Reval Logo
Home Browse Talent Data Engineers

Vetted Data Engineers

Pre-screened and vetted.

PythonSQLETLCI/CDDockerAWS
Bay AreaRemoteDFW MetroplexGreater BostonNYC MetroChicago MetroLos Angeles MetroGreater SeattleDMVGreater Houston
PV

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine LearningGenerative AILarge Language Models (LLMs)Prompt EngineeringRetrieval-Augmented Generation (RAG)Embeddings+131
View profile
MN

Mohan Naik Megavath

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp
TruistElmhurst University

“Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.”

Amazon DynamoDBAmazon EC2Amazon RedshiftAmazon S3AngularJSApache Hadoop+137
View profile
VM

Veera Mallipudi

Screened

Senior DevOps & Release Engineer specializing in CI/CD automation and AWS IaC

Raleigh, NC12y exp
VidmobUniversity of Central Missouri

“Infrastructure/DevOps engineer (Vidmob) focused on AWS + containers, owning GitLab CI/CD and Terraform-managed environments. Led a high-impact CI incident by correlating runner queue time, Docker pull latency, and NAT egress; implemented ECR pull-through caching and VPC endpoints to restore performance and then standardized the fix in Terraform for future scale-ups.”

ClaudeCI/CDGitLab CIJenkinsGitGitHub+168
View profile
MR

Manichandra Reddy Bethi

Screened

Mid-level GenAI Engineer specializing in production AI agents and evaluation pipelines

Overland Park, Kansas5y exp
MinutentagWilmington University

“Built and shipped a production LLM-powered internal operations automation platform using LangChain RAG (Pinecone) and FastAPI microservices, deployed on AWS EKS, serving 10k+ daily interactions. Implemented a rigorous evaluation/observability stack (golden datasets, prompt regression tests, MLflow, retrieval metrics, hallucination monitoring) that drove hallucinations below 2% and improved reliability, and partnered closely with non-technical ops leaders to cut manual lookup work by 60%+.”

A/B TestingAlertingAWSAWS LambdaBERTCI/CD+120
View profile
RK

Ram Kottala

Screened

Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms

Michigan, USA5y exp
FordWebster University

“Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.”

PythonPySparkScalaJavaRSQL+173
View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

“Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.”

PythonPySparkSQLScalaBatch ProcessingData Transformation+119
View profile
MS

Manali Shetye

Screened

Mid-level Applied AI & Data Engineer specializing in automation and enterprise analytics

Irving, Texas4y exp
Trend MicroUniversity of Texas at Arlington

“Backend engineer with experience evolving a high-volume agricultural loan processing platform (APMS) at HDFC Bank, emphasizing transactional integrity, auditability, and modularity while integrating with credit bureaus, document management, and risk engines. Also improved automation/reporting robustness at Trend Micro by catching duplicate-event retry edge cases and adding idempotency safeguards.”

PythonRC#SQLJavaScriptC+95
View profile
SL

Sailaja Lokasani

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and healthcare analytics

Dallas, TX5y exp
Lightbeam Health SolutionsSyracuse University

“Healthcare-focused data engineer/ML practitioner with experience at Lightbeam Health Solutions and Humana building production entity-resolution and semantic similarity pipelines across EMR, lab, and claims data. Uses NLP/ML (spaCy, scikit-learn, BioBERT/LightGBM) plus Snowflake/Airflow and vector search (Pinecone) to improve linkage accuracy (reported 90%) and semantic match quality (reported +12–15%), while reducing manual cleanup by 40%+.”

Apache AirflowAWSAWS GlueAWS LambdaAgileC+++134
View profile
RG

Revanth Goli

Screened

Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems

Morrisville, NC6y exp
Syneos HealthUniversity of Alabama at Birmingham

“Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.”

PythonPandasPySparkFastAPILangChainSQL+97
View profile
AT

Avantik Tiwari

Screened

Junior Data Scientist / Big Data Engineer specializing in ML, LLMs, and analytics platforms

Tempe, Arizona3y exp
Arizona State UniversityArizona State University

“Backend/data platform engineer who led a major redesign of a hybrid streaming+batch analytics platform processing 10+ TB/day (Airflow/Hive/BigQuery) with strong data-quality automation. Also built a production RAG PDF assistant with concrete mitigations for hallucinations and prompt injection (re-ranking, grounding, verifier step) and has deep experience executing low-risk migrations (dual-write, blue-green, rapid rollback) and implementing JWT-based row-level security.”

PythonSQLJavaJavaScriptMySQLPostgreSQL+112
View profile
NJ

Neeraj Jawahirani

Screened

Mid-level Data & AI Engineer specializing in healthcare data pipelines and MLOps

FL, USA4y exp
HumanaFlorida State University

“Built and deployed a production LLM-powered clinical note summarization system used by care managers to speed review of 5–20 page unstructured medical records. Implemented safety-focused validation (prompt constraints, rule-based and section-level checks, human-in-the-loop) to reduce hallucinations while maintaining low latency and meeting privacy/regulatory constraints, integrating via APIs into existing clinical tools.”

AgileAmazon CloudWatchAmazon EMRAmazon RedshiftAmazon S3Amazon SageMaker+122
View profile
HR

Hrishikesh Raghunath

Screened

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp
Dreamline AICalifornia State University, Fullerton

“At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.”

A/B TestingAmazon CloudWatchAmazon KinesisAmazon RedshiftAmazon S3Amazon SageMaker+114
View profile
RK

Rakesh Kolagani

Screened

Mid-level AI/ML Engineer specializing in MLOps and LLM-powered applications

Mountain View, CA5y exp
IntuitUniversity of Central Missouri

“AI/ML engineer with production experience building a RAG-based internal analytics assistant (Databricks + ADF ingestion, Pinecone vector store, LangChain orchestration) deployed via Docker on AWS SageMaker with CI/CD and MLflow. Strong focus on real-world constraints—latency/cost optimization (LoRA ~60% compute reduction), hallucination control with citation grounding, and enterprise security/governance. Previously at Intuit, delivered an interpretable churn prediction system (PySpark/Databricks, Airflow/Azure ML) that improved retention targeting ~12%.”

A/B TestingAmazon S3Apache AirflowAWS GlueAWS LambdaAWS Step Functions+126
View profile
SM

Supriya Mattapelly

Screened

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp
UnitedHealthcareKent State University

“AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.”

A/B TestingAmazon CloudWatchAmazon EC2Amazon EMRAmazon RedshiftAmazon S3+152
View profile
HG

HarshaSree gudapati

Screened

Senior Data Engineer specializing in cloud-native data platforms for finance and healthcare

Charlotte, NC4y exp
Bank of AmericaUniversity of Cincinnati

“Data engineer/backend data services practitioner with Bank of America experience building real-time and batch transaction-monitoring pipelines and APIs (Kafka + databases, REST/GraphQL). Highlights include a reported 45% response-time improvement through performance optimizations and use of Delta Lake schema evolution plus CI/CD (GitHub Actions/Jenkins) and operational reliability patterns like CloudWatch monitoring and dead-letter queues.”

Azure Data FactoryAWSAmazon S3AWS GlueAmazon RedshiftAmazon EMR+125
View profile
RL

Ramya Latha

Screened

Senior AI/ML & Data Engineer specializing in Generative AI and RAG systems

Birmingham, AL8y exp
Regions Bank

“GenAI/RAG engineer who has deployed a production policy/regulatory search assistant for a financial client using LangChain + Vertex AI, FastAPI, Docker/Kubernetes, and Airflow-orchestrated data pipelines. Demonstrated measurable impact with 50–60% latency reduction and 70% fewer pipeline failures, plus KPI-driven grounding evaluation (90%+ target) and strong cross-functional collaboration with compliance/business teams.”

Amazon EMRAmazon RedshiftAmazon S3Apache AirflowApache CassandraApache Hadoop+200
View profile
SK

Sarthak kar

Mid-level AI/ML Engineer specializing in scalable ML, NLP, and time-series forecasting

USA4y exp
MetLifeSan Diego State University
PythonRSQLMATLABJupyter NotebookTensorFlow+125
View profile
AU

Akshith Ullal

Senior Research/Data Engineer specializing in AR/VR telepresence and healthcare AI

Gainesville, FL10y exp
University of FloridaVanderbilt University
AWSBlenderCC#C++Data Engineering+83
View profile
SN

Sahiti Nallamolu

Mid-level AI/ML Engineer specializing in RAG, LLMs, and MLOps for finance

Boston, MA4y exp
Humanitarians.AINortheastern University
Generative AIMachine LearningDeep LearningRetrieval-Augmented Generation (RAG)Large Language Models (LLMs)GPT+94
View profile
SR

Siva Reddy

Mid-level AI/ML & Data Engineer specializing in GenAI, MLOps, and cloud data platforms

Pennsylvania, USA6y exp
QVCUniversity of Texas at Arlington
Apache AirflowApache KafkaApache SparkAWS CodePipelineAWS GlueAWS Lambda+112
View profile
PB

Prathyusha Bandi

Senior Data Engineer specializing in cloud data platforms and real-time pipelines

Phoenix, AZ8y exp
Wells FargoTexas A&M University–Kingsville
Apache AirflowApache HadoopApache HiveApache KafkaApache SparkAWS+132
View profile
AS

Aditya Singaravelu

Mid-level Software/Data Engineer specializing in AI-driven data platforms and cloud ETL

Sunnyvale, CA4y exp
Aspen AerogelsUC Riverside
PythonC#C++CJavaOracle Database+76
View profile
LS

Lahari Sudhini

Mid-level Data Scientist / ML Engineer specializing in financial risk, NLP, and MLOps

TX, State5y exp
Charles SchwabUniversity of North Texas
A/B TestingAgileAmazon BedrockAnomaly DetectionApache AirflowApache Kafka+139
View profile
BU

Bhavika Uppala

Mid-level Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Richmond, Texas4y exp
Capital OneWichita State University
AWSAmazon S3AWS GlueAWS LambdaAmazon EMRAmazon Redshift+136
View profile
1...789...19

Related

Data Engineers in Bay AreaData Engineers in RemoteData Engineers in DFW MetroplexData Engineers in Greater BostonData Engineers in NYC MetroData Engineers in Chicago MetroData Engineers in Los Angeles MetroData Engineers in Greater SeattleData Engineers in DMVData Engineers in Greater Houston

Need someone specific?

AI Search