Reval Logo

Vetted Data Engineers

Pre-screened and vetted.

PythonSQLDockerCI/CDETLAWS
GS

Gautam Sanka

Senior Python AI/ML Engineer specializing in MLOps, data engineering, and LLM applications

Austin, TX12y exp
Elevance HealthUniversity of Texas at Austin
PythonSQLShell ScriptingJavaJavaScriptReact+157
View profile
PJ

Purvansh Jain

Senior AI Engineer specializing in LLMs, RAG, and scalable data platforms

USA5y exp
Programmers.aiUniversity of Pennsylvania
PythonJavaSQLPySparkDatabricksSnowflake+67
View profile
HO

Hiroaki Oshima

Mid-level Machine Learning & Data Engineer specializing in MLOps and cloud data platforms

San Francisco, CA4y exp
Blue River TechnologyUC Berkeley
AirflowAmazon AthenaApache SparkAWS CloudFrontAWS CloudWatchAWS DMS+64
View profile
DS

Daniel Stephens

Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Rosharon, TX11y exp
AssistRxUniversity of Texas at Austin
Access ControlAgileAirflowAllureAmazon RedshiftApache Airflow+107
View profile
KA

Kristopher Ali

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
PythonSQLScalaBashJavaR+180
View profile
PK

priya kotha

Mid-level Data Engineer specializing in real-time pipelines across FinTech and Healthcare

USA, USA4y exp
PlaidSacred Heart University
PythonSQLPandasNumPyApache SparkPySpark+75
View profile
JH

Jiajun Huo

Mid-Level Software Engineer specializing in data infrastructure and LLM applications

Remote3y exp
H60 ConsultingUniversity of Illinois Urbana-Champaign
PythonTypeScriptJavaScriptGoSQLJava+65
View profile
CC

crystal chen

Mid-level Data Engineer specializing in analytics engineering, ML forecasting, and modern data stacks

Cupertino, CA4y exp
AppleNortheastern University
SQLPythonNumPyPandasScikit-learnMatplotlib+32
View profile
VD

Vismay Devjee

Screened ReferencesModerate rec.

Mid-level GenAI Engineer specializing in AI agents, RAG, and LLM evaluation

Boston, MA2y exp
Fidelity InvestmentsNortheastern University

Asset Management Risk professional at Fidelity Investments who built and productionized an agentic RAG platform enabling compliance and analysts to query 10,000+ fund documents with cited answers in seconds. Implemented structure-aware semantic chunking (AWS Textract), hierarchical retrieval, and hybrid search to raise accuracy from 68% to 94%, and built an evaluation framework tracking accuracy/latency/cost/hallucinations—delivering 40+ hours/month saved and zero critical production failures.

AI AgentsAgent ArchitecturesAgent Evaluation PipelinesApache AirflowAWSAWS Lambda+85
View profile
TS

Travoy Spelling

Screened

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

A/B TestingActive LearningAgentic WorkflowsAirflowAmundsenAPI Development+247
View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.

PythonSQLPySparkApache SparkHadoopHive+94
View profile
VS

Venkata Sai Pavan Dema

Screened

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp
Capital OneUniversity of the Cumberlands

AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.

A/B TestingAdversarial Prompt TestingAirflowAmazon EC2Amazon RedshiftAmazon S3+163
View profile
NV

Nikita Vivek Kolhe

Screened

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

PythonSQLCC++RMATLAB+90
View profile
SV

sai venkata

Screened

Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines

Texas, USA6y exp
CVS HealthUniversity of Central Missouri

Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.

PythonSQLScalaJavaShell ScriptingApache Spark+117
View profile
AN

Abhay Naik

Screened

Mid-level Data Engineer specializing in cloud-native analytics and enterprise integrations

Remote3y exp
The GrooveUC Berkeley

Built and productionized an LLM-powered clinical assistant at a healthcare startup, re-architecting a prototype into a robust RAG system on AWS with guardrails, citations, monitoring, and automated tests for clinical reliability. Works closely with clinicians to convert workflow feedback into evaluation criteria and iterative system improvements, and has hands-on experience debugging agentic systems in real time (including during live client demos).

AWSAmazon S3Amazon EKSAmazon EC2Amazon ECSAWS IAM+91
View profile
SB

Shriya Bannikop

Screened

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.

Adobe XDAgileAirflowAmazon AthenaAmazon CloudWatchAmazon DynamoDB+170
View profile
AN

Apoorva Nanabolu

Screened

Senior Data Scientist / Generative AI Engineer specializing in fraud, risk, and MLOps

5y exp
PayPalUniversity of New Haven

Built and deployed a production LLM/RAG fraud investigation system to replace manual investigator workflows, combining transaction data, historical cases, and policy documents with agent-style steps and LoRA fine-tuning. Demonstrates strong reliability engineering (grounding, citations, abstention paths), performance optimization (retrieval/indexing/caching), and end-to-end MLOps orchestration using Azure ML Pipelines/MLflow plus Kubernetes/Argo with canary and rollback deployments.

PythonRSQLNoSQLSnowflakeBigQuery+178
View profile
VR

Vivek Reddy

Screened

Mid-level Data Scientist/Data Engineer specializing in ML pipelines, insurance and healthcare analytics

Los Angeles, CA7y exp
Venture ConnectUC Berkeley

Built a production assistive-vision iPhone app to help visually impaired users find grocery items, training a custom YOLO detector on 2,000+ self-collected/annotated images and deploying via CoreML with a cloud multimodal LLM for navigation instructions. Brings hands-on AWS serverless + ECS container deployment (CDK/GitHub Actions) and a disciplined approach to AI workflow reliability (state-machine design, offline evals, stress tests, logging/metrics), plus experience communicating model insights to non-technical stakeholders (MOTER Technologies).

A/B TestingAirflowAmazon AthenaAmazon BedrockAmazon CDKAmazon ECS+109
View profile
ST

Sahithi Tummala

Mid-level AI Engineer specializing in Generative AI, LLMs, and RAG systems

Dallas, TX6y exp
NewmarkUniversity of North Texas
A/B TestingAccess ControlsAerospikeAmazon BedrockAmazon DynamoDBAmazon EC2+159
View profile
SM

Saad Mahmood

Senior AI/ML Engineer specializing in GenAI, LLMs, and MLOps

Beaverton, OR10y exp
NikeIllinois Institute of Technology
PythonJavaScalaKotlinC#.NET 6+174
View profile
AI

Abraheem Irheem

Senior Data Engineer specializing in AI/LLM platforms

Chicago, IL9y exp
NikeUniversity of Illinois Chicago
PythonJavaScalaKotlinC#.NET 6+155
View profile
AB

Alekya Bachala

Principal Big Data & Software Engineer specializing in Spark/Scala and GCP data platforms

San Jose, CA13y exp
VerizonUniversity at Buffalo
Google Cloud Platform (GCP)BigQuerySpannerDataprocCloud ComposerCloud Run+68
View profile

Need someone specific?

AI Search