Reval Logo

Vetted Data Engineers in Remote

Pre-screened and vetted in Remote.

PythonSQLApache AirflowApache SparkAmazon S3ETL
JS

Julian Smith

Senior Data Engineer specializing in cloud data platforms and real-time analytics

Remote10y exp
Scout MotorsUniversity of Texas at Austin
A/B TestingAgile MethodologiesAmazon EC2Amazon EMRAmazon S3Apache Airflow+109
View profile
KA

Kristopher Ali

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
PythonSQLScalaBashJavaR+180
View profile
JH

Jiajun Huo

Mid-Level Software Engineer specializing in data infrastructure and LLM applications

Remote3y exp
H60 ConsultingUniversity of Illinois Urbana-Champaign
PythonTypeScriptJavaScriptGoSQLJava+65
View profile
AN

Abhay Naik

Screened

Mid-level Data Engineer specializing in cloud-native analytics and enterprise integrations

Remote3y exp
The GrooveUC Berkeley

Built and productionized an LLM-powered clinical assistant at a healthcare startup, re-architecting a prototype into a robust RAG system on AWS with guardrails, citations, monitoring, and automated tests for clinical reliability. Works closely with clinicians to convert workflow feedback into evaluation criteria and iterative system improvements, and has hands-on experience debugging agentic systems in real time (including during live client demos).

AWSAmazon S3Amazon EKSAmazon EC2Amazon ECSAWS IAM+91
View profile
SP

Shambhavi Pandala

Mid-level Data Engineer specializing in cloud lakehouse, ETL, and streaming pipelines

Remote, USA4y exp
DatabricksUniversity of Central Missouri
PythonPandasNumPyPySparkSQLBash+64
View profile
GN

Gopala Nelapati

Mid-level Data Engineer specializing in cloud-native ETL and data warehousing

Remote, USA4y exp
PayPalLamar University
PythonPandasPySparkSQLT-SQLPostgreSQL+52
View profile
PD

Pooja Dokuri

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and cloud MLOps

Remote, USA4y exp
UnitedHealth GroupEast Texas A&M University

Built and deployed a production LLM + vector search clinical decision support system at UnitedHealth Group, retrieving medical evidence and patient context in real time for prior authorization and risk scoring. Strong in end-to-end RAG architecture (Hugging Face embeddings, Pinecone/FAISS, SageMaker, Redis) plus orchestration (Airflow/Kubeflow) and rigorous evaluation/monitoring, with demonstrated ability to align solutions with clinical operations stakeholders.

PythonPandasNumPyPySparkScikit-learnSQL+133
View profile
VS

Vaibhav Sharma

Screened

Mid-level Software Engineer specializing in AI/ML and data platforms

Remote, USA5y exp
GoogleIndiana University Bloomington

AI/ML engineer who built a production agentic system to automate computational research experiments (simulation execution, parameter exploration, and numerical analysis) and mitigated context-window failures using constrained tool-calling/prompt-chaining patterns in LangChain with OpenAI tool-enabled models. Also has adtech/big-data pipeline experience at InMobi, orchestrating Spark jobs in Airflow to filter bot-like user IDs and publish clean IDs to an online NoSQL store for live serving, plus Apache open-source collaboration experience.

A/B TestingApache AiravataApache AirflowApache HadoopApache HiveApache Kafka+100
View profile
VB

Venkatasaibalakrishna batchu

Mid-level AI/MLOps Engineer specializing in GenAI, RAG, and production ML platforms

Remote, USA3y exp
Northern TrustCalifornia State University
AI GovernanceAPI DevelopmentAPI GovernanceApplication InsightsApache AirflowAuditability+114
View profile
NT

Nandini Tarigopula

Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps

Remote, USA5y exp
CVS HealthTrine University
A/B TestingAmazon RedshiftApache AirflowApache FlinkApache HadoopApache Kafka+107
View profile
HS

Hema Sai Charan Bandi

Mid-level Data Engineer specializing in AWS, real-time pipelines, and ML/GenAI data platforms

Remote, USA4y exp
AsanaUniversity of Maryland, Baltimore County
PythonScalaC++Node.jsRubySQL+83
View profile
AA

Ahsan A Shawl

Mid-Level Data Engineer specializing in AWS, Spark, and Python ETL

Remote, USA4y exp
Albert InventGeorgia Tech
PythonJavaC++SQLScalaAWS+42
View profile
MN

Mohan Naik Megavath

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp
TruistElmhurst University

Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.

Agile MethodologyAirflowAirflow DAGsAirflow SensorsAirflow OperatorsAmazon Athena+137
View profile
HR

Hrishikesh Raghunath

Screened

Mid-level Data Engineer specializing in scalable ETL, streaming analytics, and cloud data platforms

Remote, USA7y exp
Dreamline AICalifornia State University, Fullerton

At Dreamline AI, built and productionized an AWS-based incentive intelligence platform that uses Llama-2/GPT-4 to extract eligibility rules from unstructured state policy documents into structured JSON, then processes them with Glue/PySpark and serves results via Lambda/SageMaker/API Gateway. Designed state-specific ingestion connectors plus schema validation and automated checks/alerts to handle frequent policy/format changes without breaking the pipeline, and partnered with business/analytics stakeholders to deliver interpretable eligibility decisions via explanations and dashboards.

A/B TestingAmazon AthenaAmazon CloudWatchAmazon EventBridgeAmazon KinesisAmazon QuickSight+114
View profile
BS

BHEEMA SABILLA

Screened

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp
DiscoverUniversity of South Dakota

Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.

PythonPySparkSQLScalaPandasNumPy+157
View profile
BR

Bharath Reddy

Mid-level Data Engineer specializing in financial risk, compliance, and real-time streaming

Remote, USA4y exp
LTIMindtreeConcordia University, St. Paul
PythonSQLPySparkScalaNumPyPandas+69
View profile
KS

Kavya Sree Maniga

Mid-level Data Engineer specializing in real-time streaming pipelines and healthcare data

Remote, USA6y exp
DigimarcUniversity of Maryland, Baltimore County
AI-Assisted CodingAmazon RedshiftAmazon S3Amazon EMRAnomaly DetectionApache Airflow+107
View profile
HA

HarshavardhanReddy Ala

Mid-level Data Engineer specializing in reliable data pipelines, metadata enrichment, and APIs

Remote, USA4y exp
CVS HealthFlorida Atlantic University
PythonSQLRETLELTData pipeline design+86
View profile
SB

Shashank Bijarapu

Screened

Mid-level AI/ML & Data Engineer specializing in MLOps and cloud data pipelines

Remote, USA4y exp
MerkleUniversity of North Carolina at Charlotte

AI/ML engineer (Merkle) with hands-on experience deploying RAG-based LLM applications and real-time recommendation engines into production. Strong in cloud/on-prem architectures, GPU autoscaling, caching, and network optimization—delivered measurable latency reductions (40–70%) and improved retrieval relevance by systematically benchmarking chunking/embedding configurations and validating pipelines via CI/CD.

PythonSQLRJavaBashScikit-learn+103
View profile
LP

Lerone Pieters

Senior Data Engineer specializing in cloud data platforms and real-time analytics

Remote, USA10y exp
Scale MediaNew York City College of Technology (CUNY)
PythonSQLJavaScalaGoJavaScript+87
View profile
SK

Shravani Kodari

Mid-level Data Engineer specializing in cloud data platforms and ETL/ELT

Remote, USA5y exp
OptumNalla Malla Reddy Engineering College
AWSAWS GlueAmazon EMRAWS LambdaApache SparkPySpark+65
View profile

Need someone specific?

AI Search