Reval Logo

Vetted Apache Airflow Professionals

Pre-screened and vetted.

DT

Executive Engineering Leader specializing in E-commerce, SaaS, and EdTech platforms

Delaware, USA22y exp
ChitChatShopOsmania University
View profile
AS

Senior Software Engineer specializing in cloud, data platforms, and LLM/RAG applications

Fremont, CA7y exp
Volvo GroupSan José State University
View profile
AB

Mid-level AI/ML Engineer specializing in cloud MLOps and GenAI for fraud detection

New York, NY4y exp
StripeNJIT
View profile
EP

Ethan Pribble

Screened ReferencesStrong rec.

Senior Software Engineer specializing in cloud cost intelligence and FinOps platforms

21y exp
CloudZeroNorthwestern University

Backend/data engineer with strong authorization and compliance-domain experience: led a phased migration from a simplistic role model to modern RBAC on a Python serverless stack (Auth0 + AWS Lambda/API Gateway), coordinating changes across 5 repos with extensive manual and automated validation. Previously built and operated custom ETL pipelines (Airflow + Groovy/Java on Spark/YARN/Hadoop) to normalize messy customer email/chat/voice data for NLP-driven financial compliance indicators, including complex email journaling metadata enrichment and large-scale remediation reprocessing after production bugs.

View profile
PT

Senior Data Engineer specializing in cloud big data pipelines and real-time streaming

Seattle, WA6y exp
AmazonUniversity of North Texas

Amazon data engineer who built a real-time fraud detection pipeline for AWS Lambda, tackling multi-region telemetry quality issues and scaling stream processing for billions of daily requests. Strong in production-grade data/ML workflows on AWS (EMR, Glue, Kinesis, SageMaker) with hands-on entity resolution and anomaly detection.

View profile
DM

Mid-level Software Engineer specializing in cloud automation and data/ETL platforms

Arlington, Virginia6y exp
AmazonVirginia Tech

Backend engineer with AWS multi-region production experience building APIs and workflow automation for data center/storage hardware operations (firmware orchestration, maintenance checks, ticketing, dashboards). Also shipped an internal AI chat tool that parses hardware runbooks and incorporates user feedback to retrain the model, and has a strong testing/quality discipline (95%+ coverage) plus database performance tuning via indexing and query monitoring.

View profile
SJ

Sourabh Jain

Screened

Director of Software Engineering specializing in enterprise Data, ML & AI platforms

Bay Area, CA23y exp
RSA SecurityShri G. S. Institute of Technology and Science

Former Walmart Director of Software Engineering who left in March 2025 to build products for clients. Recently delivered an LLM/RAG-based UNSPSC classification solution for an MRO client using a multi-stage retrieval + web search + prompt-engineering workflow, and has led large-scale retail forecasting initiatives and high-severity cloud-migration incidents end-to-end.

View profile
BS

Mid-level Full-Stack Developer specializing in cloud-native backend services and real-time data platforms

Remote, USA4y exp
NetflixUniversity of Dayton

Backend/data engineering candidate with Netflix experience designing and migrating analytics platforms from batch to real-time streaming (Kafka/Flink) across AWS and GCP. Delivered measurable improvements (40% lower data delay, 99.9% accuracy) using phased rollouts, automated data validation (Great Expectations), and strong observability (Prometheus/Grafana), and proactively hardened pipelines with idempotency to prevent duplicate Kafka processing.

View profile
SF

Sara Fang

Screened

Mid-level Software Engineer specializing in cloud data platforms and distributed systems

Remote6y exp
Terra Byte XUniversity of Delaware

Backend/data engineer with production experience building FastAPI services with strong reliability patterns (circuit breaker, rate limiting, caching, graceful degradation) and JWT/OAuth2 auth. Has delivered AWS EKS deployments via Terraform with Secrets Manager/IRSA and HPA autoscaling, and built Glue/Spark ETL pipelines on S3 Parquet with schema-evolution and idempotent reruns; also demonstrated measurable SQL tuning impact (20–30s to <10s).

View profile
JZ

Mid-level Machine Learning Engineer specializing in LLMs, fairness, and healthcare ML

Illinois, USA4y exp
iSchool Statistical ML & AI LabUniversity of Illinois Urbana-Champaign

ML/NLP practitioner with a master’s thesis focused on domain-adaptive knowledge distillation for LLMs (LLaMA2/sheared LLaMA), showing improved perplexity and ROUGE-L on biomedical data. Also built real-world data linking and search systems: integrated ClinicalTrials.gov with FAERS using fuzzy matching + embeddings, and delivered an LLM-powered FAQ recommender at Hyperledger using sentence-transformers, FAISS, and fine-tuning to mitigate embedding drift.

View profile
PY

Staff/Lead Software Architect specializing in Contact Center platforms and GenAI automation

Campbell, CA21y exp
HyperAnalyticsUniversity of Toledo

Built and deployed production LLM systems in healthcare and at LinkedIn: automated pen-and-paper clinical trial evaluations with a 40x efficiency gain and created an evidence-based Evaluation Agent focused on accuracy and speed. Also used Temporal to orchestrate resilient data-ingestion workflows for customer support staffing prediction, improving prediction outcomes by 40% while handling missing data, retries, and backfills.

View profile
DK

Dheeraj Kumar

Screened

Intern Data Scientist specializing in marketing analytics and data engineering

Tucson, Arizona2y exp
RochePurdue University

AI/LLM practitioner with internships at Dell Technologies and Roche who built and deployed a healthcare-focused "Doctor LLM" by fine-tuning Meta Llama 3.2 on healthcaremagic.json, emphasizing safety guardrails to prevent harmful medical advice. Experienced in productionizing AI workflows with monitoring, testing, and orchestration (Airflow, Kubernetes), and in delivering AI-agent-driven competitive landscape insights to non-technical business stakeholders.

View profile
SC

Shweta Chavan

Screened

Junior Computer Vision & ML Engineer specializing in autonomous perception systems

Pittsburgh, PA2y exp
Magna InternationalCarnegie Mellon University

LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.

View profile
ZW

Zheng Wu

Screened

Junior Software Engineer specializing in backend systems and cloud messaging

Mountain View, CA1y exp
NewsBreakRice University

Data/ML engineer who has owned end-to-end systems across email deliverability/segmentation and production LLM apps. Built a Spark+Airflow segmentation engine that materially improved deliverability (99.9%) and open rates (>50%), and shipped a PDF-to-quiz RAG product using LangChain/Vertex AI/Chroma with strong guardrails and an eval loop that cut hallucinations to <5%.

View profile
AV

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems

Harrison, NJ4y exp
AdobeNJIT

ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.

View profile
SM

Executive ML/AI Founder specializing in agentic analytics and data infrastructure

10y exp
Photosphere LabsUniversity of Texas at Dallas

Founder of Photosphere Labs (agentic AI for ecommerce data synthesis/analysis) who worked directly with customers to scope, build, demo, and iterate LLM-based solutions, including an AI chat product for brand owners. Previously at Block, built and explained a nuanced causal inference/propensity model tied to Square POS integrations, translating model specs and outputs into business impact for varied client contexts.

View profile
KD

Junior ML Engineer specializing in Generative AI and LLM applications

Thousand Oaks, California3y exp
NVIDIACalifornia Lutheran University

Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.

View profile
PV

Praveen V

Screened

Mid-Level Software Engineer specializing in Generative AI and RAG systems

Remote, USA5y exp
MetaUniversity of North Carolina at Charlotte

Built a production RAG-based natural-language-to-SQL system at Global Atlantic to replace slow, expensive manual analytics ticket workflows, focusing heavily on retrieval quality and measurable evaluation (200-question ground-truth set; recall@5 improved 0.65→0.78 via semantic chunking). Also built a custom MCP-style agent orchestrator for a personal project (arxiv-ai) to improve flexibility and Langfuse-aligned observability, and has hands-on experience with LangGraph, CrewAI, and n8n.

View profile
VM

Vishal Mittal

Screened

Director-level Engineering Manager specializing in cloud security platforms and AI-driven automation

Fremont, CA18y exp
Palo Alto NetworksStanford University

Senior engineering leader in the Bay Area with experience spanning VMware, Hortonworks/Cloudera, Barracuda, and Palo Alto Networks, including leading open-source work (Apache Knox) and architecting large-scale security platforms. Has driven disaster recovery and cloud security products, designed Python microservices for Microsoft 365 security, and scaled teams (3x) while formalizing enterprise readiness practices with automated documentation using Notebook LLM.

View profile
JL

Joseph Lee

Screened

Staff Software Engineer specializing in cloud platforms for healthcare and financial workflows

Dallas, TX10y exp
OptumUniversity of Texas at Dallas

Backend/data engineer with Optum healthcare claims domain experience building high-reliability Python microservices (FastAPI/Kafka/Postgres) and AWS data platforms (EKS, Glue, Redshift). Demonstrated strong production ownership: fixed duplicate Kafka processing via transactional outbox/idempotency, scaled to millions of daily events, and delivered major SQL performance gains (40+ min to <5 min, ~60% CPU reduction). Seeking remote-only work; targets $130k base.

View profile
CK

Senior Software Engineer specializing in Python, cloud platforms, and distributed systems

Nashville, TN13y exp
i3 VerticalsUniversity of Chicago

Backend/data engineer with production experience at Walmart and HealthSnap building Python services and data pipelines on AWS (EKS, Lambda, Glue, Airflow). Strong reliability and operations focus—implemented idempotency + circuit breakers for peak-traffic consistency issues, GitOps CI/CD, and observability. Demonstrated measurable performance wins (Postgres p95 45s to <5s, ~60% CPU reduction) and modernized SAS batch workflows to Python with parallel-run parity validation and feature-flagged rollout.

View profile
JL

Jiaqi Li

Screened

Junior AI Engineer specializing in healthcare analytics and compliance AI

Pittsburgh, PA1y exp
CustomerInsights.AICarnegie Mellon University

Built and shipped a production LLM-driven multi-agent platform (ciATHENA) at CustomerInsights.AI to automate analytics/ML/compliance workflows in healthcare and life sciences. Implemented LangGraph/LangChain orchestration with strong backend-style rigor (schemas, Pydantic validation, retries, auditability) and optimized latency/cost while keeping the system usable for non-technical users via guided natural-language interactions and structured/visual outputs.

View profile

Need someone specific?

AI Search