Vetted Apache Airflow Professionals

Pre-screened and vetted.

SZ

Junior AI/Backend Software Engineer specializing in ML and scalable systems

Dallas, TX2y exp
PMGUniversity of Maryland, College Park

Backend engineer with strong AWS/CI/CD experience (multi-repo deployments, Lambda + core app, immutable ECR and image promotion) and a published master’s thesis building an ML framework for Solar PV energy prediction and CO2 reduction impact modeling using ensemble and meta-learning approaches benchmarked against SAM.

View profile
UK

Mid-level Generative AI Engineer specializing in LLM agents and RAG systems

4y exp
Capital OneLindsey Wilson College

Built and deployed a production LLM/RAG knowledge assistant integrating internal docs, wikis, and ticket histories to reduce tribal-knowledge dependency and repetitive questions. Emphasizes reliability via grounding + a validation layer, and achieved major latency gains (>50%) through vector index optimization, caching, quantization, and selective re-validation. Comfortable orchestrating end-to-end LLM/data workflows with Airflow, Prefect, and Dagster, including monitoring and alerting.

View profile
BG

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp
IntelIllinois Institute of Technology

ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).

View profile
HG

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

NJ, USA4y exp
Red HatOklahoma Christian University

Red Hat ML/LLM engineer who designed and deployed a production LLM-powered customer support automation system using RAG, improving latency by 30% via PEFT and vector search optimization. Built security and governance into retrieval (access-level filtering, encrypted Pinecone/ChromaDB) and delivered SHAP-based explainability via a dashboard for non-technical stakeholders. Experienced orchestrating distributed ML/RAG pipelines across AWS SageMaker and OpenShift with Airflow/Prefect, plus multi-agent workflows using CrewAI and LangGraph.

View profile
SM

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).

View profile
SA

Samuel Audu

Screened

Staff Platform Engineer specializing in multi-cloud platforms and internal developer portals

Dallas, TX8y exp
Dell TechnologiesNew Mexico State University

Infrastructure reliability/capacity-focused engineer with hands-on IBM Power/AIX (LPAR/DLPAR, HMC, VIOS) performance troubleshooting and modern cloud-native delivery experience. Built production CI/CD and Terraform-managed AWS/EKS environments, and has led real incident recoveries spanning Kubernetes autoscaling and AWS quota constraints with concrete RCA and prevention improvements.

View profile
SG

Mid-level AI/ML Engineer specializing in GenAI, LLMs, RAG, and MLOps

St. Louis, MO5y exp
CenteneSaint Louis University

Built and deployed a production LLM-powered RAG document intelligence/Q&A system for healthcare prior authorization, reducing manual medical document review time and improving decision efficiency. Strong in end-to-end LLM application engineering (LangChain/LangGraph), retrieval quality improvements (hybrid search, embedding tuning, chunking strategies), and rigorous evaluation/monitoring for reliability.

View profile
YL

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp
CitigroupUniversity of Missouri

Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.

View profile
NM

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp
SonyFitchburg State University

Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.

View profile
BS

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp
First National BankTexas A&M University-Corpus Christi

Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.

View profile
Abhinav Gupta - Junior Machine Learning Engineer specializing in LLMs and applied data science

Abhinav Gupta

Screened

Junior Machine Learning Engineer specializing in LLMs and applied data science

2y exp
EsriUSC

Built and shipped multiple production AI systems, including Auto DocGen (LLM-generated OpenAPI docs kept in sync via AST diffs, schema-constrained generation, and CI/CD on Render) and a multimodal sign-language recognition pipeline at USC orchestrated with FastAPI, MediaPipe, and PyTorch. Also partnered with Esri’s non-technical community team to fine-tune an LLaMA-based spam classifier with a review UI, cutting moderation time by 70%.

View profile
Nikhil Soni - Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation in New York, NY

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp
Quant AI ResearchNYU

Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.

View profile
Sanjana Duvva - Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.

View profile
Nishantkumar Asodariya - Mid-level Supply Chain Analyst specializing in global logistics automation and forecasting in USA

Mid-level Supply Chain Analyst specializing in global logistics automation and forecasting

USA4y exp
HoneywellIndiana Wesleyan University

Built and shipped a production LLM-powered recruiting workflow that ranks resumes against job descriptions, generates evidence-based justifications, and finds "hidden fit" candidates using embeddings + RAG. Demonstrates strong production engineering around hallucination control, latency, and predictable LLM cost management (budget checks, top-K pruning, tenant caps), plus orchestration experience with Airflow/Prefect/Kubernetes and a structured evaluation/monitoring methodology for AI agents.

View profile
Manasa Mangipudi - Mid-level Machine Learning Engineer specializing in NLP and computer vision

Mid-level Machine Learning Engineer specializing in NLP and computer vision

3y exp
Columbia UniversityRutgers University–New Brunswick

AI/ML engineer with production experience building an LLM-powered resume-to-job matching and feedback product using RAG, with a strong focus on latency, hallucination control, and scalable deployment. Experienced orchestrating ML inference and backend services on Kubernetes and applying rigorous evaluation/guardrail practices; also partnered with business/product stakeholders at Walmart to improve an NLP-based supplier support system.

View profile
Bhanu Prakash Reddy Dakilli - Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing in Framingham, MA

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp
Bank of AmericaNew England College

Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.

View profile
DM

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp
JPMorgan ChaseSaint Louis University

Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.

View profile
Harrishkumar Loganathan - Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI in Remote, USA

Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI

Remote, USA3y exp
SocureArizona State University

AI/ML engineer with hands-on ownership of enterprise LLM deployments at Freshworks, including a large-scale RAG chatbot serving 15,000+ users across six departments. Stands out for combining deep production engineering skills—AWS microservices, Kubernetes, observability, retrieval quality, and faithfulness evaluation—with strong cross-functional stakeholder leadership and prior large-scale fraud data pipeline experience at Socure.

View profile
AJ

Mid-level AI/ML Engineer specializing in generative AI, NLP, and MLOps

San Jose, CA4y exp
ServiceNowUniversity of North Carolina at Charlotte

ML/AI engineer with hands-on ownership of production GenAI and computer vision systems, spanning experimentation, deployment, monitoring, and iterative optimization. Stands out for shipping an enterprise RAG platform that cut manual review by 50% and a defect detection pipeline that reduced report generation from 15 minutes to under 1 second while maintaining high uptime and strong operational discipline.

View profile
DD

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp
Elevance HealthTexas Tech University

ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.

View profile
Wei-Hsien Wang - Entry-level AI Engineer specializing in full-stack generative AI systems in San Jose, CA

Entry-level AI Engineer specializing in full-stack generative AI systems

San Jose, CA1y exp
AzazieUC San Diego

AI/full-stack product engineer who has shipped both user-facing and internal LLM products, from a photo-to-music recommendation app to an experimentation agent at Azazie. Stands out for combining modern app development with production-grade agent and GraphRAG systems, including a 500k+ email analysis platform and measurable impact like 3x experiment velocity, 75% setup-time reduction, and 65% faster task discovery.

View profile
SB

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp
Fidelity InvestmentsNortheastern University

Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.

View profile
JS

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp
Johnson & JohnsonStevens Institute of Technology

Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.

View profile

Need someone specific?

AI Search