Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Airflow Professionals

Pre-screened and vetted.

Apache Airflow Python Docker SQL AWS CI/CD

Ankush Banthia

Screened

Senior Data & Platform Engineer specializing in cloud-native streaming and distributed systems

USA10y exp

JPMorgan ChaseNew York Institute of Technology

“Financial data engineer who has built and operated high-volume batch + streaming pipelines (200–300 GB/day; 5–10k events/sec) using AWS, Spark/Delta, Airflow, Kafka, and Snowflake, with strong emphasis on data quality and reliability. Demonstrated measurable impact via 99.9% SLA adherence, major reductions in bad records/nulls, MTTR improvements, and significant latency/runtime/query performance gains; also built a distributed web-scraping system processing 5–10M records/day with anti-bot and schema-drift defenses.”

Team Building Onboarding Mentoring Agile Scrum Jira+150

View profile

Madhupal Singu

Screened

Mid-level Data Engineer specializing in multi-cloud data platforms for healthcare and finance

USA6y exp

CignaUniversity of Cincinnati

“Data engineer with Cigna experience building and operating an end-to-end AWS-based healthcare claims pipeline processing ~2TB/day, using Glue/Kafka/PySpark/SQL into Redshift. Strong focus on data quality and reliability (schema validation, monitoring/alerting, retries/checkpointing/backfills), reporting improved accuracy (~99%) and reduced latency, plus experience serving real-time Kafka/Spark data to downstream analytics with documented data contracts.”

Python Pandas PySpark SQL Scala Java+88

View profile

Abhishek Gawali

Screened

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp

PNCRochester Institute of Technology

“Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.”

SDLC Agile Waterfall Python SQL R+105

View profile

Rohan Varma Bandari

Screened

Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG

USA4y exp

Wells FargoUniversity of North Texas

“Built production LLM + hybrid RAG and multi-agent orchestration systems at Wells Fargo to automate complaint document/audio transcript understanding and categorization, addressing vocabulary drift via embedding + vector index updates instead of frequent retraining. Strong in LLM workflow reliability (testing/benchmarks/observability) and stakeholder-facing delivery with explainability (citations/SHAP-style justifications) and Tableau dashboards.”

Python SQL Jupyter Notebook Amazon SageMaker Visual Studio Code NumPy+128

View profile

Harideep Balusa

Screened

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp

Freddie MacUniversity of Wisconsin

“Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.”

Python R SQL Scala Machine Learning Deep Learning+125

View profile

Shivani Sharma

Screened

Mid-Level Software Engineer specializing in Cloud, DevOps, and MLOps

Boston, MA3y exp

Northeastern UniversityNortheastern University

“Built and productionized a recommendation system from notebook prototype into a low-latency, scalable Cloud Run service using Docker, FastAPI, Terraform, CI/CD (GitHub Actions), and MLOps tooling (Vertex AI, MLflow). Experienced diagnosing real-time workflow issues using structured logging/ELK and GCP metrics, including resolving intermittent 504s by fixing unbounded SQL and adding caching. Also partners with sales/customer teams (Wasabi) to deliver tailored demos, troubleshoot, and drive onboarding/adoption.”

AWS AWS Lambda Azure Functions BigQuery CI/CD C#+96

View profile

Srilekha Jakkula

Screened

Senior Data Engineer specializing in scalable data pipelines and API-driven data services

Chicago, IL5y exp

Northern TrustNorthern Illinois University

“Data engineer focused on building scalable, reliable end-to-end data pipelines and backend REST data services, spanning API ingestion plus batch/stream processing with Airflow, Kafka, Spark/PySpark, and SQL. Emphasizes strong data quality validation, monitoring/fault tolerance, and performance tuning for large datasets, with experience deploying in cloud environments using containerization and CI/CD.”

Python SQL REST APIs API Integration JSON XML+51

View profile

keerthana s

Screened

Mid-level Backend Software Engineer specializing in Python/FastAPI on AWS

Los Angeles, California4y exp

McKessonUniversity of North Texas

“Backend engineer with healthcare domain experience building AI-driven radiology workflow systems. Evolved tightly coupled APIs into secure, reliable FastAPI-based services by moving heavy imaging/data processing into idempotent asynchronous pipelines with retries, feature-flagged incremental rollout, and strong data-integrity controls (constraints, backfills, validation). Strong focus on defense-in-depth security for sensitive patient data (OAuth2/JWT, RBAC, and database-level protections).”

Python JavaScript C C++C#PL/SQL+119

View profile

Srijitha Katkuri

Screened

Mid-level Data Analyst specializing in healthcare and business intelligence

Michigan, USA4y exp

Banner HealthTrine University

“Healthcare analytics candidate with hands-on experience turning messy EHR, billing, and operational data into validated SQL datasets and automated Python/Airflow pipelines. They appear strongest in hospital KPI reporting—especially length of stay, readmissions, retention, and bed utilization—and have owned projects from metric definition through Power BI delivery and impact measurement.”

SQL Python Pandas NumPy Power BI Tableau+70

View profile

Cia Hang

Screened

Executive IT and Operations leader specializing in digital transformation and security

Chicago, IL13y exp

Halo Branded SolutionsCentral Michigan University

“Candidate is very familiar with the venture capital and broader investment landscape, but is not interested in founding a company. They have worked with several TPG-backed or TPG-owned organizations, helping drive business scaling, cost reduction, and execution against investor governance requirements.”

Operations Management Strategic Planning P&L Management Budget Management Project Management Performance Management+571

View profile

Alekya Battu

Screened

Mid-level Data Scientist specializing in machine learning, MLOps, and cloud analytics

USA5y exp

Wells FargoWilmington University

“Senior data scientist with ~5 years’ experience building production ML/NLP systems in finance (Wells Fargo) and deep learning for sensor analytics in connected vehicles (Medtronic). Has delivered end-to-end platforms combining time-series forecasting with transformer-based NLP, including automated drift monitoring/retraining (MLflow + Airflow) and standardized Docker/CI/CD deployments; achieved a reported 22% precision improvement after domain fine-tuning.”

Python SQL R Classification XGBoost Random Forest+171

View profile

Steven Fritsch

Screened

Principal Product Manager specializing in AI and document intelligence

New York, NY12y exp

DatasiteNew York City College of Technology (CUNY)

“Enterprise product leader with significant experience turning AI experiments into scalable, workflow-native capabilities at Datasite, especially in high-stakes M&A environments. Stands out for combining strong AI product strategy with nuanced UX judgment, emphasizing trust, transparency, and human-in-the-loop design over flashy automation.”

Generative AI LLM Integration Agentic AI Predictive Analytics Product Strategy Agile+75

View profile

Hard Parikh

Screened

Mid-level Software Engineer specializing in data platforms, distributed systems, and applied AI

Austin, TX3y exp

Compass GroupUC Riverside

“AI/full-stack product engineer currently owning Fleck Intelligent Survey Chatbot at E15, a production RAG analytics assistant embedded in Compass Group dashboards for 300+ field operators. Stands out for combining LLM orchestration, analytics engineering, and strong systems thinking—cutting hallucinated numeric answers from 14% to 2%, reducing backlog 62%, and previously delivering a low-level protocol redesign at Amadeus that cut P99 latency by 56%.”

Python SQL C++Java TypeScript JavaScript+113

View profile

Sai Prasad Gudari

Screened

Junior Software Engineer specializing in backend systems and AI infrastructure

Redwood City, CA2y exp

WindBorne SystemsEmory University

“Backend/full-stack engineer with deep experience building weather and geospatial data systems at WindBorne, spanning Next.js/TypeScript frontends through PostgreSQL, Redis, Sidekiq, Rails, Rust, and object-storage-backed forecast pipelines. Particularly strong in production reliability work—self-healing jobs, zero-downtime migrations, query/index optimization, and event-driven ingestion architectures that reduce latency and operational waste.”

Python Ruby Rust Java SQL TypeScript+132

View profile

Rohan Reddy

Screened

Mid Software Engineer specializing in Python backend systems for FinTech

Kansas City, MO3y exp

State StreetUniversity at Buffalo

“Full-stack Python engineer who has owned internal automation products from requirements through production, including a financial reporting platform that improved deployment time by 45% and raised reporting efficiency to 98%. Also built an AI-powered movie recommendation engine using collaborative and content-based filtering, with hands-on experience across frontend, backend, data pipelines, and ML evaluation.”

Python Flask Django FastAPI SQLAlchemy Celery+97

View profile

Ramin Mohammadi

Screened

Principal AI/ML Leader specializing in Generative AI, MLOps, and NLP

CA, USA11y exp

iBase-tNortheastern University

“Founding member of Tausight, building AI systems to detect and protect PHI for healthcare organizations; helped take the company through post–Series A funding and exited after ~6 years. Drove a strategic collaboration with Intel’s OpenVINO team—becoming the first to deploy it in a real production system and improving model performance by ~30% on customer Intel-CPU machines.”

A/B Testing Agentic AI Anomaly Detection Change Management CI/CD Classification+149

View profile

BHEEMA SABILLA

Screened

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp

DiscoverUniversity of South Dakota

“Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.”

Python PySpark SQL Scala Pandas NumPy+157

View profile

Thrinesh Thode

Screened

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp

BNY MellonUniversity at Albany

“BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.”

A/B Testing Agentic AI Apache Kafka Apache Spark AWS AWS Lambda+86

View profile

Koushik Gunjala

Screened

Senior AI Engineer specializing in Agentic AI and distributed systems

Charlotte, NC4y exp

UnitedHealth GroupUniversity of North Carolina at Charlotte

“LLM/agentic workflow engineer with healthcare domain experience who built a HIPAA-compliant multi-agent RAG system for clinical review automation at UnitedHealth Group, achieving 92% precision and cutting latency 40% through async orchestration and Redis semantic caching. Also has strong data engineering orchestration background (Airflow on AWS EMR with Great Expectations) and a proven clinician-in-the-loop feedback process that improved model faithfulness by 18%.”

Agentic AI Distributed Systems Retrieval-Augmented Generation (RAG)GPT-4 LangChain LangGraph+95

View profile

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp

HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

“Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.”

Agile AWS Azure DevOps Azure Functions Azure Machine Learning Bash+131

View profile

Akshata Vijay Kulkarni

Screened

AI & Full-Stack Software Engineer specializing in LLM-powered applications

Atlanta, GA4y exp

PRGXArizona State University

“Full-stack engineer focused on productionizing LLM applications, including an Android privacy-policy risk summarization app (Kotlin/React Native + FastAPI + Ollama) that cut response times from ~10s to ~5–6s via batching, caching, async, and event-driven architecture. Currently at PRGX building an LLM-based legal contract clause extraction system, partnering closely with legal/procurement SMEs to create schemas, labeled datasets, and evaluation pipelines that improved accuracy from 70% to 85%. Also has experience architecting real-time voice/LLM systems with streaming microservices (Kafka, Kubernetes, gRPC/WebSockets) and an avatar chatbot pipeline (TalkingHead, Google TTS, AnythingLLM).”

Python JavaScript TypeScript Java SQL C+++95

View profile

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG systems, and real-time risk modeling

PA, USA4y exp

Capital OneRobert Morris University

“AI/ML Engineer with 4+ years of experience (Capital One, Odin Technologies) and a master’s in Data Analytics (4.0 GPA) who has deployed LLM/RAG systems to production for compliance/risk and document review. Strong in orchestration and MLOps (Airflow, Kubernetes, MLflow, GitHub Actions) and in tackling real-world LLM constraints like latency, context limits, and data privacy, with measurable impact (20%+ manual review reduction; 33% faster release cycles).”

Agentic AI Anomaly Detection Apache Hadoop Apache Hive Apache Kafka Apache Spark+115

View profile

Vardhan Addakattu

Screened

Mid-level Data Scientist specializing in Generative AI and NLP for financial risk

Glassboro, NJ4y exp

S&P GlobalRowan University

“Built and shipped production generative AI/RAG assistants in regulated financial contexts (S&P Global), automating compliance-oriented Q&A over earnings reports/filings with grounded answers and citations. Experienced across the full stack—AWS-based ingestion (PySpark/Glue), vector retrieval + LangChain agents, GPT-4/Claude model selection, and production reliability (monitoring, caching, retries) plus rigorous evaluation and regression testing.”

Python R SQL PySpark Pandas Apache Spark+111

View profile

Hema Edavalapati

Screened

Mid-level AI/ML Engineer specializing in cloud data engineering and GenAI

Florida, USA6y exp

LexisNexisUniversity of South Florida

“AI/LLM engineer with production experience in legal tech: built a GPT-4 + LangChain RAG summarization system at Govpanel that reduced legal case-file review time by 50%+. Previously at LexisNexis, orchestrated end-to-end Airflow data/AI pipelines processing 5M+ legal documents daily, improving ETL runtime by 35% with robust validation, monitoring, and SLAs.”

SQL SQL query optimization Python Pandas NumPy PySpark+159

View profile

Machine Learning Engineers Software Engineers Data Scientists Data Engineers AI Engineers Data Analysts AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?