Vetted Azure Data Factory Professionals

Pre-screened and vetted.

PK

priya kotha

Screened

Mid-level Data Engineer specializing in real-time pipelines across FinTech and Healthcare

USA, USA4y exp
PlaidSacred Heart University

Data engineer at Plaid who built greenfield, end-to-end real-time transaction pipelines and FastAPI data services for fraud detection and analytics, handling millions of events per day. Strong focus on reliability and data integrity via Great Expectations validation, Airflow-based monitoring/SLAs, quarantine/staging patterns, and robust external data ingestion with schema versioning and backfills (reported 50% fewer anomalies and ~40% fewer failures).

View profile
PK

Mid-level Machine Learning Engineer specializing in Generative AI and real-time ML systems

California, USA4y exp
UberUniversity of North Texas

ML/GenAI engineer with hands-on experience shipping LLM-powered support systems at Uber, including real-time feedback analysis, ticket summarization, and retrieval-grounded knowledge systems. Stands out for combining fine-tuning, RAG, safety evaluation, and production optimization to drive measurable support outcomes like faster handling times, better resolution rates, and lower latency/cost.

View profile
CP

Senior Software Engineer specializing in .NET microservices for Healthcare IT and FinTech

Dallas, TX10y exp
athenahealthGeorgia Tech
View profile
DS

Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Rosharon, TX11y exp
AssistRxUniversity of Texas at Austin
View profile
BK

Mid-level Data Engineer specializing in real-time streaming and ML feature pipelines

Atlanta, GA6y exp
LyftGrand Valley State University
View profile
SS

Mid-level Data Analyst specializing in cloud analytics and BI

Chicago, IL5y exp
AirbnbLewis University
View profile
AU

Mid-level AI/ML Engineer specializing in generative AI and data engineering

Chicago, IL3y exp
Hugging FaceIllinois Institute of Technology
View profile
SK

Mid-level AI/ML Engineer specializing in production ML, NLP, and computer vision

USA6y exp
UberUniversity of Maryland, Baltimore County
View profile
SR

Mid-level Data Engineer specializing in cloud lakehouse and streaming analytics

Remote, US4y exp
RampUniversity of Colorado Boulder
View profile
TS

Senior Data Engineer specializing in healthcare ETL/ELT and ML

Pasadena, CA12y exp
Doheny Eye InstituteUniversity of Texas at Austin
View profile
KA

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
View profile
VS

Mid-level AI Data Engineer specializing in real-time streaming and LLM-powered fraud analytics

California, USA6y exp
PayPalCalifornia State University, East Bay
View profile
ZJ

Senior Software Engineer specializing in Healthcare IT and cloud-native microservices

Wesley Chapel, FL11y exp
OptumGeorgia Tech
View profile
VK

Senior Data Analyst specializing in healthcare and financial analytics

Miami, FL5y exp
AmgenTrine University
View profile
WK

Staff Full-Stack Software Engineer specializing in cloud-native microservices

Dallas, TX10y exp
JetBlueGeorgia Tech
View profile
TM

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Austin, TX11y exp
Accenture
View profile
LT

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).

View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.

View profile
LM

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp
eBayTexas Tech University

Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.

View profile
Travoy Spelling - Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP in Texarkana, TX

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

View profile
Byron Pineda - Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps in Pascagoula, MS

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
Saiteja Gaddam - Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

3y exp
IntuitUniversity at Buffalo

Data engineer (Intuit) who owned an end-to-end telemetry and subscription analytics platform processing ~22M events/day, built on Kinesis/S3/Glue/Spark/Airflow/Redshift. Strong focus on reliability and data quality (schema drift controls, quarantine layers, idempotent reruns) and performance tuning, achieving a reporting latency reduction from ~15 minutes to under 4 minutes while enabling revenue and churn analytics for business teams.

View profile

Need someone specific?

AI Search