Reval Logo
Home Browse Talent Skilled in Azure Data Factory

Vetted Azure Data Factory Professionals

Pre-screened and vetted.

Azure Data FactoryPythonSQLDockerCI/CDAmazon S3
TS

Tyler Swanson

Senior Data Engineer specializing in healthcare ETL/ELT and ML

Pasadena, CA12y exp
Doheny Eye InstituteUniversity of Texas at Austin
Amazon EC2Amazon KinesisAmazon RedshiftAmazon S3Apache AirflowApache Kafka+128
View profile
KA

Kristopher Ali

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
PythonSQLScalaBashJavaR+180
View profile
PK

priya kotha

Mid-level Data Engineer specializing in real-time pipelines across FinTech and Healthcare

USA, USA4y exp
PlaidSacred Heart University
PythonSQLPandasNumPyApache SparkPySpark+75
View profile
LT

Leela Tikkisetty

Screened

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”

A/B TestingAgileAmazon BedrockAmazon EKSAmazon RedshiftAuthentication+198
View profile
BP

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”

PythonRSQLPandasNumPyScikit-learn+132
View profile
RR

Rushi Reddy Lambu

Screened

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Remote, USA5y exp
McKinsey & CompanyUniversity of North Texas

“GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.”

PythonSQLRJavaC++Bash+192
View profile
TS

Travoy Spelling

Screened

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”

A/B TestingAPI DevelopmentAWSAWS LambdaAWS Step FunctionsAzure Data Factory+247
View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

“Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.”

PythonSQLPySparkApache SparkApache KafkaAmazon Kinesis+94
View profile
PJ

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp
JPMorgan ChaseUniversity of Massachusetts Amherst

“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”

PythonCC++CUDASQLMATLAB+129
View profile
SV

sai venkata

Screened

Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines

Texas, USA6y exp
CVS HealthUniversity of Central Missouri

“Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.”

PythonSQLScalaJavaShell ScriptingApache Spark+117
View profile
KL

Kevin Lim

Screened

Intern Software Engineer specializing in data science and machine learning

Remote2y exp
StylistGemUC Berkeley

“Backend engineer with hands-on experience building Flask REST APIs (auth, CRUD, S3 media uploads) and driving measurable Postgres/SQLAlchemy performance gains (p95 reduced to 200–400ms by eliminating N+1s and switching to keyset pagination). Implemented multi-tenant isolation with strict tenant scoping plus Postgres RLS, and built an OpenAI-powered quiz generation pipeline using queued workers, structured JSON outputs, and Celery/Redis optimizations to stabilize high-throughput workloads.”

API DevelopmentAWSAzure FunctionsCI/CDCloud ComputingCSS+108
View profile
PN

Praveen Nutulapati

Screened

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp
JPMorgan ChaseUniversity of Central Missouri

“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”

A/B TestingAgileAmazon BedrockAmazon EC2Amazon EMRAmazon RDS+184
View profile
SR

Sandeep Reddy Karumudi

Screened

Mid-level Data & Business Analyst specializing in analytics engineering and BI

6y exp
AdobeUniversity of Wisconsin–Madison

“Data/analytics professional with experience across manufacturing and enterprise environments (Wisconsin School of Business project with CNH Industrial; roles/projects at Ascensia Technologies, S&C, and Adobe). Has hands-on work combining warranty/lifecycle tables with technician free-text notes using TF-IDF + tree models (XGBoost/Random Forest), and deep experience in entity resolution/reconciliation across mismatched financial systems using Python/SQL and fuzzy matching, with production-grade pipeline practices in Azure Data Factory/Databricks.”

PythonPandasNumPyscikit-learnRSQL+119
View profile
PS

Palak Siroya

Screened

Senior Site Reliability Engineer specializing in Azure cloud reliability and data analytics

Renton, WA10y exp
MicrosoftCentral Washington University

“AppSec-focused customer advisor with hands-on experience integrating SAST/DAST/SCA into production CI/CD (Azure DevOps) and designing secure agent/scanning deployments in AWS (least-privilege IAM, private subnets, VPC endpoints). Demonstrates strong incident troubleshooting using logs/metrics/traces to diagnose load-related failures (timeouts/retry storms) and drive durable fixes, while tailoring risk/tradeoff communication across engineering, security, and leadership stakeholders.”

AutomationAzure Data FactoryAzure DevOpsAzure SQL DatabaseCI/CDC+125
View profile
AN

Apoorva Nanabolu

Screened

Senior Data Scientist / Generative AI Engineer specializing in fraud, risk, and MLOps

5y exp
PayPalUniversity of New Haven

“Built and deployed a production LLM/RAG fraud investigation system to replace manual investigator workflows, combining transaction data, historical cases, and policy documents with agent-style steps and LoRA fine-tuning. Demonstrates strong reliability engineering (grounding, citations, abstention paths), performance optimization (retrieval/indexing/caching), and end-to-end MLOps orchestration using Azure ML Pipelines/MLflow plus Kubernetes/Argo with canary and rollback deployments.”

PythonRSQLNoSQLSnowflakeBigQuery+178
View profile
VV

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp
DatabricksCampbellsville University

“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”

PythonSQLPySparkBashTensorFlowPyTorch+106
View profile
KT

Keerthana Tammina

Screened

Mid-level Data Scientist specializing in machine learning and generative AI

Saint Louis, MO5y exp
DoorDashSaint Louis University

“ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.”

AgileAmazon RedshiftAmazon S3Amazon SageMakerAnomaly DetectionApache Hadoop+157
View profile
MG

Milan Gurumurthy

Mid-level Data Scientist specializing in fraud detection, NLP/LLMs, and MLOps

USA5y exp
JPMorgan ChaseNortheastern University
PythonPandasNumPySciPyMatplotlibPlotly+71
View profile
ST

Sahithi Tummala

Mid-level AI Engineer specializing in Generative AI, LLMs, and RAG systems

Dallas, TX6y exp
NewmarkUniversity of North Texas
A/B TestingAmazon BedrockAmazon DynamoDBAmazon EC2Amazon ECSAmazon EKS+159
View profile
SG

Saiteja Gaddam

Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

3y exp
IntuitUniversity at Buffalo
ScalaHibernateJDBCJSONHTMLCSS+94
View profile
SK

Sai Kurra

Mid-level AI/ML Engineer specializing in NLP/LLMs and real-time data pipelines

Remote, USA4y exp
PlaidGeorge Mason University
PythonPandasspaCyScikit-learnSQLPySpark+138
View profile
VK

Vinay Kumar

Mid-level Full-Stack Developer specializing in cloud-native microservices

NJ, USA4y exp
UberUniversity of Texas at Arlington
PythonDjangoFlaskFastAPIJavaSpring Boot+147
View profile
1...345...35

Related

Machine Learning EngineersData EngineersSoftware EngineersData ScientistsData AnalystsAI EngineersAI & Machine LearningData & AnalyticsEngineeringEducation

Need someone specific?

AI Search