Vetted Azure Data Factory Professionals

Pre-screened and vetted.

sai Pavan - Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

sai Pavan

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

5y exp
American Family InsuranceGeorge Mason University

Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.

View profile
Teja Babu Mandaloju - Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms in Chicago, USA

Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms

Chicago, USA5y exp
VosynUniversity of North Texas

AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.

View profile
Phanideep P - Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

Phanideep P

Screened

Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

5y exp
Cadence BankWright State University

Data platform/data engineer with cross-industry experience in banking and healthcare, building cloud-native lakehouse architectures across AWS/Azure/GCP. Has owned high-volume (millions of records; TB/day) pipelines with strong data quality automation (dbt/Great Expectations), observability (Grafana/Prometheus), and real-time streaming (Kafka/Spark) for fraud monitoring; also delivered an early-stage migration from SQL Server to BigQuery with 40% batch latency reduction.

View profile
MR

Senior DevSecOps Engineer specializing in Azure cloud infrastructure and CI/CD

Virginia, USA8y exp
OdysseyReUniversity of Dayton

GCP-focused database/infrastructure engineer with hands-on production support for Cloud SQL and Firestore, spanning provisioning, IAM, scaling, backups, and performance tuning. They also described supporting a hybrid GCP architecture for a monolithic on-prem PostgreSQL workload and resolving a major latency incident by tracing cascading failures and fixing indexing issues.

View profile
Kevin Delong - Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems in Irvine, CA

Kevin Delong

Screened

Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

Irvine, CA12y exp
StfineTechLawrence Technological University

AI/ML engineer with hands-on experience shipping production systems across fintech, travel, and legal use cases. They’ve built end-to-end chatbot, generative content, and RAG solutions on AWS with CI/CD, monitoring, and guardrails, including a loan application platform that generated $3,000 in sales in its first month.

View profile
SR

Shruti Rawat

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services

Jersey City, NJ4y exp
State StreetPace University

Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.

View profile
Ramya Konda - Mid-level AI/ML Engineer specializing in healthcare ML and generative AI in Remote, USA

Ramya Konda

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and generative AI

Remote, USA5y exp
HumanaUniversity of New Haven

AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.

View profile
Nikhil Chagi - Intern Data Analyst specializing in data pipelines and LLM/RAG applications in San Francisco, CA

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp
CignaUniversity of North Texas

Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.

View profile
SC

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Atlanta, GA4y exp
Universal Health ServicesUniversity of New Haven

Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.

View profile
HK

Mid-Level Full-Stack .NET Developer specializing in cloud-native microservices and AI integration

Orlando, FL4y exp
State of FloridaFlorida International University

Software engineer with hands-on experience building and maintaining a React accessibility utility/component library (open-source-style) used across university portals, emphasizing WCAG 2.2 compliance, robust focus/keyboard behavior, and Jest/React Testing Library coverage. Also built and maintained .NET Core microservices at the Florida Department of Transportation, including integrating AI-driven features, with strong ownership around observability, incident response, and performance-focused refactoring.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
AM

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices

Sanford, FL4y exp
HCLTechUniversity of Massachusetts Lowell

Backend engineer with cloud-native Python/Flask experience building high-throughput financial platforms (loan origination intelligent document processing and real-time fraud detection). Has scaled microservices on AKS with event-driven Azure messaging, delivered measurable performance gains (e.g., 700ms→180ms query latency; ~40% API improvements), and implemented strong security controls (OAuth2/JWT, Azure AD RBAC, audit logging, AES-256/TLS) for sensitive regulated data.

View profile
Snehitha Penumaka - Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines in Dallas, TX

Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines

Dallas, TX3y exp
Cambard LLCUniversity of Texas at Dallas

LLM engineer/data engineer who has deployed production RAG systems for internal-document Q&A, building end-to-end ingestion, embedding, vector search, and FastAPI serving while actively reducing hallucinations and latency through rigorous retrieval tuning and caching. Also experienced in orchestrating cloud data pipelines (Airflow, AWS Glue, Azure Data Factory) and partnering with non-technical business teams to deliver AI solutions like automated document review.

View profile
Asanti Mokwala - Junior Data & Insights Analyst specializing in BI, dashboards, and automation in Remote

Junior Data & Insights Analyst specializing in BI, dashboards, and automation

Remote3y exp
CanvaSan José State University

Worked on taking an LLM-based system at Soundmakr from prototype to production by adding prompt constraints, validation/guardrails, deterministic ranking, and robust logging/monitoring with feedback loops. Also partnered with product/marketing during an internship on Thea: Study Smart to analyze onboarding drop-offs and run A/B tests on AI-driven flows, translating results into actions that improved retention and conversion.

View profile
Shabari Vignesh - Mid-level Data Engineer specializing in cloud data platforms and AI agents in Santa Clara, CA

Mid-level Data Engineer specializing in cloud data platforms and AI agents

Santa Clara, CA6y exp
SwirepaySan José State University

Data/Backend engineer who has owned end-to-end merchant analytics systems on AWS: orchestrated multi-source ingestion (FISERV/Shopify/Clover) with Step Functions/Lambda, enforced strong data quality gates, and served curated datasets via Redshift and a FastAPI layer. Also built an early-stage Merchant Insights AI agent that converts natural language questions into SQL using OpenAI models, with full CI/CD and observability.

View profile
RC

RIYA CHADDHA

Screened

Mid-level Data Engineer and Business Analyst specializing in cloud ETL and analytics

Remote, US5y exp
MellicellNortheastern University

Data analyst with cross-industry experience spanning insurance analytics at L&T Infotech and experimental imaging analytics at Mylyser. Stands out for building scalable SQL/PySpark data pipelines, standardizing business-critical metrics like claims lifecycle and policy retention, and delivering measurable impact such as 50%+ faster query performance and a 15% reduction in claims settlement time.

View profile
AG

Mid-level AI/ML Engineer specializing in MLOps and cloud-deployed ML systems

Austin, TX3y exp
PurevisitxUniversity of Illinois Springfield

ML/AI engineer who built and productionized an NLP system at PurevisitX, orchestrating end-to-end ML workflows with Airflow (S3 ingestion through auto-retraining) and optimizing for drift and low-latency inference. Also partnered with Citibank risk teams on a fraud detection model, translating results via dashboards and iterating thresholds based on stakeholder feedback.

View profile
GD

Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots

Houston, TX3y exp
University of HoustonUniversity of Houston

Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.

View profile
Andrew Clayman - Senior Data Scientist specializing in ML, NLP, and production AI systems in Remote

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.

View profile
NM

Mid-level Machine Learning Engineer specializing in cloud-native generative AI for healthcare

Seattle, WA4y exp
Cleveland ClinicUniversity of the Cumberlands

AI engineer at Cleveland Clinic building production LLM/NLP systems for radiology documentation, focused on HIPAA-aware, real-time performance across ~298 campuses. Re-architected infrastructure with AWS event-driven services to handle scaling and improved SLA compliance ~40%, and complements this with a personal multi-agent debate system (CrewAI) using local Llama/Mistral plus rigorous evaluation (A/B tests, red teaming, observability).

View profile
TK

Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems

New York, USA6y exp
UnitedHealthcareAuburn University at Montgomery

Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.

View profile
PA

Mid-level Automation Developer specializing in RPA, test automation, and data/ETL pipelines

Riverwoods, IL5y exp
DiscoverUniversity of South Alabama

Python backend engineer who owned an end-to-end Django/DRF authentication and account-management module (JWT, RBAC, email verification) and optimized token validation performance. Has hands-on Kubernetes + Helm delivery with GitOps via ArgoCD (multi-environment app-of-apps, drift detection/rollback) and has supported a cloud-to-on-prem migration using staged testing and phased cutover. Also built and scaled a Kafka-based real-time user activity tracking pipeline with reliability and backpressure controls.

View profile
SB

Mid-level AI/ML & Data Engineer specializing in MLOps and cloud data pipelines

Remote, USA4y exp
MerkleUniversity of North Carolina at Charlotte

AI/ML engineer (Merkle) with hands-on experience deploying RAG-based LLM applications and real-time recommendation engines into production. Strong in cloud/on-prem architectures, GPU autoscaling, caching, and network optimization—delivered measurable latency reductions (40–70%) and improved retrieval relevance by systematically benchmarking chunking/embedding configurations and validating pipelines via CI/CD.

View profile

Need someone specific?

AI Search