Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Amazon EMR Professionals

Pre-screened and vetted.

Amazon EMR Python Amazon S3 SQL Docker CI/CD

Utkarsh Joshi

Screened

Senior Data Scientist specializing in ML, NLP, and GenAI analytics

Remote, US7y exp

University of MinnesotaUniversity of Minnesota

“Built and deployed an LLM-powered analytics assistant enabling business users to ask questions in plain English and receive validated Spark SQL executed in Databricks, with a Streamlit/Flask UI. Addressed strict client schema-privacy constraints by implementing a RAG strategy and ultimately leveraging AWS Bedrock and fine-tuned reference docs. Also has production ML pipeline experience using Docker + Airflow and AWS (S3/ECS/EC2) for financial classification models.”

Python Pandas NumPy Scikit-learn R SQL+107

View profile

Harini Kv

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Dallas, TX7y exp

EquinixFitchburg State University

“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”

Python SQL PySpark Bash Java JavaScript+169

View profile

Pooja Dokuri

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and cloud MLOps

Remote, USA4y exp

UnitedHealth GroupEast Texas A&M University

“Built and deployed a production LLM + vector search clinical decision support system at UnitedHealth Group, retrieving medical evidence and patient context in real time for prior authorization and risk scoring. Strong in end-to-end RAG architecture (Hugging Face embeddings, Pinecone/FAISS, SageMaker, Redis) plus orchestration (Airflow/Kubeflow) and rigorous evaluation/monitoring, with demonstrated ability to align solutions with clinical operations stakeholders.”

Python Pandas NumPy PySpark Scikit-learn SQL+133

View profile

Samatha Amsala

Screened

Mid-level Data Engineer specializing in cloud data warehousing and analytics

Omaha, NE6y exp

American ExpressBellevue University

“Data engineer at American Express who owned end-to-end pipelines for transaction and customer data used in finance reporting and risk analytics, processing ~5–8M records/day. Built Airflow-orchestrated ingestion (including external APIs/web sources) with strong data quality controls, monitoring/alerts, and resilient backfill/retry patterns, and also shipped a versioned REST API serving aggregated metrics to analytics teams.”

Data Engineering Data Warehousing Analytics Fraud Detection ETL Data Validation+167

View profile

Adithya Chittajallu

Screened

Mid-level AI/ML Engineer specializing in LLM systems, MLOps, and Healthcare AI

Remote, USA5y exp

CVS HealthUniversity of Missouri-Kansas City

“Built and shipped a production-grade agentic RAG system at CVS Health for patient adherence and medication recommendations, processing 20k+ patient records/day. Strong focus on real-world reliability: hybrid retrieval tuned with re-ranking (<400ms latency), strict JSON/schema validation and tool guardrails, and monitoring/drift detection that reduced MTTD from 6 days to 18 hours while improving recommendation accuracy (+8%) and cutting escalations (~23%).”

Python SQL Bash Git PyTorch TensorFlow+107

View profile

Manish Challa

Screened

Mid-level AI/ML Engineer specializing in Generative AI and financial services

OR, USA5y exp

JPMorgan ChaseSeattle University

“ML/AI engineer with hands-on experience shipping regulated financial AI systems at JPMC and Capgemini, spanning credit risk, fraud detection, and generative AI assistants. Stands out for combining modern LLM/RAG architectures with strong MLOps, real-time infrastructure, and explainability/compliance practices, while delivering measurable business impact in latency, accuracy, cost, and risk reduction.”

Python SQL Java PyTorch TensorFlow Keras+134

View profile

Vasavi Mittapalli

Screened

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp

Texas InstrumentsTrine University

“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”

A/B Testing Agile Amazon DynamoDB Amazon EC2 Amazon EMR Amazon Kinesis+195

View profile

Anagha Rumade

Screened

Senior Applied AI/ML Engineer specializing in GenAI, LLMs, RAG and agents

Palo Alto, California9y exp

JPMorgan ChaseStevens Institute of Technology

“Applied AI/ML Engineer at JPMorgan Chase who led a banker-facing LLM chatbot from an OpenAI-API POC to a production RAG workflow, including hallucination mitigation, automated evaluation in SageMaker, and operational monitoring with Dynatrace. Also delivers external technical education—hosted a hands-on Grace Hopper Celebration 2025 workshop teaching LangChain/LangGraph agentic workflows.”

AWS AWS Lambda CI/CD Compliance Data Analysis Data Ingestion+58

View profile

Bernard Griffin

Screened

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp

IntelIllinois Institute of Technology

“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”

A/B Testing Amazon Athena Amazon Bedrock Amazon EC2 Amazon EMR Amazon Kinesis+130

View profile

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp

CitigroupUniversity of Missouri

“Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.”

Python JavaScript Shell Scripting SQL MySQL PostgreSQL+110

View profile

Nafeezuddin Mohammed

Screened

Mid-level Data Engineer specializing in Analytics & AI/ML

Virginia, USA6y exp

SonyFitchburg State University

“Data engineer with experience at Sony and Walmart building high-volume, near-real-time analytics and ingestion systems. Has owned end-to-end pipelines from Kafka/Spark streaming through S3/Parquet and Redshift/Looker, emphasizing data quality (Great Expectations), observability (CloudWatch/Azure Monitor), and reliability (Airflow SLAs, retries, checkpointing), including measurable performance and latency improvements.”

Agile Amazon Athena Amazon CloudWatch Amazon EMR Amazon Redshift Amazon S3+124

View profile

Bhavya Sree Ganja

Screened

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp

First National BankTexas A&M University-Corpus Christi

“Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.”

Agile Amazon API Gateway Amazon Athena Amazon CloudWatch Amazon DynamoDB Amazon EC2+210

View profile

Ajay Madhusudhan Thumala

Screened

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp

GeisingerUC Irvine

“Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.”

API Development API Gateway AWS Bash C C+++206

View profile

Deepthi Mundarinti

Screened

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp

JPMorgan ChaseSaint Louis University

“Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.”

Python NumPy Pandas PySpark Scikit-learn TensorFlow+120

View profile

Rajeev Sai Nitturu

Screened

Mid-level Software Engineer specializing in cloud-native backend and AI systems

Long Beach, CA4y exp

JPMorgan ChaseCalifornia State University, Long Beach

“Candidate takes a disciplined, developer-in-the-loop approach to AI-assisted coding, using AI primarily for brainstorming, suggestions, and optimization while retaining full ownership of architecture and final code decisions. They also actively stay current on AI developments through research papers, communities, and emerging tools.”

Java Python TypeScript JavaScript SQL Data Structures & Algorithms+113

View profile

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp

BlackRockWebster University

“Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.”

Python SQL Shell Scripting Apache Spark PySpark Apache Hadoop+103

View profile

Sravani Kasaraneni

Screened

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp

ServiceNowRivier University

“Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.”

SDLC Agile Waterfall Python R Java+104

View profile

Shanmukh Sai Madhu

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp

JPMorgan ChaseUniversity of South Dakota

“Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.”

Agile Amazon EMR Apache Airflow Apache Kafka Apache Spark AWS+122

View profile

Avinash Pancheneni

Screened

Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications

Charlotte, NC5y exp

Bank of AmericaUniversity of North Carolina at Charlotte

“Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.”

Machine Learning Artificial Intelligence Supervised Learning Unsupervised Learning Predictive Modeling Fraud Detection+119

View profile

Prasannakumar B Vardi

Screened

Senior Software Engineer specializing in low-latency ad targeting and distributed backend systems

Santa Clara, CA9y exp

CardlyticsStony Brook University

“Backend/platform engineer who built a high-scale audience segmentation and real-time targeting system using Spark/Glue + S3/Hudi and low-latency API services backed by Redis/relational stores. Demonstrates strong production rigor: Spark performance tuning to eliminate OOM failures, API idempotency/caching to cut p95 latency ~40%, and careful dual-run/feature-flag migrations with reconciliation and rollback runbooks. Experienced implementing layered security with JWT/OAuth, RBAC/ABAC, and database row-level security to prevent privilege escalation.”

Java Python Go .NET C#Scala+114

View profile

Arjun Sharma

Screened

Staff Data Scientist specializing in AI/ML engineering and MLOps

Austin, TX10y exp

AccentureTexas State University

“ML/NLP engineer with experience at Flatiron Health building a production NLP platform that processed millions of clinical notes, using BERT/BiLSTM-CRF and spaCy to extract and normalize entities from noisy EMR text with oncologist-in-the-loop validation. Also built scalable retail ML workflows (Spark + Kubernetes + feature store caching) and applied vector databases plus contrastive-learning fine-tuning to improve retrieval relevance and recommendations.”

Python SQL Java Scala PyTorch TensorFlow+122

View profile

Radhika Fichadia

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and FinTech

Jersey City, USA4y exp

JPMorgan ChaseSyracuse University

“Backend/DevOps-focused engineer with healthcare and financial systems experience, including an ICU readmission risk platform delivering real-time ML scores via a secure FastAPI service (PyTorch model serving, PostgreSQL, Celery/Redis) deployed on AWS with strong observability. Has hands-on Kubernetes GitOps delivery (Helm, ArgoCD, HPA) and has supported a JPMC on-prem-to-AWS microservices migration using phased validation and blue-green cutovers, plus Kafka/Avro streaming for real-time transaction processing.”

Python Java JavaScript C C++PL/SQL+134

View profile

Pandari G

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

San Francisco, USA5y exp

SephoraSaint Mary's College of California

“GenAI/LLM engineer with production deployments in both fintech and retail: built an AI-powered mortgage document analysis/automated underwriting pipeline at Fannie Mae (OCR + custom LLM) cutting underwriting review from 3–4 hours to under an hour with privacy-by-design controls. Also helped build Sephora’s GenAI product advisory bot using LangChain-orchestrated RAG (Azure GPT-4, Azure AI Search, MySQL HeatWave vector search), focusing on grounding, evaluation, and compliance-aware architecture choices.”

Python SQL R PySpark PowerShell Generative AI+158

View profile

Prem Kumar

Screened

Senior Data Engineer specializing in cloud data platforms and regulated analytics

McLean, VA6y exp

Capital OneRowan University

“Data engineer at Capital One building AWS-based real-time and batch pipelines and backend data services for financial/fraud use cases. Has owned end-to-end pipelines processing millions of records/day, implemented dbt/Great Expectations quality gates, and tuned Redshift/Snowflake workloads (cutting query latency ~22–25% and reducing pipeline failures ~30–40%) while supporting 15+ downstream consumers.”

Python SQL PySpark Scala Java Bash+152

View profile

Data Engineers Machine Learning Engineers Software Engineers Data Scientists Data Analysts Software Development Engineers Data & Analytics Engineering AI & Machine Learning Education

Need someone specific?

AI Search

Related

Need someone specific?