Vetted Amazon SageMaker Professionals

Pre-screened and vetted.

VS

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp
Capital OneUniversity of the Cumberlands

AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.

View profile
PJ

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp
JPMorgan ChaseUniversity of Massachusetts Amherst

Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.

View profile
NV

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

View profile
Shanmukha Koganti - Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision in Bay Area, CA

Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision

Bay Area, CA6y exp
ShopifyUniversity of North Texas

ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).

View profile
Nagarjuna Vaddineni - Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and data pipelines in Seattle, WA

Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and data pipelines

Seattle, WA6y exp
AmazonTexas A&M University-Kingsville

Amazon backend engineer who built and operated high-scale Java Spring Boot microservices on AWS (EKS/EC2) handling millions of daily transactions, with deep experience debugging p95 latency and database/ORM bottlenecks. Shipped an AI-driven real-time personalization feature by integrating SageMaker model inference end-to-end with low-latency caching and graceful fallbacks, and designed robust order/payment orchestration with retries, compensations, and DLQ-based escalation.

View profile
Amit Sharma - Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems in San Francisco, CA

Amit Sharma

Screened

Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems

San Francisco, CA25y exp
FambotUniversity of Delhi

Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.

View profile
Harsh Sanas - Intern-level Software Engineer specializing in GenAI, RAG, and backend systems in San Francisco, CA

Harsh Sanas

Screened

Intern-level Software Engineer specializing in GenAI, RAG, and backend systems

San Francisco, CA2y exp
Scale AIUSC

AI/LLM engineer focused on shipping production-grade agents that automate support, sales intake, and ERP-connected workflows. Stands out for combining strong orchestration and guardrails with measurable business outcomes, including 45% faster support handling, ~$1.2M annual savings, 18% higher customer satisfaction, and 99.5%+ reliability in production.

View profile
KC

Kevin Cruz

Screened

Senior Gen AI Engineer specializing in agentic LLM systems

Tempe, AZ15y exp
OpendoorUSC

Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.

View profile
HY

Mid-level AI/ML Engineer specializing in telematics, embedded systems, and MLOps

Mossville, IL5y exp
CaterpillarGeorgia Tech

Built and deployed a retail customer review intelligence platform by fine-tuning BERT for sentiment/topic extraction and pairing it with a recommendation component. Demonstrates strong production ML rigor (error analysis, relabeling/active sampling, thresholding/guardrails, OOD checks) and AWS-based orchestration at scale (Lambda + SageMaker with batching and concurrency controls), plus proven ability to align non-technical stakeholders on measurable outcomes.

View profile
SG

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance

6y exp
CVS HealthUniversity of New Haven

Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.

View profile
JA

Jeevan aher

Screened

Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech

Remote, USA3y exp
JPMorgan ChaseUniversity of Illinois Urbana-Champaign

AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.

View profile
Akhil Jaggari - Mid-level Full-Stack Software Engineer specializing in scalable web platforms and cloud microservices in CA, CA

Akhil Jaggari

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web platforms and cloud microservices

CA, CA6y exp
UberUniversity of North Texas

Backend engineer with fintech/real-estate lending domain experience (Berkadia) building Python/Flask services for indicative loan pricing across Fannie/Freddie workflows. Strong in scalable AWS architectures (S3, Lambda, SageMaker), database performance (PostgreSQL read replicas, indexing, pooling), and high-throughput optimizations (streaming exports, Redis caching) with measurable production impact.

View profile
Vamshikrishna Bandi - Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

6y exp
PayPalTrine University

Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.

View profile
Praveen Nutulapati - Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems in New York, NY

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp
JPMorgan ChaseUniversity of Central Missouri

Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.

View profile
Vasudha Prerepa - Mid-Level Java Full-Stack Developer specializing in cloud-native microservices

Mid-Level Java Full-Stack Developer specializing in cloud-native microservices

5y exp
BMOTexas Tech University

QA/validation-focused engineer with experience at Meta testing an ML+LLM content classification/summarization system, including production-vs-test behavior gaps. Built automated E2E validation and drift monitoring (PSI, KL divergence, embedding cosine similarity) run daily/multiple times per day and gated via CI. Also implemented Jenkins-orchestrated Selenium/API test suites in Docker at Capgemini and partnered with a business analyst to convert business rules into automated AI-driven validation checks.

View profile
Sirisha Maddikunta - Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions in O Fallon, MO

Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions

O Fallon, MO6y exp
MastercardUniversity of Texas at Arlington

Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.

View profile
VP

Victor Pirie

Screened

Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI

Des Moines, IA11y exp
AssistRxMonash University

Built and owned a production conversational AI platform for a healthcare contact center, including RAG-based agent assist, hybrid retrieval, safety guardrails, and production monitoring. Stands out for combining LLM product delivery with strong operational rigor, driving a reported 25-30% improvement in handling time in a sensitive healthcare environment.

View profile
BM

Mid-level AI/ML Engineer specializing in fraud detection and recommendation systems

California, USA3y exp
PayPalFlorida Atlantic University

ML engineer with production experience at PayPal and Flipkart, owning high-scale systems across fraud detection, recommendations, and LLM tooling. Stands out for combining strong modeling judgment with practical platform engineering, delivering measurable impact like 22% fewer fraud false positives, 18% CTR lift, 40% less LLM manual review, and 30% faster redeployments.

View profile
Kiran Kumar - Mid-level Software Engineer specializing in Java microservices and GenAI automation in USA

Kiran Kumar

Screened

Mid-level Software Engineer specializing in Java microservices and GenAI automation

USA4y exp
AirbnbAuburn University at Montgomery

Software engineer (4+ years) with hands-on production GenAI experience: built an AI incident triage assistant that summarizes production logs for on-call engineers and iterated it using real incident metrics (time-to-signal, triage duration). Also shipped a RAG-based customer support knowledge assistant using embeddings + vector retrieval with strong guardrails (relevance thresholds/abstain, sanitization, auditing) and a formal eval loop (500-query gold set) that drove measurable retrieval improvements.

View profile
SB

Sowmya BALUVU

Screened

Mid-Level Software Engineer specializing in full-stack development and AWS

Santa Clara, California3y exp
Frugal Innovation HubSanta Clara University

Backend-focused Python engineer who built an end-to-end personalized chatbot service integrating Amazon Redshift context retrieval with Amazon Bedrock, including prompt construction and production-grade reliability controls. Strong platform experience deploying containerized services to Kubernetes with GitOps/ArgoCD, plus hands-on Kafka streaming and phased infrastructure migration execution.

View profile
TN

Tanveer Nazir

Screened

Senior Cloud & DevOps Engineer specializing in enterprise cloud automation and Kubernetes

Remote, NY11y exp
Bank of AmericaCollege of Staten Island, CUNY

Infrastructure/DevOps engineer with primary ownership in enterprise Linux and AWS/Azure production environments (including financial systems). Built secure, repeatable CI/CD pipelines deploying containerized workloads to EKS/ECS and implemented Terraform/CloudFormation IaC with drift detection and rollback practices; lacks direct IBM Power/AIX/PowerHA experience.

View profile
RK

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp
Scale AIRowan University

Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.

View profile

Need someone specific?

AI Search