Vetted Amazon SageMaker Professionals

Pre-screened and vetted.

VP

Mid-level Generative AI Engineer specializing in LLMs, RAG, and agentic systems

Houston, TX5y exp
Asuitech SolutionsUniversity of Houston

Built a production "Mini RAG Assistant" for internal document Q&A, focusing on grounded answers (anti-hallucination), retrieval quality, and latency/cost optimization. Uses LangChain/LangGraph for orchestration and applies a metrics-driven evaluation loop (including reranking and semantic chunking improvements) while collaborating closely with product stakeholders.

View profile
HP

Harsh Patel

Screened

Senior Data Scientist specializing in LLM applications, RAG systems, and production ML

New York, NY6y exp
Fulcrum AnalyticsUniversity of Maryland, Robert H. Smith School of Business

Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.

View profile
DP

Dhrumi patel

Screened

Mid-level Software Engineer specializing in Java/Spring Boot microservices

Boston, MA3y exp
IPSER LAB LLCNortheastern University

Full-stack AI engineer who built Skillmatch AI, an LLM/RAG-based job matching platform using FastAPI microservices, Airflow-orchestrated async pipelines, and Pinecone vector search (sub-second retrieval across 50k+ vectors) deployed on GCP with autoscaling. Also partnered directly with a cancer researcher to automate SEER + PubMed-driven report generation via an AI pipeline, emphasizing rapid prototyping and outcome-focused communication.

View profile
SV

Satya VM

Screened

Mid-level GenAI/Data Engineer specializing in LLMs, RAG systems, and fraud detection

Ruston, LA7y exp
Origin BankOsmania University

ML/NLP engineer with banking domain experience who built a GenAI-powered fraud detection and risk intelligence system at Origin Bank, combining RAG (LangChain + FAISS), fine-tuned BERT NER, and GPT-4/Sentence-BERT embeddings. Delivered measurable impact (25% higher fraud detection accuracy, 40% less manual review) and emphasizes production-grade pipelines on AWS SageMaker/Airflow with strong data validation and scalable PySpark processing.

View profile
ST

Shreya Thakur

Screened

Mid-level Software Engineer specializing in Python backend and LLM/ML systems

New York, USA4y exp
Saayam for AllUniversity at Buffalo

Backend/AI engineer who has shipped production LLM systems end-to-end, including an AI request-routing service (FastAPI + BART MNLI + OpenAI/Gemini) that improved accuracy ~25% after launch via eval-driven prompt/category iteration. Also built an enterprise document intelligence/RAG platform on Azure (Blob/SharePoint/Teams ingestion, OCR/NLP chunking, embeddings in Azure Cognitive Search) with PII guardrails (Presidio), confidence gating, and scalable event-driven pipelines handling millions of documents.

View profile
Vaishnavi M - Mid-level AI/ML Engineer specializing in MLOps and Generative AI

Vaishnavi M

Screened

Mid-level AI/ML Engineer specializing in MLOps and Generative AI

5y exp
Liberty MutualUniversity of Maryland, Baltimore County

At Liberty Mutual, built a production underwriting decision assistant combining LLM reasoning with quantitative models and strong auditability. Implemented a claims-based response verification pipeline that cut hallucinations from 18% to 3% and materially improved user trust/validation scores. Experienced orchestrating ML/LLM workflows end-to-end with Airflow, Kubeflow Pipelines, and Jenkins, including SLA-focused pipeline hardening.

View profile
Jaykumar Kotiya - Mid-level Machine Learning & AI Engineer specializing in Generative AI, NLP, and MLOps in Boston, MA

Mid-level Machine Learning & AI Engineer specializing in Generative AI, NLP, and MLOps

Boston, MA6y exp
CitiusTechNortheastern University

Built and deployed production LLM systems for summarizing sensitive legal and financial documents, emphasizing GDPR-aligned privacy controls and scalable hybrid cloud architecture. Experienced with Kubernetes/Airflow orchestration and rigorous testing/monitoring practices, and has delivered measurable business impact (18% conversion lift) by translating AI outputs for non-technical marketing stakeholders.

View profile
Kunal Sanghvi - Mid-level Data Engineer specializing in AI, NLP, and LLM systems in USA

Kunal Sanghvi

Screened

Mid-level Data Engineer specializing in AI, NLP, and LLM systems

USA3y exp
Unique DesignsPace University

Built and deployed a production AI customer support chatbot at Unique Design Inc. using FastAPI, AWS, Docker, and retrieval-based grounding on internal documents. Stands out for hands-on ownership across discovery, deployment, incident debugging, and post-launch iteration, with a strong focus on making LLM systems reliable and safe in real business workflows.

View profile
MM

Manisha M

Screened

Senior AI/ML Engineer specializing in Generative AI and MLOps

Hollywood, FL7y exp
First Commonwealth BankJawaharlal Nehru Technological University

ML engineer with hands-on experience building banking AI systems end-to-end, including a customer-targeting model that improved campaign response rates by about 10%. Also shipped a RAG-based banking FAQ/support feature with safety guardrails and production optimizations around retrieval quality, latency, and cost, plus reusable Python services that reduced duplicate work for other engineers.

View profile
HL

Hanif Lashari

Screened

Mid-level Data & Machine Learning Engineer specializing in anomaly detection and forecasting

Ames, IA3y exp
Mary Greeley Medical CenterIowa State University

Built and productionized an agentic RAG assistant using Ollama + LangChain + MCP + ChromaDB to speed up and standardize access to operational knowledge from tickets and runbooks. Focused on real-world reliability: mitigated timeouts/latency with retries and concurrency limits, improved retrieval via chunking/embedding iteration, and reduced hallucinations through citation-grounding and confidence-based abstention. Also partnered with non-technical ops staff to deliver anomaly detection/monitoring by translating operational needs into model signals, thresholds, and alerting logic.

View profile
BK

Mid-level AI Engineer specializing in ML, NLP, and Generative AI

Atlanta, GA4y exp
CGIUniversity of New Haven

AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.

View profile
MP

Mehul Parmar

Screened

Mid-level Data Scientist specializing in insurance, healthcare, and cloud analytics

Somerset, NJ4y exp
P&F SolutionsLong Island University

Built a production-style LLM document summarization/generation workflow that mitigates token limits and reduces hallucinations using semantic chunking, FAISS-based embedding retrieval (top-k via cosine similarity), and section-wise generation. Orchestrated the end-to-end pipeline with AWS Step Functions and aligned outputs with sales stakeholders through demos, visuals, and documentation.

View profile
BP

Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems

Fort Worth, Texas8y exp
Ingram MicroUniversity of North Texas

LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.

View profile
KP

Kunal Patil

Screened

Senior Game Developer specializing in Unreal/Unity gameplay and graphics systems

Remote6y exp
Steel Wool GamesRochester Institute of Technology

Unreal Engine gameplay programmer with shipped experience on Five Nights at Freddy’s (including Ruin), spanning end-to-end systems like save/load + checkpoints, math-heavy spline-based AI movement, and player movement tuning. Also implemented a networked PvP dash using Unreal’s prediction pipeline (FSavedMove_Character) with server-authoritative validation, and has demonstrated strong debugging under stress-test conditions.

View profile
Harsh Chauhan - Junior AI Engineer specializing in Generative AI, RAG, and NLP in Remote, US

Harsh Chauhan

Screened

Junior AI Engineer specializing in Generative AI, RAG, and NLP

Remote, US3y exp
TickerIndiana University Bloomington

AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.

View profile
Srinivasan Gomadam Ramesh - Mid-level AI/Data Engineer specializing in agentic AI and data platforms in Redmond, WA

Mid-level AI/Data Engineer specializing in agentic AI and data platforms

Redmond, WA7y exp
Quadrant TechnologiesUniversity of Texas at Dallas

AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.

View profile
NH

Senior Full-Stack Engineer specializing in AI, cloud, data, and healthcare tech

Van Nuys, CA9y exp
SmartiStackUniversity of South Florida

Backend/data engineer with hands-on production experience across Python/Flask microservices and AWS serverless/data platforms (Lambda, DynamoDB, S3, Glue/PySpark). Demonstrated strong reliability and operations mindset (JWT/RBAC, retries/timeouts/circuit breakers, CloudWatch/SNS alerting) and measurable performance wins (SQL report runtime cut from 10 minutes to 30 seconds). Seeking ~$150k base and cannot travel for onsite meetings for the next 5–6 months due to family medical constraints.

View profile
VR

Mid-level Machine Learning Engineer specializing in LLMs and MLOps

5y exp
Device ThreadUniversity at Buffalo

Founding-engineer-style full-stack and AI product builder who has shipped production conversational agents in hospitality and earlier helped build an AI sports analytics startup through acquisition. Stands out for combining React/TypeScript frontend ownership, FastAPI backend design, and sophisticated LLM/RAG/agent systems with strong production monitoring and clear business impact.

View profile
PT

Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps

New York, USA2y exp
University at BuffaloUniversity at Buffalo

Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.

View profile
GA

Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems

USA4y exp
CitiusTechNorthwest Missouri State University

Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.

View profile
RV

Rahul Vemuri

Screened

Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines

Malvern, PA4y exp
PQ CorporationPenn State Great Valley School of Graduate Professional Studies

Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.

View profile
Nagendra Reddy Palugulla - Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps in Florida, United States

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Florida, United States4y exp
Community Dreams FoundationUniversity of Houston

Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.

View profile
Kundhana Paruchuru - Mid-level Data Scientist specializing in ML, LLM pipelines, and MLOps in Remote, USA

Mid-level Data Scientist specializing in ML, LLM pipelines, and MLOps

Remote, USA3y exp
Heartland Community NetworkIndiana University Bloomington

Built and deployed a production LLM-driven document understanding pipeline using LangChain/LangGraph, focusing on reliability via step-by-step prompting, validation checks, and monitoring. Also partnered with non-technical marketing stakeholders at Heartland Community Network to deliver an XGBoost targeting model surfaced in Power BI, improving campaign conversion by 12%.

View profile
Satish Kumar Reddy - Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps in Remote, NJ

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

Remote, NJ5y exp
Tungsten AutomationPace University

Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.

View profile

Need someone specific?

AI Search