Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Sai Dinesh Pusapati

Screened

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp

Scale AIBelhaven University

“LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.”

Python TypeScript Java C#JavaScript R+103

View profile

Akhil Kunala

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native FinTech

Seattle, WA5y exp

AmazonUniversity of North Texas

“Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.”

Java Python TypeScript JavaScript SQL Spring Boot+101

View profile

Piyush Kautkar

Screened

Junior Software Engineer specializing in full-stack systems and distributed log analytics

Miami, FL1y exp

NeocisCarnegie Mellon University

“CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.”

Agile Analytics Anomaly Detection Authentication AWS C+++190

View profile

Ming-Kai Liu

Screened

Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision

Raleigh, NC2y exp

Citrus OncologyUC San Diego

“Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.”

Linux C C++Python Java SQL+81

View profile

Svachuta Gollavilli

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance

6y exp

CVS HealthUniversity of New Haven

“Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.”

A/B Testing Anomaly Detection API Testing AWS Glue AWS Lambda BERT+107

View profile

Cassandra Sullivan

Screened

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp

Aurora AIUniversity of Chicago

“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”

A/B Testing Automation Classification Dashboarding Data Cleaning Data Visualization+109

View profile

Harsh Chaudhari

Screened

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp

SplunkUniversity of Colorado Boulder

“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”

C++Data Pipelines Data Preprocessing Docker Embeddings FAISS+70

View profile

Jeffrey Saelee

Screened

Mid-Level Software Engineer specializing in full-stack systems and developer tooling

Austin, TX3y exp

AppleCollege of the Sequoias

“Built and productionized an AI extension for JetBrains IDEs providing coding assistance, testing, security sweeps, and documentation generation using both an internal LLM and third-party models (e.g., Gemini, Claude). Experienced in diagnosing customer issues in real time (Slack) with structured follow-through (GitHub Issues) and driving adoption through developer-oriented walkthroughs and video demos.”

Agile AI Agents Angular AWS Bootstrap Docker+68

View profile

Rakesh Munaga

Screened

Mid-level Full-Stack Engineer specializing in AI and FinTech platforms

TX, USA4y exp

JPMorgan ChaseUniversity of Texas at Arlington

“Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.”

AI Agents Amazon API Gateway Amazon CloudWatch Amazon EC2 Amazon RDS Amazon S3+132

View profile

Alex ZhuZhou

Screened

Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems

Berkeley, CA2y exp

EmbraerUC Davis

“Backend/LLM engineer with experience productionizing RAG systems (legal-case natural language querying) and optimizing for latency/cost, including a reported ~40% reduction via Redis caching and batching. Built monitoring and real-time debugging workflows (FastAPI, structured logging, correlation IDs, sandbox repro) and regularly delivered technical demos/workshops. Also partners with BD/sales to translate LLM capabilities into business value, including ESG-metric extraction from corporate filings.”

Python TypeScript JavaScript Java Node.js SQL+78

View profile

Vamshikrishna Bandi

Screened

Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

6y exp

PayPalTrine University

“Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.”

A/B Testing Agile AWS Azure Machine Learning BigQuery Caching+138

View profile

Kunal Singh Pundir

Screened

Mid-level Full-Stack Developer specializing in cloud microservices and GenAI systems

USA, USA5y exp

UberNortheastern University

“Built and owned an end-to-end AI-driven decisioning platform at Uber, combining LLM orchestration with typed tool contracts and a Snowflake-based RAG pipeline to make decisions fully auditable. Delivered large-scale measurable impact (120k requests/day, 18k cases auto-resolved/month) while improving ops SLA from 3 days to 6 hours and cutting incident response time nearly in half. Previously led a high-risk strangler-fig modernization of a legacy insurance platform across 120+ microsites at Accenture, coordinating across multiple squads with feature-flagged parallel cutovers.”

C#Java .NET Flask Spring Boot Node.js+140

View profile

Vedant Kharwal

Screened

Intern AI/ML Engineer specializing in Generative AI and applied machine learning

Mumbai, India1y exp

LTIMindtreeBoston University

“New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.”

Agentic AI Algorithms Angular API Development Artificial Intelligence Authentication+93

View profile

Yashkumar Patel

Screened

Mid-level Software Engineer specializing in backend, distributed systems, and AI infrastructure

Menlo Park, CA4y exp

SnowflakeUSC

“Built Baioniq, an enterprise LLM platform for automating extraction from massive unstructured documents like contracts and insurance claims. They demonstrate unusually strong production depth in agentic AI—scaling to 100k+ requests/day, processing 1M+ claim documents, and improving extraction accuracy through rigorous RAG architecture, evaluation, and fallback design.”

C++Python C Java Go JavaScript+124

View profile

Manvir Singh

Screened

Senior Full-Stack & Mobile Software Engineer specializing in cloud-based applications

Englewood, NJ10y exp

Cobalt BrandsUniversity of Washington

“Data/ML backend engineer with hands-on production experience spanning RAG services (LlamaIndex/OpenAI) and AWS data platforms. Has delivered Terraform-managed AWS architectures (Lambda + ECS Fargate) with secure secrets handling, built Glue-to-Redshift ETL with schema evolution controls, modernized SAS reporting into Python microservices, and achieved major Redshift query speedups (2+ hours to under 15 minutes).”

React React Native TypeScript Next.js Redux Tailwind CSS+117

View profile

Lekha Karanam

Screened

Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation

Dallas, TX3y exp

Goldman SachsUniversity of Texas at Dallas

“Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.”

Python SQL R Scikit-learn TensorFlow PyTorch+61

View profile

Prakash Nidhi Verma

Screened

Mid-level Full-Stack Engineer specializing in scalable APIs, cloud infrastructure, and GenAI apps

San Francisco, CA6y exp

DoorDashCal State Chico

“Backend/platform engineer with experience across edtech, logistics, and AWS internal systems—owned a production course recommender end-to-end (model serving + APIs + caching/observability), delivering +30% CTR and -20% latency. Has scaled real-time delivery visibility/rerouting on Kubernetes/EKS to sub-200ms P95 during demand spikes and built billion-events/day telemetry pipelines on AWS (Kinesis Firehose, Lambda, S3, Redshift) with schema evolution, dedupe, and replay support.”

JavaScript TypeScript Python Go C#React+119

View profile

Akshay Koneti

Screened

Mid-Level Full-Stack Software Engineer specializing in AWS cloud and microservices

Dallas, TX6y exp

AmazonUniversity of North Texas

“Backend/LLM engineer who built a production-critical Amazon Bedrock + RAG correction and compliance layer for employee communications, integrating tightly with existing Spring Boot/AWS microservices to reduce manual review while keeping outputs explainable and auditable. Also designed an event-driven system processing 10M+ events/day (SQS/Lambda/DynamoDB/Elasticsearch) and handled on-call incidents with strong observability and reliability patterns (idempotency, retries, hotspot mitigation).”

Java Python JavaScript TypeScript JSON XML+138

View profile

Raghav Konduri

Screened

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp

Scale AIRowan University

“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”

A/B Testing Agile Amazon CloudWatch Amazon EC2 Amazon EKS Amazon Kinesis+151

View profile

Vidhi Upadhyay

Screened

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp

Saayam for AllCarnegie Mellon University

“Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.”

Python C++SQL MySQL .NET Generative AI+150

View profile

Deepika Gotla

Screened

Senior Technical Support Engineer specializing in Azure Cloud & Generative AI

Bellevue, WA7y exp

MicrosoftSUNY New Paltz

“Microsoft cloud/infra engineer with 5+ years supporting enterprise Azure environments, specializing in security-focused networking (private endpoints, DNS) and production troubleshooting across Azure Front Door/App Gateway WAF/AKS. Has implemented posture improvements via Defender for Cloud, Azure Policy, and RBAC tightening, and also designs secure AWS agent/scanner integrations and modern EKS/GitHub Actions/Secrets Manager observability-enabled SDK rollouts.”

Azure DevOps Azure Machine Learning Bash ChatGPT CI/CD Cloud migration+145

View profile

Niyaz Nurbhasha

Screened

Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines

4y exp

BlueHaloDuke University

“ML/LLM engineer who built production systems to speed up artist content-creation workflows, including a fine-tuned image captioning model paired with a RAG layer over image embeddings/captions to improve consistency across changing domains. Experienced orchestrating multi-tool agents with LangChain/LangGraph (planning + critic/reflection) and setting up practical monitoring (caption rejection rate) plus evaluation sets for tool-calling accuracy, output quality, and latency.”

Python C++SQL JavaScript TypeScript PyTorch+75

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?