Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Sai Dinesh Pusapati - Senior AI/ML Engineer specializing in GenAI agents and LLM workflows in San Francisco, CA

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp
Scale AIBelhaven University

LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.

View profile
Akhil Kunala - Mid-level Software Engineer specializing in backend systems and cloud-native FinTech in Seattle, WA

Akhil Kunala

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native FinTech

Seattle, WA5y exp
AmazonUniversity of North Texas

Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.

View profile
PK

Junior Software Engineer specializing in full-stack systems and distributed log analytics

Miami, FL1y exp
NeocisCarnegie Mellon University

CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.

View profile
ML

Ming-Kai Liu

Screened

Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision

Raleigh, NC2y exp
Citrus OncologyUC San Diego

Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.

View profile
SG

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance

6y exp
CVS HealthUniversity of New Haven

Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.

View profile
CS

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.

View profile
HC

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp
SplunkUniversity of Colorado Boulder

Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.

View profile
JS

Mid-Level Software Engineer specializing in full-stack systems and developer tooling

Austin, TX3y exp
AppleCollege of the Sequoias

Built and productionized an AI extension for JetBrains IDEs providing coding assistance, testing, security sweeps, and documentation generation using both an internal LLM and third-party models (e.g., Gemini, Claude). Experienced in diagnosing customer issues in real time (Slack) with structured follow-through (GitHub Issues) and driving adoption through developer-oriented walkthroughs and video demos.

View profile
RM

Rakesh Munaga

Screened

Mid-level Full-Stack Engineer specializing in AI and FinTech platforms

TX, USA4y exp
JPMorgan ChaseUniversity of Texas at Arlington

Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.

View profile
Alex ZhuZhou - Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems in Berkeley, CA

Alex ZhuZhou

Screened

Intern Full-Stack Software Engineer specializing in AI/LLM platforms and data systems

Berkeley, CA2y exp
EmbraerUC Davis

Backend/LLM engineer with experience productionizing RAG systems (legal-case natural language querying) and optimizing for latency/cost, including a reported ~40% reduction via Redis caching and batching. Built monitoring and real-time debugging workflows (FastAPI, structured logging, correlation IDs, sandbox repro) and regularly delivered technical demos/workshops. Also partners with BD/sales to translate LLM capabilities into business value, including ESG-metric extraction from corporate filings.

View profile
Vamshikrishna Bandi - Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

6y exp
PayPalTrine University

Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.

View profile
Kunal Singh Pundir - Mid-level Full-Stack Developer specializing in cloud microservices and GenAI systems in USA, USA

Mid-level Full-Stack Developer specializing in cloud microservices and GenAI systems

USA, USA5y exp
UberNortheastern University

Built and owned an end-to-end AI-driven decisioning platform at Uber, combining LLM orchestration with typed tool contracts and a Snowflake-based RAG pipeline to make decisions fully auditable. Delivered large-scale measurable impact (120k requests/day, 18k cases auto-resolved/month) while improving ops SLA from 3 days to 6 hours and cutting incident response time nearly in half. Previously led a high-risk strangler-fig modernization of a legacy insurance platform across 120+ microsites at Accenture, coordinating across multiple squads with feature-flagged parallel cutovers.

View profile
Vedant Kharwal - Intern AI/ML Engineer specializing in Generative AI and applied machine learning in Mumbai, India

Intern AI/ML Engineer specializing in Generative AI and applied machine learning

Mumbai, India1y exp
LTIMindtreeBoston University

New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.

View profile
YP

Mid-level Software Engineer specializing in backend, distributed systems, and AI infrastructure

Menlo Park, CA4y exp
SnowflakeUSC

Built Baioniq, an enterprise LLM platform for automating extraction from massive unstructured documents like contracts and insurance claims. They demonstrate unusually strong production depth in agentic AI—scaling to 100k+ requests/day, processing 1M+ claim documents, and improving extraction accuracy through rigorous RAG architecture, evaluation, and fallback design.

View profile
MS

Manvir Singh

Screened

Senior Full-Stack & Mobile Software Engineer specializing in cloud-based applications

Englewood, NJ10y exp
Cobalt BrandsUniversity of Washington

Data/ML backend engineer with hands-on production experience spanning RAG services (LlamaIndex/OpenAI) and AWS data platforms. Has delivered Terraform-managed AWS architectures (Lambda + ECS Fargate) with secure secrets handling, built Glue-to-Redshift ETL with schema evolution controls, modernized SAS reporting into Python microservices, and achieved major Redshift query speedups (2+ hours to under 15 minutes).

View profile
LK

Lekha Karanam

Screened

Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation

Dallas, TX3y exp
Goldman SachsUniversity of Texas at Dallas

Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.

View profile
PN

Mid-level Full-Stack Engineer specializing in scalable APIs, cloud infrastructure, and GenAI apps

San Francisco, CA6y exp
DoorDashCal State Chico

Backend/platform engineer with experience across edtech, logistics, and AWS internal systems—owned a production course recommender end-to-end (model serving + APIs + caching/observability), delivering +30% CTR and -20% latency. Has scaled real-time delivery visibility/rerouting on Kubernetes/EKS to sub-200ms P95 during demand spikes and built billion-events/day telemetry pipelines on AWS (Kinesis Firehose, Lambda, S3, Redshift) with schema evolution, dedupe, and replay support.

View profile
AK

Akshay Koneti

Screened

Mid-Level Full-Stack Software Engineer specializing in AWS cloud and microservices

Dallas, TX6y exp
AmazonUniversity of North Texas

Backend/LLM engineer who built a production-critical Amazon Bedrock + RAG correction and compliance layer for employee communications, integrating tightly with existing Spring Boot/AWS microservices to reduce manual review while keeping outputs explainable and auditable. Also designed an event-driven system processing 10M+ events/day (SQS/Lambda/DynamoDB/Elasticsearch) and handled on-call incidents with strong observability and reliability patterns (idempotency, retries, hotspot mitigation).

View profile
RK

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp
Scale AIRowan University

Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.

View profile
Vidhi Upadhyay - Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems in Remote

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp
Saayam for AllCarnegie Mellon University

Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.

View profile
Deepika Gotla - Senior Technical Support Engineer specializing in Azure Cloud & Generative AI in Bellevue, WA

Deepika Gotla

Screened

Senior Technical Support Engineer specializing in Azure Cloud & Generative AI

Bellevue, WA7y exp
MicrosoftSUNY New Paltz

Microsoft cloud/infra engineer with 5+ years supporting enterprise Azure environments, specializing in security-focused networking (private endpoints, DNS) and production troubleshooting across Azure Front Door/App Gateway WAF/AKS. Has implemented posture improvements via Defender for Cloud, Azure Policy, and RBAC tightening, and also designs secure AWS agent/scanner integrations and modern EKS/GitHub Actions/Secrets Manager observability-enabled SDK rollouts.

View profile
Niyaz Nurbhasha - Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines

Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines

4y exp
BlueHaloDuke University

ML/LLM engineer who built production systems to speed up artist content-creation workflows, including a fine-tuned image captioning model paired with a RAG layer over image embeddings/captions to improve consistency across changing domains. Experienced orchestrating multi-tool agents with LangChain/LangGraph (planning + critic/reflection) and setting up practical monitoring (caption rejection rate) plus evaluation sets for tool-calling accuracy, output quality, and latency.

View profile

Need someone specific?

AI Search