Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

MW

Senior Full-Stack AI Engineer specializing in Azure OpenAI and RAG/GraphRAG systems

Eagle Mountain, UT24y exp
GoEngineerBrigham Young University

Built GoEngineer’s first production AI systems, including an end-to-end RAG pipeline for SolidWorks technical support using Azure Blob Storage, Azure AI Search, and Azure OpenAI, plus an AI summarization feature adopted by sales/customer success. Strong in productionizing LLM workflows with evaluation harnesses (golden sets, LLM-as-judge, red teaming, shadow deploys) and Azure infrastructure integrations (Redis, Service Bus, App Insights), and has also implemented a custom MCP server for agentic monitoring.

View profile
SM

Sai Macherla

Screened

Mid-level Full-Stack Java Developer specializing in Healthcare and Financial Services AI

Rochester, MN4y exp
Mayo ClinicRowan University

Built and shipped production LLM/RAG systems at Mayo Clinic, including a conversational AI assistant for patient pre-consultation and a clinical-trial matching tool for doctors. Implemented HIPAA-compliant de-identification and guardrails, plus real-time feedback logging and fine-tuning that improved response accuracy by 15% and reduced admin workload by 25%.

View profile
KS

Mid-level Full-Stack Java Developer specializing in enterprise banking and healthcare systems

4y exp
JPMorgan ChaseAuburn University at Montgomery

Built and shipped a production LLM-powered customer support triage/resolution agent that automated ~60% of tickets, cutting response times from hours to seconds and improving first-response resolution by ~40%. Experienced designing multi-tenant, tenant-isolated agent architectures with RAG, schema-based tool calling/strict JSON validation, and strong reliability practices (guardrails, retries, fallbacks, monitoring), including safe integration with messy ERP-like data.

View profile
PK

Prem Kumar

Screened

Senior Data Engineer specializing in cloud data platforms and regulated analytics

McLean, VA6y exp
Capital OneRowan University

Data engineer at Capital One building AWS-based real-time and batch pipelines and backend data services for financial/fraud use cases. Has owned end-to-end pipelines processing millions of records/day, implemented dbt/Great Expectations quality gates, and tuned Redshift/Snowflake workloads (cutting query latency ~22–25% and reducing pipeline failures ~30–40%) while supporting 15+ downstream consumers.

View profile
AO

Alex Olson

Screened

Junior AI & Full-Stack Developer specializing in generative AI and web platforms

Remote1y exp
JerseySTEMBoston University

Recent graduate with internship experience at Bausch + Lomb building Copilot Studio HR chatbots that reduced HR time spent on repetitive inquiries. Strong focus on conversational flow design, prompt-based steering for predictability, and thorough technical/end-user documentation; also building a personal YouTube AI SEO analyzer.

View profile
Jayanti Lahoti - Junior Full-Stack Software Engineer specializing in AI and cloud-native systems in San Diego, USA

Junior Full-Stack Software Engineer specializing in AI and cloud-native systems

San Diego, USA2y exp
HPEUC San Diego

Backend/systems-oriented engineer focused on building production-constrained LLM agent workflows that automate repetitive operator tasks via intent/entity extraction, retrieval grounding, and structured action recommendations with human-in-the-loop review. Emphasizes reliability through deterministic orchestration, strict tool/function schemas, observability, and disciplined evaluation/feedback loops, with strong experience handling messy multi-service operational data and idempotent execution.

View profile
Dinesh Kumar Patibandla - Mid-level Machine Learning Engineer specializing in LLMs and RAG for finance and healthcare in Texas, USA

Mid-level Machine Learning Engineer specializing in LLMs and RAG for finance and healthcare

Texas, USA4y exp
Goldman SachsUniversity of North Texas

ML Engineer with recent Goldman Sachs experience building and deploying a production RAG/LLM assistant for summarization, drafting, and internal knowledge retrieval across financial, risk, and compliance documents. Designed for heavy regulatory constraints and scaled to 10,000+ concurrent users using Kubernetes-based orchestration, dynamic LLM routing, and rigorous testing (adversarial prompts, A/B tests, load simulations) with privacy controls like differential privacy.

View profile
Pavan Kumar Malasani - Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and GenAI in Remote, USA

Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and GenAI

Remote, USA4y exp
CitigroupUniversity of Colorado Boulder

GenAI/ML engineer in Citigroup’s finance environment who has deployed production RAG systems for investment banking under strict privacy and model-risk constraints. Built an internal-VPC Llama2 + Pinecone + LangChain solution with NER redaction and citation-based verification to prevent hallucinations, delivering major time savings, and also partnered with global finance executives to ship an AI early-warning indicator for treasury/liquidity risk.

View profile
HEMANTH KUMAR KOTTAPALLI - Mid-level Machine Learning Engineer specializing in GPU-accelerated LLMs and MLOps in GA, USA

Mid-level Machine Learning Engineer specializing in GPU-accelerated LLMs and MLOps

GA, USA4y exp
BlackRockMercer University

Built and deployed a production LLM-powered decision-support system for supply-chain planners that explains demand forecast changes using grounded retrieval from sales, promotion, inventory, and supplier data. Implemented strict anti-hallucination guardrails and latency optimizations, deployed as a real-time AWS API with monitoring, and reported ~15% forecast accuracy improvement and ~12% supply-chain risk reduction. Experienced orchestrating data/ML/LLM workflows with Airflow, LangChain/LangGraph-style patterns, and AWS Step Functions while partnering closely with non-technical business users via demos and example-based requirements.

View profile
Anishkumar Mahalingam Iyer - Intern Software Engineer specializing in AI/ML infrastructure and applied machine learning in Palo Alto, CA

Intern Software Engineer specializing in AI/ML infrastructure and applied machine learning

Palo Alto, CA2y exp
RivianUSC

Interned at Rivian where they built and deployed a production Whisper-based ASR + LLM real-time event labeling pipeline to help autonomous-vehicle engineers diagnose failures and route issues to triage teams. Also built a stateful multi-agent "Code Partner" developer assistant using LangGraph/LangChain (planner/router/coder/critique/tester) with evaluation, adversarial testing, and stakeholder-friendly communication practices.

View profile
Sudhan Louis - Director of Enterprise Architecture specializing in digital transformation, AI, and API strategy in Rolling Hills Estates, CA

Sudhan Louis

Screened

Director of Enterprise Architecture specializing in digital transformation, AI, and API strategy

Rolling Hills Estates, CA26y exp
HerbalifeBoston University

Hands-on architect/technology leader who builds prototypes (including Agentic AI wellness/biomarkers) and then scales teams to execute. Led a ~$400M global e-commerce transformation spanning 95 countries with active-active US/EU multi-region resilience, microservices/MFE (MACH), and strong security patterns (service mesh + API gateway + Ping Identity), plus modern data foundations (customer hub/MDM/Snowflake, data fabric/medallion).

View profile
Sachin Reddy Kunta - Mid-Level Backend Software Engineer specializing in payments, fraud systems, and AI agent infrastructure in San Francisco, CA

Mid-Level Backend Software Engineer specializing in payments, fraud systems, and AI agent infrastructure

San Francisco, CA3y exp
Saayam for AllNYU

Early-career engineer who owned an end-to-end objective assessment/coding contest platform at an edtech startup, using Postgres + S3 and Redis (queues + ZSET) to decouple and scale code submission processing with worker sandboxes. Also implemented idempotency controls and set up monitoring and CI/CD while the rest of the team focused on curriculum.

View profile
Qice Sun - Junior GenAI Software Engineer specializing in multimodal RAG and agentic workflows in Sunnyvale, CA

Qice Sun

Screened

Junior GenAI Software Engineer specializing in multimodal RAG and agentic workflows

Sunnyvale, CA2y exp
WalmartCalifornia State University, Fullerton

AI/LLM engineer with production experience building a multimodal RAG agent for Walmart driver support, combining hybrid retrieval (dense+BM25) and fine-tuned Llama 3 served via vLLM on Azure AKS to reach sub-second latency. Drove measurable impact (25% fewer escalations, 60% lower token costs, 33% lower storage costs) and also built Kafka-based microservices that cut batch runtime from 2 hours to 15 minutes and reduced DB load by 80%.

View profile
Pandari G - Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems in San Francisco, USA

Pandari G

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

San Francisco, USA5y exp
SephoraSaint Mary's College of California

GenAI/LLM engineer with production deployments in both fintech and retail: built an AI-powered mortgage document analysis/automated underwriting pipeline at Fannie Mae (OCR + custom LLM) cutting underwriting review from 3–4 hours to under an hour with privacy-by-design controls. Also helped build Sephora’s GenAI product advisory bot using LangChain-orchestrated RAG (Azure GPT-4, Azure AI Search, MySQL HeatWave vector search), focusing on grounding, evaluation, and compliance-aware architecture choices.

View profile
Shram Kadia - Mid-level Software Engineer specializing in backend systems, cloud-native apps, and AI platforms in Santa Clara, CA

Shram Kadia

Screened

Mid-level Software Engineer specializing in backend systems, cloud-native apps, and AI platforms

Santa Clara, CA4y exp
ServiceNowNorth Carolina State University

Backend/full-stack engineer who has owned production systems end-to-end, including a Dockerized Node.js/TypeScript probabilistic fault-tree analysis service for nuclear safety research deployed on AWS. Also built and operated a FastAPI-based RAG pipeline over 200+ PDFs using FAISS, focusing on low-latency, idempotent workflows and strong observability; experienced with API design and Playwright E2E automation across React/Angular projects.

View profile
JC

Jamie Cook

Screened

Senior Machine Learning Engineer specializing in AI search and recommendation systems

Plantation, FL8y exp
ChewyUniversity of Miami

Built internal production LLM tools for engineering and support, including a customer-health assistant and a RAG-based incident explainer grounded in logs, metrics, and deploy data. Stands out for combining strong GenAI safety/evaluation practices with pragmatic backend engineering, delivering measurable impact like a 40% drop in data-help requests and answers in seconds instead of minutes or hours.

View profile
PP

Pini Pur

Screened

Executive AI product and platform leader specializing in enterprise SaaS and applied AI

San Francisco, CA15y exp
TAU Ventures Innovation LabTel Aviv University

AI product leader working at TAU Ventures Innovation Lab and previously at Verint and NYSHEX, with a track record of turning legacy and ambiguous AI opportunities into shippable, human-centered products. Notably led Verint’s evolution into a low-code AI orchestration platform with 20+ capabilities and measurable gains in CSAT, handle time, and recurring revenue, while consistently emphasizing trust, workflow fit, and human-in-the-loop design.

View profile
Vasudev Konde - Mid-level Full-Stack Java Developer specializing in APIs and cloud microservices in Phoenix, AZ

Vasudev Konde

Screened

Mid-level Full-Stack Java Developer specializing in APIs and cloud microservices

Phoenix, AZ5y exp
American Express

AI/LLM engineer who has shipped a production document-intelligence agent that automated internal support workflows using RAG, tool calling, and robust fallback controls. Stands out for combining hands-on architecture with measurable business impact: 85% faster query resolution, 35% lower LLM cost, 40% fewer LLM calls, and enough automation to avoid adding 2-3 support engineers.

View profile
Harshal Sawant - Senior AI Engineer specializing in LLMs, RAG, and MLOps on multi-cloud

Senior AI Engineer specializing in LLMs, RAG, and MLOps on multi-cloud

8y exp
Wells Fargo

Built and productionized a secure internal RAG-based AI assistant (LangChain/FastAPI/FAISS on GCP), tackling real-world issues like latency, retrieval speed, and hallucinations—delivering 25% faster retrieval and 99.9% uptime. Also implemented scalable, reliable ML retraining orchestration with AWS Step Functions/SageMaker/Lambda and partners closely with compliance analysts to iteratively refine prompts and outputs to meet governance standards.

View profile
JO

Junior Data Infrastructure Software Engineer specializing in distributed pipelines and AI extraction

Irvine, CA1y exp
Tax Relief AdvocatesGeorgia Tech
View profile
CV

Mid-level Full-Stack Engineer specializing in backend systems, streaming, and GenAI

Florham Park, New Jersey4y exp
PrudentialNJIT
View profile
AG

Junior Software Engineer specializing in AI/ML and full-stack product development

Irvine, California1y exp
WalmartUC Irvine
View profile
GA

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI

St. Louis, MO6y exp
PNCSoutheast Missouri State University
View profile
Pavan Sainath Atukuri - Senior Backend Software Engineer specializing in Supply Chain and Generative AI in Bentonville, AR

Senior Backend Software Engineer specializing in Supply Chain and Generative AI

Bentonville, AR6y exp
WalmartUniversity of Maryland, College Park
View profile

Need someone specific?

AI Search