Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Dylan Zhu - Mid-level Machine Learning Engineer specializing in computer vision and generative AI in Hoboken, NJ

Dylan Zhu

Screened

Mid-level Machine Learning Engineer specializing in computer vision and generative AI

Hoboken, NJ7y exp
Stevens Institute of TechnologyPurdue University

Built and deployed an LLM/RAG system that uses differential privacy and distributional similarity checks to transform private data into a non-sensitive knowledge base while preserving utility. Also has experience demonstrating adversarial ML concepts (FGSM) to non-technical audiences by focusing on observable model behavior rather than implementation details.

View profile
AD

Ananya Dandi

Screened

Junior Machine Learning Researcher specializing in knowledge distillation

College Park, MD1y exp
University of Maryland Department of Computer ScienceUniversity of Maryland, College Park

Built and shipped LLM-powered agents including a production RAG research assistant that cut research lookup time from ~20 minutes to ~10–20 seconds using caching, retrieval thresholds, and citation-enforced grounded answers. Also designed multi-step, tool-calling workflows with stateful critique/revision loops and pragmatic monitoring (retry/schema-failure/low-confidence signals) plus normalization/validation layers for messy notes/spreadsheet-style data.

View profile
SR

Mid-level Generative AI Engineer specializing in LLMs and enterprise AI

Texas, USA5y exp
PNCUniversity of Texas at Arlington

Built and owned an enterprise LLM/RAG document intelligence platform for PNC Financial Services in a compliance-heavy environment, focused on grounded answers over internal finance and policy documents. Stands out for combining GenAI product delivery with production engineering discipline, delivering 60% faster document review and materially better answer quality while creating reusable FastAPI-based AI services for multiple teams.

View profile
BK

Mid-level Full-Stack Developer specializing in cloud-native enterprise platforms

4y exp
CignaAuburn University at Montgomery

Built Nexthire-AI, shipping an end-to-end LLM-powered resume–job description matching product (React + Node.js) using embeddings and retrieval to generate match scores and skill-gap recommendations. Improved post-launch engagement by making feedback cleaner and more actionable, and added production guardrails (validation, timeouts, fallbacks) to handle messy resume formats and AI API instability.

View profile
Siva Harini Sri Janaki Raman - Mid-level Data Engineer specializing in cloud data platforms in Dallas, TX

Mid-level Data Engineer specializing in cloud data platforms

Dallas, TX3y exp
CVS HealthTexas Tech University

Built an AI-powered internal support assistant at CVS Health using GPT-4, LangChain, and Pinecone, applying RAG, validation, and monitoring to reduce repetitive support tickets while protecting sensitive healthcare data. Stands out for a pragmatic approach to AI engineering: using multi-agent and LLM workflows to accelerate development while keeping systems constrained, observable, and production-friendly.

View profile
VT

Mid-level AI/ML Engineer specializing in Generative AI and agentic systems

4y exp
WalmartUniversity of Central Missouri

Backend/platform engineer who has owned a Python/FastAPI results API and deployed it on Kubernetes with Helm and GitHub Actions-driven CI/CD. Demonstrates strong production operations mindset across performance tuning, monitoring, safe rollouts/rollbacks, and phased migrations, plus hands-on Kafka streaming experience focused on ordering and idempotency.

View profile
Anudeep Eloori - Mid-level Software Developer specializing in full-stack enterprise applications in USA

Mid-level Software Developer specializing in full-stack enterprise applications

USA3y exp
EpsilonUniversity of South Florida

Software engineer with experience building and iterating high-volume Spring Boot microservices on AWS (Docker/Kubernetes) and integrating with React front-ends. Also delivered an LLM-powered document summarization system using embeddings + retrieval (RAG) with grounding/guardrails and built evaluation loops that directly drove retrieval and chunking improvements. Has scaled Kafka-based pipelines processing millions of messy financial/infrastructure records with reliability and cost/latency tradeoff management.

View profile
Anuja Nayak - Junior Software Engineer specializing in distributed systems and reliability in Seattle, WA

Anuja Nayak

Screened

Junior Software Engineer specializing in distributed systems and reliability

Seattle, WA3y exp
OracleArizona State University

Oracle engineer focused on reliability and internal platform tooling, with hands-on experience automating regional traffic failover and building an LLM-assisted incident investigation workflow. Stands out for owning production-impacting systems end-to-end and delivering measurable operational gains, including cutting failover recovery to under five minutes and reducing incident triage from hours to minutes.

View profile
SK

Shirisha K

Screened

Mid Software Engineer specializing in backend microservices and FinTech systems

Illinois, USA4y exp
ServiceNowUniversity of Central Missouri

Full-stack engineer with experience shipping analytics dashboards and an AI-driven support assistant for a cloud analytics platform. They combine Java/Spring Boot backend work with TypeScript frontend development and showed practical knowledge of LLM production concerns like retrieval grounding, latency, caching, retries, and graceful fallbacks. Their shipped dashboard feature improved load times by 35-40% and reduced support issues tied to delayed analytics.

View profile
RD

Rudra Dudhat

Screened

Entry-level Applied AI Engineer specializing in LLMs and ML systems

Navi Mumbai, India0y exp
CCPS, IIT BhilaiIndian Institute of Technology Bhilai

AI automations intern at a lean US-based marketing agency who works directly with founders and builds practical GTM systems end-to-end. He combines ML/LLM tooling with outbound execution, including a clustering-based recommender that improved client lead generation by 30% in two weeks and a personal cold outreach engine that achieved a 12%+ reply rate.

View profile
Yuanmu Li - Junior Software Engineer specializing in full-stack web applications in Sunnyvale, CA

Yuanmu Li

Screened

Junior Software Engineer specializing in full-stack web applications

Sunnyvale, CA2y exp
WalmartPenn State University

Full-stack product engineer in the health and wellness space who has owned prescription workflow features end to end, including React/TypeScript frontend work and GraphQL-backed API aggregation across messy downstream systems. Also shipped an internal OpenAI-powered support assistant with grounded context, structured outputs, validation, and human-in-the-loop safeguards—showing strong practical judgment at the intersection of healthcare workflows and applied AI.

View profile
RK

Rohith kollu

Screened

Senior Software Engineer specializing in backend microservices, cloud, and full-stack systems

Dallas, TX7y exp
CiscoIndiana Wesleyan University

Backend/platform engineer who has built and scaled production Java/Spring Boot + Kafka services on AWS/Kubernetes (1M+ msgs/day) and led reliability/performance fixes that restored SLAs (25–30% latency improvement; 99.9% uptime). Also shipped an AI customer-support chatbot end-to-end using retrieval + guardrails and rigorous evaluation/observability, improving resolution time 40% and satisfaction 25%, with a strong plan/execute/verify approach to agentic workflow reliability.

View profile
SR

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp
VerizonUniversity of Massachusetts Dartmouth

Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.

View profile
HA

Habtom Asfaha

Screened

Senior Java Full-Stack & DevOps Engineer specializing in cloud-native microservices

California, USA9y exp
Syneos HealthSan Francisco State University

Software engineer with a CS/Computer Engineering background who has worked on ML/NLP (Hugging Face, clinical NLP, text generation and structured extraction) and has a school robotics project integrating a trained ML model with microprocessor-controlled hardware to drive motor movement and writing. Currently focused on building and deploying applications and ML models to AWS/Azure using Docker, Kubernetes, and CI/CD; targeting ~$150K compensation.

View profile
PK

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.

View profile
TJ

Mid-Level Full-Stack Engineer specializing in LLM and RAG applications

San Jose, CA5y exp
MedRevealSaint Louis University

LLM/RAG engineer who took a PDF-heavy agent from prototype to production for an Africa-based client, combining Pinecone retrieval with robust PDF parsing (unstructured.io, OCR, structured table extraction). Demonstrates strong production mindset (eval metrics, prompt hardening, security/scalability) and measurable optimization impact (30% efficiency gain, 2x faster responses), and has helped close deals by building security-focused POCs for skeptical IT stakeholders.

View profile
PV

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.

View profile
VM

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and Clinical AI

Chicago, Illinois4y exp
OptumIllinois Institute of Technology

Built and productionized a HIPAA-compliant LLM+RAG Clinical AI assistant at Optum, fine-tuning GPT/LLaMA on de-identified patient notes and integrating FAISS/Pinecone for sub-second retrieval; reported to cut diagnosis time by ~20 minutes per case. Experienced in orchestrating ML pipelines (Airflow, AWS Step Functions, Azure Data Factory) and in reliability techniques for LLM systems (grounding, citations, confidence filters, monitoring) while partnering closely with clinicians and compliance teams.

View profile
KR

Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps

Texas, USA4y exp
McKessonUniversity of Texas at Arlington

AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.

View profile
KS

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp
UnitedHealth GroupKent State University

Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.

View profile
KS

Kumud Sharma

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and AI integrations

USA6y exp
IntuitIndiana University

Backend engineer who has delivered large, measurable performance wins (10x throughput, 67% latency reduction) by combining Flask microservices, Redis caching, and AWS autoscaling/observability. Has hands-on depth in SQLAlchemy/Postgres optimization and production scaling pitfalls (cache consistency, connection exhaustion), plus experience deploying real-time ML inference (XGBoost) on AWS Lambda and building secure multi-tenant Kubernetes isolation.

View profile
HW

Huihai Wang

Screened

Mid-level Applied AI Engineer specializing in knowledge graphs, GraphRAG, and urban mobility

Austin, TX5y exp
Urban Information Lab, The University of Texas at AustinUniversity of Texas at Austin

ML/NLP practitioner focused on knowledge-graph-based retrieval for LLM question answering, including an urban/autonomous-vehicle decision-making use case. Built a hierarchical GraphRAG + vector database system and an entity-resolution pipeline that blends spatial and semantic similarity, validated using LLM-generated synthetic datasets; uses Python tooling like RDFLib, GraphDB, OpenAI APIs, and LangChain.

View profile
KK

Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning

United States5y exp
CitigroupUniversity of North Texas

Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.

View profile
MR

Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI

Springfield, Missouri5y exp
O'Reilly Auto PartsSaint Louis University

ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.

View profile

Need someone specific?

AI Search