Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

VV

Mid AI/ML Engineer specializing in LLM alignment and scalable AI systems

Harrison, NJ5y exp
AnthropicNJIT
View profile
RP

Mid-level Full-Stack Software Engineer specializing in AI-powered applications

Seattle, WA5y exp
AmazonUniversity at Buffalo
View profile
DL

Senior AI Engineer specializing in LLM, multimodal, and XR systems

Miami Beach, FL13y exp
Tacocat InteractiveGeorgia Tech
View profile
MN

Senior AI/ML Engineer specializing in NLP, computer vision, and MLOps

Ohio, USA10y exp
Pixolat LLC
View profile
AK

Staff AI Systems Engineer specializing in multi-agent and distributed platforms

San Francisco Bay Area, CA18y exp
Reddit
View profile
NB

Executive Product & Technical Services Leader specializing in AI, Crypto, and FinTech

San Francisco, CA17y exp
Blink AI
View profile
Surya Teja - Mid-level Backend/Full-Stack Engineer specializing in AI and FinTech payments in Tempe, AZ

Surya Teja

Screened

Mid-level Backend/Full-Stack Engineer specializing in AI and FinTech payments

Tempe, AZ4y exp
StripeArizona State University

Full-stack engineer who has owned an operational reporting/dashboard product end-to-end—building a React UI, designing/implementing FastAPI services, and deploying/operating on AWS. Demonstrates strong performance engineering (Postgres query/index tuning using EXPLAIN ANALYZE) with concrete impact (reports reduced from tens of seconds to a few seconds) and a reliability mindset across observability, migrations, and resilient third-party/ETL integrations.

View profile
Sourabh Jain - Director of Software Engineering specializing in enterprise Data, ML & AI platforms in Bay Area, CA

Sourabh Jain

Screened

Director of Software Engineering specializing in enterprise Data, ML & AI platforms

Bay Area, CA23y exp
RSA SecurityShri G. S. Institute of Technology and Science

Former Walmart Director of Software Engineering who left in March 2025 to build products for clients. Recently delivered an LLM/RAG-based UNSPSC classification solution for an MRO client using a multi-stage retrieval + web search + prompt-engineering workflow, and has led large-scale retail forecasting initiatives and high-severity cloud-migration incidents end-to-end.

View profile
Kevin Allen - Senior AI/ML Engineer specializing in conversational and generative AI in Austin, TX

Kevin Allen

Screened

Senior AI/ML Engineer specializing in conversational and generative AI

Austin, TX12y exp
General MotorsUniversity of Kentucky

Built and productionized an LLM-based support assistant end-to-end, including RAG, APIs, monitoring, guardrails, and agent feedback loops. Stands out for translating GenAI prototypes into reliable production systems with structured evaluation, safety controls, and reusable Python infrastructure that improved both support quality and engineering velocity.

View profile
BK

Balpreet Kaur

Screened

Junior Machine Learning Engineer specializing in LLMs and data pipelines

Amherst, MA2y exp
Google DeepMindUniversity of Massachusetts Amherst

Research Extern at Google DeepMind and former AWS Software Development Engineer Intern with a strong focus on practical, trustworthy AI engineering. Built a multi-agent RAG system for personalized news headline generation using a fine-tuned Flan-T5 model, parallel critic agents, FAISS retrieval, and style embeddings, while also leading a 3-person team on the project.

View profile
EM

Eric Majewicz

Screened

Staff Frontend Engineer specializing in enterprise SaaS, analytics, and AI-powered products

Remote19y exp
HubSpotRochester Institute of Technology

Frontend tech lead at HubSpot who shipped an LLM-powered insights dashboard that analyzed complex customer interaction histories and surfaced sentiment, challenges, and next-best actions for sales users. Stands out for having taken an AI feature beyond prototype into beta and full production, with strong emphasis on testing, maintainability, and practical production tradeoffs.

View profile
YD

Yunqi Dong

Screened

Intern Software Engineer specializing in AI, data systems, and recommendation platforms

Pittsburgh, PA0y exp
MeituanCarnegie Mellon University

Full-stack engineer with a strong mix of real-time product engineering and applied AI experience. Built and deployed a production stock trading simulator on AWS and an LLM-based customer support agent with RAG/tooling, and also shipped a zero-to-one in-store detection feature at Meituan that improved CTR by 7% and conversion by 11%.

View profile
SB

Suraj Botcha

Screened

Intern AI/ML Engineer specializing in LLM systems and industrial AI

Remote1y exp
ControlRooms.AICarnegie Mellon University

Full-stack AI engineer who has built both document-intelligence products and agentic investigation systems end to end. At ControlRooms.AI, they helped ship a production-facing root cause investigation workflow for industrial operations using Neo4j, FastMCP, RAG, OCR/VLM inputs, and multiple LLMs, contributing to roughly a 10x reduction in manual investigation time. They stand out for designing explainable, traceable AI systems that surface evidence, uncertainty, and missing context rather than forcing overconfident answers.

View profile
Andrew Liang - Intern Software Engineer specializing in full-stack and AI/ML systems

Andrew Liang

Screened

Intern Software Engineer specializing in full-stack and AI/ML systems

2y exp
AmazonUCLA

Software engineer with experience at Amazon and Agora building end-to-end systems: a knowledge-base AI chatbot (React/TypeScript UI + retrieval/response backend + Docker deployment) and an internal approval governance platform using AWS Step Functions and DynamoDB. Emphasizes fast iteration without sacrificing trust via feature-flag rollouts, citation-required answers, abstention on low-confidence retrieval, regression query sets, and strong observability (request IDs, structured logs, latency/error monitoring).

View profile
XL

Xicheng Liang

Screened

Intern AI/Full-Stack Engineer specializing in backend systems and applied machine learning

Chicago, IL1y exp
Becker’s HealthcareUniversity of Pennsylvania

Built and shipped a production agentic RAG system for healthcare analysts that automated compliance/operations knowledge retrieval across PDFs, reports, and databases. Emphasizes production reliability (monitoring, retries, fallbacks, async queues), strong evaluation/iteration loops, and measurable impact (3–10s responses and ~98% top-k retrieval accuracy).

View profile
CR

Senior Machine Learning Engineer specializing in conversational AI and Generative AI

San Francisco, CA6y exp
Scale AIDallas Baptist University

ML/AI engineer with experience at Uber and Scale AI, focused on customer service automation across both classical NLP and generative AI systems. Has owned systems from experimentation through production on AWS, including LLM fine-tuning, RAG optimization, safety evaluation, and internal Python platform tooling that improved consistency and engineering velocity.

View profile
VK

Senior Software Engineer specializing in backend systems, cloud, and AI automation

Houston, TX5y exp
NetflixUniversity of Houston-Clear Lake

Built a production AI-powered workflow automation system at Netflix that integrated OpenAI and LangChain with FastAPI services on AWS, cutting roughly 320 hours of manual operational effort. Brings a mix of full-stack product development and practical AI systems experience, with strong attention to reliability, maintainability, and non-technical user adoption.

View profile
SC

Shweta Chavan

Screened

Junior Computer Vision & ML Engineer specializing in autonomous perception systems

Pittsburgh, PA2y exp
Magna InternationalCarnegie Mellon University

LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.

View profile
SB

Mid-level Backend & Reliability Engineer specializing in AWS, Kubernetes, and automation

New Mexico, US5y exp
MetaUniversity of North Carolina at Charlotte

Meta engineer focused on reliability/operations tooling who built a unified real-time health dashboard and scalable telemetry pipelines (AWS + Datadog) for thousands of devices. Also shipped an internal LLM-powered knowledge assistant using RAG over wikis/runbooks/logs with strong guardrails and a rigorous eval loop that drove measurable accuracy improvements via automated doc ingestion and embedding updates.

View profile
Dheeraj Kumar - Intern Data Scientist specializing in marketing analytics and data engineering in Tucson, Arizona

Dheeraj Kumar

Screened

Intern Data Scientist specializing in marketing analytics and data engineering

Tucson, Arizona2y exp
RochePurdue University

AI/LLM practitioner with internships at Dell Technologies and Roche who built and deployed a healthcare-focused "Doctor LLM" by fine-tuning Meta Llama 3.2 on healthcaremagic.json, emphasizing safety guardrails to prevent harmful medical advice. Experienced in productionizing AI workflows with monitoring, testing, and orchestration (Airflow, Kubernetes), and in delivering AI-agent-driven competitive landscape insights to non-technical business stakeholders.

View profile
Muhan Zhang - Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG in Palo Alto, USA

Muhan Zhang

Screened

Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG

Palo Alto, USA2y exp
Platflow.AICornell University

Built and shipped a production LLM pipeline for nursing home Medicare reimbursement (PDF OCR + fact extraction + keyword RAG + QA) that reportedly increased payouts by ~$1K/month per patient. Strong in LLM ops/benchmarking (ground truth, LLM-as-judge, cost/I-O tracking) and pragmatic optimization—swapped retrieval approaches, fine-tuned a small model to cut OCR cost 90%, and migrated workloads to Azure/Temporal to scale nightly processing 10x.

View profile
Zheng Wu - Junior Software Engineer specializing in backend systems and cloud messaging in Mountain View, CA

Zheng Wu

Screened

Junior Software Engineer specializing in backend systems and cloud messaging

Mountain View, CA1y exp
NewsBreakRice University

Data/ML engineer who has owned end-to-end systems across email deliverability/segmentation and production LLM apps. Built a Spark+Airflow segmentation engine that materially improved deliverability (99.9%) and open rates (>50%), and shipped a PDF-to-quiz RAG product using LangChain/Vertex AI/Chroma with strong guardrails and an eval loop that cut hallucinations to <5%.

View profile
Asrith Velireddy - Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems in Harrison, NJ

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems

Harrison, NJ4y exp
AdobeNJIT

ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.

View profile

Need someone specific?

AI Search