Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

RB

Rohit Bisht

Screened

Junior Data Scientist / ML Engineer specializing in LLMs and RAG systems

Dehradun, India2y exp
Project On TrackIIIT Ranchi

Built and deployed a production enterprise LLM-powered RAG assistant for the construction domain, enabling natural-language querying across PDFs/reports and structured sources (SQL/CSV). Implemented an agent-based routing and multi-agent orchestration approach (LangChain/LangGraph) to reduce hallucinations, improve latency, and deliver actionable, structured responses based on stakeholder feedback.

View profile
OT

Intern AI/Data Scientist specializing in LLMs, RAG, and MLOps

Maryland, USA2y exp
University of MarylandUniversity of Maryland, College Park

Internship project at Builder Market: built an end-to-end production multimodal LLM application that estimates renovation/replacement costs from appliance photos (CLIP embeddings) or text descriptions, combining fine-tuning with agentic RAG. Focused heavily on real-world performance constraints—latency and cost—using parallel agent workflows, model routing to smaller/open-source models, re-ranking, and retrieval chunking, and collaborated closely with CEO/co-founders to deliver the solution.

View profile
SP

shubham patil

Screened

Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics

New York, NY4y exp
Syracuse UniversitySyracuse University

Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).

View profile
MY

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
State StreetWebster University

Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.

View profile
SJ

Mid-level Data Scientist / ML Engineer specializing in MLOps and Generative AI

Alexandria, Virginia3y exp
Schizophrenia & Psychosis Action AllianceStony Brook University

Built and deployed an AI agent to help patients navigate complex housing information by scraping and normalizing unstructured data across all 50 U.S. states, then layering a LangChain RAG system with MMR re-ranking to reduce hallucinations. Experienced in orchestrating multi-agent workflows (LangGraph/CrewAI) and production reliability practices (Pydantic-validated outputs, LLM-as-judge evals, tracing). Also delivered stakeholder-facing explainability via SHAP dashboards for a loan-approval predictive model at Welspot.

View profile
DP

Deep Patel

Screened

Junior AI/ML Engineer specializing in NLP, LLMs, and MLOps deployment

Seattle, WA1y exp
Firenix Technologies Pvt. Ltd.University of Oklahoma

Built and deployed NeuroDoc, a production-grade RAG system for PDF Q&A that delivers citation-backed answers with strong anti-hallucination guardrails. Experienced in orchestrating and scaling ML/LLM pipelines with Kubernetes, Airflow/Prefect, and PyTorch Distributed, and in building rigorous evaluation and citation-verification tooling to ensure reliability in production.

View profile
Yashi Agarwal - Mid-level Machine Learning Engineer specializing in NLP, Generative AI, and RAG systems in Los Angeles, CA

Yashi Agarwal

Screened

Mid-level Machine Learning Engineer specializing in NLP, Generative AI, and RAG systems

Los Angeles, CA4y exp
KaiyrosCalifornia State University, East Bay

Built and deployed a production LLM-powered phone assistant for a healthcare clinic, combining streaming STT/TTS with RAG over approved clinic documents and strict safety guardrails to prevent unverified medical advice, plus seamless human handoff. Also has hands-on Apache Airflow experience building robust daily ML/data pipelines with data validation, retries/timeouts, monitoring, and metric-gated model deployment, and iterates closely with clinic staff using real call reviews.

View profile
Sai Erramada - Mid-level Full-Stack Java Developer specializing in microservices and cloud-native systems in Wisconsin, USA

Sai Erramada

Screened

Mid-level Full-Stack Java Developer specializing in microservices and cloud-native systems

Wisconsin, USA6y exp
WalgreensConcordia University Wisconsin

Backend engineer with hands-on experience building real-time, event-driven systems at Walgreens, including a Kafka-based prescription status notification service and scalable pipelines for messy prescription/inventory data. Strong focus on reliability patterns (retries, idempotency, DLQs) and iterating based on pharmacist feedback to improve usability.

View profile
Rahul Ganesan - Intern AI Engineer specializing in LLM systems, RAG, and cloud data pipelines in Washington, PA

Rahul Ganesan

Screened

Intern AI Engineer specializing in LLM systems, RAG, and cloud data pipelines

Washington, PA0y exp
Frazier Simplex Machine CompanyUniversity of Colorado Boulder

Built and deployed a production Dockerized multimodal (voice+text) LLM agent for knowledge management that retrieves from Notion and documents and falls back to Tavily-powered web search with citations when internal notes are missing. Emphasizes production reliability via model-switching fallbacks, caching, strict structured outputs (Pydantic/JSON schema), and MCP-based orchestration with state-aware gating and monitoring to reduce redundant tool calls and improve success rates.

View profile
Uttam Kumar - Intern AI Engineer specializing in LLM agents, RAG, and scalable cloud deployment in Atlanta, GA

Uttam Kumar

Screened

Intern AI Engineer specializing in LLM agents, RAG, and scalable cloud deployment

Atlanta, GA2y exp
GPT IntegratorsArizona State University

AI/LLM engineer at GPT integrators who built a production multi-agent enterprise workflow integration system, tackling hard problems in agent orchestration, layered memory, and custom RAG over enterprise/user data. Also built an education-focused agent solution integrating with Canvas, Zoom, and email to automate classroom admin tasks, and is currently applying agentic AI to insurance underwriting workflows in collaboration with underwriters.

View profile
Sai Leela Kuragayala - Mid-level Full-Stack Software Engineer specializing in scalable web apps and automation in Los Angeles, CA

Mid-level Full-Stack Software Engineer specializing in scalable web apps and automation

Los Angeles, CA5y exp
S&S Fashions Inc.NJIT

UE5 UI engineer who has shipped production-ready HUD/menu frameworks using C++/Slate/UMG and CommonUI, emphasizing MVVM-style architecture for maintainability and designer-friendly iteration. Strong in UI profiling/optimization (Unreal Insights + Slate Profiler), including Slate list virtualization and event-driven updates that improved UI frame time by ~30% in heavy menu scenarios.

View profile
Srinandh Reddy - Mid-Level Software Engineer specializing in backend, cloud, and event-driven systems in Aurora, Illinois

Mid-Level Software Engineer specializing in backend, cloud, and event-driven systems

Aurora, Illinois5y exp
McKessonLewis University

Robotics software engineer focused on backend and distributed systems for real-time robot operations, including sensor ingestion, robot state management, and robot-to-cloud communication. Hands-on with ROS/ROS2 integration and real-time navigation debugging, plus production-grade monitoring, CI/CD, and containerized deployments (Docker/Kubernetes) to improve stability and performance.

View profile
AB

Abhishek Basu

Screened

Junior Backend Software Engineer specializing in cloud and AI systems

Chicago, IL2y exp
Carpl.aiUniversity of Illinois Chicago

Built and shipped LLM-enabled decision systems focused on real production reliability rather than chatbot demos, including a multimodal radiology retrieval platform with 28% relevance gains and 35% lower latency. Also architected a 4-agent employee analytics workflow with structured outputs, traceable orchestration, and strong safeguards for messy real-world data.

View profile
AR

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Kansas City, MO5y exp
NAICUniversity of Central Missouri

ML/AI engineer with hands-on ownership of fraud detection and investigator-assist systems, combining anomaly detection with RAG-based LLM summarization in production. Stands out for translating research ideas into reliable cloud-deployed workflows that improved precision to 92%, cut review time by 25-30%, and increased investigator throughput by roughly 30% while also building reusable Python infrastructure for team-wide velocity.

View profile
Chin-yu Wu - Junior Data Analyst specializing in sports analytics and business intelligence in Indianapolis, IN

Chin-yu Wu

Screened

Junior Data Analyst specializing in sports analytics and business intelligence

Indianapolis, IN2y exp
Indianapolis ColtsIndiana University Indianapolis

Analytics professional in the sports industry who has owned high-impact revenue and compliance data projects for the Colts, turning fragmented Ticketmaster and Salesforce data into trusted real-time reporting. Stands out for combining strong SQL/Snowflake engineering, rigorous validation practices, and stakeholder-facing metric design that drove a record 98% compliance rate and meaningful revenue recovery.

View profile
AK

Junior Software Engineer specializing in full-stack systems and AI applications

New York, NY2y exp
Sentari AISanta Clara University

Full-stack AI engineer who has owned production deployments for both a voice journaling/emotional insights app and a RAG-based research assistant. Stands out for turning messy, failure-prone LLM and document pipelines into reliable user-facing systems through strong debugging, staged workflow design, and post-launch stabilization.

View profile
PN

Mid-level AI Engineer specializing in distributed systems and LLM applications

Syracuse, NY4y exp
Syracuse UniversitySyracuse University

Built production AI agents that convert natural-language requests into structured workflows using LangChain, tool calling, and a Kafka/Kubernetes backend, with strong emphasis on tracing, validation, and self-correcting failure handling. Also drove a zero-to-one Research Day judging platform spanning React, Flask, RAG, and ILP-based assignment optimization for ~100 live posters, achieving 99% uptime and winning Best Web App.

View profile
Rohan Chodapunedi - Entry-level Data Scientist specializing in LLMs and analytics in Folsom, CA

Entry-level Data Scientist specializing in LLMs and analytics

Folsom, CA1y exp
App OrchidVirginia Tech

Built a zero-to-one AI contract/policy QA agent for compliance and data teams, with a strong emphasis on trust, traceability, and clause-level citations rather than just fluent answers. They combine full-stack product ownership with practical LLM systems design, including hybrid retrieval, structured outputs, and evaluation pipelines to improve reliability, latency, and cost.

View profile
RK

Mid-level Software Engineer specializing in AI, backend systems, and data platforms

San Ramon, CA7y exp
StackGenUniversity of Illinois Chicago

Built and shipped production AI features for Aiden, including a natural-language agent and a Knowledge Hub ingestion/retrieval system. Stands out for hands-on debugging of real LLM production issues across providers like OpenAI and AWS Bedrock, improving reliability and achieving 90% response/retrieval consistency through direct LiteLLM integration, validation, monitoring, and async system design.

View profile
HN

Humera Naaz

Screened

Mid-level Full-Stack Developer specializing in cloud-native enterprise applications

Remote, USA3y exp
Cyber Infrastructure Inc.San Francisco Bay University

Engineer with hands-on experience embedding AI into software delivery workflows, including Claude-powered PR review, testing, debugging, and multi-agent coding pipelines. They pair AI automation with strong systems thinking around microservices, fault tolerance, multi-AZ design, caching, and security controls like WAF and rate limiting, and also experiment independently with RAG and multi-agent search projects.

View profile
George Platon - Principal Full-Stack Engineer specializing in AI, DevOps, and cloud platforms in Romania, Romania

George Platon

Screened

Principal Full-Stack Engineer specializing in AI, DevOps, and cloud platforms

Romania, Romania16y exp
Healing.careBabeș-Bolyai University

Built a production end-to-end AI video-to-reels clip extraction system using a multi-agent architecture with transcription, captioning, effects generation, and centralized orchestration. Demonstrates unusually strong systems thinking around reliability, observability, evaluation, and production tradeoffs for LLM-powered workflows, including Kubernetes/Kafka-based deployment and regression-driven prompt governance.

View profile
Kevin Delong - Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems in Irvine, CA

Kevin Delong

Screened

Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

Irvine, CA12y exp
StfineTechLawrence Technological University

AI/ML engineer with hands-on experience shipping production systems across fintech, travel, and legal use cases. They’ve built end-to-end chatbot, generative content, and RAG solutions on AWS with CI/CD, monitoring, and guardrails, including a loan application platform that generated $3,000 in sales in its first month.

View profile
SK

Mid-level AI Software Engineer specializing in backend systems and FinTech AI

USA4y exp
PNCConcordia University, St. Paul

Data engineering/software development candidate who built a stock market pipeline and uses that project to demonstrate strong architectural thinking across Kafka, Spark, and Airflow. They stand out for a pragmatic approach to AI: using tools like Copilot, ChatGPT, LangChain, and AutoGen to accelerate development while maintaining human oversight, testing, and system-level decision making.

View profile
AP

Angel Paudel

Screened

Intern Data Engineer specializing in healthcare analytics and machine learning

Akron, OH1y exp
Akron Children’s HospitalOhio State University

Early-career engineer with undergraduate research and hospital internship experience building Python/LLM automation systems, including a Study Planner AI and internal RAG tools for messy legal and clinical data workflows. Stands out for combining web scraping, vector search, and frontend integration to replace manual CSV-heavy processes under tight timelines.

View profile

Need someone specific?

AI Search