Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

suparshwa patil - Mid-level Software Engineer specializing in AI platforms and full-stack systems in Santa Clara, CA

suparshwa patil

Screened ReferencesStrong rec.

Mid-level Software Engineer specializing in AI platforms and full-stack systems

Santa Clara, CA4y exp
One CommunityPurdue University

Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.

View profile
TC

TejaSree Chiluveru

Screened ReferencesModerate rec.

Mid-level Software Engineer specializing in FinTech and cloud-native microservices

Austin, TX5y exp
JPMorgan ChaseWebster University

Built and launched an internal AI troubleshooting assistant focused on safe, retrieval-first root cause analysis for enterprise systems, with strong attention to monitoring, fallback behavior, and post-launch iteration. Also owns full-stack product work across React and Java/Spring Boot, including high-volume financial operations workflows, and reports measurable LLM improvements such as ~30-40% latency reduction.

View profile
Manasa Pantra - Junior Software Engineer specializing in AI, LLM systems, and full-stack development in Stony Brook, NY

Manasa Pantra

Screened ReferencesStrong rec.

Junior Software Engineer specializing in AI, LLM systems, and full-stack development

Stony Brook, NY2y exp
Stony Brook UniversityStony Brook University

Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.

View profile
AA

Abnik Ahilasamy

Screened ReferencesModerate rec.

Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference

Chennai, India0y exp
Larsen & ToubroArizona State University

Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.

View profile
ST

Mid-level AI/ML Engineer specializing in GenAI and predictive modeling

Fullerton, California5y exp
UnitedHealth GroupGeorge Washington University

Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.

View profile
JL

Julian Lee

Screened

Intern Software Engineer specializing in AI/LLMs and full-stack development

New York, New York1y exp
Highlight.AIUSC

AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).

View profile
VK

Vamsi Koppala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

Barrington, IL4y exp
ComericaTexas Tech University

LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.

View profile
AN

Alex Nguyen

Screened

Junior Applied AI Engineer specializing in LLMs, RAG, and agentic systems

La Jolla, CA2y exp
Uniwise.aiUC San Diego

Co-founded a healthcare AI startup building and deploying software directly with end users, emphasizing rapid shipping, deep user interviews, and workflow-first adoption. Has hands-on production deployment experience on AWS (including diagnosing a silent AWS App Runner failure caused by an ARM vs amd64 Docker build mismatch) and is motivated by customer-facing, travel-heavy roles to keep engineering tightly connected to real-world usage.

View profile
AK

Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps

San Francisco Bay Area, CA5y exp
VerizonCalifornia State University

Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.

View profile
SK

Mid-level AI/ML Engineer specializing in Generative AI and healthcare data

NJ, USA6y exp
Johnson & JohnsonWichita State University

Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.

View profile
AR

Anurag Reddy

Screened

Mid-level Data Scientist specializing in ML, MLOps, and Generative AI

TX, USA5y exp
CaterpillarUniversity of Illinois Chicago

ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.

View profile
DW

David Wisdom

Screened

Mid-level Data & Machine Learning Engineer specializing in production ML and data platforms

San Francisco, CA5y exp
Spice DataWilliam & Mary

Built and deployed a production LLM system that scraped Google Maps menu photos, extracted structured prices via OpenAI, and cross-validated them against website-scraped data to automate data-quality verification at scale (replacing costly manual contractor checks). Demonstrates strong reliability instincts—precision-first prompting, output gating with image-quality metadata, and fuzzy matching/RAG techniques—plus solid orchestration (Dagster/Airflow) and observability (Sentry, Prometheus/Grafana).

View profile
SS

Sarthak Singh

Screened

Mid-level Full-Stack Engineer specializing in cloud-native systems and LLM applications

Remote, USA4y exp
InfluencedUniversity of Maryland, College Park

Customer-support/engineering background spanning Informatica PowerCenter ETL and IBM demos/workshops, with hands-on experience hardening data workflows for production (error tables/reject links, validation, restart strategies, alerting, performance tuning). Also demonstrates a clear, systems-level approach to diagnosing LLM/agentic workflow issues (prompt/RAG/tooling/memory) using instrumentation and iterative fixes, and has partnered with sales on POCs by defining success metrics and mapping solutions to customer architectures.

View profile
AM

Asif Mulla

Screened

Mid-Level Software Engineer specializing in Java microservices and event-driven systems

Maryland, USA6y exp
Morgan StanleyUniversity of Alabama at Birmingham

Backend engineer on Morgan Stanley’s trade risk and compliance platform, building Java/Spring Boot microservices that validate equity and fixed-income trades at multi-million-events/day scale. Shipped an LLM-assisted trade exception analysis feature using RAG over internal policy documents and trade history, with production-grade guardrails (confidence thresholds, audit logs, human-in-the-loop) and measurable performance wins (~30–35% faster reporting) through PostgreSQL tuning and Redis caching.

View profile
AB

Ananya Bojja

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

USA4y exp
CignaUniversity of New Hampshire

AI/ML engineer at Cigna Healthcare building a production, HIPAA-compliant LLM-powered clinical insights platform that summarizes unstructured medical notes using a fine-tuned transformer + RAG on AWS. Demonstrates strong end-to-end MLOps and cloud optimization (distillation, Spot/Lambda/Auto Scaling) with quantified outcomes (~28% accuracy lift, ~40% less manual review, ~25% lower ops cost) and strong clinician-facing explainability via SHAP and dashboards.

View profile
SG

Mid-level Generative AI Engineer specializing in LLM systems and RAG

5y exp
Huntington BankCentral Michigan University

Currently at Huntington Bank, built a production-grade RAG system that helps business/operations teams get grounded answers from large volumes of internal enterprise documents. Owns ingestion and FastAPI backend, tuned hybrid BM25+vector retrieval and chunking for relevance, and evaluates reliability with metrics and observability (LangSmith, CloudWatch, Prometheus/Grafana) while partnering closely with non-technical stakeholders.

View profile
HV

Junior Software Engineer specializing in Full-Stack and ML for FinTech

Hyderabad, Telangana1y exp
Volksoft TechnologiesUSC

Full-stack engineer with fintech trading-platform experience who shipped and operated a real-time portfolio P&L/performance feature end-to-end (React + Node/WebSockets + MongoDB) on AWS, including significant performance tuning under peak trading load. Also built a Spark-based trading analytics pipeline with idempotency and reconciliation for auditability, and has a personal React/TS + Node/Express project (Artsy) with JWT auth and schema-evolution practices.

View profile
NK

Senior Data Engineer specializing in Palantir Foundry and Snowflake for regulated industries

USA5y exp
American ExpressUniversity of Massachusetts Boston

Data engineer focused on high-volume transaction pipelines (2M+ per day) using Snowflake/Snowpipe, Spark/PySpark, Kafka, and Airflow, with a strong emphasis on schema/data-quality enforcement and reliability improvements. Also built a greenfield compliance-focused RAG solution, using CloudWatch monitoring and adding ingestion validation to prevent malformed OCR documents from degrading search quality.

View profile
JW

Jiyang Wu

Screened

Junior Software Engineer specializing in cloud microservices and database systems

Stony Brook, NY2y exp
Stony Brook UniversityStony Brook University

Grad student who co-developed a safety-oriented mental health LLM consulting agent using RAG + Gemini and Hugging Face emotion detection to assess user crisis level and adapt responses. Implemented a key reliability improvement for CRISIS scenarios by bypassing generative output and returning direct, emotionless, knowledge-base guidance to seek immediate real-world help.

View profile
Cristian Vega - Senior AI/ML Engineer specializing in Generative AI and RAG in California, null

Cristian Vega

Screened

Senior AI/ML Engineer specializing in Generative AI and RAG

California, null9y exp
Morf HealthUniversity of Texas at Austin

ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.

View profile
Nikitha Kommidi - Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps

Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps

6y exp
CitibankUniversity of Texas at Arlington

Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.

View profile
HIMANSHU SHARMA - Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation in Orlando, FL

Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation

Orlando, FL6y exp
Kore.aiUniversity of South Florida

Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.

View profile
Kasireddy Kumar reddy - Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems in Missouri, USA

Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems

Missouri, USA6y exp
CenteneUniversity of Central Missouri

Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.

View profile
Fnu Pallavi Sharma - Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI in Madison, WI

Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI

Madison, WI1y exp
University of Wisconsin–MadisonUniversity of Wisconsin–Madison

Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.

View profile

Need someone specific?

AI Search