Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

TS

Trisha Shah

Screened

Mid-level Software Engineer specializing in AI, cloud, and full-stack systems

null5y exp
eSentireUniversity of Washington

Full-stack and AI product engineer with strong AWS/Snowflake experience who built an internal feature flag platform and helped migrate a cybersecurity insights product into a multi-agent AI chat interface. They report production scale of 1M+ embeddings and 50k+ monthly queries, with outcomes including an 80% reduction in analyst work and dashboard generation in 7 minutes; the work was also featured by Claude and AWS.

View profile
NS

Nisarg Shah

Screened

Junior Software Engineer specializing in data, systems, and AI engineering

Arizona, USA2y exp
Arizona State UniversityArizona State University

Early-career/new-grad candidate who built TrendScout AI, an evidence-first market intelligence agent that ingests messy news, extracts entities/events, builds a Neo4j knowledge graph, and answers questions via RAG with citations. Achieved ~95% retrieval relevance by combining ChromaDB semantic search with graph-based retrieval and validating outputs through human evaluation and guardrails to prevent hallucinations.

View profile
SR

Senior Full-Stack Software Engineer specializing in AI agents and data platforms

Remote7y exp
AT&TCalifornia State University, Los Angeles

Full-stack and AI-focused builder who has shipped both customer-facing personalization at AT&T and internal LLM-powered automation/agent systems in startup environments. Stands out for combining TypeScript-heavy engineering rigor with practical AI orchestration, evaluation, and measurable business impact—from reducing support escalation through personalization to saving 10-11 hours per week by automating fragmented operational workflows.

View profile
Sai Naveen Reddy Reganti - Mid-level Full-Stack Engineer specializing in cloud-native FinTech and Healthcare systems in null

Mid-level Full-Stack Engineer specializing in cloud-native FinTech and Healthcare systems

null4y exp
AESWichita State University

Full-stack engineer working on customer-facing utility and banking platforms, with hands-on experience across React/React Native, Java/Spring Boot, Python/Django, AWS, and SQL performance tuning. Stands out for owning production systems end to end, improving CI/CD and deployment reliability, and delivering a measurable database optimization that cut CPU utilization from about 80% to 40%.

View profile
AB

Aryan Bhide

Screened

Junior Software Engineer specializing in LLM agentic workflows and full-stack systems

Santa Cruz, CA3y exp
PaystandUC Santa Cruz

Paystand engineer/intern who built a multi-agent LLM orchestration system (with logging/feedback loops) that became part of the team workflow and reportedly cut development time ~70%. Partnered with sales/product on enterprise demos and implemented a dynamic RBAC system that helped drive adoption of an intern-built product to multiple enterprise clients, contributing to seven-figure ARR. Also founded and pitched a student-entrepreneur business management/payments project (HustleHub) and won a university startup competition.

View profile
AR

Intern Full-Stack Software Engineer specializing in cloud, voice AI, and billing systems

Los Angeles, CA1y exp
SyncratikUSC

Product-minded full-stack engineer at a B2B startup who ships high-stakes customer-facing features fast: delivered a Spanish AI support agent in 2 weeks by benchmarking LLMs and using native Spanish system prompts, reaching 90% resolution. Built the company’s first monetization system (hybrid subscription + usage) with Stripe/Firebase, emphasizing secure JWT-based flows and idempotent webhooks, and led a microservices decoupling effort that cut developer onboarding time by 50%.

View profile
RE

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp
Elevance HealthIndiana University Indianapolis

Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.

View profile
JJ

Jigeesha Jain

Screened

Senior Software Engineer specializing in backend systems, microservices, and AI-enhanced workflows

Boston, MA6y exp
VicorBinghamton University

Significant contributor/maintainer to an open-source JavaScript event-tracking client SDK, owning API consistency/backward compatibility, high-load batching and retry/backoff improvements, and test/CI + documentation upgrades. Diagnosed production-like issues (missing events under load) via reproduction and logging, then reduced GC pressure and improved predictability with a ring-buffer-based batching redesign while actively triaging issues and reviewing PRs.

View profile
OR

Mid-level Data Scientist specializing in predictive modeling, NLP/LLMs, and RAG search systems

Des Moines, IA6y exp
CDS GlobalUniversity of Massachusetts

Built production LLM/RAG platforms for financial services to enable natural-language Q&A over large policy/compliance document sets stored in Snowflake and SharePoint. Strong in MLOps and orchestration (Airflow, ADF, Step Functions, MLflow) and in solving real production issues like stale embeddings and model performance, including an incremental Snowflake Streams sync that cut processing time from hours to minutes.

View profile
SS

Sameer Shaik

Screened

Senior AI Engineer specializing in Generative AI, NLP, and applied deep learning

Chicago, IL8y exp
Live NationDePaul University

Built a production multi-agent LLM system at Live Nation on Databricks (LangGraph/LangChain) that let venue/event teams ask questions in Slack, auto-generated optimized route schedules, and produced inventory/stocking recommendations from historical SQL data and venue trends. Improved reliability by tightening prompts with strict JSON schemas, providing sample questions/SQL, and adding guardrails plus synthetic/edge-case testing, while iterating with event managers and senior VPs via prototypes and feedback loops.

View profile
RA

Rahul Alle

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

USA4y exp
CVS HealthAnderson University

Built a production internal LLM/RAG assistant at CVS Health to cut time spent searching long policy and clinical guideline PDFs, combining fine-tuned BERT/GPT models with FAISS retrieval and a FastAPI service on AWS. Demonstrates strong real-world reliability work (document cleanup, hallucination controls, monitoring/drift tracking with MLflow) and close collaboration with non-technical clinical operations teams via demos and feedback-driven iteration.

View profile
YT

Yash Tobre

Screened

Mid-level AI/ML Engineer specializing in computer vision, NLP/LLMs, and MLOps

Bentonville, AR4y exp
DyneticsUniversity of Texas at Arlington

ML/AI engineer with defense and commercial analytics experience: deployed a real-time aerial object detection system at Dynetics (YOLOv5 + TorchServe in Docker on AWS EC2) with drift-triggered retraining and 99.5% uptime, tackling ambiguous targets and weather degradation. Previously at Fractal Analytics, built and explained a churn prediction model for marketing stakeholders using SHAP and delivered it via a Flask API into dashboards, driving a reported 22% attrition reduction.

View profile
HK

Mid-level Data Analyst specializing in cloud ETL, BI, and machine learning

Texas, 752235y exp
UnitedHealth GroupUniversity of Texas at Arlington

Data/ML practitioner with experience at UnitedHealth Group building a fraud claims detection solution combining structured claims data and unstructured notes, validated with compliance stakeholders to improve actionable accuracy. Also applied embeddings, vector databases, and fine-tuned language models in a Bank of America capstone to detect threats/anomalies in financial documents, with production-minded Python ETL workflows using Airflow.

View profile
SS

Junior Software Engineer specializing in AI agents and full-stack cloud systems

Irvine, CA1y exp
OrangePeopleUC Irvine

Backend-focused engineer who has built and refactored FastAPI services backed by MongoDB, emphasizing async concurrency, stateless design for horizontal scaling, and performance tuning via indexing and request-level timing. Has implemented production authentication patterns (JWT, SSO, OAuth2 + PKCE) and user/org-scoped access controls, and improved reliability of LLM document-extraction APIs with fallback mechanisms.

View profile
UO

Principal Data Scientist specializing in Generative AI, NLP, and MLOps

San Francisco, CA12y exp
CognizantUniversity at Buffalo

ML/NLP practitioner with banking experience (M&T Bank) who has built a GPT-4 RAG system using LangChain and Pinecone to connect unstructured customer data with internal knowledge bases, improving accuracy and reducing manual lookup time by 50%+. Strong in entity resolution and productionizing scalable Python data workflows, including major performance wins by migrating bottleneck joins from Pandas to Dask.

View profile
MN

Mid-Level Software Developer specializing in full-stack, cloud-native microservices and AI integrations

Remote, USA4y exp
Ally FinancialUniversity of North Texas

Backend/AI engineer who has built production Spring Boot APIs on AWS (JWT auth, Redis/MySQL) and solved a real-world silent data integrity issue by implementing idempotency keys plus DB constraints/transactions. Also shipped an LLM-based document Q&A feature using a RAG pipeline with evaluation + human review, and designed multi-step agent workflows with verification, retries, and escalation guardrails.

View profile
KO

Karthik O

Screened

Mid-level AI Software Engineer specializing in LLM systems and cloud APIs

Kansas, USA3y exp
DeloitteUniversity of Central Missouri

Built and productionized an LLM-powered support/knowledge pipeline using embeddings and retrieval (RAG) to deliver more grounded, higher-quality responses while reducing manual effort. Focused on real-world reliability and performance—adding structured validation/guardrails, optimizing vector search and context size for latency/scale, and monitoring failure patterns in production. Experienced with orchestration via LangChain for LLM workflows and Airflow for production data/ML pipelines, and iterates closely with operations stakeholders through demos and feedback.

View profile
HS

Mid-level Full-Stack Engineer specializing in cloud data platforms and LLM-powered apps

New York City, NY4y exp
CenteneUniversity of Maryland, Baltimore County

Full-stack engineer with healthcare and finance experience who has owned end-to-end production systems across Azure and AWS. Built a real-time clinical dashboard at Centene (React + FastAPI + Azure Event Hubs) that cut data latency from ~12 minutes to under 1 minute and was associated with a 30% reduction in intervention delays. Also delivered MVPs in high-ambiguity environments at Accenture during monolith-to-microservices migration, improving uptime and maintainability with measurable results.

View profile
VS

Vinay shetty

Screened

Mid-Level Java Full-Stack Developer specializing in cloud-native microservices

4y exp
Huntington BankUniversity of North Texas

Full-stack engineer with ~3.5 years of Java Spring Boot and React experience who built an end-to-end banking transaction platform using microservices, Kafka streaming, AWS RDS, and Dockerized CI/CD. Demonstrates strong performance and reliability engineering (async processing, DLQ/retries, idempotency, caching) plus secure cloud deployment practices; has also worked across banking, healthcare, and insurance domains.

View profile
Harikiran Jangam - Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems in California, USA

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems

California, USA3y exp
McKessonCalifornia Lutheran University

Backend engineer who built and evolved a PHI-compliant RAG system (FastAPI + LangChain + embeddings/FAISS) for internal document search and summarization, delivering <400ms p95 latency at ~2,500 daily requests and measurable impact (30% faster investigations, +17% retrieval relevance). Demonstrates strong security and rollout discipline (RBAC/RLS/JWT, redaction/audits, shadow mode, dual writes, canaries) and a focus on reducing hallucination risk via grounded guardrails and confidence-based fallbacks.

View profile
Surya Vamshi Sriperambudooru - Mid-level AI Engineer specializing in healthcare claims analytics and RAG copilots in Remote, US

Mid-level AI Engineer specializing in healthcare claims analytics and RAG copilots

Remote, US4y exp
CodoxoUniversity of Texas at Dallas

Built a production "appeals co-pilot" for a healthcare claims appeals team, combining an XGBoost/logistic ranking model with a Python/LangChain RAG stack (FAISS + Mistral 7B) to surface high-probability appeal wins and speed policy-grounded drafting. Emphasizes reliability and trust: hybrid retrieval with metadata routing, citation/eval scripts, guardrails, and an explainability layer that non-technical stakeholders could understand and override.

View profile
Gary Chang - Senior Full-Stack Software Engineer specializing in AI-first cloud-native systems in Los Angeles, CA

Gary Chang

Screened

Senior Full-Stack Software Engineer specializing in AI-first cloud-native systems

Los Angeles, CA9y exp
System1UCLA

End-to-end engineer who has productionized AI automation and RAG capabilities, building full-stack systems (React/Node/Redis/Postgres + vector DB) with evaluation-driven quality gates and monitoring. Reported ~60% reduction in manual ops time and major turnaround improvements, and has experience modernizing legacy systems safely via feature flags and parallel runs while working across product, data, and ops teams (System1).

View profile
TJ

Tejasri Joshi

Screened

Mid-level Machine Learning Engineer specializing in Healthcare AI and Generative AI

Chicago, IL6y exp
UnitedHealth GroupDePaul University

Analytics professional with Intuit experience spanning modern data stack work, behavioral segmentation, and applied AI. They built dbt/Snowflake pipelines powering retention and churn dashboards, automated feedback classification with OpenAI/LangChain, and partnered closely with product and marketing teams to turn analytics into onboarding, targeting, and lifecycle messaging decisions.

View profile
Shashwat Negi - Mid-level Software Engineer specializing in AI/ML and full-stack systems in San Jose, CA

Shashwat Negi

Screened

Mid-level Software Engineer specializing in AI/ML and full-stack systems

San Jose, CA3y exp
InfrrdUniversity of Wisconsin–Madison

Data Scientist (2–3 years) at ZS Associates who has built and productionized agentic LLM systems, including a LangGraph-based multi-LLM prompt-optimization pipeline for entity extraction deployed as a Spring Boot microservice via Jenkins. Also built an Insightmate.ai chatbot and improved its RAG accuracy by diagnosing vector retrieval issues and implementing HyDE query expansion, while partnering with sales and pharma stakeholders to drive adoption (e.g., Zimmer Biomet platform migration into a multi-year partnership).

View profile

Need someone specific?

AI Search