Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Manpreet Kour - Senior Data Scientist specializing in Generative AI and NLP in Seattle, USA

Manpreet Kour

Screened

Senior Data Scientist specializing in Generative AI and NLP

Seattle, USA6y exp
SOTIDr. B. R. Ambedkar National Institute of Technology, Jalandhar

ML/NLP engineer with recent Scotiabank experience building production-grade indexing automation over large-scale emails and customer databases, combining LLM fine-tuning (Mistral, XLM-R) with fuzzy matching to exceed 95% accuracy under strict banking constraints. Also built a RAG-based chat agent using Gecko embeddings, Vertex AI Search, Gemini, and cross-encoder reranking, and delivered a text-to-SQL chatbot at SOTI through iterative fine-tuning and benchmark-driven experimentation.

View profile
Sharanya Rao - Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare in Remote, USA

Sharanya Rao

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare

Remote, USA3y exp
Ally FinancialUniversity of Maryland, Baltimore County

Built an AI lending assistant (RAG + DeBERTa) used by credit analysts to retrieve policies and past loan decisions, tackling real production issues like hallucinations, document quality, and sub-second latency. Deployed a modular, Dockerized AWS architecture (ECS/EMR + load balancer) with load testing, caching/precomputed embeddings, and CloudWatch monitoring, and used Airflow to automate scheduled data/embedding/vector DB refresh pipelines with retries and alerts.

View profile
Yogita Adari - Mid-level AI Engineer specializing in generative and multimodal systems in San Francisco, CA

Yogita Adari

Screened

Mid-level AI Engineer specializing in generative and multimodal systems

San Francisco, CA4y exp
Handshake AISyracuse University

Built and productionized an agentic LLM automation system for an insurance client to determine medication eligibility, using prompt-chaining plus a RAG pipeline over policy rules and deploying on AWS (Lambda/Step Functions, Bedrock) with a serverless architecture. Addressed major data/schema mismatch issues via a semantic matching pipeline and validated performance through human agreement scoring, A/B testing, KPI monitoring, and confidence-based human-in-the-loop review.

View profile
AL

Aaron Lao

Screened

Intern Software Engineer specializing in agentic RAG and full-stack web development

San Francisco, CA1y exp
ConnectionLoopsUniversity of San Francisco

Entry-level software engineer who built an agentic AI backend in Python/FastAPI, including APIs for conversation history retrieval and user data storage, and worked through async/concurrency challenges for multiple agents querying simultaneously. Also has practical AWS experience using S3 for static hosting with Lambda and RDS for backend/data access.

View profile
SP

Surya Pavan

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications

Baltimore, MD5y exp
AcerCalifornia State University, Northridge

GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.

View profile
SG

Entry-Level AI/ML Engineer specializing in LLM apps, RAG pipelines, and production ML systems

1y exp
iFrog Marketing SolutionsUC San Diego

AI/LLM practitioner at iFrog Marketing Solutions who drove a RAG chatbot from prototype to production in a legacy, AI-resistant environment by validating customer needs and building a business case. Implemented production-grade LLM practices (CI/CD eval gating, rollbacks, prompt/context engineering) and led internal workshops to bring non-AI-native developers up to speed while partnering with sales on tailored demos to drive adoption.

View profile
YN

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp
Ally FinancialUniversity of Michigan-Dearborn

GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.

View profile
AG

Senior Backend Engineer specializing in AI/LLM and Healthcare Claims

8y exp
UnitedHealth GroupIndiana University Bloomington

JavaScript/React performance-focused engineer who contributed upstream to an open-source virtualization/pagination library, fixing overlapping-fetch race conditions and introducing prefetch/deduping patterns that cut load times from ~3s to <900ms and reduced render thrash ~35%. Also built healthcare automation systems (clinical summary and claims triage), including a FastAPI + RAG pipeline that retrieved CPT/ICD evidence, improving decision accuracy from 67% to 86% and reducing turnaround time by 40%.

View profile
AN

Anudeep Nayak

Screened

Junior Software Engineer specializing in cloud APIs, security testing, and AI web apps

Boulder, Colorado3y exp
ActualizeUniversity of Colorado Boulder

Software engineer with experience delivering customer-facing and internal tools across GE Renewables, GE Healthcare (supply chain/production systems), and a Boulder-based event app startup. Recently focused on scaling backend performance using Redis and RabbitMQ, and has hands-on experience resolving hard-to-reproduce production issues in legacy authentication/session systems; also deployed a personal project (Journal Buddy) publicly.

View profile
HT

Mid-level Machine Learning Engineer specializing in LLMs, agentic AI, and risk/fraud modeling

San Francisco, CA3y exp
The Research Foundation for SUNYUniversity at Buffalo

Built and productionized an agentic LLM workflow during a summer internship to transform unstructured clinical reports into analytics-ready structured data, using a LangChain multi-agent design plus an LLM-as-a-judge layer to control quality in a regulated setting. Also has experience orchestrating ML pipelines at Piramal Capital using AWS Step Functions/EventBridge/CloudWatch, with strong emphasis on observability, evaluation rigor, and measurable impact (80–90% reduction in manual data entry).

View profile
SP

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and data pipelines

New York, NY4y exp
DeloitteSaint Louis University

Engineer with Deloitte experience building real-time analytics products and scalable Kafka/Go/Postgres pipelines, plus production LLM features using RAG and embeddings. Demonstrates strong focus on performance, reliability, and guardrails/evaluation loops to reduce hallucinations and improve real-world AI system quality.

View profile
SP

Samuel Park

Screened

Mid-level Software Engineer specializing in backend systems, DevOps/SRE, and AI workflows

Los Angeles, CA5y exp
xAIUC Santa Barbara

Built an end-to-end automated trading system for Polymarket, including Go/Python execution services, Terraform-scheduled ETL/feature pipelines, and monitoring on modest hardware. Also shipped a production LLM+RAG signal verifier/explainer that grounds trade decisions in external context (news/social) with vector DB retrieval and guardrails, plus a lightweight RAGAS-style eval loop on ~50 resolved markets that improved signal faithfulness by ~15%.

View profile
HM

Mid-Level Full-Stack Software Engineer specializing in cloud-native and GenAI solutions

Remote, USA5y exp
Capital OneUniversity of North Carolina at Charlotte

Built and shipped production RAG-based LLM agents automating multi-step document query workflows, emphasizing reliability via monitoring, retries, structured exception handling, and fallback retrieval (alternative embeddings/keyword search). Demonstrated measurable gains (18% latency improvement, 25% retrieval efficiency, 12% precision) and has experience integrating agents with messy tax and transaction data at RSM using validation/cleaning and idempotent design.

View profile
Yukta Chikate - Mid-level Machine Learning Engineer specializing in safety-critical and uncertainty-aware ML systems in Brooklyn, NY

Yukta Chikate

Screened

Mid-level Machine Learning Engineer specializing in safety-critical and uncertainty-aware ML systems

Brooklyn, NY5y exp
MTech DistributorsNortheastern University

Built and productionized an LLM-powered assistant for company documents and support questions, focused on reducing time spent searching PDFs/policies/tickets while preventing hallucinations by grounding answers in approved sources. Demonstrates strong production engineering (Kubernetes/orchestration, caching, monitoring, fallbacks) plus security-minded permissioning and close collaboration with operations/support stakeholders.

View profile
Rohan Varma Bandari - Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG in USA

Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG

USA4y exp
Wells FargoUniversity of North Texas

Built production LLM + hybrid RAG and multi-agent orchestration systems at Wells Fargo to automate complaint document/audio transcript understanding and categorization, addressing vocabulary drift via embedding + vector index updates instead of frequent retraining. Strong in LLM workflow reliability (testing/benchmarks/observability) and stakeholder-facing delivery with explainability (citations/SHAP-style justifications) and Tableau dashboards.

View profile
Ramcharan Reddy - Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices in Texas, USA

Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices

Texas, USA6y exp
Morgan StanleyUniversity of Central Missouri

Backend engineer focused on AI-enabled systems, having built a production-style RAG pipeline (vector search + LLM) exposed via Python/Flask endpoints with strong observability and hallucination-reduction techniques. Demonstrates deep performance work in PostgreSQL/SQLAlchemy (5x faster analytics queries) and high-throughput optimization using Celery + Redis (800ms to 120ms latency, 3x throughput), plus schema-per-tenant multi-tenancy with tenant-aware middleware and logging.

View profile
Harideep Balusa - Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems in USA

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp
Freddie MacUniversity of Wisconsin

Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.

View profile
Nagaveda Sai Kumar Reddy Gajjala - Mid-Level Full-Stack Software Engineer specializing in distributed systems and cloud integrations in Richardson, TX

Mid-Level Full-Stack Software Engineer specializing in distributed systems and cloud integrations

Richardson, TX4y exp
PaycomUniversity of Texas at Dallas

Backend engineer with enterprise SaaS experience (Zoho) who owned an end-to-end cloud integration between Endpoint Central and ServiceDesk Plus, redesigning device onboarding across 64+ scenarios and building a fault-tolerant sync engine that recovered 100% failed transactions. Also built and operated production systems across the stack—FastAPI services with strong testing/observability, React+TypeScript portals, PostgreSQL performance tuning, and AWS deployments with real incident response (RDS CPU saturation resolved with zero downtime).

View profile
Ojasmitha Pedirappagari - Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms in Jersey City, NJ

Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms

Jersey City, NJ5y exp
Nurture HoldingsUC Santa Cruz

Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.

View profile
Zubair Shaik - Mid-level Full-Stack Developer specializing in AI-driven FinTech platforms in Remote, USA

Zubair Shaik

Screened

Mid-level Full-Stack Developer specializing in AI-driven FinTech platforms

Remote, USA4y exp
Bank of AmericaIndiana Wesleyan University

Built and productionized an LLM-powered loan decisioning agent at Bank of America, integrating RAG with microservices to automate creditworthiness assessment and recommendations. Emphasizes real-world reliability and governance (EKS autoscaling, observability, SOC2/PCI security controls), and drove measurable outcomes including 20% faster loan decisions and a reduction in agent failures/fallbacks to under 2% through schema enforcement and confidence-based routing.

View profile
Tara Worrell - Senior Software Engineer specializing in backend systems, AI/LLM integration, and cloud infrastructure in Brooklyn, NY

Tara Worrell

Screened

Senior Software Engineer specializing in backend systems, AI/LLM integration, and cloud infrastructure

Brooklyn, NY6y exp
AuraCodeBinghamton University

Backend engineer with experience in highly regulated and high-stakes systems, including an airline crew messaging platform requiring near-zero-error real-time operations and a HIPAA-compliant mental health application built from an early-stage concept. They also show strong operational maturity, having owned a GoDaddy production incident through resolution and then led deployment pipeline improvements that reduced build failures by 40% and doubled deployment frequency.

View profile
SV

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp
Summit Design and TechnologyNorthwest Missouri State University

Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.

View profile
BS

Mid-level Data Engineer specializing in Lakehouse, Streaming, and ML/LLM data systems

Remote, USA3y exp
DiscoverUniversity of South Dakota

Built and productionized an enterprise retrieval-augmented generation platform for internal knowledge over large unstructured corpora, emphasizing trust via strict citation/grounding and hybrid retrieval (BM25 + FAISS + cross-encoder re-ranking). Demonstrates strong scaling and cost/latency optimization through incremental indexing/embedding and index partitioning, plus disciplined evaluation/observability practices. Has experience operationalizing pipelines with Airflow/Databricks/GitHub Actions and partnering closely with risk & compliance stakeholders on auditability requirements.

View profile
SP

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp
BNY MellonFranklin University

Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.

View profile

Need someone specific?

AI Search