Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Nikhil Soni - Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation in New York, NY

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp
Quant AI ResearchNYU

Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.

View profile
Manasa Mangipudi - Mid-level Machine Learning Engineer specializing in NLP and computer vision

Mid-level Machine Learning Engineer specializing in NLP and computer vision

3y exp
Columbia UniversityRutgers University–New Brunswick

AI/ML engineer with production experience building an LLM-powered resume-to-job matching and feedback product using RAG, with a strong focus on latency, hallucination control, and scalable deployment. Experienced orchestrating ML inference and backend services on Kubernetes and applying rigorous evaluation/guardrail practices; also partnered with business/product stakeholders at Walmart to improve an NLP-based supplier support system.

View profile
Jhansi Bendi - Senior Software Engineer specializing in cloud-native microservices and event-driven systems in Antioch, CA

Jhansi Bendi

Screened

Senior Software Engineer specializing in cloud-native microservices and event-driven systems

Antioch, CA18y exp
SephoraRashtriya Sanskrit Sansthan

Senior engineer/tech lead with 18+ years building large-scale distributed applications, specializing in performance and reliability improvements. Recently owned multiple apps on an email personalization team, shipping major optimizations (including a push-update feature and audience-count architecture redesign) that reportedly lifted system performance from ~50% to ~99% while also leading code standards, reviews, and mentoring.

View profile
Ruthvik Bacha - Mid-level Data Engineer specializing in financial data pipelines and reliability in North Carolina, USA

Ruthvik Bacha

Screened

Mid-level Data Engineer specializing in financial data pipelines and reliability

North Carolina, USA7y exp
Wells FargoUniversity of South Florida

Systems/robotics-oriented software engineer focused on real-time orchestration and reliability: built a central control layer coordinating multiple concurrent agents with safe state machines, failure isolation, and recovery. Has hands-on ROS/ROS 2 integration experience in simulation (DDS/QoS, lifecycle, nodes in Python/C++) and emphasizes observability (structured JSON logs, correlation IDs) and low-latency control-loop performance under load.

View profile
Sanjana Duvva - Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.

View profile
Nishantkumar Asodariya - Mid-level Supply Chain Analyst specializing in global logistics automation and forecasting in USA

Mid-level Supply Chain Analyst specializing in global logistics automation and forecasting

USA4y exp
HoneywellIndiana Wesleyan University

Built and shipped a production LLM-powered recruiting workflow that ranks resumes against job descriptions, generates evidence-based justifications, and finds "hidden fit" candidates using embeddings + RAG. Demonstrates strong production engineering around hallucination control, latency, and predictable LLM cost management (budget checks, top-K pruning, tenant caps), plus orchestration experience with Airflow/Prefect/Kubernetes and a structured evaluation/monitoring methodology for AI agents.

View profile
Sri Harshitha Yannam - Junior Software Engineer specializing in AI/ML and cloud platforms in Austin, TX

Junior Software Engineer specializing in AI/ML and cloud platforms

Austin, TX2y exp
AmazonUniversity of Wisconsin–Milwaukee

LLM/agent engineer who shipped a production "Memory Assistant" at HydroX AI, building a LangChain/LlamaIndex RAG memory pipeline on ChromaDB/FAISS with robust fallbacks (BERT/BART), prompt-injection mitigation, and 99.9% uptime monitoring. Also built a multi-step customer support agent using Rasa + OpenAI Assistants API with structured tool calling, guardrails, and human-in-the-loop escalation, and has experience hardening agents against messy ERP data via Pydantic validation, idempotency, and transactional outbox patterns.

View profile
Ishaan Nanal - Intern-level Software Engineer specializing in backend systems and AI/ML in Ithaca, NY

Ishaan Nanal

Screened

Intern-level Software Engineer specializing in backend systems and AI/ML

Ithaca, NY1y exp
QuorAgraCornell University

Built and shipped an LLM-powered RAG research copilot used by 20+ users across biology, physics, and ML, cutting literature review from days to minutes. Strong focus on production reliability—iterated on chunking/retrieval/prompting, added validation and modular pipelines for debuggability, and is now containerizing and scaling the system with Docker and GCP.

View profile
Harrishkumar Loganathan - Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI in Remote, USA

Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI

Remote, USA3y exp
SocureArizona State University

AI/ML engineer with hands-on ownership of enterprise LLM deployments at Freshworks, including a large-scale RAG chatbot serving 15,000+ users across six departments. Stands out for combining deep production engineering skills—AWS microservices, Kubernetes, observability, retrieval quality, and faithfulness evaluation—with strong cross-functional stakeholder leadership and prior large-scale fraud data pipeline experience at Socure.

View profile
DD

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp
Elevance HealthTexas Tech University

ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.

View profile
RS

Mid-level Software Engineer specializing in cloud-native backend and AI systems

Long Beach, CA4y exp
JPMorgan ChaseCalifornia State University, Long Beach

Candidate takes a disciplined, developer-in-the-loop approach to AI-assisted coding, using AI primarily for brainstorming, suggestions, and optimization while retaining full ownership of architecture and final code decisions. They also actively stay current on AI developments through research papers, communities, and emerging tools.

View profile
VJ

Vedant Jagtap

Screened

Junior AI/NLP Engineer specializing in LLM systems and applied research

New York, NY2y exp
NYU’s Center for Social Media, AI, and PoliticsNYU

LLM/agent engineer who shipped a two-stage AI recruitment screening platform at Foursquare that automated resume ingestion through behavioral assessment, delivering an 85% reduction in screening time across 5,000+ applications with auditability and confidence-gated decisions. Also built a multi-agent benchmarking framework using MCP tool interfaces and a RAGAS + LangSmith evaluation/observability stack, including async re-architecture that cut production latency by 50%.

View profile
PS

Pooja Shindd

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web and AI systems

Illinois, USA4y exp
University of Illinois Chicago Technology SolutionsUniversity of Illinois Chicago

Full-stack engineer who has built both a TypeScript-based HR/payroll platform and a production agentic AI support system end to end. Stands out for combining strong product judgment with deep LLM systems thinking: RAG architecture, confidence-based routing, evals, observability, and human-in-the-loop design in a greenfield environment.

View profile
JS

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp
Johnson & JohnsonStevens Institute of Technology

Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Connecticut, USA5y exp
PfizerUniversity of New Haven

Built and deployed an enterprise GenAI knowledge assistant over thousands of internal PDFs/reports using a RAG stack (GPT-4 + Hugging Face embeddings + vector DB) to reduce manual search and SME escalations. Uses LangGraph/LangChain to orchestrate modular agent workflows with relevance filtering and fallback handling, and applies rigorous evaluation (golden datasets, edge cases, A/B tests) with production monitoring metrics.

View profile
PE

Mid-Level Software Engineer specializing in distributed systems and cloud-native backends

Dallas, USA5y exp
T-MobilePurdue University

AI/LLM engineer with production experience at Charles Schwab building a RAG-based assistant to help 5,000+ reps answer complex financial policy questions. Implemented a multi-layer anti-hallucination approach (GNN-driven ontology/graph retrieval + citation-only answers) and compliance-focused guardrails (Azure AI Content Safety) in partnership with audit/compliance stakeholders.

View profile
SS

Junior Software Engineer specializing in ML, distributed systems, and LLM applications

Austin, TX1y exp
ZondaUC San Diego

Interned at Zonda where he built an AI-driven semantic search solution over ~280M housing/builder records. Iterated from local LLMs via llama.cpp quantization to a vector-embedding retrieval system, then boosted semantic accuracy with a custom spaCy NER layer and re-ranking, optimizing for latency through precomputation. Collaborated with economics-focused stakeholders to reduce manual document/paperwork time by enabling natural-language search over internal data.

View profile
BC

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp
BlackRockWebster University

Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.

View profile
SK

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp
ServiceNowRivier University

Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.

View profile
MS

Min-Han Shih

Screened

Junior Machine Learning Engineer specializing in speech and multimodal AI

Taipei, Taiwan2y exp
FurboUSC

New grad who has shipped a production vision-language recommendation feature for a pet camera/mobile app, including building a tagged video dataset with human annotators and optimizing inference by FPS downsampling under device compute limits. Also built a multimodal MLLM benchmark using an LLM-as-judge (GPT-5-thinking) with a feedback loop, validated against human scoring, and measured post-feedback quality gains (12% average score improvement).

View profile
JG

Junior Software Engineer specializing in AI, security, and cloud systems

Trondheim, Norway1y exp
Norwegian University of Science and TechnologyUniversity of Waterloo

Built and deployed an LLM + RAG + memory system on a Furhat social robot, adding continuous face/voice recognition embeddings over WebSockets to enable persistent, natural conversations across sessions. Experienced working around real-world hardware/latency constraints and uses Datadog plus structured debugging/rollback practices for stabilizing customer-facing LLM workflows.

View profile
SV

Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI

Remote5y exp
InfosysUniversity at Buffalo

GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.

View profile
SS

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp
JPMorgan ChaseUniversity of South Dakota

Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.

View profile
RK

Principal Software Engineer specializing in AI/ML and cloud-native backend systems

New York, NY16y exp
McKinsey & CompanyNJIT

McKinsey data/ML practitioner who led production deployment of an entity resolution + semantic search platform for unstructured finance and healthcare data, integrating with legacy systems under HIPAA constraints. Deep hands-on stack across transformers (spaCy/HF BERT), embeddings + FAISS, and production MLOps/workflow tooling (Airflow, Docker, CI/CD, Prometheus/Grafana), with reported gains of +30% decision speed and +25% search relevance.

View profile

Need someone specific?

AI Search