Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

SC

Senior Full-Stack AI/ML Engineer specializing in cloud data platforms and GenAI

Orlando, FL11y exp
Scale AIFlorida State University
View profile
RT

Executive Engineering Leader specializing in data platforms, cloud modernization, and AI

San Francisco, CA27y exp
Warner Bros. DiscoveryOsmania University
View profile
MG

Principal Machine Learning Scientist specializing in GenAI, LLMs, and RAG

Austin, TX13y exp
Season HealthGeorgia Tech
View profile
BM

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps

San Francisco, CA6y exp
Scale AISaint Louis University
View profile
ML

Senior Software Engineer specializing in AI agents and cloud platforms

Louisiana, USA7y exp
NotionSanta Clara University
View profile
TC

Mid-level Software Engineer specializing in Python, distributed systems, and AI backend services

San Francisco, CA6y exp
OpenAIWebster University
View profile
KK

Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure

San Francisco, CA6y exp
PerplexityStevens Institute of Technology
View profile
JI

Executive AI/ML Cloud Architect specializing in enterprise and humanitarian AI systems

Washington, DC13y exp
CloudsCockpit Inc.Indiana University Bloomington
View profile
BW

Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems

Seattle, WA10y exp
eBayUniversity of Illinois Urbana-Champaign
View profile
PV

Director-level Software Development Manager specializing in large-scale cloud platforms

San Jose, California13y exp
Amazon
View profile
Sumer Joshi - Senior Full-Stack Engineer specializing in backend systems and AI applications in Remote

Sumer Joshi

Screened ReferencesStrong rec.

Senior Full-Stack Engineer specializing in backend systems and AI applications

Remote13y exp
MercorSanta Clara University

Candidate is deeply focused on AI-native software development, using a deliberate planner/implementer agent workflow with tools like Cursor, Claude, and Kimi. They also built a personal project called Config Proctor, an AI-agent-driven Terraform/AWS self-healing system that identifies infrastructure configuration gaps and proposes fixes.

View profile
Yuan-Hsuan Wen - Intern Software Engineer specializing in AI agents, RAG pipelines, and semiconductor systems in Taipei, Taiwan

Intern Software Engineer specializing in AI agents, RAG pipelines, and semiconductor systems

Taipei, Taiwan3y exp
NVIDIAUSC

Built a web-based interface that connects an internal bug system to an LLM for initial debugging and issue classification, aiming to boost QA and software engineer efficiency while balancing latency and accuracy. Worked as a one-person project and managed constraints like limited hardware and difficulty extracting team debugging context, relying on manager communication and rapid modeling to validate direction.

View profile
RT

Rana Taki

Screened

Junior Mechanical Engineering & Software Developer specializing in aviation autonomy and retrieval systems

Stanford, CA2y exp
Stanford UniversityStanford University

Robotics/embedded builder who trained an aviation-specific LLM and deployed it offline on an NVIDIA Jetson for an in-flight voice assistant, solving performance and cabling constraints with NVMe storage and Bluetooth. Also has hands-on Raspberry Pi/Arduino robot builds (including a cigarette-butt picking prototype with hydraulic actuation) plus Docker-based FEA work using FEniCS/Gmsh and strong CI/CD + automated testing practices.

View profile
KT

Kenil Tanna

Screened

Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services

New York, NY7y exp
JPMorgan ChaseIIT Guwahati

Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).

View profile
SS

Sai supriya

Screened

Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference

St. Louis, MO7y exp
AnthropicSaint Louis University

Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.

View profile
Nishitha Thummala - Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference in San Francisco, CA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp
PerplexityUniversity of Nebraska Omaha

Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.

View profile
Kowshika M - Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety in Santa Clara, CA

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp
NVIDIAOregon State University

AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.

View profile
Geetika Jain - Mid-Level Software Engineer specializing in Azure AI and full-stack development in Park City, UT

Geetika Jain

Screened

Mid-Level Software Engineer specializing in Azure AI and full-stack development

Park City, UT6y exp
NICEUniversity of Texas at Dallas

Hands-on AI/LLM engineer who built a RAG-based product feature end-to-end, including prompt engineering, safety guardrails, and an automated adversarial + load-testing harness. Diagnosed real production issues (null responses) via Azure logs/metrics and drove an architectural fix by separating model deployments to address token/quota limits. Also runs internal developer enablement through short theory-to-hands-on AI workshops after completing a Microsoft AI certification.

View profile
DS

Executive CTO and Founder specializing in AI platforms and hyper-scale SaaS

South San Francisco, CA26y exp
Deep OriginUC Berkeley

CTO-minded builder seeking to join a startup; previously created an AI-driven platform that abstracted away DevOps and infrastructure for drug discovery researchers. Emphasizes high-leverage, zero-to-one execution with managed cloud/open-source tooling, and a strong reliability/reproducibility mindset validated against existing scientific pipelines.

View profile
Nikhil Reddy - Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms in San Francisco, CA

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp
NVIDIASaint Louis University

Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.

View profile
Krishna Reddy - Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants in New York, NY

Krishna Reddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants

New York, NY6y exp
StripeIndiana Wesleyan University

Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.

View profile
AR

Anagha Ram

Screened

Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search

Los Altos, CA2y exp
Columbia UniversityCornell University

Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.

View profile

Need someone specific?

AI Search