Vetted Machine Learning Engineers in the Bay Area

Pre-screened and vetted in the Bay Area.

RC

Senior AI/ML Engineer specializing in computer vision, NLP, and real-time forecasting

Newark, CA10y exp
OutlierUC Berkeley
View profile
PK

Mid AI/ML Engineer specializing in LLMs, RAG, and multimodal systems

San Francisco, CA5y exp
AnthropicUniversity of Central Missouri
View profile
HK

Harish Kasu

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

San Francisco, CA5y exp
NVIDIATexas A&M University-Kingsville

AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.

View profile
Vinay Ramrupe - Mid AI/ML Engineer specializing in LLM and enterprise generative AI in San Francisco, CA

Vinay Ramrupe

Screened

Mid AI/ML Engineer specializing in LLM and enterprise generative AI

San Francisco, CA5y exp
DatabricksCleveland State University

ML/AI engineer focused on taking LLM systems from experimentation to reliable production, including enterprise copilot and RAG-based knowledge retrieval use cases. Stands out for combining data pipelines, model training, inference optimization, automated evaluation, and safety guardrails, with cited impact including 20% throughput gains and 30% less manual evaluation effort.

View profile
Chetan Panthukala - Mid-level AI/ML Engineer specializing in LLMs, RAG, and distributed MLOps in San Francisco, CA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and distributed MLOps

San Francisco, CA6y exp
PerplexityUniversity of North Texas
View profile
SM

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference

Bay Area, CA5y exp
PerplexitySaint Louis University
View profile
BM

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps

San Francisco, CA6y exp
Scale AISaint Louis University
View profile
KK

Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure

San Francisco, CA6y exp
PerplexityStevens Institute of Technology
View profile
RK

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

San Francisco, CA4y exp
PlaidSaint Louis University
View profile
Nishitha Thummala - Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference in San Francisco, CA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp
PerplexityUniversity of Nebraska Omaha

Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.

View profile
Kowshika M - Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety in Santa Clara, CA

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp
NVIDIAOregon State University

AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.

View profile
Nikhil Reddy - Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms in San Francisco, CA

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp
NVIDIASaint Louis University

Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.

View profile
Vinnie Yerramadha - Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps in San Francisco, CA

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

San Francisco, CA6y exp
ShopifyUniversity of North Texas
View profile
BhanuPrasad Pothagani - Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems in Bay Area, CA

Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems

Bay Area, CA5y exp
MetaFlorida Institute of Technology
View profile
SR

Mid-level Machine Learning Engineer specializing in LLMs and RAG systems

San Francisco, CA5y exp
Scale AIUniversity of New Haven
View profile
RG

Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems

San Francisco, CA5y exp
NVIDIAArizona State University
View profile
SR

Mid-level AI/ML Engineer specializing in LLM fine-tuning and RAG systems

San Francisco, CA5y exp
Scale AIConcordia University
View profile
HP

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and GPU-accelerated cloud systems

Santa Clara, CA4y exp
NVIDIAConcordia University Wisconsin
View profile
VP

Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps

Mountain View, CA5y exp
MetaUniversity of North Carolina at Charlotte
View profile
RT

Rhutwij Tulankar

Screened ReferencesStrong rec.

Engineering Manager and ML/Data Architect specializing in scalable data platforms and personalization

San Francisco, CA11y exp
RecruiticsRochester Institute of Technology

Hands-on engineering manager at a marketing company leading a highly senior, distributed team (10 direct reports) while personally coding ~60–70% and owning end-to-end architecture across three interconnected products. Built agentic CRM automation and a reinforcement-learning-driven distribution layer for channel spend/bidding, with a strong focus on scalable design and observability (Prometheus/APM/logging) enabling frequent releases and few production incidents.

View profile
GK

Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning

San Francisco, CA5y exp
MetaUniversity of Central Missouri

ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.

View profile
Dhruv Arora - Senior Generative AI Implementation Consultant specializing in RAG and agentic AI on cloud in Bay Area, CA

Dhruv Arora

Screened

Senior Generative AI Implementation Consultant specializing in RAG and agentic AI on cloud

Bay Area, CA3y exp
CapgeminiDuke University

LLM/RAG practitioner who built an AWS-based enterprise document search and summarization platform with RBAC and scaled it to 10K+ users, solving relevance issues via contextual chunking and hybrid retrieval. Also designed agentic workflows for a telecom forecast-validation use case using sub-agents, tool APIs, and strict context management, and has proven pre-sales influence (supported a $300K manufacturing deal with a roadmap-driven pitch).

View profile

Need someone specific?

AI Search