Vetted Machine Learning Engineers in California

Pre-screened and vetted in California.

MC

Executive engineering leader and full-stack engineer specializing in FinTech and AI platforms

San Francisco, CA16y exp
NavigateAICornell University
View profile
Nishitha Thummala - Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference in San Francisco, CA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp
PerplexityUniversity of Nebraska Omaha

Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.

View profile
Kowshika M - Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety in Santa Clara, CA

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp
NVIDIAOregon State University

AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.

View profile
Nikhil Reddy - Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms in San Francisco, CA

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp
NVIDIASaint Louis University

Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.

View profile
KS

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

CA, USA4y exp
AnthropicCalifornia State University, Long Beach

ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.

View profile
Vinnie Yerramadha - Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps in San Francisco, CA

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

San Francisco, CA6y exp
ShopifyUniversity of North Texas
View profile
BhanuPrasad Pothagani - Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems in Bay Area, CA

Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems

Bay Area, CA5y exp
MetaFlorida Institute of Technology
View profile
SR

Mid-level Machine Learning Engineer specializing in LLMs and RAG systems

San Francisco, CA5y exp
Scale AIUniversity of New Haven
View profile
TR

Mid-level Machine Learning Engineer specializing in NLP, recommender systems, and on-device ML

CA, USA5y exp
AppleTexas Tech University
View profile
RG

Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems

San Francisco, CA5y exp
NVIDIAArizona State University
View profile
SR

Mid-level AI/ML Engineer specializing in LLM fine-tuning and RAG systems

San Francisco, CA5y exp
Scale AIConcordia University
View profile
HP

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and GPU-accelerated cloud systems

Santa Clara, CA4y exp
NVIDIAConcordia University Wisconsin
View profile
VP

Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps

Mountain View, CA5y exp
MetaUniversity of North Carolina at Charlotte
View profile
RT

Rhutwij Tulankar

Screened ReferencesStrong rec.

Engineering Manager and ML/Data Architect specializing in scalable data platforms and personalization

San Francisco, CA11y exp
RecruiticsRochester Institute of Technology

Hands-on engineering manager at a marketing company leading a highly senior, distributed team (10 direct reports) while personally coding ~60–70% and owning end-to-end architecture across three interconnected products. Built agentic CRM automation and a reinforcement-learning-driven distribution layer for channel spend/bidding, with a strong focus on scalable design and observability (Prometheus/APM/logging) enabling frequent releases and few production incidents.

View profile
GK

Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning

San Francisco, CA5y exp
MetaUniversity of Central Missouri

ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.

View profile
RS

Mid-level AI & ML Engineer specializing in NLP, LLMs, and scalable ML systems

Cupertino, CA6y exp
AppleVisvesvaraya Technological University

AI/ML engineer with experience spanning Accenture healthcare NLP systems, academic research, and Apple on-device LLM integration. Stands out for owning regulated production pipelines end-to-end—from HIPAA-compliant clinical NLP and EHR integrations to incident prevention, experiment tracking, and optimized on-device inference with LLaMA 3.

View profile
Dhruv Arora - Senior Generative AI Implementation Consultant specializing in RAG and agentic AI on cloud in Bay Area, CA

Dhruv Arora

Screened

Senior Generative AI Implementation Consultant specializing in RAG and agentic AI on cloud

Bay Area, CA3y exp
CapgeminiDuke University

LLM/RAG practitioner who built an AWS-based enterprise document search and summarization platform with RBAC and scaled it to 10K+ users, solving relevance issues via contextual chunking and hybrid retrieval. Also designed agentic workflows for a telecom forecast-validation use case using sub-agents, tool APIs, and strict context management, and has proven pre-sales influence (supported a $300K manufacturing deal with a roadmap-driven pitch).

View profile
CS

Mid-level Machine Learning Engineer specializing in fraud detection and real-time personalization

San Francisco, CA6y exp
StripeUniversity of Tampa

ML/LLM engineer with Stripe and Adobe experience who productionized a transformer-based Payments Foundation Model for real-time fraud detection at global scale (billions of transactions). Built petabyte-scale ETL/feature pipelines (Spark/EMR, Airflow, dbt, Kafka/Flink) and achieved <100ms multi-region inference (EKS, TorchServe, edge/Lambda, GPU/CPU routing) with strong PCI-DSS/GDPR compliance and explainability (SHAP/LIME), reporting a 64% fraud accuracy improvement.

View profile
Keerthana Senthilnathan - Junior Machine Learning Engineer specializing in LLM systems and inference reliability in California, USA

Junior Machine Learning Engineer specializing in LLM systems and inference reliability

California, USA1y exp
llm-dUC San Diego

ML/LLM infrastructure-focused engineer who built a production stateful LLM inference service that cuts latency and GPU compute for repeated/overlapping prompts via caching with correctness guardrails. Strong in Kubernetes-based deployment and reliability engineering, using A/B testing and similarity-based evaluation to quantify performance gains without sacrificing output quality.

View profile

Need someone specific?

AI Search