Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Machine Learning Engineers in California

Pre-screened and vetted in California.

Python Docker SQL PyTorch AWS Kubernetes

Michael Chen

Executive engineering leader and full-stack engineer specializing in FinTech and AI platforms

San Francisco, CA16y exp

NavigateAICornell University

Python Go Java C#Node.js TypeScript+90

View profile

Nishitha Thummala

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp

PerplexityUniversity of Nebraska Omaha

“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”

Python FastAPI Flask Django gRPC JavaScript+167

View profile

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp

NVIDIAOregon State University

“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”

A/B Testing Ansible Apache Kafka Apache Spark Automated Testing AWS+113

View profile

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp

NVIDIASaint Louis University

“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”

Python Java Spring Boot JavaScript TypeScript React+129

View profile

Krishna Sahith Poruri

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

CA, USA4y exp

AnthropicCalifornia State University, Long Beach

“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”

Python SQL R C++Git Classification+101

View profile

Vinnie Yerramadha

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

San Francisco, CA6y exp

ShopifyUniversity of North Texas

Python SQL Bash C JavaScript PHP+173

View profile

BhanuPrasad Pothagani

Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems

Bay Area, CA5y exp

MetaFlorida Institute of Technology

Java Python JavaScript TypeScript C C+++221

View profile

Sumukh Ramagiri

Mid-level Machine Learning Engineer specializing in LLMs and RAG systems

San Francisco, CA5y exp

Scale AIUniversity of New Haven

Python SQL Bash Data Structures Algorithms Multithreading+133

View profile

Thanmayee Reddy

Mid-level Machine Learning Engineer specializing in NLP, recommender systems, and on-device ML

CA, USA5y exp

AppleTexas Tech University

A/B Testing Amazon DynamoDB Amazon EC2 Amazon S3 Amazon SageMaker Anomaly Detection+111

View profile

Roop Gundu

Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems

San Francisco, CA5y exp

NVIDIAArizona State University

A/B Testing Agile Amazon Bedrock Anomaly Detection Apache Spark AWS+178

View profile

Sravanthi REDDY

Mid-level AI/ML Engineer specializing in LLM fine-tuning and RAG systems

San Francisco, CA5y exp

Scale AIConcordia University

Amazon EC2 Amazon EKS Amazon S3 Amazon SageMaker Apache Kafka API Development+90

View profile

Hemalatha Papasani

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and GPU-accelerated cloud systems

Santa Clara, CA4y exp

NVIDIAConcordia University Wisconsin

Python Pandas Java Spring Boot Node.js TypeScript+126

View profile

Vrushank Prasanna

Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps

Mountain View, CA5y exp

MetaUniversity of North Carolina at Charlotte

Python Java C C++MATLAB Bash+154

View profile

Rhutwij Tulankar

Screened ReferencesStrong rec.

Engineering Manager and ML/Data Architect specializing in scalable data platforms and personalization

San Francisco, CA11y exp

RecruiticsRochester Institute of Technology

“Hands-on engineering manager at a marketing company leading a highly senior, distributed team (10 direct reports) while personally coding ~60–70% and owning end-to-end architecture across three interconnected products. Built agentic CRM automation and a reinforcement-learning-driven distribution layer for channel spend/bidding, with a strong focus on scalable design and observability (Prometheus/APM/logging) enabling frequent releases and few production incidents.”

Amazon DynamoDB Amazon ECS Amazon Kinesis Amazon Redshift Amazon S3 Amazon SQS+263

View profile

Gowri Kajipuram

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning

San Francisco, CA5y exp

MetaUniversity of Central Missouri

“ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.”

Python SQL PyTorch TensorFlow Scikit-learn XGBoost+158

View profile

Ritesh Somashekar

Screened

Mid-level AI & ML Engineer specializing in NLP, LLMs, and scalable ML systems

Cupertino, CA6y exp

AppleVisvesvaraya Technological University

“AI/ML engineer with experience spanning Accenture healthcare NLP systems, academic research, and Apple on-device LLM integration. Stands out for owning regulated production pipelines end-to-end—from HIPAA-compliant clinical NLP and EHR integrations to incident prevention, experiment tracking, and optimized on-device inference with LLaMA 3.”

Python NumPy Pandas Scikit-learn TensorFlow PyTorch+130

View profile

Dhruv Arora

Screened

Senior Generative AI Implementation Consultant specializing in RAG and agentic AI on cloud

Bay Area, CA3y exp

CapgeminiDuke University

“LLM/RAG practitioner who built an AWS-based enterprise document search and summarization platform with RBAC and scaled it to 10K+ users, solving relevance issues via contextual chunking and hybrid retrieval. Also designed agentic workflows for a telecom forecast-validation use case using sub-agents, tool APIs, and strict context management, and has proven pre-sales influence (supported a $300K manufacturing deal with a roadmap-driven pitch).”

A/B Testing API Gateway AWS AWS Glue AWS Lambda AWS Step Functions+81

View profile

Chandra sai kiran Kammari

Screened

Mid-level Machine Learning Engineer specializing in fraud detection and real-time personalization

San Francisco, CA6y exp

StripeUniversity of Tampa

“ML/LLM engineer with Stripe and Adobe experience who productionized a transformer-based Payments Foundation Model for real-time fraud detection at global scale (billions of transactions). Built petabyte-scale ETL/feature pipelines (Spark/EMR, Airflow, dbt, Kafka/Flink) and achieved <100ms multi-region inference (EKS, TorchServe, edge/Lambda, GPU/CPU routing) with strong PCI-DSS/GDPR compliance and explainability (SHAP/LIME), reporting a 64% fraud accuracy improvement.”

Python PyTorch TensorFlow Scikit-learn Pandas NumPy+164

View profile

Keerthana Senthilnathan

Screened

Junior Machine Learning Engineer specializing in LLM systems and inference reliability

California, USA1y exp

llm-dUC San Diego

“ML/LLM infrastructure-focused engineer who built a production stateful LLM inference service that cuts latency and GPU compute for repeated/overlapping prompts via caching with correctness guardrails. Strong in Kubernetes-based deployment and reliability engineering, using A/B testing and similarity-based evaluation to quantify performance gains without sacrificing output quality.”

LoRA PyTorch TensorFlow Python C C+++87

View profile

Machine Learning Engineers in Bay Area Machine Learning Engineers in Los Angeles Metro Machine Learning Engineers in San Diego Metro

Need someone specific?

AI Search

Related

Need someone specific?