“Backend/ML infrastructure engineer with experience at Perplexity and Meta building production evaluation, monitoring, and retrieval systems for AI search, autonomous agents, and LLM-powered workflows. Particularly strong in turning messy manual quality-review processes into reusable Python/FastAPI automation with measurable impact, including major gains in search relevance, latency, and grounded answer quality.”

Python SQL Bash Shell Scripting Data Structures Algorithms+170

View profile

Harish Kasu

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

San Francisco, CA5y exp

NVIDIATexas A&M University-Kingsville

“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”

Python FastAPI Flask R SQL Java+204

View profile

Vinay Ramrupe

Screened

Mid-level AI/ML Engineer specializing in LLM and enterprise generative AI

San Francisco, CA5y exp

DatabricksCleveland State University

“ML/AI engineer focused on taking LLM systems from experimentation to reliable production, including enterprise copilot and RAG-based knowledge retrieval use cases. Stands out for combining data pipelines, model training, inference optimization, automated evaluation, and safety guardrails, with cited impact including 20% throughput gains and 30% less manual evaluation effort.”

Python SQL Bash Shell Scripting Data Structures Supervised Learning+129

View profile

Satish Mattam

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference

Bay Area, CA5y exp

PerplexitySaint Louis University

A/B Testing Agile Anomaly Detection Apache Hive Apache Kafka Apache Spark+165

View profile

Bharath Mamidi

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps

San Francisco, CA6y exp

Scale AISaint Louis University

Python FastAPI Flask TypeScript Machine Learning Deep Learning+130

View profile

Mohammed Ahmed

Mid-level Machine Learning Engineer specializing in search, retrieval, and generative AI

San Francisco, CA4y exp

PerplexityTrine University

Python SQL Bash Data Structures & Algorithms System Design Supervised Learning+111

View profile

KEERTHI KOTHA

Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure

San Francisco, CA6y exp

PerplexityStevens Institute of Technology

Machine Learning Large Language Models (LLMs)Generative AI Retrieval-Augmented Generation (RAG)Vector Search Embeddings+102

View profile

Raghunath Kunigiri

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

San Francisco, CA4y exp

PlaidSaint Louis University

Machine Learning Deep Learning Generative AI Python PyTorch TensorFlow+99

View profile

Michael Chen

Executive engineering leader and full-stack engineer specializing in FinTech and AI platforms

San Francisco, CA16y exp

NavigateAICornell University

Python Go Java C#Node.js TypeScript+90

View profile

Nishitha Thummala

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp

PerplexityUniversity of Nebraska Omaha

“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”

Python FastAPI Flask Django gRPC JavaScript+167

View profile

Kowshika M

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety

Santa Clara, CA5y exp

NVIDIAOregon State University

“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”

A/B Testing Ansible Apache Kafka Apache Spark Automated Testing AWS+113

View profile

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp

NVIDIASaint Louis University

“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”

Python Java Spring Boot JavaScript TypeScript React+129

View profile

Vinnie Yerramadha

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

San Francisco, CA6y exp

ShopifyUniversity of North Texas

Python SQL Bash C JavaScript PHP+173

View profile

BhanuPrasad Pothagani

Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems

Bay Area, CA5y exp

MetaFlorida Institute of Technology

Java Python JavaScript TypeScript C C+++221

View profile

Sumukh Ramagiri

Mid-level Machine Learning Engineer specializing in LLMs and RAG systems

San Francisco, CA5y exp

Scale AIUniversity of New Haven

Python SQL Bash Data Structures Algorithms Multithreading+133

View profile

Roop Gundu

Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems

San Francisco, CA5y exp

NVIDIAArizona State University

A/B Testing Agile Amazon Bedrock Anomaly Detection Apache Spark AWS+178

View profile

Sravanthi REDDY

Mid-level AI/ML Engineer specializing in LLM fine-tuning and RAG systems

San Francisco, CA5y exp

Scale AIConcordia University

Amazon EC2 Amazon EKS Amazon S3 Amazon SageMaker Apache Kafka API Development+90

View profile

Hemalatha Papasani

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and GPU-accelerated cloud systems

Santa Clara, CA4y exp

NVIDIAConcordia University Wisconsin

Python Pandas Java Spring Boot Node.js TypeScript+126

View profile

Vrushank Prasanna

Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps

Mountain View, CA5y exp

MetaUniversity of North Carolina at Charlotte

Python Java C C++MATLAB Bash+154

View profile

Rhutwij Tulankar

Screened ReferencesStrong rec.

Engineering Manager and ML/Data Architect specializing in scalable data platforms and personalization

San Francisco, CA11y exp

RecruiticsRochester Institute of Technology

“Hands-on engineering manager at a marketing company leading a highly senior, distributed team (10 direct reports) while personally coding ~60–70% and owning end-to-end architecture across three interconnected products. Built agentic CRM automation and a reinforcement-learning-driven distribution layer for channel spend/bidding, with a strong focus on scalable design and observability (Prometheus/APM/logging) enabling frequent releases and few production incidents.”

Amazon DynamoDB Amazon ECS Amazon Kinesis Amazon Redshift Amazon S3 Amazon SQS+263

View profile

Need someone specific?

AI Search