“ML/NLP leader with 12+ years of impact across LinkedIn, TikTok, and Levi's, building and productionizing multimodal recommendation and embedding-based search systems. Deep experience in entity resolution, vector retrieval, and rigorous evaluation, with cloud-native deployment/monitoring (MLflow, Airflow, SageMaker/Lambda, Azure ML, Kubernetes) and demonstrated double-digit relevance gains at millions-of-users scale.”

Python C C++Java JavaScript R+158

View profile

Harish Kasu

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

San Francisco, CA5y exp

NVIDIATexas A&M University-Kingsville

“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”

Python FastAPI Flask R SQL Java+204

View profile

Gaurav Madhav

Staff Software Engineer specializing in LLMs and ML platforms

Pleasanton, CA19y exp

Blackhawk NetworkJaypee Institute of Information Technology

Agentic AI LangGraph Amazon Bedrock Prompt Engineering Machine Learning Model Evaluation+68

View profile

Sony Arravena

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production NLP

CA, USA6y exp

MetaUniversity of Central Missouri

A/B Testing Amazon EKS Amazon Redshift Amazon S3 Amazon SageMaker AWS CodePipeline+144

View profile

Satish Mattam

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference

Bay Area, CA5y exp

PerplexitySaint Louis University

A/B Testing Agile Anomaly Detection Apache Hive Apache Kafka Apache Spark+165

View profile

Mason Gallo

Principal Machine Learning Scientist specializing in GenAI, LLMs, and RAG

Austin, TX13y exp

Season HealthGeorgia Tech

A/B Testing Apache Airflow Apache Kafka Apache Spark Azure Machine Learning BERT+108

View profile

Bharath Mamidi

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps

San Francisco, CA6y exp

Scale AISaint Louis University

Python FastAPI Flask TypeScript Machine Learning Deep Learning+130

View profile

KEERTHI KOTHA

Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure

San Francisco, CA6y exp

PerplexityStevens Institute of Technology

Machine Learning Large Language Models (LLMs)Generative AI Retrieval-Augmented Generation (RAG)Vector Search Embeddings+102

View profile

Raghunath Kunigiri

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

San Francisco, CA4y exp

PlaidSaint Louis University

Machine Learning Deep Learning Generative AI Python PyTorch TensorFlow+99

View profile

Mani Kishore Kamanaboina

Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps

4y exp

NVIDIAFlorida State University

A/B Testing Apache Cassandra Apache Hadoop Apache Spark AWS AWS Glue+88

View profile

Niteesh Singh

Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference

New York city, NY4y exp

PerplexityCleveland State University

A/B Testing Amazon EC2 Amazon EKS Amazon S3 Apache Spark Argo CD+145

View profile

Ben Wang

Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems

Seattle, WA10y exp

eBayUniversity of Illinois Urbana-Champaign

Python Java Scala SQL Bash C+++128

View profile

Manaswini Gogineni

Screened ReferencesStrong rec.

Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development

San Francisco, CA2y exp

CiscoUniversity of Wisconsin–Madison

“Backend engineer at Electric Hydrogen who built a serverless device-log ingestion and processing platform in Python/Flask, scaling throughput (4x peak ingestion) while keeping sub-300ms API latency. Strong in Postgres/SQLAlchemy performance (partitioning, materialized views) and production ML integration (ONNX model served via FastAPI microservice with async batch inference, Redis feature caching, and drift monitoring via S3/Lambda). Experienced designing secure multi-tenant systems with schema-per-tenant isolation and KMS-backed encryption.”

Go Python JavaScript TypeScript Java C+++140

View profile

Christopher Bun

Screened

Executive AI/ML technology leader specializing in healthcare, biotech, and legal AI

Irvine, CA17y exp

Augnition LabsUniversity of Chicago

“Repeat founder and startup advisor with experience spanning academic, health tech, legal tech, sports, and gaming. Has participated in fundraising and due diligence and has built companies, engineering teams, and software platforms from scratch, with a strong product-design-first approach to product-market fit and market selection.”

Machine learning Deep learning Model evaluation Model deployment Retrieval-augmented generation Agentic AI+323

View profile

Shailaja Domala

Screened

Executive product and AI leader specializing in data platforms and analytics

California, USA24y exp

CerenityCornell University

“Engineering leader with deep experience at Visa building and modernizing large-scale analytics platforms, including refactoring legacy systems into globally available microservices on AWS. Combines hands-on technical judgment in architecture, search platform evaluation, and service reliability with management of distributed international engineering and data teams.”

Analytics Data Science Product Management MLOps Cloud Computing Data Quality+297

View profile

Kenil Tanna

Screened

Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services

New York, NY7y exp

JPMorgan ChaseIIT Guwahati

“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”

Python R SQL JavaScript REST APIs gRPC+124

View profile

Sai supriya

Screened

Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference

St. Louis, MO7y exp

AnthropicSaint Louis University

“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”

Apache Spark AWS CI/CD Data Ingestion Data Pipelines Data Preprocessing+127

View profile

Nishitha Thummala

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp

PerplexityUniversity of Nebraska Omaha

“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”

Python FastAPI Flask Django gRPC JavaScript+167

View profile

Krishna Reddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants

New York, NY6y exp

StripeIndiana Wesleyan University

“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”

Agile Amazon Bedrock Apache Hadoop Apache Hive Apache Kafka Apache Spark+143

View profile

Alfred Dwan

Screened

Intern Software Engineer specializing in AI, cloud-native systems, and MLOps

Hong Kong, China1y exp

PredictXNYU

“Backend/full-stack engineer who has owned a production recruiting platform end-to-end (TypeScript/Node microservices for scraping/cleaning/serving job data, RabbitMQ for spike handling, MongoDB + Elasticsearch, AWS containers) with pragmatic CI, logging/alerts, and Docker Compose E2E tests. Also operated high-traffic event pipelines during a Binance internship using Kafka + Redis idempotency, with strong observability and failure-mode/rollback/degradation practices, and has experience designing developer-friendly REST APIs and resilient browser automation for E2E flows.”

API Gateway C++CI/CD Cypress Design systems Distributed systems+119

View profile

Machine Learning Engineers Data Scientists Software Engineers AI Engineers Generative AI Engineers Data Engineers AI & Machine Learning Data & Analytics Engineering Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?