ScreenedIdentity Verified
No cost, no commitment - we'll make a personal intro
HK

Harish Kasu

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

NVIDIATexas A&M University-KingsvilleSan Francisco, CA5 Years ExperienceMid LevelWorks On-Site

Connect with Harish

Harish already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

About

AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee

Key Strengths

  • Built and deployed production RAG enterprise knowledge assistant at NVIDIA
  • Improved retrieval/answer accuracy by 42% via high-precision retrieval pipeline
  • Low-latency, high-throughput LLM inference optimization using TensorRT-LLM and Triton
  • Scaled system to thousands of queries with performance stability under load
  • Implemented quantization and speculative decoding to reduce GPU usage
  • Strong MLOps: drift detection, metrics dashboards, and continuous evaluation (Prometheus/Grafana, MLflow, LangSmith)
  • Designed reliable agent/workflow evaluation with synthetic tests, unit tests, and red teaming
  • End-to-end orchestration across multi-cloud using Airflow + Kubeflow with reproducibility via containers and Git config
  • Effective collaboration with non-technical stakeholders; translated requirements into AI solution improving accuracy by ~20%

Like what you see? We'll introduce you to Harish directly.

Experience

AI/ML ENGINEERNVIDIA · May 2024 – Present
AI/ML ENGINEERMicrosoft Corporation · Apr 2020 – Jul 2023

Education

Texas A&M University-Kingsvillemaster, Computer Science

Languages

English

Certifications

AWS Certified Solutions Architect - AssociateCareer Essentials in Generative AI - Microsoft and LinkedIn

Similar Candidates

SG

Mid-level AI/ML Engineer specializing in LLM training, RAG, and scalable inference

Bay Area, CA3y exp
OpenAICarnegie Mellon University
View profile
VP

Staff AI/ML Engineer specializing in LLMs, fraud detection, and MLOps

Menlo Park, CA9y exp
MetaMetropolitan State University
View profile
JM

Mid-level AI/ML Engineer specializing in LLM training, RAG, and scalable inference

Bay Area, CA5y exp
OpenAICalifornia State University, East Bay
View profile
EP

Esha Pahwa

Screened

Intern Machine Learning Engineer specializing in LLM agents and multimodal reasoning

Mountain View, CA2y exp
Corvic AICarnegie Mellon University

LLM/agent engineer who built a production code-generation agent at Corvic AI that lets non-technical users query CSV/tabular data in natural language by generating and executing Python. Focused on making LLM systems reliable and scalable via schema-aware validation, sandboxed execution-feedback retries, prompt caching/embeddings, async execution, and high-throughput data processing with Polars; also partnered with Adobe product/marketing to ship brand-aligned AI content generation for email and push notifications.

View profile
SS

Senior Machine Learning Engineer specializing in LLMs and scalable MLOps

San Francisco, CA7y exp
MetaIndiana University
View profile

Interested in Harish?

We'll personally introduce you - no strings attached.

For Hiring Teams

Build your dream team with Reval

Our AI agents source, screen, and vet candidates for your open roles. Get qualified, high-intent candidates on your desk within 48 hours.

$250one-time kickoff
10%on successful hire
48hrsto first candidates
Post a Role90-day money-back guarantee. A fraction of traditional agency fees.

Discover more candidates like Harish

Search across thousands of pre-screened, high-quality, high-intent candidates on Reval.

Search Talent

Connect with Harish

Harish already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee
Harish KasuMid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps