Keerthana Senthilnathan

Junior Machine Learning Engineer specializing in LLM systems and inference reliability

llm-dUC San DiegoCalifornia, USA1 Years ExperienceJunior LevelWorks On-Site

Connect with Keerthana

Keerthana already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

About

ML/LLM infrastructure-focused engineer who built a production stateful LLM inference service that cuts latency and GPU compute for repeated/overlapping prompts via caching with correctness guardrails. Strong in Kubernetes-based deployment and reliability engineering, using A/B testing and similarity-based evaluation to quantify performance gains without sacrificing output quality.

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff

10%on successful hire

Post a Role90-day money-back guarantee

Key Strengths

Built and deployed a stateful LLM inference service to reduce latency and GPU cost for overlapping prompts
Balances correctness vs performance using similarity thresholds and automatic cache bypass
Strong production reliability focus (bounded caches, eviction strategies, memory pressure monitoring to prevent OOM)
Metrics-driven evaluation using controlled A/B experiments (latency, cache hit rate, GPU savings, output similarity distributions)
Hands-on container orchestration in Kubernetes (resource limits/requests, scheduling, restart policies, horizontal scaling)
Designs AI agents as observable, composable, idempotent stages with measurable contracts and guardrails
Pragmatic model/retrieval/prompting selection driven by constraints and empirical validation (offline eval + A/B tests)

Like what you see? We'll introduce you to Keerthana directly.

Experience

Open-Source Contributor — ML Health & Inference Reliabilityllm-d · Jan 2026 – Present

Software Development Engineer Intern — Cross-Border TransportationAmazon.com Services LLC · Jun 2025 – Sep 2025internship

Research InternCarnegie Mellon University, Xu Lab · Dec 2023 – Jun 2024internship

Generative AI InternSamsung R&D Institute · Jun 2023 – Aug 2023internship

Education

UC San Diegomaster, Data Science (Artificial Intelligence & Machine Learning) (2026)

National Institute of Technology Tiruchirappallibachelor, Electronics & Communication Engineering (2024)

Skills

LoRA PyTorch CUDA TensorFlow Python C C++SQL Docker Kubernetes AWS Git Apache Spark ETL Pipelines React

Languages

English

Publications

4 publications

Scalable multi-model trainingKnowledge distillationNamed entity recognition (NER)Healthcare question answeringMedical NLP token classificationCloud forecastingMachine learning