ScreenedIdentity Verified
No cost, no commitment - we'll make a personal intro
Muhan Zhang - AI Software Engineer in Palo Alto, USA

Muhan Zhang

Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG

Platflow.AICornell UniversityPalo Alto, USA2 Years ExperienceJunior LevelWorks On-Site

Connect with Muhan

Muhan already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

About

Built and shipped a production LLM pipeline for nursing home Medicare reimbursement (PDF OCR + fact extraction + keyword RAG + QA) that reportedly increased payouts by ~$1K/month per patient. Strong in LLM ops/benchmarking (ground truth, LLM-as-judge, cost/I-O tracking) and pragmatic optimization—swapped retrieval approaches, fine-tuned a small model to cut OCR cost 90%, and migrated workloads to Azure/Temporal to scale nightly processing 10x.

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee

Key Strengths

  • Deployed LLM system that increased Medicare reimbursement by ~$1,000/month per patient for nursing home facilities
  • Reduced OCR costs ~90% by distilling data and fine-tuning a 2B model and self-hosting
  • Cut QA inference cost ~10x by benchmarking prompts/models and switching from GPT-4.1 to Grok-4-fast-reasoning with minor prompt changes
  • Built LLM infra for prompt/model benchmarking with ground-truth labeling, prompt versioning, and cost/I-O observability
  • Improved throughput ~10x by migrating pipeline tasks to Azure Container Apps Jobs
  • Designed robust Temporal workflows with retries, reset points, and operational visibility for nightly ingestion pipelines
  • Effective collaboration with non-technical SMEs; translates output issues into upstream retrieval/pipeline fixes

Like what you see? We'll introduce you to Muhan directly.

Experience

AI Software EngineerPlatflow.AI · Jan 2024 – Present
AI Software Engineer InternPlatflow.AI · Jul 2024 – Dec 2024internship
AI Engineer InternGenerative Alpha · Feb 2024 – Jul 2024internship
Software Development Engineer InternTencent Company · Mar 2023 – Jul 2023internship

Education

Cornell Universitymaster, Information Science (2024)
University of Illinois Urbana-Champaignbachelor, Mathematics (2022)

Awards

  • Graduating with high distinction in Mathematics
  • Department of Mathematics Dean’s Honoree list

Languages

English

Similar Candidates

Aditya Gupta - Senior Full-Stack Software Engineer specializing in cloud-native distributed systems and AI in San Francisco, CA

Senior Full-Stack Software Engineer specializing in cloud-native distributed systems and AI

San Francisco, CA13y exp
GoogleStanford University
View profile
JZ

Senior AI/ML Engineer specializing in applied AI and scalable backend systems

Palo Alto, CA14y exp
WaymoHarvard University
View profile
Fan Wang - Staff-level Software Engineer specializing in LLM inference infrastructure and scalable model serving in San Pablo, California

Staff-level Software Engineer specializing in LLM inference infrastructure and scalable model serving

San Pablo, California11y exp
OpenAINorthwestern University
View profile
CZ

Senior Full-Stack Engineer specializing in AI/ML platforms and cloud-native systems

Redwood City, CA12y exp
Fireworks AIUC Berkeley
View profile
RW

Russell Wong

Screened ReferencesStrong rec.

Senior Software Engineer specializing in large-scale backend reliability and media platforms

San Bruno, California6y exp
GoogleSan Francisco State University

Backend/data engineer with experience on large-scale consumer platforms (Google and Meta), building high-traffic Python microservices (REST/gRPC) on Kubernetes with strong reliability/observability practices. Delivered AWS container-based deployments with CI/CD and IaC, and built AWS Glue ETL pipelines on S3 with schema evolution and data quality controls; also has demonstrated SQL tuning impact (15% latency reduction) and incident ownership for batch pipelines.

View profile
RW

Senior AI Engineer specializing in LLMs, RAG, and production ML systems

San Francisco, CA11y exp
OpenAIUniversity of North Texas
View profile

Interested in Muhan?

We'll personally introduce you - no strings attached.

For Hiring Teams

Build your dream team with Reval

Our AI agents source, screen, and vet candidates for your open roles. Get qualified, high-intent candidates on your desk within 48 hours.

$250one-time kickoff
10%on successful hire
48hrsto first candidates
Post a Role90-day money-back guarantee. A fraction of traditional agency fees.

Discover more candidates like Muhan

Search across thousands of pre-screened, high-quality, high-intent candidates on Reval.

Search Talent

Connect with Muhan

Muhan already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee
Muhan ZhangJunior AI Software Engineer specializing in LLM pipelines, OCR, and RAG