ScreenedIdentity Verified
No cost, no commitment - we'll make a personal intro
CH

Chengzhu He

Staff/Principal Cloud Infrastructure Engineer specializing in Kubernetes and OpenStack

TikTokShanghai University14 Years ExperienceStaff LevelWorks On-Site

Connect with Chengzhu

Chengzhu already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

About

Platform/backend engineer focused on Kubernetes at scale: built a Java control-plane service for multi-region cluster provisioning/monitoring/upgrades using Kafka-driven async workers, and solved peak-load provisioning failures by eliminating blocking I/O and dynamically scaling consumers. Also shipped an LLM-assisted Kubernetes troubleshooting/remediation feature that pulls Prometheus logs/metrics into prompts and uses guardrails (confidence thresholds + human-in-the-loop) to prevent risky actions.

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee

Key Strengths

  • Built and operated a Java control-plane service managing Kubernetes clusters across multiple cloud regions
  • Debugged and resolved high-load reliability failures (thread pool exhaustion + blocking I/O) by moving to non-blocking async calls and dynamically scaling workers/consumers
  • Designed event-driven provisioning/upgrade workflows using Kafka with asynchronous execution
  • Shipped an LLM-powered cluster troubleshooting feature integrating Prometheus logs/metrics into context-rich prompts with remediation recommendations
  • Designed multi-step automated cluster maintenance/upgrade workflows with step tracking in SQL + etcd and typed error handling (timeouts, retries, escalation/stop rules)
  • Improved slow relational queries by adding composite indexes aligned to real query patterns

Like what you see? We'll introduce you to Chengzhu directly.

Experience

Infrastructure Engineer SRETikTok · Oct 2025 – Feb 2026
Member of Technical Staff IIeBay · May 2022 – Dec 2024
Software Engineering Manager & Senior Tech ExpertAnt Group · May 2021 – May 2022
Member of Technical Staff IeBay · Sep 2017 – May 2021
Software Engineer IIMicrosoft · Sep 2015 – Aug 2017
Principal Software EngineeriQIYI · Jul 2015 – Sep 2015
Senior Software EngineeriQIYI · Nov 2014 – Jul 2015
Software developerEMC · Jul 2011 – Oct 2014
Software Engineer InternHP · Oct 2010 – May 2011internship

Education

Shanghai Universitybachelor, Computer Science & Technology (2011)

Awards

  • Bronze prize (global) - EMC (for automating mainframe HA standby system upgrade feature)

Languages

English

Similar Candidates

AO

Staff DevOps Engineer specializing in SRE, Kubernetes, and hybrid cloud platforms

New York, NY9y exp
D. E. ShawMIT
View profile
JR

Staff ML Platform Engineer specializing in distributed training and inference

Menlo Park, CA9y exp
MetaPolytechnic University of Puerto Rico
View profile
HW

Mid-Level Software Engineer specializing in Cloud SRE and LLM-powered automation

Seattle, WA4y exp
GoogleUSC
View profile
IN

Staff Full-Stack & Platform Engineer specializing in cloud-native distributed systems

Houston, TX14y exp
AtlassianUniversity of Houston
View profile
YH

Mid-level Site Reliability Engineer specializing in AI training infrastructure and GPU platforms

Sunnyvale, CA2y exp
Alibaba CloudUC San Diego
View profile
Gregory Walton - Senior Software Engineer specializing in SRE, cloud automation, and reliability tooling in Sunnyvale, CA

Senior Software Engineer specializing in SRE, cloud automation, and reliability tooling

Sunnyvale, CA11y exp
LinkedIn

Technically oriented builder focused on taking ambiguous LLM/agentic workflow needs from scoped problem definition to iterative prototypes, with emphasis on edge cases and customer feedback. Has hands-on experience debugging RAG/agent systems, including resolving a real integration issue involving search query quoting differences between manual search and an API, and is comfortable fielding deep developer Q&A in demos.

View profile

Interested in Chengzhu?

We'll personally introduce you - no strings attached.

For Hiring Teams

Build your dream team with Reval

Our AI agents source, screen, and vet candidates for your open roles. Get qualified, high-intent candidates on your desk within 48 hours.

$250one-time kickoff
10%on successful hire
48hrsto first candidates
Post a Role90-day money-back guarantee. A fraction of traditional agency fees.

Discover more candidates like Chengzhu

Search across thousands of pre-screened, high-quality, high-intent candidates on Reval.

Search Talent

Connect with Chengzhu

Chengzhu already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Typically responds within 24 hours

Recommended

Already have an account?

Hire with Reval

Find your next great hire

Our AI agents source, screen, and vet candidates for your open roles. Get qualified candidates within 48 hours.

$250one-time kickoff
10%on successful hire
Post a Role90-day money-back guarantee
Chengzhu HeStaff/Principal Cloud Infrastructure Engineer specializing in Kubernetes and OpenStack