Reval LogoFind More Talent
ZZ

Zhiwen Zhao

Junior Data Engineer specializing in cloud ETL and big data platforms

New York, NYData Engineer Associate 23 years experienceJuniorFinancial ServicesTransportation & LogisticsGovernment
ScreenedIdentity Verified

Connect with Zhiwen

Zhiwen already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Recommended

Already have an account?

About

Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.

Experience

Data Engineer Associate 2Bank of China America Data Center
Azure Data Engineer AssociateMetropolitan Transportation Authority
Teaching Assistant of Big Data Management & AnalyticsNYU Tandon
Data Scientist InternShanghai Big Data Center

Education

New York Universitymaster, Computer Science (2025)
Shanghai University of Finance and Economicsbachelor, Data Science and Big Data Technologies (2022)

Key Strengths

  • Owned end-to-end data pipelines from Oracle ingestion through PySpark transformations to Azure Data Lake serving layer
  • Hands-on data quality validation with route coverage and record-count checks (e.g., routes 1–6, 300–500 records per route)
  • Debugged Spark execution issues (apply for Java error) by isolating root cause and using local checkpointing to break long lineage and reduce recomputation
  • Designed/ran backfills via Airflow DAGs and manual Spark submit with parameterized date conditions; enabled parallel backfill execution
  • Pragmatic technology selection in ambiguous projects (chose GraphFrames over NetworkX for Spark DataFrame compatibility)
  • Implemented daily freshness checks using last-modified timestamps to ensure timely updates

Browse Similar

All Data EngineersData Engineers in NYC MetroData & AnalyticsData & Analytics in NYC MetroPythonJavaScalaR

Similar Candidates

TD

Thuc Duong

Screened

Senior Data Engineer specializing in AI-driven GTM analytics and LLM evaluation

Long Island City, NY5y exp
MetaTemple University

“Data/analytics engineer who stood up foundational pipelines and services at Meta for the Ray-Ban Meta launch—building a retailer sales ingestion system (S3/Hive) with rigorous DQ checks, 1-day SLAs, and dimensional rollups used by GTM to track sales trends. Also built a modular multi-retailer web-scraping system for out-of-stock alerts and shipped internal GraphQL APIs and an n8n-like workflow builder using serverless (AWS Lambda) with strong testing and observability practices.”

Data EngineeringData QualityDockerETLJSONMachine Learning+63
View profile
TZ

Tianming Zhang

Mid-level Data Engineer specializing in big data platforms and analytics infrastructure

New York, NY7y exp
MetaUniversity of Illinois Chicago
PythonScalaGoJavaApache SparkApache Airflow+57
View profile
JV

John Villarraga

Staff-level Software Engineer specializing in AI, data platforms, and cloud infrastructure

New York, NY8y exp
GrowthLoopCarnegie Mellon University
PythonNode.jsSQLTypeScriptCelerySQLAlchemy+50
View profile
SR

Sanketh Reddy

Screened

Senior Data Engineer specializing in cloud data platforms and large-scale ETL

Jersey City, NJ6y exp
JPMorgan ChaseUniversity of Texas at Dallas

“Data engineer focused on large-scale ETL/ELT pipelines across cloud stacks (GCP and AWS), including Spark-based transformations and orchestration with Airflow. Has experience loading up to ~2TB per BigQuery target table and designing atomic loads to multiple downstream systems (Elasticsearch + Kafka), with Kubernetes deployment and Jenkins CI/CD.”

PythonSQLScalaJavaRC+++81
View profile

Discover more candidates like Zhiwen

Search across thousands of pre-screened, high-quality, high-intent candidates on Reval.

Search Talent

Connect with Zhiwen

Zhiwen already has a relationship with Reval, so a warm intro from us gets a much better response than cold outreach.

Recommended

Already have an account?

Languages

English

Skills

PythonJavaScalaRSQLC#MATLABHTMLApache SparkPySparkTensorFlowPyTorchScikit-learnGitLabMicrosoft Azure