Vetted Data Engineers in the NYC Metro

Pre-screened and vetted in the NYC Metro.

Harshitha Parupalli - Mid-level Data Engineer specializing in multi-cloud real-time and batch data pipelines in Jersey City, NJ

Mid-level Data Engineer specializing in multi-cloud real-time and batch data pipelines

Jersey City, NJ4y exp
Elevance HealthNJIT

Data engineer with healthcare domain experience who owned 100M+ record pipelines end-to-end (Kafka/Kinesis/ADF → PySpark/dbt validation → Spark SQL transforms → Snowflake/Power BI serving). Built production-grade reliability practices (Airflow orchestration, CloudWatch/Grafana monitoring, pytest + contract/regression tests, idempotent ingestion/backfills) and delivered measurable improvements: 35% lower latency and 40% better query performance.

View profile
KP

Mid-level Data Engineer specializing in capital markets post-trade data platforms

Whippany, NJ3y exp
BarclaysUniversity of Connecticut

Data/streaming engineer in capital markets who led an end-to-end trade settlement data product (Kafka→MongoDB→data lake) with rigorous data-quality logic and ~$175K first-year operational impact. Also built a low-latency Go-based CME market data engine feeding SOFR curve generation, using MSK on EKS with performance tuning (idempotency, compression, partitioning) to achieve sub-100ms delivery.

View profile
SS

Mid-level Azure Data Engineer specializing in Databricks lakehouse and Spark pipelines

Jersey City, NJ6y exp
CitibankUniversity of Cincinnati
View profile
CC

Principal Cloud Data Engineering Leader specializing in lakehouse and streaming platforms

Astoria, NY14y exp
Success Academy Charter SchoolsPurchase College (SUNY)
View profile
VM

Mid-level Data Engineer specializing in cloud ETL/ELT and analytics platforms

New York, USA5y exp
Quantegy AnalyticsStevens Institute of Technology
View profile
RG

Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms

New York, NY9y exp
PVHUniversity of Texas at Arlington
View profile
SM

Senior Data Engineer specializing in Azure Lakehouse and LLM/ML data platforms

New York, NY8y exp
MedFilo IncGeorge Washington University
View profile
SR

Mid-level Data Engineer specializing in cloud ETL, big data, and analytics

Newark, NJ6y exp
Cosette PharmaceuticalsWilmington University
View profile
AG

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp
PNCRochester Institute of Technology

Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.

View profile
SH

Mid-level Data Engineer specializing in cloud ETL/ELT and lakehouse architecture

Jersey City, NJ4y exp
State StreetUniversity of New Haven

Data engineer focused on sales/marketing analytics pipelines, owning ingestion from CRMs/ad platforms through warehouse serving and dashboards at ~hundreds of thousands of records/day. Built reliability-focused systems including dbt/SQL/Python data quality gates with alerting, a resilient web-scraping pipeline (retries/backoff, anti-bot tactics, schema-change detection, backfills), and a versioned internal REST API with caching and strong developer usability.

View profile
SP

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

New York, NY4y exp
Wells FargoUniversity of Birmingham

Data engineer with Wells Fargo experience owning an end-to-end lakehouse ETL pipeline on Databricks/Azure Data Factory, processing ~480GB daily and implementing robust data quality/reconciliation across 40+ tables to reach ~99.3% reliability. Strong in performance optimization (cut runtime 5.5h→3.8h), CI/CD and monitoring, and resilient external/API ingestion with retries, schema validation, and backfills.

View profile
AC

Senior Backend/Cloud Developer specializing in Python and AWS-native data workflows

New York, NY11y exp
PVHNorthern Illinois University
View profile
SB

Mid-level Data Engineer specializing in cloud data pipelines and warehousing

New York, NY3y exp
CitibankUniversity of West Florida
View profile
YG

Mid-level Data Engineer specializing in cloud ETL/ELT, Spark, and streaming pipelines

New York, USA3y exp
S&P GlobalUniversity at Albany
View profile
IS

Mid-Level Data Engineer specializing in cloud data platforms (AWS & GCP)

Brooklyn, NY4y exp
NovisDiego Portales University
View profile
AS

Mid-level Data Engineer specializing in cloud lakehouse and streaming analytics for financial services

New York, NY3y exp
AccentureUniversity of Alabama at Birmingham
View profile
AC

Senior Data Engineer specializing in cloud data platforms and lakehouse architecture

New York, USA6y exp
KinshipNJIT
View profile
AR

Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines

New York, NY4y exp
American Arbitration AssociationNortheastern University
View profile
AH

Senior Lead Data Engineer specializing in cloud data platforms and real-time ML pipelines

Hillside, NJ13y exp
NexusMontclair State University
View profile
AS

Mid-level Data Analyst/Data Engineer specializing in machine learning and NLP

New York3y exp
Bright Mind Enrichment and SchoolingRochester Institute of Technology
View profile
AS

Mid-level sales and data professional specializing in FinTech, telecom, and insurance

Woodbridge, NJ3y exp
Plymouth Rock AssuranceRowan University
View profile
BB

Mid-Level Data Engineer specializing in cloud data pipelines and big data platforms

Newark, NJ3y exp
Horizon Blue Cross Blue Shield of NJUniversity of Memphis

Data engineer with ~4 years of experience building Python-based data ingestion/processing services and real-time streaming pipelines (Kafka/PubSub + Spark Structured Streaming). Has deployed containerized data applications on Kubernetes with GitLab CI/Jenkins pipelines and applied GitOps to cut deployment time ~40% while reducing config drift. Also supported a legacy on-prem data warehouse/backend migration to GCP using phased migration and parallel validation to meet strict reliability/SLA needs.

View profile
PN

Junior Data Engineer specializing in cloud ETL/ELT and lakehouse platforms

Newark, NJ2y exp
Horizon Blue Cross Blue Shield of NJUniversity of Central Missouri
View profile

Need someone specific?

AI Search