Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Engineers in the NYC Metro

Pre-screened and vetted in the NYC Metro.

Python SQL ETL CI/CD Apache Airflow Apache Kafka

Harshitha Parupalli

Screened

Mid-level Data Engineer specializing in multi-cloud real-time and batch data pipelines

Jersey City, NJ4y exp

Elevance HealthNJIT

“Data engineer with healthcare domain experience who owned 100M+ record pipelines end-to-end (Kafka/Kinesis/ADF → PySpark/dbt validation → Spark SQL transforms → Snowflake/Power BI serving). Built production-grade reliability practices (Airflow orchestration, CloudWatch/Grafana monitoring, pytest + contract/regression tests, idempotent ingestion/backfills) and delivered measurable improvements: 35% lower latency and 40% better query performance.”

Python SQL Shell Scripting R Scala Java+160

View profile

Kamalesh Ponnivalavan

Screened

Mid-level Data Engineer specializing in capital markets post-trade data platforms

Whippany, NJ3y exp

BarclaysUniversity of Connecticut

“Data/streaming engineer in capital markets who led an end-to-end trade settlement data product (Kafka→MongoDB→data lake) with rigorous data-quality logic and ~$175K first-year operational impact. Also built a low-latency Go-based CME market data engine feeding SOFR curve generation, using MSK on EKS with performance tuning (idempotency, compression, partitioning) to achieve sub-100ms delivery.”

Amazon Athena Amazon DynamoDB Amazon Redshift Amazon S3 Apache Hadoop Apache Kafka+118

View profile

Sekhar Sabbisetti

Mid-level Azure Data Engineer specializing in Databricks lakehouse and Spark pipelines

Jersey City, NJ6y exp

CitibankUniversity of Cincinnati

Python SQL Scala PySpark Apache Spark Databricks+68

View profile

Chris Cestari

Principal Cloud Data Engineering Leader specializing in lakehouse and streaming platforms

Astoria, NY14y exp

Success Academy Charter SchoolsPurchase College (SUNY)

AWS Apache Spark PySpark Databricks Apache Kafka Amazon Kinesis+87

View profile

Vanshika Makhija

Mid-level Data Engineer specializing in cloud ETL/ELT and analytics platforms

New York, USA5y exp

Quantegy AnalyticsStevens Institute of Technology

Python SQL Java C PL/SQL MySQL+103

View profile

Ramesh Giri

Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms

New York, NY9y exp

PVHUniversity of Texas at Arlington

Python Java Scala Kotlin C#.NET+156

View profile

Sumedha Mannem

Senior Data Engineer specializing in Azure Lakehouse and LLM/ML data platforms

New York, NY8y exp

MedFilo IncGeorge Washington University

Python SQL Java Apache Spark PySpark Apache Hadoop+112

View profile

Subramanyam R

Mid-level Data Engineer specializing in cloud ETL, big data, and analytics

Newark, NJ6y exp

Cosette PharmaceuticalsWilmington University

Agile Amazon S3 Amazon SNS Apache Airflow Apache Hive AWS Glue+62

View profile

Abhishek Gawali

Screened

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp

PNCRochester Institute of Technology

“Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.”

SDLC Agile Waterfall Python SQL R+105

View profile

Sai Harshith Varma Pericherla

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and lakehouse architecture

Jersey City, NJ4y exp

State StreetUniversity of New Haven

“Data engineer focused on sales/marketing analytics pipelines, owning ingestion from CRMs/ad platforms through warehouse serving and dashboards at ~hundreds of thousands of records/day. Built reliability-focused systems including dbt/SQL/Python data quality gates with alerting, a resilient web-scraping pipeline (retries/backoff, anti-bot tactics, schema-change detection, backfills), and a versioned internal REST API with caching and strong developer usability.”

SQL Python Pandas NumPy Scikit-learn Java+151

View profile

Sheshikanth Pothuganti

Screened

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

New York, NY4y exp

Wells FargoUniversity of Birmingham

“Data engineer with Wells Fargo experience owning an end-to-end lakehouse ETL pipeline on Databricks/Azure Data Factory, processing ~480GB daily and implementing robust data quality/reconciliation across 40+ tables to reach ~99.3% reliability. Strong in performance optimization (cut runtime 5.5h→3.8h), CI/CD and monitoring, and resilient external/API ingestion with retries, schema validation, and backfills.”

Python SQL Java Scala R PostgreSQL+122

View profile

Ajay Chauhan

Senior Backend/Cloud Developer specializing in Python and AWS-native data workflows

New York, NY11y exp

PVHNorthern Illinois University

Python Java TypeScript JavaScript Bash SQL+163

View profile

Sathwika B

Mid-level Data Engineer specializing in cloud data pipelines and warehousing

New York, NY3y exp

CitibankUniversity of West Florida

Python SQL PySpark Scala Shell Scripting Apache Spark+60

View profile

Yogitha Goli

Mid-level Data Engineer specializing in cloud ETL/ELT, Spark, and streaming pipelines

New York, USA3y exp

S&P GlobalUniversity at Albany

Python Pandas NumPy PySpark SQL PostgreSQL+90

View profile

Ignacio Silva Bartholomaus

Mid-Level Data Engineer specializing in cloud data platforms (AWS & GCP)

Brooklyn, NY4y exp

NovisDiego Portales University

Apache Airflow AWS AWS CloudFormation AWS Glue AWS Lambda AWS Step Functions+36

View profile

Ameer Sohail Syed

Mid-level Data Engineer specializing in cloud lakehouse and streaming analytics for financial services

New York, NY3y exp

AccentureUniversity of Alabama at Birmingham

Apache Spark PySpark Databricks Apache Kafka AWS Glue Amazon EMR+95

View profile

Ankit Chaudhary

Senior Data Engineer specializing in cloud data platforms and lakehouse architecture

New York, USA6y exp

KinshipNJIT

Python PySpark SQL AWS Amazon S3 Amazon Redshift+67

View profile

Anirudha Raghava Sarma Kuchibhotla

Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines

New York, NY4y exp

American Arbitration AssociationNortheastern University

LangChain LangGraph Retrieval-Augmented Generation (RAG)OpenAI Prompt Engineering Multi-Agent Systems+64

View profile

Ashanti Hameed

Senior Lead Data Engineer specializing in cloud data platforms and real-time ML pipelines

Hillside, NJ13y exp

NexusMontclair State University

ETL Data pipelines Data modeling Data warehousing Distributed systems Performance tuning+75

View profile

Ananya Singh

Mid-level Data Analyst/Data Engineer specializing in machine learning and NLP

New York3y exp

Bright Mind Enrichment and SchoolingRochester Institute of Technology

A/B Testing BERT Chromadb Clustering Customer Segmentation Data Analysis+41

View profile

Abdullah Shahid

Mid-level sales and data professional specializing in FinTech, telecom, and insurance

Woodbridge, NJ3y exp

Plymouth Rock AssuranceRowan University

Java Python C CSS HTML JavaScript+43

View profile

bharath burgoju

Screened

Mid-Level Data Engineer specializing in cloud data pipelines and big data platforms

Newark, NJ3y exp

Horizon Blue Cross Blue Shield of NJUniversity of Memphis

“Data engineer with ~4 years of experience building Python-based data ingestion/processing services and real-time streaming pipelines (Kafka/PubSub + Spark Structured Streaming). Has deployed containerized data applications on Kubernetes with GitLab CI/Jenkins pipelines and applied GitOps to cut deployment time ~40% while reducing config drift. Also supported a legacy on-prem data warehouse/backend migration to GCP using phased migration and parallel validation to meet strict reliability/SLA needs.”

Agile Amazon Athena Amazon CloudFront Amazon DynamoDB Amazon EC2 Amazon EMR+114

View profile

Phani Narla

Junior Data Engineer specializing in cloud ETL/ELT and lakehouse platforms

Newark, NJ2y exp

Horizon Blue Cross Blue Shield of NJUniversity of Central Missouri

Amazon Athena Amazon CloudFront Amazon DynamoDB Amazon EC2 Amazon EMR Amazon Kinesis+94

View profile

Need someone specific?

AI Search