Reval Logo
Home Browse Talent Data Engineers NYC Metro

Vetted Data Engineers in the NYC Metro

Pre-screened and vetted in the NYC Metro.

AWS LambdaApache AirflowCI/CDPythonAWSAmazon S3
JV

John Villarraga

Staff-level Software Engineer specializing in AI, data platforms, and cloud infrastructure

New York, NY8y exp
GrowthLoopCarnegie Mellon University
PythonNode.jsSQLTypeScriptRuby on RailsCelery+50
View profile
RH

Randy Hollins

Senior Data & AI/ML Engineer specializing in LLM/NLP platforms and cloud data engineering

Bronx, NY11y exp
CBRENYU
PythonRSQLJavaJavaScriptScala+147
View profile
RP

Revanth Peddi

Mid-level Data Engineer specializing in LLM agents, RAG pipelines, and LLMOps

New York, US6y exp
mcSquared AIUniversity at Buffalo
PythonSQLShellTypeScriptOpenAI APIsLangChain+70
View profile
RR

Roberto Rodas-Herndon

Junior Data Scientist specializing in analytics automation and BI dashboards

Newark, NJ2y exp
Public Service Enterprise GroupBoston University
PythonJavaScriptSQLFlaskReactReact.js+36
View profile
PV

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine LearningGenerative AILarge Language Models (LLMs)Agentic SystemsAutonomous AgentsLLM Applications+131
View profile
SS

Sekhar Sabbisetti

Mid-level Azure Data Engineer specializing in Databricks lakehouse and Spark pipelines

Jersey City, NJ6y exp
CitibankUniversity of Cincinnati
PythonSQLScalaPySparkUnix Shell ScriptingApache Spark+68
View profile
RG

Ramesh Giri

Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms

New York, NY9y exp
PVHUniversity of Texas at Arlington
PythonJavaScalaKotlinC#.NET+156
View profile
SR

Subramanyam R

Mid-level Data Engineer specializing in cloud ETL, big data, and analytics

Newark, NJ6y exp
Cosette PharmaceuticalsWilmington University
AgileAirtableAmazon S3Amazon SNSApache AirflowApache Flink+62
View profile
AC

Ajay Chauhan

Senior Backend/Cloud Developer specializing in Python and AWS-native data workflows

New York, NY11y exp
PVHNorthern Illinois University
PythonJavaTypeScriptJavaScriptBashSQL+163
View profile
YG

Yogitha Goli

Mid-level Data Engineer specializing in cloud ETL/ELT, Spark, and streaming pipelines

New York, USA3y exp
S&P GlobalUniversity at Albany
PythonPandasNumPyPySparkSQLPostgreSQL+90
View profile
IS

Ignacio Silva Bartholomaus

Mid-Level Data Engineer specializing in cloud data platforms (AWS & GCP)

Brooklyn, NY4y exp
NovisDiego Portales University
API IntegrationsApache AirflowApache BeamAthenaAWSAWS Athena+36
View profile
AR

Anirudha Raghava Sarma Kuchibhotla

Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines

New York, NY4y exp
American Arbitration AssociationNortheastern University
LangChainLangGraphRetrieval-Augmented Generation (RAG)RAG PipelinesOpenAIGPT-4o+64
View profile
AH

Ashanti Hameed

Senior Lead Data Engineer specializing in cloud data platforms and real-time ML pipelines

Hillside, NJ13y exp
NexusMontclair State University
ETLELTData pipelinesData modelingData warehousingData lakes+75
View profile
AS

Ananya Singh

Mid-level Data Analyst/Data Engineer specializing in machine learning and NLP

New York3y exp
Bright Mind Enrichment and SchoolingRochester Institute of Technology
A/B TestingBERTBERT Fine-tuningChromadbClusteringCustomer Segmentation+41
View profile
BB

bharath burgoju

Screened

Mid-Level Data Engineer specializing in cloud data pipelines and big data platforms

Newark, NJ3y exp
Horizon Blue Cross Blue Shield of NJUniversity of Memphis

“Data engineer with ~4 years of experience building Python-based data ingestion/processing services and real-time streaming pipelines (Kafka/PubSub + Spark Structured Streaming). Has deployed containerized data applications on Kubernetes with GitLab CI/Jenkins pipelines and applied GitOps to cut deployment time ~40% while reducing config drift. Also supported a legacy on-prem data warehouse/backend migration to GCP using phased migration and parallel validation to meet strict reliability/SLA needs.”

AgileAmazon AthenaAmazon CloudFrontAmazon CloudTrailAmazon DynamoDBAmazon EC2+114
View profile
PN

Phani Narla

Junior Data Engineer specializing in cloud ETL/ELT and lakehouse platforms

Newark, NJ2y exp
Horizon Blue Cross Blue Shield of NJUniversity of Central Missouri
Agile MethodologyAmazon AthenaAmazon CloudFrontAmazon CloudTrailAmazon DynamoDBAmazon EC2+94
View profile
YM

Yang MA

Screened

Junior Backend Software Engineer specializing in search, data systems, and LLM applications

New York, NY2y exp
Bevel HealthUniversity of Pittsburgh

“Built a contract and customer documentation retrieval solution for Urban Studio, designing a RAG + Elasticsearch hybrid search stack (RRF + cross-encoder reranking) with a strong emphasis on chunking/data quality and hallucination reduction. Experienced in diagnosing LLM workflow issues via observability traces and tailoring technical demos to developer concerns like reliability and high concurrency.”

PythonTypeScriptGoSQLJavaScriptNext.js+102
View profile

Need someone specific?

AI Search