Reval Logo

Vetted Data Engineers in New York

Pre-screened and vetted in New York.

AWSAWS LambdaAmazon S3ETLAmazon RedshiftApache Airflow
JV

John Villarraga

Staff-level Software Engineer specializing in AI, data platforms, and cloud infrastructure

New York, NY8y exp
GrowthLoopCarnegie Mellon University
PythonNode.jsSQLTypeScriptRuby on RailsCelery+50
View profile
RH

Randy Hollins

Senior Data & AI/ML Engineer specializing in LLM/NLP platforms and cloud data engineering

Bronx, NY11y exp
CBRENYU
PythonRSQLJavaJavaScriptScala+147
View profile
RP

Revanth Peddi

Mid-level Data Engineer specializing in LLM agents, RAG pipelines, and LLMOps

New York, US6y exp
mcSquared AIUniversity at Buffalo
PythonSQLShellTypeScriptOpenAI APIsLangChain+70
View profile
BC

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp
BlackRockWebster University

Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.

PythonSQLShell ScriptingApache SparkPySparkSpark Structured Streaming+103
View profile
SN

Sandhya Nunemunthala

Senior Data Engineer specializing in cloud data platforms and Generative AI

Albany, NY12y exp
JPMorgan ChaseOsmania University
AWSAmazon S3Amazon RedshiftAmazon RDSAmazon EC2AWS Glue+176
View profile
VC

Vinay Chandra

Mid-level Data Engineer specializing in cloud lakehouse and scalable data pipelines

Syracuse, NY4y exp
HubSpotSyracuse University
PythonSQLPySparkSpark SQLScalaBash+110
View profile
PV

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.

Machine LearningGenerative AILarge Language Models (LLMs)Agentic SystemsAutonomous AgentsLLM Applications+131
View profile
VV

Vaibhav Vikas

Mid-level Data Engineer specializing in cloud ETL, Spark, and analytics platforms

Syracuse, NY3y exp
American ExpressSyracuse University
PythonSQLPySparkRTypeScriptNode.js+68
View profile
RG

Ramesh Giri

Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms

New York, NY9y exp
PVHUniversity of Texas at Arlington
PythonJavaScalaKotlinC#.NET+156
View profile
AC

Ajay Chauhan

Senior Backend/Cloud Developer specializing in Python and AWS-native data workflows

New York, NY11y exp
PVHNorthern Illinois University
PythonJavaTypeScriptJavaScriptBashSQL+163
View profile
YG

Yogitha Goli

Mid-level Data Engineer specializing in cloud ETL/ELT, Spark, and streaming pipelines

New York, USA3y exp
S&P GlobalUniversity at Albany
PythonPandasNumPyPySparkSQLPostgreSQL+90
View profile
IS

Ignacio Silva Bartholomaus

Mid-Level Data Engineer specializing in cloud data platforms (AWS & GCP)

Brooklyn, NY4y exp
NovisDiego Portales University
API IntegrationsApache AirflowApache BeamAthenaAWSAWS Athena+36
View profile
SS

Shane Stevens

Senior Data Engineer specializing in cloud data platforms and real-time streaming

Syracuse, NY14y exp
Haithem TechUniversity of Vermont
PythonSQLJavaScalaNoSQLPostgreSQL+68
View profile
AR

Anirudha Raghava Sarma Kuchibhotla

Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines

New York, NY4y exp
American Arbitration AssociationNortheastern University
LangChainLangGraphRetrieval-Augmented Generation (RAG)RAG PipelinesOpenAIGPT-4o+64
View profile
AS

Ananya Singh

Mid-level Data Analyst/Data Engineer specializing in machine learning and NLP

New York3y exp
Bright Mind Enrichment and SchoolingRochester Institute of Technology
A/B TestingBERTBERT Fine-tuningChromadbClusteringCustomer Segmentation+41
View profile
YM

Yang MA

Screened

Junior Backend Software Engineer specializing in search, data systems, and LLM applications

New York, NY2y exp
Bevel HealthUniversity of Pittsburgh

Built a contract and customer documentation retrieval solution for Urban Studio, designing a RAG + Elasticsearch hybrid search stack (RRF + cross-encoder reranking) with a strong emphasis on chunking/data quality and hallucination reduction. Experienced in diagnosing LLM workflow issues via observability traces and tailoring technical demos to developer concerns like reliability and high concurrency.

PythonTypeScriptGoSQLJavaScriptNext.js+102
View profile
HM

Harshaditya Mallipudi

Mid-level Data Scientist/ML Engineer specializing in LLMs and Generative AI

Rochester, NY3y exp
Coverdoc AIRochester Institute of Technology
A/B TestingAgentic WorkflowsAuthenticationAWSAWS EC2AWS Lambda+75
View profile

Need someone specific?

AI Search