Reval Logo
Home Browse Talent Skilled in Apache Hadoop

Vetted Apache Hadoop Professionals

Pre-screened and vetted.

Apache HadoopPythonDockerSQLApache SparkAWS
FL

Fang-Yu Lin

Mid-level Software Engineer specializing in cloud infrastructure automation and ML systems

Spring, Texas3y exp
HPERice University
PythonSQLJavaC++GitApache Airflow+37
View profile
DS

Daniel Stephens

Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Rosharon, TX11y exp
AssistRxUniversity of Texas at Austin
AgileAmazon RedshiftApache AirflowApache HiveApache KafkaAWS+107
View profile
JJ

Jaynil Jaiswal

Mid-level Software Engineer specializing in MLOps, AI infrastructure, and distributed systems

Raleigh, NC4y exp
UC San DiegoUC San Diego
JavaSpring BootPythonPyTorchGoSQL+117
View profile
SK

SRIDHAR KANDI

Mid-level AI/ML Engineer specializing in production ML, NLP, and computer vision

USA6y exp
UberUniversity of Maryland, Baltimore County
A/B TestingAnomaly DetectionApache HadoopApache HiveApache KafkaApache Spark+127
View profile
SG

SriSaiKiranReddy Gorla

Mid-level Machine Learning Engineer specializing in GenAI, LLM agents, and MLOps

Seattle, WA3y exp
AmazonUniversity of Illinois Chicago
A/B TestingAmazon BedrockAmazon CloudWatchAmazon EKSAmazon EMRAmazon ECS+114
View profile
VR

Venkat Ram

Staff Machine Learning Engineer specializing in Generative AI, MLOps, and Computer Vision

18y exp
SyndioUniversity of Nevada, Las Vegas
Apache HadoopApache HiveApache SparkAutomationAWSBigQuery+106
View profile
ME

Madhu Eadara

Principal AI Platform Architect specializing in agentic AI and enterprise LLM infrastructure

Sunnyvale, CA21y exp
CrowdStrikeUniversity of Massachusetts Boston
A/B TestingAPI GatewayAmazon BedrockAnomaly DetectionAWSClustering+154
View profile
SR

Saketh Reddy Dodda

Mid-level Data Engineer specializing in cloud lakehouse and streaming analytics

Remote, US4y exp
RampUniversity of Colorado Boulder
PythonSQLApache SparkApache KafkaApache AirflowDatabricks+88
View profile
TS

Tyler Swanson

Senior Data Engineer specializing in healthcare ETL/ELT and ML

Pasadena, CA12y exp
Doheny Eye InstituteUniversity of Texas at Austin
Amazon EC2Amazon KinesisAmazon RedshiftAmazon S3Apache AirflowApache Kafka+128
View profile
MG

Manny Garcia

Senior Software Developer specializing in Python, AWS, and Big Data

Chicago, Illinois11y exp
CVS HealthCal Poly San Luis Obispo
PythonJavaScriptTypeScriptSQLShell ScriptingBash+94
View profile
KA

Kristopher Ali

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
PythonSQLScalaBashJavaR+180
View profile
VS

Vudityala Srinidh

Mid-level AI Data Engineer specializing in real-time streaming and LLM-powered fraud analytics

California, USA6y exp
PayPalCalifornia State University, East Bay
PythonSQLPostgreSQLBigQueryPySparkApache Spark+102
View profile
KZ

Khushang Zaveri

Intern AI Researcher specializing in NLP, multimodal AI, and medical ML

2y exp
Johns Hopkins UniversityJohns Hopkins University
Apache HadoopApache HiveApache SparkBashBERTChromaDB+80
View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and fraud/risk analytics in Financial Services

PA, USA4y exp
Capital OneRobert Morris University

“Built and shipped a production-grade GenAI Fraud & Compliance Investigation Copilot for a large US bank, integrating OCR docs, structured data, and prior case history to generate grounded, regulator-friendly summaries and red-flag highlights. Demonstrates strong end-to-end LLM systems engineering (LangGraph/LangChain, hybrid retrieval with FAISS+BM25, guardrails/citations, streaming/latency optimization) plus rigorous evaluation and close partnership with compliance stakeholders.”

A/B TestingAnomaly DetectionApache HadoopApache HiveApache KafkaApache Spark+137
View profile
MP

Mahesh Purushothaman

Screened

Senior Director of Software Engineering specializing in cloud-native microservices for streaming platforms

San Jose, CA20y exp
XperiAnna University

“Engineering leader who drove TiVo IPTV’s client-facing API modernization from a monolith to AWS-based microservices (API Gateway, Lambda, EKS, Kafka, DynamoDB/RDS), including phased/blue-green production routing of millions of calls. Emphasizes org scaling through skill-based hiring, mentorship, and a you-build-you-run ownership culture, while balancing technical leadership with executive stakeholder communication and budgeting.”

JavaGoJavaScriptSpring BootSpring FrameworkAWS+184
View profile
VD

Varshith Dupati

Screened

Mid-level Software Engineer specializing in AWS, full-stack development, and AI data systems

Seattle, Washington3y exp
AmazonArizona State University

“Backend engineer who built a Python-based data profiling/statistics platform processing up to 50M rows and ~300 metrics, using a DAG execution model, multithreading, and smart caching to cut processing time by up to 70%. Also improved PostgreSQL query performance from 12s to 2s via indexing/query rewrites, integrated an LLM (LangChain + OpenAI) for explainable “chat with the pipeline” functionality, and designed an AWS EC2+SQS architecture for scalable, isolated per-user processing.”

JavaJUnitSpring BootPythonCC+++84
View profile
SK

Sai Krishna Yemineni

Screened

Mid-level AI/ML Engineer specializing in healthcare NLP, real-time risk systems, and ML platforms

Massachusetts, USA5y exp
Johnson & JohnsonRivier University

“LLM-focused customer-facing engineer who repeatedly takes document Q&A and agentic prototypes into secure, monitored production systems. Experienced in reducing hallucinations via RAG + guardrails, diagnosing retrieval/embedding issues in real time, and partnering with sales to run metrics-driven PoCs that overcome accuracy/security objections and drive adoption.”

PythonRC++SQLBashTensorFlow+107
View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

“Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.”

PythonSQLPySparkApache SparkApache KafkaAmazon Kinesis+94
View profile
YK

Yukta Kulkarni

Screened

Junior AI/ML Engineer specializing in applied LLMs, security, and reinforcement learning

New York, USA2y exp
New York UniversityNYU

“Built and shipped a production LLM-powered investor research feature for a fintech product, focused on grounded answers and minimizing hallucinations. Implemented retrieval-quality and evidence-coverage gating with clear refusal fallbacks, and evaluates systems with regression tests and metrics like correct-refusal rate, hallucination rate, and latency. Comfortable orchestrating workflows with LangChain or custom Python depending on production needs.”

PythonCC++SQLTypeScriptJavaScript+82
View profile
NV

Nikita Vivek Kolhe

Screened

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”

PythonSQLCC++RMATLAB+90
View profile
SV

sai venkata

Screened

Senior Data Engineer specializing in cloud lakehouse and real-time streaming pipelines

Texas, USA6y exp
CVS HealthUniversity of Central Missouri

“Senior data engineer with experience in both healthcare (CVS Health) and financial services (Bank of America), building large-scale Azure lakehouse pipelines (30+ EHR sources, ~5TB) and real-time streaming services (Event Hubs/Kafka) for patient vitals. Strong focus on reliability and data quality (Great Expectations, monitoring/alerting, schema drift automation), with measurable outcomes like 50% runtime reduction and 99%+ uptime for regulatory reporting pipelines.”

PythonSQLScalaJavaShell ScriptingApache Spark+117
View profile
PN

Praveen Nutulapati

Screened

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp
JPMorgan ChaseUniversity of Central Missouri

“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”

A/B TestingAgileAmazon BedrockAmazon EC2Amazon EMRAmazon RDS+184
View profile
1...91011...56

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search