Reval Logo
Home Browse Talent Skilled in Apache Hadoop

Vetted Apache Hadoop Professionals

Pre-screened and vetted.

Apache HadoopPythonDockerSQLApache SparkAWS
PK

Parth Kasat

Screened

Mid-level Forward Deployed Engineer specializing in AI automation for finance and data platforms

Remote2y exp
ArganoGeorge Washington University

“LLM/agentic workflow specialist with healthcare deployment experience who has taken LLM-based automation from prototype to production using operator-in-the-loop validation, RAG-style retrieval, RBAC, and monitoring for sensitive data compliance. Demonstrated real-time incident resolution (retrieval timeouts due to network/proxy misconfig) and strong GTM support—hands-on developer workshops and sales demos translating technical safeguards and real-time ETL into measurable ROI (70% ops reduction, ~$200K/year savings).”

A/B TestingAPI IntegrationAzure Data FactoryAzure DevOpsC++Containerization+124
View profile
CV

Cristian Vega

Screened

Senior AI/ML Engineer specializing in Generative AI and RAG

California, null9y exp
Morf HealthUniversity of Texas at Austin

“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”

PythonRJavaJavaScriptSQLMySQL+154
View profile
AR

Anurag Reddy

Screened

Mid-level Data Scientist specializing in ML, MLOps, and Generative AI

TX, USA5y exp
CaterpillarUniversity of Illinois Chicago

“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”

A/B TestingAgileAnomaly DetectionAnsibleApache AirflowApache Hadoop+138
View profile
NT

nithish thotla

Screened

Mid-level Full-Stack Java Developer specializing in cloud-native microservices and data streaming

Atlanta, GA6y exp
VisaMissouri University of Science and Technology

“Software engineer with payments-domain experience (Visa) building real-time transaction monitoring and analytics systems. Strong end-to-end ownership across Spring Boot/Kafka microservices, PostgreSQL modeling, and AWS/Kubernetes operations, plus React+TypeScript dashboards—focused on low-latency processing, secure APIs, and zero-downtime production releases.”

JavaSpring FrameworkSpring BootSpring MVCSpring CloudHibernate+206
View profile
RK

Rohith kollu

Screened

Senior Software Engineer specializing in backend microservices, cloud, and full-stack systems

Dallas, TX7y exp
CiscoIndiana Wesleyan University

“Backend/platform engineer who has built and scaled production Java/Spring Boot + Kafka services on AWS/Kubernetes (1M+ msgs/day) and led reliability/performance fixes that restored SLAs (25–30% latency improvement; 99.9% uptime). Also shipped an AI customer-support chatbot end-to-end using retrieval + guardrails and rigorous evaluation/observability, improving resolution time 40% and satisfaction 25%, with a strong plan/execute/verify approach to agentic workflow reliability.”

Amazon CloudFrontAmazon CloudWatchAmazon EC2Amazon RDSAmazon S3Apache Hadoop+154
View profile
SH

Sri Harsha patallapalli

Screened

Mid-level Machine Learning & Data Infrastructure Engineer specializing in MLOps on AWS

Boston, MA5y exp
Dextr.aiNortheastern University

“Built and deployed a fine-tuned Qwen 2.5 14B model into production at Dextr.ai as the backbone for hotel-operations agentic workflows, running on AWS EKS with Triton and TensorRT-LLM. Demonstrates strong cost-aware LLM engineering (QLoRA, FP8/BF16 on H100) plus rigorous benchmarking/observability (Prometheus, LangSmith) with reported sub-30ms TTNT. Previously handled long-running ETL orchestration with Airflow at GE Healthcare and Lowe's.”

PythonJavaC++SQLJavaScriptBash+113
View profile
PV

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine LearningGenerative AILarge Language Models (LLMs)Prompt EngineeringRetrieval-Augmented Generation (RAG)Embeddings+131
View profile
MN

Mohan Naik Megavath

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp
TruistElmhurst University

“Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.”

Amazon DynamoDBAmazon EC2Amazon RedshiftAmazon S3AngularJSApache Hadoop+137
View profile
YL

Yaoxin Liu

Screened

Intern Full-Stack Software Engineer specializing in real-time web systems

New York, NY0y exp
VenuePilotNYU

“Built and iterated an end-to-end virtual waiting room for a real-time ticketing prototype, making concrete architecture tradeoffs (polling + Redis Pub/Sub) and improving performance post-launch with Redis caching (+30% throughput, -15% p99 latency). Also has hands-on experience building Spark/HDFS ETL pipelines with strong reliability/observability patterns and running disciplined NLP model evaluation loops on review-rating classification.”

PythonJavaJavaScriptTypeScriptSQLC+89
View profile
PR

Preetham Reddy Konuganti

Screened

Junior Full-Stack Engineer specializing in AI applications and scalable web platforms

San Jose, CA2y exp
Cognia SecurityArizona State University

“Full-stack engineer with customer-facing delivery experience who built and deployed a multi-platform social media automation product (Next.js/Node/MongoDB) and optimized it using BullMQ/Redis background jobs, retries, and rate limiting for reliable posting at scale. Also delivered an AI-powered false-positive analysis service in a cybersecurity context, resolving production pipeline stalls via log-driven debugging, parallelization, caching, and LLM guardrails.”

AgileAmazon EC2Amazon S3Amazon SNSAmazon SQSAngular+126
View profile
UK

Uday kumar swamy

Screened

Senior Machine Learning Engineer specializing in MLOps and NLP/GenAI

Chicago, USA9y exp
UnitedHealth GroupIllinois Institute of Technology

“Built a production LLM-agent framework for a startup that performs daily financial/trading analysis by combining live market data with internal tools, including a centralized memory module to prevent context drift and reduce hallucinations. Also implemented an Airflow-orchestrated retail price forecasting pipeline deployed to AWS endpoints, scaling parallel workloads via Kubernetes Executor and validating systems with rigorous functional + LLM-specific metrics and cross-team collaboration.”

PythonSQLRJavaScikit-learnTensorFlow+126
View profile
KR

Krishnakaanth Reddy Yeduguru

Screened

Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps

Texas, USA4y exp
McKessonUniversity of Texas at Arlington

“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”

PythonC++SQLBashTensorFlowPyTorch+129
View profile
BB

BHARATH BHOOTHPUR

Screened

Mid-level Data Analyst specializing in healthcare and finance analytics

New Jersey, USA5y exp
Omada HealthRowan University

“Built an end-to-end Alexa smart-home IoT application controlling a Wi-Fi bulb, including ESP32 firmware (MQTT) and an AWS serverless backend (IoT Core/Device Shadow, Lambda, DynamoDB) with a REST API. Demonstrates strong real-time scalability patterns (streaming ingestion, stateless processing, partition-key design) and full-stack delivery with Spring Boot + React (JWT auth, CORS, data-heavy dashboards).”

PythonSQLRNumPyPandasMatplotlib+113
View profile
SP

Sushma Puchakayala

Screened

Mid-level Data Analyst specializing in AI/ML and advanced analytics

USA3y exp
AccentureMurray State University

“Accenture data/ML practitioner who deployed a retail churn prediction and BERT-based sentiment analysis system to production, integrating behavioral + feedback data and operationalizing it with ETL automation, orchestration, and CI/CD. Experienced managing 2TB+ multi-source data, monitoring drift in Databricks, and translating results into Power BI dashboards for marketing teams (including K-means customer segmentation).”

PythonPandasNumPyMatplotlibScikit-learnSeaborn+122
View profile
JB

Jaideep bommidi

Screened

Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps

Denton, TX8y exp
Webster BankUniversity of North Texas

“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”

A/B TestingAgileAmazon EC2Amazon EKSAmazon ECSAmazon Kinesis+181
View profile
RG

Rohan Gore

Screened

Intern AI/ML Engineer specializing in agentic systems and full-stack development

New York City, NY0y exp
MARV CapitalNYU

“Built and scaled a multi-agent LLM automation pipeline during a fintech internship, growing from a rapid 1-week proof-of-concept to a 15+ agent hierarchical system that cut market brief report generation time from ~5 hours to under 30 minutes. Hands-on with agent frameworks (Haystack, CrewAI, LangChain) and experienced in debugging agent communication issues via sandboxed modular testing and context/token management; also regularly gives architecture-first technical demos at multiple hackathons and university events.”

Apache CassandraApache HadoopApache KafkaAWSAWS LambdaC#+93
View profile
KK

Krishna Kandlakunta

Screened

Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning

United States5y exp
CitigroupUniversity of North Texas

“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”

A/B TestingAgileAnomaly DetectionApache HadoopApache HiveApache Kafka+167
View profile
RK

Ram Kottala

Screened

Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms

Michigan, USA5y exp
FordWebster University

“Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.”

PythonPySparkScalaJavaRSQL+173
View profile
DL

Dharanidharan Loganathan

Screened

Senior Python Developer specializing in data engineering, MLOps, and cloud platforms

Dallas, TX13y exp
CBREAnna University

“Backend/data engineer with production experience building secure Django/DRF APIs (JWT RS256 + rotating refresh tokens), background processing with Celery, and strong reliability practices (timeouts, retries/backoff, structured logging, audit trails). Has delivered AWS solutions spanning Lambda + ECS with IaC/CI-CD and built Glue/PySpark ETL pipelines with schema evolution and data-quality quarantine patterns; also modernized a legacy SAS pipeline to Python/PySpark with parallel-run parity validation and phased rollout.”

PythonC#C++GoJavaJavaScript+170
View profile
YC

Yijun Chen

Screened

Senior Full-Stack Software Developer specializing in IoT and cloud systems

Toronto, ON4y exp
PulsenicsUniversity of Toronto

“Frontend-focused engineer who built a full movie recommendation system from concept to production, comparing classic collaborative filtering with LLM-based recommendation approaches on AWS. Emphasizes scalable architecture, strict TypeScript data contracts, and high-quality Next.js/React UI patterns (defensive states, scoped state management, performance optimization) with disciplined QA and feature-flagged rollouts.”

AgileApache HadoopApache KafkaApache SparkAzure Data FactoryAzure DevOps+82
View profile
TM

Tejal Mane

Screened

Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems

Moundsville, WV4y exp
CitiusTechUniversity of Michigan

“Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.”

AgileApache HadoopApache KafkaAWSCI/CDCUDA+112
View profile
RB

Rushir Bhavsar

Screened

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp
Cadence Design SystemsArizona State University

“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”

AngularApache SparkAWSAWS CloudFormationAWS LambdaBitbucket+121
View profile
KK

Kranthi Kumar Karupati

Screened

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp
AccentureEastern Illinois University

“LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).”

Amazon API GatewayAmazon BedrockAmazon CloudWatchAmazon DynamoDBAmazon EKSAmazon ECS+168
View profile
HV

Harini Vinu

Screened

Intern Software Engineer specializing in cloud, big data, and test automation

New York, United States1y exp
QualitestNYU

“Internship experience at Qualitest building and deploying an LLM-powered test automation system that reduced manual test creation and improved efficiency (~40%). Demonstrates strong production engineering for LLM systems (timeouts/retries/monitoring/caching, prompt optimization, batching) and has scaled workflows to 100+ concurrent jobs; also has orchestration experience with AWS Step Functions and Kubernetes.”

Amazon CloudWatchAmazon DynamoDBAmazon KinesisAmazon S3Amazon SQSAmazon API Gateway+149
View profile
1...242526...57

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search