Reval Logo
Home Browse Talent Skilled in Apache Hadoop

Vetted Apache Hadoop Professionals

Pre-screened and vetted.

Apache HadoopPythonDockerSQLApache SparkAWS
NG

Niteesh Ganipisetty

Screened

Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision

Grand Rapids, MI4y exp
IntuitGrand Valley State University

“Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.”

A/B TestingAgileApache HadoopApache HiveApache KafkaApache Spark+112
View profile
AK

Aashna Kunkolienker

Screened

Junior AI Engineer specializing in agentic workflows and ML platforms

San Ramon, CA2y exp
SearceNYU

“Building a production LLM/agent system for a leading US dental provider that extracts rules from payer handbooks/portals and EDI 271 responses to validate and improve patient cost estimates. Combines GCP stack (BigQuery, GKE, Cloud Run, Pub/Sub, Vertex AI) with strong agent reliability practices (observability, validator agents, grounding, PII/hallucination guardrails, confidence scoring) and has led non-technical customer stakeholders on enterprise ServiceNow↔Aha sync and AI-powered enterprise search/summarization.”

PythonCC++JavaJavaScriptSQL+105
View profile
PP

Prateek Patil

Screened

Engineering Leader specializing in Digital Health, AI, and Cloud Platforms

Santa Clara, CA16y exp
RocheIllinois Institute of Technology

“Senior Engineering Manager at Roche leading two Scrum teams building internally shared (“inner-sourced”) tools and libraries for a healthcare enterprise. Has led security/compliance-first architecture decisions (e.g., Python AI modules running inside a Java container) and front-end modularization (Angular monorepo to module federation), with a strong focus on developer experience via automated Swagger/OpenAPI documentation and robust testing/versioning practices.”

JavaPythonObject-oriented programming (OOP)Design patternsAlgorithmsDistributed systems+112
View profile
SK

Sasi Katamneni

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications

Dallas, TX5y exp
Baylor Scott & WhiteUniversity of North Texas

“Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).”

A/B TestingAgileAjaxAmazon API GatewayAmazon BedrockAmazon CloudWatch+267
View profile
SR

Santhosh Reddy

Screened

Mid-level AI/ML Engineer specializing in deep learning, NLP/LLMs, and MLOps

MA, USA6y exp
Flatiron HealthClark University

“Built and shipped a real-time oncology risk prediction system used by doctors during patient visits, trained on clinical data in AWS SageMaker and deployed via FastAPI with sub-second responses. Emphasizes clinician-trust features (SHAP explainability, validation checks) and HIPAA-compliant controls (encryption, RBAC, audit logging), plus Kubernetes-based production operations with autoscaling, monitoring, and drift/retraining workflows; collaborated closely with oncologists at Flatiron Health.”

PythonRSQLJavaC++Bash+123
View profile
BK

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp
ThorneBharathiar University

“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”

A/B TestingAPI DevelopmentAPI TestingApache HadoopApache HiveApache Kafka+251
View profile
RW

Rebecca Witmer

Screened

Principal Data Scientist specializing in NLP and Generative AI

Chicago, IL9y exp
Witmer Consulting CorporationGeorgetown University

“ML/NLP practitioner with experience building an embedding-based ad matching and search system at Vericast (BERT embeddings + similarity search) to replace a third-party taxonomy approach, evaluated via a human-curated gold standard. Also built a custom NER pipeline at Allstate for auto accident claims calls using a bidirectional LSTM and achieved 90%+ F1, with a strong emphasis on production-grade ML workflows (testing, CI/CD, orchestration, versioning, validation).”

PythonPySparkRetrieval Augmented Generation (RAG)SQLOpenAIChatGPT+81
View profile
VR

Vineeth Reddy Vallapureddy

Screened

Mid-level Full-Stack Software Engineer specializing in backend microservices and enterprise AI tools

Redwood City, California5y exp
C3 AIUniversity at Buffalo

“Backend/platform engineer with experience across C3.ai (supply chain demand planning) and Amdocs (telecom), working on large-scale data systems and microservices. Has driven first-time adoption experiments of Snowflake + Spark to handle billion-record workloads, built Jenkins-to-Kubernetes delivery pipelines with Nexus artifact management, and implemented Kafka streaming between microservices with HA and retry/error-handling patterns.”

AWSBackend DevelopmentCC++CI/CDDebugging+117
View profile
MS

Mohan Shri Harsha Guntu

Screened

Mid-level Data Scientist / Machine Learning Engineer specializing in fraud, risk, and MLOps

Remote, MO7y exp
Northern TrustWebster University

“AI/ML practitioner with Northern Trust experience who has shipped production LLM systems (internal support assistant) using RAG, vector databases, orchestration (LangChain/custom pipelines), and rigorous monitoring/feedback loops. Also built AI-driven fraud detection/risk monitoring solutions in a regulated financial environment, emphasizing explainability (SHAP), audit readiness, and stakeholder trust through dashboards and clear communication.”

PythonRSQLPandasNumPyScikit-learn+137
View profile
NP

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp
JPMorgan ChaseUniversity of Dayton

“Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.”

PythonPandasspaCyRSQLPySpark+199
View profile
NT

Niteesha Thottempudi

Screened

Mid-level Software Engineer specializing in cloud-native microservices and data platforms

Downingtown, PA5y exp
Pike SolutionsNYU

“Backend engineer with experience at Comcast and in healthcare/pharmacy automation (PrimeRx), building Python/Flask services that orchestrate large-scale batch workflows (Airflow) and high-throughput event processing (Kafka). Demonstrated measurable performance wins (cut provisioning latency to ~150–200ms) and strong multi-tenant isolation strategies (Postgres RLS, partitioning), plus practical integration of ML model outputs into production systems with validation and fallback controls.”

PythonJavaCC++JavaScriptHTML+113
View profile
HK

Harini Kv

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Dallas, TX7y exp
EquinixFitchburg State University

“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”

PythonSQLPySparkBashJavaJavaScript+169
View profile
DD

Dhyey Desai

Screened

Intern AI/ML Engineer specializing in RAG, multimodal AI, and LLM systems

Los Angeles, California0y exp
NalaUSC

“Built and shipped 'PetPulse,' a production AI pet-health note system that records voice notes, transcribes them, converts transcripts into structured symptom/event data, and supports grounded Q&A over a user’s notes and vet PDFs. Demonstrates full-stack LLM product execution (FastAPI + GPT-4 + Firebase), with concrete reliability/performance work (async endpoints, caching, RAG/embeddings, function calling) and user-centered iteration with a non-technical product stakeholder.”

Apache HadoopBERTCCachingData VisualizationDatabricks+87
View profile
SS

Siva Sai Kumar Mogalluru

Screened

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp
EYUniversity of South Florida

“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”

A/B TestingAgileAnomaly DetectionApache AirflowApache SparkAzure DevOps+138
View profile
UC

Uday Chilakala

Screened

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and RAG systems

Atlanta, GA5y exp
Morgan StanleyKennesaw State University

“Machine learning/NLP engineer who built a production-oriented retrieval-based AI system at Morgan Stanley for healthcare use cases, combining RAG over unstructured patient records with deep-learning medical image segmentation (U-Net/Mask R-CNN). Strong in end-to-end pipelines and MLOps (Spark/MongoDB, AWS SageMaker, CI/CD, monitoring, automated retraining) and in entity resolution/data quality validation for noisy clinical data.”

PythonSQLFlaskApache SparkgRPCTensorFlow+125
View profile
AM

Ajay Madhusudhan Thumala

Screened

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp
GeisingerUC Irvine

“Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.”

API DevelopmentAPI GatewayAWSBashCC+++206
View profile
VM

Vasavi Mittapalli

Screened

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp
Texas InstrumentsTrine University

“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”

A/B TestingAgileAmazon DynamoDBAmazon EC2Amazon EMRAmazon Kinesis+195
View profile
JV

Jaswanth Vakkala

Screened

Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP

Iselin, NJ5y exp
Wells FargoSt. Francis College

“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”

A/B TestingAnomaly DetectionApache HadoopApache HiveApache SparkAWS+224
View profile
HK

Harshitha Kotari

Screened

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

“AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.”

PythonRSQLJavaMATLABHTML+148
View profile
AJ

Aditya Jhaveri

Screened

Mid-level Software Engineer specializing in AI, big data, and distributed systems

Jersey City, NJ3y exp
New York UniversityNYU

“Software Developer at NYU (GEMSS) focused on scaling and optimizing a data-heavy asset management web app, including migrating/optimizing data access via Google Sheets API and Firestore. Previously an SDE at Sainapse working on Spring Boot microservices POCs (Kafka, Hadoop at 2B+ record scale). Built an end-to-end Apple Wallet coupon generation/redemption system using PassKit + Google Apps Script with measurable ops impact (40% efficiency gain).”

AgileAlgorithmsAnomaly DetectionApache HadoopApache HiveApache Kafka+124
View profile
SD

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

“Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.”

PythonSQLJavaScalaShell ScriptingTypeScript+153
View profile
AG

Abhinav Gupta

Screened

Junior Machine Learning Engineer specializing in LLMs and applied data science

2y exp
EsriUSC

“Built and shipped multiple production AI systems, including Auto DocGen (LLM-generated OpenAPI docs kept in sync via AST diffs, schema-constrained generation, and CI/CD on Render) and a multimodal sign-language recognition pipeline at USC orchestrated with FastAPI, MediaPipe, and PyTorch. Also partnered with Esri’s non-technical community team to fine-tune an LLaMA-based spam classifier with a review UI, cutting moderation time by 70%.”

PythonPandasNumPyScikit-learnJavaScriptTypeScript+126
View profile
BG

Bernard Griffin

Screened

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp
IntelIllinois Institute of Technology

“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”

A/B TestingAmazon BedrockAmazon EC2Amazon EMRAmazon KinesisAmazon Redshift+130
View profile
1...151617...57

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search