Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Hadoop Professionals

Pre-screened and vetted.

Apache Hadoop Python SQL Docker AWS Apache Spark

Jathin Shettigar

Screened

Intern Software Engineer specializing in edge AI deployment and distributed systems

San Francisco, CA1y exp

Zetic AISan José State University

“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”

Python Go JavaScript TypeScript Bash C+205

View profile

Aashna Kunkolienker

Screened

Junior AI Engineer specializing in agentic workflows and ML platforms

San Ramon, CA2y exp

SearceNYU

“Building a production LLM/agent system for a leading US dental provider that extracts rules from payer handbooks/portals and EDI 271 responses to validate and improve patient cost estimates. Combines GCP stack (BigQuery, GKE, Cloud Run, Pub/Sub, Vertex AI) with strong agent reliability practices (observability, validator agents, grounding, PII/hallucination guardrails, confidence scoring) and has led non-technical customer stakeholders on enterprise ServiceNow↔Aha sync and AI-powered enterprise search/summarization.”

Python C C++Java JavaScript SQL+105

View profile

Prateek Patil

Screened

Engineering Leader specializing in Digital Health, AI, and Cloud Platforms

Santa Clara, CA16y exp

RocheIllinois Institute of Technology

“Senior Engineering Manager at Roche leading two Scrum teams building internally shared (“inner-sourced”) tools and libraries for a healthcare enterprise. Has led security/compliance-first architecture decisions (e.g., Python AI modules running inside a Java container) and front-end modularization (Angular monorepo to module federation), with a strong focus on developer experience via automated Swagger/OpenAPI documentation and robust testing/versioning practices.”

Java Python Object-oriented programming (OOP)Design patterns Algorithms Distributed systems+112

View profile

Niteesh Ganipisetty

Screened

Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision

Grand Rapids, MI4y exp

IntuitGrand Valley State University

“Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.”

A/B Testing Agile Apache Hadoop Apache Hive Apache Kafka Apache Spark+112

View profile

John Ward

Screened

Executive Applications Development & Analytics Leader specializing in enterprise transformation

Chicago, IL30y exp

AppsIntelligentUniversity of Illinois Urbana-Champaign

“Candidate has prior startup experience building systems and has firsthand experience with a venture that lost angel funding. They show thoughtful reflection on why the startup failed—emphasizing unclear success criteria, weak funding planning, and lack of team consensus—and would seek experienced advisors earlier in future ventures.”

Angular AWS Budgeting Digital transformation Docker Financial modeling+70

View profile

Mishika Garg

Screened

Junior Data Analyst specializing in business analytics and BI

Bengaluru, India3y exp

DeloitteMichigan State University

“Analytics-focused candidate with hands-on experience building SQL data pipelines and Python-based forecasting workflows for inventory and planning use cases. They emphasize data quality, stakeholder trust, and operational adoption, citing a 19% forecast accuracy improvement and strong experience translating analytics into dashboard-ready business metrics.”

SQL Python Pandas NumPy Matplotlib Scikit-learn+70

View profile

Abhishek Adinarayanappa

Screened

Junior Software Engineer specializing in backend, cloud, and machine learning systems

Miami, FL3y exp

Marketeq Digital Inc.NYU

“Built Digipulse, a university project that ingested and clustered Bluesky tweet data at scale and used Gemini to generate near-real-time topic summaries, processing 1M+ tweets per day. Also brings Intel experience with Prometheus and Kubernetes, including production monitoring and incident troubleshooting.”

Python C C++Go Java JavaScript+92

View profile

Zahra Shergadwala

Screened

Junior Machine Learning Engineer specializing in AI, computer vision, and data systems

Los Angeles, CA2y exp

WiDeS Lab, USCUSC

“Built and owned an end-to-end AV operations automation and dashboarding platform for USC event operations, used daily to coordinate hundreds of live events. Delivered a React/TypeScript full-stack system integrating Smartsheet APIs with strong reliability practices (typed contracts, validation/fallbacks, safe rollouts) and experience with queue-based microservice patterns (idempotency, retries, DLQs, monitoring).”

Python SQL JavaScript Java MySQL PostgreSQL+221

View profile

Ansh Bajaj

Screened

Senior Data Engineer specializing in cloud analytics and data modernization

Los Angeles, CA9y exp

DeloitteUniversity of the Cumberlands

“Candidate has hands-on experience delivering production data and AI systems, including an AWS-based real-time data platform for a financial client at Deloitte and a production RAG workflow that cut manual search time by 40%. They stand out for combining strong data engineering depth with practical LLM governance, incident debugging, and stakeholder management across business and risk/compliance teams.”

Auto-scaling ETL Pipelines Data Governance Analytics Data Analysis Training+105

View profile

Neeshma Narahari

Screened

Mid-level Software Developer specializing in backend microservices and cloud platforms

Irving, TX6y exp

McKessonUniversity of Central Missouri

“Full-stack product engineer with strong React and TypeScript depth who has owned dashboard features end-to-end, from UI architecture and rendering optimization through Spring Boot APIs and database query tuning. Particularly compelling for startup or high-growth teams: they’ve shipped 0→1 internal operations platforms, prioritized MVP workflows effectively, and iterated post-launch using user feedback, logs, and usage metrics.”

Java Python JavaScript TypeScript SQL Spring Boot+113

View profile

Aakash Khepar

Screened

Mid-level Full-Stack AI Engineer specializing in agentic AI systems

Tempe, AZ4y exp

Arizona State UniversityArizona State University

“AI/full-stack builder with hands-on experience shipping healthcare, career-tech, nonprofit, and fintech products, spanning speech AI, browser extensions, agentic RAG systems, and enterprise ML monitoring. Stands out for combining strong technical depth with measurable outcomes, including reducing clinical call WER from 26% to 3%, building safe tool-using agents with rollback/RBAC, and delivering zero-to-one multi-tenant platform features in ambiguous environments.”

Python TypeScript JavaScript Java SQL NoSQL+259

View profile

Mohan Shri Harsha Guntu

Screened

Mid-level Data Scientist / Machine Learning Engineer specializing in fraud, risk, and MLOps

Remote, MO7y exp

Northern TrustWebster University

“AI/ML practitioner with Northern Trust experience who has shipped production LLM systems (internal support assistant) using RAG, vector databases, orchestration (LangChain/custom pipelines), and rigorous monitoring/feedback loops. Also built AI-driven fraud detection/risk monitoring solutions in a regulated financial environment, emphasizing explainability (SHAP), audit readiness, and stakeholder trust through dashboards and clear communication.”

Python R SQL Pandas NumPy Scikit-learn+137

View profile

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp

JPMorgan ChaseUniversity of Dayton

“Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.”

Python Pandas spaCy R SQL PySpark+199

View profile

Harini Kv

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Dallas, TX7y exp

EquinixFitchburg State University

“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”

Python SQL PySpark Bash Java JavaScript+169

View profile

Siva Sai Kumar Mogalluru

Screened

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp

EYUniversity of South Florida

“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”

A/B Testing Agile Anomaly Detection Apache Airflow Apache Spark Azure DevOps+138

View profile

Uday Chilakala

Screened

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and RAG systems

Atlanta, GA5y exp

Morgan StanleyKennesaw State University

“Machine learning/NLP engineer who built a production-oriented retrieval-based AI system at Morgan Stanley for healthcare use cases, combining RAG over unstructured patient records with deep-learning medical image segmentation (U-Net/Mask R-CNN). Strong in end-to-end pipelines and MLOps (Spark/MongoDB, AWS SageMaker, CI/CD, monitoring, automated retraining) and in entity resolution/data quality validation for noisy clinical data.”

Python SQL Flask Apache Spark gRPC TensorFlow+125

View profile

Sai Gowtham Madaka

Screened

Mid-level Data Engineer specializing in streaming and cloud data platforms for financial services

Edison, NJ3y exp

Morgan StanleyPace University

“Data engineering-focused candidate (internship/project experience) who built end-to-end pipelines processing a few million transactional records/day for fraud detection and reporting, using Airflow, Python/SQL, and PySpark with strong emphasis on data quality gates, idempotency, and monitoring. Also implemented an external web/API data collection system with anti-bot tactics and schema-change quarantine, and shipped a versioned Flask API to serve curated warehouse data.”

Apache Airflow Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+82

View profile

Vineeth Reddy Vallapureddy

Screened

Mid-level Full-Stack Software Engineer specializing in backend microservices and enterprise AI tools

Redwood City, California5y exp

C3 AIUniversity at Buffalo

“Backend/platform engineer with experience across C3.ai (supply chain demand planning) and Amdocs (telecom), working on large-scale data systems and microservices. Has driven first-time adoption experiments of Snowflake + Spark to handle billion-record workloads, built Jenkins-to-Kubernetes delivery pipelines with Nexus artifact management, and implemented Kafka streaming between microservices with HA and retry/error-handling patterns.”

AWS Backend Development C C++CI/CD Debugging+117

View profile

Dhyey Desai

Screened

Intern AI/ML Engineer specializing in RAG, multimodal AI, and LLM systems

Los Angeles, California0y exp

NalaUSC

“Built and shipped 'PetPulse,' a production AI pet-health note system that records voice notes, transcribes them, converts transcripts into structured symptom/event data, and supports grounded Q&A over a user’s notes and vet PDFs. Demonstrates full-stack LLM product execution (FastAPI + GPT-4 + Firebase), with concrete reliability/performance work (async endpoints, caching, RAG/embeddings, function calling) and user-centered iteration with a non-technical product stakeholder.”

AI Agents Apache Hadoop BERT C Caching Data Visualization+87

View profile

Adithya Chittajallu

Screened

Mid-level AI/ML Engineer specializing in LLM systems, MLOps, and Healthcare AI

Remote, USA5y exp

CVS HealthUniversity of Missouri-Kansas City

“Built and shipped a production-grade agentic RAG system at CVS Health for patient adherence and medication recommendations, processing 20k+ patient records/day. Strong focus on real-world reliability: hybrid retrieval tuned with re-ranking (<400ms latency), strict JSON/schema validation and tool guardrails, and monitoring/drift detection that reduced MTTD from 6 days to 18 hours while improving recommendation accuracy (+8%) and cutting escalations (~23%).”

Python SQL Bash Git PyTorch TensorFlow+107

View profile

Jing Yang

Screened

Senior Machine Learning Engineer specializing in NLP and generative AI

McLean, VA8y exp

Capital OneUniversity of Utah

“ML/AI engineer focused on production NLP and voice AI systems in the restaurant tech space, with hands-on work spanning ASR, intent classification, LLM fine-tuning, and deployment monitoring at Presto AI. They highlight a 15% improvement in full-AI ordering rate and also built a restaurant sentiment analysis product at Wisely that they say became a standout feature in a $10M acquisition context.”

Deep Learning TensorFlow PyTorch AWS Amazon SageMaker OpenAI+107

View profile

Chaitanya Prasad Reddy Narala

Screened

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

USA4y exp

ServiceNowSaint Louis University

“Senior AI/ML engineer focused on production LLM systems, combining RAG, fine-tuning, distributed training, and AI safety to ship scalable real-time moderation and conversational AI platforms. Stands out for pairing deep AWS/Kubernetes MLOps expertise with measurable impact: 40% lower latency/cost, 30-50% fewer hallucinations, and major reliability gains through observability and automation.”

Python Java SQL R Scikit-learn XGBoost+139

View profile

Sachin Komati

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML

Florida, USA5y exp

BlackRockFlorida International University

“Built an end-to-end GenAI/RAG platform for financial compliance and research at BlackRock, focused on safe, auditable answers in a highly regulated environment. Combines strong LLM engineering depth with production platform skills and delivered clear business impact, including reducing research/compliance turnaround from hours to seconds, improving retrieval relevance by 22%, and cutting inference costs by 75%.”

SDLC Agile MLOps Cross-Functional Collaboration Machine Learning Deep Learning+134

View profile

Machine Learning Engineers Software Engineers Data Engineers Data Scientists Data Analysts Software Developers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?