Vetted Apache Hadoop Professionals

Pre-screened and vetted.

DT

Mid-level Full-Stack Engineer specializing in cloud-native enterprise and FinTech systems

Sunnyvale, CA6y exp
WalmartCalifornia State University, East Bay
View profile
NJ

Director-level Technology & Management Consultant specializing in software delivery, cloud, and healthcare IT

Clarence Center, NY24y exp
Bruin Biometrics
View profile
ST

Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

7y exp
CVS Health
View profile
SD

Senior Data Scientist specializing in NLP, MLOps, and cloud ML platforms

Westfield Center, OH7y exp
Westfield Insurance
View profile
KR

Senior AI Python Engineer specializing in Generative AI and MLOps

San Francisco, CA8y exp
Silicon Valley Bank
View profile
VN

Mid-level Software Engineer specializing in ML, LLM apps, and cloud data systems

Tracy, California4y exp
GeneaUC Santa Cruz

Built a production SQL chatbot for access-log analytics that replaced manual custom report requests with natural-language querying, using LangGraph and a ChromaDB-backed RAG pipeline for grounded, consistent answers. Implemented a privacy-preserving design where the LLM never sees raw customer data (only query metadata) and has experience building multi-agent/tool-calling systems with LangGraph (DeepAgents), including solving sub-agent communication drift via self-reflection.

View profile
YL

Yurong Luo

Screened

Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems

Remote9y exp
dataAnnotationVirginia Commonwealth University

Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.

View profile
SN

Senior Data Engineer specializing in cloud data platforms and ML pipelines

Atlanta, GA8y exp
Berkshire HathawayUniversity of Alabama at Birmingham

Data engineer focused on AWS-based enterprise data platforms, owning end-to-end pipelines from multi-source batch/stream ingestion (Glue/Kinesis/StreamSets/Airflow) through PySpark transformations into curated datasets for Redshift/Snowflake. Emphasizes production reliability with strong monitoring/observability and data quality gates, and reports ~30% performance improvement plus improved SLAs and latency after optimization.

View profile
RG

Mid-level Backend Python Engineer specializing in APIs, microservices, and data pipelines

USA, USA4y exp
Marsh McLennanFlorida Atlantic University

Backend engineer (Marsh McLennan) who evolved a high-volume claims automation pipeline in Python, emphasizing thin APIs with background job processing, strong validation/retries, and production-grade observability. Experienced in secure FastAPI API design (centralized JWT/RBAC), multi-tenant Postgres/Supabase-style row-level security, and low-risk refactors using parallel runs and feature flags; targeting founding-engineer scope roles.

View profile
VK

Varshitha K

Screened

Mid-level Data Engineer specializing in cloud data platforms and lakehouse architectures

Lakewood, CO4y exp
First BankUniversity of Central Missouri

Data engineer in a banking context who has owned end-to-end Azure lakehouse pipelines ingesting financial/vendor data from APIs, Azure SQL, and flat files into Databricks/Delta (bronze-silver-gold). Emphasizes production reliability via schema-drift validation, data quality controls, monitoring/alerting, retries/checkpointing, and Spark/Delta performance tuning, with outputs served to BI/reporting teams (e.g., Tableau).

View profile
Chandan Chalumuri - Mid-level Data Scientist specializing in ML, NLP, and Generative AI in Tempe, AZ

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Tempe, AZ4y exp
MetLifeArizona State University

Data engineering / ML practitioner with experience at MetLife building transformer-based sentiment analysis over large unstructured datasets and productionizing pipelines with Airflow/PySpark/Hadoop (reported 52% efficiency gain). Also implemented embedding-based semantic search using Pinecone/Weaviate to improve retrieval relevance and enable RAG for customer support and document matching use cases.

View profile
Mike Gardiner - Technology Executive / Engineering Director specializing in AI-driven platform transformation in Lehi, UT

Mike Gardiner

Screened

Technology Executive / Engineering Director specializing in AI-driven platform transformation

Lehi, UT12y exp
VivintWeber State University

Built a 0-to-1 iOS mobile gardening application that helps users plan, track, and harvest crops with pest control guidance, weather, and climate-zone-based planting date recommendations. Demonstrated strong customer discovery and MVP-first product execution, including a major data challenge: compiling US climate zone data for every ZIP code from widely dispersed public sources into an app-ready database.

View profile
AA

Agna Antony

Screened

Mid-level Data Engineer specializing in cloud-native healthcare and enterprise data platforms

Michigan, USA5y exp
MedStar HealthAPJ Abdul Kalam Technological University

Data Engineer (TCS) who owned an end-to-end CRM analytics pipeline for Bayer’s eSalesWeb integration, ingesting from Salesforce APIs/databases/S3 and serving analytics-ready datasets via PostgreSQL/S3 for Tableau. Drove measurable outcomes: ~60% reduction in manual data-quality effort, ~30% lower latency through SQL optimization, and ~35% improved stability via monitoring, retries, and idempotent processing.

View profile
FM

Senior AI/ML Engineer specializing in healthcare AI and MLOps

Mansfield, TX16y exp
McKessonSam Houston State University

Healthcare AI engineer with hands-on ownership of production ML and LLM systems at McKesson, spanning clinical risk prediction and RAG-based documentation tools. Stands out for combining deep clinical-data experience, HIPAA-aware deployment practices, and measurable impact through reduced readmissions, clinician workflow gains, and 20% to 30% faster ML delivery for engineering teams.

View profile
Apoorv Bankey - Mid-level Backend Engineer specializing in distributed systems and FinTech in New York City, NY

Apoorv Bankey

Screened

Mid-level Backend Engineer specializing in distributed systems and FinTech

New York City, NY6y exp
Rutgers UniversityRutgers University

Engineer who uses AI and multi-agent workflows as a force multiplier while keeping architecture, security, scalability, and production quality under human control. Shared a concrete example of accelerating a backend-heavy SaaS email ingestion platform with authentication, role-based APIs, database models, and deployment setup using agent-style development and review.

View profile
Yogita Adari - Mid-level AI Engineer specializing in generative and multimodal systems in San Francisco, CA

Yogita Adari

Screened

Mid-level AI Engineer specializing in generative and multimodal systems

San Francisco, CA4y exp
Handshake AISyracuse University

Built and productionized an agentic LLM automation system for an insurance client to determine medication eligibility, using prompt-chaining plus a RAG pipeline over policy rules and deploying on AWS (Lambda/Step Functions, Bedrock) with a serverless architecture. Addressed major data/schema mismatch issues via a semantic matching pipeline and validated performance through human agreement scoring, A/B testing, KPI monitoring, and confidence-based human-in-the-loop review.

View profile
YN

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp
Ally FinancialUniversity of Michigan-Dearborn

GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.

View profile
KP

Mid-level Full-Stack Java Developer specializing in cloud-native microservices and React

5y exp
Northern TrustCentral Michigan University

Full-stack engineer who owned enterprise workflow platforms end-to-end at Northern Trust and Elevance Health—building NestJS/Java Spring Boot APIs, React UIs, and cloud deployments on GCP Cloud Run. Strong in data-heavy applications (hundreds of thousands of records) with proven production performance tuning (indexing/query rewrites, Cloud Run concurrency/min instances) and secure RBAC via Azure AD.

View profile
AB

Senior Data & Platform Engineer specializing in cloud-native streaming and distributed systems

USA10y exp
JPMorgan ChaseNew York Institute of Technology

Financial data engineer who has built and operated high-volume batch + streaming pipelines (200–300 GB/day; 5–10k events/sec) using AWS, Spark/Delta, Airflow, Kafka, and Snowflake, with strong emphasis on data quality and reliability. Demonstrated measurable impact via 99.9% SLA adherence, major reductions in bad records/nulls, MTTR improvements, and significant latency/runtime/query performance gains; also built a distributed web-scraping system processing 5–10M records/day with anti-bot and schema-drift defenses.

View profile
MS

Mid-level Data Engineer specializing in multi-cloud data platforms for healthcare and finance

USA6y exp
CignaUniversity of Cincinnati

Data engineer with Cigna experience building and operating an end-to-end AWS-based healthcare claims pipeline processing ~2TB/day, using Glue/Kafka/PySpark/SQL into Redshift. Strong focus on data quality and reliability (schema validation, monitoring/alerting, retries/checkpointing/backfills), reporting improved accuracy (~99%) and reduced latency, plus experience serving real-time Kafka/Spark data to downstream analytics with documented data contracts.

View profile

Need someone specific?

AI Search