Vetted Apache Hadoop Professionals

Pre-screened and vetted.

NJ

Director-level Technology & Management Consultant specializing in software delivery, cloud, and healthcare IT

Clarence Center, NY24y exp
Bruin Biometrics
View profile
ST

Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

7y exp
CVS Health
View profile
SD

Senior Data Scientist specializing in NLP, MLOps, and cloud ML platforms

Westfield Center, OH7y exp
Westfield Insurance
View profile
KR

Senior AI Python Engineer specializing in Generative AI and MLOps

San Francisco, CA8y exp
Silicon Valley Bank
View profile
VN

Mid-level Software Engineer specializing in ML, LLM apps, and cloud data systems

Tracy, California4y exp
GeneaUC Santa Cruz

Built a production SQL chatbot for access-log analytics that replaced manual custom report requests with natural-language querying, using LangGraph and a ChromaDB-backed RAG pipeline for grounded, consistent answers. Implemented a privacy-preserving design where the LLM never sees raw customer data (only query metadata) and has experience building multi-agent/tool-calling systems with LangGraph (DeepAgents), including solving sub-agent communication drift via self-reflection.

View profile
YL

Yurong Luo

Screened

Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems

Remote9y exp
dataAnnotationVirginia Commonwealth University

Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.

View profile
SN

Senior Data Engineer specializing in cloud data platforms and ML pipelines

Atlanta, GA8y exp
Berkshire HathawayUniversity of Alabama at Birmingham

Data engineer focused on AWS-based enterprise data platforms, owning end-to-end pipelines from multi-source batch/stream ingestion (Glue/Kinesis/StreamSets/Airflow) through PySpark transformations into curated datasets for Redshift/Snowflake. Emphasizes production reliability with strong monitoring/observability and data quality gates, and reports ~30% performance improvement plus improved SLAs and latency after optimization.

View profile
RG

Mid-level Backend Python Engineer specializing in APIs, microservices, and data pipelines

USA, USA4y exp
Marsh McLennanFlorida Atlantic University

Backend engineer (Marsh McLennan) who evolved a high-volume claims automation pipeline in Python, emphasizing thin APIs with background job processing, strong validation/retries, and production-grade observability. Experienced in secure FastAPI API design (centralized JWT/RBAC), multi-tenant Postgres/Supabase-style row-level security, and low-risk refactors using parallel runs and feature flags; targeting founding-engineer scope roles.

View profile
VK

Varshitha K

Screened

Mid-level Data Engineer specializing in cloud data platforms and lakehouse architectures

Lakewood, CO4y exp
First BankUniversity of Central Missouri

Data engineer in a banking context who has owned end-to-end Azure lakehouse pipelines ingesting financial/vendor data from APIs, Azure SQL, and flat files into Databricks/Delta (bronze-silver-gold). Emphasizes production reliability via schema-drift validation, data quality controls, monitoring/alerting, retries/checkpointing, and Spark/Delta performance tuning, with outputs served to BI/reporting teams (e.g., Tableau).

View profile
Chandan Chalumuri - Mid-level Data Scientist specializing in ML, NLP, and Generative AI in Tempe, AZ

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Tempe, AZ4y exp
MetLifeArizona State University

Data engineering / ML practitioner with experience at MetLife building transformer-based sentiment analysis over large unstructured datasets and productionizing pipelines with Airflow/PySpark/Hadoop (reported 52% efficiency gain). Also implemented embedding-based semantic search using Pinecone/Weaviate to improve retrieval relevance and enable RAG for customer support and document matching use cases.

View profile
Mike Gardiner - Technology Executive / Engineering Director specializing in AI-driven platform transformation in Lehi, UT

Mike Gardiner

Screened

Technology Executive / Engineering Director specializing in AI-driven platform transformation

Lehi, UT12y exp
VivintWeber State University

Built a 0-to-1 iOS mobile gardening application that helps users plan, track, and harvest crops with pest control guidance, weather, and climate-zone-based planting date recommendations. Demonstrated strong customer discovery and MVP-first product execution, including a major data challenge: compiling US climate zone data for every ZIP code from widely dispersed public sources into an app-ready database.

View profile
AA

Agna Antony

Screened

Mid-level Data Engineer specializing in cloud-native healthcare and enterprise data platforms

Michigan, USA5y exp
MedStar HealthAPJ Abdul Kalam Technological University

Data Engineer (TCS) who owned an end-to-end CRM analytics pipeline for Bayer’s eSalesWeb integration, ingesting from Salesforce APIs/databases/S3 and serving analytics-ready datasets via PostgreSQL/S3 for Tableau. Drove measurable outcomes: ~60% reduction in manual data-quality effort, ~30% lower latency through SQL optimization, and ~35% improved stability via monitoring, retries, and idempotent processing.

View profile
FM

Senior AI/ML Engineer specializing in healthcare AI and MLOps

Mansfield, TX16y exp
McKessonSam Houston State University

Healthcare AI engineer with hands-on ownership of production ML and LLM systems at McKesson, spanning clinical risk prediction and RAG-based documentation tools. Stands out for combining deep clinical-data experience, HIPAA-aware deployment practices, and measurable impact through reduced readmissions, clinician workflow gains, and 20% to 30% faster ML delivery for engineering teams.

View profile
Apoorv Bankey - Mid-level Backend Engineer specializing in distributed systems and FinTech in New York City, NY

Apoorv Bankey

Screened

Mid-level Backend Engineer specializing in distributed systems and FinTech

New York City, NY6y exp
Rutgers UniversityRutgers University

Engineer who uses AI and multi-agent workflows as a force multiplier while keeping architecture, security, scalability, and production quality under human control. Shared a concrete example of accelerating a backend-heavy SaaS email ingestion platform with authentication, role-based APIs, database models, and deployment setup using agent-style development and review.

View profile
YN

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp
Ally FinancialUniversity of Michigan-Dearborn

GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.

View profile
KP

Mid-level Full-Stack Java Developer specializing in cloud-native microservices and React

5y exp
Northern TrustCentral Michigan University

Full-stack engineer who owned enterprise workflow platforms end-to-end at Northern Trust and Elevance Health—building NestJS/Java Spring Boot APIs, React UIs, and cloud deployments on GCP Cloud Run. Strong in data-heavy applications (hundreds of thousands of records) with proven production performance tuning (indexing/query rewrites, Cloud Run concurrency/min instances) and secure RBAC via Azure AD.

View profile
AB

Senior Data & Platform Engineer specializing in cloud-native streaming and distributed systems

USA10y exp
JPMorgan ChaseNew York Institute of Technology

Financial data engineer who has built and operated high-volume batch + streaming pipelines (200–300 GB/day; 5–10k events/sec) using AWS, Spark/Delta, Airflow, Kafka, and Snowflake, with strong emphasis on data quality and reliability. Demonstrated measurable impact via 99.9% SLA adherence, major reductions in bad records/nulls, MTTR improvements, and significant latency/runtime/query performance gains; also built a distributed web-scraping system processing 5–10M records/day with anti-bot and schema-drift defenses.

View profile
MS

Mid-level Data Engineer specializing in multi-cloud data platforms for healthcare and finance

USA6y exp
CignaUniversity of Cincinnati

Data engineer with Cigna experience building and operating an end-to-end AWS-based healthcare claims pipeline processing ~2TB/day, using Glue/Kafka/PySpark/SQL into Redshift. Strong focus on data quality and reliability (schema validation, monitoring/alerting, retries/checkpointing/backfills), reporting improved accuracy (~99%) and reduced latency, plus experience serving real-time Kafka/Spark data to downstream analytics with documented data contracts.

View profile
AG

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp
PNCRochester Institute of Technology

Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.

View profile
Subhash Krishnamoorthy - Executive Technology Leader specializing in digital transformation, headless e-commerce, and cloud architecture in Chesterfield, VA

Executive Technology Leader specializing in digital transformation, headless e-commerce, and cloud architecture

Chesterfield, VA25y exp
Hamilton BeachUniversity of Phoenix

Technology leader focused on business-aligned roadmaps and integration-heavy ecommerce platforms. Recently delivered an on-time launch for lutusooking.com (a premium Hamilton Beach brand) by coordinating UX/UI, component-based middleware, BigCommerce, Algolia search, personalization/recommendations, payments, and supply chain integrations, and later improved scalability via a Jitterbit iPaaS approach proven during Black Friday/Cyber Monday traffic.

View profile
SK

Mid-level Data Analyst specializing in healthcare and business intelligence

Michigan, USA4y exp
Banner HealthTrine University

Healthcare analytics candidate with hands-on experience turning messy EHR, billing, and operational data into validated SQL datasets and automated Python/Airflow pipelines. They appear strongest in hospital KPI reporting—especially length of stay, readmissions, retention, and bed utilization—and have owned projects from metric definition through Power BI delivery and impact measurement.

View profile

Need someone specific?

AI Search