Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Hadoop Professionals

Pre-screened and vetted.

Apache Hadoop Python SQL Docker AWS Apache Spark

Hariom Vyas

Screened

Senior Business Analyst specializing in BFSI reporting and BI

Dallas, TX4y exp

Goldman SachsUniversity of Maryland, Baltimore County

“Forward-deployed, full-stack/platform engineer who owns production features end-to-end across frontend, backend, data, and infrastructure (AWS serverless, Terraform, React). Has modernized critical fintech/payment systems (zero-downtime monolith-to-microservices with Kafka event sourcing) and productionized AI-native support workflows (LLM + RAG on Pinecone) with measurable gains in latency, incidents, CSAT, and support efficiency.”

SQL Python Power BI Tableau Business Intelligence Reporting+280

View profile

Gaurav Sherwani

Screened

Mid-level product and data engineer specializing in FinTech and AI

Washington, DC5y exp

Omnic.AIGeorgetown University

“Data engineer and technical product manager with a strong pre-sales-style background in highly regulated financial environments, translating complex data infrastructure work into business cases, compliance signoffs, and leadership buy-in. At UBS, they drove a cloud migration with $2M savings and built regulatory automation that beat benchmarks, while at Omnic.ai they worked directly with users on a GPT-4 feature that doubled MAU and lifted subscriptions 20%.”

Go-to-Market Strategy Market Research A/B Testing Prompt Engineering GPT-4 Agentic AI+90

View profile

Vasavi Mittapalli

Screened

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp

Texas InstrumentsTrine University

“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”

A/B Testing Agile Amazon DynamoDB Amazon EC2 Amazon Kinesis Amazon Redshift+195

View profile

Jaswanth Vakkala

Screened

Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP

Iselin, NJ5y exp

Wells FargoSt. Francis College

“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”

A/B Testing Anomaly Detection Apache Hadoop Apache Hive Apache Spark AWS+224

View profile

Harshitha Kotari

Screened

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp

AbbottClarkson University

“AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.”

Python R SQL Java MATLAB HTML+148

View profile

Bernard Griffin

Screened

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp

IntelIllinois Institute of Technology

“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”

A/B Testing Amazon Bedrock Amazon EC2 Amazon Kinesis Amazon Redshift Amazon S3+130

View profile

Subhasmita Maharana

Screened

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp

CitigroupKent State University

“Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).”

A/B Testing Apache Airflow Azure Machine Learning BERT CI/CD Clustering+170

View profile

Chad Thomas

Screened

Executive Technology Leader (CTO/Chief Architect) specializing in AI, FinTech, and scalable platforms

Remote, FL34y exp

Intech InvestmentsColorado State University

“Serial entrepreneur who built Verb Technology from a garage startup to a Nasdaq IPO, raising multiple rounds of capital along the way. Invented interactive live streaming technology that was acquired by Amazon and demonstrated rapid product/market response during COVID by prototyping and launching a solution for users while tightly managing AWS costs.”

AWS AWS Lambda Amazon Kinesis Amazon Redshift AWS Glue Kubernetes+81

View profile

Yu Liu

Screened

Senior Big Data Engineer specializing in AML/KYC compliance and cloud data platforms

New York, NY17y exp

CitigroupUniversity of Missouri

“Data engineer with experience delivering an end-to-end pipeline handling ~3.5TB in a star-schema setup (fact + dimensions) and producing business-facing tables in Hive/Spark. Identified and resolved UAT-reported duplicate issues caused by joins through root-cause analysis, and also built automation to run Spark SQL metrics on weekly/monthly/quarterly cadences and distribute results to users.”

Python JavaScript Shell Scripting SQL MySQL PostgreSQL+110

View profile

Bhavya Sree Ganja

Screened

Senior Data Engineer specializing in cloud lakehouse platforms and streaming analytics

Pittsburgh, PA8y exp

First National BankTexas A&M University-Corpus Christi

“Data engineer focused on fraud and banking analytics who has owned end-to-end batch + streaming pipelines at very large scale (hundreds of millions of records/day). Built robust data quality/observability layers (schema validation, anomaly detection, alerting) and delivered low-latency serving via AWS Lambda/API Gateway with DynamoDB + Redis, plus external data ingestion/scraping pipelines orchestrated in Airflow with anti-bot protections.”

Agile Amazon API Gateway Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon Kinesis+210

View profile

Abhinav Gupta

Screened

Junior Machine Learning Engineer specializing in LLMs and applied data science

2y exp

EsriUSC

“Built and shipped multiple production AI systems, including Auto DocGen (LLM-generated OpenAPI docs kept in sync via AST diffs, schema-constrained generation, and CI/CD on Render) and a multimodal sign-language recognition pipeline at USC orchestrated with FastAPI, MediaPipe, and PyTorch. Also partnered with Esri’s non-technical community team to fine-tune an LLaMA-based spam classifier with a review UI, cutting moderation time by 70%.”

Python Pandas NumPy Scikit-learn JavaScript TypeScript+126

View profile

Ajay Madhusudhan Thumala

Screened

Junior Software Engineer specializing in data engineering and LLM applications

Irvine, CA1y exp

GeisingerUC Irvine

“Computer science engineer and master’s graduate who independently built a mechatronics-heavy capstone prototype: a smartphone concept for deafblind users using micro-actuator arrays for braille reading. Also has platform engineering experience at Quantiphi, deploying webhooks to Kubernetes and implementing GitOps-based CI/CD using AWS CodeCommit/CodeBuild and ECR.”

API Development API Gateway AWS Bash C C+++206

View profile

Aditya Jhaveri

Screened

Mid-level Software Engineer specializing in AI, big data, and distributed systems

Jersey City, NJ3y exp

New York UniversityNYU

“Software Developer at NYU (GEMSS) focused on scaling and optimizing a data-heavy asset management web app, including migrating/optimizing data access via Google Sheets API and Firestore. Previously an SDE at Sainapse working on Spring Boot microservices POCs (Kafka, Hadoop at 2B+ record scale). Built an end-to-end Apple Wallet coupon generation/redemption system using PassKit + Google Apps Script with measurable ops impact (40% efficiency gain).”

Agile Algorithms Anomaly Detection Apache Hadoop Apache Hive Apache Kafka+124

View profile

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp

Wells FargoUniversity of North Texas

“Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.”

Python SQL Java Scala Shell Scripting TypeScript+153

View profile

Deepthi Mundarinti

Screened

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp

JPMorgan ChaseSaint Louis University

“Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.”

Python NumPy Pandas PySpark Scikit-learn TensorFlow+120

View profile

Saisureshreddy Challa

Screened

Mid-level Data Scientist specializing in AI/ML, LLMs, and domain analytics

California, USA6y exp

BlackRockNortheastern University

“BlackRock AI/ML engineer who built and owned a production LLM document intelligence system for regulatory and investment analysis end-to-end. They combined RAG, multi-agent validation, strong evaluation/monitoring, and reusable Python services to process 50K+ documents, cut review time 40-50%, and improve decision accuracy by about 25%.”

Python PySpark SQL Scala PostgreSQL MySQL+174

View profile

Ashutosh Jitendra Zawar

Screened

Mid-level AI/ML Engineer specializing in generative AI, NLP, and MLOps

San Jose, CA4y exp

ServiceNowUniversity of North Carolina at Charlotte

“ML/AI engineer with hands-on ownership of production GenAI and computer vision systems, spanning experimentation, deployment, monitoring, and iterative optimization. Stands out for shipping an enterprise RAG platform that cut manual review by 50% and a defect detection pipeline that reduced report generation from 15 minutes to under 1 second while maintaining high uptime and strong operational discipline.”

SDLC Agile MLOps Cross-Functional Collaboration Machine Learning Deep Learning+154

View profile

Vedant Jagtap

Screened

Junior AI/NLP Engineer specializing in LLM systems and applied research

New York, NY2y exp

NYU’s Center for Social Media, AI, and PoliticsNYU

“LLM/agent engineer who shipped a two-stage AI recruitment screening platform at Foursquare that automated resume ingestion through behavioral assessment, delivering an 85% reduction in screening time across 5,000+ applications with auditability and confidence-gated decisions. Also built a multi-agent benchmarking framework using MCP tool interfaces and a RAGAS + LangSmith evaluation/observability stack, including async re-architecture that cut production latency by 50%.”

Python JavaScript TypeScript SQL R Java+162

View profile

Ashlesh Bhardwaj

Screened

Director-level Product Leader specializing in FinTech and enterprise finance platforms

Charlotte, NC19y exp

Wells FargoIEC College of Engineering and Technology

“Senior product and technology leader with 23+ years of experience driving modernization in complex enterprise finance and operations environments. He stands out for turning legacy, paper-based or fragmented systems into scalable digital products—cutting a warranty claims process from 30 days to near-instant and using AI to improve service efficiency and reduce testing effort by 30%+. Strong C-suite-facing operator who bridges strategy, architecture, UX, and organizational change.”

Product Strategy Go-to-Market Strategy Digital Transformation Agile Scrum Risk Management+122

View profile

Pooja Shindd

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web and AI systems

Illinois, USA4y exp

University of Illinois Chicago Technology SolutionsUniversity of Illinois Chicago

“Full-stack engineer who has built both a TypeScript-based HR/payroll platform and a production agentic AI support system end to end. Stands out for combining strong product judgment with deep LLM systems thinking: RAG architecture, confidence-based routing, evals, observability, and human-in-the-loop design in a greenfield environment.”

Java Scala Python JavaScript TypeScript SQL+108

View profile

Sathyavarthan Balachandar

Screened

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp

Fidelity InvestmentsNortheastern University

“Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.”

Python SQL Go Apache Spark PySpark Databricks+125

View profile

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp

Johnson & JohnsonStevens Institute of Technology

“Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.”

A/B Testing API Integration Apache Airflow Apache Hadoop Apache Kafka Apache Spark+102

View profile

Sravani Kasaraneni

Screened

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp

ServiceNowRivier University

“Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.”

SDLC Agile Waterfall Python R Java+104

View profile

Machine Learning Engineers Software Engineers Data Engineers Data Scientists Data Analysts Software Developers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?