Vetted Apache Hadoop Professionals

Pre-screened and vetted.

VC

Mid-level AI/ML Engineer specializing in Generative AI and cloud MLOps

Peoria, IL3y exp
Bradley UniversityBradley University
View profile
HT

Mid-level AI/ML Engineer specializing in LLMs, RAG, and agentic AI systems

Brooklyn, NY3y exp
CARA SYSTEMSNortheastern University
View profile
SV

Mid-Level Full-Stack Software Engineer specializing in cloud-native and data platforms

Bloomington, Indiana4y exp
Indiana University BloomingtonIndiana University Bloomington
View profile
JL

Junior Full-Stack Software Developer specializing in web and cloud applications

Culver City, CA1y exp
Property MatrixSanta Clara University
View profile
BS

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems

CA, USA5y exp
DXC TechnologyCalifornia State University, Long Beach
View profile
NK

Junior Data Scientist specializing in cybersecurity and AI/ML

Saint Paul, Minnesota3y exp
ONCODEAUniversity of St. Thomas
View profile
PG

Mid-level AI/ML Engineer specializing in cloud AI, MLOps, and NLP

Washington, USA4y exp
iLink DigitalFlorida Atlantic University
View profile
NS

Mid-level AI/ML Engineer specializing in MLOps, streaming data, and NLP/CV

USA4y exp
CGIUniversity of Central Missouri
View profile
MK

Senior Java Full-Stack Developer specializing in microservices and cloud (AWS)

Plano, TX9y exp
Beal BankFitchburg State University
View profile
JH

Executive Cloud & Enterprise Architect specializing in AWS and FedRAMP transformations

Silver Spring, MD32y exp
Clickit Clockit CorporationNortheastern University
View profile
Pranava Reddy Kothapally - Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization in Hyderabad, India

Pranava Reddy Kothapally

Screened ReferencesStrong rec.

Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization

Hyderabad, India2y exp
TechwaveCleveland State University

LLM/AI engineer who has deployed production RAG conversational analytics and Text-to-SQL systems over Snowflake and curated data marts, emphasizing enterprise-grade guardrails for accuracy, security, and cost. Notable for a structured approach to reducing hallucinations (curated metric/table registry, SQL validation, RBAC, and citation-backed responses) and for building resilient, observable multi-step agent workflows using LangChain/LlamaIndex and Airflow.

View profile
VP

Vikesh Patel

Screened ReferencesStrong rec.

Senior AI/ML Engineer & Data Scientist specializing in LLMs, RAG, and MLOps

Eagan, MN8y exp
Intertech, Inc.Metropolitan State University

ML/NLP practitioner who has delivered production systems in regulated domains, including a healthcare compliance pipeline using RAG (GPT-4/Claude) plus TF-IDF retrieval that increased document review throughput 4.5x. Also has hands-on experience improving fraud detection data quality via entity resolution (Levenshtein, Dedupe.py) validated with A/B testing, and building scalable, monitored workflows with Airflow, CI/CD, and AWS SageMaker.

View profile
AS

Adithya Sharma

Screened ReferencesModerate rec.

Mid-level AI/ML Engineer specializing in MLOps, NLP, and Generative AI

Remote, USA5y exp
EncoraUniversity of Michigan-Dearborn

Built and deployed a production LLM-powered text-to-SQL/document intelligence chatbot on AWS that lets non-technical business users query complex enterprise databases in plain English. Demonstrates deep practical expertise in schema-aware prompting, embeddings-based schema retrieval, SQL safety/validation guardrails, and rigorous offline/online evaluation with human-in-the-loop approvals for risky queries.

View profile
HS

Helly Shah

Screened ReferencesModerate rec.

Junior Data Analyst specializing in business analytics and machine learning

New York, NY2y exp
Handshake AI Solutions, LLCBaruch College (CUNY)

Analytics-focused candidate with hands-on project experience in SQL data preparation and Python-based churn modeling. They demonstrated a practical approach to turning messy multi-source data into reporting tables, validating data quality rigorously, and translating churn insights into targeted retention strategies.

View profile
SS

Mid-level AI Engineer and Data Scientist specializing in LLM agents and RAG systems

Palo Alto, CA5y exp
LemmataUniversity at Buffalo

Built a production-grade LLM evaluation and regression system that stress-tests models across hundreds of iterations, combining LLM-as-judge, semantic similarity, statistical metrics, and rule-based checks, with results delivered via stakeholder-friendly HTML reports and dashboards. Experienced orchestrating multi-agent RAG workflows using LangChain/LangGraph and event-driven GenAI pipelines in n8n integrating OCR, speech-to-text, and external APIs, with strong emphasis on reliability, observability, and explainable failures.

View profile
AG

Junior Data Analyst specializing in marketing analytics and machine learning

Dallas, Texas1y exp
Maverick Digital TechnologiesUniversity of Texas at Arlington

Built and deployed a production LLM-assisted recommendation and insights platform that unifies structured, semi-structured, and unstructured data via a modular ingestion pipeline, canonical schemas, embeddings, and late-fusion modeling. Experienced in operationalizing ML/LLM systems with Airflow and Kubernetes (Dockerized services, autoscaling, rolling updates) and emphasizes reliability through layered testing, guardrails, monitoring, and A/B experimentation while partnering closely with non-technical stakeholders.

View profile
SS

Mid-level Data Scientist specializing in Generative AI and LLMOps

Dover, USA4y exp
Visual TechnologiesUniversity of Houston

Built a production-grade, semi-automated document recognition and classification system for large volumes of scanned PDFs, starting from little/no labeled data and handling highly variable scan quality. Deployed on AWS using SageMaker + Docker and orchestrated on EKS with a microservices design that scales CPU-heavy OCR separately from GPU inference, with strong reliability controls (validation, fallbacks, retries, readiness probes).

View profile
ES

Mid-level AI Engineer specializing in RAG, conversational AI, and agentic systems

Remote6y exp
MedLibIowa State University

Built and deployed a production RAG-based clinical decision support assistant at MedLib, focused on fast, trustworthy answers from large medical documents. Demonstrates deep practical experience improving retrieval accuracy (semantic chunking + metadata-aware search), controlling hallucinations with grounded generation and thresholds, and adding clinician-requested citations using chunk metadata, with evaluation driven by healthcare professional review.

View profile
VS

VIJAY SAGI

Screened

Mid-level Data Engineer specializing in cloud-native batch and streaming pipelines

Prosper, TX5y exp
ACL DigitalTrine University

Data/ML platform engineer with ~6 years in financial services and enterprise data platforms, building regulated fraud/credit-risk pipelines on AWS (Airflow, EMR/Spark, MLflow) and an Azure lakehouse ingesting 50+ sources and serving ~100M records/day. Also led an early-stage deployment of a RAG-based internal AI search tool using AWS Bedrock and LangChain with automated evaluation to validate LLM accuracy.

View profile
VB

Mid-level AI Engineer specializing in NLP, computer vision, and healthcare analytics

Dartmouth, US3y exp
Integrated MonitoringUniversity of Massachusetts Dartmouth

Data scientist who has built production LLM agents (GPT-4o + LangChain + RAG) to automate analyst-style ad hoc CSV analysis with guardrails and GPT-as-a-judge evaluation. Also delivered an explainable healthcare NLP system for ICD code classification by collaborating closely with clinicians, using a hybrid rule-based decision tree + BERT model to reach 97% accuracy and cut manual review time.

View profile
RM

Senior Data Scientist / AI Engineer specializing in LLMs, RAG, and production ML

New York, NY5y exp
Bluesap SolutionsDePaul University

Data science professional who has built a production RAG-based LLM question-answering system ("Flash Query") to deliver fast, accurate answers over large document collections, focusing on retrieval quality and grounded responses. Also collaborates with non-technical retail/jewelry stakeholders to turn business questions into predictive models and dashboards for decision-making.

View profile
Homak Patel - Junior Software Engineer specializing in Agentic AI and Data Systems

Homak Patel

Screened

Junior Software Engineer specializing in Agentic AI and Data Systems

2y exp
EasyBee AINorth Carolina State University

Forward Deployed Engineer at EasyBee AI who productionized a self-storage customer’s multi-agent LLM system end-to-end—rebuilding it with LangGraph/CrewAI, integrating with real property management + CRM systems via an MCP server, and adding observability/guardrails for reliable daily use. Experienced in live troubleshooting of agentic workflows, developer demos/workshops (including an open-source project, MerryQuery), and partnering with sales to close deals through customer-specific technical demos and fast integration feedback loops.

View profile

Need someone specific?

AI Search