Vetted Apache Hadoop Professionals

Pre-screened and vetted.

SC

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp
HCLTechUniversity of New Haven

Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.

View profile
AC

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp
Beyond IdentityIowa State University

ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).

View profile
Swati Swati - Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps in Florida, United States

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp
Voltihost LLCStony Brook University

AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.

View profile
Haritha Kuraparthi - Mid-level Full-Stack Developer specializing in cloud data engineering and analytics in West Haven, CT

Mid-level Full-Stack Developer specializing in cloud data engineering and analytics

West Haven, CT4y exp
BlackbaudUniversity of Bridgeport

Software developer with hands-on experience owning customer-facing work end-to-end (requirements, implementation, testing, and feedback-driven iteration) using Python and React.js. Also described remodeling an internal legacy page/tool to improve performance and accuracy, and has exposure to microservices and RabbitMQ plus ETL-based system work.

View profile
Dishank Kailash Oza - Mid-level Software Engineer specializing in distributed systems and cloud infrastructure in Santa Clara, CA

Mid-level Software Engineer specializing in distributed systems and cloud infrastructure

Santa Clara, CA4y exp
Toir Inc.Santa Clara University

Engineer with a thoughtful, production-oriented approach to AI-assisted development, including multi-agent workflows for planning, coding, review, testing, and debugging. Stands out for treating AI systems like distributed pipelines with explicit interfaces, validation layers, and guardrails to improve reliability and reduce hallucinations.

View profile
KG

Mid-level Generative AI Engineer specializing in LLM agents and RAG

Chesterfield, MO4y exp
Reinsurance Group of AmericaUniversity of Central Missouri

GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.

View profile
MS

Mid-level Full-Stack Software Engineer specializing in Java/Spring Boot, React, and cloud

United States4y exp
SubaruCentral Michigan University

Backend/platform engineer who built real-time connected-vehicle telemetry analytics at Subaru, spanning Kafka streaming, Python/FastAPI ETL, and low-latency WebSocket delivery (minutes to <2s). Strong Kubernetes + GitOps practitioner across AWS EKS and Azure AKS (Helm, ArgoCD, Jenkins/GitLab) and has led major on-prem-to-cloud migrations for financial microservices using Terraform and AWS DMS with measurable cost and reliability gains.

View profile
VK

Vaishnavi K

Screened

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

USA5y exp
TCSUniversity of New Haven

LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.

View profile
TP

Thilak P

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

5y exp
W. R. BerkleySacred Heart University

Backend/data engineer who builds Python (FastAPI) data-processing API services for internal analytics/reporting, emphasizing modular architecture, async performance tuning, and reliability patterns (health checks, retries, observability). Also migrated legacy on-prem ETL pipelines to Azure using ADF/Data Lake/Functions and implemented a near-real-time ingestion flow with Event Hubs plus watermarking to handle late events and deduplication.

View profile
AJ

Aman Jain

Screened

Mid-level Software Engineer specializing in cloud-native data pipelines and ML platforms

Boston, MA4y exp
Community Dreams FoundationBoston University

Backend engineer who has owned end-to-end delivery of Python/FastAPI microservices for real-time data processing and alerting, including performance tuning (Postgres optimization, caching, async processing). Strong DevOps/GitOps background: Docker + Kubernetes deployments with GitHub Actions CI/CD and ArgoCD-driven GitOps, plus experience supporting phased on-prem to AWS migrations and building Kafka-based streaming pipelines.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
SB

Senior DevOps/Platform Engineer specializing in Kubernetes and cloud infrastructure

12y exp
DXC TechnologyVinayaka Missions University

DevOps/Infrastructure engineer with hands-on production experience building Jenkins CI/CD pipelines that provision Kubernetes infrastructure and process data into a MapR cluster. Uses Terraform to provision AWS resources (EC2, S3, VPC, subnets) with remote state in S3, separate environment state files, and code review/validation practices; targeting $135k base.

View profile
Vidit Naik - Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems in San Francisco, CA

Vidit Naik

Screened

Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems

San Francisco, CA2y exp
Checksum AIUC Riverside

Forward-deployed engineer who built a production AI drone-control chatbot that lets users fly a drone via natural language while viewing a real-time feed. Implemented RAG over drone SDK documentation (vector DB + top-k retrieval) and LoRA fine-tuning, with a focus on latency, token efficiency, and cost reduction, and regularly works with non-technical clients to integrate and explain AI system architecture.

View profile
Santhoshi Priya Sunchu - Mid-level Data Scientist specializing in NLP and predictive modeling in Massachusetts, USA

Mid-level Data Scientist specializing in NLP and predictive modeling

Massachusetts, USA5y exp
Blue Cross Blue Shield of MassachusettsUniversity of Massachusetts Dartmouth

AI/ML practitioner in healthcare/insurance (Blue Cross Blue Shield) who built and deployed a production NLP system to classify patient risk from unstructured clinical notes. Experienced in end-to-end pipeline orchestration (Airflow, AWS Step Functions/Lambda/SageMaker) and real-time optimization (BERT to DistilBERT on AWS GPUs), with strong clinician collaboration to drive adoption.

View profile
Shabari Vignesh - Mid-level Data Engineer specializing in cloud data platforms and AI agents in Santa Clara, CA

Mid-level Data Engineer specializing in cloud data platforms and AI agents

Santa Clara, CA6y exp
SwirepaySan José State University

Data/Backend engineer who has owned end-to-end merchant analytics systems on AWS: orchestrated multi-source ingestion (FISERV/Shopify/Clover) with Step Functions/Lambda, enforced strong data quality gates, and served curated datasets via Redshift and a FastAPI layer. Also built an early-stage Merchant Insights AI agent that converts natural language questions into SQL using OpenAI models, with full CI/CD and observability.

View profile
AK

Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps

KS, USA4y exp
Black & VeatchUniversity of Central Missouri

Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.

View profile
SA

Sai Addala

Screened

Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting

USA4y exp
Northern TrustSyracuse University

Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.

View profile
JC

Mid-level Machine Learning Engineer specializing in LLMs, NLP, and MLOps

USA5y exp
McKessonSUNY

Built a production LLM-RAG system at McKesson to let internal healthcare operations teams query large volumes of unstructured operational documents via natural language with source-backed answers, designed with HIPAA/FHIR compliance in mind. Demonstrated strong production engineering across hallucination mitigation, retrieval quality tuning, and latency/scalability optimization, using LangChain/LangGraph and Airflow plus rigorous evaluation/monitoring practices.

View profile
SK

Mid-level ML Engineer specializing in NLP and Generative AI

Houston, TX4y exp
Epic SystemsUniversity of Central Missouri

Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.

View profile
DG

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp
PaycomUniversity of Kansas

Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.

View profile
ND

Staff Software Engineer/Architect specializing in Java microservices and multi-cloud (AWS/Azure)

California, USA19y exp
NTT DATAUniversity of Hyderabad

Backend/platform engineer with State Farm experience modernizing and scaling an enterprise consolidated payment data platform and event-driven pipelines. Built cloud-native payment architecture (ECS->EKS) handling millions of financial transactions/day and high-volume telemetry (~100M events/day), with strong schema governance (Avro + schema registry) and production operations/incident mitigation driven by observability.

View profile
Dhairya Desai - Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics in Chicago, IL

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
Navya Sureddy - Mid-level Full-Stack Developer specializing in Angular/React and Spring Boot in Columbus, IN

Navya Sureddy

Screened

Mid-level Full-Stack Developer specializing in Angular/React and Spring Boot

Columbus, IN5y exp
CumminsUniversity of Missouri-Kansas City

Full-stack engineer with experience at Cummins owning production features end-to-end (React/TypeScript + Node + Postgres) and operating them in AWS (EC2/RDS/S3/IAM) with CloudWatch-based observability. Also built resilient ETL and third-party integrations, including an AWS Glue–S3–Redshift pipeline hardened with validation, idempotent UPSERTs, retries/backfills, and quarantine handling to prevent bad or duplicate data.

View profile

Need someone specific?

AI Search