Vetted Apache Spark Professionals

Pre-screened and vetted.

SK

Mid-level AI Developer & Machine Learning Engineer specializing in LLM and MLOps systems

Champaign, IL5y exp
CenteneEastern Illinois University

Built and deployed an enterprise RAG application at Centene to help clinical teams retrieve insights from large internal policy document sets, cutting manual research by 30–40%. Implemented custom domain-adapted embeddings (SageMaker + BERT transfer learning) and hybrid retrieval (BM25 + Pinecone) to drive a 22% relevance lift, and ran the system in production on AWS EKS with CI/CD, MLflow, and Prometheus monitoring (99% uptime, ~40% latency reduction).

View profile
LG

Junior Business Analytics & SAP BASIS professional specializing in AI and predictive modeling

Denton, TX3y exp
University of North TexasUniversity of North Texas

Built and deployed a production LLM-powered email assistant (“wood flow”) for a local pet resort to automate after-hours inbound email handling, including email categorization and context-aware auto-responses. Uses n8n for orchestration and applies CRISP-DM, load/edge-case testing, and RAG-based context retrieval, and has experience presenting AI solutions with budgeting and ROI to a non-technical founder.

View profile
PG

Mid-level Data Scientist specializing in healthcare ML and GenAI

San Marcos, TX4y exp
UnitedHealth GroupTexas State University

Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.

View profile
LD

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

Atlanta, GA3y exp
AIGKennesaw State University

Data professional with ~4 years of experience, most recently at AIG (insurance), building ML/NLP systems for fraud detection and policy automation using transformers, CNNs, and clustering/anomaly detection. Also developed a RAG-based knowledge retrieval system, iterating across embedding models and moving to production based on precision and latency SLAs, then containerizing and deploying with SageMaker and CI/CD.

View profile
RT

Rakesh Thota

Screened

Mid-level Data Engineer specializing in multi-cloud real-time data pipelines

California, USA5y exp
Molina HealthcareUniversity at Buffalo

Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.

View profile
GM

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp
Eastern Illinois UniversityEastern Illinois University

Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.

View profile
LP

Senior Data Engineer specializing in cloud data platforms and real-time analytics

Remote, USA10y exp
Scale MediaNew York City College of Technology (CUNY)

Data/analytics engineer focused on finance and e-commerce integrations, building end-to-end pipelines and services across Odoo, QuickBooks, Snowflake, and Tableau. Replaced a costly third-party Walmart connector with a serverless AWS Lambda pipeline deployed via Terraform/GitHub and monitored with CloudWatch/Datadog, and shipped a bi-directional Odoo↔QuickBooks invoice sync with distributed locking plus Slack-based finance approvals.

View profile
SB

Mid-level Data Engineer specializing in cloud ETL and streaming data pipelines

Detroit, MI5y exp
HarmonecareAuburn University at Montgomery

Data engineer in healthcare/clinical data platforms (HarmonCare) who built and operated an end-to-end lakehouse pipeline ingesting HL7/FHIR at ~2–3M records/day on AWS (Glue/Lambda/S3/Spark) and serving trusted datasets in Snowflake. Implemented strong validation/reconciliation gates and a data quality framework that reduced discrepancies ~40%, plus CI/CD (GitHub Actions/Terraform) and monitoring (Airflow/CloudWatch).

View profile
Anay Dongre - Junior Machine Learning Engineer specializing in GenAI and LLM fine-tuning in Pomona, California

Anay Dongre

Screened

Junior Machine Learning Engineer specializing in GenAI and LLM fine-tuning

Pomona, California1y exp
Aerolift.AICal Poly Pomona

Robotics software engineer focused on hard real-time autonomy for legged robots, building a quadruped navigation stack that combines vision SLAM with MPC and maintains a deterministic 500Hz control loop. Deep performance optimization experience across CUDA (sub-2ms perception latency), ROS 2/DDS real-time tuning, and motion planning (cut 500ms spikes to sub-5ms). Also designed distributed ROS 2 + Zenoh communications between quadrupeds and aerial drones and validated robustness under lossy wireless conditions.

View profile
Sachin Dulla - Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps in Kentwood, MI

Sachin Dulla

Screened

Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps

Kentwood, MI3y exp
Fifth Third BankCalifornia State University, San Bernardino

Built and deployed a domain-specific LLM chatbot for research/support, cutting manual effort by ~50%. Demonstrates strong applied LLM engineering: RAG, prompt grounding with citations and fallbacks, embedding/top-k tuning, and production monitoring (confidence, latency, feedback loops). Experienced orchestrating agent workflows with LangChain-style pipelines and continuous evaluation to maintain reliability.

View profile
Shrinivas Bhusannavar - Mid-level AI Engineer specializing in agentic LLM systems and RAG platforms in San Jose, CA

Mid-level AI Engineer specializing in agentic LLM systems and RAG platforms

San Jose, CA5y exp
SquareShiftSan José State University

Built and shipped Serrano AI, a multi-tenant SaaS conversational AI platform that automates Odoo ERP workflows and lets ops/finance/supply-chain teams query ERP data in natural language. Implemented a multi-agent architecture (LangChain/LangGraph/CrewAI) with hybrid RAG over ERP schemas, deployed on Heroku/Vercel with production observability, cutting reporting time by ~80% while addressing hallucinations, latency, and schema complexity.

View profile
Bhavishyasai Chigurupati - Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms in Overland Park, KS

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp
CignaUniversity of Central Missouri

Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.

View profile
Arunima Mishra - Senior Technical Product Lead specializing in Data Governance and MDM SaaS platforms in Bengaluru, India

Senior Technical Product Lead specializing in Data Governance and MDM SaaS platforms

Bengaluru, India7y exp
InforManipal University Jaipur

Technical/product lead at Albanero (acquired by Infor in 2024; now at Infor) who built a Data Mesh-focused “Governance as a Product” module from early persona-based policies through a highly configurable multi-ERP governance platform (MDM, multi-source mastering, match/merge, automated review workflows). Also troubleshoots agentic/LLM workflows in production using auditability, guardrails, monitoring, and real-time validation—fixing a P0 false-positive security flagging issue and contributing to significant deal/adoption growth (~50%) after V2 launch.

View profile
SaiSindhu Beeravolu - Mid AI/ML Engineer specializing in LLMs, RAG, and healthcare AI in Illinois, USA

Mid AI/ML Engineer specializing in LLMs, RAG, and healthcare AI

Illinois, USA4y exp
UnitedHealth GroupUniversity of Maryland, Baltimore County

Healthcare ML/AI engineer with production experience at UnitedHealth Group, including an end-to-end readmission prediction system built on 50M+ patient records that improved accuracy by 18% and reduced preventable readmissions by 12%. Also shipped a clinically grounded LLM/RAG referral generator with human-in-the-loop safety controls, showing strong depth in regulated, high-stakes AI systems.

View profile
Almas Fathimah - Senior AI Engineer specializing in Generative AI and ML platforms in USA

Senior AI Engineer specializing in Generative AI and ML platforms

USA7y exp
EnteraUniversity of Maryland, Baltimore County

Built and owned a production RAG-based conversational AI system at Entera for real estate analysis, taking it from experimentation through AWS deployment, monitoring, and iterative improvement. Demonstrates strong practical judgment in retrieval design, LLM safety, and scalable Python service architecture, with measurable impact including 30-40% reduction in manual analysis time and roughly 30% better response accuracy.

View profile
SC

Mid-level Software Engineer specializing in Python backend and AI/GenAI

Jersey City, NJ4y exp
PTCSt. Francis College

Backend/infrastructure-focused engineer building AI-agent products for small businesses, including a customer-service agent platform with intent routing, RAG over Pinecone, and external booking API integration. Has shipped Python/FastAPI services with JWT auth, versioned APIs, Docker deployments to AWS EC2 via GitHub Actions, and production monitoring with Prometheus/Grafana.

View profile
RB

Raghav Bajaj

Screened

Junior Software Engineer specializing in backend systems and ML applications

Modesto, CA2y exp
Center for Human Services (CHS)UC Riverside

Full-stack engineer with hands-on experience building and shipping production web products across AI, frontend, backend, and DevOps. Notably built an end-to-end resume-job matching platform during an internship that processed 1000+ resumes/day and cut recruiter screening effort by 60%, and later shipped an internal operations dashboard at CHS with measurable performance gains.

View profile
Baliram Maurya - Senior Full-Stack Engineer specializing in Java microservices and Healthcare IT in USA

Senior Full-Stack Engineer specializing in Java microservices and Healthcare IT

USA11y exp
Bluethink

Backend engineer with hands-on experience modernizing healthcare platforms in a startup-like team of 8-12 across engineering, QA, DevOps, and product. They personally drove scalable Java/Spring Boot microservices for healthcare workflows, including FHIR integrations, real-time data pipelines, and resilient integrations with legacy and third-party systems.

View profile
Saketh Kota - Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps in Irving, TX

Saketh Kota

Screened

Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps

Irving, TX4y exp
U.S. Bank

Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.

View profile
NS

Intern Software Engineer specializing in AI agents, MLOps, and data engineering

Westborough, MA5y exp
LG Energy SolutionNortheastern University
View profile
NK

Mid-level Prompt Engineer specializing in Generative AI and RAG systems

Birmingham, AL4y exp
Regions BankUniversity of North Texas
View profile
CD

Mid-Level Software Development Engineer specializing in Healthcare IT and FinTech

California, USA4y exp
PfizerCal State Long Beach
View profile
SS

Intern Software Engineer specializing in distributed systems, cloud, and LLM/RAG data platforms

Palo Alto, California3y exp
AkashX.aiUniversity of Michigan-Dearborn
View profile
MK

Mid-level Machine Learning Engineer specializing in Generative AI, NLP, and MLOps

PA, USA4y exp
AllstateGannon University
View profile

Need someone specific?

AI Search