Vetted Apache Spark Professionals

Pre-screened and vetted.

NK

Senior Data Scientist and AI Engineer specializing in NLP, LLMs, and MLOps

Milwaukee, WI10y exp
CaterpillarWest Virginia University
View profile
WL

Senior Machine Learning Engineer specializing in GenAI, LLMs, and MLOps

Houston, TX11y exp
Paramount+University of Houston
View profile
RA

Senior AI/ML Engineer specializing in LLM, NLP, and production ML systems

Plano, TX11y exp
CignaUniversity of North Texas
View profile
AL

Senior Machine Learning Engineer specializing in GenAI, LLMs, and MLOps

Houston, TX11y exp
Paramount+University of Houston
View profile
RK

Mid-level Software Engineer specializing in distributed backend systems for FinTech

Los Angeles, CA5y exp
BlackRockCalifornia State University, Long Beach
View profile
DT

Mid-level Full-Stack Engineer specializing in cloud-native enterprise and FinTech systems

Sunnyvale, CA6y exp
WalmartCalifornia State University, East Bay
View profile
NT

Senior Full-Stack Developer specializing in FinTech microservices

Morristown, NJ8y exp
Valley Bank
View profile
SH

Senior AI Architect specializing in Generative AI and LLM systems

New York City, NY8y exp
Rezolve AI
View profile
MA

Senior Full-Stack Python Developer specializing in Django, FastAPI, and cloud platforms

Matamoras, Pennsylvania9y exp
Weston Chase
View profile
SS

Mid-level AI Engineer specializing in production LLM, RAG, and agentic AI systems

6y exp
Bank of America
View profile
ST

Senior AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

7y exp
CVS Health
View profile
SD

Senior Data Scientist specializing in NLP, MLOps, and cloud ML platforms

Westfield Center, OH7y exp
Westfield Insurance
View profile
HS

Mid-level Java Full-Stack Developer specializing in cloud-native microservices

Dallas, TX4y exp
Baylor Scott & White
View profile
RG

Senior Full-Stack Java Developer specializing in Spring Boot microservices and cloud platforms

Orlando, FL10y exp
HD Supply
View profile
PK

Mid-level AI/ML Engineer specializing in NLP, GenAI, and MLOps in healthcare and finance

USA5y exp
CVS HealthUniversity of Houston

AI/ML engineer with CVS Health experience deploying production LLM systems in regulated healthcare settings, including a large-scale RAG solution (1M+ documents) built for compliance-grade, auditable policy/regulatory Q&A with strong anti-hallucination controls. Also delivered an NLP summarization system for physician notes/case narratives by partnering closely with non-technical care operations stakeholders and iterating via prototypes, dashboards, and feedback loops.

View profile
GS

Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG

Auburn Hills, MI4y exp
StellantisUniversity of Cincinnati

ML/NLP practitioner who built a retrieval-augmented generation (RAG) system for large financial and operational document sets using Sentence-Transformers (all-mpnet-base-v2) and a vector DB (e.g., Pinecone), with a strong focus on retrieval evaluation and chunking strategy optimization. Experienced in entity resolution (rules + embedding similarity with type-specific thresholds) and in productionizing scalable Python data workflows using Airflow/Dagster and Spark.

View profile
AR

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

3y exp
State FarmCleveland State University

Built a secure, on-prem/private GPT assistant to replace manual SharePoint-style search across thousands of policies/SOPs/engineering docs, using a production RAG stack (LangChain/LangGraph, FAISS/Chroma, PyMuPDF+OCR, vLLM). Implemented layout-aware ingestion (including table-to-JSON) and a multi-agent retrieval/generation/verification workflow with strong observability and compliance guardrails, delivering ~70% reduction in search time.

View profile
YL

Yurong Luo

Screened

Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems

Remote9y exp
dataAnnotationVirginia Commonwealth University

Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.

View profile
SN

Senior Data Engineer specializing in cloud data platforms and ML pipelines

Atlanta, GA8y exp
Berkshire HathawayUniversity of Alabama at Birmingham

Data engineer focused on AWS-based enterprise data platforms, owning end-to-end pipelines from multi-source batch/stream ingestion (Glue/Kinesis/StreamSets/Airflow) through PySpark transformations into curated datasets for Redshift/Snowflake. Emphasizes production reliability with strong monitoring/observability and data quality gates, and reports ~30% performance improvement plus improved SLAs and latency after optimization.

View profile
GK

Mid-level Backend Software Engineer specializing in cloud-native distributed systems (Healthcare IT)

USA3y exp
UnitedHealth GroupNJIT

Data engineer with healthcare domain experience who has owned end-to-end pipelines and APIs at UnitedHealth Group, processing ~8M records per batch. Strong focus on data quality (multi-layer validation), reliability (monitoring/logging, retries/idempotency), and performance (Spark/SQL tuning, caching), with experience standing up early-stage systems using Python, Docker, and CI/CD.

View profile
PS

Mid-level Data Engineer specializing in AWS lakehouse platforms and scalable ETL/ELT

Texas, USA4y exp
HumanaUniversity of Texas at Dallas

Data engineer focused on reliable, production-grade pipelines and data services: has owned end-to-end ingestion-to-serving workflows processing millions of records/day, using Airflow, Python/SQL, and PySpark. Demonstrates strong operational rigor (monitoring, retries, idempotency, backfills) and measurable outcomes (98% stability, ~30% faster processing), plus experience exposing curated warehouse data via versioned REST APIs.

View profile
VK

Varshitha K

Screened

Mid-level Data Engineer specializing in cloud data platforms and lakehouse architectures

Lakewood, CO4y exp
First BankUniversity of Central Missouri

Data engineer in a banking context who has owned end-to-end Azure lakehouse pipelines ingesting financial/vendor data from APIs, Azure SQL, and flat files into Databricks/Delta (bronze-silver-gold). Emphasizes production reliability via schema-drift validation, data quality controls, monitoring/alerting, retries/checkpointing, and Spark/Delta performance tuning, with outputs served to BI/reporting teams (e.g., Tableau).

View profile

Need someone specific?

AI Search