Reval Logo

Vetted PySpark Professionals

Pre-screened and vetted.

SB

Mid-Level Full-Stack Software Engineer specializing in FinTech and data platforms

Jersey City, NJ4y exp
Capital OneUniversity of Cincinnati
View profile
AP

Mid-level Software Engineer specializing in FinTech payments and compliance systems

San Francisco, CA6y exp
StripeUniversity of Memphis
View profile
HO

Mid-level Machine Learning & Data Engineer specializing in MLOps and cloud data platforms

San Francisco, CA4y exp
Blue River TechnologyUC Berkeley
View profile
DS

Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Rosharon, TX11y exp
AssistRxUniversity of Texas at Austin
View profile
BK

Mid-level Data Engineer specializing in real-time streaming and ML feature pipelines

Atlanta, GA6y exp
LyftGrand Valley State University
View profile
SK

Mid-level AI & Machine Learning Engineer specializing in computer vision and MLOps

United States6y exp
NVIDIAUniversity of Massachusetts Lowell
View profile
SK

Mid-level AI/ML Engineer specializing in production ML, NLP, and computer vision

USA6y exp
UberUniversity of Maryland, Baltimore County
View profile
MB

Mid-Level Software Engineer specializing in AI, cloud-native microservices, and full-stack systems

CO, USA5y exp
HubSpotCalifornia State University, Fullerton
View profile
ME

Principal AI Platform Architect specializing in agentic AI and enterprise LLM infrastructure

Sunnyvale, CA21y exp
CrowdStrikeUniversity of Massachusetts Boston
View profile
VB

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and big data on cloud platforms

USA5y exp
DatabricksAuburn University at Montgomery
View profile
TS

Senior Data Engineer specializing in healthcare ETL/ELT and ML

Pasadena, CA12y exp
Doheny Eye InstituteUniversity of Texas at Austin
View profile
MG

Senior Software Developer specializing in Python, AWS, and Big Data

Chicago, Illinois11y exp
CVS HealthCal Poly San Luis Obispo
View profile
VS

Mid-level AI Data Engineer specializing in real-time streaming and LLM-powered fraud analytics

California, USA6y exp
PayPalCalifornia State University, East Bay
View profile
LT

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).

View profile
BP

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
RR

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Remote, USA5y exp
McKinsey & CompanyUniversity of North Texas

GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.

View profile
TS

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

View profile
RM

Mid-level Software Engineer specializing in cloud-native distributed systems and streaming data

Austin, TX7y exp
TeslaGeorge Mason University

Backend/product engineer with Tesla experience building and operating a real-time OTA update monitoring and fleet analytics platform at massive scale (telemetry from 3M+ vehicles). Delivered end-to-end systems across Kafka-based ingestion, TimescaleDB/Postgres analytics modeling, FastAPI/GraphQL APIs, and React/TypeScript dashboards, and handled production scaling incidents on AWS EKS during major rollout spikes.

View profile
YL

Yuqi Lei

Screened

Mid-level Software Engineer specializing in financial data platforms and quantitative research tooling

New York City, NY3y exp
BloombergWashington University in St. Louis

Owned and built Bloomberg’s end-to-end bitemporal dividend & dividend-forecast data platform powering BQL for 400k+ terminal users. Architected real-time Kafka ingestion (5k–10k msgs/sec) across 100k+ tickers with strong correctness guarantees (PIT/bitemporal time-travel, immutable history to avoid look-ahead bias) and achieved sub-100ms p95 query latency through indexing and caching, deployed with Kubernetes + DLQ and robust monitoring.

View profile
JA

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps

McKinney, TX6y exp
Globe LifeTexas A&M University

Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.

View profile

Need someone specific?

AI Search