Reval Logo

Vetted PySpark Professionals

Pre-screened and vetted.

BG

Brian Gomez

Screened

Senior Machine Learning Engineer specializing in AI/ML, NLP, and computer vision

United States12y exp
McKinsey & CompanyCornell University

McKinsey & Company ML/NLP practitioner who builds production-grade AI systems across sectors (notably healthcare and finance), including RAG/LLM solutions, entity resolution pipelines, and embedding-powered search with vector databases. Demonstrated measurable impact (40% reduction in data duplication) and strong MLOps/data workflow practices (Airflow, MLflow, Spark, AWS/GCP, Prometheus, CI/CD).

View profile
DG

Mid-level Full-Stack Developer specializing in Java/Spring Boot and React

Seattle, WA5y exp
ShopifySaint Louis University

NVIDIA engineer who built and shipped a production LLM-powered enterprise knowledge system (summarization, transcription, and Q&A) that cut document retrieval time ~30%. Deep hands-on experience with RAG (FAISS/Pinecone), GPU-accelerated microservices on AWS, and reliability/safety practices (Guardrails AI, prompt A/B testing, canary releases) plus strong MLOps orchestration across Airflow, Step Functions, and Kubernetes GitOps.

View profile
SF

Executive AI/IoT Engineering Leader specializing in full-stack and edge AI systems

San Luis Obispo, CA20y exp
AI LabCornell University
View profile
SN

Mid-Level Backend Software Engineer specializing in payments and real-time analytics

CA5y exp
StripeSaint Louis University
View profile
MM

Senior Data Scientist / ML Engineer specializing in LLMs, generative AI, and MLOps

New York, NY7y exp
MetaColumbia University
View profile
YW

Senior Research Scientist specializing in LLM verification and fraud/risk modeling

San Mateo, CA10y exp
UpstartStanford University
View profile
LS

Senior Full-Stack Python Engineer specializing in cloud microservices and AI/LLM systems

Redmond, WA13y exp
MicrosoftMontclair State University
View profile
VS

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and scalable inference

Seattle, WA6y exp
MetaNortheastern University
View profile
VK

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable MLOps

Cupertino, CA5y exp
OpenAIUniversity of North Carolina at Charlotte
View profile
SS

Mid-level Full-Stack Software Engineer specializing in FinTech analytics and security

San Francisco, CA6y exp
StripeMontclair State University
View profile
AB

Mid-level Software Engineer specializing in backend APIs, data pipelines, and cloud microservices

CA, USA6y exp
NVIDIAConcordia University Wisconsin
View profile
AM

Mid-level Software Engineer specializing in full-stack and distributed backend systems

San Francisco, CA5y exp
StripeSaint Louis University
View profile
HV

Senior Data Engineer specializing in cloud-native data platforms and streaming pipelines

7y exp
GoogleUniversity of Cincinnati
View profile
SO

Mid-level AI/ML Engineer specializing in LLMs, multilingual NLP, and low-latency MLOps

CA, USA6y exp
MetaClarkson University
View profile
NV

Senior AI/ML Engineer specializing in LLM agents, RAG, and production ML systems

San Francisco, CA7y exp
OpenAISaint Louis University
View profile
MC

Senior Software Engineer specializing in AI for Healthcare and Enterprise SaaS

Seattle, WA9y exp
Amazon
View profile
SC

Mid-level AI/ML Engineer specializing in Generative AI, LLM alignment, and RAG

CA6y exp
Scale AIUniversity of Texas at Arlington

Built and productionized a real-time enterprise RAG pipeline to improve factual accuracy and reduce LLM hallucinations by grounding responses in constantly changing internal knowledge bases (policies, manuals, FAQs). Experienced in orchestrating end-to-end ML workflows (Airflow/Kubernetes), handling messy multi-format data with schema enforcement (Pydantic/Hydra), and maintaining freshness via streaming incremental embeddings plus batch refresh. Also delivers applied ML solutions with non-technical teams (marketing/CRM) for segmentation and personalized engagement.

View profile
KC

Mid-level Data Engineer specializing in AI/ML platforms and cloud data pipelines

USA4y exp
MetaTexas Tech University

Built and shipped an LLM-powered data quality assistant that generates maintainable validation checks from metadata while executing validations via Great Expectations, exposed through FastAPI and integrated into Airflow-managed pipelines. Emphasizes production reliability (structured outputs, guardrails, monitoring, versioning, human review) and works closely with compliance/operations teams to deliver clear, auditable, user-friendly AI outputs.

View profile
AB

Junior Data Scientist specializing in Generative AI and agentic LLM systems

San Jose, CA1y exp
SAPUniversity of Pennsylvania

LLM/agentic-systems builder who has shipped production tools for investment research and procurement insights, including a company screener that processes thousands of conference-listed companies using FireCrawl + Google Search + Gemini. Demonstrates strong orchestration expertise (LangGraph multi-agent graphs), performance optimization (async/batching to sub-30s), and pragmatic reliability/evaluation practices with stakeholder-friendly UX (real-time cost tracking and model/parameter toggles).

View profile
YX

Yihao Xie

Screened

Senior Backend Engineer specializing in Python and AWS serverless systems

Austin, TX3y exp
AmazonTexas A&M University

Backend/data engineer with Amazon supply-chain experience building production serverless Python services and ETL pipelines on AWS (Lambda, API Gateway, S3, RDS, Glue). Has modernized legacy SAS jobs into Python with rigorous parity testing and phased migrations, and has delivered major SQL performance gains (minutes down to seconds) through indexing and partitioning.

View profile
AM

Alex M Lee

Screened

Staff Full-Stack Engineer specializing in Healthcare AI and FinTech payments

Irving, TX9y exp
Oscar HealthUniversity of Texas at Dallas

Backend/data engineer from Oscar Health specializing in healthcare claims systems on AWS. Built HIPAA-compliant real-time services (FastAPI/Postgres/Kafka on EKS) and serverless ingestion pipelines, and led modernization of a legacy SAS claims pricing system to Python/Spark with rigorous parity validation. Demonstrated measurable impact with high uptime/low latency services and major Snowflake performance and cost reductions.

View profile

Need someone specific?

AI Search