Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
MG

Manny Garcia

Senior Software Developer specializing in Python, AWS, and Big Data

Chicago, Illinois11y exp
CVS HealthCal Poly San Luis Obispo
PythonJavaScriptTypeScriptSQLShell ScriptingBash+94
View profile
AG

Ankit Gundewar

Intern AI/ML Engineer specializing in generative AI and multimodal agentic systems

Boston, MA1y exp
NTT DATANortheastern University
AgileApache SparkAWSAzure FunctionsCC+++108
View profile
AR

AROKIA R ARPUTHAM

Executive VP of Engineering specializing in FinTech platforms, cloud modernization, and AI/ML

New York, NY20y exp
Genesis Wealth & Asset ManagementJönköping University
AWSMicrosoft AzureKubernetesDockerServerless ArchitectureLoad Balancing+133
View profile
RK

Richard Kerr

Executive AI Engineering Leader specializing in research-to-production LLM systems

Las Vegas, NV22y exp
The Spodio GroupWharton School
Automated TestingAWSCI/CDCloud-Native ArchitectureComputer VisionData Governance+63
View profile
FS

Fabian Schonholz

Executive Technology & Data Leader specializing in AI/ML strategy and digital transformation

Salt Lake City, UT13y exp
AlchemeeUC Santa Barbara
AnsibleAWSBudgetingCI/CDCloud ComputingCompliance+99
View profile
KA

Kristopher Ali

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
PythonSQLScalaBashJavaR+180
View profile
JH

Jiajun Huo

Mid-Level Software Engineer specializing in data infrastructure and LLM applications

Remote3y exp
H60 ConsultingUniversity of Illinois Urbana-Champaign
PythonTypeScriptJavaScriptGoSQLJava+65
View profile
TS

Tyler Swanson

Senior AI/ML Engineer specializing in production AI systems for healthcare and finance

Austin, TX13y exp
AspirusUniversity of Texas at Austin
PythonScalaSQLJavaC++TensorFlow+72
View profile
VS

Vudityala Srinidh

Mid-level AI Data Engineer specializing in real-time streaming and LLM-powered fraud analytics

California, USA6y exp
PayPalCalifornia State University, East Bay
PythonSQLPostgreSQLBigQueryPySparkApache Spark+102
View profile
AD

Andrew Dyer

Senior AI/Machine Learning Engineer specializing in RAG and MLOps

Odessa, TX8y exp
DataRobotJohns Hopkins University
A/B TestingAgileApache KafkaApache SparkAWSCI/CD+36
View profile
RB

Rajan Bhargav Souda

Mid-level Generative AI Engineer specializing in LLMs, NLP, and multimodal systems

St. Louis, MO6y exp
BJC HealthCareNorthwest Missouri State University
PythonSQLBashPyTorchTensorFlowKeras+94
View profile
KZ

Khushang Zaveri

Intern AI Researcher specializing in NLP, multimodal AI, and medical ML

2y exp
Johns Hopkins UniversityJohns Hopkins University
Apache HadoopApache HiveApache SparkBashBERTChromaDB+80
View profile
SK

Sumanth Kumar Sri perumbuduri

Mid-level Full-Stack Software Engineer specializing in cloud-native and AI-driven applications

6y exp
Fidelity InvestmentsUniversity of Texas at Dallas
PythonJavaJavaScriptTypeScriptC#Node.js+119
View profile
VP

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp
Kaiser PermanenteSan Francisco State University

“ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.”

PythonSQLJavaGoJavaScriptREST APIs+121
View profile
DV

Devisri Veeramachaneni

Screened

Senior Software Engineer specializing in cloud backend systems and LLM-powered agents

Seattle, WA5y exp
AmazonSan José State University

“Amazon Fire TV Devices engineer who built and shipped a production LLM-powered lab triage and validation system that grounds recommendations in internal runbooks/known-issue data and pushes evidence-based actions via dashboards and Slack. Emphasizes safety and measurability with structured JSON outputs, replay-based evaluation on historical incidents, and production metrics (e.g., disagreement rate and time-to-first-action), plus cost/latency optimizations like caching, batching, and rule-based fast paths.”

PythonJavaJavaScriptTypeScriptC++Bash+130
View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and fraud/risk analytics in Financial Services

PA, USA4y exp
Capital OneRobert Morris University

“Built and shipped a production-grade GenAI Fraud & Compliance Investigation Copilot for a large US bank, integrating OCR docs, structured data, and prior case history to generate grounded, regulator-friendly summaries and red-flag highlights. Demonstrates strong end-to-end LLM systems engineering (LangGraph/LangChain, hybrid retrieval with FAISS+BM25, guardrails/citations, streaming/latency optimization) plus rigorous evaluation and close partnership with compliance stakeholders.”

A/B TestingAnomaly DetectionApache HadoopApache HiveApache KafkaApache Spark+137
View profile
VD

Varshith Dupati

Screened

Mid-level Software Engineer specializing in AWS, full-stack development, and AI data systems

Seattle, Washington3y exp
AmazonArizona State University

“Backend engineer who built a Python-based data profiling/statistics platform processing up to 50M rows and ~300 metrics, using a DAG execution model, multithreading, and smart caching to cut processing time by up to 70%. Also improved PostgreSQL query performance from 12s to 2s via indexing/query rewrites, integrated an LLM (LangChain + OpenAI) for explainable “chat with the pipeline” functionality, and designed an AWS EC2+SQS architecture for scalable, isolated per-user processing.”

JavaJUnitSpring BootPythonCC+++84
View profile
LT

Leela Tikkisetty

Screened

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”

A/B TestingAgileAmazon BedrockAmazon EKSAmazon RedshiftAuthentication+198
View profile
BP

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”

PythonRSQLPandasNumPyScikit-learn+132
View profile
RR

Rushi Reddy Lambu

Screened

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Remote, USA5y exp
McKinsey & CompanyUniversity of North Texas

“GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.”

PythonSQLRJavaC++Bash+192
View profile
TS

Travoy Spelling

Screened

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”

A/B TestingAPI DevelopmentAWSAWS LambdaAWS Step FunctionsAzure Data Factory+247
View profile
1...222324...118

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search