Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
DF

David Finol

Principal DevOps/SRE Engineer specializing in multi-cloud infrastructure and DevSecOps

Houston, TX12y exp
ComcastUniversity of Texas at Austin
AWSMicrosoft AzureServerless computingAWS LambdaAzure FunctionsCI/CD+175
View profile
PB

Pulkit Bhardwaj

Senior Data Scientist specializing in Generative AI, NLP, and MLOps

San Bruno, CA10y exp
WalmartPurdue University
Machine LearningArtificial IntelligenceDeep LearningGenerative AILarge Language Models (LLMs)GPT+86
View profile
KS

Kushal Shah

Senior Applications Engineer specializing in ERP Financial Systems and GenAI automation

CA, USA5y exp
OracleNYU
AgileApache AirflowApache KafkaApache SparkBERTBigQuery+55
View profile
BC

Bhanu Chander

Senior Data Engineer specializing in cloud data platforms and real-time pipelines

New York, NY6y exp
DisneyIndiana Wesleyan University
PythonSQLScalaC#JavaScriptJava+134
View profile
GC

Gopi CHINTHAGUMPULA

Mid-level Software Engineer specializing in backend microservices and cloud-native systems

USA, USA4y exp
UberTexas A&M University–Kingsville
JavaKotlinPythonTypeScriptSQLSwift+92
View profile
JK

Jahnavi Koneni

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

San Antonio, TX4y exp
USAAClark University
PythonSQLScalaShell ScriptingJavaApache Spark+98
View profile
SM

Sainath Myakala

Mid-level Data Engineer specializing in AI/ML data platforms and real-time streaming

Arkansas, USA6y exp
WalmartUniversity of Central Missouri
PythonJavaScalaShell ScriptingSQLApache Kafka+79
View profile
SM

Sai Meka

Mid-level Data Engineer specializing in cloud lakehouse and streaming pipelines

California, USA5y exp
JPMorgan ChaseCalifornia State University, Fullerton
PythonJavaScalaRShell ScriptingSQL+97
View profile
SC

Supriya Chopra

Executive Product & Technology Leader specializing in AI and healthcare platforms

14y exp
Healthy Vignettes
Product ManagementProgram ManagementData GovernanceGo-to-Market StrategySaaSLarge Language Models (LLMs)+60
View profile
AM

Abhishikth Meesala

Screened ReferencesStrong rec.

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and fraud detection

Dallas, TX4y exp
PwCCampbellsville University

“At PwC, built and productionized an agentic RAG enterprise search assistant over 6M internal documents (8M embeddings), deployed across AWS and GCP. Drove major retrieval gains (72%→92% precision via BM25+dense hybrid with RRF and cross-encoder re-ranking), reduced hallucinations 30%, achieved <2s latency at 50–60K queries/month, and cut support tickets 30%—boosting adoption to 2,500 users by adding source-cited answers.”

A/B TestingAgileAnomaly DetectionApache AirflowApache SparkAuto-scaling+135
View profile
MM

Matthew Mullins

Screened

Senior Software Engineer specializing in AI/ML backend and cloud infrastructure

Bentonville, AR11y exp
WalmartUniversity of Houston

“Backend/data platform engineer with production experience at Walmart and Molina Healthcare, building Python microservices on AWS (EKS + Lambda) for real-time inventory and recommendation systems. Strong in reliability/observability and incident leadership, plus modernizing legacy healthcare workflows and building resilient AWS Glue/PySpark pipelines with schema evolution and data quality controls.”

PythonFastAPIDjangoSQLAlchemyPySparkSQL+132
View profile
VB

Varaprasad Bathula

Screened

Intern AI/ML Engineer specializing in LLM applications and data infrastructure

Redmond, Washington, USA3y exp
UberUniversity of Memphis

“Hands-on LLM practitioner who built a production document-processing pipeline in Python, tackling long-document handling and latency with chunking/batching and a user-driven correction feedback loop. Experienced operationalizing AI workflows with Kubernetes (CronJobs, autoscaling, scheduled data cleaning and weekly retraining) and applying structured testing/evaluation (E2E, LLM-as-judge, HITL) while communicating solutions clearly to non-technical clients using visual diagrams.”

PythonCC++C#JavaSQL+110
View profile
HK

Hari Kiran Reddy Rommala

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native data platforms

Austin, TX5y exp
Northeastern UniversityPenn State University

“LLM/agentic systems practitioner who specializes in moving customer prototypes into production within microservices environments, emphasizing reliability, latency, security, and measurable success metrics. Experienced in real-time troubleshooting using logs/traces and in enabling adoption through hands-on developer workshops (including live coding in Java Spring Boot) and pre-sales POCs that address technical objections and integration risk.”

AlertingAmazon API GatewayAmazon CloudWatchAmazon DynamoDBAmazon EC2Amazon Kinesis+181
View profile
SC

Swastik Chowdhury

Screened

Mid-level Data Engineer specializing in cloud data platforms and real-time streaming

5y exp
Vertisage TechnologiesCarnegie Mellon University

“Worked on onboarding a Middle East logistics client processing thousands of invoices/month, building a production-ready pipeline that routes known vendor PDFs to deterministic regex parsers via Tax ID matching and falls back to LlamaParse for unknown layouts. Added financial consistency validation plus human-in-the-loop review and logging/metrics to continuously reduce LLM usage and improve template coverage.”

PythonScalaJavaSQLReactMySQL+96
View profile
SZ

Siliang Zhang

Screened

Intern Machine Learning Engineer specializing in LLMs, RAG, and vision-language systems

Shanghai, China2y exp
CarizonUSC

“Robotics ML/software engineer focused on Vision-Language-Action control for 7-DoF robots, replacing tokenized action decoding with continuous regression heads (including a logit-weighted expectation approach) to improve stability and real-time behavior. Strong in ROS1/ROS2 systems integration and debugging closed-loop manipulation issues via latency instrumentation, QoS-aware distributed messaging, and sim-to-real validation using Gazebo/Unity, Docker, and CI pipelines.”

PythonC++JavaJavaScriptGoSQL+137
View profile
SK

Shankar Koduvayur Ramaswami

Screened

Mid-level Machine Learning Engineer specializing in industrial deep learning and predictive control

Houston, TX5y exp
oPRO.aiCarnegie Mellon University

“AI engineer building and deploying deep-learning-based optimization/control systems for petrochemical plants, with a focus on maintaining operational stability under real-world constraints. Core contributor to model and inference design; introduced a stability-focused non-linear objective and sped up second-layer optimization via on-the-fly first-order approximations. Experienced using Kubernetes for end-to-end testing and effective in translating customer expectations into measurable evaluation plots for non-technical stakeholders.”

AlgorithmsAmazon EMRApache KafkaApache SparkAWSBash+75
View profile
SS

Shayan Shokri

Screened

Intern ML Engineer specializing in LLMs and NLP research

Seattle, WA0y exp
TruvetaCity University of New York

“ML/LLM practitioner with experience at Truveta building an LLM-based evaluation framework; identified non-overlapping evaluator failure modes and proposed an ensemble approach that enabled scaling training data and drove ~5% performance gains across multiple internal projects. Strong focus on robustness to distribution shift (augmentation/domain adaptation/meta-learning) and production reliability via monitoring, drift detection, and safe fallbacks.”

Machine LearningDeep LearningLarge Language Models (LLMs)Generative AIModel EvaluationReinforcement Learning+70
View profile
PM

Priyanshu Maurya

Screened

Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics

New York, NY3y exp
MetLifeRowan University

“Built and productionized LLM-driven sentiment scoring for earnings call transcripts at Goldman Sachs, replacing legacy NLP to deliver a cleaner trading signal while managing latency/cost via batching, caching, and distilled models. Also implemented an Airflow-orchestrated fraud modeling pipeline at MetLife with drift-based retraining and SageMaker deployment, and has a disciplined evaluation/rollout framework for reliable AI workflows.”

Anomaly DetectionAWSAWS GlueBigQueryBlue/Green DeploymentCI/CD+105
View profile
AD

Aarati Dulal

Screened

Senior Full-Stack Java Engineer specializing in cloud-native microservices

Dallas, TX6y exp
Goldman SachsAvila University

“Backend/platform engineer who owned high-volume Java/Spring Boot microservices on AWS (Kafka + RDS/DynamoDB) and has hands-on experience debugging complex production latency incidents across DB, JVM/GC, and async consumers. Also shipped applied AI features for ops, including an LLM-powered log analysis assistant and an incident-response agent with strong safety guardrails (schema-validated tool use, retries/backoff, and human-in-the-loop escalation).”

JavaPythonJavaScriptTypeScriptSQLPL/SQL+155
View profile
AM

Aigo Madakimova

Screened

Senior Data Analyst specializing in audit analytics, automation, and financial data platforms

Malvern, PA6y exp
VanguardNYU

“Full-stack engineer with strong Next.js App Router + TypeScript experience who built and owned a production internal analytics dashboard end-to-end, including server-component data fetching, route handlers for secure proxying, and post-launch monitoring/caching fixes. Also designed Postgres data models and performance-tuned analytics queries, and built reliable BullMQ/Redis-based order-fulfillment workflows with idempotency, retries, and compensating refunds—comfortable operating with high ownership in early-stage teams.”

PythonJavaCC++SQLEclipse+106
View profile
HS

Haider Shah

Screened

Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI

California, USA13y exp
PineconePreston University

“FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.”

PythonJavaScalaGoC++Bash+162
View profile
RM

Ryan McDowell

Screened

Senior Software Engineer specializing in pricing, marketplaces, and data engineering

Remote9y exp
Ballast Point AnalyticsUniversity of Chicago

“Built and operationalized intelligent pricing infrastructure for live event ticketing at StubHub, emphasizing iterative prototyping with traders and production-grade monitoring (Splunk, API/data-stream thresholding). Also partnered with customer-facing teams to drive adoption and helped win a significant consignment revenue-share deal by demoing the system to the Philadelphia 76ers and quantifying pricing efficacy and business impact.”

PHPTypeScriptPythonPandasNumPySciPy+74
View profile
YV

Yash Vishe

Screened

Junior Software Engineer specializing in LLM systems, data engineering, and ML

San Diego, CA2y exp
San Diego Supercomputer CenterUC San Diego

“Backend/ML systems engineer with experience at SDSC, UCSD, and Media.net, building production semantic dataset/model discovery using embeddings + Solr KNN and LLM-based intent/reranking at 5M+ dataset scale. Emphasizes offline/online separation for predictable serving, has delivered measurable gains (23% retrieval accuracy, 38% latency reduction) and helped secure a $3M+ NSF grant.”

Anomaly DetectionApache SparkAWSBigQueryCC+++97
View profile
SS

Sayuj Shah

Screened

Mid-level Data Analyst & AI Practitioner specializing in ML, LLMs, and analytics platforms

Schaumburg, IL4y exp
U.S. CellularGeorgia Tech

“Data Analyst at U.S. Cellular who built production LLM solutions, including a Tableau-embedded chatbot that converts natural language questions into Oracle SQL and returns actionable KPI insights for non-technical users. Also authored MAD-CTI, a multi-agent LLM system for dark web hacker forum threat intelligence (published in IEEE Access) that outperformed single-agent approaches by 14%.”

PythonSQLPostgreSQLRMATLABJavaScript+92
View profile
1...323334...119

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search