Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
RC

Rohan Chickalkar

Senior Data/GenAI Engineer specializing in cloud-native ML, RAG, and real-time data platforms

Richardson, TX8y exp
ToyotaTexas A&M University
PythonScalaJavaRSQLShell Scripting+178
View profile
PF

Patrick Ford

Senior Data Engineer specializing in BI Analytics and AI/ML

Lighthouse Point, FL11y exp
DropboxJacksonville University
PythonSQLScalaJavaShell ScriptingBash+110
View profile
DS

Dhruv Susheelkar

Junior AI/ML Engineer specializing in agentic AI and cloud optimization

Cupertino, CA1y exp
AdvantisUC San Diego
PythonGoJavaC++CSQL+71
View profile
AB

Amandeep Bhullar

Mid-Level Python Developer specializing in Django, data pipelines, and automation

Sunnyvale, CA5y exp
AppleI.K. Gujral Punjab Technical University
PythonDjangoSQLPySparkApache SparkETL+46
View profile
SN

Sai Navyanth Penumaka

Mid-level Software Development Engineer specializing in backend systems and ML platforms

New York, USA2y exp
FlipkartNYU
JavaCC++PythonScalaSQL+82
View profile
SV

Suhuruth Veeramalla

Mid-level AI/ML Engineer specializing in recommendation, retrieval, and MLOps

San Francisco, CA5y exp
MetaConcordia University
PythonPyTorchTensorFlowScikit-learnNumPyPandas+127
View profile
SN

Sharath Nyalakonda

Screened

Mid-level AI/ML Engineer specializing in NLP, graph models, and MLOps for FinTech and Healthcare

Remote, USA5y exp
StripeKent State University

“AI/ML engineer who has deployed production LLM/transformer-based systems for merchant intelligence and fraud/support optimization, delivering +27% merchant engagement and +18% payment success. Deep experience in privacy-preserving, PCI DSS-compliant data/ML pipelines (Airflow, AWS Glue, Spark, Delta Lake) and scalable microservices on Kubernetes, plus proven cross-functional delivery in healthcare claims analytics at UnitedHealth Group (12% HEDIS claim reduction).”

PythonpandasspaCyRSQLPySpark+185
View profile
PP

Poorna Pedapudi

Screened

Mid-Level Software Engineer specializing in distributed backend systems and cloud-native microservices

Seattle, WA5y exp
UberGeorge Mason University

“Software engineer focused on data platforms and applied LLM systems: built an internal data quality monitoring layer to catch silent data drift and iterated post-launch after finding ~30% false-positive alerts, reducing noise via dynamic baselines and improved structured logging. Also shipped a production RAG-based internal knowledge assistant over Jira/Confluence with citations, confidence-based fallbacks, and nightly automated evals to prevent regressions.”

GoPythonJavaJavaScriptTypeScriptC+++115
View profile
SS

Shuju Sun

Screened

Mid-Level Software Engineer specializing in real-time data pipelines and ML deployment

PA, USA4y exp
VanguardUSC

“Ticketmaster data engineer who built CDC-driven Kafka pipelines feeding Snowflake for analytics and data science teams. Hands-on in production operations—scaled Kafka during sudden playoff-driven transaction spikes and improved monitoring for preemptive scaling. Known for using small-batch experiments and quantitative metrics to align stakeholders and drive cost-saving architecture changes (e.g., buffering to reduce AWS Lambda invocation frequency).”

PythonJavaCC++ScalaGo+132
View profile
KR

Kaustubh Rai

Screened

Junior Software Engineer specializing in scalable distributed systems and cloud platforms

Pittsburgh, PA2y exp
eParts Services LLCCarnegie Mellon University

“Backend engineer with experience at UnitedHealth Group redesigning a high-traffic Spring Boot microservice from blocking to reactive architecture during peak season, cutting median latency by 47% for a service used by ~10M customers annually. Strong in Kubernetes-based deployment/scaling and pragmatic rollout strategies (blue-green/incremental traffic shifting) with performance and database troubleshooting.”

.NETApache HadoopApache KafkaAWSAWS LambdaAzure Data Factory+70
View profile
DS

Darsh Sharma

Screened

Mid-level Software Engineer specializing in ML systems and microservices

Madison, WI2y exp
TeradataUniversity of Wisconsin–Madison

“Teradata Text Security intern who built a production LLM-powered planner agent that decomposes complex tasks into dependency-aware subtasks (DAG/topological graph) and executes them via a custom orchestrator with parallelism, status tracking, and error handling. Also contributed to an HR-facing internal document chatbot concept to streamline onboarding, showing cross-functional collaboration.”

CC++CUDAPythonJavaSQL+101
View profile
SL

Sri Lekha Kandadai

Screened

Mid-level Machine Learning Engineer specializing in MLOps and multimodal AI

KS, USA5y exp
AppleUniversity of Central Missouri

“ML/AI engineer focused on production-grade model reliability: built a monitoring and validation framework to detect drift, trigger anomaly alerts/retraining, and maintain consistent performance for device intelligence workflows at scale. Strong MLOps background with Python pipelines, Docker/Kubernetes deployments, Airflow orchestration, and real-time monitoring dashboards; experienced partnering with product managers to deliver business-facing insights.”

PythonSQLRC++JavaMachine Learning+80
View profile
PP

Pranav Puranik

Screened

Senior AI Engineer specializing in LLMs, RAG, and multimodal NLP

Austin, TX5y exp
Health Care Service CorporationUniversity of Florida

“Built a production LLM/RAG assistant for insurance/health claims agents that ingests 100–200 page patient PDFs via OCR (migrated from local Tesseract to Azure Document Intelligence) and delivers grounded claim detail retrieval plus summaries with PII/PHI guardrails. Experienced orchestrating large workflows with Celery worker pipelines and AWS Step Functions (S3-triggered, Fargate-based batch inference/accuracy aggregation), and collaborates closely with non-technical SMEs (claims agents/nurses) through shadowing, iterative demos, and SME-defined evaluation.”

PythonSQLJavaTypeScriptBashUnix+120
View profile
JY

Jiacheng Yin

Screened

Intern Software Engineer specializing in data engineering and AI agent systems

Beijing, China1y exp
JD.comCornell University

“AI engineer at Anote.ai who built and shipped a production multi-agent LangGraph/LangChain/Ray RAG platform for enterprise search and workflow automation, supporting 3 commercial products and 100+ developers. Drove measurable gains (30% accuracy improvement, lower latency) and improved reliability with Redis-based state checkpointing, message-queue synchronization, and Milvus retrieval optimizations, while partnering with PMs/clients to add transparency features like confidence scores and real-time logs.”

.NETAgileAmazon CloudFrontAngularAnomaly detectionAPI Gateway+158
View profile
MS

Manjory saran

Screened

Senior Backend & Infrastructure Engineer specializing in cloud-native distributed systems

5y exp
WalmartSan José State University

“LLM infrastructure engineer who built a production-critical real-time personalization and memory retrieval system for a user-facing product, adding <100ms P99 latency while improving relevance ~20–25% and holding SLA through 3x traffic. Experienced designing tiered retrieval backends (Redis + vector store), deploying on Kubernetes with autoscaling/circuit breakers, and running rigorous observability, incident response, and agent evaluation (shadow traffic, A/B tests, regression/replay).”

API DesignAsynchronous ProcessingAWSAWS CloudFormationCachingCI/CD+105
View profile
FA

Feras Alsaiari

Screened

Senior Software Engineer specializing in AWS data platforms and event-driven systems

4y exp
Capital OneGeorgia Tech

“Capital One engineer leading the architecture and delivery of a large-scale AWS Glue/Spark/Delta Lake batch messaging pipeline that decoupled batch from real-time flows, added multi-region failover and automated retries, and delivered ~40% AWS cost savings with ~3x performance gains. Currently building an LLM-powered Slack bot using RAG to automate message investigations by querying CloudWatch, Snowflake, and internal documentation with privacy-aware masking of NPI/PII.”

PythonJavaJavaScriptSQLCC+++91
View profile
PK

priya kotha

Screened

Mid-level Data Engineer specializing in real-time pipelines across FinTech and Healthcare

USA, USA4y exp
PlaidSacred Heart University

“Data engineer at Plaid who built greenfield, end-to-end real-time transaction pipelines and FastAPI data services for fraud detection and analytics, handling millions of events per day. Strong focus on reliability and data integrity via Great Expectations validation, Airflow-based monitoring/SLAs, quarantine/staging patterns, and robust external data ingestion with schema versioning and backfills (reported 50% fewer anomalies and ~40% fewer failures).”

PythonSQLPandasNumPyApache SparkPySpark+97
View profile
TJ

Travis Johnson

Screened

Senior Full-Stack Software Engineer specializing in FinTech payments and fraud systems

Plano, TX8y exp
AffirmTexas Tech University

“Backend/data engineer with production experience building credit/fraud enrichment services and checkout pipelines on AWS (EKS + Lambda) using FastAPI, Kafka, Postgres, and Redis, with a strong focus on reliability patterns (timeouts/retries/circuit breakers) and observability. Has also built AWS Glue/PySpark ETL into S3/Redshift with schema evolution and data quality controls, and modernized legacy credit decisioning into Java/Node microservices with parallel-run parity validation and feature-flag rollouts.”

JavaJavaScriptTypeScriptPythonSQLAWS+98
View profile
CM

Chaitanya Mahajan

Screened

Intern/Junior Software Engineer specializing in ML, networking telemetry, and full-stack web apps

Louisville, CO1y exp
CableLabsUniversity of Colorado Boulder

“Backend-focused engineer with hands-on experience modernizing a legacy SNMP/PNM data collection system at CableLabs into a cloud-accessible Kubernetes pipeline, feeding Prometheus-formatted metrics into VictoriaMetrics and visualizing real-time network health in Grafana for 100+ modems. Also built a FastAPI + Supabase appointment booking portal for a clinic with encryption and phone-number-based auth, and has frontend experience debugging S3-based HEIF image rendering issues.”

PythonCC++JavaHTMLCSS+103
View profile
SK

Sai Kiran

Mid-level Full-Stack Python Engineer specializing in cloud-native payments and data pipelines

New York, NY6y exp
StripeUniversity of Central Missouri
PythonJavaTypeScriptJavaScriptC++SQL+153
View profile
CA

Cyrus Azima

Executive Technology & Product Leader specializing in cloud platforms, security, and global engineering scale

Omaha, NE16y exp
Spark MembershipUCLA
Data warehousingAnalyticsDevOpsRisk managementSystem designPerformance optimization+87
View profile
1...192021...118

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search