Reval Logo
Home Browse Talent Skilled in PySpark

Vetted PySpark Professionals

Pre-screened and vetted.

PySparkPythonDockerSQLCI/CDAWS
NA

Navyasri Arekatla

Mid-level AI/ML Engineer specializing in GenAI agents and production ML systems

Dallas, TX5y exp
PerplexityUniversity of North Texas
PythonJavaCC++MATLABBash+159
View profile
HS

Hemant Sathish

Mid-level Machine Learning Engineer specializing in GenAI, forecasting, and MLOps

Pittsburgh, PA3y exp
CalixCarnegie Mellon University
AgileApache HadoopApache KafkaAWSBigQueryC+76
View profile
DG

Devdatt Golwala

Mid-level Data Scientist/ML Engineer specializing in LLMs, NLP, and recommender systems

New York, NY3y exp
AdobeColumbia University
A/B TestingAlgorithmsAWSBashChromaDBC+81
View profile
AB

Amulya Baddam

Mid-level AI/ML Engineer specializing in LLMs and MLOps

5y exp
GoogleUniversity of North Texas
PythonSQLBashShell ScriptingJavaScriptTypeScript+102
View profile
RG

Ramya Gurrala

Mid-level Machine Learning Engineer specializing in fraud detection and recommendations

Bay Area, CA6y exp
StripeBinghamton University
A/B TestingAgileAmazon RedshiftAmazon SageMakerAmazon S3Anomaly Detection+179
View profile
SE

Samuel Evans

Staff AI & Data Engineer specializing in LLM systems and real-time data platforms

Salt Lake City, UT10y exp
Jump AILouisiana Tech University
A/B TestingAmazon ECSAmazon EMRAmazon KinesisAmazon RedshiftAmazon S3+177
View profile
IJ

Ikenna Joe-Nweke

Junior Data Scientist & Data Engineer specializing in ML and scalable data pipelines

2y exp
MicrosoftUSC
PythonSQLRJavaScriptMachine LearningEmbeddings+62
View profile
AS

Ananta Singh

Senior Software Engineer specializing in cloud, data platforms, and LLM/RAG applications

Fremont, CA7y exp
Volvo GroupSan José State University
AgileApache AirflowApache KafkaAWSAzure DevOpsAzure Functions+99
View profile
AB

Abhinav Bachu

Mid-level AI/ML Engineer specializing in cloud MLOps and GenAI for fraud detection

New York, NY4y exp
StripeNJIT
PythonNumPyPandasScikit-learnTensorFlowPyTorch+124
View profile
PT

Pavanika Thotakura

Screened

Senior Data Engineer specializing in cloud big data pipelines and real-time streaming

Seattle, WA6y exp
AmazonUniversity of North Texas

“Amazon data engineer who built a real-time fraud detection pipeline for AWS Lambda, tackling multi-region telemetry quality issues and scaling stream processing for billions of daily requests. Strong in production-grade data/ML workflows on AWS (EMR, Glue, Kinesis, SageMaker) with hands-on entity resolution and anomaly detection.”

PythonSQLPySparkScalaJavaBash+139
View profile
SF

Sara Fang

Screened

Mid-level Software Engineer specializing in cloud data platforms and distributed systems

Remote6y exp
Terra Byte XUniversity of Delaware

“Backend/data engineer with production experience building FastAPI services with strong reliability patterns (circuit breaker, rate limiting, caching, graceful degradation) and JWT/OAuth2 auth. Has delivered AWS EKS deployments via Terraform with Secrets Manager/IRSA and HPA autoscaling, and built Glue/Spark ETL pipelines on S3 Parquet with schema-evolution and idempotent reruns; also demonstrated measurable SQL tuning impact (20–30s to <10s).”

JavaPythonScalaGoSQLJavaScript+101
View profile
DB

Damik Bermudez

Screened

Staff Software Engineer specializing in Healthcare platforms and AI data pipelines

Remote10y exp
DrwellBinghamton University

“Backend/data engineer with hands-on production AWS experience spanning serverless APIs (Chalice/Lambda/API Gateway/Cognito) and data pipelines (Glue PySpark + Step Functions). Has modernized a legacy SAS reporting system into AWS microservices and implemented schema-drift detection and incident prevention for ETL workflows, plus measurable SQL tuning wins (30 min to <10 min runtime).”

PythonJavaScriptTypeScriptC#DjangoFlask+93
View profile
SC

Shweta Chavan

Screened

Junior Computer Vision & ML Engineer specializing in autonomous perception systems

Pittsburgh, PA2y exp
Magna InternationalCarnegie Mellon University

“LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.”

PythonC++OpenCVMATLABPyTorchTensorFlow+126
View profile
KV

KARTHIKBABU VADLOORI

Screened

Mid-level Full-Stack Developer specializing in Spring Boot, React, and cloud microservices

San Francisco, CA5y exp
MetaUniversity of Texas at Arlington

“Backend engineer with experience at Meta and Accenture building regulated-data systems (healthcare/financial) using Python/Flask and Postgres. Has scaled high-throughput services to millions of daily requests, delivering measurable latency wins (~40% API latency reduction; ~35% faster DB-backed endpoints), and has productionized ML inference services using Docker/Kubernetes and AWS (ECS/SageMaker).”

AgileAnsibleAWS CodePipelineAWS LambdaAzure App ServiceAzure Functions+165
View profile
AV

Asrith Velireddy

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems

Harrison, NJ4y exp
AdobeNJIT

“ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.”

A/B TestingApache AirflowAuto ScalingAWSAWS IAMAWS Lambda+123
View profile
SM

Sagnik Mazumder

Screened

Executive ML/AI Founder specializing in agentic analytics and data infrastructure

10y exp
Photosphere LabsUniversity of Texas at Dallas

“Founder of Photosphere Labs (agentic AI for ecommerce data synthesis/analysis) who worked directly with customers to scope, build, demo, and iterate LLM-based solutions, including an AI chat product for brand owners. Previously at Block, built and explained a nuanced causal inference/propensity model tied to Square POS integrations, translating model specs and outputs into business impact for varied client contexts.”

A/B TestingAWSAWS GlueBERTData AnalysisData Pipelines+63
View profile
PV

Praveen V

Screened

Mid-Level Software Engineer specializing in Generative AI and RAG systems

Remote, USA5y exp
MetaUniversity of North Carolina at Charlotte

“Built a production RAG-based natural-language-to-SQL system at Global Atlantic to replace slow, expensive manual analytics ticket workflows, focusing heavily on retrieval quality and measurable evaluation (200-question ground-truth set; recall@5 improved 0.65→0.78 via semantic chunking). Also built a custom MCP-style agent orchestrator for a personal project (arxiv-ai) to improve flexibility and Langfuse-aligned observability, and has hands-on experience with LangGraph, CrewAI, and n8n.”

PythonJavaC#JavaScriptTypeScriptPostgreSQL+105
View profile
JL

Joseph Lee

Screened

Staff Software Engineer specializing in cloud platforms for healthcare and financial workflows

Dallas, TX10y exp
OptumUniversity of Texas at Dallas

“Backend/data engineer with Optum healthcare claims domain experience building high-reliability Python microservices (FastAPI/Kafka/Postgres) and AWS data platforms (EKS, Glue, Redshift). Demonstrated strong production ownership: fixed duplicate Kafka processing via transactional outbox/idempotency, scaled to millions of daily events, and delivered major SQL performance gains (40+ min to <5 min, ~60% CPU reduction). Seeking remote-only work; targets $130k base.”

ReactNext.jsAngularVue.jsTypeScriptJavaScript+167
View profile
GF

greg farhadian

Screened

Senior Software Engineer specializing in cloud data platforms and Java microservices

Remote4y exp
IBMUC Irvine

“Backend/data engineer with experience building Kafka-driven real-time pipelines that support ML code deployment and downstream integrations. Currently migrating high-throughput mainframe (COBOL/assembly) processing to Java, using Spark/Databricks to preserve performance and employing rigorous A/B testing across dev/pre-prod/prod with years of historical data.”

JavaSpring BootPythonPySparkJavaScriptReact+59
View profile
CK

Christopher Khan

Screened

Senior Software Engineer specializing in Python, cloud platforms, and distributed systems

Nashville, TN13y exp
i3 VerticalsUniversity of Chicago

“Backend/data engineer with production experience at Walmart and HealthSnap building Python services and data pipelines on AWS (EKS, Lambda, Glue, Airflow). Strong reliability and operations focus—implemented idempotency + circuit breakers for peak-traffic consistency issues, GitOps CI/CD, and observability. Demonstrated measurable performance wins (Postgres p95 45s to <5s, ~60% CPU reduction) and modernized SAS batch workflows to Python with parallel-run parity validation and feature-flagged rollout.”

PythonRDjangoFlaskFastAPIReact+153
View profile
CS

Chappidi Sasi

Screened

Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference

Bay Area, CA5y exp
NVIDIAWebster University

“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”

A/B TestingApache SparkAWSAWS LambdaBigQueryClaude+141
View profile
1...678...78

Related

Machine Learning EngineersData ScientistsSoftware EngineersData EngineersAI EngineersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search