Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
JJ

Jay Joshi

Senior Full-Stack AI/ML Engineer specializing in personalization, NLP, and GenAI platforms

Remote15y exp
DisneyRutgers University–Newark
A/B TestingAgileAmazon S3AngularApache HiveApache Kafka+242
View profile
VP

Vani Pulluri

Mid-level Data Engineer specializing in cloud data platforms and FinTech analytics

Des Moines, IA5y exp
Principal Financial GroupUniversity of Cincinnati
PythonPySparkSQLPandasSciPyStatistical Analysis+88
View profile
IK

Islombek Kobiljonov

Senior Data Engineer specializing in Azure, Databricks, and BI/ETL platforms

Orlando, FL9y exp
EY
AgileApache KafkaApache SparkAWS GlueAzure Blob StorageAzure Data Factory+161
View profile
GD

Gourav Deshmukh

Senior Data Engineer specializing in cloud data platforms and real-time streaming pipelines

Rosemont, IL11y exp
Wintrust
API DevelopmentApache AirflowApache HadoopApache HiveApache KafkaApache Spark+132
View profile
TS

Timur Stadnichenko

Senior Data Engineer specializing in multi-cloud data platforms and real-time analytics

Sunny Isles Beach, FL10y exp
Capgemini
Azure FunctionsAWSAmazon S3AWS GlueAmazon RedshiftAWS Lambda+84
View profile
AM

Ayesha Mazzy

Senior Data Scientist specializing in healthcare analytics and scalable ML pipelines

Philadelphia, PA11y exp
CoverMyMeds
AgileApache HadoopApache KafkaApache SparkAWSAWS Glue+96
View profile
RO

Rafael Ortega

Screened ReferencesStrong rec.

Senior Full-Stack & AI Engineer specializing in LLM integrations and cloud-native systems

Remote (Texas)8y exp
BlocUnitedNew Mexico Tech

“Backend/data engineer with hands-on production experience building FastAPI Python APIs and AWS-native platforms (Lambda/API Gateway, SQS, ECS Fargate) with Terraform + GitHub Actions CI/CD and strong reliability practices (JWT/RBAC, retries/timeouts, structured errors/logging). Also built AWS Glue ETL pipelines (S3/RDS to curated S3/Athena) with schema evolution and data quality controls, modernized legacy processing via parallel-run validation and phased cutovers, and has demonstrated SQL tuning impact (seconds to <200ms) plus incident ownership for batch pipeline SLAs.”

AgileAngularAuthenticationAWSAWS LambdaAzure Data Factory+240
View profile
MS

Mounika S

Senior Machine Learning Engineer specializing in MLOps and Generative AI

St. Louis, Missouri7y exp
Emerson
A/B TestingAmazon RedshiftAmazon S3Anomaly DetectionApache AirflowApache Hadoop+158
View profile
PK

Praneeth kumar Rangineni

Senior Data Engineer specializing in multi-cloud data platforms and generative AI

Weston, FL5y exp
UKGUniversity of Alabama at Birmingham
PythonSQLScalaJavaPySparkApache Spark+113
View profile
TL

Tan Le

Senior Software Engineer specializing in ML/AI and scalable data platforms

San Jose, CA11y exp
LabelboxNational University of Singapore
RubyPythonJavaREST APIsGraphQLVue.js+56
View profile
YR

Yaswanth Reddy Seelam

Mid-level AI/ML Developer specializing in FinTech fraud detection and GenAI assistants

MO, USA4y exp
Edward JonesUniversity of Central Missouri
A/B TestingAnomaly DetectionApache HadoopApache SparkAWSCI/CD+70
View profile
RR

Rishika Reddy

Mid-level Data Scientist specializing in financial ML, NLP, and MLOps

San Diego, CA5y exp
Morgan StanleySan Diego State University
A/B TestingAgileAmazon S3Anomaly DetectionApache AirflowApache Kafka+135
View profile
JF

Joel Franklin Stalin Vijayakumar

Mid-level AI/ML Software Engineer specializing in Generative AI and NLP

Remote5y exp
EmerjenceBoston University
Generative AIDeep LearningMachine LearningComputer VisionArtificial IntelligenceData Analysis+103
View profile
NG

Naga Gayatri Bandaru

Screened ReferencesModerate rec.

Mid-level AI/ML Engineer specializing in MLOps and production ML systems

Cleveland, Ohio3y exp
Cleveland ClinicSan José State University

“Backend/ML engineer who has shipped high-scale real-time systems across e-commerce and healthcare: built a PharmEasy real-time recommendation engine for ~2M monthly users (cut feature latency 5 min→30 sec; +15% cross-sell) and architected a HIPAA-compliant multimodal clinical diagnostic workflow (DICOM+EHR) with XAI, MLOps (MLflow/Airflow/K8s), and drift/monitoring guardrails supporting 10k+ daily predictions.”

PythonSQLPySparkJavaRScala+157
View profile
HW

Hsi-Chun Wang

Screened

Mid-level Data Scientist specializing in LLM development and scalable ML pipelines

Remote4y exp
GearFactory.aiUniversity of Maryland, College Park

“Built and deployed production LLM pipelines for evidence-based scoring in two domains: biomedical literature mining (scoring ~2700 drug compounds vs gene targets/mechanisms) and long-horizon news analytics (35 years of Chinese articles). Emphasizes reliability at scale (retries/checkpointing/validation), rigorous empirical model benchmarking (GPT-4o/mini/5), and translating results into stakeholder-friendly visual narratives.”

A/B TestingAWSAWS IAMAWS LambdaClassificationClustering+80
View profile
SP

Soham Patel

Screened

Mid-level Machine Learning Engineer specializing in healthcare NLP and MLOps

Piscataway, NJ3y exp
Syneos HealthRutgers University - New Brunswick

“ML/AI practitioner in healthcare (Syneos Health) who has deployed production clinical NLP and risk models. Built a BERT-based physician-note information extraction system on Docker + AWS SageMaker (reported ~42% retrieval improvement) and automated retraining/deployment with Airflow and drift detection, while partnering closely with clinicians to drive adoption (reported ~18% readmission reduction).”

PythonRSQLJavaScriptJavaBash+118
View profile
AI

Anirudh Indurthi

Screened

Mid-level Full-Stack Java Engineer specializing in cloud-native microservices

NC, USA6y exp
Bank of AmericaUniversity of Central Missouri

“Software engineer with strong full-stack and platform experience (TypeScript/React/Node.js) who has built real-time analytics dashboards and microservices using RabbitMQ. Demonstrates production-minded decision-making under launch pressure (manual fallback for payment-impacting third-party API issues) and has delivered internal DevOps tooling that automates compliance checks via GitHub/Jira integrations.”

JavaPythonJavaScriptTypeScriptC++C#+122
View profile
ST

Srinivas Tenneti

Screened

Mid-level AI/ML Engineer specializing in GenAI and predictive modeling

Fullerton, California5y exp
UnitedHealth GroupGeorge Washington University

“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”

A/B TestingAmazon ECSApache SparkAWSAWS GlueBigQuery+110
View profile
VK

Vamsi Koppala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

Barrington, IL4y exp
ComericaTexas Tech University

“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”

AgileApache SparkAzure Blob StorageBashBERTBitbucket+178
View profile
SA

Sathwik Alavala

Screened

Mid-level Data Scientist specializing in AI/ML, MLOps, and LLM-powered analytics

Charlotte, NC6y exp
Bank of AmericaCampbellsville University

“Built and deployed a production LLM-powered document Q&A system enabling natural-language querying of large PDFs, focusing on retrieval quality (overlapped chunking) and low-latency performance (optimized embeddings + vector search). Experienced with scaling ML/LLM workflows using async/batch processing, caching, cloud storage, and orchestration via Apache Airflow with robust testing, monitoring, and failure handling.”

A/B TestingAnomaly DetectionAPI DevelopmentAWSAzure Machine LearningChromaDB+94
View profile
1...545556...119

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search