Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
FL

Felix Li

Screened

Intern Software Engineer specializing in data pipelines and full-stack web development

New York, NY1y exp
RadarUniversity of Waterloo

“Internship at Radar (geolocation infrastructure) where they owned automation of multiple geospatial data ingestion pipelines (including US/Canadian address ingestion), orchestrating Spark (Scala) jobs via Python-based Airflow and using GitOps-style CI/CD workflows.”

AWSBashCC++CypressData Pipelines+60
View profile
VP

Vasudha Prerepa

Screened

Mid-Level Java Full-Stack Developer specializing in cloud-native microservices

5y exp
BMOTexas Tech University

“QA/validation-focused engineer with experience at Meta testing an ML+LLM content classification/summarization system, including production-vs-test behavior gaps. Built automated E2E validation and drift monitoring (PSI, KL divergence, embedding cosine similarity) run daily/multiple times per day and gated via CI. Also implemented Jenkins-orchestrated Selenium/API test suites in Docker at Capgemini and partnered with a business analyst to convert business rules into automated AI-driven validation checks.”

AJAXApache KafkaApache TomcatAWSAWS CloudFormationAWS Glue+141
View profile
JA

Jeevan aher

Screened

Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech

Remote, USA3y exp
JPMorgan ChaseUniversity of Illinois Urbana-Champaign

“AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.”

PythonRSQLJavaScriptBashC+107
View profile
SR

Sandeep Reddy Karumudi

Screened

Mid-level Data & Business Analyst specializing in analytics engineering and BI

6y exp
AdobeUniversity of Wisconsin–Madison

“Data/analytics professional with experience across manufacturing and enterprise environments (Wisconsin School of Business project with CNH Industrial; roles/projects at Ascensia Technologies, S&C, and Adobe). Has hands-on work combining warranty/lifecycle tables with technician free-text notes using TF-IDF + tree models (XGBoost/Random Forest), and deep experience in entity resolution/reconciliation across mismatched financial systems using Python/SQL and fuzzy matching, with production-grade pipeline practices in Azure Data Factory/Databricks.”

PythonPandasNumPyscikit-learnRSQL+119
View profile
CS

Cassandra Sullivan

Screened

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”

A/B TestingAutomationClassificationDashboardingData CleaningData Visualization+109
View profile
KS

Kunal Singh Pundir

Screened

Mid-level Full-Stack Developer specializing in cloud microservices and GenAI systems

USA, USA5y exp
UberNortheastern University

“Built and owned an end-to-end AI-driven decisioning platform at Uber, combining LLM orchestration with typed tool contracts and a Snowflake-based RAG pipeline to make decisions fully auditable. Delivered large-scale measurable impact (120k requests/day, 18k cases auto-resolved/month) while improving ops SLA from 3 days to 6 hours and cutting incident response time nearly in half. Previously led a high-risk strangler-fig modernization of a legacy insurance platform across 120+ microsites at Accenture, coordinating across multiple squads with feature-flagged parallel cutovers.”

C#Java.NETFlaskSpring BootNode.js+140
View profile
LC

Lavanya Chilakalapudi

Screened

Mid-level Full-Stack Developer specializing in cloud-native web apps and APIs

Tampa, FL5y exp
DatabricksUniversity of South Florida

“Backend engineer with experience building microservice-based systems that integrate LLM workflows (code review suggestions, documentation generation, test scaffolding) using REST APIs, Celery/Redis, and OpenTelemetry for observability. Demonstrates hands-on database and performance optimization in PostgreSQL/SQLAlchemy (bulk inserts, lock mitigation, cursor-based pagination) plus multi-tenant data isolation via tenant-aware models, middleware scoping, and schema/row-level strategies.”

AjaxAnsibleApache AirflowApache KafkaApache SparkAPI Gateway+164
View profile
RV

Ramyasri Veerapaneni

Screened

Mid-Level Full-Stack Developer specializing in FinTech

Remote, USA4y exp
IntuitMississippi State University

“Backend-heavy full-stack engineer with experience at Intuit (TurboTax Live) and Paytm payments, building and scaling Java/Spring Boot microservices for high-traffic transaction systems. Has hands-on wins improving peak-load performance using Redis/disk caching and Kafka event-driven patterns, plus React/Redux work for web app integration and strong monitoring practices with ELK.”

Apache KafkaApache SparkAPI DesignAWSCC#+83
View profile
SV

Skanda Vyas Srinivasan

Screened

Intern Software Engineer specializing in full-stack, ML, and optimization

New York, NY0y exp
GeminiUniversity of Wisconsin–Madison

“Built a production-style PyTorch LSTM system that generates structured piano compositions from 1200+ MIDI files, then significantly improved long-range musical coherence by implementing Bahdanau attention based on research literature. Also has internship experience using Docker Compose for containerized backend workloads and has independently used Ray to scale ML experiments across multiple GPUs, including dealing with GPU scheduling/memory oversubscription issues.”

AlgorithmsAngularBashCC#C+++104
View profile
SB

Shriya Bannikop

Screened

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

“Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.”

AgileAmazon CloudWatchAmazon DynamoDBAmazon EC2Amazon ECSAmazon EKS+170
View profile
VU

Vidhi Upadhyay

Screened

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp
Saayam for AllCarnegie Mellon University

“Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.”

PythonC++SQLMySQL.NETGenerative AI+150
View profile
NM

Nicholas Moore

Screened

Senior Full-Stack Engineer specializing in scalable cloud-native systems

Lehi, Utah13y exp
KomBeaMidwestern State University

“Backend/data engineer with production experience building high-concurrency customer engagement platforms at KomBea on AWS (EKS + Lambda) using FastAPI/Django, PostgreSQL, Redis, and strong observability. Has modernized legacy batch systems into modular Python services with parallel-run parity validation and phased rollouts, and has delivered resilient AWS Glue ETL pipelines with schema evolution and data quality controls.”

PythonDjangoFastAPIFlaskGoNode.js+138
View profile
NZ

Nate Zaidi

Screened

Senior Full-Stack Python Engineer specializing in AI/ML and cloud-native systems

Dumfries, Virginia10y exp
CodingQnaVirginia Commonwealth University

“Backend/data engineer with hands-on production experience across FastAPI/PostgreSQL APIs and AWS (Lambda, ECS) delivered via Terraform + GitHub Actions. Built Glue-based ETL pipelines into Redshift with schema evolution and data quality checks, modernized legacy reporting into Python microservices, and has demonstrated measurable SQL performance wins (multi-second query reduced to sub-300ms).”

PythonDjangoFastAPIFlaskJavaScriptReact+94
View profile
MI

Moses Immanuel

Screened

Mid-level Data Scientist specializing in machine learning and big data analytics

Bentonville, AR6y exp
WalmartUniversity of North Texas

“Walmart engineer who built and shipped a production LLM+RAG system to automate triage and analysis of computer support chats/tickets, producing grounded, schema-constrained JSON outputs for summaries, urgency, and routing recommendations. Emphasizes reliability (hallucination control, confidence thresholds, human-in-the-loop) and runs end-to-end pipelines with Airflow and AWS-native orchestration, plus rigorous evaluation and monitoring tied to business KPIs.”

AgileAmazon EC2Amazon EMRAmazon RedshiftAmazon S3Apache Hadoop+172
View profile
ZJ

ZHIYONG JIANG

Screened

Senior AI & Machine Learning Engineer specializing in GenAI, Agentic AI, and RAG

19y exp
DisneyUniversity of Utah

“Built a production agentic AI system to automate data science work using a layered architecture (executive-summary handling, tool-based execution, and on-the-fly code generation). Demonstrates strong end-to-end agent development practices including RAG with vector databases, prompt engineering, and multi-method evaluation (LLM-as-judge/human/code-based), plus Airflow-based orchestration for ML data pipelines and close collaboration with business end users.”

PythonCSQLMATLABJavaMachine Learning+110
View profile
SM

Srushti Manjunath

Screened

Mid-level Data Scientist specializing in NLP, LLMs, and cloud ML platforms

Remote, USA5y exp
Wells FargoUniversity of Illinois Urbana-Champaign

“LLM/MLOps engineer who has shipped production systems for complaint intelligence and contact-center NLU, including LoRA/RLHF-tuned LLaMA models deployed on GKE with vLLM and Vertex AI batch pipelines to BigQuery. Demonstrates strong practical focus on hallucination control, data imbalance mitigation, and production monitoring (Langfuse) with regression testing and canary rollouts, plus experience orchestrating complex workflows with AWS Step Functions.”

PythonRSQLMATLABC++Scala+169
View profile
SD

Sakshi Dinesh Deore

Screened

Mid-level Software Engineer specializing in AWS, DevOps automation, and data platforms

Bellevue, USA3y exp
AmazonUC San Diego

“Engineer with Securonix experience deploying and operating production microservices and real-time data-processing systems at high throughput. Led AWS infrastructure, CI/CD, monitoring, and customer-driven customization for a threat-report classification solution, including rule adjustments and model retraining based on live client feedback.”

AgileAmazon API GatewayAmazon DynamoDBAmazon EKSAmazon EMRAmazon S3+105
View profile
VR

Vivek Reddy

Screened

Mid-level Data Scientist/Data Engineer specializing in ML pipelines, insurance and healthcare analytics

Los Angeles, CA7y exp
Venture ConnectUC Berkeley

“Built a production assistive-vision iPhone app to help visually impaired users find grocery items, training a custom YOLO detector on 2,000+ self-collected/annotated images and deploying via CoreML with a cloud multimodal LLM for navigation instructions. Brings hands-on AWS serverless + ECS container deployment (CDK/GitHub Actions) and a disciplined approach to AI workflow reliability (state-machine design, offline evals, stress tests, logging/metrics), plus experience communicating model insights to non-technical stakeholders (MOTER Technologies).”

A/B TestingAmazon BedrockAmazon ECSAmazon RDSAWS LambdaCI/CD+109
View profile
VV

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp
DatabricksCampbellsville University

“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”

PythonSQLPySparkBashTensorFlowPyTorch+106
View profile
KT

Keerthana Tammina

Screened

Mid-level Data Scientist specializing in machine learning and generative AI

Saint Louis, MO5y exp
DoorDashSaint Louis University

“ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.”

AgileAmazon RedshiftAmazon S3Amazon SageMakerAnomaly DetectionApache Hadoop+157
View profile
RR

Rishitha Reddy K

Screened

Mid-level Data Scientist specializing in risk, forecasting, and segmentation across finance and healthcare

McLean, Virginia5y exp
Capital OneUniversity of Cincinnati

“Data/ML engineer with experience across pharma (Dr. Reddy Laboratories) and financial services (Cincinnati Financial, Capital One), building production NLP and entity-resolution systems that connect messy unstructured text with enterprise SQL data. Delivered semantic search with BERT + vector DB and domain fine-tuning (reported ~35% relevance lift), and builds robust pipelines using Airflow/dbt/Spark with strong validation, monitoring, and stakeholder-aligned rollout practices.”

PythonRSQLScalaJavaScikit-learn+139
View profile
1...242526...119

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search