Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
MN

Meghana Nandivada

Screened

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp
TCSStevens Institute of Technology

“ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.”

PythonSQLJavaScalaMachine LearningMLOps+99
View profile
SS

Sumit Sahu

Screened

Mid-level Machine Learning Engineer specializing in computer vision and MLOps on GCP

Atlanta, GA4y exp
NCR VoyixUniversity of Georgia

“ML/AI engineer who deployed a real-time, edge-based computer-vision pipeline for produce recognition in retail self-checkout to reduce shrink. Demonstrates strong end-to-end production chops: multi-camera data calibration/sync, ranking-based modeling for fine-grained classes, latency-focused optimization, and continuous A/B testing/monitoring with guardrails. Experienced with ML orchestration (Kubeflow Pipelines, Airflow) and CI/CD via GitHub Actions, and collaborates closely with store operations to make interventions usable in the checkout flow.”

PythonC++SQLJavaPyTorchTensorFlow+100
View profile
AC

Andrew Clayman

Screened

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

“Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.”

PythonC++SQLDockerFlaskCI/CD+133
View profile
VK

Varun Kothapalli

Screened

Mid-level AI/Machine Learning Engineer specializing in Generative AI, NLP, and MLOps

Saint Louis, MO6y exp
EquifaxWebster University

“Built a production LLM/RAG document analysis system for large financial documents (credit reports/PDFs) to help business analysts extract insights faster. Implemented end-to-end pipeline orchestration with LangChain, vector search (e.g., FAISS), and hallucination controls (context grounding, similarity thresholds, and no-answer fallback), delivered as a Dockerized Python API.”

Artificial IntelligenceMachine LearningDeep LearningSupervised LearningUnsupervised LearningFeature Engineering+89
View profile
DB

Dinesh Battula

Screened

Mid-level Full-Stack Java Developer specializing in microservices and cloud-native systems

Kansas, null5y exp
Cardinal HealthUniversity of Central Missouri

“Senior full-stack engineer with strong healthcare domain experience who has shipped an Azure OpenAI RAG-based patient medication support chatbot to production, driving ~10K queries/month and a reported 38% reduction in call center volume. Also builds polished real-time React/TypeScript pharmacy tooling and operates large-scale Python/Spark ETL pipelines (~12M records/day) with strong API design, observability, and cloud deployment experience across Azure/Kubernetes and AWS.”

SDLCAgileScrumKanbanMicroservices ArchitectureJava+136
View profile
AD

Anay Dongre

Screened

Junior Machine Learning Engineer specializing in GenAI and LLM fine-tuning

Pomona, California1y exp
Aerolift.AICal Poly Pomona

“Robotics software engineer focused on hard real-time autonomy for legged robots, building a quadruped navigation stack that combines vision SLAM with MPC and maintains a deterministic 500Hz control loop. Deep performance optimization experience across CUDA (sub-2ms perception latency), ROS 2/DDS real-time tuning, and motion planning (cut 500ms spikes to sub-5ms). Also designed distributed ROS 2 + Zenoh communications between quadrupeds and aerial drones and validated robustness under lossy wireless conditions.”

AWSApache SparkC++CI/CDCUDAChromaDB+118
View profile
SD

Sachin Dulla

Screened

Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps

Kentwood, MI3y exp
Fifth Third BankCalifornia State University, San Bernardino

“Built and deployed a domain-specific LLM chatbot for research/support, cutting manual effort by ~50%. Demonstrates strong applied LLM engineering: RAG, prompt grounding with citations and fallbacks, embedding/top-k tuning, and production monitoring (confidence, latency, feedback loops). Experienced orchestrating agent workflows with LangChain-style pipelines and continuous evaluation to maintain reliability.”

Amazon EC2Amazon EKSAWSAWS LambdaAzure Machine LearningAzure Monitor+93
View profile
SC

Sahil Chaubal

Screened

Senior AI/ML Engineer specializing in financial risk, fraud detection, and GenAI analytics

USA7y exp
Northern TrustSyracuse University

“AI/ML engineer with experience at Northern Trust and Persistent Systems building production LLM + RAG systems for regulated financial use cases, including liquidity forecasting, anomaly detection, and credit scoring. Emphasizes compliance-first design with explainability (SHAP), traceability (MLflow), and hallucination controls (FAISS + citation-grounded prompting), and has delivered drift-triggered retraining pipelines using Airflow and Kubernetes while translating model outputs into business-ready marketing segments.”

PythonRSQLPostgreSQLMySQLMicrosoft SQL Server+114
View profile
TK

Tadigotla Kumar Reddy

Screened

Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems

New York, USA6y exp
UnitedHealthcareAuburn University at Montgomery

“Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.”

PythonSQLRJavaJavaScriptBash+157
View profile
JD

Jimmy Dani

Screened

Mid-level AI Researcher specializing in privacy-preserving ML and applied cryptography

College Station, TX6y exp
Texas A&M UniversityTexas A&M University

“Graduate researcher who builds production-grade AI systems spanning LLM security evaluation and on-device RAG. Created HoneyLearner, a self-learning attack framework using GPT-4-class models as structured black-box attackers against honeywords defenses, with rigorous metrics and reproducible orchestration (Airflow/Spark/Kafka/Docker). Also partnered with agriculture scientists at Texas A&M–Corpus Christi to deliver UAV + 3D point-cloud crop-stress maps that cut time-to-insight ~40% and enabled ~30% earlier interventions.”

PythonCC++JavaSQLBash+74
View profile
JC

Jen-Ting Chang

Screened

Mid-Level Backend Software Engineer specializing in FinTech and distributed systems

Taipei, Taiwan5y exp
Crypto-ArsenalUSC

“Backend engineer who built an AI RAG quoting system for the fastener industry, reducing quote turnaround from weeks to ~30 minutes and raising retrieval accuracy to ~90% by solving a semantic-collision issue with a parent-document retrieval design. Strong in production AWS integrations (Cognito auth, S3 pre-signed uploads), performance optimization (multithreading/out-of-core), and real-time streaming (Kafka/Spark Kappa architecture achieving sub-second latency), plus Kubernetes logging and GitHub Actions CI/CD to ECR.”

API GatewayAWSAWS LambdaAlgorithmsCI/CDC+++80
View profile
SK

SUJAY Kanakamedala

Screened

Mid-level AI Developer & Machine Learning Engineer specializing in LLM and MLOps systems

Champaign, IL5y exp
CenteneEastern Illinois University

“Built and deployed an enterprise RAG application at Centene to help clinical teams retrieve insights from large internal policy document sets, cutting manual research by 30–40%. Implemented custom domain-adapted embeddings (SageMaker + BERT transfer learning) and hybrid retrieval (BM25 + Pinecone) to drive a 22% relevance lift, and ran the system in production on AWS EKS with CI/CD, MLflow, and Prometheus monitoring (99% uptime, ~40% latency reduction).”

A/B TestingAgileApache KafkaApache SparkAWSAWS Lambda+145
View profile
LG

Likhitha Gandi

Screened

Junior Business Analytics & SAP BASIS professional specializing in AI and predictive modeling

Denton, TX3y exp
University of North TexasUniversity of North Texas

“Built and deployed a production LLM-powered email assistant (“wood flow”) for a local pet resort to automate after-hours inbound email handling, including email categorization and context-aware auto-responses. Uses n8n for orchestration and applies CRISP-DM, load/edge-case testing, and RAG-based context retrieval, and has experience presenting AI solutions with budgeting and ROI to a non-technical founder.”

PythonPandasNumPyScikit-LearnSQLR+77
View profile
TS

Tanmay Sharma

Screened

Mid-level Backend Software Engineer specializing in microservices and AI/ML

Chandigarh, India3y exp
Excellence EducationUniversity at Buffalo

“JavaScript engineer with open-source experience on a database visualization library, focused on real-time rendering performance for large datasets (virtualized DOM rendering, requestAnimationFrame/debouncing, memoization) and on raising project quality via tests and CI performance benchmarks. Also built Kafka-based messaging documentation and sample producer/consumer apps to speed onboarding, and has experience diagnosing production issues including concurrency-related duplicate data problems.”

AgileAmazon S3Apache KafkaAPI DevelopmentAWSAWS Lambda+99
View profile
PG

Pandraju Gamanapriya

Screened

Mid-level Data Scientist specializing in healthcare ML and GenAI

San Marcos, TX4y exp
UnitedHealth GroupTexas State University

“Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.”

AgileAnomaly DetectionAPI IntegrationAWSAWS GlueBash+106
View profile
SB

Shrinivas Bhusannavar

Screened

Mid-level AI Engineer specializing in agentic LLM systems and RAG platforms

San Jose, CA5y exp
SquareShiftSan José State University

“Built and shipped Serrano AI, a multi-tenant SaaS conversational AI platform that automates Odoo ERP workflows and lets ops/finance/supply-chain teams query ERP data in natural language. Implemented a multi-agent architecture (LangChain/LangGraph/CrewAI) with hybrid RAG over ERP schemas, deployed on Heroku/Vercel with production observability, cutting reporting time by ~80% while addressing hallucinations, latency, and schema complexity.”

Apache HadoopApache KafkaApache SparkAWSAWS LambdaAzure Data Factory+154
View profile
AM

Arunima Mishra

Screened

Senior Technical Product Lead specializing in Data Governance and MDM SaaS platforms

Bengaluru, India7y exp
InforManipal University Jaipur

“Technical/product lead at Albanero (acquired by Infor in 2024; now at Infor) who built a Data Mesh-focused “Governance as a Product” module from early persona-based policies through a highly configurable multi-ERP governance platform (MDM, multi-source mastering, match/merge, automated review workflows). Also troubleshoots agentic/LLM workflows in production using auditability, guardrails, monitoring, and real-time validation—fixing a P0 false-positive security flagging issue and contributing to significant deal/adoption growth (~50%) after V2 launch.”

Risk managementData governanceCross-functional leadershipJavaSpring BootMicroservices+92
View profile
LD

Leelakarthik Devisetty

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

Atlanta, GA3y exp
AIGKennesaw State University

“Data professional with ~4 years of experience, most recently at AIG (insurance), building ML/NLP systems for fraud detection and policy automation using transformers, CNNs, and clustering/anomaly detection. Also developed a RAG-based knowledge retrieval system, iterating across embedding models and moving to production based on precision and latency SLAs, then containerizing and deploying with SageMaker and CI/CD.”

AWSAWS LambdaBERTBigQueryCI/CDClaude+143
View profile
BC

Bhavishyasai Chigurupati

Screened

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp
CignaUniversity of Central Missouri

“Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.”

SDLCAgileWaterfallPythonSQLR+179
View profile
RT

Rakesh Thota

Screened

Mid-level Data Engineer specializing in multi-cloud real-time data pipelines

California, USA5y exp
Molina HealthcareUniversity at Buffalo

“Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.”

PythonSQLJavaPySparkApache SparkApache Kafka+136
View profile
GM

Gopichand Muppaneni

Screened

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp
Eastern Illinois UniversityEastern Illinois University

“Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.”

PythonSQLPySparkETLData PipelinesData Modeling+47
View profile
LP

Lerone Pieters

Screened

Senior Data Engineer specializing in cloud data platforms and real-time analytics

Remote, USA10y exp
Scale MediaNew York City College of Technology (CUNY)

“Data/analytics engineer focused on finance and e-commerce integrations, building end-to-end pipelines and services across Odoo, QuickBooks, Snowflake, and Tableau. Replaced a costly third-party Walmart connector with a serverless AWS Lambda pipeline deployed via Terraform/GitHub and monitored with CloudWatch/Datadog, and shipped a bi-directional Odoo↔QuickBooks invoice sync with distributed locking plus Slack-based finance approvals.”

PythonSQLJavaScalaGoJavaScript+110
View profile
SK

Saketh Kota

Screened

Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps

Irving, TX4y exp
U.S. Bank

“Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.”

AgileA/B TestingAmazon SageMakerAPI DevelopmentArgo CDAWS+185
View profile
PS

Parv Shah

Intern Data Engineer specializing in web-scale ingestion and cloud data pipelines

3y exp
XtriumArizona State University
AgileApache AirflowApache KafkaApache SparkAWSBackend Development+88
View profile
1...899091...119

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search