Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Michael Miller

Screened

Executive technology leader specializing in model risk and regulatory technology

Waco, TX19y exp

Campton CorpPortland State University

“Candidate is pursuing a CTO role and has helped multiple startups turn early technology concepts into concrete, real-world technical requirements. They cite a systems science and mathematics background, along with experience at JPMorgan Chase, and appear strongest in technical strategy, concept fleshing, and identifying strong people to help teams succeed.”

Data Pipelines Statistical Analysis Machine Learning Python R SQL+113

View profile

Shruti Pangare

Screened

Junior AI/ML Software Engineer specializing in LLMs and data-intensive systems

New York, NY3y exp

NYU Langone HealthNYU

“AI/backend engineer who has owned production applied-ML systems end to end, including a Jitsi meeting intelligence platform with custom RoBERTa boundary detection, LLM summarization, and automated retraining from user feedback. Also has healthcare AI experience building a diabetes medication titration system with strict validation, drift monitoring, and safety guardrails—showing both product speed and high-stakes engineering rigor.”

Python SQL PL/SQL R Pandas NumPy+147

View profile

Prathyusha Mardhi

Screened

Mid-level AI/ML Engineer specializing in LLM agents and workflow automation

4y exp

UnitedHealth GroupKansas State University

“AI/LLM engineer with strong healthcare domain depth who has shipped production-grade agents for care coordination and clinical workflow automation. Stands out for combining Knowledge Graph RAG, LangGraph orchestration, and rigorous eval/guardrail systems to improve reliability in high-stakes environments, with measurable gains in review time, hallucination reduction, latency, and clinician adoption.”

Python R SQL PySpark Java PyTorch+115

View profile

Mounya Bonuga

Screened

Mid-level AI/ML Engineer specializing in multimodal AI and recommendation systems

USA4y exp

Goldman SachsUniversity of Central Oklahoma

“ML/AI engineer with hands-on ownership of a production LLM/RAG system at Goldman Sachs, focused on workflow automation and large-scale document search for operational teams. They combine strong MLOps and backend engineering skills with practical GenAI evaluation and safety practices, and cite measurable impact including 22% better task guidance accuracy and sub-second search across millions of records.”

Deep Learning Reinforcement Learning Natural Language Processing Time-Series Forecasting Feature Engineering A/B Testing+114

View profile

Richard Wicaksono

Screened

Junior Data Engineer and Analyst specializing in ETL, analytics, and e-commerce data

Walnut, CA3y exp

Dreamstream, LLCUC Irvine

“Data engineer with a Master's in Data Science who has owned 30+ customer-facing K-12 SIS migrations end-to-end, building ETL, validation, and SOP-driven deployment processes in a PII-sensitive environment. Also brings recent hands-on agentic AI experience from a biotech capstone, where they led a production-oriented NLP-to-SQL + RAG support system that handled about 30% of support queries in testing.”

Python Pandas SQL R Java C+++66

View profile

Sreekar Praneeth Marri

Screened

Junior Robotics & AI Researcher specializing in soft robotics and real-time ML control

Boston, MA2y exp

Boston UniversityBoston University

“Early-career robotics engineer who has integrated LLM/NLP command interfaces (OpenAI/LLaMA) into ROS-controlled industrial manipulators and built data-driven controls for underwater soft robotic actuators. Combines hands-on fabrication (balloon actuator with embedded copper traces) with sensor debugging (IMU/Aurora) and simulation work in Gazebo, with practical exposure to edge deployment constraints on Jetson Nano and model quantization.”

Reinforcement Learning MATLAB Python SQL PyTorch TensorFlow+85

View profile

Sai Teja Challa

Screened

Mid-Level AI Engineer specializing in NLP, computer vision, and LLM applications

Austin, TX3y exp

BookedByUniversity of Maryland, Baltimore County

“LLM/RAG practitioner who productionized an LLM-driven customer communication and transaction understanding system at PayPal, emphasizing privacy/compliance guardrails and large-scale data normalization. Experienced in real-time debugging of hallucinations via retrieval pipeline tuning and in leading hands-on developer workshops and sales-aligned POCs to drive adoption.”

Python PySpark SQL NoSQL NumPy Pandas+169

View profile

KAUSHIK KUMAR KOLAR RAVINDRA KUMAR

Screened

Intern-level Software Engineer specializing in AI/ML and time-series forecasting for finance

Bangalore, Karnataka, India0y exp

CiscoNJIT

“Built a production AI-driven QA automation platform using a multi-agent architecture (MCPs + LangGraph) to run parallel website tests across multiple device environments via automated image building and containerization. Currently collaborating with restaurant operators and managers to deliver an agentic restaurant analytics system, emphasizing deep domain discovery with non-technical stakeholders.”

AWS Bitbucket Caching Data analysis Data cleaning Data preprocessing+96

View profile

Arjun Sharma

Screened

Staff Data Scientist specializing in AI/ML engineering and MLOps

Austin, TX10y exp

AccentureTexas State University

“ML/NLP engineer with experience at Flatiron Health building a production NLP platform that processed millions of clinical notes, using BERT/BiLSTM-CRF and spaCy to extract and normalize entities from noisy EMR text with oncologist-in-the-loop validation. Also built scalable retail ML workflows (Spark + Kubernetes + feature store caching) and applied vector databases plus contrastive-learning fine-tuning to improve retrieval relevance and recommendations.”

Python SQL Java Scala PyTorch TensorFlow+122

View profile

Giri Nathan

Screened

Executive Technology Leader (CTO/CIO) specializing in cloud, AI/ML, and cybersecurity

38y exp

Production Resource GroupCharter Oak State College

“CTO who ties technology strategy directly to business outcomes, building multi-year roadmaps with measurable ROI. Led major modernization (cloud, data platform, unified API, microservices + CI/CD) delivering 5x faster releases/deployments, 99.8% uptime, and 40% user growth without headcount increases, while scaling engineering from 15 to 80+ in ~18 months.”

Leadership Strategic planning Mentoring Coaching Team management Budgeting+108

View profile

Pavan Kumar Malasani

Screened

Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and GenAI

Remote, USA4y exp

CitigroupUniversity of Colorado Boulder

“GenAI/ML engineer in Citigroup’s finance environment who has deployed production RAG systems for investment banking under strict privacy and model-risk constraints. Built an internal-VPC Llama2 + Pinecone + LangChain solution with NER redaction and citation-based verification to prevent hallucinations, delivering major time savings, and also partnered with global finance executives to ship an AI early-warning indicator for treasury/liquidity risk.”

A/B Testing Amazon CloudWatch Apache Airflow Apache Hive Apache Kafka Apache Spark+137

View profile

Dinesh Kumar Patibandla

Screened

Mid-level Machine Learning Engineer specializing in LLMs and RAG for finance and healthcare

Texas, USA4y exp

Goldman SachsUniversity of North Texas

“ML Engineer with recent Goldman Sachs experience building and deploying a production RAG/LLM assistant for summarization, drafting, and internal knowledge retrieval across financial, risk, and compliance documents. Designed for heavy regulatory constraints and scaled to 10,000+ concurrent users using Kubernetes-based orchestration, dynamic LLM routing, and rigorous testing (adversarial prompts, A/B tests, load simulations) with privacy controls like differential privacy.”

A/B Testing Apache Hadoop Apache Hive Apache Spark AWS BERT+118

View profile

Pandari G

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

San Francisco, USA5y exp

SephoraSaint Mary's College of California

“GenAI/LLM engineer with production deployments in both fintech and retail: built an AI-powered mortgage document analysis/automated underwriting pipeline at Fannie Mae (OCR + custom LLM) cutting underwriting review from 3–4 hours to under an hour with privacy-by-design controls. Also helped build Sephora’s GenAI product advisory bot using LangChain-orchestrated RAG (Azure GPT-4, Azure AI Search, MySQL HeatWave vector search), focusing on grounding, evaluation, and compliance-aware architecture choices.”

Python SQL R PySpark PowerShell Generative AI+158

View profile

HEMANTH KUMAR KOTTAPALLI

Screened

Mid-level Machine Learning Engineer specializing in GPU-accelerated LLMs and MLOps

GA, USA4y exp

BlackRockMercer University

“Built and deployed a production LLM-powered decision-support system for supply-chain planners that explains demand forecast changes using grounded retrieval from sales, promotion, inventory, and supplier data. Implemented strict anti-hallucination guardrails and latency optimizations, deployed as a real-time AWS API with monitoring, and reported ~15% forecast accuracy improvement and ~12% supply-chain risk reduction. Experienced orchestrating data/ML/LLM workflows with Airflow, LangChain/LangGraph-style patterns, and AWS Step Functions while partnering closely with non-technical business users via demos and example-based requirements.”

Agile Apache Hadoop Apache Kafka Apache Spark AWS AWS Lambda+110

View profile

Sudhan Louis

Screened

Director of Enterprise Architecture specializing in digital transformation, AI, and API strategy

Rolling Hills Estates, CA26y exp

HerbalifeBoston University

“Hands-on architect/technology leader who builds prototypes (including Agentic AI wellness/biomarkers) and then scales teams to execute. Led a ~$400M global e-commerce transformation spanning 95 countries with active-active US/EU multi-region resilience, microservices/MFE (MACH), and strong security patterns (service mesh + API gateway + Ping Identity), plus modern data foundations (customer hub/MDM/Snowflake, data fabric/medallion).”

Digital Transformation Microservices Architecture Agile CRM Salesforce E-commerce+155

View profile

Sachin Reddy Kunta

Screened

Mid-Level Backend Software Engineer specializing in payments, fraud systems, and AI agent infrastructure

San Francisco, CA3y exp

Saayam for AllNYU

“Early-career engineer who owned an end-to-end objective assessment/coding contest platform at an edtech startup, using Postgres + S3 and Redis (queues + ZSET) to decouple and scale code submission processing with worker sandboxes. Also implemented idempotency controls and set up monitoring and CI/CD while the rest of the team focused on curriculum.”

Go Python Java Node.js TypeScript SQL+136

View profile

Prem Kumar

Screened

Senior Data Engineer specializing in cloud data platforms and regulated analytics

McLean, VA6y exp

Capital OneRowan University

“Data engineer at Capital One building AWS-based real-time and batch pipelines and backend data services for financial/fraud use cases. Has owned end-to-end pipelines processing millions of records/day, implemented dbt/Great Expectations quality gates, and tuned Redshift/Snowflake workloads (cutting query latency ~22–25% and reducing pipeline failures ~30–40%) while supporting 15+ downstream consumers.”

Python SQL PySpark Scala Java Bash+152

View profile

Snehitha Borra

Screened

Mid-level Data Engineer specializing in cloud data platforms and big data pipelines

5y exp

Molina HealthcareUniversity of Michigan-Dearborn

“Healthcare data engineer with hands-on ownership of claims/member data pipelines on a cloud analytics platform, spanning batch and streaming ingestion (Airflow/Kafka/Spark/Databricks) through serving for reporting. Emphasizes reliability and data quality via embedded validation, schema-drift detection, deduplication, and operational monitoring/incident response, plus pragmatic CI/CD and observability setup in early-stage/ambiguous projects.”

Python Scala SQL PySpark Shell Scripting PowerShell+128

View profile

Akashreddy Madduri

Screened

Senior Backend Engineer specializing in real-time data platforms for FinTech and Healthcare

Plano, Texas6y exp

JPMorgan ChaseNorthern Arizona University

“Backend/data engineer with experience at JPMorgan building near real-time payment risk and fraud scoring pipelines using Python, Spark Structured Streaming, and Delta Lake, emphasizing auditability, security, and data correctness (dedupe/late events) to reduce false positives. Also led a legacy-to-cloud migration of claims/eligibility data at Cogna with parallel runs, phased rollout, and healthcare-specific validation (ICD-CPT mapping).”

Python FastAPI Flask SQL PySpark Shell Scripting+102

View profile

Sanat Ahuja

Screened

Senior Engineering Manager specializing in platform, data/ML, and identity/access systems

Los Angeles, CA16y exp

GoodyearUSC

“Senior engineering leader from Goodyear’s AndGo startup-like division who scaled the org from 12 to 30+ across pod-based teams and introduced an Architect Guild/ARD governance model. Led a 4-month Europe launch requiring AWS regional infrastructure, GDPR compliance, i18n/l10n, and new EMEA reporting pipelines, and has hands-on depth in API performance, incident response, and GraphQL/Hasura adoption to boost product velocity.”

Leadership Performance Optimization Incident Response Cloud-Native Architecture High Availability Event-Driven Architecture+139

View profile

Sushma Mangalampati

Screened

Mid-level Data Engineer specializing in lakehouse ETL and analytics engineering

Boston, MA6y exp

ServiceNowNortheastern University

“Data engineer with strong end-to-end ownership of production lakehouse pipelines (Snowflake + Databricks + Airflow + dbt + Great Expectations), handling 8M+ records/month and 500K+ daily CDC updates. Delivered measurable reliability and efficiency gains (41% cost reduction, freshness improved from 4h to 30m, 35% fewer downstream incidents) and has experience building a lakehouse platform from scratch across 12 source systems.”

Python SQL PySpark Apache Spark Stored Procedures ETL+89

View profile

Zhiwen Zhao

Screened

Junior Data Engineer specializing in cloud ETL and big data platforms

New York, NY3y exp

Bank of ChinaNYU

“Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.”

Python Java Scala R SQL C#+75

View profile

Jamie Cook

Screened

Senior Machine Learning Engineer specializing in AI search and recommendation systems

Plantation, FL8y exp

ChewyUniversity of Miami

“Built internal production LLM tools for engineering and support, including a customer-health assistant and a RAG-based incident explainer grounded in logs, metrics, and deploy data. Stands out for combining strong GenAI safety/evaluation practices with pragmatic backend engineering, delivering measurable impact like a 40% drop in data-help requests and answers in seconds instead of minutes or hours.”

Machine Learning Artificial Intelligence Data Pipelines Data Ingestion Feature Engineering Model Deployment+115

View profile

Preeti Pandey

Screened

Senior AI/ML Engineer specializing in predictive analytics and NLP

Birmingham, AL10y exp

Blue Cross and Blue Shield of AlabamaLiverpool John Moores University

“ML/AI engineer with hands-on experience building production healthcare AI systems across predictive modeling and GenAI. They built an end-to-end patient risk prediction platform and a RAG-based clinical summarization feature, combining strong NLP/LLM skills with AWS deployment, monitoring, drift detection, and reusable Python service design to deliver measurable clinical and operational impact.”

Python Pandas NumPy PySpark SQL MLOps+125

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?