Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Bhanu Prakash Reddy Dakilli

Screened

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp

Bank of AmericaNew England College

“Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.”

Python SQL PySpark Apache Spark Java Power BI+97

View profile

Deepthi Mundarinti

Screened

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp

JPMorgan ChaseSaint Louis University

“Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.”

Python NumPy Pandas PySpark Scikit-learn TensorFlow+120

View profile

Harrishkumar Loganathan

Screened

Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI

Remote, USA3y exp

SocureArizona State University

“AI/ML engineer with hands-on ownership of enterprise LLM deployments at Freshworks, including a large-scale RAG chatbot serving 15,000+ users across six departments. Stands out for combining deep production engineering skills—AWS microservices, Kubernetes, observability, retrieval quality, and faithfulness evaluation—with strong cross-functional stakeholder leadership and prior large-scale fraud data pipeline experience at Socure.”

Python R PySpark Node.js JavaScript TypeScript+135

View profile

Saisureshreddy Challa

Screened

Mid-level Data Scientist specializing in AI/ML, LLMs, and domain analytics

California, USA6y exp

BlackRockNortheastern University

“BlackRock AI/ML engineer who built and owned a production LLM document intelligence system for regulatory and investment analysis end-to-end. They combined RAG, multi-agent validation, strong evaluation/monitoring, and reusable Python services to process 50K+ documents, cut review time 40-50%, and improve decision accuracy by about 25%.”

Python PySpark SQL Scala PostgreSQL MySQL+174

View profile

Ashutosh Jitendra Zawar

Screened

Mid-level AI/ML Engineer specializing in generative AI, NLP, and MLOps

San Jose, CA4y exp

ServiceNowUniversity of North Carolina at Charlotte

“ML/AI engineer with hands-on ownership of production GenAI and computer vision systems, spanning experimentation, deployment, monitoring, and iterative optimization. Stands out for shipping an enterprise RAG platform that cut manual review by 50% and a defect detection pipeline that reduced report generation from 15 minutes to under 1 second while maintaining high uptime and strong operational discipline.”

SDLC Agile MLOps Cross-Functional Collaboration Machine Learning Deep Learning+154

View profile

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp

Elevance HealthTexas Tech University

“ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.”

Large Language Models Generative AI PyTorch TensorFlow FAISS Transformers+113

View profile

Aditya Goverdhana

Screened

Mid-level Full-Stack Java Developer specializing in FinTech

New York, NY5y exp

JPMorgan ChaseKent State University

“Built a production AI-powered insights platform for marketing teams analyzing large-scale social and news data, combining Java microservices, Kafka, Spark, React, and LLM-based retrieval workflows. Stands out for shipping customer-facing AI features with measurable gains in accuracy and latency, plus solid reliability practices for high-volume backend systems.”

Java HTML CSS Python C .NET+125

View profile

Rajeev Sai Nitturu

Screened

Mid-level Software Engineer specializing in cloud-native backend and AI systems

Long Beach, CA4y exp

JPMorgan ChaseCalifornia State University, Long Beach

“Candidate takes a disciplined, developer-in-the-loop approach to AI-assisted coding, using AI primarily for brainstorming, suggestions, and optimization while retaining full ownership of architecture and final code decisions. They also actively stay current on AI developments through research papers, communities, and emerging tools.”

Java Python TypeScript JavaScript SQL Data Structures & Algorithms+113

View profile

Naveen Chava

Screened

Mid-level Software Engineer specializing in Generative AI and FinTech systems

Chicago, IL4y exp

PayPalDePaul University

“Candidate brings practical GenAI engineering experience with a disciplined approach to AI-assisted development. They have designed lightweight multi-agent workflows for a RAG-based support copilot, including retrieval, relevance validation, response generation, and groundedness checks to reduce hallucinations.”

React Next.js TypeScript JavaScript Tailwind CSS Node.js+131

View profile

Wei-Hsien Wang

Screened

Entry-level AI Engineer specializing in full-stack generative AI systems

San Jose, CA1y exp

AzazieUC San Diego

“AI/full-stack product engineer who has shipped both user-facing and internal LLM products, from a photo-to-music recommendation app to an experimentation agent at Azazie. Stands out for combining modern app development with production-grade agent and GraphRAG systems, including a 500k+ email analysis platform and measurable impact like 3x experiment velocity, 75% setup-time reduction, and 65% faster task discovery.”

Python SQL R TypeScript JavaScript MATLAB+96

View profile

Ashlesh Bhardwaj

Screened

Director-level Product Leader specializing in FinTech and enterprise finance platforms

Charlotte, NC19y exp

Wells FargoIEC College of Engineering and Technology

“Senior product and technology leader with 23+ years of experience driving modernization in complex enterprise finance and operations environments. He stands out for turning legacy, paper-based or fragmented systems into scalable digital products—cutting a warranty claims process from 30 days to near-instant and using AI to improve service efficiency and reduce testing effort by 30%+. Strong C-suite-facing operator who bridges strategy, architecture, UX, and organizational change.”

Product Strategy Go-to-Market Strategy Digital Transformation Agile Scrum Risk Management+122

View profile

Pooja Shindd

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web and AI systems

Illinois, USA4y exp

University of Illinois Chicago Technology SolutionsUniversity of Illinois Chicago

“Full-stack engineer who has built both a TypeScript-based HR/payroll platform and a production agentic AI support system end to end. Stands out for combining strong product judgment with deep LLM systems thinking: RAG architecture, confidence-based routing, evals, observability, and human-in-the-loop design in a greenfield environment.”

Java Scala Python JavaScript TypeScript SQL+108

View profile

Rithvik Mysore Suresh

Screened

Junior Full-Stack Software Engineer specializing in React and AI-powered applications

Bloomington, IN4y exp

Indiana UniversityIndiana University Bloomington

“Full-stack/AI-focused builder who shipped a production Career Advisor app using LLMs + RAG + vector DB (React/Node/MongoDB/Claude API) and grew it to 2000+ users, handling real deployment issues and CI/CD on Vercel/Render. Also developing an AI-powered iOS “3D World Explorer” (text-to-3D) and has cloud experience across Azure and AWS (S3/SageMaker/EC2).”

Python JavaScript TypeScript C SQL HTML+96

View profile

Sathyavarthan Balachandar

Screened

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp

Fidelity InvestmentsNortheastern University

“Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.”

Python SQL Go Apache Spark PySpark Databricks+125

View profile

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp

Johnson & JohnsonStevens Institute of Technology

“Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.”

A/B Testing API Integration Apache Airflow Apache Hadoop Apache Kafka Apache Spark+102

View profile

SUSENDRANATH MUSANI

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Connecticut, USA5y exp

PfizerUniversity of New Haven

“Built and deployed an enterprise GenAI knowledge assistant over thousands of internal PDFs/reports using a RAG stack (GPT-4 + Hugging Face embeddings + vector DB) to reduce manual search and SME escalations. Uses LangGraph/LangChain to orchestrate modular agent workflows with relevance filtering and fallback handling, and applies rigorous evaluation (golden datasets, edge cases, A/B tests) with production monitoring metrics.”

A/B Testing Agile Apache Kafka Apache Spark AWS Lambda BERT+103

View profile

Aisha Sartaj

Screened

Mid-level AI Engineer specializing in LLM systems, RAG, and MLOps

Remote3y exp

ILMAscentUCLA

“Built an LLM multi-agent “ingredient safety” analyzer for cosmetics that cuts consumer research time from ~20+ minutes to minutes, using LangGraph orchestration, hybrid retrieval (Qdrant + Tavily), and safety-focused critic validation (false rejections reduced ~30%→~8%). Also has research-internship experience building computer-vision pipelines to classify emerald color/clarity by translating gem-expert heuristics into quantitative model features.”

A/B Testing API Gateway AWS AWS Glue AWS Lambda CI/CD+118

View profile

Avijit Saha

Screened

Junior Software Engineer specializing in cloud-native microservices and AI/ML observability

Bedford, TX3y exp

JPMorgan ChaseUniversity of the Cumberlands

“Engineer with banking and industrial/IoT experience who has deployed a payment-processing microservice with zero downtime, handling Protobuf schema evolution and sensitive data migration via dual-write/checksum techniques. Demonstrates strong cross-stack troubleshooting (pinpointed intermittent distributed timeouts to a failing ToR switch port) and customer-facing Python ETL customization using plugin-based parsers and Pydantic validation, plus hands-on monitoring/alerting improvements with operators.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon EKS Amazon S3+103

View profile

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp

BlackRockWebster University

“Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.”

Python SQL Shell Scripting Apache Spark PySpark Apache Hadoop+103

View profile

Sravani Kasaraneni

Screened

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp

ServiceNowRivier University

“Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.”

SDLC Agile Waterfall Python R Java+104

View profile

Kevin Fang

Screened

Intern Software Engineer specializing in full-stack and data systems

Beverly Hills, CA1y exp

Alo YogaUC Irvine

“Software developer with healthcare operations experience at Epic Systems (Referrals & Authorizations), delivering customer-facing tooling to speed manual insurance authorization/denial documentation and support future automation. Also supported an HRIS migration to Workday at Aloe Yoga, solving legacy ID interoperability via scripting and mapping, and demonstrates strong production debugging and test-driven maintainability practices.”

Apache Hadoop Apache Kafka API Development AWS C C#+79

View profile

Min-Han Shih

Screened

Junior Machine Learning Engineer specializing in speech and multimodal AI

Taipei, Taiwan2y exp

FurboUSC

“New grad who has shipped a production vision-language recommendation feature for a pet camera/mobile app, including building a tagged video dataset with human annotators and optimizing inference by FPS downsampling under device compute limits. Also built a multimodal MLLM benchmark using an LLM-as-judge (GPT-5-thinking) with a feedback loop, validated against human scoring, and measured post-feedback quality gains (12% average score improvement).”

Python C C++MySQL Go Apache Spark+61

View profile

Rohit Khoja

Screened

Mid-level Full-Stack Engineer specializing in cloud microservices and NLP/LLM systems

Tempe, AZ4y exp

CitigroupArizona State University

“Full-stack engineer with 3+ years using Java/Spring Boot (Citi) and React, who built a production observability dashboard monitoring 53 microservices across 17 clusters with real-time health/latency tracing and significant performance improvements (cut load time from ~10s). Also designed a serverless AWS face-recognition system (Lambda/S3/SQS) built to handle burst traffic (~1000 concurrent requests), demonstrating strength in scalable, event-driven architectures.”

Agile Amazon EC2 Amazon S3 Amazon SQS Apache Kafka AWS Lambda+106

View profile

Shanmukh Sai Madhu

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp

JPMorgan ChaseUniversity of South Dakota

“Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.”

Agile Apache Airflow Apache Kafka Apache Spark AWS AWS Lambda+122

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?