Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Ramyasri Veerapaneni

Screened

Mid-Level Full-Stack Developer specializing in FinTech

Remote, USA4y exp

IntuitMississippi State University

“Backend-heavy full-stack engineer with experience at Intuit (TurboTax Live) and Paytm payments, building and scaling Java/Spring Boot microservices for high-traffic transaction systems. Has hands-on wins improving peak-load performance using Redis/disk caching and Kafka event-driven patterns, plus React/Redux work for web app integration and strong monitoring practices with ELK.”

Apache Kafka Apache Spark API Design AWS C C#+83

View profile

Skanda Vyas Srinivasan

Screened

Intern Software Engineer specializing in full-stack, ML, and optimization

New York, NY0y exp

GeminiUniversity of Wisconsin–Madison

“Built a production-style PyTorch LSTM system that generates structured piano compositions from 1200+ MIDI files, then significantly improved long-range musical coherence by implementing Bahdanau attention based on research literature. Also has internship experience using Docker Compose for containerized backend workloads and has independently used Ray to scale ML experiments across multiple GPUs, including dealing with GPU scheduling/memory oversubscription issues.”

Algorithms Angular Bash C C#C+++104

View profile

Rahul Reddy

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

New York, NY6y exp

CVS HealthSouthern Arkansas University

“Data engineer with healthcare (CVS Health) experience who migrated production PySpark workloads to native BigQuery SQL and built a Great Expectations-based validation microservice on GKE (Flask + REST) integrated into Cloud Composer. Has operated high-volume pipelines (~300–400GB/day) and designed external vendor ingestion on AWS (Lambda/Step Functions/Glue) with schema-drift detection, alerting, and backfill-safe controls to protect downstream Snowflake/BigQuery tables.”

Python Java SQL MySQL PostgreSQL Apache Hive+118

View profile

Raghav Konduri

Screened

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp

Scale AIRowan University

“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”

A/B Testing Agile Amazon CloudWatch Amazon EC2 Amazon EKS Amazon Kinesis+151

View profile

Lavanya Chilakalapudi

Screened

Mid-level Full-Stack Developer specializing in cloud-native web apps and APIs

Tampa, FL5y exp

DatabricksUniversity of South Florida

“Backend engineer with experience building microservice-based systems that integrate LLM workflows (code review suggestions, documentation generation, test scaffolding) using REST APIs, Celery/Redis, and OpenTelemetry for observability. Demonstrates hands-on database and performance optimization in PostgreSQL/SQLAlchemy (bulk inserts, lock mitigation, cursor-based pagination) plus multi-tenant data isolation via tenant-aware models, middleware scoping, and schema/row-level strategies.”

Ajax Ansible Apache Airflow Apache Kafka Apache Spark API Gateway+164

View profile

Shriya Bannikop

Screened

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp

Amazon Web ServicesKLE Technological University

“Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon ECS Amazon EKS+170

View profile

Vidhi Upadhyay

Screened

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp

Saayam for AllCarnegie Mellon University

“Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.”

Python C++SQL MySQL .NET Generative AI+150

View profile

Bhanu Chander

Screened

Senior Data Engineer specializing in cloud data platforms and real-time pipelines

New York, NY6y exp

DisneyIndiana Wesleyan University

“Data engineer focused on reliability and observability, building end-to-end pipelines processing millions of records/day from sources like S3 and Kafka. Has hands-on experience with Airflow-based data quality automation, PySpark/Databricks transformations, and shipping versioned Python REST APIs deployed via Docker/Kubernetes with CI/CD (Jenkins) and monitoring (CloudWatch/Azure Logs).”

Python SQL Scala C#JavaScript Java+140

View profile

Binita Chourasia

Screened

Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI

4y exp

Cardinal HealthRivier University

“Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.”

Python SQL Generative AI Large Language Models Prompt Engineering Retrieval-Augmented Generation+142

View profile

Shimao Du

Screened

Junior Full-Stack Engineer specializing in cloud, AI, and distributed systems

Pittsburgh, PA2y exp

Snapbit LLCCarnegie Mellon University

“Full-stack engineer from early-stage startups who has owned AI products end to end, from B2B document intelligence platforms on AWS to an HVAC voice assistant and a GCP-based RAG research system. Stands out for combining hands-on backend/infra depth with team leadership in lean environments, and for shipping scalable AI systems that contributed to roughly 1 million yuan in sponsorship.”

Python Java C++TypeScript Go Spring Boot+112

View profile

Moses Immanuel

Screened

Mid-level Data Scientist specializing in machine learning and big data analytics

Bentonville, AR6y exp

WalmartUniversity of North Texas

“Walmart engineer who built and shipped a production LLM+RAG system to automate triage and analysis of computer support chats/tickets, producing grounded, schema-constrained JSON outputs for summaries, urgency, and routing recommendations. Emphasizes reliability (hallucination control, confidence thresholds, human-in-the-loop) and runs end-to-end pipelines with Airflow and AWS-native orchestration, plus rigorous evaluation and monitoring tied to business KPIs.”

Agile Amazon EC2 Amazon Redshift Amazon S3 Apache Hadoop Apache Hive+172

View profile

ZHIYONG JIANG

Screened

Senior AI & Machine Learning Engineer specializing in GenAI, Agentic AI, and RAG

19y exp

DisneyUniversity of Utah

“Built a production agentic AI system to automate data science work using a layered architecture (executive-summary handling, tool-based execution, and on-the-fly code generation). Demonstrates strong end-to-end agent development practices including RAG with vector databases, prompt engineering, and multi-method evaluation (LLM-as-judge/human/code-based), plus Airflow-based orchestration for ML data pipelines and close collaboration with business end users.”

Python C SQL MATLAB Java Machine Learning+110

View profile

Srushti Manjunath

Screened

Mid-level Data Scientist specializing in NLP, LLMs, and cloud ML platforms

Remote, USA5y exp

Wells FargoUniversity of Illinois Urbana-Champaign

“LLM/MLOps engineer who has shipped production systems for complaint intelligence and contact-center NLU, including LoRA/RLHF-tuned LLaMA models deployed on GKE with vLLM and Vertex AI batch pipelines to BigQuery. Demonstrates strong practical focus on hallucination control, data imbalance mitigation, and production monitoring (Langfuse) with regression testing and canary rollouts, plus experience orchestrating complex workflows with AWS Step Functions.”

Python R SQL MATLAB C++Scala+169

View profile

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp

DatabricksCampbellsville University

“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”

Python SQL PySpark Bash TensorFlow PyTorch+106

View profile

Keerthana Tammina

Screened

Mid-level Data Scientist specializing in machine learning and generative AI

Saint Louis, MO5y exp

DoorDashSaint Louis University

“ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.”

Agile Amazon Redshift Amazon S3 Amazon SageMaker Anomaly Detection Apache Hadoop+157

View profile

Rishitha Reddy K

Screened

Mid-level Data Scientist specializing in risk, forecasting, and segmentation across finance and healthcare

McLean, Virginia5y exp

Capital OneUniversity of Cincinnati

“Data/ML engineer with experience across pharma (Dr. Reddy Laboratories) and financial services (Cincinnati Financial, Capital One), building production NLP and entity-resolution systems that connect messy unstructured text with enterprise SQL data. Delivered semantic search with BERT + vector DB and domain fine-tuning (reported ~35% relevance lift), and builds robust pipelines using Airflow/dbt/Spark with strong validation, monitoring, and stakeholder-aligned rollout practices.”

Python R SQL Scala Java Scikit-learn+139

View profile

Uddesh Singh

Screened

Mid-level Software Engineer specializing in AI agents and cloud-native microservices

Irving, TX4y exp

PaycomUniversity of Texas at Dallas

“Built and shipped a production LLM-powered multi-agent system that autonomously generates and publishes YouTube videos end-to-end (trend discovery, script writing, image/caption generation, timestamped video assembly). Emphasizes production readiness with extensive automated testing, Redis/Postgres/TimescaleDB state orchestration, and Prometheus/Grafana monitoring, reporting ~100x faster content production and improved engagement/viewership.”

AI Agents Apache Kafka Apache Spark AWS AWS Lambda BigQuery+82

View profile

Vivek Reddy

Screened

Mid-level Data Scientist/Data Engineer specializing in ML pipelines, insurance and healthcare analytics

Los Angeles, CA7y exp

Venture ConnectUC Berkeley

“Built a production assistive-vision iPhone app to help visually impaired users find grocery items, training a custom YOLO detector on 2,000+ self-collected/annotated images and deploying via CoreML with a cloud multimodal LLM for navigation instructions. Brings hands-on AWS serverless + ECS container deployment (CDK/GitHub Actions) and a disciplined approach to AI workflow reliability (state-machine design, offline evals, stress tests, logging/metrics), plus experience communicating model insights to non-technical stakeholders (MOTER Technologies).”

A/B Testing Amazon Bedrock Amazon ECS Amazon RDS AWS Lambda CI/CD+109

View profile

Nicholas Moore

Screened

Senior Full-Stack Engineer specializing in scalable cloud-native systems

Lehi, Utah13y exp

KomBeaMidwestern State University

“Backend/data engineer with production experience building high-concurrency customer engagement platforms at KomBea on AWS (EKS + Lambda) using FastAPI/Django, PostgreSQL, Redis, and strong observability. Has modernized legacy batch systems into modular Python services with parallel-run parity validation and phased rollouts, and has delivered resilient AWS Glue ETL pipelines with schema evolution and data quality controls.”

Python Django FastAPI Flask Go Node.js+138

View profile

Sakshi Dinesh Deore

Screened

Mid-level Software Engineer specializing in AWS, DevOps automation, and data platforms

Bellevue, USA3y exp

AmazonUC San Diego

“Engineer with Securonix experience deploying and operating production microservices and real-time data-processing systems at high throughput. Led AWS infrastructure, CI/CD, monitoring, and customer-driven customization for a threat-report classification solution, including rule adjustments and model retraining based on live client feedback.”

Agile Amazon API Gateway Amazon DynamoDB Amazon EKS Amazon S3 Ansible+105

View profile

Eric Low

Screened

Principal Engineering Leader specializing in platform, product, and AI advisory

14y exp

Catalyst AICal State East Bay

“Fractional CTO/lead engineer who shipped an end-to-end Next.js + FastAPI product experience (login, data processing results, chatbot Q&A) with an architecture designed to support future ML model integration. Has led large-scale engineering enablement (continuous delivery across ~150 devs/200 systems), owned production incident response with lasting test/contract improvements, and delivered a 3x productivity gain by fixing debugging/tooling bottlenecks while mentoring junior teams into independent delivery.”

Ansible Angular AWS CI/CD Claude Data Pipelines+65

View profile

Akhilesh Padala

Screened

Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision

USA4y exp

DatabricksGannon University

“ML/AI engineer with strong end-to-end production ownership across predictive ML and Generative AI use cases. They built a churn prediction platform that cut churn 12% and preserved about $1.2M in annual revenue, and also shipped a RAG-based support assistant that reduced ticket resolution time 30% while improving agent satisfaction and onboarding speed.”

Python Java R SQL PySpark Apache Spark+130

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?