Vetted Apache Spark Professionals

Pre-screened and vetted.

RV

Mid-Level Full-Stack Developer specializing in FinTech

Remote, USA4y exp
IntuitMississippi State University

Backend-heavy full-stack engineer with experience at Intuit (TurboTax Live) and Paytm payments, building and scaling Java/Spring Boot microservices for high-traffic transaction systems. Has hands-on wins improving peak-load performance using Redis/disk caching and Kafka event-driven patterns, plus React/Redux work for web app integration and strong monitoring practices with ELK.

View profile
SV

Intern Software Engineer specializing in full-stack, ML, and optimization

New York, NY0y exp
GeminiUniversity of Wisconsin–Madison

Built a production-style PyTorch LSTM system that generates structured piano compositions from 1200+ MIDI files, then significantly improved long-range musical coherence by implementing Bahdanau attention based on research literature. Also has internship experience using Docker Compose for containerized backend workloads and has independently used Ray to scale ML experiments across multiple GPUs, including dealing with GPU scheduling/memory oversubscription issues.

View profile
RR

Rahul Reddy

Screened

Senior Data Engineer specializing in cloud data platforms and big data pipelines

New York, NY6y exp
CVS HealthSouthern Arkansas University

Data engineer with healthcare (CVS Health) experience who migrated production PySpark workloads to native BigQuery SQL and built a Great Expectations-based validation microservice on GKE (Flask + REST) integrated into Cloud Composer. Has operated high-volume pipelines (~300–400GB/day) and designed external vendor ingestion on AWS (Lambda/Step Functions/Glue) with schema-drift detection, alerting, and backfill-safe controls to protect downstream Snowflake/BigQuery tables.

View profile
RK

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp
Scale AIRowan University

Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.

View profile
Lavanya Chilakalapudi - Mid-level Full-Stack Developer specializing in cloud-native web apps and APIs in Tampa, FL

Mid-level Full-Stack Developer specializing in cloud-native web apps and APIs

Tampa, FL5y exp
DatabricksUniversity of South Florida

Backend engineer with experience building microservice-based systems that integrate LLM workflows (code review suggestions, documentation generation, test scaffolding) using REST APIs, Celery/Redis, and OpenTelemetry for observability. Demonstrates hands-on database and performance optimization in PostgreSQL/SQLAlchemy (bulk inserts, lock mitigation, cursor-based pagination) plus multi-tenant data isolation via tenant-aware models, middleware scoping, and schema/row-level strategies.

View profile
Shriya Bannikop - Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems in Seattle, WA

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.

View profile
Vidhi Upadhyay - Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems in Remote

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp
Saayam for AllCarnegie Mellon University

Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.

View profile
Bhanu Chander - Senior Data Engineer specializing in cloud data platforms and real-time pipelines in New York, NY

Bhanu Chander

Screened

Senior Data Engineer specializing in cloud data platforms and real-time pipelines

New York, NY6y exp
DisneyIndiana Wesleyan University

Data engineer focused on reliability and observability, building end-to-end pipelines processing millions of records/day from sources like S3 and Kafka. Has hands-on experience with Airflow-based data quality automation, PySpark/Databricks transformations, and shipping versioned Python REST APIs deployed via Docker/Kubernetes with CI/CD (Jenkins) and monitoring (CloudWatch/Azure Logs).

View profile
BC

Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI

4y exp
Cardinal HealthRivier University

Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.

View profile
SD

Shimao Du

Screened

Junior Full-Stack Engineer specializing in cloud, AI, and distributed systems

Pittsburgh, PA2y exp
Snapbit LLCCarnegie Mellon University

Full-stack engineer from early-stage startups who has owned AI products end to end, from B2B document intelligence platforms on AWS to an HVAC voice assistant and a GCP-based RAG research system. Stands out for combining hands-on backend/infra depth with team leadership in lean environments, and for shipping scalable AI systems that contributed to roughly 1 million yuan in sponsorship.

View profile
MI

Mid-level Data Scientist specializing in machine learning and big data analytics

Bentonville, AR6y exp
WalmartUniversity of North Texas

Walmart engineer who built and shipped a production LLM+RAG system to automate triage and analysis of computer support chats/tickets, producing grounded, schema-constrained JSON outputs for summaries, urgency, and routing recommendations. Emphasizes reliability (hallucination control, confidence thresholds, human-in-the-loop) and runs end-to-end pipelines with Airflow and AWS-native orchestration, plus rigorous evaluation and monitoring tied to business KPIs.

View profile
ZJ

ZHIYONG JIANG

Screened

Senior AI & Machine Learning Engineer specializing in GenAI, Agentic AI, and RAG

19y exp
DisneyUniversity of Utah

Built a production agentic AI system to automate data science work using a layered architecture (executive-summary handling, tool-based execution, and on-the-fly code generation). Demonstrates strong end-to-end agent development practices including RAG with vector databases, prompt engineering, and multi-method evaluation (LLM-as-judge/human/code-based), plus Airflow-based orchestration for ML data pipelines and close collaboration with business end users.

View profile
SM

Mid-level Data Scientist specializing in NLP, LLMs, and cloud ML platforms

Remote, USA5y exp
Wells FargoUniversity of Illinois Urbana-Champaign

LLM/MLOps engineer who has shipped production systems for complaint intelligence and contact-center NLU, including LoRA/RLHF-tuned LLaMA models deployed on GKE with vLLM and Vertex AI batch pipelines to BigQuery. Demonstrates strong practical focus on hallucination control, data imbalance mitigation, and production monitoring (Langfuse) with regression testing and canary rollouts, plus experience orchestrating complex workflows with AWS Step Functions.

View profile
VV

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp
DatabricksCampbellsville University

AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.

View profile
KT

Mid-level Data Scientist specializing in machine learning and generative AI

Saint Louis, MO5y exp
DoorDashSaint Louis University

ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.

View profile
RR

Mid-level Data Scientist specializing in risk, forecasting, and segmentation across finance and healthcare

McLean, Virginia5y exp
Capital OneUniversity of Cincinnati

Data/ML engineer with experience across pharma (Dr. Reddy Laboratories) and financial services (Cincinnati Financial, Capital One), building production NLP and entity-resolution systems that connect messy unstructured text with enterprise SQL data. Delivered semantic search with BERT + vector DB and domain fine-tuning (reported ~35% relevance lift), and builds robust pipelines using Airflow/dbt/Spark with strong validation, monitoring, and stakeholder-aligned rollout practices.

View profile
US

Uddesh Singh

Screened

Mid-level Software Engineer specializing in AI agents and cloud-native microservices

Irving, TX4y exp
PaycomUniversity of Texas at Dallas

Built and shipped a production LLM-powered multi-agent system that autonomously generates and publishes YouTube videos end-to-end (trend discovery, script writing, image/caption generation, timestamped video assembly). Emphasizes production readiness with extensive automated testing, Redis/Postgres/TimescaleDB state orchestration, and Prometheus/Grafana monitoring, reporting ~100x faster content production and improved engagement/viewership.

View profile
Vivek Reddy - Mid-level Data Scientist/Data Engineer specializing in ML pipelines, insurance and healthcare analytics in Los Angeles, CA

Vivek Reddy

Screened

Mid-level Data Scientist/Data Engineer specializing in ML pipelines, insurance and healthcare analytics

Los Angeles, CA7y exp
Venture ConnectUC Berkeley

Built a production assistive-vision iPhone app to help visually impaired users find grocery items, training a custom YOLO detector on 2,000+ self-collected/annotated images and deploying via CoreML with a cloud multimodal LLM for navigation instructions. Brings hands-on AWS serverless + ECS container deployment (CDK/GitHub Actions) and a disciplined approach to AI workflow reliability (state-machine design, offline evals, stress tests, logging/metrics), plus experience communicating model insights to non-technical stakeholders (MOTER Technologies).

View profile
Nicholas Moore - Senior Full-Stack Engineer specializing in scalable cloud-native systems in Lehi, Utah

Senior Full-Stack Engineer specializing in scalable cloud-native systems

Lehi, Utah13y exp
KomBeaMidwestern State University

Backend/data engineer with production experience building high-concurrency customer engagement platforms at KomBea on AWS (EKS + Lambda) using FastAPI/Django, PostgreSQL, Redis, and strong observability. Has modernized legacy batch systems into modular Python services with parallel-run parity validation and phased rollouts, and has delivered resilient AWS Glue ETL pipelines with schema evolution and data quality controls.

View profile
Sakshi Dinesh Deore - Mid-level Software Engineer specializing in AWS, DevOps automation, and data platforms in Bellevue, USA

Mid-level Software Engineer specializing in AWS, DevOps automation, and data platforms

Bellevue, USA3y exp
AmazonUC San Diego

Engineer with Securonix experience deploying and operating production microservices and real-time data-processing systems at high throughput. Led AWS infrastructure, CI/CD, monitoring, and customer-driven customization for a threat-report classification solution, including rule adjustments and model retraining based on live client feedback.

View profile
Eric Low - Principal Engineering Leader specializing in platform, product, and AI advisory

Eric Low

Screened

Principal Engineering Leader specializing in platform, product, and AI advisory

14y exp
Catalyst AICal State East Bay

Fractional CTO/lead engineer who shipped an end-to-end Next.js + FastAPI product experience (login, data processing results, chatbot Q&A) with an architecture designed to support future ML model integration. Has led large-scale engineering enablement (continuous delivery across ~150 devs/200 systems), owned production incident response with lasting test/contract improvements, and delivered a 3x productivity gain by fixing debugging/tooling bottlenecks while mentoring junior teams into independent delivery.

View profile
AP

Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision

USA4y exp
DatabricksGannon University

ML/AI engineer with strong end-to-end production ownership across predictive ML and Generative AI use cases. They built a churn prediction platform that cut churn 12% and preserved about $1.2M in annual revenue, and also shipped a RAG-based support assistant that reduced ticket resolution time 30% while improving agent satisfaction and onboarding speed.

View profile

Need someone specific?

AI Search