Vetted PySpark Professionals

Pre-screened and vetted.

SN

Mid-level Software Development Engineer specializing in backend systems and ML platforms

New York, USA2y exp
FlipkartNYU
View profile
SM

Senior Software Engineer specializing in AWS, DevOps, and automation

9y exp
Capital OneGeorgia Tech
View profile
SC

Mid AI/ML Engineer specializing in LLM systems and inference optimization

Bay Area, CA5y exp
NVIDIAWebster University
View profile
ST

Senior Software Engineer specializing in data engineering, BI analytics, and AI/ML

Moulton, AL11y exp
DropboxFlorida State University
View profile
MM

Senior AI/ML Engineer specializing in NLP, computer vision, and MLOps

Ohio, USA10y exp
Pixolat LLC
View profile
Peeyush Dyavarashetty - Intern AI/ML Engineer specializing in GenAI, LLMs, and agentic RAG systems in Miami, FL

Peeyush Dyavarashetty

Screened ReferencesModerate rec.

Intern AI/ML Engineer specializing in GenAI, LLMs, and agentic RAG systems

Miami, FL2y exp
Scale Up 360University of Maryland, College Park

AI/LLM practitioner who built a GPT-2-like language model from scratch at the University of Maryland using PyTorch and multi-GPU distributed training, with experiment tracking in Weights & Biases. As an AI Operations intern at ScaleUp360, delivered multiple production-style AI agent automations (Gmail classification and Fireflies-to-Claude workflows that extract and assign CEO tasks) and set up measurable evaluation using test cases and classification metrics.

View profile
SN

Mid-level AI/ML Engineer specializing in NLP, graph models, and MLOps for FinTech and Healthcare

Remote, USA5y exp
StripeKent State University

AI/ML engineer who has deployed production LLM/transformer-based systems for merchant intelligence and fraud/support optimization, delivering +27% merchant engagement and +18% payment success. Deep experience in privacy-preserving, PCI DSS-compliant data/ML pipelines (Airflow, AWS Glue, Spark, Delta Lake) and scalable microservices on Kubernetes, plus proven cross-functional delivery in healthcare claims analytics at UnitedHealth Group (12% HEDIS claim reduction).

View profile
NM

Mid-level Full-Stack Python Developer specializing in cloud-native banking applications

6y exp
TruistPace University

Backend engineer who built a low-latency real-time transaction API in Python/Flask, with strong depth in PostgreSQL/SQLAlchemy performance tuning (time-based partitioning, indexing, connection pooling). Has production experience integrating ML scoring and OpenAI-style APIs with safety/latency controls, and designing multi-tenant isolation strategies including per-tenant pooling/caching and premium-tenant isolation.

View profile
SR

Sanketh Reddy

Screened

Senior Data Engineer specializing in cloud data platforms and large-scale ETL

Jersey City, NJ6y exp
JPMorgan ChaseUniversity of Texas at Dallas

Data engineer focused on large-scale ETL/ELT pipelines across cloud stacks (GCP and AWS), including Spark-based transformations and orchestration with Airflow. Has experience loading up to ~2TB per BigQuery target table and designing atomic loads to multiple downstream systems (Elasticsearch + Kafka), with Kubernetes deployment and Jenkins CI/CD.

View profile
VR

Vikas Ravula

Screened

Senior Data Engineer specializing in cloud data platforms and real-time streaming for financial services

Chicago, IL6y exp
BloombergUniversity of Illinois Urbana-Champaign

Data engineer with experience at Bloomberg, UBS, and Bank of America building high-volume financial data platforms and services. Owned an end-to-end pipeline processing ~150–200M records/day (Kafka/Cassandra/S3 → Spark/PySpark → Snowflake) with strong data quality controls and Airflow reliability practices, reporting ~99% reliability and major performance gains. Also built large-scale external API ingestion with compliance-minded rate limiting, schema versioning, and quarantine/validation layers.

View profile
SR

Senior Infrastructure Platform Architect specializing in Kubernetes and hybrid cloud

Chicago, IL9y exp
ExelonGeorge Mason University

Platform/infra engineer with strong ownership of Kubernetes on VMware and day-to-day hybrid on-prem-to-AWS operations. Has hands-on experience automating infrastructure delivery with Terraform/Ansible/CI-CD, and has resolved real production issues spanning CSI storage reattachment during upgrades, vSphere storage-latency performance degradation, and hybrid connectivity/routing failures with improved validation, monitoring, and failover.

View profile
Vignesh Shanmugasundaram - Junior Software Engineer specializing in full-stack development and applied ML in New York, NY

Junior Software Engineer specializing in full-stack development and applied ML

New York, NY2y exp
AmazonNYU

Full-stack engineer with experience at Zoho and Amazon who has owned production systems end-to-end, including a monolith-to-microservices migration using Kafka and Cassandra that improved search latency ~25% and increased throughput without data loss. Also built a hackathon project (Buildwise) into a sold product for a construction company (AI-driven document compliance checks) and shipped an IoT-based parking availability MVP in 3 weeks.

View profile
Poorna Pedapudi - Mid-Level Software Engineer specializing in distributed backend systems and cloud-native microservices in Seattle, WA

Mid-Level Software Engineer specializing in distributed backend systems and cloud-native microservices

Seattle, WA5y exp
UberGeorge Mason University

Software engineer focused on data platforms and applied LLM systems: built an internal data quality monitoring layer to catch silent data drift and iterated post-launch after finding ~30% false-positive alerts, reducing noise via dynamic baselines and improved structured logging. Also shipped a production RAG-based internal knowledge assistant over Jira/Confluence with citations, confidence-based fallbacks, and nightly automated evals to prevent regressions.

View profile
Abhay Murjani - Director-level Data Science Manager specializing in ML forecasting, experimentation, and MLOps in New York, NY

Abhay Murjani

Screened

Director-level Data Science Manager specializing in ML forecasting, experimentation, and MLOps

New York, NY6y exp
American ExpressUniversity at Buffalo

Data/ML engineer with experience at American Express and Amazon, owning an end-to-end rewards redemption/liability ML pipeline (~200GB) with rigorous regulatory/audit validation and quarterly executive reporting. Also built web-scraped product datasets with anti-bot protections at a startup and helped modernize an authn/authz service using AWS, plus led early-stage migration work from an internal warehouse to GCP with CI/CD and cloud observability.

View profile
SB

Mid-level Software Engineer specializing in cloud backend and distributed systems

Seattle, WA3y exp
AmazonUSC

Built a production GenAI support agent at Amazon for FBA on-call operations, using Bedrock, Lambda, RAG, and confidence-based human fallback to safely automate ticket triage. The system materially reduced ticket volume and manual workload while improving MTTR, showing strong depth in reliable LLM agent architecture under real operational constraints.

View profile
GS

grusha shetty

Screened

Senior Data Analyst specializing in product analytics and experimentation

Berkeley, CA3y exp
Games24x7UC Berkeley

Analytics candidate with strong product and growth analytics experience across SQL, Spark, Python, and Tableau. They have built clickstream funnel pipelines, automated Bayesian experiment evaluation, and used Markov chain journey modeling to uncover onboarding friction that led to a 5% conversion improvement. They also show strong cross-functional influence by standardizing churn definitions across product and marketing teams and operationalizing adoption in shared dashboards.

View profile
YY

Yuanhui Yang

Screened

Senior Software Engineer specializing in Python backend systems on AWS

Livermore, CA8y exp
ASMLShanghai Jiao Tong University

Backend/data engineer from ASML who modernized a legacy SAS-based statistical processing system into a cloud-native AWS platform (Lambda/FastAPI, Step Functions/EventBridge, Glue, S3/RDS) with strong reliability and data-quality practices. Demonstrated measurable performance wins (RDS query reduced from 90+ seconds to <5 seconds) and hands-on incident ownership for production ETL pipelines.

View profile
Sarthak Gupta - Mid-level AI/ML Engineer specializing in LLMs, NLP, and real-time AI systems in New York, NY

Sarthak Gupta

Screened

Mid-level AI/ML Engineer specializing in LLMs, NLP, and real-time AI systems

New York, NY4y exp
New York UniversityNYU

Backend engineer who built a real-time pipeline for recording, transcribing, and analyzing audio from 400+ news radio stations, scaling Whisper on an HPC cluster with 7 H100 GPUs. Has strong performance optimization experience (30% latency reduction via SQL/query design; 50% DB call reduction via Redis caching) and has implemented region-based data isolation and PII protections in a regulated environment (JP Morgan Chase).

View profile
PK

priya kotha

Screened

Mid-level Data Engineer specializing in real-time pipelines across FinTech and Healthcare

USA, USA4y exp
PlaidSacred Heart University

Data engineer at Plaid who built greenfield, end-to-end real-time transaction pipelines and FastAPI data services for fraud detection and analytics, handling millions of events per day. Strong focus on reliability and data integrity via Great Expectations validation, Airflow-based monitoring/SLAs, quarantine/staging patterns, and robust external data ingestion with schema versioning and backfills (reported 50% fewer anomalies and ~40% fewer failures).

View profile
Rojin Bakhti - Junior Software Engineer specializing in Edge AI and ML deployment in San Diego, CA

Rojin Bakhti

Screened

Junior Software Engineer specializing in Edge AI and ML deployment

San Diego, CA3y exp
QualcommUSC

Qualcomm engineer building Android applications that run on Qualcomm AI accelerators, with hands-on experience in C++ concurrency, chipset stress testing, and power/performance tuning. Has deployed on-device AI models and built deployment/log post-processing workflows using Docker/Kubernetes and CI/CD; interested in translating this embedded AI/performance background into robotics (perception/real-time systems).

View profile
AD

Abhinay Dodda

Screened

Mid-Level AI Engineer specializing in data pipelines and scalable ML systems

United States, USA5y exp
Scale AISaint Louis University

Data engineer/backend developer with experience owning end-to-end, high-volume data pipelines for ML/analytics using Python, Airflow, SQL, and PySpark, reporting ~30% error reduction through improved reliability and data quality checks. Has also built Django-based REST APIs with caching/pagination and strong versioning practices, and operated external data collection/web scraping pipelines with anti-bot measures, monitoring, retries, and idempotent backfills.

View profile

Need someone specific?

AI Search