Vetted PySpark Professionals

Pre-screened and vetted.

MM

Senior Software Engineer specializing in AI/ML backend and cloud infrastructure

Bentonville, AR11y exp
WalmartUniversity of Houston

Backend/data platform engineer with production experience at Walmart and Molina Healthcare, building Python microservices on AWS (EKS + Lambda) for real-time inventory and recommendation systems. Strong in reliability/observability and incident leadership, plus modernizing legacy healthcare workflows and building resilient AWS Glue/PySpark pipelines with schema evolution and data quality controls.

View profile
VB

Intern AI/ML Engineer specializing in LLM applications and data infrastructure

Redmond, Washington, USA3y exp
UberUniversity of Memphis

Hands-on LLM practitioner who built a production document-processing pipeline in Python, tackling long-document handling and latency with chunking/batching and a user-driven correction feedback loop. Experienced operationalizing AI workflows with Kubernetes (CronJobs, autoscaling, scheduled data cleaning and weekly retraining) and applying structured testing/evaluation (E2E, LLM-as-judge, HITL) while communicating solutions clearly to non-technical clients using visual diagrams.

View profile
CK

Mid-level Data Engineer specializing in cloud data platforms and FinTech analytics

Chicago, IL4y exp
IntuitDePaul University

Solutions architect/technical consultant with experience across Intuit, Deloitte, and CodeNest Solutions, focused on enterprise data modernization, AI adoption, and real-time streaming in B2B environments. Particularly strong in regulated financial use cases, where they combine hands-on POC building, security/compliance diligence, and modern data stack expertise to help clients modernize legacy systems and close complex enterprise deals.

View profile
SZ

Siliang Zhang

Screened

Intern Machine Learning Engineer specializing in LLMs, RAG, and vision-language systems

Shanghai, China2y exp
CarizonUSC

Robotics ML/software engineer focused on Vision-Language-Action control for 7-DoF robots, replacing tokenized action decoding with continuous regression heads (including a logit-weighted expectation approach) to improve stability and real-time behavior. Strong in ROS1/ROS2 systems integration and debugging closed-loop manipulation issues via latency instrumentation, QoS-aware distributed messaging, and sim-to-real validation using Gazebo/Unity, Docker, and CI pipelines.

View profile
MM

Max Matkovski

Screened

Junior Machine Learning Engineer specializing in data pipelines and applied AI

San Francisco Bay Area, CA3y exp
Ontra MobilityGeorgia Tech

Built a production AI agent for phishing fraud detection using n8n orchestration, Claude (Sonnet 4/MCP), VirusTotal, and JavaScript formatting to generate and deliver email-based reports via Gmail. Has experience evaluating detection accuracy against known examples, iterating via feedback, and presenting AI solutions to non-technical teams.

View profile
MK

Mid-level Data Analyst specializing in retention, churn, and customer analytics

Chicago, IL5y exp
OptumNorthern Illinois University

Analytics professional with experience across healthcare and fintech, including building SQL/Python data pipelines at Optum and owning a fraud detection initiative at Razorpay. Stands out for combining messy-data cleanup, reproducible analytics workflows, and stakeholder-driven metric design, with a reported 25% improvement in fraud detection while keeping false positives under control.

View profile
HS

Haider Shah

Screened

Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI

California, USA13y exp
PineconePreston University

FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.

View profile
Priyanshu Maurya - Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics in New York, NY

Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics

New York, NY3y exp
MetLifeRowan University

Built and productionized LLM-driven sentiment scoring for earnings call transcripts at Goldman Sachs, replacing legacy NLP to deliver a cleaner trading signal while managing latency/cost via batching, caching, and distilled models. Also implemented an Airflow-orchestrated fraud modeling pipeline at MetLife with drift-based retraining and SageMaker deployment, and has a disciplined evaluation/rollout framework for reliable AI workflows.

View profile
Hongxi Chen - Intern Software Engineer specializing in distributed systems and backend infrastructure in Beijing, China

Hongxi Chen

Screened

Intern Software Engineer specializing in distributed systems and backend infrastructure

Beijing, China0y exp
Chinese Academy of SciencesUniversity of Nottingham

Backend engineer with deep experience building event-driven logistics systems (orders, warehouse execution, real-time delivery tracking) using Spring Boot/PostgreSQL/Redis and strong observability (Prometheus/Grafana). Led a zero-downtime migration from monolithic MySQL to a sharded architecture for ~2M users with dual-write, checksum validation, and fast auto-rollback, and has strong security expertise including PostgreSQL RLS for multi-tenant SaaS and robust OAuth/JWT handling.

View profile
RE

Rakesh Eleti

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Florida, USA4y exp
BlackRockUniversity of Florida

Healthcare ML/AI engineer at Cigna who has owned a clinical RAG pipeline from prototype through production, monitoring, compliance, and iteration. Stands out for combining LLM product delivery with healthcare-grade safety and explainability, driving a 38% retrieval precision gain, 42% hallucination reduction, and meaningful improvements in team velocity and system reliability.

View profile
YV

Yash Vishe

Screened

Junior Software Engineer specializing in LLM systems, data engineering, and ML

San Diego, CA2y exp
San Diego Supercomputer CenterUC San Diego

Backend/ML systems engineer with experience at SDSC, UCSD, and Media.net, building production semantic dataset/model discovery using embeddings + Solr KNN and LLM-based intent/reranking at 5M+ dataset scale. Emphasizes offline/online separation for predictable serving, has delivered measurable gains (23% retrieval accuracy, 38% latency reduction) and helped secure a $3M+ NSF grant.

View profile
DK

David Kidwell

Screened

Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps

New York, NY10y exp
Canoe IntelligenceBinghamton University

Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.

View profile
HS

Senior Data Engineer specializing in multi-cloud data platforms and streaming pipelines

4y exp
Northern TrustUniversity of Texas at Arlington

Data platform engineer with hands-on ownership of high-volume financial data pipelines (millions of transactions/day) on Azure (ADF, Databricks, Delta Lake, Synapse), emphasizing schema-drift protection and automated data-quality gates. Also built resilient web scraping pipelines with anti-bot and backfill strategies, and shipped a versioned FastAPI + Redis data API with autoscaling, testing, and CI/CD via GitHub Actions.

View profile
SY

Mid-level Software Engineer specializing in FinTech and Healthcare systems

Arizona, USA4y exp
PayPal

Data engineer who has owned end-to-end production pipelines ingesting ~500GB/day from APIs/databases/Kafka into an S3 data lake (Glue/Spark) with Airflow-orchestrated Great Expectations quality gates. Built resilient external data collection systems with idempotent jobs, exponential-backoff retries, raw data capture, and backfills; also shipped Snowflake-backed APIs with caching, versioned endpoints, and backward-compatible data contracts. Led an early-stage Azure data platform build with phased delivery and GitHub Actions CI/CD, resolving schema-mismatch incidents quickly without downstream corruption.

View profile
Matt Salomon - Senior Data Scientist specializing in GenAI, LLM systems, and production ML in Los Angeles, CA

Senior Data Scientist specializing in GenAI, LLM systems, and production ML

Los Angeles, CA17y exp
CignaMIT
View profile
Priya Gandhi - Junior Software Engineer specializing in full-stack development and data engineering

Junior Software Engineer specializing in full-stack development and data engineering

1y exp
LexisNexisNorth Carolina State University
View profile
Krishna Katakam - Senior Data Engineer specializing in cloud data platforms and analytics in Eden Prairie, MN

Senior Data Engineer specializing in cloud data platforms and analytics

Eden Prairie, MN5y exp
OptumUniversity of Texas at Dallas
View profile
SK

Mid-level Full-Stack Developer specializing in Python, React, and cloud-native AI microservices

San Francisco, CA6y exp
ShopifySaint Louis University
View profile
Jayavibhav Kogundi - Junior AI/ML Engineer specializing in LLMs, RAG, and multimodal agents in Los Angeles, CA

Junior AI/ML Engineer specializing in LLMs, RAG, and multimodal agents

Los Angeles, CA2y exp
Scale AIUSC
View profile
SA

Mid-Level Software Engineer specializing in cloud-native backend and LLM/RAG systems

New York, NY3y exp
LOCOMeXNYU
View profile
SM

Mid-level SDET specializing in test automation, API/microservices QA, and cloud-native CI/CD

Jersey City, NJ6y exp
AmazonLeeds Beckett University
View profile
MZ

Senior Software Engineer / Solutions Architect specializing in data platforms and AI/LLM integration

Remote6y exp
Gruve AIUC Berkeley
View profile
SS

Mid-level ML Engineer specializing in computer vision and robotics

Buffalo, NY3y exp
Nissha Medical TechnologiesUniversity at Buffalo
View profile

Need someone specific?

AI Search