Vetted Apache Spark Professionals

Pre-screened and vetted.

AD

Aarati Dulal

Screened

Senior Full-Stack Java Engineer specializing in cloud-native microservices

Dallas, TX6y exp
Goldman SachsAvila University

Backend/platform engineer who owned high-volume Java/Spring Boot microservices on AWS (Kafka + RDS/DynamoDB) and has hands-on experience debugging complex production latency incidents across DB, JVM/GC, and async consumers. Also shipped applied AI features for ops, including an LLM-powered log analysis assistant and an incident-response agent with strong safety guardrails (schema-validated tool use, retries/backoff, and human-in-the-loop escalation).

View profile
HS

Haider Shah

Screened

Principal AI/ML Architect specializing in GenAI, LLMs, RAG, and Agentic AI

California, USA13y exp
PineconePreston University

FinTech/AI engineer who has shipped an end-to-end discrepancy-detection product for financial managers using Next.js, FastAPI/GraphQL, Pinecone, and AWS (with dev/staging/prod, observability, A/B testing, and documentation). Also built an AI-native “AI Genesis” system with agentic cyclic workflows, routing, and tool use, and has experience modernizing legacy systems via the strangler fig pattern while coordinating with senior stakeholders on a 5G autonomous simulation platform.

View profile
Priyanshu Maurya - Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics in New York, NY

Mid-level Data Scientist specializing in insurance, finance, and healthcare analytics

New York, NY3y exp
MetLifeRowan University

Built and productionized LLM-driven sentiment scoring for earnings call transcripts at Goldman Sachs, replacing legacy NLP to deliver a cleaner trading signal while managing latency/cost via batching, caching, and distilled models. Also implemented an Airflow-orchestrated fraud modeling pipeline at MetLife with drift-based retraining and SageMaker deployment, and has a disciplined evaluation/rollout framework for reliable AI workflows.

View profile
Aigo Madakimova - Senior Data Analyst specializing in audit analytics, automation, and financial data platforms in Malvern, PA

Senior Data Analyst specializing in audit analytics, automation, and financial data platforms

Malvern, PA6y exp
VanguardNYU

Full-stack engineer with strong Next.js App Router + TypeScript experience who built and owned a production internal analytics dashboard end-to-end, including server-component data fetching, route handlers for secure proxying, and post-launch monitoring/caching fixes. Also designed Postgres data models and performance-tuned analytics queries, and built reliable BullMQ/Redis-based order-fulfillment workflows with idempotency, retries, and compensating refunds—comfortable operating with high ownership in early-stage teams.

View profile
SP

Satya Pithani

Screened

Mid-level AI/ML Engineer specializing in healthcare and financial analytics

Texas, USA4y exp
Oracle HealthUniversity of Texas at Dallas

ML engineer with production experience across healthcare and fraud domains, including end-to-end ownership of a telecare patient deterioration system at Oracle Health and a GPT-4/RAG fraud reporting solution at Cognizant. Stands out for combining scalable data/ML infrastructure, clinical NLP, and GenAI delivery with measurable gains in model quality and workflow efficiency.

View profile
RM

Ryan McDowell

Screened

Senior Software Engineer specializing in pricing, marketplaces, and data engineering

Remote9y exp
Ballast Point AnalyticsUniversity of Chicago

Built and operationalized intelligent pricing infrastructure for live event ticketing at StubHub, emphasizing iterative prototyping with traders and production-grade monitoring (Splunk, API/data-stream thresholding). Also partnered with customer-facing teams to drive adoption and helped win a significant consignment revenue-share deal by demoing the system to the Philadelphia 76ers and quantifying pricing efficacy and business impact.

View profile
YV

Yash Vishe

Screened

Junior Software Engineer specializing in LLM systems, data engineering, and ML

San Diego, CA2y exp
San Diego Supercomputer CenterUC San Diego

Backend/ML systems engineer with experience at SDSC, UCSD, and Media.net, building production semantic dataset/model discovery using embeddings + Solr KNN and LLM-based intent/reranking at 5M+ dataset scale. Emphasizes offline/online separation for predictable serving, has delivered measurable gains (23% retrieval accuracy, 38% latency reduction) and helped secure a $3M+ NSF grant.

View profile
SS

Sayuj Shah

Screened

Mid-level Data Analyst & AI Practitioner specializing in ML, LLMs, and analytics platforms

Schaumburg, IL4y exp
U.S. CellularGeorgia Tech

Data Analyst at U.S. Cellular who built production LLM solutions, including a Tableau-embedded chatbot that converts natural language questions into Oracle SQL and returns actionable KPI insights for non-technical users. Also authored MAD-CTI, a multi-agent LLM system for dark web hacker forum threat intelligence (published in IEEE Access) that outperformed single-agent approaches by 14%.

View profile
DK

David Kidwell

Screened

Senior AI/ML Data Scientist specializing in NLP, computer vision, and MLOps

New York, NY10y exp
Canoe IntelligenceBinghamton University

Applied LLMs and a graph-RAG architecture in Neo4j to automate an accounting firm's cross-checking of transactional books against tax regulations, indexing 1,000+ pages into a knowledge graph with vector search. Combines agentic LLM workflows with classical NER (Hugging Face/NLTK) and validates using expert-labeled held-out data plus precision/recall and measured accountant time savings after deployment.

View profile
RS

Executive Technology Leader specializing in B2C marketplaces, cloud platforms, and AI products

San Francisco, CA17y exp
KeyCentrixMadurai Kamaraj University

20-year technology builder with ~8 years in healthcare AI, currently at Keycentrix modernizing a legacy pharmacy solutions business. Shipped an OCR MVP within days and delivered a rebate-based product generating ~$50K/month, leveraging Claude/LangGraph agentic automation to replace work typically requiring a much larger engineering team. Developing a "Longevity AI Copilot" B2B platform that synthesizes research, labs, and wearable data into personalized longevity protocols for HNW and corporate wellness markets; concept validated but not yet incorporated or funded.

View profile
Jones Pavan - Director-level Engineering Leader specializing in platform modernization and AI integration in Burbank, CA

Jones Pavan

Screened

Director-level Engineering Leader specializing in platform modernization and AI integration

Burbank, CA15y exp
BlackLineCalifornia State University, Northridge

Engineering leader from Blackline who has repeatedly rescued and delivered high-visibility products by resetting roadmaps, tightening execution (better specs/estimation), and accelerating team velocity. Scaled a distributed org from ~20 to ~40 engineers by building a new India team with strong hiring rubrics and governance-as-code/SDLC consistency. Also modernized legacy systems into microservices (Kafka/Kubernetes/Apigee) and drove hackathon-to-production innovation using Google Vertex AI.

View profile
Pratik Jaiswal - Mid-level AI/ML Engineer specializing in financial services ML and MLOps in Remote, USA

Mid-level AI/ML Engineer specializing in financial services ML and MLOps

Remote, USA4y exp
M&T BankUniversity of South Florida

ML engineer/data scientist with M&T Bank experience who built a production reinforcement-learning portfolio analytics tool for wealth management, emphasizing near real-time performance via batch/serving separation and robust generalization through stress-scenario backtesting and RL regularization. Strong MLOps background (Airflow, Grafana, MLflow) and proven ability to drive adoption with non-technical stakeholders using KPI alignment and SHAP-based explanations.

View profile
HS

Senior Data Engineer specializing in multi-cloud data platforms and streaming pipelines

4y exp
Northern TrustUniversity of Texas at Arlington

Data platform engineer with hands-on ownership of high-volume financial data pipelines (millions of transactions/day) on Azure (ADF, Databricks, Delta Lake, Synapse), emphasizing schema-drift protection and automated data-quality gates. Also built resilient web scraping pipelines with anti-bot and backfill strategies, and shipped a versioned FastAPI + Redis data API with autoscaling, testing, and CI/CD via GitHub Actions.

View profile
XZ

Xiaoai Zhu

Screened

Entry-level Software Engineer specializing in AI and full-stack data systems

Atlanta, GA1y exp
Georgia Institute of TechnologyGeorgia Tech

Backend/AI engineer who has built an offline, citation-grounded RAG system end-to-end with hybrid retrieval, local LLM inference, and quantitative evaluation via RAGAS. Also brings real-time systems experience from an Airbnb-like booking platform and data pipeline/ML quality work from a Bilibili internship, with a strong emphasis on reliability, privacy, and measurable correctness.

View profile
NZ

Nate Zaidi

Screened

Senior Full-Stack Engineer specializing in Python, AI/ML, and cloud applications

Dumfries, VA10y exp
CodingQnaVirginia Commonwealth University

Backend/data engineer with hands-on production experience across FastAPI/PostgreSQL APIs and AWS (Lambda, ECS) delivered via Terraform + GitHub Actions. Built Glue-based ETL pipelines into Redshift with schema evolution and data quality checks, modernized legacy reporting into Python microservices, and has demonstrated measurable SQL performance wins (multi-second query reduced to sub-300ms).

View profile
SY

Mid-level Software Engineer specializing in FinTech and Healthcare systems

Arizona, USA4y exp
PayPal

Data engineer who has owned end-to-end production pipelines ingesting ~500GB/day from APIs/databases/Kafka into an S3 data lake (Glue/Spark) with Airflow-orchestrated Great Expectations quality gates. Built resilient external data collection systems with idempotent jobs, exponential-backoff retries, raw data capture, and backfills; also shipped Snowflake-backed APIs with caching, versioned endpoints, and backward-compatible data contracts. Led an early-stage Azure data platform build with phased delivery and GitHub Actions CI/CD, resolving schema-mismatch incidents quickly without downstream corruption.

View profile
YV

Intern Data Scientist / Software Engineer specializing in ML, computer vision, and cloud

United States2y exp
CCC Intelligent SolutionsJohns Hopkins University
View profile
PK

Staff Machine Learning Engineer specializing in LLM agents and ML systems

San Fransico, CA6y exp
InfosysGeorgia State University
View profile
AK

Mid-level Software Engineer specializing in cloud-native microservices and real-time data pipelines

Boston, MA4y exp
CiscoNortheastern University
View profile
KM

Junior Full-Stack Software Developer specializing in cloud APIs and data platforms

Calgary, Canada2y exp
DV8 EnergyUniversity of Waterloo
View profile
DS

Intern Software Engineer specializing in cloud data platforms and full-stack systems

Seattle, WA1y exp
Amazon Web ServicesStony Brook University
View profile
Matt Salomon - Senior Data Scientist specializing in GenAI, LLM systems, and production ML in Los Angeles, CA

Senior Data Scientist specializing in GenAI, LLM systems, and production ML

Los Angeles, CA17y exp
CignaMIT
View profile
Ying-Han Chen - Intern software engineer specializing in AI, cloud, and full-stack systems in San Mateo, CA

Intern software engineer specializing in AI, cloud, and full-stack systems

San Mateo, CA1y exp
MaximaArizona State University
View profile

Need someone specific?

AI Search