Vetted PySpark Professionals

Pre-screened and vetted.

AA

Aayush Anand

Screened

Intern Full-Stack/Software Engineer specializing in web apps, cloud, and data/ML systems

New York, NY1y exp
The NorthStar GroupNYU

Built and productionized LLM-driven content intelligence/SEO agents for a high-traffic media platform, automating tagging/summarization/metadata with FastAPI + async orchestration and strict JSON-schema outputs. Demonstrated measurable impact (40% faster publishing, +20% organic traffic in 3 months) and strong reliability practices (offline evals, shadow mode, canaries, fallbacks, idempotency, and monitoring).

View profile
SR

Mid-level Data Engineer specializing in cloud ETL/ELT and big data pipelines

Columbus, OH4y exp
Western Alliance BankUniversity of Missouri-Kansas City

Data engineer focused on production-grade pipelines and data services: ingests millions of records/day into S3, performs SQL/Python quality validation and PySpark/SQL transformations, and serves curated datasets via Athena/Redshift. Has experience hardening external data collection with retries/rate-limit handling and shipping versioned internal data APIs with backward compatibility, monitoring, and CI/CD in early-stage environments.

View profile
NS

Mid-level ML Data Engineer specializing in MLOps and scalable healthcare data pipelines

Boston, MA5y exp
CignaNortheastern University

Data/ML platform engineer with healthcare (Cigna) experience owning an end-to-end pipeline spanning Airflow + Debezium CDC ingestion, PySpark/SQL transformations, rigorous data quality gates, and feature-store/API serving for ML training and inference. Worked at 10+ TB scale and cites a ~30% latency reduction plus stronger reliability via idempotent design, monitoring, and backfill-safe reprocessing; also built pragmatic early-stage data pipelines at Frankenbuild Ventures.

View profile
Keerthi Kalluri - Senior Full-Stack & GenAI Engineer specializing in healthcare and financial services

Senior Full-Stack & GenAI Engineer specializing in healthcare and financial services

6y exp
Kaiser PermanenteTexas Tech University

Built and deployed a production LLM-powered customer support assistant using a RAG backend in Python, focused on deflecting repetitive Tier-1 tickets and reducing resolution time. Demonstrates strong production engineering instincts around reliability (confidence scoring + human fallback), scalability/cost optimization (multi-stage pipelines), and workflow orchestration/observability (LangChain, custom DAGs, structured logging, step metrics).

View profile
Shweta Gupta - Senior Backend Software Engineer specializing in Java microservices, Kafka, and AWS in Seattle, WA

Shweta Gupta

Screened

Senior Backend Software Engineer specializing in Java microservices, Kafka, and AWS

Seattle, WA6y exp
EasyBee AIUC Irvine

AI engineer who shipped a production chat assistant for a storage company by building the underlying RAG-style knowledge base (document ingestion, chunking/embeddings, FAISS vector store) and an admin update interface to keep content current. Also has full-stack delivery experience (Python REST APIs + React/TypeScript UI) and AWS operations using Terraform/Jenkins, including handling a real production performance incident by optimizing DB queries and adding auto-scaling.

View profile
Nikitha Margadi - Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps in Texas, USA

Mid-level Data Engineer specializing in cloud lakehouse, streaming, and MLOps

Texas, USA5y exp
AT&TCal State Fullerton

Data engineer at AT&T focused on large-scale telecom (5G/IoT) data platforms, owning end-to-end pipelines from Kafka/Azure ingestion through Databricks/Delta Lake transformations to serving analytics and ML. Has operated at very high volumes (~50+ TB/day) and delivered measurable performance gains (25–30% faster processing) plus improved reliability via Airflow monitoring, robust data quality checks, and resilient external data collection patterns (rate limiting, retries, dynamic schemas).

View profile
Krishnamraju Penumatsa - Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines in Fort Worth, TX

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Fort Worth, TX6y exp
American AirlinesUniversity of North Texas

Data engineer currently at American Airlines who built and owned end-to-end flight operations and booking data pipelines (batch + real-time) using Azure Data Factory, Kafka, Spark/Databricks, Synapse, and Snowflake—processing hundreds of GBs/day. Strong focus on reliability and data quality (idempotency, checkpointing, retries, validation/alerts) and delivered near-real-time analytics powering Power BI dashboards; previously helped stand up an early-stage data platform at Sysco on AWS (Glue/S3/Redshift) with Airflow and Jenkins CI/CD.

View profile
LN

Junior Data Analyst specializing in analytics, BI, and machine learning

College Park, MD1y exp
USA TODAYUniversity of Maryland, College Park

Analytics-focused candidate with experience owning end-to-end data projects across AI transcription, retail forecasting, and transportation revenue analytics. They combine strong SQL/Python pipeline skills with dashboarding and stakeholder alignment, citing measurable impact including 60% lower ETL latency, 18% better forecast accuracy, and 25% operational efficiency gains.

View profile
AM

Senior Machine Learning Engineer specializing in conversational AI and healthcare ML

Chicago, IL5y exp
OptumUniversity of Illinois Chicago

ML/AI engineer focused on taking LLM products from experiment to production, with hands-on ownership of a RAG-based customer support system that improved response quality by 35% and cut latency by 30%. Stands out for combining product impact with production rigor across retrieval tuning, safety guardrails, monitoring, and reusable Python/FastAPI services that accelerated adoption across teams.

View profile
JN

Mid-level Software Engineer specializing in AI backend and LLM systems

Texas, USA4y exp
Encando AITexas A&M University

Founding engineer at an edtech startup who combines hands-on engineering leadership with advanced AI-native development workflows. They’ve built an AI grading pipeline and a multi-agent SDLC tool, and stand out for treating AI agents like an engineering team with planning, parallel execution, QA, and rigorous validation.

View profile
SL

Mid-level Data Scientist specializing in experimentation, NLP, and ML

USA4y exp
Capital OneUniversity of Memphis

Data science and AI professional with Capital One experience building churn prediction and GenAI-powered document intelligence solutions. Stands out for pairing hands-on technical depth in NLP, LLMs, and analytics with strong business communication, including driving adoption across teams and contributing to a 25% reduction in customer churn.

View profile
JK

Mid-level Machine Learning & GenAI Engineer specializing in LLMs, RAG, and NLP

New York, NY6y exp
Morgan Stanley

Built and deployed an LLM-powered customer support assistant (“Notable Assistant”) focused on automating common post-customer queries while maintaining multi-turn context and meeting scalability/latency needs. Experienced with production orchestration and operations using Kubernetes and Apache Airflow (DAG-based ETL, scheduling, monitoring/alerts), and has partnered closely with customer service stakeholders to align chatbot behavior with brand voice through iterative testing.

View profile
BT

Bharath TVS

Screened

Senior Data Scientist specializing in NLP, LLMs, and Computer Vision

Westlake, OH7y exp
KeyBank

Applied NLP/ML engineer with experience at KeyBank and Novartis building production document intelligence and entity-resolution systems in finance and healthcare. Has delivered end-to-end pipelines (Airflow + AWS) using transformers (DistilBERT/Sentence-BERT), vector search (FAISS/Milvus/Pinecone), and human-in-the-loop labeling to achieve measurable gains (40%+ faster queries; up to 88% F1 and 93% precision/90% recall in entity linking).

View profile
Janvitha Mandyam - Mid-level AI/ML Engineer specializing in Generative AI and NLP in Chicago, IL

Mid-level AI/ML Engineer specializing in Generative AI and NLP

Chicago, IL4y exp
Citibank

GenAI/LLMOps practitioner who deployed a production RAG-based customer service and knowledge retrieval system for a global bank using LangChain, FAISS/Azure Cognitive Search, GPT-4/Claude, and Guardrails—driving a reported 35% Q&A accuracy lift while reducing handle time and escalations. Also partnered with non-technical leaders at CVS Health to deliver ML-driven supply chain risk and inventory insights via anomaly detection, NLG summaries, and stakeholder-friendly dashboards.

View profile
RL

Ramya Latha

Screened

Senior AI/ML & Data Engineer specializing in Generative AI and RAG systems

Birmingham, AL8y exp
Regions Bank

GenAI/RAG engineer who has deployed a production policy/regulatory search assistant for a financial client using LangChain + Vertex AI, FastAPI, Docker/Kubernetes, and Airflow-orchestrated data pipelines. Demonstrated measurable impact with 50–60% latency reduction and 70% fewer pipeline failures, plus KPI-driven grounding evaluation (90%+ target) and strong cross-functional collaboration with compliance/business teams.

View profile
DP

Dhrumil Patel

Screened

Mid-level AI/ML Engineer specializing in Generative AI and NLP

Boston, MA5y exp
TD Bank

Built an end-to-end GenAI underwriting copilot at TD Bank for complex financial documents, combining RoBERTa-based risk classification with Azure OpenAI RAG to deliver grounded, citation-based insights. Drove a 40-50% reduction in manual underwriting review time and created reusable FastAPI ML services that cut integration effort for other teams by 30-40%.

View profile
TK

Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps

Dallas, USA4y exp
AT&TSaint Louis University
View profile
SK

Mid-Level Software Engineer specializing in backend, cloud, and AI/LLM systems

TX, USA3y exp
ServiceNowUniversity of Texas at Arlington
View profile
VC

Mid-level AI/ML Engineer specializing in Generative AI and LLM solutions

Fort Lauderdale, FL4y exp
U.S. BankFlorida Atlantic University
View profile
AK

Senior Machine Learning Engineer specializing in agentic systems, RAG, and edge AI

Plano, TX7y exp
SonicsterUniversity of Texas at Arlington
View profile
BR

Mid-level Data Engineer specializing in financial risk, compliance, and real-time streaming

Remote, USA4y exp
LTIMindtreeConcordia University, St. Paul
View profile
GA

Mid-level Data Engineer specializing in cloud ETL and data platforms (AWS/Azure)

USA, USA5y exp
Liberty MutualIndiana State University
View profile
Shuvam Chatterjee - Mid-level AI/ML Engineer specializing in NLP, recommender systems, and Generative AI in Remote, USA

Mid-level AI/ML Engineer specializing in NLP, recommender systems, and Generative AI

Remote, USA5y exp
Allianz LifeUniversity at Buffalo
View profile
Sai Rakesh Penumetcha - Mid-level GenAI/ML Engineer specializing in LLMs, NLP, and RAG in USA

Mid-level GenAI/ML Engineer specializing in LLMs, NLP, and RAG

USA3y exp
CitigroupGeorge Mason University
View profile

Need someone specific?

AI Search