Vetted PySpark Professionals

Pre-screened and vetted.

JS

Principal Data Scientist specializing in LLMs, RAG, and enterprise AI products

Winchester, TN9y exp
SambaNovaSewanee: The University of the South
View profile
KP

Mid-level Data Engineer specializing in GCP, Spark, and healthcare analytics

New York, NY3y exp
CVS HealthColumbia University
View profile
AA

Senior AI/ML Engineer specializing in LLMs and enterprise conversational AI

Northbrook, IL16y exp
CVS HealthUniversity of Illinois Chicago
View profile
RD

Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps

USA, USA4y exp
Scale AIUniversity of Texas at Arlington
View profile
MP

Senior Software Engineer specializing in backend data platforms for FinTech

Irving, TX9y exp
Cottonwood FinancialUniversity of Texas at Austin
View profile
AH

Senior Full-Stack Engineer specializing in backend, cloud, and AI systems

New York, NY8y exp
Istream Solution
View profile
AR

Adithya Rajendra

Screened ReferencesStrong rec.

Junior Data Engineer specializing in Azure data platforms and GenAI analytics

Bengaluru, India1y exp
ZEISSUC Irvine

Data/ML practitioner with experience spanning medical imaging (retinal vessel analysis for hypertension/CVD risk prediction) and enterprise data engineering at Carl Zeiss. Built large-scale SAP data cleaning/validation pipelines (10M+ daily records, ~99% accuracy) and RAG-based semantic search with LangChain/vector DBs that cut manual querying by 82%, plus automation that reduced data onboarding from 8 hours to 12 minutes.

View profile
NK

NEHA KOLAN

Screened

Mid-Level Software Engineer specializing in microservices and cloud data pipelines

Texas, USA4y exp
CignaUniversity of North Texas

Full-stack engineer with end-to-end ownership across React/TypeScript frontends, Spring Boot/Node microservices, and production ops on Docker/Kubernetes and AWS (ECS/CloudWatch). Built real-time healthcare eligibility and analytics systems at Cigna and an early-stage seller onboarding platform at Flipkart, driving measurable performance gains (35–40% latency/throughput improvements) through event-driven Kafka pipelines, Redis caching, and strong reliability/observability practices.

View profile
AP

Anurag Patil

Screened

Mid-level Data Analyst specializing in machine learning, ETL, and real-world evidence analytics

California, USA6y exp
AbbVieUC Irvine

Developed and productionized an AI-driven "indication finding" system for AbbVie to identify additional diseases a drug could target, working closely with clinical research teams on cohort inclusion/exclusion criteria and disease rollups. Leveraged an LLM to map clinical inputs to ICD codes and built configuration-driven ML pipelines (Cloudera ML, YAML, scheduled jobs) with structured testing and evaluation for reliability.

View profile
SM

Shravya M

Screened

Senior AI/ML Engineer specializing in NLP, LLMs, and MLOps

Texas, USA6y exp
CVS HealthUniversity of North Texas

LLM/agent workflow engineer with healthcare experience (CVS/CBS Health) who built and deployed a production call-insights platform using Azure OpenAI + LangChain/LangGraph, including sentiment and compliance checks. Demonstrates deep HIPAA/PHI handling (tenant-contained processing, redaction, RBAC/encryption/audit logging) and production rigor (testing, eval sets, validation/retries, autoscaling) to scale to thousands of transcripts.

View profile
KA

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and multimodal modeling

Ann Arbor, USA3y exp
University of MichiganUniversity of Michigan

Built and productionized a telecom-focused RAG assistant by LoRA fine-tuning LLaMA-2 and integrating LangChain+FAISS behind a FastAPI service, with dashboards and a human feedback UI for engineers. Demonstrated measurable impact (≈40% faster document lookup, +8–10% retrieval precision) and strong MLOps rigor via Airflow orchestration, CI/CD, and monitoring for drift and failures.

View profile
SK

Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications

Dallas, TX5y exp
Baylor Scott & WhiteUniversity of North Texas

Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).

View profile
SR

Mid-level AI/ML Engineer specializing in deep learning, NLP/LLMs, and MLOps

MA, USA6y exp
Flatiron HealthClark University

Built and shipped a real-time oncology risk prediction system used by doctors during patient visits, trained on clinical data in AWS SageMaker and deployed via FastAPI with sub-second responses. Emphasizes clinician-trust features (SHAP explainability, validation checks) and HIPAA-compliant controls (encryption, RBAC, audit logging), plus Kubernetes-based production operations with autoscaling, monitoring, and drift/retraining workflows; collaborated closely with oncologists at Flatiron Health.

View profile
BK

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp
ThorneBharathiar University

ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.

View profile
RW

Principal Data Scientist specializing in NLP and Generative AI

Chicago, IL9y exp
Witmer Consulting CorporationGeorgetown University

ML/NLP practitioner with experience building an embedding-based ad matching and search system at Vericast (BERT embeddings + similarity search) to replace a third-party taxonomy approach, evaluated via a human-curated gold standard. Also built a custom NER pipeline at Allstate for auto accident claims calls using a bidirectional LSTM and achieved 90%+ F1, with a strong emphasis on production-grade ML workflows (testing, CI/CD, orchestration, versioning, validation).

View profile
RG

Mid-level GenAI Engineer specializing in production RAG and LLM fine-tuning

San Jose, California5y exp
eBayTexas Tech University

LLM engineer who built a production seller-support RAG system at eBay using hybrid retrieval (BM25 + Pinecone vectors) with Cohere reranking, LangGraph orchestration, and citation-grounded answers. Strong focus on reliability: semantic/structure-aware chunking, automated Ragas-based evaluation with nightly regressions, and production observability (LangSmith) plus drift monitoring (Arize). Also implemented a multi-agent fraud pipeline with AutoGen using JSON-schema contracts and explicit termination conditions.

View profile
YT

Yupeng Tang

Screened

Junior Machine Learning Engineer specializing in LLM systems and GPU inference

Atlanta, GA1y exp
GMI CloudGeorgia Tech

LLM/agent engineer who shipped a production RAG-based recommendation + explanation system that replaced a traditional recommender stack, delivering ~20% CTR lift (and +8% after a reliability iteration) with strong cold-start performance. Demonstrates strong production rigor: schema-constrained generation, typed tool calling, explicit state/orchestration, deep monitoring/feedback loops, and safe integration with messy ERP inventory/order data using normalization, idempotency, and conflict-resolution guardrails.

View profile
Anvith Reddy Dodda - Mid-level AI Engineer specializing in GenAI, NLP, and MLOps in Remote, USA

Mid-level AI Engineer specializing in GenAI, NLP, and MLOps

Remote, USA3y exp
PayPalUniversity of Central Missouri

LLM/agentic-systems engineer with PayPal experience hardening an LLM-powered fraud support assistant from prototype to production, focusing on low-latency distributed architecture, rigorous evaluation/testing, and security/compliance. Comfortable in customer-facing and GTM contexts—runs technical demos/workshops, builds tailored pilots, and aligns sales/CS with engineering to close deals and drive adoption.

View profile
Mounika Gunturu - Senior Python Full-Stack Developer specializing in cloud-native microservices and data platforms in New York, NY

Senior Python Full-Stack Developer specializing in cloud-native microservices and data platforms

New York, NY9y exp
Oliver WymanNarayanamma Institute of Technology and Science

Backend/data engineer from Oliver Wyman who built and ran production Python (FastAPI) services on AWS (ECS/Lambda/API Gateway) supporting risk modeling and regulatory reporting. Strong in reliability/observability, Glue-based ETL with data quality controls, and legacy SAS-to-Python modernization with rigorous parity validation; also demonstrated measurable SQL performance wins and cost-control improvements in serverless scaling. Based in Raleigh, NC and can travel onsite for important Bethesda-area meetings.

View profile
silin liu - Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems in New York City, NY

silin liu

Screened

Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems

New York City, NY5y exp
Metropolitan Transportation AuthorityStevens Institute of Technology

Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.

View profile
Swathi Sankaran - Senior Python Full-Stack Developer specializing in cloud, data engineering, and ML/GenAI in New York, NY

Senior Python Full-Stack Developer specializing in cloud, data engineering, and ML/GenAI

New York, NY10y exp
East West BankJawaharlal Nehru Technological University

Backend/data engineer with hands-on production experience building FastAPI services on AWS and implementing strong reliability/observability (CloudWatch, ELK, correlation IDs, alarms). Has delivered serverless + container solutions with IaC (CloudFormation/Terraform) and Jenkins CI/CD, and built AWS Glue/PySpark pipelines into S3/Redshift with schema-evolution and data-quality safeguards; demonstrated large-scale SQL tuning (45 min to 3 min on a 500M-row workload).

View profile
Shravya Shashidhar - Intern Software Engineer specializing in LLM agents and full-stack development in Seattle, USA

Intern Software Engineer specializing in LLM agents and full-stack development

Seattle, USA1y exp
Unwind AIUSC

Embedded C++ engineer with Bosch automotive infotainment experience, owning real-time audio middleware modules with strict latency/memory constraints. Strong in profiling/optimizing deterministic behavior, debugging hardware-specific intermittent issues, and building automated test + CI pipelines; currently ramping up on ROS2 concepts (DDS, nodes/topics/services) to transition toward robotics.

View profile
AB

Ansh Bajaj

Screened

Senior Data Engineer specializing in cloud analytics and data modernization

Los Angeles, CA9y exp
DeloitteUniversity of the Cumberlands

Candidate has hands-on experience delivering production data and AI systems, including an AWS-based real-time data platform for a financial client at Deloitte and a production RAG workflow that cut manual search time by 40%. They stand out for combining strong data engineering depth with practical LLM governance, incident debugging, and stakeholder management across business and risk/compliance teams.

View profile
Aarushi Mahajan - Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in New York, USA

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps

New York, USA4y exp
IntuitUniversity of Massachusetts Amherst

Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.

View profile

Need someone specific?

AI Search