Vetted PySpark Professionals

Pre-screened and vetted.

AS

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

USA4y exp
Northern TrustSyracuse University
View profile
SB

Mid-level Data Scientist / AI/ML Engineer specializing in Generative AI and healthcare analytics

Maryland Heights, MO4y exp
KrogerSaint Louis University
View profile
SS

Senior GenAI Engineer specializing in LLM agents and insurance automation

West Bend, WI5y exp
CoforgeTexas A&M University
View profile
AT

Senior Machine Learning Engineer specializing in GenAI, RAG, and NLP

United States10y exp
BirlasoftDrexel University
View profile
DR

Mid-level Machine Learning Engineer specializing in MLOps and applied data science

Dallas, TX4y exp
Southern Glazer's Wine & SpiritsSan José State University
View profile
SJ

Mid-level Full-Stack Software Engineer specializing in GenAI and SaaS platforms

Harrison, NJ5y exp
MetLifeStevens Institute of Technology
View profile
KS

Kush Shah

Screened

Senior Frontend/Full-Stack Engineer specializing in scalable React/Next.js systems

Denver, CO7y exp
AmplifireTexas A&M University

Backend/data engineer who reports building production Python services (FastAPI + JWT) backed by Postgres and Redis, and modernizing data workflows using AWS Glue + PySpark with S3/RDS. States experience delivering AWS solutions (S3, SES, Cognito) and using golden datasets/snapshot testing for migration parity, with many details withheld due to NDAs. Seeking fully remote work with a $300k base salary expectation.

View profile
AP

Mid-level Machine Learning Engineer specializing in production ML, forecasting, NLP and computer vision

IL, USA4y exp
CignaChicago State University

Built and deployed a production LLM-powered support assistant for customer support agents using a RAG architecture over internal docs and past tickets, with human-in-the-loop review. Demonstrates strong applied LLM engineering focused on real-world constraints (hallucinations, latency, cost) using routing to smaller models, reranking, caching, and rigorous evaluation/monitoring (offline eval sets, A/B tests, KPI tracking).

View profile
NK

Mid-level AI/ML Engineer specializing in fraud detection, recommender systems, and forecasting

Remote, USA4y exp
CitigroupUniversity of Dayton

ML engineer/data scientist who built and deployed a real-time fraud detection platform at Citi on AWS SageMaker, processing 3M+ daily transactions and improving fraud response by 28%. Combines unsupervised anomaly detection (autoencoders) with ensemble models (XGBoost/Random Forest) plus Airflow/Step Functions orchestration, drift monitoring, and explainability (SHAP) to keep models reliable and compliant in production.

View profile
SP

shubham patil

Screened

Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics

New York, NY4y exp
Syracuse UniversitySyracuse University

Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).

View profile
MY

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
State StreetWebster University

Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.

View profile
JP

Jay Patel

Screened

Mid-level AI/ML Engineer specializing in NLP, Document AI, and MLOps

USA6y exp
State StreetPace University

ML/LLM engineer with production experience building a RAG-based LLM support assistant (FastAPI, Redis, Kafka) with multi-layer validation and human-in-the-loop feedback loops to improve accuracy over time. Has orchestration and MLOps depth using Airflow and Kubeflow on Kubernetes (autoscaling, alerting, monitoring) and delivered measurable ops impact (40% ticket efficiency improvement) by partnering closely with customer support teams.

View profile
SJ

Mid-level Data Scientist / ML Engineer specializing in MLOps and Generative AI

Alexandria, Virginia3y exp
Schizophrenia & Psychosis Action AllianceStony Brook University

Built and deployed an AI agent to help patients navigate complex housing information by scraping and normalizing unstructured data across all 50 U.S. states, then layering a LangChain RAG system with MMR re-ranking to reduce hallucinations. Experienced in orchestrating multi-agent workflows (LangGraph/CrewAI) and production reliability practices (Pydantic-validated outputs, LLM-as-judge evals, tracing). Also delivered stakeholder-facing explainability via SHAP dashboards for a loan-approval predictive model at Welspot.

View profile
RR

Rajeev Reddy

Screened

Mid-level AI/ML Engineer specializing in NLP and production ML on cloud

4y exp
The HartfordFlorida Atlantic University

ML engineer/data scientist who deployed a production credit risk + insurance claims triage platform at Hartford Financial, combining XGBoost default prediction with BERT-based document classification. Demonstrated strong MLOps by cutting inference latency to sub-500ms and building drift monitoring plus automated retraining/deployment pipelines (MLflow, CloudWatch, GitHub Actions, SageMaker) with human-in-the-loop review and SHAP-based explainability for underwriting adoption.

View profile
DG

Mid-level Data Scientist specializing in cloud ML, MLOps, and predictive analytics

Dallas, TX4y exp
UnitedHealth GroupJawaharlal Nehru Technological University, Hyderabad

NLP/ML engineer with hands-on healthcare and support-ticket text experience, building clinical-note structuring and semantic linking systems using spaCy, BERT clinical embeddings, and FAISS. Emphasizes production-grade delivery (Airflow/Databricks, PySpark, Docker, AWS/FastAPI/Lambda) and rigorous validation via clinician-labeled datasets, retrieval metrics, and user feedback.

View profile
SB

Mid-level Full-Stack & ML Engineer specializing in AI SaaS, MLOps, and cloud infrastructure

Edison, NJ3y exp
AffirmoAINYU

Built and shipped an AI-powered driver ranking/assignment system at AffirmoAI using LLM intent classification + RAG over pgvector/Postgres, served via FastAPI with a React UI that explains scores. Drove measurable improvements through optimization and iteration (latency down to <800ms, adoption 60%→90%+) and implemented rigorous eval loops with dispatcher ground truth plus cold-start handling for new drivers.

View profile
SREEJA REDDY Konda - Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics in Kentwood, MI

Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics

Kentwood, MI6y exp
Fifth Third BankUniversity of Central Missouri

AI/ML Engineer at Fifth Third Bank who has shipped production fraud detection and risk analysis systems combining ML models with LLM-powered insights/explanations, including real-time monitoring, drift detection, and automated retraining under regulatory explainability constraints. Also built a hybrid-retrieval internal knowledge-base QA system (+20% top-5 relevance) and delivered a customer support chatbot that reduced first response time by 30% through strong stakeholder collaboration.

View profile
Teja Babu Mandaloju - Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms in Chicago, USA

Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms

Chicago, USA5y exp
VosynUniversity of North Texas

AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.

View profile
sai Pavan - Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

sai Pavan

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

5y exp
American Family InsuranceGeorge Mason University

Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.

View profile
Allen Saunders - Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations in San Francisco, California

Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations

San Francisco, California11y exp
SpiderOakSan Francisco State University

Solutions Architect with 5+ years leading pre- and post-sales engagements, focused on taking complex tooling from test/prototype to secure production through a structured discovery-to-deployment approach. Experienced in LLM workflow troubleshooting using tools like Langfuse/Gopher and in developer enablement via concise, hands-on workshops (e.g., Jenkins on Kubernetes at scale). Has navigated internal and external blockers to drive adoption and keep enterprise deals moving (including a Jenkins sale to Love's).

View profile
Bhavana Anna - Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG) in USA

Bhavana Anna

Screened

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)

USA5y exp
USAAKennesaw State University

AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.

View profile
Phanideep P - Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

Phanideep P

Screened

Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

5y exp
Cadence BankWright State University

Data platform/data engineer with cross-industry experience in banking and healthcare, building cloud-native lakehouse architectures across AWS/Azure/GCP. Has owned high-volume (millions of records; TB/day) pipelines with strong data quality automation (dbt/Great Expectations), observability (Grafana/Prometheus), and real-time streaming (Kafka/Spark) for fraud monitoring; also delivered an early-stage migration from SQL Server to BigQuery with 40% batch latency reduction.

View profile
Sai Bandaru - Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems in Boston, MA

Sai Bandaru

Screened

Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems

Boston, MA6y exp
FiVerityNortheastern University

At FiVerity, built and deployed a production LLM/RAG-based Information Gathering Tool for credit union fraud analysts that generates auditable investigation summaries from verified evidence. Focused on high-stakes constraints—hallucination prevention, cross-entity leakage controls, compliance/PII-safe monitoring, and latency—while also shipping customer-facing agentic workflows using CrewAI and LangGraph in close partnership with fraud and compliance stakeholders.

View profile
Rajeshwar Peri - Mid-level Data Analyst specializing in healthcare and financial analytics in Chicago, IL

Mid-level Data Analyst specializing in healthcare and financial analytics

Chicago, IL5y exp
Elevance HealthIndiana Wesleyan University

Healthcare analytics candidate with hands-on experience turning messy claims and CRM data into validated reporting tables, automating monthly reporting in Python/Airflow, and operationalizing churn metrics in SQL and Tableau. They appear especially strong in stakeholder-aligned metric design and delivered a reported ~10% churn reduction through cohort analysis, segmentation, and at-risk member targeting.

View profile

Need someone specific?

AI Search