Vetted PySpark Professionals

Pre-screened and vetted.

AB

Anuj Bubna

Screened

Senior DevOps/SRE Engineer specializing in cloud automation, reliability, and data pipelines

10y exp
IntuitUniversity of Texas at Dallas

Hands-on technical professional experienced in taking LLM/AI-adjacent integrations from prototype to production, using customer observation to refine UX and uncover edge cases. Diagnoses workflow issues in real time using logs and Sankey-style workflow analysis, and communicates fixes with clear short/long-term plans plus proactive alerting. Also partners cross-functionally to drive adoption and cost savings, including a POC around IBM Sterling Integrator that reduced licensing costs by $30K/year.

View profile
PK

Prem Kumar

Screened

Senior Data Engineer specializing in cloud data platforms and regulated analytics

McLean, VA6y exp
Capital OneRowan University

Data engineer at Capital One building AWS-based real-time and batch pipelines and backend data services for financial/fraud use cases. Has owned end-to-end pipelines processing millions of records/day, implemented dbt/Great Expectations quality gates, and tuned Redshift/Snowflake workloads (cutting query latency ~22–25% and reducing pipeline failures ~30–40%) while supporting 15+ downstream consumers.

View profile
SB

Mid-level Data Engineer specializing in cloud data platforms and big data pipelines

5y exp
Molina HealthcareUniversity of Michigan-Dearborn

Healthcare data engineer with hands-on ownership of claims/member data pipelines on a cloud analytics platform, spanning batch and streaming ingestion (Airflow/Kafka/Spark/Databricks) through serving for reporting. Emphasizes reliability and data quality via embedded validation, schema-drift detection, deduplication, and operational monitoring/incident response, plus pragmatic CI/CD and observability setup in early-stage/ambiguous projects.

View profile
Pavan Kumar Malasani - Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and GenAI in Remote, USA

Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and GenAI

Remote, USA4y exp
CitigroupUniversity of Colorado Boulder

GenAI/ML engineer in Citigroup’s finance environment who has deployed production RAG systems for investment banking under strict privacy and model-risk constraints. Built an internal-VPC Llama2 + Pinecone + LangChain solution with NER redaction and citation-based verification to prevent hallucinations, delivering major time savings, and also partnered with global finance executives to ship an AI early-warning indicator for treasury/liquidity risk.

View profile
Akashreddy Madduri - Senior Backend Engineer specializing in real-time data platforms for FinTech and Healthcare in Plano, Texas

Senior Backend Engineer specializing in real-time data platforms for FinTech and Healthcare

Plano, Texas6y exp
JPMorgan ChaseNorthern Arizona University

Backend/data engineer with experience at JPMorgan building near real-time payment risk and fraud scoring pipelines using Python, Spark Structured Streaming, and Delta Lake, emphasizing auditability, security, and data correctness (dedupe/late events) to reduce false positives. Also led a legacy-to-cloud migration of claims/eligibility data at Cogna with parallel runs, phased rollout, and healthcare-specific validation (ICD-CPT mapping).

View profile
Krishna Kodur - Mid-level Robotics & AI Researcher specializing in human-robot interaction and reinforcement learning in Santa Clara, CA

Krishna Kodur

Screened

Mid-level Robotics & AI Researcher specializing in human-robot interaction and reinforcement learning

Santa Clara, CA8y exp
AMDSanta Clara University

Robotics software engineer who built an end-to-end mobile manipulation platform (Franka Panda on a Clearpath Ridgeback) for a simulated-kitchen human-robot interaction study with natural speech commands, implemented in Python/ROS. Has hands-on experience integrating diverse sensors (RealSense, LiDAR, biosignals) with deep learning frameworks (PyTorch, Hugging Face) and fine-tuning GPT-Neo, plus simulation (Gazebo) and modern deployment practices (Docker/Kubernetes, CI/CD).

View profile
Pandari G - Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems in San Francisco, USA

Pandari G

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

San Francisco, USA5y exp
SephoraSaint Mary's College of California

GenAI/LLM engineer with production deployments in both fintech and retail: built an AI-powered mortgage document analysis/automated underwriting pipeline at Fannie Mae (OCR + custom LLM) cutting underwriting review from 3–4 hours to under an hour with privacy-by-design controls. Also helped build Sephora’s GenAI product advisory bot using LangChain-orchestrated RAG (Azure GPT-4, Azure AI Search, MySQL HeatWave vector search), focusing on grounding, evaluation, and compliance-aware architecture choices.

View profile
Sushma Mangalampati - Mid-level Data Engineer specializing in lakehouse ETL and analytics engineering in Boston, MA

Mid-level Data Engineer specializing in lakehouse ETL and analytics engineering

Boston, MA6y exp
ServiceNowNortheastern University

Data engineer with strong end-to-end ownership of production lakehouse pipelines (Snowflake + Databricks + Airflow + dbt + Great Expectations), handling 8M+ records/month and 500K+ daily CDC updates. Delivered measurable reliability and efficiency gains (41% cost reduction, freshness improved from 4h to 30m, 35% fewer downstream incidents) and has experience building a lakehouse platform from scratch across 12 source systems.

View profile
Zhiwen Zhao - Junior Data Engineer specializing in cloud ETL and big data platforms in New York, NY

Zhiwen Zhao

Screened

Junior Data Engineer specializing in cloud ETL and big data platforms

New York, NY3y exp
Bank of ChinaNYU

Data engineer focused on transit/transportation datasets, building Spark-based pipelines that ingest from Oracle/APIs, apply PySpark data-quality fixes, and publish star-schema fact tables to Azure Data Lake. Experienced troubleshooting complex Spark failures (using checkpointing to manage long lineage) and operating Airflow-driven backfills and GitLab CI deployments for production DAGs.

View profile
Yash Rangucha - Mid-level Software Engineer specializing in backend microservices and real-time streaming in Illinois, USA

Yash Rangucha

Screened

Mid-level Software Engineer specializing in backend microservices and real-time streaming

Illinois, USA4y exp
ServiceNowIllinois Institute of Technology

Built and owned an end-to-end LLM-powered enterprise retrieval pipeline at ServiceNow, spanning ingestion of structured/semi-structured sources through vector retrieval and real-time API serving. Focused heavily on reliability and quality (multi-stage validation, monitoring, evaluation pipelines) while also driving performance improvements (~35% faster responses) via caching, async processing, and SQL/query optimization.

View profile
YV

Mid-level Data Analyst specializing in banking and product analytics

Memphis, TN4y exp
Bank of AmericaUniversity of Memphis

Analytics engineer/data analyst with Bank of America experience turning fragmented financial data across SQL Server, PostgreSQL, Kafka, and flat files into trusted Snowflake/dbt reporting models. Stands out for unifying disputed business definitions like churn and payment success rate, automating manual analysis in Python, and pairing strong data quality rigor with stakeholder adoption through self-service dashboards.

View profile
PP

Preeti Pandey

Screened

Senior AI/ML Engineer specializing in predictive analytics and NLP

Birmingham, AL10y exp
Blue Cross and Blue Shield of AlabamaLiverpool John Moores University

ML/AI engineer with hands-on experience building production healthcare AI systems across predictive modeling and GenAI. They built an end-to-end patient risk prediction platform and a RAG-based clinical summarization feature, combining strong NLP/LLM skills with AWS deployment, monitoring, drift detection, and reusable Python service design to deliver measurable clinical and operational impact.

View profile
HS

Mid-level AI/ML Engineer specializing in scalable ML, NLP, and MLOps

USA5y exp
CiscoUniversity of North Texas

ML/AI engineer with strong production depth across classical ML, MLOps, LLM/RAG, and scalable Python data platforms, with experience at Cisco and Accenture. Stands out for tying technical decisions to measurable business outcomes, including $1.2M annual savings, 40% faster support resolution, and broad internal adoption of shared engineering frameworks.

View profile
SV

Mid-level AI/ML Engineer specializing in cybersecurity and fraud analytics

USA4y exp
AccentureUniversity of Massachusetts Lowell

AI/ML engineer with production experience across both classical ML and Generative AI, including a real-time banking fraud detection platform at Deloitte and a RAG-based cybersecurity threat analysis feature at Accenture. Stands out for owning systems end-to-end—from feature pipelines and model tuning through deployment, monitoring, retraining, and API/platform reliability—with measurable impact on fraud accuracy, false positives, and SOC analyst efficiency.

View profile
Abhinita Sanabada - Senior Software Engineer specializing in AI/ML systems and FinTech platforms in San Jose, CA

Senior Software Engineer specializing in AI/ML systems and FinTech platforms

San Jose, CA9y exp
Wells FargoSan Jose State University

Master’s student in Data Science at San Jose State University with prior software engineering experience at JPMorgan Chase and Zap Labs. She combines enterprise backend reliability work in financial systems with hands-on full-stack AI workflow projects, including a recruiting automation system built with React/Next.js, FastAPI/Node, Kafka, and WebSockets, with a strong emphasis on observability, human-in-the-loop controls, and maintainability.

View profile
Harshal Sawant - Senior AI Engineer specializing in LLMs, RAG, and MLOps on multi-cloud

Senior AI Engineer specializing in LLMs, RAG, and MLOps on multi-cloud

8y exp
Wells Fargo

Built and productionized a secure internal RAG-based AI assistant (LangChain/FastAPI/FAISS on GCP), tackling real-world issues like latency, retrieval speed, and hallucinations—delivering 25% faster retrieval and 99.9% uptime. Also implemented scalable, reliable ML retraining orchestration with AWS Step Functions/SageMaker/Lambda and partners closely with compliance analysts to iteratively refine prompts and outputs to meet governance standards.

View profile
SG

Senior Data Scientist specializing in GenAI, fraud/credit risk, and cloud MLOps

Chicago, IL5y exp
Bankers LifeNorthern Illinois University
View profile
GA

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI

St. Louis, MO6y exp
PNCSoutheast Missouri State University
View profile
HD

Mid-level Data Engineer specializing in scalable ETL pipelines and data quality automation

USA6y exp
CentenePurdue University
View profile
Rahul Chowdary Kolla - Mid-Level Software Engineer specializing in microservices, cloud, and machine learning in Little Rock, AR

Mid-Level Software Engineer specializing in microservices, cloud, and machine learning

Little Rock, AR3y exp
JPMorgan ChaseUniversity of Arkansas
View profile
DP

Mid-level Data Analyst specializing in analytics, machine learning, and financial services

San Francisco, CA6y exp
JPMorgan ChaseArizona State University
View profile
RR

Mid-level GenAI Engineer specializing in LLM, RAG, and ML for finance and healthcare

Milwaukee, WI7y exp
Bank of AmericaUniversity of Wisconsin–Milwaukee
View profile
BK

Senior Data Scientist specializing in Generative AI, LLMs, and insurance analytics

Boston, MA20y exp
John HancockTexas A&M University
View profile
CC

Mid-level AI/ML Engineer specializing in MLOps, NLP, computer vision, and Generative AI

United States7y exp
McKinsey & CompanyUniversity of Central Missouri
View profile

Need someone specific?

AI Search