Vetted PySpark Professionals

Pre-screened and vetted.

Harikiran Jangam - Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems in California, USA

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems

California, USA3y exp
McKessonCalifornia Lutheran University

Backend engineer who built and evolved a PHI-compliant RAG system (FastAPI + LangChain + embeddings/FAISS) for internal document search and summarization, delivering <400ms p95 latency at ~2,500 daily requests and measurable impact (30% faster investigations, +17% retrieval relevance). Demonstrates strong security and rollout discipline (RBAC/RLS/JWT, redaction/audits, shadow mode, dual writes, canaries) and a focus on reducing hallucination risk via grounded guardrails and confidence-based fallbacks.

View profile
Ketan Verma - Junior Applied AI Engineer specializing in data pipelines and ML systems in College Station, TX

Ketan Verma

Screened

Junior Applied AI Engineer specializing in data pipelines and ML systems

College Station, TX2y exp
ElysiTexas A&M University

Built an end-to-end wafer-data anomaly detection and reporting system at Samsung using PySpark, Random Forest models, SQL, and Grafana to help engineers track faults and take corrective action. Also has strong UX prototyping and validation practices in Figma plus hands-on front-end/full-stack experience (HTML/CSS/TypeScript), including a student project recognized as best design out of 25 teams, and early-stage startup experience pivoting a product based on user interviews into a real-time in-context feedback overlay.

View profile
Abhishek Soni - Mid-level Full-Stack Developer specializing in React and scalable web applications in Mumbai, India

Abhishek Soni

Screened

Mid-level Full-Stack Developer specializing in React and scalable web applications

Mumbai, India3y exp
Taurus TechnologiesDr. A. P. J. Abdul Kalam Technical University

Backend/data engineer with hands-on production experience across FastAPI microservices and AWS data platforms. Has delivered serverless and Glue/EMR-based ETL pipelines with strong observability (Prometheus/Grafana/Sentry, CloudWatch/SNS), schema-evolution resilience, and measurable SQL performance wins (5 min to <30 sec). Open to onsite meetings in the Bethesda, MD area and flexible on remote arrangements.

View profile
Nisarg Shah - Junior Machine Learning Engineer specializing in geospatial analytics and computer vision in Tempe, Arizona

Nisarg Shah

Screened

Junior Machine Learning Engineer specializing in geospatial analytics and computer vision

Tempe, Arizona1y exp
Arizona State UniversityArizona State University

Built and evolved a geospatial ETL + API platform that processes pixel-wise satellite imagery in PostgreSQL/PostGIS into low-latency farm-level time-series metrics for an interactive dashboard, using precomputed hotspot analysis to reduce latency by 75–80%. Experienced in FastAPI-style API contract design (OpenAPI), caching, server-side filtering/compression, and production-minded security patterns (RBAC, session-derived authorization, password hashing) with disciplined rollback/versioning practices.

View profile
Pranita Agrawal - Mid-level Software Engineer specializing in Java microservices and AWS in California, USA

Mid-level Software Engineer specializing in Java microservices and AWS

California, USA5y exp
City of ModestoWayne State University

TypeScript backend/full-stack engineer who owned an internal business workflow platform end-to-end in production, including API/data design, relational DB integration, and enterprise integrations. Has hands-on experience operating workflow processing services with Kafka-style event-driven patterns, idempotency, exponential backoff retries, dead-letter queues, and strong observability, plus API design with OpenAPI/Swagger and token-based auth.

View profile
Prasanth Sai - Mid-level Data Engineer specializing in cloud lakehouse/warehouse pipelines

Prasanth Sai

Screened

Mid-level Data Engineer specializing in cloud lakehouse/warehouse pipelines

4y exp
Wells FargoChristian Brothers University

Data engineer with HCA Healthcare experience building and operating end-to-end AWS-based pipelines for clinical and operational reporting (50–100 GB/day), serving curated data into Redshift/Snowflake for Power BI/Tableau. Emphasizes production reliability (Airflow SLAs/retries/alerting, logging/observability) and strong data quality controls (reconciliations, schema/null/duplicate checks), and has shipped versioned REST APIs to expose warehouse data to downstream systems.

View profile
Shashank R - Senior Data Engineer specializing in cloud data platforms and real-time analytics in Las Vegas, NV

Shashank R

Screened

Senior Data Engineer specializing in cloud data platforms and real-time analytics

Las Vegas, NV6y exp
Credit One BankUniversity of North Texas

Data engineer (Credit One) who built and owned real-time financial transaction and credit risk/fraud data systems end-to-end on AWS + Snowflake. Delivered high-scale pipelines (150k events/hour; ~2TB/week), raised data accuracy to 99%, and cut Snowflake costs 42% while adding strong observability, schema-drift handling, and production-grade APIs/documentation.

View profile
EP

Mid-level Data Engineer specializing in cloud lakehouse platforms and ETL/ELT

Charlotte, NC4y exp
AccentureUniversity of North Carolina at Charlotte

Accenture data engineer who greenfielded a supply-chain lakehouse platform, building an end-to-end medallion/Delta pipeline ingesting ~1.4TB/day from 17+ ERP/WMS/TMS/shipment sources. Delivered Gold datasets to Redshift/Synapse/Databricks SQL powering Power BI/Tableau with a 99.5% SLA, while cutting runtime 30% and cloud costs 16% through Spark/Delta optimizations and robust data quality controls.

View profile
Ashrita Mishra - Mid-level Data Analyst specializing in analytics, ETL, and cloud data platforms in Jersey City, NJ

Mid-level Data Analyst specializing in analytics, ETL, and cloud data platforms

Jersey City, NJ4y exp
CitigroupPace University

Data analyst with 4 years of experience spanning banking and retail/marketing analytics. Has hands-on experience building churn analytics pipelines in SQL and Python, optimizing large-query performance, and turning stakeholder-aligned metrics into recurring dashboards and business actions.

View profile
Shashwat Negi - Mid-level Software Engineer specializing in AI/ML and full-stack systems in San Jose, CA

Shashwat Negi

Screened

Mid-level Software Engineer specializing in AI/ML and full-stack systems

San Jose, CA3y exp
InfrrdUniversity of Wisconsin–Madison

Data Scientist (2–3 years) at ZS Associates who has built and productionized agentic LLM systems, including a LangGraph-based multi-LLM prompt-optimization pipeline for entity extraction deployed as a Spring Boot microservice via Jenkins. Also built an Insightmate.ai chatbot and improved its RAG accuracy by diagnosing vector retrieval issues and implementing HyDE query expansion, while partnering with sales and pharma stakeholders to drive adoption (e.g., Zimmer Biomet platform migration into a multi-year partnership).

View profile
YA

Mid-level Data Analyst specializing in BI, analytics automation, and cloud data platforms

Charlotte, NC4y exp
SkyWest AirlinesUniversity of North Carolina at Charlotte

Analytics professional with hands-on experience building SQL/Python pipelines, customer ID mapping logic, and self-serve BI dashboards across marketing/CRM and regulated aviation reporting environments. Particularly strong in turning messy multi-source data into trusted reporting assets, with repeated claims of major efficiency gains, faster decision-making, and high-confidence stakeholder adoption.

View profile
SS

Sagar Sidhwa

Screened

Senior AI/ML Engineer specializing in LLMs, MLOps, and predictive analytics

Jamestown, NY6y exp
CumminsBinghamton University

ML/AI engineer with hands-on experience building production MLOps systems for predictive maintenance and demand forecasting, including deployment, monitoring, and iterative retraining. Also shipped a RAG-based employee onboarding chatbot integrated with ServiceNow APIs and reports business impact of roughly $300k/month in reduced stockout and overstock costs.

View profile
Lokesh Jain - Senior AI/ML Engineer specializing in supply chain and healthcare systems in Bentonville, AR

Lokesh Jain

Screened

Senior AI/ML Engineer specializing in supply chain and healthcare systems

Bentonville, AR6y exp
Forman TechnologyUniversity at Buffalo

Built and deployed AcademiQ Ai, a production LLM-based teaching assistant using GPT/BERT with RAG (LangChain + Pinecone) to handle large student notes and generate adaptive explanations/quizzes. Demonstrated measurable retrieval-quality gains (18% precision improvement, 22% less irrelevant context) by tuning similarity thresholds and chunking based on user satisfaction signals. Also orchestrated terabyte-scale, real-time demand forecasting pipelines using Airflow and Kubeflow on GCP with strong monitoring, shadow deployment, and feedback-loop practices.

View profile
MJ

Meet Jhaveri

Screened

Mid-level Data Scientist specializing in AI/ML, LLMs, and healthcare analytics

California, USA3y exp
Johnson & JohnsonCalifornia State University, Fullerton

Built and shipped enterprise AI products including a conversational SQL analytics platform and a production RAG system at Johnson & Johnson. Combines full-stack engineering with LLM systems expertise, and has delivered measurable impact at scale, including 48% lower retrieval latency and 37% better response relevance across 12M+ records.

View profile
SL

Mid-level Full-Stack Python Developer specializing in LLM/GenAI for Banking & Healthcare

Kentwood, MI4y exp
Fifth Third BankUniversity of Central Missouri
View profile
Gopi Bhoyar - Mid-level Software Engineer specializing in FinTech data platforms and full-stack analytics in Whippany, NJ

Mid-level Software Engineer specializing in FinTech data platforms and full-stack analytics

Whippany, NJ4y exp
BarclaysNJIT
View profile
Adithya Raj Melayikandy - Junior Full-Stack Developer specializing in MERN and AI/ML systems in Mumbai, India

Junior Full-Stack Developer specializing in MERN and AI/ML systems

Mumbai, India1y exp
TCSUniversity at Buffalo
View profile
Gilbert Chaplin - Senior Data Engineer specializing in cloud data platforms and BI reporting in Austin, TX

Senior Data Engineer specializing in cloud data platforms and BI reporting

Austin, TX15y exp
CrowdStreetChicago State University
View profile
PremalParagbhai Shah - Junior Full-Stack & ML Engineer specializing in MLOps and time-series prediction in College Park, MD

Junior Full-Stack & ML Engineer specializing in MLOps and time-series prediction

College Park, MD2y exp
Project DAWNUniversity of Maryland, College Park
View profile
ST

Entry-level Machine Learning Engineer specializing in healthcare and analytics

Baltimore, MD1y exp
LifeBridge HealthGeorge Washington University
View profile
Jaisurya Kolluru - Mid-level Data Engineer specializing in cloud data pipelines and healthcare analytics in Virginia, USA

Mid-level Data Engineer specializing in cloud data pipelines and healthcare analytics

Virginia, USA3y exp
CVS HealthGeorge Mason University
View profile
Prajwal Manohar - Mid-level Software Engineer specializing in cloud-native microservices and AI systems in Bloomington, IN

Mid-level Software Engineer specializing in cloud-native microservices and AI systems

Bloomington, IN4y exp
Indiana University School of Public HealthIndiana University Bloomington
View profile
Xin Ning - Junior Software Engineer specializing in FinTech and full-stack systems

Junior Software Engineer specializing in FinTech and full-stack systems

3y exp
BNY MellonRensselaer Polytechnic Institute
View profile
KA

Senior Machine Learning Engineer / Data Scientist specializing in LLMs, RAG, and MLOps

6y exp
KrogerJawaharlal Nehru Technological University, Hyderabad
View profile

Need someone specific?

AI Search