Vetted PySpark Professionals

Pre-screened and vetted.

UK

Junior AI Engineer specializing in OCR and tax document automation

Irvine, CA2y exp
Clarkson UniversityClarkson University
View profile
HT

Mid-level AI/ML Engineer specializing in LLMs, RAG, and agentic AI systems

Brooklyn, NY3y exp
CARA SYSTEMSNortheastern University
View profile
HK

Mid-level Data Scientist / Software Engineer specializing in AI automation and cloud microservices

Remote4y exp
ScanAvertNJIT
View profile
BS

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems

CA, USA5y exp
DXC TechnologyCalifornia State University, Long Beach
View profile
NK

Junior Data Scientist specializing in cybersecurity and AI/ML

Saint Paul, Minnesota3y exp
ONCODEAUniversity of St. Thomas
View profile
PK

Mid-Level Machine Learning Engineer specializing in LLMs and RAG systems

Ohio, USA3y exp
Leaf HomeSan José State University
View profile
RP

Intern Data Scientist / ML Engineer specializing in predictive modeling and data pipelines

Hyderabad, India1y exp
National Remote Sensing CentreMontclair State University
View profile
NS

Mid-level AI/ML Engineer specializing in MLOps, streaming data, and NLP/CV

USA4y exp
CGIUniversity of Central Missouri
View profile
DU

Mid-level Data Analyst specializing in marketing analytics and machine learning

Columbus, Ohio4y exp
ElevateMeStevens Institute of Technology
View profile
AG

Mid-level AI/ML Engineer specializing in MLOps and fraud detection

USA4y exp
Northern TrustLewis University
View profile
SM

Mid-level AI/ML Engineer specializing in GenAI, RAG, and multi-agent LLM systems

Boston, MA4y exp
PredictaBio InnovationsKhoury College of Computer Sciences (Northeastern University)
View profile
NJ

Senior AI/ML Engineer specializing in Generative AI and LLMOps

Washington, DC10y exp
Clarion Tech
View profile
Pranava Reddy Kothapally - Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization in Hyderabad, India

Pranava Reddy Kothapally

Screened ReferencesStrong rec.

Junior Data Engineer specializing in Azure, CRM data pipelines, and marketing personalization

Hyderabad, India2y exp
TechwaveCleveland State University

LLM/AI engineer who has deployed production RAG conversational analytics and Text-to-SQL systems over Snowflake and curated data marts, emphasizing enterprise-grade guardrails for accuracy, security, and cost. Notable for a structured approach to reducing hallucinations (curated metric/table registry, SQL validation, RBAC, and citation-backed responses) and for building resilient, observable multi-step agent workflows using LangChain/LlamaIndex and Airflow.

View profile
Snigdha Reddy Podduturi - Junior Data & AI Engineer specializing in cloud AI and analytics in Remote

Snigdha Reddy Podduturi

Screened ReferencesStrong rec.

Junior Data & AI Engineer specializing in cloud AI and analytics

Remote3y exp
Lightning MindsUniversity of Massachusetts Lowell

Built production AI backend systems in healthcare and e-commerce, including a healthcare agent that automated clinical workflows like medication refills, immunizations, and scheduling using FHIR APIs and cloud-native infrastructure. Strong in end-to-end backend ownership, LLM orchestration, and adding guardrails/validation for high-stakes and customer-facing AI workflows.

View profile
HS

Helly Shah

Screened ReferencesModerate rec.

Junior Data Analyst specializing in business analytics and machine learning

New York, NY2y exp
Handshake AI Solutions, LLCBaruch College (CUNY)

Analytics-focused candidate with hands-on project experience in SQL data preparation and Python-based churn modeling. They demonstrated a practical approach to turning messy multi-source data into reporting tables, validating data quality rigorously, and translating churn insights into targeted retention strategies.

View profile
AS

Ashish Shah

Screened

Mid-level Data Engineer / Software Engineer specializing in streaming and cloud data platforms

Arlington, TX3y exp
The University of Texas at ArlingtonUniversity of Texas at Arlington

Backend engineer with deep Kafka/FastAPI microservices experience who redesigned a notification pipeline to cut end-to-end latency from ~5s to ~3s (including custom partition assignment and consumer tuning). Led a high-stakes ClickUp-to-Oracle migration of 1M+ records using idempotent ETL, reconciliation, and shadow deployment to achieve >99% integrity with zero downtime, and has hands-on production security implementation with Django/DRF (JWT + RBAC).

View profile
SS

Mid-level AI Engineer and Data Scientist specializing in LLM agents and RAG systems

Palo Alto, CA5y exp
LemmataUniversity at Buffalo

Built a production-grade LLM evaluation and regression system that stress-tests models across hundreds of iterations, combining LLM-as-judge, semantic similarity, statistical metrics, and rule-based checks, with results delivered via stakeholder-friendly HTML reports and dashboards. Experienced orchestrating multi-agent RAG workflows using LangChain/LangGraph and event-driven GenAI pipelines in n8n integrating OCR, speech-to-text, and external APIs, with strong emphasis on reliability, observability, and explainable failures.

View profile
SS

Mid-level Data Scientist specializing in Generative AI and LLMOps

Dover, USA4y exp
Visual TechnologiesUniversity of Houston

Built a production-grade, semi-automated document recognition and classification system for large volumes of scanned PDFs, starting from little/no labeled data and handling highly variable scan quality. Deployed on AWS using SageMaker + Docker and orchestrated on EKS with a microservices design that scales CPU-heavy OCR separately from GPU inference, with strong reliability controls (validation, fallbacks, retries, readiness probes).

View profile
SV

Mid-Level Full-Stack Software Engineer specializing in cloud-native apps and ML services

Bowling Green, OH4y exp
Senecio Software IncBowling Green State University

Software engineer who deployed and stabilized a real-time analytics platform at Senecio Software, focusing on production reliability, observability, and performance under load. Experienced debugging issues spanning distributed services and networking (e.g., tracing timeouts to packet loss from misconfiguration) and extending Python (FastAPI/Django) APIs for customer-specific analytics features in a configurable, maintainable way.

View profile
AK

Ajith Kumar

Screened

Mid-level AI Data Engineer specializing in GenAI, RAG, and cloud data pipelines

Irving, TX5y exp
Mouri TechGeorge Mason University

LLM/agentic AI builder who deployed a production ITSM automation agent on Google ADK integrating ServiceNow and FreshService, with strong safety guardrails (human-approval gating and runbook-only command execution) and rigorous evaluation (500 synthetic tickets; 80%+ false-positive reduction). Also partnered with finance to deliver an AI agent that automated invoice/SOW retrieval and monthly reporting to account managers, reducing manual back-and-forth.

View profile
LS

Mid-level AI Engineer specializing in Generative AI and LLM systems

Grand Ledge, MI3y exp
ChainSysUniversity of Michigan-Dearborn

Built and deployed a production-grade, multi-agent Text-to-SQL assistant that lets non-technical stakeholders query large enterprise databases in natural language. Uses Pinecone-based schema retrieval + LLM reasoning (Gemini/Claude/GPT) with a dedicated validation agent (schema/syntax checks and safe dry runs) to reduce hallucinations and improve reliability, while optimizing latency and cost via async execution and embedding caching.

View profile
NT

Mid-level AI Engineer specializing in ML, LLM applications, and data automation

Atlanta, GA4y exp
Exus Renewables North AmericaGeorgia State University

Data/ML practitioner who has built a production RAG-based knowledge assistant integrated into Microsoft 365/internal dashboards to help employees query internal documents in plain English. Experienced orchestrating and hardening ETL pipelines with Airflow and Azure Data Factory (validation, retries, monitoring) and running end-to-end model evaluation and production performance tracking via Power BI.

View profile
VS

VIJAY SAGI

Screened

Mid-level Data Engineer specializing in cloud-native batch and streaming pipelines

Prosper, TX5y exp
ACL DigitalTrine University

Data/ML platform engineer with ~6 years in financial services and enterprise data platforms, building regulated fraud/credit-risk pipelines on AWS (Airflow, EMR/Spark, MLflow) and an Azure lakehouse ingesting 50+ sources and serving ~100M records/day. Also led an early-stage deployment of a RAG-based internal AI search tool using AWS Bedrock and LangChain with automated evaluation to validate LLM accuracy.

View profile

Need someone specific?

AI Search