Vetted PySpark Professionals

Pre-screened and vetted.

AS

Mid-level sales and data professional specializing in FinTech, telecom, and insurance

Woodbridge, NJ3y exp
Plymouth Rock AssuranceRowan University
View profile
SA

Principal Cloud & Data Architect specializing in AI-enabled AWS platforms

Austin, TX20y exp
AI20LABSEastern Mediterranean University
View profile
IM

Mid-level AI/ML Engineer specializing in financial risk, NLP, and MLOps

Norman, OK6y exp
Northern TrustUniversity of Oklahoma
View profile
MJ

Mid-level Backend Engineer specializing in distributed data systems

Pune, India4y exp
TCSSan Jose State University
View profile
VC

Mid-level Data Scientist specializing in industrial IoT, predictive analytics, and generative AI

Ruston, LA5y exp
Grambling State UniversityLouisiana Tech University

ML/NLP engineer with Industrial IoT experience who built an end-to-end anomaly detection and GenAI explanation system: AWS (S3, PySpark, EC2/Lambda) pipelines feeding dashboards, plus transformer-embedding vector search to connect anomalies to noisy maintenance notes and past events. Demonstrated measurable impact (15% lift in defect detection; ~35% reduction in manual review; 35% fewer preprocessing errors) and strong productionization practices (orchestration, monitoring, rollback, data-quality controls).

View profile
AA

Alexis Abbott

Screened

Senior Python Developer specializing in AWS, microservices, and data pipelines

Boston, MA6y exp
SumatoSoftPenn State University

Backend/data engineer with strong AWS production experience spanning serverless APIs and containerized workers (Lambda, API Gateway, ECS) plus data pipelines (Glue, S3, Athena/Redshift). Has modernized legacy SAS/cron batch systems into Python/AWS with parallel-run parity validation and low-risk cutovers, and has owned ETL incidents end-to-end (CloudWatch detection, backfills, and preventative controls). Targeting $130k–$150k base and strongly prefers remote, with occasional Bethesda onsite acceptable.

View profile
DS

Damon Summers

Screened

Senior Backend Software Engineer specializing in AWS cloud-native data platforms

Columbus, OH10y exp
Highcode TechUniversity of Maryland, College Park

AWS-focused Python backend/data engineer who builds production analytics APIs and ETL pipelines using API Gateway, Lambda, Step Functions, ECS, Glue, S3, and RDS. Strong in operational reliability and performance tuning (including SQL indexing/partitioning) and has modernized legacy SAS statistical processing into validated Python services with phased rollouts and stakeholder sign-off.

View profile
DM

Mid-level Data Scientist specializing in GenAI, RAG, and forecasting

New Jersey, USA4y exp
University at BuffaloUniversity at Buffalo

ML/NLP engineer focused on large-scale data linking for e-commerce-style catalogs and customer records, combining transformer embeddings (BERT/Sentence-BERT), NER, and FAISS-based vector search. Has delivered measurable lifts (e.g., +30% matching accuracy, Precision@10 62%→84%) and built production-grade, scalable pipelines in Airflow/PySpark with strong data quality and schema-drift handling.

View profile
MP

Mahesh Ponnam

Screened

Mid-level Data Scientist specializing in credit risk, fraud detection, and ESG analytics

PA, USA4y exp
Northern TrustWilmington University

AI/LLM practitioner who has deployed production chatbots across e-commerce, HRMS, and real estate, focusing on retrieval-first workflows for factual tasks like product and property search. Optimized intent understanding and significantly improved latency by using lightweight embeddings and tuning the inference pipeline on Groq (Llama 3.3), while applying modular orchestration and measurable production evaluation.

View profile
DC

Junior Data Engineer specializing in data pipelines and streaming ingestion

CT, United States3y exp
KoyetechArizona State University

Backend/data platform engineer who owned a near-real-time patient feedback ingestion system, building a FastAPI + Kafka service with Snowflake/Airflow orchestration. Demonstrates strong production Kubernetes/GitOps practices on AWS EKS (Helm, Argo CD, Sealed Secrets) and solved real-time data integrity issues via idempotent processing with Redis.

View profile
BB

Mid-Level Data Engineer specializing in cloud data pipelines and big data platforms

Newark, NJ3y exp
Horizon Blue Cross Blue Shield of NJUniversity of Memphis

Data engineer with ~4 years of experience building Python-based data ingestion/processing services and real-time streaming pipelines (Kafka/PubSub + Spark Structured Streaming). Has deployed containerized data applications on Kubernetes with GitLab CI/Jenkins pipelines and applied GitOps to cut deployment time ~40% while reducing config drift. Also supported a legacy on-prem data warehouse/backend migration to GCP using phased migration and parallel validation to meet strict reliability/SLA needs.

View profile
HD

Mid-level Data Engineer specializing in cloud data pipelines and analytics engineering

Boston, MA5y exp
AltaPotentiaNortheastern University

Built and deployed a production LLM-powered demand and churn forecasting system for an e-commerce client, combining open-source LLMs (LLaMA/Mistral) and Sentence-BERT embeddings to generate business-friendly explanations of forecast drivers. Strong focus on data quality and model trust (validation, baselines, segmented monitoring) and production reliability via Airflow-orchestrated pipelines with readiness checks, retries, and ongoing drift/A-B testing.

View profile
RK

Mid-level Software Engineer specializing in cloud, DevOps, and distributed systems

California, USA4y exp
University of Illinois ChicagoUniversity of Illinois Chicago
View profile
Satya Dineswara Reddy - Mid-level MLOps/ML Engineer specializing in LLMs and financial risk modeling in United States

Mid-level MLOps/ML Engineer specializing in LLMs and financial risk modeling

United States4y exp
Northern TrustIllinois Institute of Technology
View profile
Vikas Venkannagari - Mid-level Data Scientist specializing in Generative AI, RAG systems, and MLOps in Remote, USA

Mid-level Data Scientist specializing in Generative AI, RAG systems, and MLOps

Remote, USA5y exp
Enigma TechnologiesUniversity of Maryland, Baltimore County
View profile
Esteban Rios - Senior Cloud Software Engineer specializing in AWS microservices and DevOps in Harrison, NJ

Senior Cloud Software Engineer specializing in AWS microservices and DevOps

Harrison, NJ13y exp
Cox AutomotiveNJIT
View profile
AP

Mid-level AI/ML Data Engineer specializing in secure ML pipelines and AI governance

Plano, Texas4y exp
InfosoftUniversity of Texas at Dallas
View profile
JP

Mid-level Software Engineer specializing in AI and cloud-native data platforms

Overland Park, KS4y exp
APFMUniversity of Missouri
View profile
CM

Mid-level Data Scientist specializing in ML, NLP/LLMs, and MLOps

5y exp
CBRETexas A&M University-Corpus Christi
View profile
LS

Senior Full-Stack Software Engineer specializing in AWS, .NET, and data/telemetry platforms

Cincinnati, Ohio15y exp
Workhorse GroupBowling Green State University
View profile
SP

Mid-level Data Engineer specializing in FinTech and AI-ready data platforms

Illinois, USA3y exp
Northern TrustLindsey Wilson College
View profile
B`

Mid-level Machine Learning Engineer specializing in LLMs, Generative AI, and MLOps

Albany, NY4y exp
Northern TrustUniversity at Albany
View profile

Need someone specific?

AI Search