Vetted Apache Spark Professionals

Pre-screened and vetted.

Chandan Chalumuri - Mid-level Data Scientist specializing in ML, NLP, and Generative AI in Tempe, AZ

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Tempe, AZ4y exp
MetLifeArizona State University

Data engineering / ML practitioner with experience at MetLife building transformer-based sentiment analysis over large unstructured datasets and productionizing pipelines with Airflow/PySpark/Hadoop (reported 52% efficiency gain). Also implemented embedding-based semantic search using Pinecone/Weaviate to improve retrieval relevance and enable RAG for customer support and document matching use cases.

View profile
Sharanya Rao - Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare in Remote, USA

Sharanya Rao

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare

Remote, USA3y exp
Ally FinancialUniversity of Maryland, Baltimore County

Built an AI lending assistant (RAG + DeBERTa) used by credit analysts to retrieve policies and past loan decisions, tackling real production issues like hallucinations, document quality, and sub-second latency. Deployed a modular, Dockerized AWS architecture (ECS/EMR + load balancer) with load testing, caching/precomputed embeddings, and CloudWatch monitoring, and used Airflow to automate scheduled data/embedding/vector DB refresh pipelines with retries and alerts.

View profile
Mike Gardiner - Technology Executive / Engineering Director specializing in AI-driven platform transformation in Lehi, UT

Mike Gardiner

Screened

Technology Executive / Engineering Director specializing in AI-driven platform transformation

Lehi, UT12y exp
VivintWeber State University

Built a 0-to-1 iOS mobile gardening application that helps users plan, track, and harvest crops with pest control guidance, weather, and climate-zone-based planting date recommendations. Demonstrated strong customer discovery and MVP-first product execution, including a major data challenge: compiling US climate zone data for every ZIP code from widely dispersed public sources into an app-ready database.

View profile
srilekha pothula - Mid-level Data Engineer specializing in cloud data pipelines for healthcare and financial services in Bloomfield, CT

Mid-level Data Engineer specializing in cloud data pipelines for healthcare and financial services

Bloomfield, CT4y exp
CignaPace University

Data engineer with ~4 years of experience (Cigna) building and operating Azure Data Factory pipelines for healthcare claims/member/provider data at 2–3M records/day. Emphasizes reliability and downstream safety via schema/data-quality validation, quarantine workflows, idempotent processing, and backfills; also improved runtime ~20% through SQL optimization and served curated datasets through versioned views and well-documented, analyst-friendly interfaces.

View profile
AA

Agna Antony

Screened

Mid-level Data Engineer specializing in cloud-native healthcare and enterprise data platforms

Michigan, USA5y exp
MedStar HealthAPJ Abdul Kalam Technological University

Data Engineer (TCS) who owned an end-to-end CRM analytics pipeline for Bayer’s eSalesWeb integration, ingesting from Salesforce APIs/databases/S3 and serving analytics-ready datasets via PostgreSQL/S3 for Tableau. Drove measurable outcomes: ~60% reduction in manual data-quality effort, ~30% lower latency through SQL optimization, and ~35% improved stability via monitoring, retries, and idempotent processing.

View profile
FM

Senior AI/ML Engineer specializing in healthcare AI and MLOps

Mansfield, TX16y exp
McKessonSam Houston State University

Healthcare AI engineer with hands-on ownership of production ML and LLM systems at McKesson, spanning clinical risk prediction and RAG-based documentation tools. Stands out for combining deep clinical-data experience, HIPAA-aware deployment practices, and measurable impact through reduced readmissions, clinician workflow gains, and 20% to 30% faster ML delivery for engineering teams.

View profile
Phani K - Mid-level AI/ML Engineer specializing in GenAI, NLP, and healthcare-financial ML in Terre Haute, IN

Phani K

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and healthcare-financial ML

Terre Haute, IN4y exp
UnitedHealth GroupIndiana State University

ML/AI engineer with hands-on experience shipping healthcare AI systems, including an oncology risk prediction platform and RAG-based clinical decision support tools. Stands out for combining clinical domain context with strong production engineering across Spark, FastAPI, AWS SageMaker, monitoring, evaluation, and safety guardrails.

View profile
Apoorv Bankey - Mid-level Backend Engineer specializing in distributed systems and FinTech in New York City, NY

Apoorv Bankey

Screened

Mid-level Backend Engineer specializing in distributed systems and FinTech

New York City, NY6y exp
Rutgers UniversityRutgers University

Engineer who uses AI and multi-agent workflows as a force multiplier while keeping architecture, security, scalability, and production quality under human control. Shared a concrete example of accelerating a backend-heavy SaaS email ingestion platform with authentication, role-based APIs, database models, and deployment setup using agent-style development and review.

View profile
AR

Amruth Reddy

Screened

Mid-level Software Engineer specializing in Python backend and AI applications

Irving, TX3y exp
CGIBoston University

ML engineer at CGI who built demand forecasting models end-to-end, from feature engineering and training through AWS deployment. Stands out for a production-first mindset and strong skepticism of AI-generated code, including catching a Copilot-generated SQL query that would have caused a costly full table scan in production.

View profile
SR

Sandeep Reddy

Screened

Mid-level Software Engineer specializing in full-stack cloud-native systems

Remote, USA4y exp
WalmartWebster University

Full-stack engineer with hands-on experience building real-time analytics and logistics platforms across modern JavaScript and Java stacks. They combine strong production ownership and database optimization skills with architectural leadership, including redesigning bottlenecks with SQS/Lambda and driving a monolith-to-microservices migration on Kubernetes that cut deployment time by 50%.

View profile
YN

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Michigan, USA3y exp
Ally FinancialUniversity of Michigan-Dearborn

GenAI/ML engineer with production experience at Cognizant and Ally Financial, building end-to-end LLM/RAG systems and ML pipelines. Delivered a domain chatbot trained from 90k tickets and 45k docs, improving intent accuracy (65%→83%), scaling to 800+ concurrent users with 99.2% uptime and sub-150ms latency, and driving +14% customer satisfaction. Strong in Azure ML + DevOps CI/CD, Dockerized deployments, and explainable/PII-safe modeling using SHAP/LIME to satisfy stakeholder trust and GDPR needs.

View profile
KP

Mid-level Full-Stack Java Developer specializing in cloud-native microservices and React

5y exp
Northern TrustCentral Michigan University

Full-stack engineer who owned enterprise workflow platforms end-to-end at Northern Trust and Elevance Health—building NestJS/Java Spring Boot APIs, React UIs, and cloud deployments on GCP Cloud Run. Strong in data-heavy applications (hundreds of thousands of records) with proven production performance tuning (indexing/query rewrites, Cloud Run concurrency/min instances) and secure RBAC via Azure AD.

View profile
HT

Mid-level Machine Learning Engineer specializing in LLMs, agentic AI, and risk/fraud modeling

San Francisco, CA3y exp
The Research Foundation for SUNYUniversity at Buffalo

Built and productionized an agentic LLM workflow during a summer internship to transform unstructured clinical reports into analytics-ready structured data, using a LangChain multi-agent design plus an LLM-as-a-judge layer to control quality in a regulated setting. Also has experience orchestrating ML pipelines at Piramal Capital using AWS Step Functions/EventBridge/CloudWatch, with strong emphasis on observability, evaluation rigor, and measurable impact (80–90% reduction in manual data entry).

View profile
TG

Executive Technology Leader (CTO/CIO) specializing in AI/ML, cloud modernization, and FinTech

Santa Monica, CA11y exp
Web3AdvisorsUniversity of Phoenix

Engineering/technology leader (CTO-style) with experience scaling orgs and running distributed teams across four continents for over a decade. Led a high-stakes modernization of a securities trading platform at Wedbush—migrating from monolith to microservices on AWS with zero-downtime constraints—driving 45% execution performance improvement and enabling 25% market share growth. Emphasizes business-aligned roadmaps, build-vs-buy rigor, and scalable engineering practices/culture.

View profile
SK

Mid-level Data Scientist specializing in real-time fraud detection and MLOps

San Francisco, CA5y exp
Charles SchwabCUNY Graduate Center

ML/NLP engineer with experience at Charles Schwab building an NLP + graph (Neo4j) entity-resolution system to unify fragmented user/device/transaction data and improve downstream model quality and analyst querying. Has applied embeddings (SentenceTransformers + FAISS) with domain fine-tuning to boost hard-case matching recall by ~12% while maintaining precision, and has a track record of hardening scalable Python/Spark pipelines and productionizing fraud models via A/B tests and shadow-mode monitoring.

View profile
AB

Senior Data & Platform Engineer specializing in cloud-native streaming and distributed systems

USA10y exp
JPMorgan ChaseNew York Institute of Technology

Financial data engineer who has built and operated high-volume batch + streaming pipelines (200–300 GB/day; 5–10k events/sec) using AWS, Spark/Delta, Airflow, Kafka, and Snowflake, with strong emphasis on data quality and reliability. Demonstrated measurable impact via 99.9% SLA adherence, major reductions in bad records/nulls, MTTR improvements, and significant latency/runtime/query performance gains; also built a distributed web-scraping system processing 5–10M records/day with anti-bot and schema-drift defenses.

View profile
MS

Mid-level Data Engineer specializing in multi-cloud data platforms for healthcare and finance

USA6y exp
CignaUniversity of Cincinnati

Data engineer with Cigna experience building and operating an end-to-end AWS-based healthcare claims pipeline processing ~2TB/day, using Glue/Kafka/PySpark/SQL into Redshift. Strong focus on data quality and reliability (schema validation, monitoring/alerting, retries/checkpointing/backfills), reporting improved accuracy (~99%) and reduced latency, plus experience serving real-time Kafka/Spark data to downstream analytics with documented data contracts.

View profile
KR

Mid-Level Backend Engineer specializing in SaaS, FinTech, and AI document intelligence

San Francisco, CA3y exp
IntraEdgeNYU

Full-stack engineer who built an AI-driven document analysis and processing workflow end-to-end, including large-document ingestion, queued async processing, and low-latency retrieval for user-facing flows. Demonstrated practical performance tuning (moving heavy work off request path, polling, caching) and Postgres optimization validated with EXPLAIN ANALYZE, plus durable workflow resilience via retries and dead-letter queues.

View profile
AG

Mid-level Data Engineer specializing in cloud ETL and real-time streaming

New York, NY6y exp
PNCRochester Institute of Technology

Data engineer focused on AWS + Spark/Databricks pipelines, including an end-to-end nightly loan-data ingestion flow (~2.2M records) from Postgres/S3 through Glue and Databricks into a DWH with layered validation and alerting. Also built real-time streaming with Kafka + Spark Structured Streaming and a master’s project streaming Reddit data for sentiment analysis under ambiguous requirements and tight budget constraints.

View profile
Subhash Krishnamoorthy - Executive Technology Leader specializing in digital transformation, headless e-commerce, and cloud architecture in Chesterfield, VA

Executive Technology Leader specializing in digital transformation, headless e-commerce, and cloud architecture

Chesterfield, VA25y exp
Hamilton BeachUniversity of Phoenix

Technology leader focused on business-aligned roadmaps and integration-heavy ecommerce platforms. Recently delivered an on-time launch for lutusooking.com (a premium Hamilton Beach brand) by coordinating UX/UI, component-based middleware, BigCommerce, Algolia search, personalization/recommendations, payments, and supply chain integrations, and later improved scalability via a Jitterbit iPaaS approach proven during Black Friday/Cyber Monday traffic.

View profile
Muaaz Syed - Mid-level AI/ML Engineer specializing in NLP and conversational AI in Richardson, TX

Muaaz Syed

Screened

Mid-level AI/ML Engineer specializing in NLP and conversational AI

Richardson, TX4y exp
CVS HealthUniversity of Texas at Dallas

ML/NLP engineer focused on real-time IT ops analytics, building a predictive maintenance/anomaly detection platform end-to-end (multi-source ETL, streaming, modeling, and production deployment on GCP/Vertex AI). Uses deep learning (LSTMs, autoencoders/VAEs) plus embeddings (SentenceBERT) and vector search to improve incident correlation and search, citing ~40% reduction in duplicate alert noise.

View profile
Ramcharan Reddy - Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices in Texas, USA

Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices

Texas, USA6y exp
Morgan StanleyUniversity of Central Missouri

Backend engineer focused on AI-enabled systems, having built a production-style RAG pipeline (vector search + LLM) exposed via Python/Flask endpoints with strong observability and hallucination-reduction techniques. Demonstrates deep performance work in PostgreSQL/SQLAlchemy (5x faster analytics queries) and high-throughput optimization using Celery + Redis (800ms to 120ms latency, 3x throughput), plus schema-per-tenant multi-tenancy with tenant-aware middleware and logging.

View profile

Need someone specific?

AI Search