Vetted Apache Spark Professionals

Pre-screened and vetted.

SA

Sai Addala

Screened

Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting

USA4y exp
Northern TrustSyracuse University

Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.

View profile
BY

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp
ZymebalanzBoston University

LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.

View profile
JC

Mid-level Machine Learning Engineer specializing in LLMs, NLP, and MLOps

USA5y exp
McKessonSUNY

Built a production LLM-RAG system at McKesson to let internal healthcare operations teams query large volumes of unstructured operational documents via natural language with source-backed answers, designed with HIPAA/FHIR compliance in mind. Demonstrated strong production engineering across hallucination mitigation, retrieval quality tuning, and latency/scalability optimization, using LangChain/LangGraph and Airflow plus rigorous evaluation/monitoring practices.

View profile
SK

Mid-level ML Engineer specializing in NLP and Generative AI

Houston, TX4y exp
Epic SystemsUniversity of Central Missouri

Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.

View profile
PK

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and data analytics

Oklahoma City, USA5y exp
Wells FargoOklahoma City University

Software engineer with experience at Wipro Technologies and Wells Fargo building React-based SPAs, reusable component libraries, and developer documentation. Demonstrated strong performance engineering (React.memo, list virtualization, code splitting) with reported >50% rendering-time improvement, plus hands-on production support by diagnosing API outages via monitoring/logs and implementing traffic/server fixes. Comfortable leading workstreams in fast-changing environments using Kanban and tight stakeholder feedback loops.

View profile
SC

Snehal Chavan

Screened

Mid-Level Software Engineer specializing in backend systems and cloud infrastructure

California, USA4y exp
California State University, FullertonCalifornia State University, Fullerton

iOS full-stack/mobile engineer who built a SwiftUI (MVVM) barcode-scanner app using VisionKit for on-device, low-latency recognition, focusing on responsiveness via continuous scanning and debouncing. Also has Capgemini customer-facing support experience resolving restored-file access/permissions issues and receiving positive CSAT feedback.

View profile
DG

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp
PaycomUniversity of Kansas

Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.

View profile
SS

Sumit Sahu

Screened

Mid-level Machine Learning Engineer specializing in computer vision and MLOps on GCP

Atlanta, GA4y exp
NCR VoyixUniversity of Georgia

ML/AI engineer who deployed a real-time, edge-based computer-vision pipeline for produce recognition in retail self-checkout to reduce shrink. Demonstrates strong end-to-end production chops: multi-camera data calibration/sync, ranking-based modeling for fine-grained classes, latency-focused optimization, and continuous A/B testing/monitoring with guardrails. Experienced with ML orchestration (Kubeflow Pipelines, Airflow) and CI/CD via GitHub Actions, and collaborates closely with store operations to make interventions usable in the checkout flow.

View profile
ND

Staff Software Engineer/Architect specializing in Java microservices and multi-cloud (AWS/Azure)

California, USA19y exp
NTT DATAUniversity of Hyderabad

Backend/platform engineer with State Farm experience modernizing and scaling an enterprise consolidated payment data platform and event-driven pipelines. Built cloud-native payment architecture (ECS->EKS) handling millions of financial transactions/day and high-volume telemetry (~100M events/day), with strong schema governance (Avro + schema registry) and production operations/incident mitigation driven by observability.

View profile
HP

Hansitha P

Screened

Mid-level Data Engineer specializing in scalable ETL/ELT and real-time streaming pipelines

USA4y exp
CVS HealthUniversity of Cincinnati

Built and shipped a production LLM-powered customer support agent for an EV charging platform using RAG plus internal APIs, automating session/payment issues and ticket routing. Emphasizes production readiness via guardrails, schema validation, state-machine orchestration, monitoring, and continuous evals, delivering a reported 35–40% reduction in support tickets and improved customer satisfaction.

View profile
Andrew Clayman - Senior Data Scientist specializing in ML, NLP, and production AI systems in Remote

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.

View profile
Meghana Nandivada - Junior Machine Learning Engineer specializing in production ML systems and MLOps

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp
TCSStevens Institute of Technology

ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.

View profile
Dhairya Desai - Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics in Chicago, IL

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
Manali Patil - Senior Software Engineer & Engineering Manager specializing in cloud backend and manufacturing MES in Santa Clara, CA

Manali Patil

Screened

Senior Software Engineer & Engineering Manager specializing in cloud backend and manufacturing MES

Santa Clara, CA9y exp
Halo IndustriesUniversity of San Francisco

Customer-facing engineer who led recurring midnight ERP data-feed/B2B integrations from prototype to production, building reusable APIs and using Hangfire for job scheduling. Known for tight weekly customer iteration, strong documentation and test coverage (80%+), and cross-functional problem-solving with Operations/Quality/NPI to resolve data-collection and manufacturing-process constraints; has 2 customers live on the integration.

View profile
Varun Kothapalli - Mid-level AI/Machine Learning Engineer specializing in Generative AI, NLP, and MLOps in Saint Louis, MO

Mid-level AI/Machine Learning Engineer specializing in Generative AI, NLP, and MLOps

Saint Louis, MO6y exp
EquifaxWebster University

Built a production LLM/RAG document analysis system for large financial documents (credit reports/PDFs) to help business analysts extract insights faster. Implemented end-to-end pipeline orchestration with LangChain, vector search (e.g., FAISS), and hallucination controls (context grounding, similarity thresholds, and no-answer fallback), delivered as a Dockerized Python API.

View profile
Sreedivya Nagalli - Junior AI/ML Engineer specializing in deep learning and full-stack ML applications

Junior AI/ML Engineer specializing in deep learning and full-stack ML applications

2y exp
Amrita Vishwa VidyapeethamUniversity at Buffalo

Built and operated a production-used RAG-based AI study planner (GPT-4 + FAISS) that handled 250+ concurrent users, with real-world reliability engineering (caching, fallbacks, schema validation, Redis state, monitoring). Also has healthcare data integration experience at Medinet Analytics, standardizing messy EHR/practice-management data with canonical schemas, idempotency hashing, and compliance-grade audit trails.

View profile
SG

somasekhar G

Screened

Mid-level Data Engineer specializing in cloud big data and streaming pipelines

California, USA4y exp
Smarc Solutions IncUniversity of Colorado Boulder

Data engineer focused on large-scale financial data platforms, with hands-on ownership of an AWS + Databricks + Snowflake pipeline processing ~2TB/day. Strong in data quality (Great Expectations), schema drift automation, and production reliability (99.9%), plus measurable performance/cost wins (4h→1.2h, ~25% cost reduction). Also built an async Python crawling/ingestion framework with anti-bot mitigation, retries, and Airflow-driven backfills.

View profile
Varun Sharma - Mid-level AI Builder and Data Engineer specializing in GenAI and data pipelines in Remote, USA

Varun Sharma

Screened

Mid-level AI Builder and Data Engineer specializing in GenAI and data pipelines

Remote, USA4y exp
Modern StreamingDrexel University

Full-stack AI product engineer who personally built ViGenAir, a multimodal system that turns long-form video into ads using FastAPI, React, and agentic scoring. Stands out for handling complex 50GB+ media pipelines, re-architecting systems to eliminate OOM failures, and making opaque AI workflows usable through interactive visual UX that improved trust, speed, and retention.

View profile
Harshitha Ayenugula - Mid Software Engineer specializing in backend and FinTech systems in New Jersey, USA

Mid Software Engineer specializing in backend and FinTech systems

New Jersey, USA4y exp
Community Dreams FoundationUniversity at Buffalo

Full-stack engineer with strong ownership of complex web products, including building a real-time collaborative editor end-to-end using React, Spring Boot, WebSockets, Yjs CRDT, PostgreSQL, Redis, and Docker. Stands out for combining product delivery with production reliability and performance work, including reducing QA defects by ~25%, improving internal tool load times to under 2 seconds, and resolving latency issues in live systems.

View profile
CC

Executive product leader specializing in AI, SaaS platforms, and monetization

Seattle, WA14y exp
SubmittableFlorida State University

Senior product leader who helped transform Submittable from a single-program grant tool into a multi-program impact platform, driving ARR from $20M to $70M+ while improving retention and margins. Particularly strong in enterprise platform strategy and human-centered AI, with a clear philosophy of using AI to augment expert judgment rather than replace it.

View profile
DB

Mid-level Full-Stack Java Developer specializing in microservices and cloud-native systems

Kansas, null5y exp
Cardinal HealthUniversity of Central Missouri

Senior full-stack engineer with strong healthcare domain experience who has shipped an Azure OpenAI RAG-based patient medication support chatbot to production, driving ~10K queries/month and a reported 38% reduction in call center volume. Also builds polished real-time React/TypeScript pharmacy tooling and operates large-scale Python/Spark ETL pipelines (~12M records/day) with strong API design, observability, and cloud deployment experience across Azure/Kubernetes and AWS.

View profile
SC

Sahil Chaubal

Screened

Senior AI/ML Engineer specializing in financial risk, fraud detection, and GenAI analytics

USA7y exp
Northern TrustSyracuse University

AI/ML engineer with experience at Northern Trust and Persistent Systems building production LLM + RAG systems for regulated financial use cases, including liquidity forecasting, anomaly detection, and credit scoring. Emphasizes compliance-first design with explainability (SHAP), traceability (MLflow), and hallucination controls (FAISS + citation-grounded prompting), and has delivered drift-triggered retraining pipelines using Airflow and Kubernetes while translating model outputs into business-ready marketing segments.

View profile
TK

Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems

New York, USA6y exp
UnitedHealthcareAuburn University at Montgomery

Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.

View profile
JD

Jimmy Dani

Screened

Mid-level AI Researcher specializing in privacy-preserving ML and applied cryptography

College Station, TX6y exp
Texas A&M UniversityTexas A&M University

Graduate researcher who builds production-grade AI systems spanning LLM security evaluation and on-device RAG. Created HoneyLearner, a self-learning attack framework using GPT-4-class models as structured black-box attackers against honeywords defenses, with rigorous metrics and reproducible orchestration (Airflow/Spark/Kafka/Docker). Also partnered with agriculture scientists at Texas A&M–Corpus Christi to deliver UAV + 3D point-cloud crop-stress maps that cut time-to-insight ~40% and enabled ~30% earlier interventions.

View profile

Need someone specific?

AI Search