Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Sai Addala

Screened

Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting

USA4y exp

Northern TrustSyracuse University

“Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.”

Python NumPy Pandas JSON SQL PostgreSQL+116

View profile

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp

ZymebalanzBoston University

“LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.”

Python C++Java C HTML JavaScript+174

View profile

Jahnavi Chakka

Screened

Mid-level Machine Learning Engineer specializing in LLMs, NLP, and MLOps

USA5y exp

McKessonSUNY

“Built a production LLM-RAG system at McKesson to let internal healthcare operations teams query large volumes of unstructured operational documents via natural language with source-backed answers, designed with HIPAA/FHIR compliance in mind. Demonstrated strong production engineering across hallucination mitigation, retrieval quality tuning, and latency/scalability optimization, using LangChain/LangGraph and Airflow plus rigorous evaluation/monitoring practices.”

A/B Testing Agile Amazon ECS Amazon EKS Amazon SageMaker Algorithms+125

View profile

SaiGanesh Konagalla

Screened

Mid-level ML Engineer specializing in NLP and Generative AI

Houston, TX4y exp

Epic SystemsUniversity of Central Missouri

“Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.”

Python NumPy Pandas SciPy Scikit-learn Seaborn+186

View profile

Pavithra Kandavel

Screened

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices and data analytics

Oklahoma City, USA5y exp

Wells FargoOklahoma City University

“Software engineer with experience at Wipro Technologies and Wells Fargo building React-based SPAs, reusable component libraries, and developer documentation. Demonstrated strong performance engineering (React.memo, list virtualization, code splitting) with reported >50% rendering-time improvement, plus hands-on production support by diagnosing API outages via monitoring/logs and implementing traffic/server fixes. Comfortable leading workstreams in fast-changing environments using Kanban and tight stakeholder feedback loops.”

Java Python TypeScript JavaScript SQL Angular+144

View profile

Snehal Chavan

Screened

Mid-Level Software Engineer specializing in backend systems and cloud infrastructure

California, USA4y exp

California State University, FullertonCalifornia State University, Fullerton

“iOS full-stack/mobile engineer who built a SwiftUI (MVVM) barcode-scanner app using VisionKit for on-device, low-latency recognition, focusing on responsiveness via continuous scanning and debouncing. Also has Capgemini customer-facing support experience resolving restored-file access/permissions issues and receiving positive CSAT feedback.”

C++Python Swift Flask MySQL SQL+59

View profile

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp

PaycomUniversity of Kansas

“Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.”

A/B Testing Amazon EC2 Apache Kafka Apache Spark AWS AWS Glue+163

View profile

Sumit Sahu

Screened

Mid-level Machine Learning Engineer specializing in computer vision and MLOps on GCP

Atlanta, GA4y exp

NCR VoyixUniversity of Georgia

“ML/AI engineer who deployed a real-time, edge-based computer-vision pipeline for produce recognition in retail self-checkout to reduce shrink. Demonstrates strong end-to-end production chops: multi-camera data calibration/sync, ranking-based modeling for fine-grained classes, latency-focused optimization, and continuous A/B testing/monitoring with guardrails. Experienced with ML orchestration (Kubeflow Pipelines, Airflow) and CI/CD via GitHub Actions, and collaborates closely with store operations to make interventions usable in the checkout flow.”

Python C++Rust SQL Java PyTorch+100

View profile

Narendar Dheeravath

Screened

Staff Software Engineer/Architect specializing in Java microservices and multi-cloud (AWS/Azure)

California, USA19y exp

NTT DATAUniversity of Hyderabad

“Backend/platform engineer with State Farm experience modernizing and scaling an enterprise consolidated payment data platform and event-driven pipelines. Built cloud-native payment architecture (ECS->EKS) handling millions of financial transactions/day and high-volume telemetry (~100M events/day), with strong schema governance (Avro + schema registry) and production operations/incident mitigation driven by observability.”

Agile Amazon CloudWatch Amazon DynamoDB Amazon EC2 Amazon ECS Amazon EKS+262

View profile

Hansitha P

Screened

Mid-level Data Engineer specializing in scalable ETL/ELT and real-time streaming pipelines

USA4y exp

CVS HealthUniversity of Cincinnati

“Built and shipped a production LLM-powered customer support agent for an EV charging platform using RAG plus internal APIs, automating session/payment issues and ticket routing. Emphasizes production readiness via guardrails, schema validation, state-machine orchestration, monitoring, and continuous evals, delivering a reported 35–40% reduction in support tickets and improved customer satisfaction.”

Python SQL ETL Apache Spark PySpark Apache Kafka+76

View profile

Andrew Clayman

Screened

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp

AppstemUniversity of Southampton

“Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.”

Python C++SQL Docker Flask CI/CD+133

View profile

Meghana Nandivada

Screened

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp

TCSStevens Institute of Technology

“ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.”

Python SQL Java Scala Machine Learning MLOps+99

View profile

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp

OptumUniversity of Texas at Dallas

“ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.”

Python R SQL MATLAB C C#+157

View profile

Manali Patil

Screened

Senior Software Engineer & Engineering Manager specializing in cloud backend and manufacturing MES

Santa Clara, CA9y exp

Halo IndustriesUniversity of San Francisco

“Customer-facing engineer who led recurring midnight ERP data-feed/B2B integrations from prototype to production, building reusable APIs and using Hangfire for job scheduling. Known for tight weekly customer iteration, strong documentation and test coverage (80%+), and cross-functional problem-solving with Operations/Quality/NPI to resolve data-collection and manufacturing-process constraints; has 2 customers live on the integration.”

Apache Spark AWS AWS IAM AWS Lambda C#C+++107

View profile

Varun Kothapalli

Screened

Mid-level AI/Machine Learning Engineer specializing in Generative AI, NLP, and MLOps

Saint Louis, MO6y exp

EquifaxWebster University

“Built a production LLM/RAG document analysis system for large financial documents (credit reports/PDFs) to help business analysts extract insights faster. Implemented end-to-end pipeline orchestration with LangChain, vector search (e.g., FAISS), and hallucination controls (context grounding, similarity thresholds, and no-answer fallback), delivered as a Dockerized Python API.”

Artificial Intelligence Machine Learning Deep Learning Supervised Learning Feature Engineering Model Evaluation+89

View profile

Sreedivya Nagalli

Screened

Junior AI/ML Engineer specializing in deep learning and full-stack ML applications

2y exp

Amrita Vishwa VidyapeethamUniversity at Buffalo

“Built and operated a production-used RAG-based AI study planner (GPT-4 + FAISS) that handled 250+ concurrent users, with real-world reliability engineering (caching, fallbacks, schema validation, Redis state, monitoring). Also has healthcare data integration experience at Medinet Analytics, standardizing messy EHR/practice-management data with canonical schemas, idempotency hashing, and compliance-grade audit trails.”

Python SQL MATLAB Java C++Object-Oriented Programming (OOP)+114

View profile

somasekhar G

Screened

Mid-level Data Engineer specializing in cloud big data and streaming pipelines

California, USA4y exp

Smarc Solutions IncUniversity of Colorado Boulder

“Data engineer focused on large-scale financial data platforms, with hands-on ownership of an AWS + Databricks + Snowflake pipeline processing ~2TB/day. Strong in data quality (Great Expectations), schema drift automation, and production reliability (99.9%), plus measurable performance/cost wins (4h→1.2h, ~25% cost reduction). Also built an async Python crawling/ingestion framework with anti-bot mitigation, retries, and Airflow-driven backfills.”

AWS AWS Lambda Amazon Kinesis AWS Step Functions Amazon EKS AWS IAM+93

View profile

Varun Sharma

Screened

Mid-level AI Builder and Data Engineer specializing in GenAI and data pipelines

Remote, USA4y exp

Modern StreamingDrexel University

“Full-stack AI product engineer who personally built ViGenAir, a multimodal system that turns long-form video into ads using FastAPI, React, and agentic scoring. Stands out for handling complex 50GB+ media pipelines, re-architecting systems to eliminate OOM failures, and making opaque AI workflows usable through interactive visual UX that improved trust, speed, and retention.”

Large Language Models Agentic AI LangGraph LangChain Hugging Face TensorFlow+117

View profile

Harshitha Ayenugula

Screened

Mid Software Engineer specializing in backend and FinTech systems

New Jersey, USA4y exp

Community Dreams FoundationUniversity at Buffalo

“Full-stack engineer with strong ownership of complex web products, including building a real-time collaborative editor end-to-end using React, Spring Boot, WebSockets, Yjs CRDT, PostgreSQL, Redis, and Docker. Stands out for combining product delivery with production reliability and performance work, including reducing QA defects by ~25%, improving internal tool load times to under 2 seconds, and resolving latency issues in live systems.”

Java TypeScript JavaScript Python C SQL+125

View profile

Claudius Christian

Screened

Executive product leader specializing in AI, SaaS platforms, and monetization

Seattle, WA14y exp

SubmittableFlorida State University

“Senior product leader who helped transform Submittable from a single-program grant tool into a multi-program impact platform, driving ARR from $20M to $70M+ while improving retention and margins. Particularly strong in enterprise platform strategy and human-centered AI, with a clear philosophy of using AI to augment expert judgment rather than replace it.”

Workflow automation Data analytics Product strategy A/B testing Snowflake Figma+574

View profile

Dinesh Battula

Screened

Mid-level Full-Stack Java Developer specializing in microservices and cloud-native systems

Kansas, null5y exp

Cardinal HealthUniversity of Central Missouri

“Senior full-stack engineer with strong healthcare domain experience who has shipped an Azure OpenAI RAG-based patient medication support chatbot to production, driving ~10K queries/month and a reported 38% reduction in call center volume. Also builds polished real-time React/TypeScript pharmacy tooling and operates large-scale Python/Spark ETL pipelines (~12M records/day) with strong API design, observability, and cloud deployment experience across Azure/Kubernetes and AWS.”

SDLC Agile Scrum Kanban Microservices Architecture Java+136

View profile

Sahil Chaubal

Screened

Senior AI/ML Engineer specializing in financial risk, fraud detection, and GenAI analytics

USA7y exp

Northern TrustSyracuse University

“AI/ML engineer with experience at Northern Trust and Persistent Systems building production LLM + RAG systems for regulated financial use cases, including liquidity forecasting, anomaly detection, and credit scoring. Emphasizes compliance-first design with explainability (SHAP), traceability (MLflow), and hallucination controls (FAISS + citation-grounded prompting), and has delivered drift-triggered retraining pipelines using Airflow and Kubernetes while translating model outputs into business-ready marketing segments.”

Python R SQL PostgreSQL MySQL Microsoft SQL Server+114

View profile

Tadigotla Kumar Reddy

Screened

Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems

New York, USA6y exp

UnitedHealthcareAuburn University at Montgomery

“Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.”

Python SQL R Java JavaScript Bash+157

View profile

Jimmy Dani

Screened

Mid-level AI Researcher specializing in privacy-preserving ML and applied cryptography

College Station, TX6y exp

Texas A&M UniversityTexas A&M University

“Graduate researcher who builds production-grade AI systems spanning LLM security evaluation and on-device RAG. Created HoneyLearner, a self-learning attack framework using GPT-4-class models as structured black-box attackers against honeywords defenses, with rigorous metrics and reproducible orchestration (Airflow/Spark/Kafka/Docker). Also partnered with agriculture scientists at Texas A&M–Corpus Christi to deliver UAV + 3D point-cloud crop-stress maps that cut time-to-insight ~40% and enabled ~30% earlier interventions.”

Python C C++Java SQL Bash+74

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?