Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

SUJAY Kanakamedala

Screened

Mid-level AI Developer & Machine Learning Engineer specializing in LLM and MLOps systems

Champaign, IL5y exp

CenteneEastern Illinois University

“Built and deployed an enterprise RAG application at Centene to help clinical teams retrieve insights from large internal policy document sets, cutting manual research by 30–40%. Implemented custom domain-adapted embeddings (SageMaker + BERT transfer learning) and hybrid retrieval (BM25 + Pinecone) to drive a 22% relevance lift, and ran the system in production on AWS EKS with CI/CD, MLflow, and Prometheus monitoring (99% uptime, ~40% latency reduction).”

A/B Testing Agile Agentic AI Apache Kafka Apache Spark AWS+145

View profile

Likhitha Gandi

Screened

Junior Business Analytics & SAP BASIS professional specializing in AI and predictive modeling

Denton, TX3y exp

University of North TexasUniversity of North Texas

“Built and deployed a production LLM-powered email assistant (“wood flow”) for a local pet resort to automate after-hours inbound email handling, including email categorization and context-aware auto-responses. Uses n8n for orchestration and applies CRISP-DM, load/edge-case testing, and RAG-based context retrieval, and has experience presenting AI solutions with budgeting and ROI to a non-technical founder.”

Python Pandas NumPy Scikit-Learn SQL R+77

View profile

Pandraju Gamanapriya

Screened

Mid-level Data Scientist specializing in healthcare ML and GenAI

San Marcos, TX4y exp

UnitedHealth GroupTexas State University

“Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.”

Agile Anomaly Detection API Integration AWS AWS Glue Bash+106

View profile

Leelakarthik Devisetty

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

Atlanta, GA3y exp

AIGKennesaw State University

“Data professional with ~4 years of experience, most recently at AIG (insurance), building ML/NLP systems for fraud detection and policy automation using transformers, CNNs, and clustering/anomaly detection. Also developed a RAG-based knowledge retrieval system, iterating across embedding models and moving to production based on precision and latency SLAs, then containerizing and deploying with SageMaker and CI/CD.”

AWS AWS Lambda BERT BigQuery CI/CD Claude+143

View profile

Rakesh Thota

Screened

Mid-level Data Engineer specializing in multi-cloud real-time data pipelines

California, USA5y exp

Molina HealthcareUniversity at Buffalo

“Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.”

Python SQL Java PySpark Apache Spark Apache Kafka+136

View profile

Gopichand Muppaneni

Screened

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp

Eastern Illinois UniversityEastern Illinois University

“Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.”

Python SQL PySpark ETL Data Pipelines Data Modeling+47

View profile

Lerone Pieters

Screened

Senior Data Engineer specializing in cloud data platforms and real-time analytics

Remote, USA10y exp

Scale MediaNew York City College of Technology (CUNY)

“Data/analytics engineer focused on finance and e-commerce integrations, building end-to-end pipelines and services across Odoo, QuickBooks, Snowflake, and Tableau. Replaced a costly third-party Walmart connector with a serverless AWS Lambda pipeline deployed via Terraform/GitHub and monitored with CloudWatch/Datadog, and shipped a bi-directional Odoo↔QuickBooks invoice sync with distributed locking plus Slack-based finance approvals.”

Python SQL Java Scala Go JavaScript+110

View profile

Saiprasad Barkam

Screened

Mid-level Data Engineer specializing in cloud ETL and streaming data pipelines

Detroit, MI5y exp

HarmonecareAuburn University at Montgomery

“Data engineer in healthcare/clinical data platforms (HarmonCare) who built and operated an end-to-end lakehouse pipeline ingesting HL7/FHIR at ~2–3M records/day on AWS (Glue/Lambda/S3/Spark) and serving trusted datasets in Snowflake. Implemented strong validation/reconciliation gates and a data quality framework that reduced discrepancies ~40%, plus CI/CD (GitHub Actions/Terraform) and monitoring (Airflow/CloudWatch).”

Python SQL PySpark Scala Shell scripting Apache Spark+89

View profile

Anay Dongre

Screened

Junior Machine Learning Engineer specializing in GenAI and LLM fine-tuning

Pomona, California1y exp

Aerolift.AICal Poly Pomona

“Robotics software engineer focused on hard real-time autonomy for legged robots, building a quadruped navigation stack that combines vision SLAM with MPC and maintains a deterministic 500Hz control loop. Deep performance optimization experience across CUDA (sub-2ms perception latency), ROS 2/DDS real-time tuning, and motion planning (cut 500ms spikes to sub-5ms). Also designed distributed ROS 2 + Zenoh communications between quadrupeds and aerial drones and validated robustness under lossy wireless conditions.”

AWS AI Agents Apache Spark C++CI/CD Computer Vision+118

View profile

Sachin Dulla

Screened

Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps

Kentwood, MI3y exp

Fifth Third BankCalifornia State University, San Bernardino

“Built and deployed a domain-specific LLM chatbot for research/support, cutting manual effort by ~50%. Demonstrates strong applied LLM engineering: RAG, prompt grounding with citations and fallbacks, embedding/top-k tuning, and production monitoring (confidence, latency, feedback loops). Experienced orchestrating agent workflows with LangChain-style pipelines and continuous evaluation to maintain reliability.”

Amazon EC2 Amazon EKS AWS AWS Lambda Azure Machine Learning BERT+93

View profile

Shrinivas Bhusannavar

Screened

Mid-level AI Engineer specializing in agentic LLM systems and RAG platforms

San Jose, CA5y exp

SquareShiftSan José State University

“Built and shipped Serrano AI, a multi-tenant SaaS conversational AI platform that automates Odoo ERP workflows and lets ops/finance/supply-chain teams query ERP data in natural language. Implemented a multi-agent architecture (LangChain/LangGraph/CrewAI) with hybrid RAG over ERP schemas, deployed on Heroku/Vercel with production observability, cutting reporting time by ~80% while addressing hallucinations, latency, and schema complexity.”

AI Agents Apache Hadoop Apache Kafka Apache Spark AWS AWS Lambda+154

View profile

Bhavishyasai Chigurupati

Screened

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp

CignaUniversity of Central Missouri

“Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.”

SDLC Agile Waterfall Python SQL R+179

View profile

Arunima Mishra

Screened

Senior Technical Product Lead specializing in Data Governance and MDM SaaS platforms

Bengaluru, India7y exp

InforManipal University Jaipur

“Technical/product lead at Albanero (acquired by Infor in 2024; now at Infor) who built a Data Mesh-focused “Governance as a Product” module from early persona-based policies through a highly configurable multi-ERP governance platform (MDM, multi-source mastering, match/merge, automated review workflows). Also troubleshoots agentic/LLM workflows in production using auditability, guardrails, monitoring, and real-time validation—fixing a P0 false-positive security flagging issue and contributing to significant deal/adoption growth (~50%) after V2 launch.”

Risk management Data governance Cross-functional leadership Java Spring Boot Microservices+92

View profile

SaiSindhu Beeravolu

Screened

Mid AI/ML Engineer specializing in LLMs, RAG, and healthcare AI

Illinois, USA4y exp

UnitedHealth GroupUniversity of Maryland, Baltimore County

“Healthcare ML/AI engineer with production experience at UnitedHealth Group, including an end-to-end readmission prediction system built on 50M+ patient records that improved accuracy by 18% and reduced preventable readmissions by 12%. Also shipped a clinically grounded LLM/RAG referral generator with human-in-the-loop safety controls, showing strong depth in regulated, high-stakes AI systems.”

Machine Learning Artificial Intelligence Supervised Learning Feature Engineering Model Evaluation Hyperparameter Tuning+100

View profile

Almas Fathimah

Screened

Senior AI Engineer specializing in Generative AI and ML platforms

USA7y exp

EnteraUniversity of Maryland, Baltimore County

“Built and owned a production RAG-based conversational AI system at Entera for real estate analysis, taking it from experimentation through AWS deployment, monitoring, and iterative improvement. Demonstrates strong practical judgment in retrieval design, LLM safety, and scalable Python service architecture, with measurable impact including 30-40% reduction in manual analysis time and roughly 30% better response accuracy.”

Python SQL Java C++JavaScript Bash+164

View profile

Sreeraj Chintham

Screened

Mid-level Software Engineer specializing in Python backend and AI/GenAI

Jersey City, NJ4y exp

PTCSt. Francis College

“Backend/infrastructure-focused engineer building AI-agent products for small businesses, including a customer-service agent platform with intent routing, RAG over Pinecone, and external booking API integration. Has shipped Python/FastAPI services with JWT auth, versioned APIs, Docker deployments to AWS EC2 via GitHub Actions, and production monitoring with Prometheus/Grafana.”

Python SQL FastAPI Flask Django REST APIs+150

View profile

Raghav Bajaj

Screened

Junior Software Engineer specializing in backend systems and ML applications

Modesto, CA2y exp

Center for Human Services (CHS)UC Riverside

“Full-stack engineer with hands-on experience building and shipping production web products across AI, frontend, backend, and DevOps. Notably built an end-to-end resume-job matching platform during an internship that processed 1000+ resumes/day and cut recruiter screening effort by 60%, and later shipped an internal operations dashboard at CHS with measurable performance gains.”

Python Java TypeScript SQL C++Spring Boot+130

View profile

Baliram Maurya

Screened

Senior Full-Stack Engineer specializing in Java microservices and Healthcare IT

USA11y exp

Bluethink

“Backend engineer with hands-on experience modernizing healthcare platforms in a startup-like team of 8-12 across engineering, QA, DevOps, and product. They personally drove scalable Java/Spring Boot microservices for healthcare workflows, including FHIR integrations, real-time data pipelines, and resilient integrations with legacy and third-party systems.”

Java JavaScript TypeScript SQL Shell Scripting PowerShell+185

View profile

Saketh Kota

Screened

Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps

Irving, TX4y exp

U.S. Bank

“Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.”

Agile A/B Testing Amazon SageMaker API Development Argo CD AWS+185

View profile