Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Mukesh Rajmohan

Screened

Mid-level Data Engineer specializing in AWS/Azure pipelines and streaming analytics

VA, USA5y exp

UnitedHealth GroupGeorge Mason University

“Data engineer with experience across healthcare and geospatial risk systems, owning end-to-end pipelines from ingestion through serving on AWS/Azure stacks. Built HIPAA-compliant data quality gates and CDC for millions of daily claims, and also delivered a real-time wildfire risk platform with 20-minute refresh cycles and a 60% data accuracy lift. Strong in streaming (Kafka), Spark performance tuning, and production-grade orchestration/CI/CD (Airflow, Docker, Jenkins, GitHub Actions, Terraform).”

Python SQL Java AWS Amazon S3 AWS Lambda+95

View profile

Akhil Reddy Edla

Screened

Senior Data Engineer specializing in cloud data platforms and automated data quality

Houston, TX4y exp

CenterPoint EnergyUniversity of Central Missouri

“Data engineer at CenterPoint Energy who built and operated multiple production-grade GCP data systems: a daily Snowflake→BigQuery replication framework (150+ tables) with Monte Carlo/Atlan-driven observability and schema-drift protection, plus a FastAPI metrics service for pipeline health. Demonstrated measurable impact (40% faster dashboard queries, 70% less manual refresh work, zero data loss) and strong operational rigor (scaling Cloud Run jobs, SAP SLT reconciliation, quarantine patterns, CI/CD via GitHub Actions + Terraform).”

Apache Airflow Apache Kafka Apache Spark API Development AWS AWS Glue+116

View profile

Rushir Bhavsar

Screened

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp

Cadence Design SystemsArizona State University

“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”

AI Agents Angular Apache Spark AWS AWS CloudFormation AWS Lambda+121

View profile

Sri Lalitha

Screened

Senior Full-Stack Java Engineer specializing in cloud-native microservices and FinTech

California, USA6y exp

JoydropJawaharlal Nehru Technological University

“Backend engineer who owned a Python task management API with JWT auth, async notifications, and performance work (DB optimization/caching) to handle high volumes. Led an on-prem to Azure private cloud migration at Morgan Stanley using GitOps and IaC (Terraform/ARM) with phased rollout and rollback planning. Also built a Kafka real-time streaming pipeline with exactly-once/idempotent consumers and Prometheus/Grafana monitoring.”

Java Kotlin JavaScript TypeScript Node.js SQL+138

View profile

Yijun Chen

Screened

Senior Full-Stack Software Developer specializing in IoT and cloud systems

Toronto, ON4y exp

PulsenicsUniversity of Toronto

“Frontend-focused engineer who built a full movie recommendation system from concept to production, comparing classic collaborative filtering with LLM-based recommendation approaches on AWS. Emphasizes scalable architecture, strict TypeScript data contracts, and high-quality Next.js/React UI patterns (defensive states, scoped state management, performance optimization) with disciplined QA and feature-flagged rollouts.”

Agile Apache Hadoop Apache Kafka Apache Spark Azure Data Factory Azure DevOps+82

View profile

Sana Khan

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech

Oklahoma, USA4y exp

Capital OneOklahoma Christian University

“ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.”

Python SQL NumPy Pandas Bash PySpark+97

View profile

Chandra Shekar Akkandra

Screened

Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services

Newark, CA5y exp

JPMorgan ChaseUniversity of Missouri-Kansas City

“Finance-domain ML/LLM engineer who has shipped production systems including a RAG-based financial insights assistant with a custom post-generation validation layer that verifies atomic claims against retrieved source text to prevent hallucinations in compliance-critical workflows. Also built large-scale MLOps automation on AWS using Kubeflow + MLflow + CI/CD for fraud detection and credit risk models processing 500M+ transactions/day with a 99.99% uptime goal, and partnered closely with JP Morgan risk/compliance stakeholders on NLP-driven compliance monitoring.”

A/B Testing Amazon DynamoDB Amazon EC2 Amazon ECS Amazon EKS Amazon Kinesis+136

View profile

Joseph Winston

Screened

Executive Systems Architect specializing in distributed edge-to-cloud and real-time data platforms

Houston, TX14y exp

Stasis Drilling SolutionsLuther Seminary

“Has worked across multiple startup stages from pre-funding through Series D and emphasizes rigorous idea validation through direct conversations with both end users and purchasing decision-makers. Interested in applying NLP to automate summarization/abstracting of highly technical articles, with a balanced view of entrepreneurship that prioritizes health and family.”

API Design Apache Cassandra Apache Kafka Apache Spark AWS AWS Lambda+112

View profile

Chakravarthy V P

Screened

Executive AI Consultant/CTO specializing in Agentic AI, GenAI, and cloud-native data platforms

Texas, USA21y exp

C4ScaleIndira Gandhi National Open University

“Bootstrapped founder and CTO of C4Scale, a 2.5-year-old services-led company delivering MVP-to-scale product/platform builds for high-value clients across 5+ countries (10+ projects). Strong fit for roles blending scalable SaaS platform engineering, technical org leadership, and practical AI adoption, with clear awareness of the operational and GTM challenges of scaling into enterprise.”

Agentic AI Generative AI Distributed Systems Cloud-Native Architecture AWS Kubernetes+48

View profile

Annie Suzan

Screened

Mid Software Engineer specializing in machine learning and real-time data systems

Remote, USA3y exp

ThoughtWorksArizona State University

“Hands-on implementation-focused candidate with experience owning cloud deployments and putting LLM/RAG workflows into production. They stand out for combining customer-facing deployment ownership with practical AI systems work, including retrieval tuning, hallucination mitigation, production incident response, and document-processing pipelines for messy real-world inputs.”

Python Java JavaScript SQL Bash React+121

View profile

Nidhish Rao Bairineni

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

5y exp

Wells FargoSouthern Methodist University

“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”

Artificial Intelligence Machine Learning Generative AI Large Language Models OpenAI Claude+173

View profile

Rohit Vibhu Channananjundarya

Screened

Mid-level Software Engineer specializing in distributed systems and full-stack platforms

Chicago, IL6y exp

ExpediaUniversity of Illinois Chicago

“Engineer who treats AI as a force multiplier rather than a replacement for judgment, with hands-on experience using tools like Claude Code, Cursor, Copilot, and Codex across planning, coding, testing, and review. Particularly notable for building a multi-agent PR review system that automated summarization, risk scanning, schema validation, and test suggestions, helping the team shift reviewer time toward architecture and business logic.”

Full Stack Development Agile Java Kotlin Spring Boot GraphQL+104

View profile

Hilary Lutz

Screened

Intern IT and cybersecurity professional with data and Python skills

Philadelphia, PA5y exp

ProsciaBryn Mawr College

“Internship experience at Arkema and Proscia focused on improving onboarding and internal automation workflows. Built SQL-based processes for computer onboarding and security compliance checks, redesigned cybersecurity onboarding for different departments, and created templated setup instructions with GitHub-based review safeguards.”

SQL Python Data Validation Data Analysis Cross-Functional Collaboration Data Pipelines+54

View profile

Teja Paladagu

Screened

Mid-level Software Engineer specializing in full-stack and AI-powered cloud applications

Dearborn, MI7y exp

FordRutgers University–New Brunswick

“Currently building a DBC (Digital Birth Certificate) agentic AI system to speed root cause investigation for quality issues at their company. They bring hands-on experience designing and leading multi-agent workflows, including orchestrator/root-agent patterns, evaluation agents, clarification agents, and practical guardrails for hallucination, bias, and rate-limit management.”

Java Python SQL JavaScript TypeScript Spring Boot+116

View profile

Adit Shah

Screened

Mid AI/ML Engineer specializing in computer vision, NLP, and LLM systems

USA4y exp

Omnic.AINortheastern University

“AI/full-stack engineer in gaming analytics who joined Omnic.ai at a 2-person stage, helped grow with the company, and built both backend and frontend for real-time gameplay analysis products. He combines computer vision production experience with LLM/RAG systems work, and has already led 4 employees while shipping 12 models in a fast-moving startup environment.”

Python SQL Data Structures Algorithms REST APIs FastAPI+143

View profile

Nithin Raghava Aitha

Screened

Mid Software Engineer specializing in FinTech and ML-powered backend systems

Arlington, VA4y exp

Global PaymentsGeorge Washington University

“Backend-leaning full-stack engineer who has shipped real-time, customer-facing dashboards and ticketing/payment features at Freshworks and Global Payments. Strong in Python API design (Django/Flask/FastAPI) and React/TypeScript UIs, with hands-on experience scaling PostgreSQL for high transaction volumes and operating services on AWS, including incident response and HIPAA-aligned security controls.”

Python Java JavaScript C#TypeScript HTML+148

View profile

Keerthana Priya

Screened

Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms

Dallas, TX5y exp

MattelKennesaw State University

“At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).”

SQL Python R PySpark Apache Spark Pandas+123

View profile

Sharath Bandi

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation

Saint Louis, Missouri4y exp

LSEGAvila University

“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”

A/B Testing Apache Airflow Apache Hadoop Apache Hive Apache Kafka Apache Spark+225

View profile

Sravan Kumar Jajam

Screened

Mid-level Data Scientist / ML Engineer specializing in streaming ML systems for healthcare and IoT

Urbandale, IA4y exp

John DeereAuburn University at Montgomery

“ML/GenAI engineer with production experience building an LLM-powered governance layer that summarizes verified drift/performance signals into validation reports and release notes, designed for regulated environments with de-identification and non-blocking fallbacks. Strong Airflow-based orchestration background across healthcare and finance, integrating Databricks/Spark and MLflow for scalable retraining/monitoring. Demonstrated ability to partner with non-technical healthcare operations teams to deliver actionable risk-scoring outputs via dashboards and automated reporting.”

Python R SQL Bash Pandas NumPy+127

View profile

Sowmya Sree

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

Dallas, TX5y exp

Bank of AmericaUniversity of North Texas

“Built production LLM systems including a real-time customer feedback analysis and workflow automation platform using RAG and multi-agent orchestration with confidence-based human escalation, addressing privacy and legacy integration challenges. Also automated ML operations with Airflow/Kubernetes (e.g., daily churn model retraining) cutting retraining time to under 30 minutes, and demonstrates a rigorous testing/monitoring approach plus strong non-technical stakeholder collaboration.”

Python Java Spring Boot JavaScript R Bash+148

View profile

Hanish Kukkala

Screened

Mid-level Data Scientist specializing in Generative AI and NLP

USA6y exp

CVS HealthUniversity of Central Missouri

“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”

A/B Testing Apache Hadoop Apache Hive Apache Kafka Apache Spark AWS+170

View profile

Vamshi Arempula

Screened

Senior AI/ML Engineer specializing in Generative AI, RAG, and agentic systems

6y exp

Wellmark Blue Cross and Blue ShieldIndiana Wesleyan University

“GenAI/LLM ML engineer (currently at Webprobo) building an enterprise GenAI platform with document intelligence and automation on AWS and blockchain. Has hands-on experience with RAG, LLM evaluation tooling, and orchestrating production LLM workflows with Apache Airflow, plus deep exposure to reliability challenges in globally distributed/edge deployments. Also partnered with business/marketing stakeholders at a banking client to deliver an AI-driven customer retention insights solution.”

A/B Testing Agile Amazon API Gateway Amazon Bedrock Amazon CloudWatch Amazon Redshift+212

View profile

Mayur Komaravelly

Screened

Senior Data Analyst specializing in data pipelines, web scraping, and legal data enrichment

Illinois, USA5y exp

The HartfordIndiana Wesleyan University

“Data engineer focused on reliable, scalable analytics pipelines and external data collection. Has owned end-to-end pipelines processing 5–10M records/day, serving Snowflake data marts to Power BI/Tableau, and reports ~99% reliability through strong validation/monitoring. Also shipped versioned REST APIs for curated data with query optimization and caching.”

Apache Airflow Apache Kafka Apache Spark Ansible API Design AWS Glue+140

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?