Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

SREEJA REDDY Konda

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics

Kentwood, MI6y exp

Fifth Third BankUniversity of Central Missouri

“AI/ML Engineer at Fifth Third Bank who has shipped production fraud detection and risk analysis systems combining ML models with LLM-powered insights/explanations, including real-time monitoring, drift detection, and automated retraining under regulatory explainability constraints. Also built a hybrid-retrieval internal knowledge-base QA system (+20% top-5 relevance) and delivered a customer support chatbot that reduced first response time by 30% through strong stakeholder collaboration.”

Python SQL R Java Scala Scikit-learn+102

View profile

Bhavana Anna

Screened

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)

USA5y exp

USAAKennesaw State University

“AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.”

AWS AWS CloudFormation AWS Lambda BERT CI/CD Claude+82

View profile

sai Pavan

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

5y exp

American Family InsuranceGeorge Mason University

“Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.”

A/B Testing AI Agents Amazon API Gateway Amazon EC2 Amazon Kinesis Amazon Redshift+157

View profile

Allen Saunders

Screened

Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations

San Francisco, California11y exp

SpiderOakSan Francisco State University

“Solutions Architect with 5+ years leading pre- and post-sales engagements, focused on taking complex tooling from test/prototype to secure production through a structured discovery-to-deployment approach. Experienced in LLM workflow troubleshooting using tools like Langfuse/Gopher and in developer enablement via concise, hands-on workshops (e.g., Jenkins on Kubernetes at scale). Has navigated internal and external blockers to drive adoption and keep enterprise deals moving (including a Jenkins sale to Love's).”

Python Rust SQL TensorFlow Apache Spark PySpark+107

View profile

Sai Bandaru

Screened

Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems

Boston, MA6y exp

FiVerityNortheastern University

“At FiVerity, built and deployed a production LLM/RAG-based Information Gathering Tool for credit union fraud analysts that generates auditable investigation summaries from verified evidence. Focused on high-stakes constraints—hallucination prevention, cross-entity leakage controls, compliance/PII-safe monitoring, and latency—while also shipping customer-facing agentic workflows using CrewAI and LangGraph in close partnership with fraud and compliance stakeholders.”

Python PyTorch Hugging Face Transformers LoRA Scikit-learn XGBoost+105

View profile

Uttam Kumar

Screened

Intern AI Engineer specializing in LLM agents, RAG, and scalable cloud deployment

Atlanta, GA2y exp

GPT IntegratorsArizona State University

“AI/LLM engineer at GPT integrators who built a production multi-agent enterprise workflow integration system, tackling hard problems in agent orchestration, layered memory, and custom RAG over enterprise/user data. Also built an education-focused agent solution integrating with Canvas, Zoom, and email to automate classroom admin tasks, and is currently applying agentic AI to insurance underwriting workflows in collaboration with underwriters.”

Amazon DynamoDB Amazon EC2 Amazon S3 Apache Spark AWS AWS Lambda+114

View profile

Phanideep P

Screened

Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

5y exp

Cadence BankWright State University

“Data platform/data engineer with cross-industry experience in banking and healthcare, building cloud-native lakehouse architectures across AWS/Azure/GCP. Has owned high-volume (millions of records; TB/day) pipelines with strong data quality automation (dbt/Great Expectations), observability (Grafana/Prometheus), and real-time streaming (Kafka/Spark) for fraud monitoring; also delivered an early-stage migration from SQL Server to BigQuery with 40% batch latency reduction.”

Python SQL Apache Spark PySpark Snowflake Databricks+126

View profile

Rahul Ganesan

Screened

Intern AI Engineer specializing in LLM systems, RAG, and cloud data pipelines

Washington, PA0y exp

Frazier Simplex Machine CompanyUniversity of Colorado Boulder

“Built and deployed a production Dockerized multimodal (voice+text) LLM agent for knowledge management that retrieves from Notion and documents and falls back to Tavily-powered web search with citations when internal notes are missing. Emphasizes production reliability via model-switching fallbacks, caching, strict structured outputs (Pydantic/JSON schema), and MCP-based orchestration with state-aware gating and monitoring to reduce redundant tool calls and improve success rates.”

Amazon RDS Apache Kafka Apache Spark AWS BigQuery CI/CD+99

View profile

Rajeshwar Peri

Screened

Mid-level Data Analyst specializing in healthcare and financial analytics

Chicago, IL5y exp

Elevance HealthIndiana Wesleyan University

“Healthcare analytics candidate with hands-on experience turning messy claims and CRM data into validated reporting tables, automating monthly reporting in Python/Airflow, and operationalizing churn metrics in SQL and Tableau. They appear especially strong in stakeholder-aligned metric design and delivered a reported ~10% churn reduction through cohort analysis, segmentation, and at-risk member targeting.”

Python SQL R dbt Pandas NumPy+93

View profile

Atharva Rajesh Patil

Screened

Junior Software Engineer specializing in backend systems and cloud-native applications

Texas, USA2y exp

AmdocsUniversity of Texas at Arlington

“Engineer with hands-on experience owning customer deployments for ordering and billing systems at Amdocs, including performance tuning, CI/CD improvements, and post-launch stabilization that delivered about 50% faster execution time. Also built and debugged an LLM-powered task prioritization app using Gemini, Streamlit, Python, and MongoDB, with a strong focus on prompt reliability, validation, and handling inconsistent real-world inputs.”

Java Python C++C#Go SQL+117

View profile

Srija Komaraneni

Screened

Mid-level AI Software Engineer specializing in backend systems and FinTech AI

USA4y exp

PNCConcordia University, St. Paul

“Data engineering/software development candidate who built a stock market pipeline and uses that project to demonstrate strong architectural thinking across Kafka, Spark, and Airflow. They stand out for a pragmatic approach to AI: using tools like Copilot, ChatGPT, LangChain, and AutoGen to accelerate development while maintaining human oversight, testing, and system-level decision making.”

Java Python C++JavaScript SQL Spring Boot+138

View profile

Abhinandan Saha

Screened

Mid-Level Software & Infrastructure Engineer specializing in cloud, distributed systems, and AI

Madison, WI5y exp

University of Wisconsin-MadisonUniversity of Wisconsin–Madison

“Backend/data engineer who helped evolve Bitnimbus LLC’s Kafka-as-a-service MVP from a monolith into an event-driven distributed system, using careful design, parallel rollouts, and idempotent event handling to maintain correctness. Also built production-grade API and database security (JWT scopes, rate limiting, explicit Postgres policies/RLS-style controls) and improved Prometheus monitoring by eliminating false outages via heartbeat metrics and windowed aggregation.”

Ansible Apache Cassandra Apache Hadoop Apache Kafka Apache Spark AWS+89

View profile

Manohar Vallabi

Screened

Mid-Level Software Engineer specializing in Java/Spring microservices and cloud event-driven systems

California, US5y exp

LTIMindtreeCalifornia State University, Long Beach

“LLM/agentic-systems practitioner who has repeatedly taken LLM-driven pricing/decision services from prototype to production using pilots, guardrails, observability, and staged rollouts. Demonstrates strong real-time incident troubleshooting (dependency timeouts, cached fallbacks) and post-incident hardening (isolation/async/alerts), and also supports go-to-market via developer workshops, technical demos, and sales-aligned POCs.”

Java Spring Boot Spring MVC GraphQL Microservices Architecture Hibernate+169

View profile

Meet Doshi

Screened

Mid-level Data Engineer specializing in cloud data platforms and AI/ML analytics

Chicago, IL4y exp

EDNANortheastern University

“Backend/data engineer in healthcare who built an AWS-based clinical analytics platform from scratch (DynamoDB/S3/Airflow/dbt) with sub-second clinician query goals, 99.9% uptime, and HIPAA-grade controls (KMS encryption, IAM RBAC, audit trails). Also modernized ML delivery by replacing a manual 4-hour deployment with a 30-minute Docker/GitHub Actions CI/CD pipeline using parallel runs, parity testing, and rollback, and caught critical EHR data edge cases (date formats/timezones) that could have impacted patient care.”

Python PySpark SQL R Java Scala+120

View profile

Meghavardhan Ketireddi

Screened

Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps

USA6y exp

Northern TrustUniversity of North Texas

“Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.”

Python SQL Bash Java TypeScript PyTorch+127

View profile

Hariom Vyas

Screened

Senior Laboratory Technician specializing in clinical diagnostics and quality compliance

Los Angeles, CA8y exp

Innovative Health DiagnosticsCalifornia State University Channel Islands

“Forward-deployed, full-stack/platform engineer who owns production features end-to-end across frontend, backend, data, and infrastructure (AWS serverless, Terraform, React). Has modernized critical fintech/payment systems (zero-downtime monolith-to-microservices with Kafka event sourcing) and productionized AI-native support workflows (LLM + RAG on Pinecone) with measurable gains in latency, incidents, CSAT, and support efficiency.”

Documentation Inventory management Time management Apache Spark Automated Testing AWS+233

View profile

Alekhya Parimala Koppolu

Screened

Mid-level AI/ML Software Engineer specializing in data pipelines, BI dashboards, and computer vision

Wichita, Kansas3y exp

Friends UniversityFriends University

“Graduate Assistant Intern at Friends University who built and deployed a GenAI-driven requirement understanding system that automates extraction and semantic grouping of technical requirements from large unstructured documents. Demonstrates strong LLM engineering rigor (golden datasets, regression testing, post-processing validation) and production-minded delivery using LangChain/LlamaIndex orchestration, FastAPI microservices, Docker, and cloud deployment.”

Python SQL R Java C C+++119

View profile

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp

HCLTechUniversity of New Haven

“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”

A/B Testing Agile Amazon CloudWatch Amazon DynamoDB Amazon S3 Apache Airflow+139

View profile

KHUSHBU KAKDIYA

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and cloud MLOps

California, USA6y exp

CVS HealthCleveland State University

“Built and deployed a production LLM/RAG system at CVS to automate clinical documents, addressing PHI compliance, retrieval accuracy, and latency; achieved a 35–40% reduction in review effort through chunking and FP16/INT8 optimization. Also has experience translating AI outputs into actionable insights for non-technical stakeholders (sports analysts).”

Python SQL PySpark R Bash Scikit-learn+114

View profile

Muraliprasath Ponnuswamy

Screened

Mid-level Full-Stack Software Engineer specializing in Java/Spring microservices and cloud

Frisco, TX5y exp

ISOArizona State University

“Backend-focused engineer with experience owning a production e-commerce platform end-to-end (TypeScript/Node/Express, React, MongoDB, Redis) including RBAC and contract-based API development. Also worked at Infosys on a large healthcare management system built with Spring Boot microservices, using Kafka for messaging/retries, circuit breakers for resilience, and OpenTelemetry/Swagger for observability and API documentation.”

Java SQL Spring Boot Spring MVC Microservices Hibernate+114

View profile

HemaSri Perumalla

Screened

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Reston, VA4y exp

TruistUniversity of Central Missouri

“ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.”

A/B Testing Agile Apache Kafka Apache Spark AWS Glue AWS Lambda+108

View profile

Alexander Conn

Screened

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp

Beyond IdentityIowa State University

“ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).”

Machine Learning Artificial Intelligence Supervised Learning Deep Learning Computer Vision Reinforcement Learning+118

View profile

Ramya Konda

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and generative AI

Remote, USA5y exp

HumanaUniversity of New Haven

“AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.”

Python R SQL PostgreSQL BigQuery Snowflake+108

View profile

Manvi Panjwani

Screened

Mid-level Machine Learning Engineer specializing in cloud, governance automation, and distributed systems

San Francisco, CA4y exp

SoftmaxClark University

“Governance engineer intern at GSK who built policy-as-code automation using Open Policy Agent/Rego integrated into GitHub CI/CD and Terraform workflows. Also built and shipped a voice-enabled expense tracking app using speech-to-text + LLM structured extraction with strong validation, retries, and semantic guardrails, and designed the supporting PostgreSQL data model with performance-focused indexing.”

Python Java C C++SQL HTML+97

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?