Vetted Apache Spark Professionals

Pre-screened and vetted.

SREEJA REDDY Konda - Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics in Kentwood, MI

Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics

Kentwood, MI6y exp
Fifth Third BankUniversity of Central Missouri

AI/ML Engineer at Fifth Third Bank who has shipped production fraud detection and risk analysis systems combining ML models with LLM-powered insights/explanations, including real-time monitoring, drift detection, and automated retraining under regulatory explainability constraints. Also built a hybrid-retrieval internal knowledge-base QA system (+20% top-5 relevance) and delivered a customer support chatbot that reduced first response time by 30% through strong stakeholder collaboration.

View profile
Bhavana Anna - Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG) in USA

Bhavana Anna

Screened

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)

USA5y exp
USAAKennesaw State University

AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.

View profile
sai Pavan - Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

sai Pavan

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

5y exp
American Family InsuranceGeorge Mason University

Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.

View profile
Allen Saunders - Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations in San Francisco, California

Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations

San Francisco, California11y exp
SpiderOakSan Francisco State University

Solutions Architect with 5+ years leading pre- and post-sales engagements, focused on taking complex tooling from test/prototype to secure production through a structured discovery-to-deployment approach. Experienced in LLM workflow troubleshooting using tools like Langfuse/Gopher and in developer enablement via concise, hands-on workshops (e.g., Jenkins on Kubernetes at scale). Has navigated internal and external blockers to drive adoption and keep enterprise deals moving (including a Jenkins sale to Love's).

View profile
Sai Bandaru - Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems in Boston, MA

Sai Bandaru

Screened

Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems

Boston, MA6y exp
FiVerityNortheastern University

At FiVerity, built and deployed a production LLM/RAG-based Information Gathering Tool for credit union fraud analysts that generates auditable investigation summaries from verified evidence. Focused on high-stakes constraints—hallucination prevention, cross-entity leakage controls, compliance/PII-safe monitoring, and latency—while also shipping customer-facing agentic workflows using CrewAI and LangGraph in close partnership with fraud and compliance stakeholders.

View profile
Uttam Kumar - Intern AI Engineer specializing in LLM agents, RAG, and scalable cloud deployment in Atlanta, GA

Uttam Kumar

Screened

Intern AI Engineer specializing in LLM agents, RAG, and scalable cloud deployment

Atlanta, GA2y exp
GPT IntegratorsArizona State University

AI/LLM engineer at GPT integrators who built a production multi-agent enterprise workflow integration system, tackling hard problems in agent orchestration, layered memory, and custom RAG over enterprise/user data. Also built an education-focused agent solution integrating with Canvas, Zoom, and email to automate classroom admin tasks, and is currently applying agentic AI to insurance underwriting workflows in collaboration with underwriters.

View profile
Phanideep P - Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

Phanideep P

Screened

Senior Data Engineer specializing in cloud lakehouse and streaming data platforms

5y exp
Cadence BankWright State University

Data platform/data engineer with cross-industry experience in banking and healthcare, building cloud-native lakehouse architectures across AWS/Azure/GCP. Has owned high-volume (millions of records; TB/day) pipelines with strong data quality automation (dbt/Great Expectations), observability (Grafana/Prometheus), and real-time streaming (Kafka/Spark) for fraud monitoring; also delivered an early-stage migration from SQL Server to BigQuery with 40% batch latency reduction.

View profile
Rahul Ganesan - Intern AI Engineer specializing in LLM systems, RAG, and cloud data pipelines in Washington, PA

Rahul Ganesan

Screened

Intern AI Engineer specializing in LLM systems, RAG, and cloud data pipelines

Washington, PA0y exp
Frazier Simplex Machine CompanyUniversity of Colorado Boulder

Built and deployed a production Dockerized multimodal (voice+text) LLM agent for knowledge management that retrieves from Notion and documents and falls back to Tavily-powered web search with citations when internal notes are missing. Emphasizes production reliability via model-switching fallbacks, caching, strict structured outputs (Pydantic/JSON schema), and MCP-based orchestration with state-aware gating and monitoring to reduce redundant tool calls and improve success rates.

View profile
Rajeshwar Peri - Mid-level Data Analyst specializing in healthcare and financial analytics in Chicago, IL

Mid-level Data Analyst specializing in healthcare and financial analytics

Chicago, IL5y exp
Elevance HealthIndiana Wesleyan University

Healthcare analytics candidate with hands-on experience turning messy claims and CRM data into validated reporting tables, automating monthly reporting in Python/Airflow, and operationalizing churn metrics in SQL and Tableau. They appear especially strong in stakeholder-aligned metric design and delivered a reported ~10% churn reduction through cohort analysis, segmentation, and at-risk member targeting.

View profile
AR

Junior Software Engineer specializing in backend systems and cloud-native applications

Texas, USA2y exp
AmdocsUniversity of Texas at Arlington

Engineer with hands-on experience owning customer deployments for ordering and billing systems at Amdocs, including performance tuning, CI/CD improvements, and post-launch stabilization that delivered about 50% faster execution time. Also built and debugged an LLM-powered task prioritization app using Gemini, Streamlit, Python, and MongoDB, with a strong focus on prompt reliability, validation, and handling inconsistent real-world inputs.

View profile
SK

Mid-level AI Software Engineer specializing in backend systems and FinTech AI

USA4y exp
PNCConcordia University, St. Paul

Data engineering/software development candidate who built a stock market pipeline and uses that project to demonstrate strong architectural thinking across Kafka, Spark, and Airflow. They stand out for a pragmatic approach to AI: using tools like Copilot, ChatGPT, LangChain, and AutoGen to accelerate development while maintaining human oversight, testing, and system-level decision making.

View profile
AS

Mid-Level Software & Infrastructure Engineer specializing in cloud, distributed systems, and AI

Madison, WI5y exp
University of Wisconsin-MadisonUniversity of Wisconsin–Madison

Backend/data engineer who helped evolve Bitnimbus LLC’s Kafka-as-a-service MVP from a monolith into an event-driven distributed system, using careful design, parallel rollouts, and idempotent event handling to maintain correctness. Also built production-grade API and database security (JWT scopes, rate limiting, explicit Postgres policies/RLS-style controls) and improved Prometheus monitoring by eliminating false outages via heartbeat metrics and windowed aggregation.

View profile
MV

Mid-Level Software Engineer specializing in Java/Spring microservices and cloud event-driven systems

California, US5y exp
LTIMindtreeCalifornia State University, Long Beach

LLM/agentic-systems practitioner who has repeatedly taken LLM-driven pricing/decision services from prototype to production using pilots, guardrails, observability, and staged rollouts. Demonstrates strong real-time incident troubleshooting (dependency timeouts, cached fallbacks) and post-incident hardening (isolation/async/alerts), and also supports go-to-market via developer workshops, technical demos, and sales-aligned POCs.

View profile
MD

Meet Doshi

Screened

Mid-level Data Engineer specializing in cloud data platforms and AI/ML analytics

Chicago, IL4y exp
EDNANortheastern University

Backend/data engineer in healthcare who built an AWS-based clinical analytics platform from scratch (DynamoDB/S3/Airflow/dbt) with sub-second clinician query goals, 99.9% uptime, and HIPAA-grade controls (KMS encryption, IAM RBAC, audit trails). Also modernized ML delivery by replacing a manual 4-hour deployment with a 30-minute Docker/GitHub Actions CI/CD pipeline using parallel runs, parity testing, and rollback, and caught critical EHR data edge cases (date formats/timezones) that could have impacted patient care.

View profile
MK

Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps

USA6y exp
Northern TrustUniversity of North Texas

Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.

View profile
HV

Hariom Vyas

Screened

Senior Laboratory Technician specializing in clinical diagnostics and quality compliance

Los Angeles, CA8y exp
Innovative Health DiagnosticsCalifornia State University Channel Islands

Forward-deployed, full-stack/platform engineer who owns production features end-to-end across frontend, backend, data, and infrastructure (AWS serverless, Terraform, React). Has modernized critical fintech/payment systems (zero-downtime monolith-to-microservices with Kafka event sourcing) and productionized AI-native support workflows (LLM + RAG on Pinecone) with measurable gains in latency, incidents, CSAT, and support efficiency.

View profile
AP

Mid-level AI/ML Software Engineer specializing in data pipelines, BI dashboards, and computer vision

Wichita, Kansas3y exp
Friends UniversityFriends University

Graduate Assistant Intern at Friends University who built and deployed a GenAI-driven requirement understanding system that automates extraction and semantic grouping of technical requirements from large unstructured documents. Demonstrates strong LLM engineering rigor (golden datasets, regression testing, post-processing validation) and production-minded delivery using LangChain/LlamaIndex orchestration, FastAPI microservices, Docker, and cloud deployment.

View profile
SC

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp
HCLTechUniversity of New Haven

Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.

View profile
KK

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and cloud MLOps

California, USA6y exp
CVS HealthCleveland State University

Built and deployed a production LLM/RAG system at CVS to automate clinical documents, addressing PHI compliance, retrieval accuracy, and latency; achieved a 35–40% reduction in review effort through chunking and FP16/INT8 optimization. Also has experience translating AI outputs into actionable insights for non-technical stakeholders (sports analysts).

View profile
MP

Mid-level Full-Stack Software Engineer specializing in Java/Spring microservices and cloud

Frisco, TX5y exp
ISOArizona State University

Backend-focused engineer with experience owning a production e-commerce platform end-to-end (TypeScript/Node/Express, React, MongoDB, Redis) including RBAC and contract-based API development. Also worked at Infosys on a large healthcare management system built with Spring Boot microservices, using Kafka for messaging/retries, circuit breakers for resilience, and OpenTelemetry/Swagger for observability and API documentation.

View profile
HP

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Reston, VA4y exp
TruistUniversity of Central Missouri

ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.

View profile
AC

Principal Data Scientist specializing in cybersecurity ML and MLOps

New York, NY15y exp
Beyond IdentityIowa State University

ML/NLP engineer (Beyond Identity) who built production semantic search and entity-resolution systems over internal security documentation, using LDA + BERT embeddings with FAISS/Pinecone to cut search time by 30%. Also scaled a real-time anomaly detection pipeline to millions of events/day with Spark and AWS Lambda, with strong emphasis on measurable validation (Precision@k, MRR, F1, ARI).

View profile
Ramya Konda - Mid-level AI/ML Engineer specializing in healthcare ML and generative AI in Remote, USA

Ramya Konda

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and generative AI

Remote, USA5y exp
HumanaUniversity of New Haven

AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.

View profile
Manvi Panjwani - Mid-level Machine Learning Engineer specializing in cloud, governance automation, and distributed systems in San Francisco, CA

Mid-level Machine Learning Engineer specializing in cloud, governance automation, and distributed systems

San Francisco, CA4y exp
SoftmaxClark University

Governance engineer intern at GSK who built policy-as-code automation using Open Policy Agent/Rego integrated into GitHub CI/CD and Terraform workflows. Also built and shipped a voice-enabled expense tracking app using speech-to-text + LLM structured extraction with strong validation, retries, and semantic guardrails, and designed the supporting PostgreSQL data model with performance-focused indexing.

View profile

Need someone specific?

AI Search