Vetted Apache Spark Professionals

Pre-screened and vetted.

Mohan Naik Megavath - Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms in Remote, USA

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp
TruistElmhurst University

Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.

View profile
KP

Mid-level Data Engineer specializing in capital markets post-trade data platforms

Whippany, NJ3y exp
BarclaysUniversity of Connecticut

Data/streaming engineer in capital markets who led an end-to-end trade settlement data product (Kafka→MongoDB→data lake) with rigorous data-quality logic and ~$175K first-year operational impact. Also built a low-latency Go-based CME market data engine feeding SOFR curve generation, using MSK on EKS with performance tuning (idempotency, compression, partitioning) to achieve sub-100ms delivery.

View profile
MOUNIKA SAI MEKALA - Junior Data Analyst specializing in financial and operational analytics in Kansas, USA

Junior Data Analyst specializing in financial and operational analytics

Kansas, USA3y exp
KPMGUniversity of Central Missouri

Analytics professional with experience at KPMG turning messy operational and financial data from SQL Server and AWS S3 into clean reporting datasets and automated Python workflows. They combine SQL, Python, Power BI, and experimentation methods to deliver stakeholder-aligned KPI dashboards and marketing performance insights with a strong focus on data integrity and reproducibility.

View profile
AB

Junior Software Engineer specializing in full-stack, data engineering, and mobile apps

Seattle, WA3y exp
AmazonArizona State University

Built production LLM agents at Hivenue and Amazon, spanning consumer booking automation and internal data-query/reporting workflows. Stands out for combining conversational UX with strong reliability engineering—strict tool use, state machines, schema validation, idempotency, and evaluation pipelines—and can point to measurable impact including a 21% reduction in time to book and a 12% conversion lift.

View profile
SAITEJA MALLEMPUDI - Senior Data Scientist and AI/ML Engineer specializing in GenAI and cloud ML in Chicago, IL

Senior Data Scientist and AI/ML Engineer specializing in GenAI and cloud ML

Chicago, IL6y exp
BMOLewis University

ML/AI engineer with hands-on experience owning systems from experimentation through deployment and monitoring, including a Bank of Montreal project that improved timely interventions by 12%. Also brings GenAI/RAG experience with evaluation and safety guardrails, plus clinical NLP pipeline work extracting medication data from notes for patient risk prediction.

View profile
Akhila Kannegari - Mid-level AI/ML Engineer specializing in FinTech and retail ML systems in Alabama, USA

Mid-level AI/ML Engineer specializing in FinTech and retail ML systems

Alabama, USA4y exp
Wells FargoAuburn University at Montgomery

ML-focused candidate with strong Wells Fargo experience building production fraud systems and internal GenAI tools for fraud analysts. Stands out for measurable impact in fraud detection—raising recall from 71% to 88%—while also demonstrating hands-on depth across streaming infrastructure, MLOps, LLM/RAG implementation, and Python service architecture.

View profile
ST

Senior Software Engineer specializing in backend systems and data platforms

Texas, USA5y exp
WalmartNew England College

Software developer who uses AI pragmatically across the full stack to accelerate coding, testing, debugging, and documentation while maintaining strong human oversight. Stands out for treating AI output like any other code source—reviewing for architecture fit, security risks, performance, and standards before integration—and for coordinating multiple AI tools across backend, frontend, and test workflows.

View profile
Naveena Musku - Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

Naveena Musku

Screened

Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

5y exp
Western UnionJawaharlal Nehru Technological University

Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.

View profile
YL

Yaoxin Liu

Screened

Intern Software Engineer specializing in backend and full-stack systems

New York, NY1y exp
SevenRoomsNYU

Built and iterated an end-to-end virtual waiting room for a real-time ticketing prototype, making concrete architecture tradeoffs (polling + Redis Pub/Sub) and improving performance post-launch with Redis caching (+30% throughput, -15% p99 latency). Also has hands-on experience building Spark/HDFS ETL pipelines with strong reliability/observability patterns and running disciplined NLP model evaluation loops on review-rating classification.

View profile
GK

Gregory Kline

Screened

Principal Distributed Systems Engineer specializing in healthcare, defense, and finance platforms

Pittsburgh, PA25y exp
ArcadiaGrove City College

Engineer with experience in small, high-pressure innovation environments and enterprise healthcare platforms, spanning distributed systems, search, and database optimization. At RJ Lee Group, he helped pivot an Air Force document-processing platform from Pig/MapReduce to Apache Storm, enabling near-real-time results, and also built a full-stack natural-language search application that cut analyst investigations from months to weeks or days.

View profile
NP

Nihari Puli

Screened

Mid-level AI/ML Engineer specializing in LLM systems and agentic workflows

4y exp
OptumUniversity of Cincinnati

Built an agentic medical coding system at Optum that combined LangGraph, LangChain RAG, Azure OpenAI, pgvector, and TypeScript to automate routine clinical coding while escalating risky cases to humans. The system automated about 40% of routine cases at roughly 92% accuracy, with strong production evals and observability using MLflow, Ragas, and DeepEval.

View profile
NT

Mid-level Software Engineer specializing in full-stack cloud-native systems

New York, NY7y exp
Dune SecurityNYU

Backend/platform engineer from Dune Security with strong experience turning messy, fragmented workflows into reusable production systems. They’ve built a shared database abstraction layer, integrated multiple enterprise security platforms into a unified workflow, and shipped AWS Bedrock-powered security insight features with guardrails and human review.

View profile
NP

Neel Patel

Screened

Mid-level Python Backend Engineer specializing in cloud-native AI and observability systems

USA4y exp
ComcastUniversity at Buffalo

Backend/AI engineer who has shipped an LLM-powered enterprise support-ticket agent at Comcast, building a production-grade microservices pipeline (FastAPI, SQS, Redis) with strong observability (OpenTelemetry/Splunk/Prometheus/Grafana) and reliability patterns (async, caching, circuit breakers, idempotency). Demonstrated quantified impact at scale—processing 10k+ tickets/day while improving response SLAs and routing accuracy through evaluation and human feedback loops.

View profile
BS

Senior Software Engineer specializing in distributed systems and FinTech

Washington, USA6y exp
Principal Financial GroupTrine University

Data/analytics-focused engineer who builds end-to-end KPI reporting and validation products used daily by plant leads and leadership to track yield, downtime, and defects. Combines Python/SQL + Power BI data pipelines with strong data-quality practices (automated validation, monitoring/alerts) and has experience designing scalable frontend architecture in TypeScript/React and working in distributed/microservices-style data systems.

View profile
RG

Senior Full-Stack Developer specializing in Python, cloud microservices, and AI/ML

Oviedo, Florida11y exp
FocustAppsSt. Francis University

Backend/data engineer with hands-on production experience across GCP and AWS: built FastAPI microservices on Cloud Run and delivered AWS Lambda + ECS Fargate systems with Terraform/GitHub Actions. Strong in data engineering (Glue/Spark, S3/Redshift) and modernization (SAS to Python/SQL), with proven reliability and incident ownership—including cutting a 20+ minute reporting query to under 2 minutes.

View profile
SS

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp
Cloud9USC

AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.

View profile
DL

Senior Python Developer specializing in data engineering, MLOps, and cloud platforms

Dallas, TX13y exp
CBREAnna University

Backend/data engineer with production experience building secure Django/DRF APIs (JWT RS256 + rotating refresh tokens), background processing with Celery, and strong reliability practices (timeouts, retries/backoff, structured logging, audit trails). Has delivered AWS solutions spanning Lambda + ECS with IaC/CI-CD and built Glue/PySpark ETL pipelines with schema evolution and data-quality quarantine patterns; also modernized a legacy SAS pipeline to Python/PySpark with parallel-run parity validation and phased rollout.

View profile
AS

Anuj Shah

Screened

Senior Data Analyst specializing in cloud data platforms, experimentation, and predictive analytics

GA, USA9y exp
UnitedHealth GroupNorthwestern Polytechnic University

Healthcare data/ML practitioner with experience at UnitedHealth Group building production ETL and streaming pipelines (Python, BigQuery, Kafka) that unify EHR, IoT device, and lab data for patient risk prediction. Also implemented embedding-based semantic search/linking for noisy clinical notes via domain adaptation and rigorous validation with clinical stakeholders; previously built churn prediction at DirecTV using XGBoost.

View profile
KK

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp
AccentureEastern Illinois University

LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).

View profile
RJ

Ramesh Jasti

Screened

Mid-level AI/ML & MLOps Engineer specializing in cloud AI infrastructure and GenAI

San Jose, USA5y exp
HPEWestern Illinois University

At HPE, led and deployed an enterprise-grade LLM document intelligence platform for an insurance client, automating extraction from highly variable PDFs/scans/emails and raising field accuracy from 74% to 93%. Built a LangChain/Pinecone/OpenSearch RAG framework to cut hallucinations by 37% and operationalized LangSmith evals in CI, driving a 41% triage accuracy lift and >33% fewer incorrect resolutions while partnering closely with claims operations via HITL workflows.

View profile
PB

Junior Data Scientist / ML Engineer specializing in LLMs and Computer Vision

Tempe, Arizona2y exp
Arizona State UniversityArizona State University

Currently working in CoRAL Lab, built and deployed IntegrityShield—a document-layer PDF watermarking system that keeps assessments visually identical while disrupting LLM-based solving; validated in a real classroom where it helped catch 12 AI-cheating cases. Also built MALDOC, a modular red-teaming platform for document-processing AI agents using LangGraph to run reproducible, deterministic adversarial trials across OCR/text/vision routes.

View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
AK

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp
Data Security Council of IndiaUSC

Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.

View profile
OL

Mid-level Data Engineer specializing in cloud data pipelines and streaming

Charlotte, NC5y exp
Wells FargoUniversity of North Texas

Data engineer with experience at Wells Fargo and Accenture owning end-to-end production pipelines processing hundreds of millions of transactional/risk records daily. Strong focus on data quality and reliability (reconciliation checks, schema drift detection, CloudWatch alerting) plus Spark performance tuning and idempotent backfills using Delta Lake/merge logic across AWS (S3/EMR/Databricks/Redshift) and Azure (ADF/Azure DevOps/Azure Monitor).

View profile

Need someone specific?

AI Search