Vetted Apache Spark Professionals

Pre-screened and vetted.

Bhanu Prakash Reddy Dakilli - Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing in Framingham, MA

Mid-level Data Engineer specializing in Azure ETL/ELT and data warehousing

Framingham, MA4y exp
Bank of AmericaNew England College

Data engineer who has owned end-to-end production pipelines for customer transaction data (~2–5 GB/day) using Python/PySpark/SQL and Airflow, delivering major reliability and speed gains (70% faster reporting; 60–70% fewer data issues). Also built a daily external web-scraping system with anti-bot handling and safe, idempotent Airflow-driven backfills, plus a Python data API optimized with indexing/caching and tested for correctness.

View profile
DM

Mid-level Data Engineer specializing in real-time analytics and regulated domains

NC, USA5y exp
JPMorgan ChaseSaint Louis University

Data platform engineer focused on large-scale, real-time fraud systems, with hands-on ownership of streaming architectures using Kafka, Spark, Snowflake, and Databricks. Stands out for combining performance tuning and platform automation with LLM/RAG-based enrichment, delivering measurable gains in latency, fraud accuracy, false positives, and analyst decision speed.

View profile
Harrishkumar Loganathan - Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI in Remote, USA

Mid AI/Machine Learning Engineer specializing in FinTech and Generative AI

Remote, USA3y exp
SocureArizona State University

AI/ML engineer with hands-on ownership of enterprise LLM deployments at Freshworks, including a large-scale RAG chatbot serving 15,000+ users across six departments. Stands out for combining deep production engineering skills—AWS microservices, Kubernetes, observability, retrieval quality, and faithfulness evaluation—with strong cross-functional stakeholder leadership and prior large-scale fraud data pipeline experience at Socure.

View profile
Saisureshreddy Challa - Mid-level Data Scientist specializing in AI/ML, LLMs, and domain analytics in California, USA

Mid-level Data Scientist specializing in AI/ML, LLMs, and domain analytics

California, USA6y exp
BlackRockNortheastern University

BlackRock AI/ML engineer who built and owned a production LLM document intelligence system for regulatory and investment analysis end-to-end. They combined RAG, multi-agent validation, strong evaluation/monitoring, and reusable Python services to process 50K+ documents, cut review time 40-50%, and improve decision accuracy by about 25%.

View profile
AJ

Mid-level AI/ML Engineer specializing in generative AI, NLP, and MLOps

San Jose, CA4y exp
ServiceNowUniversity of North Carolina at Charlotte

ML/AI engineer with hands-on ownership of production GenAI and computer vision systems, spanning experimentation, deployment, monitoring, and iterative optimization. Stands out for shipping an enterprise RAG platform that cut manual review by 50% and a defect detection pipeline that reduced report generation from 15 minutes to under 1 second while maintaining high uptime and strong operational discipline.

View profile
DD

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp
Elevance HealthTexas Tech University

ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.

View profile
AG

Mid-level Full-Stack Java Developer specializing in FinTech

New York, NY5y exp
JPMorgan ChaseKent State University

Built a production AI-powered insights platform for marketing teams analyzing large-scale social and news data, combining Java microservices, Kafka, Spark, React, and LLM-based retrieval workflows. Stands out for shipping customer-facing AI features with measurable gains in accuracy and latency, plus solid reliability practices for high-volume backend systems.

View profile
RS

Mid-level Software Engineer specializing in cloud-native backend and AI systems

Long Beach, CA4y exp
JPMorgan ChaseCalifornia State University, Long Beach

Candidate takes a disciplined, developer-in-the-loop approach to AI-assisted coding, using AI primarily for brainstorming, suggestions, and optimization while retaining full ownership of architecture and final code decisions. They also actively stay current on AI developments through research papers, communities, and emerging tools.

View profile
NC

Naveen Chava

Screened

Mid-level Software Engineer specializing in Generative AI and FinTech systems

Chicago, IL4y exp
PayPalDePaul University

Candidate brings practical GenAI engineering experience with a disciplined approach to AI-assisted development. They have designed lightweight multi-agent workflows for a RAG-based support copilot, including retrieval, relevance validation, response generation, and groundedness checks to reduce hallucinations.

View profile
Wei-Hsien Wang - Entry-level AI Engineer specializing in full-stack generative AI systems in San Jose, CA

Entry-level AI Engineer specializing in full-stack generative AI systems

San Jose, CA1y exp
AzazieUC San Diego

AI/full-stack product engineer who has shipped both user-facing and internal LLM products, from a photo-to-music recommendation app to an experimentation agent at Azazie. Stands out for combining modern app development with production-grade agent and GraphRAG systems, including a 500k+ email analysis platform and measurable impact like 3x experiment velocity, 75% setup-time reduction, and 65% faster task discovery.

View profile
AB

Director-level Product Leader specializing in FinTech and enterprise finance platforms

Charlotte, NC19y exp
Wells FargoIEC College of Engineering and Technology

Senior product and technology leader with 23+ years of experience driving modernization in complex enterprise finance and operations environments. He stands out for turning legacy, paper-based or fragmented systems into scalable digital products—cutting a warranty claims process from 30 days to near-instant and using AI to improve service efficiency and reduce testing effort by 30%+. Strong C-suite-facing operator who bridges strategy, architecture, UX, and organizational change.

View profile
PS

Pooja Shindd

Screened

Mid-level Full-Stack Software Engineer specializing in scalable web and AI systems

Illinois, USA4y exp
University of Illinois Chicago Technology SolutionsUniversity of Illinois Chicago

Full-stack engineer who has built both a TypeScript-based HR/payroll platform and a production agentic AI support system end to end. Stands out for combining strong product judgment with deep LLM systems thinking: RAG architecture, confidence-based routing, evals, observability, and human-in-the-loop design in a greenfield environment.

View profile
RM

Junior Full-Stack Software Engineer specializing in React and AI-powered applications

Bloomington, IN4y exp
Indiana UniversityIndiana University Bloomington

Full-stack/AI-focused builder who shipped a production Career Advisor app using LLMs + RAG + vector DB (React/Node/MongoDB/Claude API) and grew it to 2000+ users, handling real deployment issues and CI/CD on Vercel/Render. Also developing an AI-powered iOS “3D World Explorer” (text-to-3D) and has cloud experience across Azure and AWS (S3/SageMaker/EC2).

View profile
SB

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp
Fidelity InvestmentsNortheastern University

Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.

View profile
JS

Jash Shah

Screened

Mid-level Data Scientist specializing in LLMs, MLOps, and predictive analytics in healthcare and finance

New Jersey, USA4y exp
Johnson & JohnsonStevens Institute of Technology

Built and deployed a production LLM/RAG clinical decision support system that enables real-time semantic search over unstructured EHR notes and delivers patient risk insights. Strong in healthcare-grade MLOps and compliance (HIPAA, PHI handling, encryption, RBAC, audit logs) and scaled embedding/retrieval pipelines using Spark/Databricks and Airflow. Partnered with clinicians via Power BI dashboards and explainability, contributing to an 18% reduction in patient readmissions.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Connecticut, USA5y exp
PfizerUniversity of New Haven

Built and deployed an enterprise GenAI knowledge assistant over thousands of internal PDFs/reports using a RAG stack (GPT-4 + Hugging Face embeddings + vector DB) to reduce manual search and SME escalations. Uses LangGraph/LangChain to orchestrate modular agent workflows with relevance filtering and fallback handling, and applies rigorous evaluation (golden datasets, edge cases, A/B tests) with production monitoring metrics.

View profile
AS

Aisha Sartaj

Screened

Mid-level AI Engineer specializing in LLM systems, RAG, and MLOps

Remote3y exp
ILMAscentUCLA

Built an LLM multi-agent “ingredient safety” analyzer for cosmetics that cuts consumer research time from ~20+ minutes to minutes, using LangGraph orchestration, hybrid retrieval (Qdrant + Tavily), and safety-focused critic validation (false rejections reduced ~30%→~8%). Also has research-internship experience building computer-vision pipelines to classify emerald color/clarity by translating gem-expert heuristics into quantitative model features.

View profile
AS

Avijit Saha

Screened

Junior Software Engineer specializing in cloud-native microservices and AI/ML observability

Bedford, TX3y exp
JPMorgan ChaseUniversity of the Cumberlands

Engineer with banking and industrial/IoT experience who has deployed a payment-processing microservice with zero downtime, handling Protobuf schema evolution and sensitive data migration via dual-write/checksum techniques. Demonstrates strong cross-stack troubleshooting (pinpointed intermittent distributed timeouts to a failing ToR switch port) and customer-facing Python ETL customization using plugin-based parsers and Pydantic validation, plus hands-on monitoring/alerting improvements with operators.

View profile
BC

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp
BlackRockWebster University

Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.

View profile
SK

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp
ServiceNowRivier University

Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.

View profile
KF

Kevin Fang

Screened

Intern Software Engineer specializing in full-stack and data systems

Beverly Hills, CA1y exp
Alo YogaUC Irvine

Software developer with healthcare operations experience at Epic Systems (Referrals & Authorizations), delivering customer-facing tooling to speed manual insurance authorization/denial documentation and support future automation. Also supported an HRIS migration to Workday at Aloe Yoga, solving legacy ID interoperability via scripting and mapping, and demonstrates strong production debugging and test-driven maintainability practices.

View profile
MS

Min-Han Shih

Screened

Junior Machine Learning Engineer specializing in speech and multimodal AI

Taipei, Taiwan2y exp
FurboUSC

New grad who has shipped a production vision-language recommendation feature for a pet camera/mobile app, including building a tagged video dataset with human annotators and optimizing inference by FPS downsampling under device compute limits. Also built a multimodal MLLM benchmark using an LLM-as-judge (GPT-5-thinking) with a feedback loop, validated against human scoring, and measured post-feedback quality gains (12% average score improvement).

View profile
RK

Rohit Khoja

Screened

Mid-level Full-Stack Engineer specializing in cloud microservices and NLP/LLM systems

Tempe, AZ4y exp
CitigroupArizona State University

Full-stack engineer with 3+ years using Java/Spring Boot (Citi) and React, who built a production observability dashboard monitoring 53 microservices across 17 clusters with real-time health/latency tracing and significant performance improvements (cut load time from ~10s). Also designed a serverless AWS face-recognition system (Lambda/S3/SQS) built to handle burst traffic (~1000 concurrent requests), demonstrating strength in scalable, event-driven architectures.

View profile
SS

Mid-level Data Engineer specializing in real-time pipelines and cloud analytics

Chicago, IL5y exp
JPMorgan ChaseUniversity of South Dakota

Researcher from the University of South Dakota who built a production medical RAG system to help interpret model predictions by retrieving relevant clinical notes and medical literature, overcoming retrieval accuracy and imaging-dataset challenges through semantic chunking and metadata-driven indexing. Also has hands-on orchestration experience with Airflow and Azure Data Factory, plus a pragmatic approach to LLM evaluation and stakeholder-driven iteration.

View profile

Need someone specific?

AI Search