Vetted Apache Spark Professionals

Pre-screened and vetted.

AH

Ansh Harjai

Screened

Junior Software Engineer specializing in AI, RAG systems, and backend development

Brooklyn, NY1y exp
New York UniversityNYU

Built an NYU software engineering capstone called “Smart Cash AI,” a multi-agent LLM-powered web app that curates offline-ready podcasts/articles/videos/news based on user preferences and commute schedules. Architected agent orchestration (discovery/downloader/summarizer), real-time progress via WebSockets, and an ETL normalization layer across RSS/YouTube and other sources with GUID-based deduplication, retries, and failure isolation to keep the system predictable.

View profile
Hamidreza Lotfalizadeh - Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems in Bay Area, CA

Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems

Bay Area, CA6y exp
Inertia SystemsPurdue University

At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.

View profile
Sreelekha Vuppala - Mid-level Data Scientist specializing in Generative AI, MLOps, and cloud data platforms in USA

Mid-level Data Scientist specializing in Generative AI, MLOps, and cloud data platforms

USA4y exp
CitiusTechArizona State University

GenAI/ML engineer (CitiusTech) who has deployed production RAG systems for compliance/operations document Q&A, using Pinecone + FastAPI microservices on Kubernetes with strong monitoring and guardrails. Also built a GenAI-powered incident triage/routing solution in collaboration with non-technical stakeholders, achieving 35% faster response times and 40% fewer misclassified tickets, and has hands-on orchestration experience with Airflow and AutoSys.

View profile
Harini Vinu - Intern Software Engineer specializing in cloud, big data, and test automation in New York, United States

Harini Vinu

Screened

Intern Software Engineer specializing in cloud, big data, and test automation

New York, United States1y exp
QualitestNYU

Internship experience at Qualitest building and deploying an LLM-powered test automation system that reduced manual test creation and improved efficiency (~40%). Demonstrates strong production engineering for LLM systems (timeouts/retries/monitoring/caching, prompt optimization, batching) and has scaled workflows to 100+ concurrent jobs; also has orchestration experience with AWS Step Functions and Kubernetes.

View profile
Fangjian Xiong - Junior Machine Learning Engineer specializing in NLP and biomedical entity extraction in Boston, MA

Junior Machine Learning Engineer specializing in NLP and biomedical entity extraction

Boston, MA2y exp
Northeastern UniversityNortheastern University

Built and deployed a production LLM-powered biomedical knowledge extraction pipeline that processed millions of papers to identify tools/techniques and produce a unified knowledge graph via active learning NER (Prodigy + spaCy transformers) and entity linking (Bio-tools/Wikidata). Addressed hard NLP engineering challenges like WordPiece span-offset alignment and scaled inference over ~1.5M documents using batching/caching, containerized services, async workers, and orchestration with Prefect/Airflow.

View profile
Daniel Jin - Intern Site Reliability Engineer specializing in Kubernetes, AWS, and observability in New York, NY

Daniel Jin

Screened

Intern Site Reliability Engineer specializing in Kubernetes, AWS, and observability

New York, NY1y exp
Woori America BankNYU

Backend/data engineering candidate specializing in Python/Flask services and ML-enabled systems, deploying containerized workloads on AWS ECS/EKS with strong observability (Prometheus/Grafana) and PostgreSQL performance tuning. Built multi-tenant architectures with row- and schema-level isolation and optimized a Kubernetes-based Airflow + Spark nightly ETL pipeline for an e-commerce client, improving performance by 250%+ and reliably beating morning reporting deadlines; also contributed to Apache Airflow (SQLAlchemy/PostgreSQL area).

View profile
Sai Nekkanti - Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance in Mount Laurel, NJ

Sai Nekkanti

Screened

Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance

Mount Laurel, NJ4y exp
MetLifeRowan University

Built a production "sentinel insight engine" to tame information overload from millions of product reviews and support transcripts, combining Azure OpenAI (GPT-3.5) zero-shot classification with a fine-tuned T5 summarizer to generate weekly actionable product insights. Demonstrated strong MLOps/production engineering by adding drift monitoring with embedding-based detection, integrating REST with legacy SOAP/queue-based CRM via FastAPI middleware, and scaling reliably on Kubernetes with HPA.

View profile
Nishad Kane - Mid-level Data Scientist & AI Engineer specializing in RAG, agentic AI, and production ML

Nishad Kane

Screened

Mid-level Data Scientist & AI Engineer specializing in RAG, agentic AI, and production ML

5y exp
Xtrium AIArizona State University

AI/data engineer who built a production LLM-powered schema drift detection system (LangChain/LangGraph) to catch semantic data changes before they break downstream analytics/ML. Deployed on AWS with Docker/S3 and implemented an LLM-as-a-judge evaluation framework to improve trust, reduce hallucinations, and control false positives/alert fatigue. Collaborated with non-technical risk/business analytics stakeholders at EY by delivering human-readable drift explanations that improved confidence in financial analytics dashboards.

View profile
Deeresh Gajjala - Senior Software Engineer specializing in Java/Spring Boot microservices and AWS payments systems in Dallas, TX

Senior Software Engineer specializing in Java/Spring Boot microservices and AWS payments systems

Dallas, TX6y exp
American ExpressUniversity of Central Missouri

Senior software engineer with Amazon experience who owned end-to-end improvements to a real-time payment authorization service, rebuilding it as a reactive Spring WebFlux microservice with saga orchestration and Kafka event streaming, deployed on AWS EKS with strong observability. Also built React+TypeScript and Node/Express full-stack workflow apps (onboarding, campaign management, admin review) and has experience shipping quickly in ambiguous startup environments while maintaining reliability and data correctness.

View profile
Revanth Goli - Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems in Morrisville, NC

Revanth Goli

Screened

Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems

Morrisville, NC6y exp
Syneos HealthUniversity of Alabama at Birmingham

Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.

View profile
BK

Mid-level Data Engineer specializing in big data pipelines and real-time streaming

Dallas, TX6y exp
Johnson & JohnsonUniversity of North Texas

Data engineer who has owned end-to-end production pipelines processing a few million records/day, using Python/Airflow/SQL/PySpark with Snowflake serving to BI (Power BI). Built resilient external web data collection systems (anti-bot, schema-change detection, backfills) and shipped versioned REST APIs for internal consumers, improving pipeline success rates to 99% through monitoring, retries, and idempotent design.

View profile
SV

Mid-Level Data Engineer specializing in cloud data platforms and governed analytics

5y exp
OptumUniversity of Central Missouri

Data engineer with Optum experience building end-to-end healthcare data pipelines for HL7/FHIR, processing millions of records daily across Kafka streaming and Databricks/Spark batch. Strong focus on data quality (schema enforcement/validations), reliability (Airflow monitoring/alerts), and analytics-ready serving in Snowflake powering Power BI/Tableau, with CI/CD via Git and Jenkins.

View profile
DP

Dhruv Pandoh

Screened

Junior Full-Stack Software Engineer specializing in AI, FinTech, and e-commerce

New York, USA2y exp
MIO PartnersNYU

Built both traditional internal tooling and LLM-powered systems during an internship, including a React/Python/AWS calculator onboarding platform and a production-style ROS2 RAG assistant over 10K+ documents. Stands out for combining full-stack delivery, stakeholder coordination, and practical AI reliability work like retrieval tuning, source-grounded answers, and low-confidence fallbacks.

View profile
PS

Polam Srija

Screened

Mid-level AI/ML Engineer specializing in Generative AI and FinTech

Texas, USA3y exp
Fidelity InvestmentsUniversity of North Carolina at Charlotte

AI Engineer with hands-on ownership of a production multi-agent RAG platform in financial services, spanning experimentation, architecture, deployment, monitoring, and iterative optimization. Stands out for measurable impact: 35% retrieval relevance improvement and nearly 50% reduction in manual operational analysis effort, plus strong experience making enterprise LLM systems safer and more reliable in production.

View profile
Supreet Purthpli - Mid-level AI/ML Software Engineer specializing in cloud-native MLOps and FinTech in San Francisco, CA

Mid-level AI/ML Software Engineer specializing in cloud-native MLOps and FinTech

San Francisco, CA4y exp
JPMorgan ChaseUniversity of Kansas

Software engineer with JPMorgan Chase experience delivering end-to-end fintech features (Next.js/React/Node/Postgres on AWS) and measurable performance gains. Built and productionized an AI-native credit decisioning workflow combining LLMs, vector retrieval, and a rules engine with strong governance (bias checks, auditability, human-in-loop), improving precision and cutting underwriting turnaround time by 40%.

View profile
Dikshith Pulakanti - Intern AI Engineer specializing in agentic LLM systems in Singapore, Singapore

Intern AI Engineer specializing in agentic LLM systems

Singapore, Singapore0y exp
National University of SingaporeNortheastern University

Built multiple AI-heavy backend systems from scratch, including FORESIGHT, a personal financial intelligence platform running daily on live bank accounts with zero manual intervention, and JobPilot, an autonomous job application agent spanning Workday, Greenhouse, Lever, and custom forms. Stands out for combining strong systems design with applied ML pragmatism, reproducibility, and unusually candid reflection on security, scalability, and observability tradeoffs.

View profile
Sai Divya Mulukala - Mid-level Full-Stack Software Engineer specializing in FinTech and distributed systems in USA

Mid-level Full-Stack Software Engineer specializing in FinTech and distributed systems

USA5y exp
WalmartUniversity at Buffalo

Full-stack engineer with experience building operational dashboards at Walmart and improving digital banking experiences at Bank of America. Stands out for tracing performance issues across frontend, APIs, and backend services, including cutting response times from 1.2s to 700ms and resolving duplicate event-processing problems in distributed systems.

View profile
AG

Senior Full-Stack Developer specializing in FinTech and cloud-native platforms

6y exp
PrudentialTexas A&M University-Corpus Christi

Fullstack engineer from Prudential who built a workflow automation platform for internal service reps, combining Angular/React frontends with NestJS, GraphQL, Kafka, MongoDB, and Redis. Stands out for translating ambiguous business problems into scalable metadata-driven systems, validating architecture through hands-on POCs, and delivering a measurable 40% reduction in transaction handling time.

View profile
AS

Mid-level Full-Stack AI Engineer specializing in enterprise automation and FinTech

USA6y exp
CitigroupUniversity of Texas at Dallas

Built and owned Citigroup's ASTRA AI-powered test case generation platform end to end, from full-stack product experience to multi-agent LLM orchestration and RAG infrastructure. Drove test coverage from 40% to 95%, cut generation time from hours to minutes, and scaled the feature to 300+ daily users across 32 enterprise projects with sponsorship from Citi's CIO and Head of Engineering Excellence.

View profile
MB

Mid-level Python Developer specializing in FinTech and banking platforms

USA3y exp
IntuitUniversity of Bridgeport

Built and owned an AI-powered real-time financial fraud detection and monitoring platform end-to-end, spanning product decisions, backend architecture, frontend dashboards, deployment, and production support. Their work scaled to 120M transactions/day and materially improved fraud detection accuracy from 78% to 94%, showing rare breadth across distributed systems, observability, and React-based operational analytics.

View profile
IP

Intern Data Scientist specializing in machine learning and predictive modeling

Irvine, CA2y exp
Trilemma FoundationUC Irvine

Built across data, backend, analytics, and visualization-heavy applications, including a nonprofit financial forecasting app, large-scale insurance model analysis at Mercury Insurance, and a publicly deployed soccer analytics dashboard. Stands out for combining machine learning, large-dataset SQL work, and practical production improvements like cutting dashboard load times to under two seconds and refactoring codebases for smoother team handoff.

View profile
AT

Junior Data Scientist / Big Data Engineer specializing in ML, LLMs, and analytics platforms

Tempe, Arizona3y exp
Arizona State UniversityArizona State University

Backend/data platform engineer who led a major redesign of a hybrid streaming+batch analytics platform processing 10+ TB/day (Airflow/Hive/BigQuery) with strong data-quality automation. Also built a production RAG PDF assistant with concrete mitigations for hallucinations and prompt injection (re-ranking, grounding, verifier step) and has deep experience executing low-risk migrations (dual-write, blue-green, rapid rollback) and implementing JWT-based row-level security.

View profile
SJ

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Missouri, USA4y exp
KPMGUniversity of Central Missouri

Built and deployed a production LLM-powered calorie-counting chatbot that turns plain-English meal descriptions into normalized food entities, quantities, and calorie estimates using a hybrid transformer + rule-engine pipeline. Emphasizes reliability with schema/constraint guardrails, confidence-based routing (including embedding similarity search fallbacks), and strong observability/metrics (hallucination rate, calibration, latency, cost). Partnered closely with nutritionists to encode domain standards into mappings and validation logic.

View profile
JL

Junior Machine Learning Engineer specializing in LLMs, NLP, and computer vision

Bengaluru, Karnataka2y exp
PwCArizona State University

Built a production, agentic multi-agent pharmaceutical intelligence system for US oncology (breast cancer) conference/news intelligence, automating MSL-style information gathering and summarization for pharma and healthcare stakeholders. Uses CrewAI + LangChain orchestration, custom scraping across ~15 pharma newsrooms, and a grounding-score evaluation approach (sentence transformers/cosine similarity) to mitigate hallucinations.

View profile

Need someone specific?

AI Search