Vetted Apache Spark Professionals

Pre-screened and vetted.

SS

Shivam Sah

Screened

Mid-level Backend Software Engineer specializing in distributed microservices

New York, United States4y exp
ActiveViamNortheastern University

Internship at ActiveVM where they tackled large-scale Spring Boot 2→3/library migrations across hundreds of downstream products by combining OpenRewrite (AST-based recipes) with an LLM/RAG-based classifier that routed risky files to human experts. Reported ~70% reduction in manual effort and 90%+ accuracy after testing across multiple branches and cutovers; also built a CTR-driven book recommendation capstone showcased at the Google office in Cambridge.

View profile
HV

Junior Software Engineer specializing in Full-Stack and ML for FinTech

Hyderabad, Telangana1y exp
Volksoft TechnologiesUSC

Full-stack engineer with fintech trading-platform experience who shipped and operated a real-time portfolio P&L/performance feature end-to-end (React + Node/WebSockets + MongoDB) on AWS, including significant performance tuning under peak trading load. Also built a Spark-based trading analytics pipeline with idempotency and reconciliation for auditability, and has a personal React/TS + Node/Express project (Artsy) with JWT auth and schema-evolution practices.

View profile
SN

Mid-level Software Engineer specializing in data engineering on GCP

Bentonville, AR3y exp
WalmartLehigh University

Data engineer with hands-on experience migrating a legacy/mainframe-fed loader onto GCP, orchestrating daily SFTP-to-GCS ingestion, Spark/Scala transformations, and loading into Cassandra/Solr/OpenSearch with API- and BigQuery-based validation. Also built a Java Spring Boot service that extracts from Hive and produces Excel outputs, emphasizing testing, logging/alerts, and CI setup.

View profile
NK

Senior Data Engineer specializing in Palantir Foundry and Snowflake for regulated industries

USA5y exp
American ExpressUniversity of Massachusetts Boston

Data engineer focused on high-volume transaction pipelines (2M+ per day) using Snowflake/Snowpipe, Spark/PySpark, Kafka, and Airflow, with a strong emphasis on schema/data-quality enforcement and reliability improvements. Also built a greenfield compliance-focused RAG solution, using CloudWatch monitoring and adding ingestion validation to prevent malformed OCR documents from degrading search quality.

View profile
MG

Senior Data Engineer specializing in cloud data platforms and real-time streaming

6y exp
HCA HealthcareWright State University

Data engineer in healthcare (HCA) who owned end-to-end Azure-based pipelines at very large scale (50M+ daily claims/patient records). Strong focus on reliability: schema-drift fail-fast validation, quarantine layers, and Python/SQL data quality checks that reduced issues ~25%, plus performance tuning in Databricks/PySpark and versioned serving in Synapse for downstream consumers.

View profile
Nikitha Kommidi - Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps

Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps

6y exp
CitibankUniversity of Texas at Arlington

Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.

View profile
Kasireddy Kumar reddy - Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems in Missouri, USA

Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems

Missouri, USA6y exp
CenteneUniversity of Central Missouri

Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.

View profile
Hsi-Chun Wang - Mid-level Data Scientist specializing in LLM development and scalable ML pipelines in Remote

Hsi-Chun Wang

Screened

Mid-level Data Scientist specializing in LLM development and scalable ML pipelines

Remote4y exp
GearFactory.aiUniversity of Maryland, College Park

Built and deployed production LLM pipelines for evidence-based scoring in two domains: biomedical literature mining (scoring ~2700 drug compounds vs gene targets/mechanisms) and long-horizon news analytics (35 years of Chinese articles). Emphasizes reliability at scale (retries/checkpointing/validation), rigorous empirical model benchmarking (GPT-4o/mini/5), and translating results into stakeholder-friendly visual narratives.

View profile
SK

Mid-level Data Analyst and Data Engineer specializing in healthcare and financial analytics

3y exp
UnitedHealth GroupUniversity of North Texas

Analytics professional with healthcare and operations experience who turns messy enterprise data from platforms like Teradata, GCP, SQL Server, and Snowflake into trusted reporting layers and reproducible analysis workflows. They combine SQL, Python, PySpark, Power BI, and Tableau to improve reporting accuracy and performance, including a 30% dashboard refresh improvement and 20-25% accuracy gains in healthcare reporting.

View profile
Gnaneswar Markat - Mid-level Software Engineer specializing in cloud-native distributed systems in Charlotte, NC

Mid-level Software Engineer specializing in cloud-native distributed systems

Charlotte, NC4y exp
Bank of AmericaUniversity at Buffalo

Full-stack engineer with Bank of America experience building and owning a customer portfolio dashboard end-to-end, from requirements through launch and ongoing iteration. They combine React/Spring Boot/PostgreSQL implementation with strong performance tuning, real-time data handling, and UX improvements, and cite adoption by roughly 12,000 active internal users.

View profile
Pavan Punna - Mid-level AI/ML Engineer specializing in LLMs, MLOps, and healthcare-fintech AI in Dallas, TX

Pavan Punna

Screened

Mid-level AI/ML Engineer specializing in LLMs, MLOps, and healthcare-fintech AI

Dallas, TX5y exp
Federal Soft SystemsConcordia University

Built and owned a production GPT-4 RAG assistant for clinical and enterprise query resolution, taking it from initial experiment to deployment, monitoring, and iterative improvement. Their work cut resolution time from 45 minutes to under 2 minutes, achieved roughly 95% accuracy, and scaled to thousands of additional monthly queries while emphasizing safety and trust in a sensitive clinical domain.

View profile
premkumar narla - Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise ML systems in Chicago, IL

Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise ML systems

Chicago, IL5y exp
Morgan StanleyEastern Illinois University

ML/AI engineer with hands-on experience at Morgan Stanley building production fraud detection and enterprise RAG systems. Stands out for owning systems end-to-end—from experimentation and deployment to monitoring and iteration—and for delivering measurable impact, including an 18% reduction in fraud false positives, 40% lower inference latency, and internal tooling that reduced model deployment time from days to hours.

View profile
SR

Mid-level Generative AI Engineer specializing in LLMs and enterprise AI

Texas, USA5y exp
PNCUniversity of Texas at Arlington

Built and owned an enterprise LLM/RAG document intelligence platform for PNC Financial Services in a compliance-heavy environment, focused on grounded answers over internal finance and policy documents. Stands out for combining GenAI product delivery with production engineering discipline, delivering 60% faster document review and materially better answer quality while creating reusable FastAPI-based AI services for multiple teams.

View profile
NA

Naiya Adatiya

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native microservices

Vermont, USA4y exp
Vermont Information ProcessingNortheastern University

Engineer with a process-driven approach to AI-assisted software development, focused on orchestrating where AI adds value while maintaining human review and verification. Has applied this in backend work such as an S3-based invoice pipeline and used multi-agent workflows to speed up large API refactors across many endpoints.

View profile
Siva Harini Sri Janaki Raman - Mid-level Data Engineer specializing in cloud data platforms in Dallas, TX

Mid-level Data Engineer specializing in cloud data platforms

Dallas, TX3y exp
CVS HealthTexas Tech University

Built an AI-powered internal support assistant at CVS Health using GPT-4, LangChain, and Pinecone, applying RAG, validation, and monitoring to reduce repetitive support tickets while protecting sensitive healthcare data. Stands out for a pragmatic approach to AI engineering: using multi-agent and LLM workflows to accelerate development while keeping systems constrained, observable, and production-friendly.

View profile
VT

Mid-level AI/ML Engineer specializing in Generative AI and agentic systems

4y exp
WalmartUniversity of Central Missouri

Backend/platform engineer who has owned a Python/FastAPI results API and deployed it on Kubernetes with Helm and GitHub Actions-driven CI/CD. Demonstrates strong production operations mindset across performance tuning, monitoring, safe rollouts/rollbacks, and phased migrations, plus hands-on Kafka streaming experience focused on ordering and idempotency.

View profile
Vineet Jujjavarapu - Mid-level Software Engineer specializing in cloud-native data platforms in College Park, MD

Mid-level Software Engineer specializing in cloud-native data platforms

College Park, MD3y exp
University of Maryland, College ParkUniversity of Maryland, College Park

Software engineer with hands-on experience using AI coding assistants and LangChain-based agent workflows in RAG/LLM projects. Stands out for combining practical multi-agent experimentation with strong grounding in system design, distributed systems, and production-minded validation of AI-generated outputs.

View profile
CP

Director of Software Engineering specializing in AI, data platforms, and cloud architecture

Washington, DC29y exp
ZipRecruiterAmerican University

Veteran software engineering leader who started as an early internet engineer in the mid-1990s and has since grown into Director/VP-level leadership across legacy web platforms, logistics systems, and modern data engineering. Particularly compelling for companies needing a hands-on leader who can modernize complex Perl/UNIX monoliths, manage large cross-functional teams, and deliver operational systems in warehouse, marketplace, and reverse-logistics environments.

View profile
AG

Aditee Gaddam

Screened

Mid-level Full-Stack Software Engineer specializing in cloud and AI systems

United States3y exp
SyscoGeorge Mason University

Built internal product features at Sysco's Collab Cafe across React/TypeScript frontend and Spring Boot/PostgreSQL backend, including a full project invite flow and an early AI-style project matching capability. Stands out for owning features end-to-end, improving React dashboard performance with profiling and component refactoring, and making pragmatic 0→1 tradeoffs to ship quickly.

View profile
ET

Evan Teague

Screened

Senior Software Engineer specializing in backend and data platforms

Bethesda, MD10y exp
Spatial Data LogicUniversity of Virginia

Series A startup engineer with broad full-stack ownership across backend, data, and frontend, including a real-time ingestion platform that scaled to 10x higher daily volume without downtime while cutting latency from minutes to seconds. Brings strong fintech and B2B SaaS experience building auditable, high-throughput systems for analysts, operations, and compliance teams in regulated environments.

View profile
Priyadarshini Vykuntapu - Mid-level Software Developer specializing in full-stack systems for FinTech and industrial platforms in USA

Mid-level Software Developer specializing in full-stack systems for FinTech and industrial platforms

USA3y exp
HoneywellUniversity at Buffalo

Enterprise full-stack engineer with experience at Honeywell and Wells Fargo, spanning real-time telemetry dashboards and digital banking systems. Stands out for owning production systems end to end, improving performance in high-scale environments, and driving architectural modernization that reduced release times and improved reliability.

View profile
RK

Rohith kollu

Screened

Senior Software Engineer specializing in backend microservices, cloud, and full-stack systems

Dallas, TX7y exp
CiscoIndiana Wesleyan University

Backend/platform engineer who has built and scaled production Java/Spring Boot + Kafka services on AWS/Kubernetes (1M+ msgs/day) and led reliability/performance fixes that restored SLAs (25–30% latency improvement; 99.9% uptime). Also shipped an AI customer-support chatbot end-to-end using retrieval + guardrails and rigorous evaluation/observability, improving resolution time 40% and satisfaction 25%, with a strong plan/execute/verify approach to agentic workflow reliability.

View profile
SK

Mid-Level Software Engineer specializing in FinTech microservices and AI automation

New York City, United States3y exp
Bank of AmericaNJIT

Backend engineer with experience evolving a real-time transaction and rewards processing platform from a tightly coupled architecture into domain-based microservices. Uses REST plus Kafka for synchronous vs. asynchronous workflows, and builds Python/FastAPI APIs with Pydantic contracts, Docker/Kubernetes deployments, and JWT/OAuth-based security; has also supported analytics/dashboard use cases (Power BI).

View profile
PK

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.

View profile

Need someone specific?

AI Search