Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache Spark Python Docker SQL AWS CI/CD

Shivam Sah

Screened

Mid-level Backend Software Engineer specializing in distributed microservices

New York, United States4y exp

ActiveViamNortheastern University

“Internship at ActiveVM where they tackled large-scale Spring Boot 2→3/library migrations across hundreds of downstream products by combining OpenRewrite (AST-based recipes) with an LLM/RAG-based classifier that routed risky files to human experts. Reported ~70% reduction in manual effort and 90%+ accuracy after testing across multiple branches and cutovers; also built a CTR-driven book recommendation capstone showcased at the Google office in Cambridge.”

Java Python Go SQL JavaScript TypeScript+96

View profile

Hruday Vuppala

Screened

Junior Software Engineer specializing in Full-Stack and ML for FinTech

Hyderabad, Telangana1y exp

Volksoft TechnologiesUSC

“Full-stack engineer with fintech trading-platform experience who shipped and operated a real-time portfolio P&L/performance feature end-to-end (React + Node/WebSockets + MongoDB) on AWS, including significant performance tuning under peak trading load. Also built a Spark-based trading analytics pipeline with idempotency and reconciliation for auditability, and has a personal React/TS + Node/Express project (Artsy) with JWT auth and schema-evolution practices.”

Python JavaScript TypeScript C C++SQL+92

View profile

Steven Nazaroff

Screened

Mid-level Software Engineer specializing in data engineering on GCP

Bentonville, AR3y exp

WalmartLehigh University

“Data engineer with hands-on experience migrating a legacy/mainframe-fed loader onto GCP, orchestrating daily SFTP-to-GCS ingestion, Spark/Scala transformations, and loading into Cassandra/Solr/OpenSearch with API- and BigQuery-based validation. Also built a Java Spring Boot service that extracts from Hive and produces Excel outputs, emphasizing testing, logging/alerts, and CI setup.”

Python Scala Java SQL CSS JavaScript+56

View profile

Neimisha Konda

Screened

Senior Data Engineer specializing in Palantir Foundry and Snowflake for regulated industries

USA5y exp

American ExpressUniversity of Massachusetts Boston

“Data engineer focused on high-volume transaction pipelines (2M+ per day) using Snowflake/Snowpipe, Spark/PySpark, Kafka, and Airflow, with a strong emphasis on schema/data-quality enforcement and reliability improvements. Also built a greenfield compliance-focused RAG solution, using CloudWatch monitoring and adding ingestion validation to prevent malformed OCR documents from degrading search quality.”

Snowflake SQL PostgreSQL MySQL NoSQL Apache Spark+109

View profile

Manasa Gunreddy

Screened

Senior Data Engineer specializing in cloud data platforms and real-time streaming

6y exp

HCA HealthcareWright State University

“Data engineer in healthcare (HCA) who owned end-to-end Azure-based pipelines at very large scale (50M+ daily claims/patient records). Strong focus on reliability: schema-drift fail-fast validation, quarantine layers, and Python/SQL data quality checks that reduced issues ~25%, plus performance tuning in Databricks/PySpark and versioned serving in Synapse for downstream consumers.”

AWS AWS CloudFormation AWS CodePipeline AWS Glue AWS IAM AWS Lambda+136

View profile

Nikitha Kommidi

Screened

Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps

6y exp

CitibankUniversity of Texas at Arlington

“Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.”

Python SQL Bash C JavaScript PHP+154

View profile

Kasireddy Kumar reddy

Screened

Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems

Missouri, USA6y exp

CenteneUniversity of Central Missouri

“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”

A/B Testing Agile Apache Airflow Apache Kafka Azure Blob Storage BigQuery+137

View profile

Hsi-Chun Wang

Screened

Mid-level Data Scientist specializing in LLM development and scalable ML pipelines

Remote4y exp

GearFactory.aiUniversity of Maryland, College Park

“Built and deployed production LLM pipelines for evidence-based scoring in two domains: biomedical literature mining (scoring ~2700 drug compounds vs gene targets/mechanisms) and long-horizon news analytics (35 years of Chinese articles). Emphasizes reliability at scale (retries/checkpointing/validation), rigorous empirical model benchmarking (GPT-4o/mini/5), and translating results into stakeholder-friendly visual narratives.”

A/B Testing AWS AWS IAM AWS Lambda Classification Clustering+80

View profile

Sreenija Karnati

Screened

Mid-level Data Analyst and Data Engineer specializing in healthcare and financial analytics

3y exp

UnitedHealth GroupUniversity of North Texas

“Analytics professional with healthcare and operations experience who turns messy enterprise data from platforms like Teradata, GCP, SQL Server, and Snowflake into trusted reporting layers and reproducible analysis workflows. They combine SQL, Python, PySpark, Power BI, and Tableau to improve reporting accuracy and performance, including a 30% dashboard refresh improvement and 20-25% accuracy gains in healthcare reporting.”

SQL Python Power BI Snowflake Databricks Business Intelligence+89

View profile

Gnaneswar Markat

Screened

Mid-level Software Engineer specializing in cloud-native distributed systems

Charlotte, NC4y exp

Bank of AmericaUniversity at Buffalo

“Full-stack engineer with Bank of America experience building and owning a customer portfolio dashboard end-to-end, from requirements through launch and ongoing iteration. They combine React/Spring Boot/PostgreSQL implementation with strong performance tuning, real-time data handling, and UX improvements, and cite adoption by roughly 12,000 active internal users.”

Java Python C++C#JavaScript TypeScript+143

View profile

Pavan Punna

Screened

Mid-level AI/ML Engineer specializing in LLMs, MLOps, and healthcare-fintech AI

Dallas, TX5y exp

Federal Soft SystemsConcordia University

“Built and owned a production GPT-4 RAG assistant for clinical and enterprise query resolution, taking it from initial experiment to deployment, monitoring, and iterative improvement. Their work cut resolution time from 45 minutes to under 2 minutes, achieved roughly 95% accuracy, and scaled to thousands of additional monthly queries while emphasizing safety and trust in a sensitive clinical domain.”

Python SQL Java Scala Bash PyTorch+124

View profile

premkumar narla

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise ML systems

Chicago, IL5y exp

Morgan StanleyEastern Illinois University

“ML/AI engineer with hands-on experience at Morgan Stanley building production fraud detection and enterprise RAG systems. Stands out for owning systems end-to-end—from experimentation and deployment to monitoring and iteration—and for delivering measurable impact, including an 18% reduction in fraud false positives, 40% lower inference latency, and internal tooling that reduced model deployment time from days to hours.”

Python SQL Bash JavaScript PyTorch TensorFlow+79

View profile

Sujeeth Rajkumar

Screened

Mid-level Generative AI Engineer specializing in LLMs and enterprise AI

Texas, USA5y exp

PNCUniversity of Texas at Arlington

“Built and owned an enterprise LLM/RAG document intelligence platform for PNC Financial Services in a compliance-heavy environment, focused on grounded answers over internal finance and policy documents. Stands out for combining GenAI product delivery with production engineering discipline, delivering 60% faster document review and materially better answer quality while creating reusable FastAPI-based AI services for multiple teams.”

Generative AI GPT BERT LLaMA LangChain FAISS+197

View profile

Naiya Adatiya

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native microservices

Vermont, USA4y exp

Vermont Information ProcessingNortheastern University

“Engineer with a process-driven approach to AI-assisted software development, focused on orchestrating where AI adds value while maintaining human review and verification. Has applied this in backend work such as an S3-based invoice pipeline and used multi-agent workflows to speed up large API refactors across many endpoints.”

JavaScript Python C++TypeScript SQL Shell Scripting+124

View profile

Siva Harini Sri Janaki Raman

Screened

Mid-level Data Engineer specializing in cloud data platforms

Dallas, TX3y exp

CVS HealthTexas Tech University

“Built an AI-powered internal support assistant at CVS Health using GPT-4, LangChain, and Pinecone, applying RAG, validation, and monitoring to reduce repetitive support tickets while protecting sensitive healthcare data. Stands out for a pragmatic approach to AI engineering: using multi-agent and LLM workflows to accelerate development while keeping systems constrained, observable, and production-friendly.”

Python SQL R AWS Amazon S3 AWS Glue+110

View profile

vedavathi thumula

Screened

Mid-level AI/ML Engineer specializing in Generative AI and agentic systems

4y exp

WalmartUniversity of Central Missouri

“Backend/platform engineer who has owned a Python/FastAPI results API and deployed it on Kubernetes with Helm and GitHub Actions-driven CI/CD. Demonstrates strong production operations mindset across performance tuning, monitoring, safe rollouts/rollbacks, and phased migrations, plus hands-on Kafka streaming experience focused on ordering and idempotency.”

Python SQL Bash JavaScript Scikit-learn XGBoost+249

View profile

Vineet Jujjavarapu

Screened

Mid-level Software Engineer specializing in cloud-native data platforms

College Park, MD3y exp

University of Maryland, College ParkUniversity of Maryland, College Park

“Software engineer with hands-on experience using AI coding assistants and LangChain-based agent workflows in RAG/LLM projects. Stands out for combining practical multi-agent experimentation with strong grounding in system design, distributed systems, and production-minded validation of AI-generated outputs.”

Python Java C++SQL Design Patterns Apache Spark+93

View profile

Christopher Prokop

Screened

Director of Software Engineering specializing in AI, data platforms, and cloud architecture

Washington, DC29y exp

ZipRecruiterAmerican University

“Veteran software engineering leader who started as an early internet engineer in the mid-1990s and has since grown into Director/VP-level leadership across legacy web platforms, logistics systems, and modern data engineering. Particularly compelling for companies needing a hands-on leader who can modernize complex Perl/UNIX monoliths, manage large cross-functional teams, and deliver operational systems in warehouse, marketplace, and reverse-logistics environments.”

Agentic AI Claude AWS Terraform Kubernetes Amazon S3+78

View profile

Aditee Gaddam

Screened

Mid-level Full-Stack Software Engineer specializing in cloud and AI systems

United States3y exp

SyscoGeorge Mason University

“Built internal product features at Sysco's Collab Cafe across React/TypeScript frontend and Spring Boot/PostgreSQL backend, including a full project invite flow and an early AI-style project matching capability. Stands out for owning features end-to-end, improving React dashboard performance with profiling and component refactoring, and making pragmatic 0→1 tradeoffs to ship quickly.”

Java JavaScript Python TypeScript C C#+201

View profile

Evan Teague

Screened

Senior Software Engineer specializing in backend and data platforms

Bethesda, MD10y exp

Spatial Data LogicUniversity of Virginia

“Series A startup engineer with broad full-stack ownership across backend, data, and frontend, including a real-time ingestion platform that scaled to 10x higher daily volume without downtime while cutting latency from minutes to seconds. Brings strong fintech and B2B SaaS experience building auditable, high-throughput systems for analysts, operations, and compliance teams in regulated environments.”

Python Go JavaScript TypeScript SQL Bash+159

View profile

Priyadarshini Vykuntapu

Screened

Mid-level Software Developer specializing in full-stack systems for FinTech and industrial platforms

USA3y exp

HoneywellUniversity at Buffalo

“Enterprise full-stack engineer with experience at Honeywell and Wells Fargo, spanning real-time telemetry dashboards and digital banking systems. Stands out for owning production systems end to end, improving performance in high-scale environments, and driving architectural modernization that reduced release times and improved reliability.”

C C++Python Java JavaScript SQL+103

View profile

Rohith kollu

Screened

Senior Software Engineer specializing in backend microservices, cloud, and full-stack systems

Dallas, TX7y exp

CiscoIndiana Wesleyan University

“Backend/platform engineer who has built and scaled production Java/Spring Boot + Kafka services on AWS/Kubernetes (1M+ msgs/day) and led reliability/performance fixes that restored SLAs (25–30% latency improvement; 99.9% uptime). Also shipped an AI customer-support chatbot end-to-end using retrieval + guardrails and rigorous evaluation/observability, improving resolution time 40% and satisfaction 25%, with a strong plan/execute/verify approach to agentic workflow reliability.”

Amazon CloudFront Amazon CloudWatch Amazon EC2 Amazon RDS Amazon S3 Apache Hadoop+154

View profile

SaiRahulCharan Kotepalli

Screened

Mid-Level Software Engineer specializing in FinTech microservices and AI automation

New York City, United States3y exp

Bank of AmericaNJIT

“Backend engineer with experience evolving a real-time transaction and rewards processing platform from a tightly coupled architecture into domain-based microservices. Uses REST plus Kafka for synchronous vs. asynchronous workflows, and builds Python/FastAPI APIs with Pydantic contracts, Docker/Kubernetes deployments, and JWT/OAuth-based security; has also supported analytics/dashboard use cases (Power BI).”

Java Python JavaScript R SQL C#+110

View profile

PHANINDRA KETHAMUKKALA

Screened

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp

GE HealthCareFranklin University

“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”

A/B Testing Agile AI Agents Apache Kafka Apache Spark AWS Glue+170

View profile

Software Engineers Machine Learning Engineers Data Scientists Data Engineers Software Developers AI Engineers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?