Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
RG

Raja Gurugubelli

Screened

Mid-level GenAI Engineer specializing in production RAG and LLM fine-tuning

San Jose, California5y exp
eBayTexas Tech University

“LLM engineer who built a production seller-support RAG system at eBay using hybrid retrieval (BM25 + Pinecone vectors) with Cohere reranking, LangGraph orchestration, and citation-grounded answers. Strong focus on reliability: semantic/structure-aware chunking, automated Ragas-based evaluation with nightly regressions, and production observability (LangSmith) plus drift monitoring (Arize). Also implemented a multi-agent fraud pipeline with AutoGen using JSON-schema contracts and explicit termination conditions.”

PythonSQLBashGPT-4LoRALangChain+130
View profile
DB

Dharmik Bhingradiya

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS

TX, USA5y exp
BlackRockTexas A&M University-Kingsville

“AI engineer who built a production RAG-based internal analyst tool at BlackRock, fine-tuning an LLM on proprietary financial data and adding four layers of guardrails (input/retrieval/generation/output) to improve grounding and reduce hallucinations. Implemented a LangChain-based multi-agent orchestration (7 major agents) deployed on AWS ECS, with reliability measured via internal human evaluation, LLM-as-judge, and RLHF/drift monitoring.”

PythonSQLRJavaC++Machine Learning+90
View profile
GB

Ganesh Bandi

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

USA6y exp
Capital OneUniversity of North Texas

“LLM engineer who has deployed production RAG systems for regulated document QA (PDFs/knowledge bases), emphasizing grounded answers with citations, RBAC, monitoring, and continuous feedback. Demonstrates deep practical expertise in retrieval quality (semantic chunking, hybrid BM25+embeddings, re-ranking), reliability (guardrails, deterministic workflows), and measurable evaluation (golden sets, log replay, A/B tests) while partnering closely with compliance/operations stakeholders.”

A/B TestingAgileAmazon EKSAmazon S3Anomaly DetectionApache Spark+128
View profile
DM

Durga Mahesh Boppani

Screened

Mid-level Backend Software Engineer specializing in distributed cloud-native systems

Gainesville, FL4y exp
Silicon AssuranceUniversity of Florida

“Backend/AI workflow engineer who built production-grade orchestration systems for hardware security verification at Silicon Assurance (Nextflow/Python/Postgres) and a multi-agent LLM-driven regulatory code checking system at the University of Florida. Emphasizes reliability: strict plan/execute/verify boundaries, queue-based isolation, and strong observability/auditability with Prometheus/Grafana and persisted prompts/tool calls.”

PythonJavaCC++JavaScriptSQL+117
View profile
VR

Vineeth Reddy Vallapureddy

Screened

Mid-level Full-Stack Software Engineer specializing in backend microservices and enterprise AI tools

Redwood City, California5y exp
C3 AIUniversity at Buffalo

“Backend/platform engineer with experience across C3.ai (supply chain demand planning) and Amdocs (telecom), working on large-scale data systems and microservices. Has driven first-time adoption experiments of Snowflake + Spark to handle billion-record workloads, built Jenkins-to-Kubernetes delivery pipelines with Nexus artifact management, and implemented Kafka streaming between microservices with HA and retry/error-handling patterns.”

AWSBackend DevelopmentCC++CI/CDDebugging+117
View profile
AF

Allan Farinas

Screened

Senior Full-Stack Software Engineer specializing in Python and AWS

West Covina, CA11y exp
CareRevCal Poly Pomona

“Backend/data engineer who has built production Python microservices (FastAPI) and AWS-native platforms for event ingestion and analytics, combining ECS/Fargate + Lambda with CloudFormation-driven environments and strong secrets/IAM practices. Experienced modernizing legacy logic with parallel-run parity validation and safe phased cutovers, and has demonstrated measurable SQL tuning wins (20–30s down to 1–2s) plus incident ownership in Glue/Step Functions ETL pipelines.”

PythonJavaScriptSQLAWSAWS LambdaAmazon API Gateway+193
View profile
JH

John Hoffman

Screened

Senior Data Engineer specializing in Databricks, Spark, and AWS for government healthcare data systems

Windsor Mill, MD12y exp
GDITUniversity of Virginia

“Python/AWS engineer focused on batch-processing and data workflows, including building reusable S3/boto3 utilities with reliability features and IAM-based auth. Has led low-risk legacy modernizations using parity testing plus a month of parallel production runs, and has owned production issues end-to-end (including fixing a client-side Excel macro) while contributing to significant AWS cost reductions (~$10k/month).”

PythonSQLBashDatabricksApache SparkPySpark+66
View profile
MS

Mohan Shri Harsha Guntu

Screened

Mid-level Data Scientist / Machine Learning Engineer specializing in fraud, risk, and MLOps

Remote, MO7y exp
Northern TrustWebster University

“AI/ML practitioner with Northern Trust experience who has shipped production LLM systems (internal support assistant) using RAG, vector databases, orchestration (LangChain/custom pipelines), and rigorous monitoring/feedback loops. Also built AI-driven fraud detection/risk monitoring solutions in a regulated financial environment, emphasizing explainability (SHAP), audit readiness, and stakeholder trust through dashboards and clear communication.”

PythonRSQLPandasNumPyScikit-learn+137
View profile
GB

Geetha Bommareddy

Screened

Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services

USA5y exp
JPMorgan ChaseTrine University

“At JP Morgan Chase, built and deployed a production LLM-powered RAG knowledge assistant to help fraud investigators and risk analysts quickly navigate regulatory updates and internal policies, reducing investigation delays and compliance risk. Strong focus on secure retrieval (RBAC filtering), reliability (layered testing + observability), and production constraints (latency/SLOs), with Airflow-orchestrated, auditable ML pipelines.”

Amazon EC2Amazon EKSAmazon RedshiftAmazon S3Amazon SageMakerAnomaly Detection+159
View profile
NP

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp
JPMorgan ChaseUniversity of Dayton

“Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.”

PythonPandasspaCyRSQLPySpark+199
View profile
SA

Shiva Adusumilli

Screened

Mid-level Software Engineer specializing in AI agents, backend systems, and data engineering

4y exp
AmazonGeorgia State University

“Amazon engineer who built a production AI agent platform (Python/AWS Strands on Bedrock) that lets teams create tool-using, multi-agent workflows—e.g., agents that auto-triage and resolve customer support tickets by reading internal documentation and collaborating with a research agent. Previously worked in Deloitte on IAM using Ping Identity/Ping DaVinci orchestration, and applies orchestration thinking plus structured evaluation (LLM-as-judge, surveys, automated tests) to improve agent reliability.”

PythonC++JavaJavaScriptTypeScriptMySQL+82
View profile
LS

Likhith Sai Kumar Pasupuleti

Screened

Mid-level Software Engineer specializing in cloud-native microservices and workflow automation

TX, USA5y exp
ServiceNowCalifornia State University, Long Beach

“Enterprise platform engineer/product owner who led end-to-end delivery of customer-facing ServiceNow Service Catalog/workflow solutions, emphasizing reliability, security, and fast iteration. Built React/TypeScript portals with Node.js and Spring Boot backends, and improved microservices reliability at scale using Kafka, monitoring, and robust retry/timeout patterns.”

JavaPythonSQLCC++R+154
View profile
SL

Samuel Luther

Screened

Senior Software Engineer specializing in full-stack systems, data pipelines, and ML

Seattle, WA8y exp
ExponentGeorgia Tech

“Built and productionized an autonomous research agent (AutoGPT) in a Docker/Kubernetes environment with Pinecone-based long-term memory and custom Python tools for analysis, visualization, and report drafting. Implemented layered guardrails (prompt templates, automated validation, self-critique loops, and monitoring) and achieved ~25% reduction in manual report generation time while scaling the workflow to support multiple concurrent users.”

PythonC#JavaJavaScriptTypeScriptGo+116
View profile
PD

Pooja Dokuri

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and cloud MLOps

Remote, USA4y exp
UnitedHealth GroupEast Texas A&M University

“Built and deployed a production LLM + vector search clinical decision support system at UnitedHealth Group, retrieving medical evidence and patient context in real time for prior authorization and risk scoring. Strong in end-to-end RAG architecture (Hugging Face embeddings, Pinecone/FAISS, SageMaker, Redis) plus orchestration (Airflow/Kubeflow) and rigorous evaluation/monitoring, with demonstrated ability to align solutions with clinical operations stakeholders.”

PythonPandasNumPyPySparkScikit-learnSQL+133
View profile
UJ

Utkarsh Joshi

Screened

Senior Data Scientist specializing in ML, NLP, and GenAI analytics

Remote, US7y exp
University of MinnesotaUniversity of Minnesota

“Built and deployed an LLM-powered analytics assistant enabling business users to ask questions in plain English and receive validated Spark SQL executed in Databricks, with a Streamlit/Flask UI. Addressed strict client schema-privacy constraints by implementing a RAG strategy and ultimately leveraging AWS Bedrock and fine-tuned reference docs. Also has production ML pipeline experience using Docker + Airflow and AWS (S3/ECS/EC2) for financial classification models.”

PythonPandasNumPyScikit-learnRSQL+107
View profile
SK

Sharath Kumar

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps

Remote, USA5y exp
HPWilmington University

“AI/ML engineer with HP experience building and productionizing an LLM-powered document intelligence platform (LangChain + Pinecone) to deliver semantic search and contextual Q&A across millions of enterprise support documents. Demonstrates strong MLOps and scaling expertise (Airflow, Kubernetes autoscaling, Triton GPU inference, monitoring with Prometheus/W&B) plus a structured approach to evaluation (A/B tests, shadow deployments, failover) and effective collaboration with non-technical stakeholders.”

PythonSQLPostgreSQLBigQuerySnowflakeBash+142
View profile
NT

Niteesha Thottempudi

Screened

Mid-level Software Engineer specializing in cloud-native microservices and data platforms

Downingtown, PA5y exp
Pike SolutionsNYU

“Backend engineer with experience at Comcast and in healthcare/pharmacy automation (PrimeRx), building Python/Flask services that orchestrate large-scale batch workflows (Airflow) and high-throughput event processing (Kafka). Demonstrated measurable performance wins (cut provisioning latency to ~150–200ms) and strong multi-tenant isolation strategies (Postgres RLS, partitioning), plus practical integration of ML model outputs into production systems with validation and fallback controls.”

PythonJavaCC++JavaScriptHTML+113
View profile
HK

Harini Kv

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Dallas, TX7y exp
EquinixFitchburg State University

“GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.”

PythonSQLPySparkBashJavaJavaScript+169
View profile
ES

Edin Samuel Joselyn Chandrakumar

Screened

Senior Engineering Manager specializing in cloud platforms and risk systems

16y exp
Capital OneGovernment College of Technology, Coimbatore

“Engineering leader who proposed and delivered a new API-based document management platform to replace a vendor-dependent system, improving latency by ~1s and availability to 99.9% while migrating legacy data. Also drove Python-based automation of ~12 workflows via third-party API integrations and led an SSO/auth integration focused on backward compatibility and high login success rates.”

A/B TestingAgileAmazon CloudWatchAmazon DynamoDBAmazon ECSAmazon RDS+88
View profile
DA

Divyam Agrawal

Screened

Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems

Seattle, WA4y exp
Affinity SolutionsUniversity of Washington

“Internship experience building a production RAG+LLM pipeline to map messy card transaction descriptions to merchant brands, including a custom modified-ROUGE evaluation approach for weak/variant ground truth. Improved scalability and cost by moving from a managed LLM endpoint (e.g., Bedrock) to self-hosted vLLM, and orchestrated massive embedding backfills (5,000+ files, 10B+ rows) using an Airflow-triggered SQS + ECS worker architecture with robust retry/DLQ handling.”

A/B TestingAPI DesignAWSAWS CloudFormationAWS LambdaAuto-scaling+110
View profile
MB

Mahesh Babu

Screened

Mid-level Full-Stack Developer specializing in cloud-native FinTech systems

New York, NY4y exp
Goldman SachsClemson University

“Built a lightweight internal JavaScript analytics tracker capturing user interactions (clicks, page views, custom events) with debounced batching, automatic session tracking, and offline event caching via a localStorage-backed append-only queue. Demonstrates practical performance optimization mindset (profiling, memoization/caching) and React performance tuning.”

AgileAmazon EC2Amazon EKSAmazon RDSAmazon S3Angular+97
View profile
DD

Dhyey Desai

Screened

Intern AI/ML Engineer specializing in RAG, multimodal AI, and LLM systems

Los Angeles, California0y exp
NalaUSC

“Built and shipped 'PetPulse,' a production AI pet-health note system that records voice notes, transcribes them, converts transcripts into structured symptom/event data, and supports grounded Q&A over a user’s notes and vet PDFs. Demonstrates full-stack LLM product execution (FastAPI + GPT-4 + Firebase), with concrete reliability/performance work (async endpoints, caching, RAG/embeddings, function calling) and user-centered iteration with a non-technical product stakeholder.”

Apache HadoopBERTCCachingData VisualizationDatabricks+87
View profile
SS

Siva Sai Kumar Mogalluru

Screened

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp
EYUniversity of South Florida

“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”

A/B TestingAgileAnomaly DetectionApache AirflowApache SparkAzure DevOps+138
View profile
SK

SaiTeasmitha Kaja

Screened

Mid-level Full-Stack Software Engineer specializing in Java/Spring Boot and cloud microservices

Houston, TX4y exp
HPEUniversity of Houston

“Backend-focused Python/Flask engineer who has built authentication/profile services with clean modular architecture (blueprints + service layer) and tuned SQLAlchemy/Postgres for scale using indexing, query rewrites, and pagination. Has production-style integration experience for AI/ML via TensorFlow Serving and OpenAI APIs (batching, rate limiting, caching), plus multi-tenant data isolation and high-throughput background processing with Celery/Redis and idempotent jobs.”

AgileAngularApache TomcatAPI GatewayArgo CDAudit Logging+168
View profile
1...363738...119

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search