Reval Logo
Home Browse Talent Skilled in Apache Spark

Vetted Apache Spark Professionals

Pre-screened and vetted.

Apache SparkPythonDockerSQLAWSCI/CD
DP

DHYAN PATEL

Screened

Mid-level AI Engineer specializing in NLP and production ML systems

Tempe, AZ3y exp
MindSparkArizona State University

“AI/LLM engineer who has shipped production RAG chatbots using LangChain/OpenAI with FAISS and FastAPI, focusing on real-world constraints like context windows, concurrency, and latency (reported ~40% latency reduction and <2s average response). Experienced orchestrating AI pipelines with Celery and fault-tolerant long-running workflows with Temporal, and has applied NLP model tradeoff testing (Word2Vec vs BERT) to drive measurable accuracy gains.”

AgileAlertingAmazon API GatewayAmazon DynamoDBAmazon EC2Amazon S3+125
View profile
BK

Bhargavi Karuku

Screened

Mid-level AI Engineer specializing in ML, NLP, and Generative AI

Atlanta, GA4y exp
CGIUniversity of New Haven

“AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.”

A/B TestingAgileAWSAzure Machine LearningBigQueryClaude+129
View profile
SG

Srinivasan GomadamRamesh

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production inference

Redmond, WA7y exp
Quadrant TechnologiesUniversity of Texas at Dallas

“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”

PythonPandasNumPySciPyScikit-learnTensorFlow+100
View profile
VM

Venkata Morla

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native microservices

USA4y exp
State FarmUniversity of Bridgeport

“DevOps engineer (State Farm) with hands-on ownership of Python backend services and data pipelines, deploying microservices and workers on Kubernetes using GitOps (Argo CD). Has led complex cloud-to-on-prem/hybrid migrations with staged cutovers and rollback planning, and built Kafka-based real-time streaming pipelines with schema governance, autoscaling, and strong observability.”

PythonJavaJavaScriptC#TypeScriptAngular+118
View profile
BP

Bharat Potluri

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems

Fort Worth, Texas8y exp
Ingram MicroUniversity of North Texas

“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”

PythonRSQLJavaC#HTML+134
View profile
JP

Jhansi Priya

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows

Remote, null6y exp
fundae software IncUniversity of Dayton

“Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.”

AgileApache KafkaApache SparkAWSAWS GlueAWS Lambda+129
View profile
AB

Akshara Bhukya

Screened

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

Remote4y exp
KGS Technology GroupStevens Institute of Technology

“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”

PythonRC++SQLScikit-learnPandas+112
View profile
UD

Urvashi Dhingra

Screened

Mid-Level Software Engineer specializing in full-stack, cloud, and data platforms

Remote, NY4y exp
Global Mobile Software LLCRochester Institute of Technology

“Backend/full-stack engineer who has owned production TypeScript systems in both fintech-style transaction/rewards flows and HIPAA-regulated healthcare platforms. Deep focus on correctness and reliability (idempotency, retries/DLQs, reconciliation, observability) plus strong infra automation (Docker/Terraform/CI-CD) and measurable performance wins (40% query improvement, 90% test coverage).”

PythonGoJavaC++C#JavaScript+116
View profile
MD

Mrudula Devaguptapu

Screened

Mid-Level Software/AI Engineer specializing in backend systems, data pipelines, and RAG automation

United States3y exp
360DMMC ConsultingSaint Louis University

“Backend engineer with experience modernizing high-traffic subscription and payment systems (TCS) by moving to event-driven Spring Boot microservices with Kafka, adding idempotency/state management to eliminate duplicate processing. Built and scaled FastAPI services for AI automation workflows (360DMMC) with versioned contracts, JWT security, and strong observability, and has led live refactors using feature flags, parallel runs, and data reconciliation.”

API GatewayApache KafkaApache SparkAWSAWS IAMAWS Lambda+104
View profile
NR

Nagendra Reddy Palugulla

Screened

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Florida, United States4y exp
Community Dreams FoundationUniversity of Houston

“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”

PythonPyTorchTensorFlowApache SparkScikit-learnHTML+119
View profile
DS

Durga Samhitha Muvva

Screened

Junior AI Engineer specializing in LLMs, RAG, and MLOps

San Jose, California2y exp
ReferU.AISan José State University

“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”

PythonSQLJavaNumPyPandasSciPy+110
View profile
PT

Phani Tarun Munukuntla

Screened

Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps

New York, USA2y exp
University at BuffaloUniversity at Buffalo

“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”

PythonPySparkApache AirflowJavaJavaScriptSQL+121
View profile
SK

Satish Kumar Reddy

Screened

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

Remote, NJ5y exp
Tungsten AutomationPace University

“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”

PythonRJavaC++SQLPostgreSQL+142
View profile
GA

Gopichand Amaraneni

Screened

Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems

USA4y exp
CitiusTechNorthwest Missouri State University

“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”

PythonNumPyPandasJSONSQLPostgreSQL+151
View profile
AP

Ayushi Patel

Screened

Mid-level Software Engineer specializing in cloud data platforms and serverless ETL

Redmond, WA6y exp
HCLTechIllinois Institute of Technology

“Data/ML engineer from HCLTech who modernized enterprise data by linking fragmented financial and supply-chain data across SAP/SQL Server/Snowflake using NLP entity linking and embeddings (FAISS). Delivered measurable impact including ~40% reduction in manual error-log triage and entity-linking accuracy improvements from ~86% to ~93%, with results surfaced in Power BI for real-time analytics.”

PythonTypeScriptJavaScriptSQLShell ScriptingNode.js+105
View profile
YM

Yang MA

Screened

Junior Backend Software Engineer specializing in search, data systems, and LLM applications

New York, NY2y exp
Bevel HealthUniversity of Pittsburgh

“Built a contract and customer documentation retrieval solution for Urban Studio, designing a RAG + Elasticsearch hybrid search stack (RRF + cross-encoder reranking) with a strong emphasis on chunking/data quality and hallucination reduction. Experienced in diagnosing LLM workflow issues via observability traces and tailoring technical demos to developer concerns like reliability and high concurrency.”

PythonTypeScriptGoSQLJavaScriptNext.js+102
View profile
MC

Meghana Chowdary Borra

Screened

Junior Machine Learning Engineer specializing in predictive modeling and GenAI RAG systems

Buffalo, New York2y exp
AFAD AgencyUniversity at Buffalo

“LLM engineer who built and deployed an emotionally intelligent AAC communication system using an emotion-aware RAG pipeline (Empathetic Dialogues + GoEmotions) and a PEFT-adapted model. Experienced with LangChain/LangGraph and custom Python orchestration, focusing on reliability (guards, schema validation, fallbacks), latency optimization, and rigorous evaluation (automatic metrics + human-in-the-loop), with a reported 18% user satisfaction improvement.”

A/B TestingCI/CDDeep LearningFeature EngineeringGitHub ActionsLSTM+122
View profile
AP

Aneri Patel

Screened

Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval

Washington, D.C.2y exp
Enquire AI, Inc.George Washington University

“Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.”

PythonTypeScriptSQLRJavaMachine Learning+133
View profile
TC

Tamanna Choithani

Screened

Intern Full-Stack Software Engineer specializing in web apps and applied AI

Bay Area, USA1y exp
BottlelyArizona State University

“Full-stack engineer who built an AI-based inventory/procurement query system at Botlily/Botlerly using Flask and Google Sheets as a live knowledge base, overcoming Sheets latency with caching and structured in-memory models. Demonstrated strong LLM product engineering (40% accuracy improvement via preprocessing/prompting) and customer-driven iteration with bar/restaurant owners, evolving the tool into a more comprehensive inventory management and forecasting solution.”

PythonCC++JavaGoJavaScript+123
View profile
TM

Tanay Mehendale

Screened

Junior Data Engineer specializing in LLM agents and RAG pipelines

San Jose, CA3y exp
Texas A&M UniversityTexas A&M University

“Built and deployed “ApartmentFinder AI,” a multi-agent system using Google ADK, Gemini, and Google Maps MCP to automate apartment shortlisting and commute-time analysis, cutting a 45–70 minute user workflow down to ~30 seconds. Also has strong delivery/process chops from serving as an SDLC Release Coordinator, managing 52+ releases and reducing SDLC issues by 84%.”

AgileAmazon EC2Amazon RDSAmazon RedshiftAmazon S3Anomaly Detection+86
View profile
LM

Lakshmi Meghana

Screened

Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP

Bristol, PA4y exp
DermanutureStevens Institute of Technology

“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”

PythonC++RSQLBashPyTorch+112
View profile
PG

Prabhdeep Gandhi

Screened

Mid-level Software Engineer specializing in real-time IoT and event-driven platforms

5y exp
Eagl TechnologySavitribai Phule Pune University

“Founding engineer at a startup building LLM/agentic workflows for public-safety customers, with hands-on experience delivering a hybrid on-prem + secure cloud solution to meet strict compliance needs. Implemented OpenTelemetry observability for multimodal agentic systems behind closed networks and used the resulting traces to optimize prompting/token usage for customer-specific security integrations. Regularly runs technical workshops and supports pre/post-sales by translating integration feedback into product roadmap decisions.”

GoPythonC++JavaJavaScriptTypeScript+101
View profile
KV

Krishna vamsi Dhulipalla

Screened

Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure

Remote4y exp
Cloud Systems LLCVirginia Tech

“Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.”

A/B TestingApache KafkaApache SparkAWSBERTBigQuery+119
View profile
HP

hetvi patel

Screened

Mid-level Software/Data Engineer specializing in cloud ETL pipelines and data infrastructure

New Jersey5y exp
Plore AIAvila University

“Backend/data engineer who built a production analytics data service (Python/FastAPI on AWS/Postgres with PySpark ETL) handling millions of records per day and drove major latency improvements (10–15s to <2s) via indexing, Redis caching, and shifting aggregations into ETL. Also shipped an LLM-based natural-language-to-SQL assistant end-to-end with strong guardrails (schema restrictions, read-only validation, RBAC, masking) and designed a multi-step agent workflow with verification and fallback logic.”

AgileAngularAPI DesignApache HadoopApache SparkAutomation+128
View profile
1...104105106...118

Related

Machine Learning EngineersSoftware EngineersData ScientistsData EngineersSoftware DevelopersAI EngineersEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search