Reval Logo
Home Browse Talent Skilled in PySpark

Vetted PySpark Professionals

Pre-screened and vetted.

PySparkPythonDockerSQLCI/CDAWS
TB

Tejas Belakavadi Kemparaju

Screened

Mid-Level Software Engineer specializing in backend, microservices, and ML systems

Newark, NJ3y exp
Exito InfynitesNJIT

“Primary designer/implementer/maintainer of an open-source JavaScript library for programmatic SSML generation and validation in text-to-speech pipelines. Focused on safety-by-default APIs with vendor-specific extension adapters, strong backward compatibility/deprecation practices, and measurable performance gains by removing redundant validation stages. Emphasizes developer experience through example-driven documentation and systematic community issue triage.”

JavaPythonJavaScriptTypeScriptCC+++87
View profile
DS

Deva Sai Kumar Bheesetti

Screened

Mid-level Full-Stack Engineer specializing in data automation, cloud & AI

Lowell, MA5y exp
University of Massachusetts LowellUniversity of Massachusetts Lowell

“JavaScript engineer who effectively "maintains" an internal open-source-style React/Node.js shared library used by multiple teams—owning API stability, semantic versioning, CI/testing, logging, and documentation. Demonstrates strong cross-team debugging and change-management skills (schema-driven refactors, feature flags, validation layers) to ship new features without breaking existing workflows, plus a profiling/benchmarking-driven approach to performance.”

PythonJavaJavaScriptFastAPIFlaskReact+99
View profile
MS

Martin Stidom

Screened

Senior Full-Stack Engineer specializing in scalable React/Next.js platforms

Austin, TX11y exp
NexaTech SolutionsASA College

“Backend/data engineer with strong production experience across Python microservices (FastAPI) and AWS serverless/data platforms (Lambda, API Gateway, Glue, Redshift). Demonstrates reliability and incident ownership (rate limits, retries/circuit breakers, monitoring) and has delivered measurable SQL performance gains (12–15s to <800ms, ~60% CPU reduction). Seeking fully remote work and not open to relocation/onsite meetings.”

AgileAngularAWSAWS LambdaBashBigQuery+157
View profile
TP

Tapan Patel

Screened

Junior Machine Learning Engineer specializing in MLOps and real-time systems

Gujarat, India1y exp
Macrosoft CreationsNortheastern University

“Built and shipped a production GPT-4 + RAG customer support chatbot that materially improved support operations (response time 4 hours to <3 minutes; ~65% tier-1 ticket automation). Demonstrates strong end-to-end LLM engineering across retrieval (Sentence Transformers/Pinecone), safety (multi-layer moderation), cost/latency optimization (caching/streaming, Celery/Redis), and rigorous evaluation/monitoring (shadow deploys, Datadog, 500+ test cases), plus proven stakeholder buy-in leading to 80% adoption.”

A/B TestingAmazon EC2Amazon S3AWS LambdaApache AirflowApache Cassandra+94
View profile
TM

Trinath Manikanta Batta

Screened

Junior AI/ML Engineer specializing in healthcare and financial risk modeling

Bristol, PA3y exp
DermanutureUniversity of South Florida

“Built and productionized a clinical NLP + patient risk stratification platform at Dermanture, combining Spark/PySpark pipelines with BERT/BioBERT for entity extraction and text classification and downstream risk models in TensorFlow/scikit-learn. Experienced running regulated, auditable ML workflows with Airflow and AWS SageMaker, emphasizing data validation (Great Expectations), drift monitoring, and explainability (SHAP) to drive clinician trust and adoption.”

A/B TestingAgileAnomaly DetectionAPI DevelopmentAWS GlueAWS Lambda+95
View profile
VP

Varshitha Pendyala

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and agentic systems

Houston, TX5y exp
Asuitech SolutionsUniversity of Houston

“Built a production "Mini RAG Assistant" for internal document Q&A, focusing on grounded answers (anti-hallucination), retrieval quality, and latency/cost optimization. Uses LangChain/LangGraph for orchestration and applies a metrics-driven evaluation loop (including reranking and semantic chunking improvements) while collaborating closely with product stakeholders.”

AgileAmazon ECSAmazon RedshiftAmazon S3Apache HadoopApache Kafka+164
View profile
YP

Yashwanth P

Screened

Mid-level AI/ML Engineer specializing in Agentic AI and Generative AI

USA6y exp
DoubleneGeorge Mason University

“Built and deployed a production LLM-powered RAG knowledge system to unify operational/policy information across PDFs, wikis, and databases, emphasizing auditability and low-latency/cost performance. Improved answer relevance at scale by moving from pure vector search to hybrid retrieval with metadata filtering and reranking, and partnered closely with healthcare operations/compliance to define acceptance criteria and human-in-the-loop guardrails.”

A/B TestingAgileAnomaly DetectionApache SparkAWSAWS Glue+129
View profile
VM

Vaishnavi M

Screened

Mid-level AI/ML Engineer specializing in MLOps and Generative AI

5y exp
Liberty MutualUniversity of Maryland, Baltimore County

“At Liberty Mutual, built a production underwriting decision assistant combining LLM reasoning with quantitative models and strong auditability. Implemented a claims-based response verification pipeline that cut hallucinations from 18% to 3% and materially improved user trust/validation scores. Experienced orchestrating ML/LLM workflows end-to-end with Airflow, Kubeflow Pipelines, and Jenkins, including SLA-focused pipeline hardening.”

A/B TestingApache AirflowApache KafkaApache SparkAWSAWS Lambda+143
View profile
HP

Harsh Patel

Screened

Senior Data Scientist specializing in LLM applications, RAG systems, and production ML

New York, NY6y exp
Fulcrum AnalyticsUniversity of Maryland, Robert H. Smith School of Business

“Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.”

PythonNumPyPandasScikit-learnTensorFlowPyTorch+105
View profile
SV

Satya VM

Screened

Mid-level GenAI/Data Engineer specializing in LLMs, RAG systems, and fraud detection

Ruston, LA7y exp
Origin BankOsmania University

“ML/NLP engineer with banking domain experience who built a GenAI-powered fraud detection and risk intelligence system at Origin Bank, combining RAG (LangChain + FAISS), fine-tuned BERT NER, and GPT-4/Sentence-BERT embeddings. Delivered measurable impact (25% higher fraud detection accuracy, 40% less manual review) and emphasizes production-grade pipelines on AWS SageMaker/Airflow with strong data validation and scalable PySpark processing.”

Generative AILarge Language Models (LLMs)Retrieval-Augmented Generation (RAG)Sentiment analysisMachine LearningDeep Learning+173
View profile
ST

Shreya Thakur

Screened

Mid-level Software Engineer specializing in Python backend and LLM/ML systems

New York, USA4y exp
Saayam for AllUniversity at Buffalo

“Backend/AI engineer who has shipped production LLM systems end-to-end, including an AI request-routing service (FastAPI + BART MNLI + OpenAI/Gemini) that improved accuracy ~25% after launch via eval-driven prompt/category iteration. Also built an enterprise document intelligence/RAG platform on Azure (Blob/SharePoint/Teams ingestion, OCR/NLP chunking, embeddings in Azure Cognitive Search) with PII guardrails (Presidio), confidence gating, and scalable event-driven pipelines handling millions of documents.”

PythonJavaCC++FastAPIFlask+136
View profile
PC

Purva Chakravarti

Screened

Senior Full-Stack Software Engineer specializing in scalable web apps, cloud, and blockchain/AI

Chino, California11y exp
MPRISECalifornia State University, Fullerton

“Full-stack engineer with strong production ownership across React/TypeScript, Node.js, and AWS (EC2/ECS/RDS/CloudWatch), including CI/CD, observability, and incident response. Delivered a secure RBAC workflow module end-to-end and achieved measurable gains (~30–40% latency reduction, ~50% error reduction) that lowered infra/ops costs. Comfortable in high-ambiguity startup environments—shipped a payment module within 2 days of joining with no documentation.”

Full-stack developmentAPI developmentREST APIsSOAPMVCMicroservices+237
View profile
CK

Charith Kandula

Screened

Mid-level Conversational AI Engineer specializing in enterprise chatbots and workflow automation

Miami, FL4y exp
Lid VizionUniversity of South Dakota

“Built a production LLM/RAG document extraction and game/quiz content workflow using LLaMA 2, LangChain/LangGraph, and FAISS, achieving ~94% accuracy and reducing turnaround from hours to minutes. Demonstrates strong applied MLOps/orchestration (CI/CD, MLflow, Databricks/PySpark), robust handling of noisy/variable document layouts (layout chunking + OCR fallbacks), and practical reliability practices (human-in-the-loop routing, drift monitoring, A/B testing).”

A/B TestingAnalyticsAPI DevelopmentAudit LoggingAWSCI/CD+241
View profile
NH

Nicholas Homme

Screened

Senior Full-Stack Developer specializing in React, Node.js, and AWS

Los Angeles, CA9y exp
SmartiStackUniversity of South Florida

“Backend/data engineer with hands-on production experience across Python/Flask microservices and AWS serverless/data platforms (Lambda, DynamoDB, S3, Glue/PySpark). Demonstrated strong reliability and operations mindset (JWT/RBAC, retries/timeouts/circuit breakers, CloudWatch/SNS alerting) and measurable performance wins (SQL report runtime cut from 10 minutes to 30 seconds). Seeking ~$150k base and cannot travel for onsite meetings for the next 5–6 months due to family medical constraints.”

A/B TestingAlgorithmsAngularJSApache KafkaAPI DesignAPI Testing+358
View profile
NK

Narayana Koushik kancharla

Screened

Intern Data Scientist specializing in Generative AI and NLP

United States2y exp
HCLTechUniversity of New Haven

“Backend/AI engineer with internship experience building an AI-powered financial insights platform (FastAPI, Redis, BigQuery) and prior HCL experience leading a monolith-to-microservices refactor (Flask, Kafka) using blue-green deployments. Demonstrates strong performance/security focus (OAuth/JWT/RBAC, encryption) and measurable impact on latency, downtime, and ML model reliability; MVP was submitted to Google’s accelerator program.”

A/B TestingApache KafkaApache HiveApache SparkBERTBigQuery+132
View profile
HC

Harsh Chauhan

Screened

Junior AI Engineer specializing in Generative AI, RAG, and NLP

Remote, US3y exp
TickerIndiana University Bloomington

“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”

A/B TestingAPI IntegrationAWSCI/CDCC+++120
View profile
BK

Bhargavi Karuku

Screened

Mid-level AI Engineer specializing in ML, NLP, and Generative AI

Atlanta, GA4y exp
CGIUniversity of New Haven

“AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.”

A/B TestingAgileAWSAzure Machine LearningBigQueryClaude+129
View profile
MP

Mehul Parmar

Screened

Mid-level Data Scientist specializing in insurance, healthcare, and cloud analytics

Somerset, NJ4y exp
P&F SolutionsLong Island University

“Built a production-style LLM document summarization/generation workflow that mitigates token limits and reduces hallucinations using semantic chunking, FAISS-based embedding retrieval (top-k via cosine similarity), and section-wise generation. Orchestrated the end-to-end pipeline with AWS Step Functions and aligned outputs with sales stakeholders through demos, visuals, and documentation.”

PythonRSQLSupervised LearningUnsupervised LearningClassification+98
View profile
JP

Jhansi Priya

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows

Remote, null6y exp
fundae software IncUniversity of Dayton

“Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.”

AgileApache KafkaApache SparkAWSAWS GlueAWS Lambda+129
View profile
AF

Anisha Fernandes

Screened

Mid-Level Software Engineer specializing in FinTech and LLM-powered data products

Los Angeles, California3y exp
California State University, Long BeachCalifornia State University, Long Beach

“Full-stack engineer with payments/settlement domain experience who modernized a payment tracking workflow from REST to GraphQL and delivered a production payment status dashboard using Next.js App Router + TypeScript. Strong in performance and reliability work (Postgres indexing/Explain Analyze, Redis caching, Datadog observability) and in durable event-driven processing with Kafka (DLQs, idempotency, reconciliation, event replay).”

PythonJavaTypeScriptHTMLCSSTailwind CSS+112
View profile
UD

Urvashi Dhingra

Screened

Mid-Level Software Engineer specializing in full-stack, cloud, and data platforms

Remote, NY4y exp
Global Mobile Software LLCRochester Institute of Technology

“Backend/full-stack engineer who has owned production TypeScript systems in both fintech-style transaction/rewards flows and HIPAA-regulated healthcare platforms. Deep focus on correctness and reliability (idempotency, retries/DLQs, reconciliation, observability) plus strong infra automation (Docker/Terraform/CI-CD) and measurable performance wins (40% query improvement, 90% test coverage).”

PythonGoJavaC++C#JavaScript+116
View profile
SM

Sakshi More

Screened

Mid-level Full-Stack Software Engineer specializing in cloud, data science, and ML systems

Texas, USA4y exp
Granite ConstructionUniversity of Texas at San Antonio

“Backend/data engineer focused on AWS-based, low-latency event processing for market data and social-signal sentiment systems. Has led a monolith-to-event-driven migration with feature-flagged incremental rollout, and emphasizes production-grade security (OAuth2/JWT, secrets management, Supabase RLS) and data integrity (deduplication/idempotency) under high-volume spike conditions.”

AgileAmazon CloudWatchAmazon DynamoDBAmazon ECSAmazon RDSAnsible+128
View profile
VS

Vikram Sandigaru

Screened

Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation

Boston, US3y exp
FounderWayNortheastern University

“Built and shipped production LLM systems at Founderbay, including a low-latency voice agent and a graph-based multi-agent research assistant. Strong focus on reliability in real workflows—hybrid SERP + full-site scraping RAG, grounding guardrails, validation checkpoints, and transcript-driven evaluation—plus performance tuning with async FastAPI, Redis caching, and containerization. Also partnered with a non-technical ops lead to automate post-call follow-ups via call summarization, field extraction, and tool-triggered actions.”

A/B TestingAWSCI/CDData ValidationDatabricksDebugging+85
View profile
PT

Phani Tarun Munukuntla

Screened

Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps

New York, USA2y exp
University at BuffaloUniversity at Buffalo

“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”

PythonPySparkApache AirflowJavaJavaScriptSQL+121
View profile
1...676869...78

Related

Machine Learning EngineersData ScientistsSoftware EngineersData EngineersAI EngineersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search