Reval Logo
Home Browse Talent Skilled in PySpark

Vetted PySpark Professionals

Pre-screened and vetted.

PySparkPythonDockerSQLCI/CDAWS
VS

Venkatesh Sanaboina

Screened

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

“Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.”

A/B TestingAgileAmazon RedshiftAmazon S3Amazon SageMakerAnomaly Detection+168
View profile
MP

Meghana P

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and NLP

Illinois, USA5y exp
State FarmSaint Louis University

“AI/ML engineer with forensic analytics and healthcare claims experience (Optum), building production LLM/RAG systems to surface context-driven fraud patterns from unstructured claim notes and explain risk to investigators. Strong in large-scale retrieval performance tuning, legacy API integration with reliability patterns (SQS, circuit breakers), and MLOps orchestration on Airflow/Kubernetes with rigorous testing, monitoring, and stakeholder-friendly interpretability.”

A/B TestingApache SparkAWSAWS LambdaAzure Data FactoryAzure Functions+125
View profile
SM

Sahithi Mogudala

Screened

Mid-level Full-Stack Software Developer specializing in cloud-native microservices

WI, USA3y exp
Cardinal HealthAnderson University

“Full-stack engineer with enterprise experience at Metasystems Inc. (and Qualcomm) building high-traffic, security-sensitive systems—owned a secure transaction processing module end-to-end using Java/Spring Boot, Python/Django, and React. Strong AWS production operations (EKS/ECS/Lambda/RDS/DynamoDB) with IaC (Terraform/CloudFormation), observability, and reliability patterns; also delivered resilient ETL/integration pipelines with idempotency/retries/backfills and achieved a 50% deployment-time reduction through CI/CD and modular refactoring.”

AjaxAmazon CloudFrontAmazon CloudWatchAmazon DynamoDBAmazon EC2Amazon ECS+284
View profile
HS

Harsha Sikha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and data engineering

Armonk, New York4y exp
IBMSaint Peter's University

“IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.”

A/B TestingAgileAnomaly DetectionAPI DevelopmentApache HadoopApache Hive+157
View profile
YL

Yun-Hao Lee

Screened

Junior Machine Learning Engineer specializing in LLM deployment and computer vision

Dallas, TX2y exp
Lab for Intelligent Storage and ComputingUniversity of Texas at Dallas

“Robotics/AI candidate who built an AI-driven landmark location tool during a summer internship at Mobile Drive, combining YOLOv5 object detection with OpenStreetMap-based geolocation to handle dense, cluttered urban environments. Also researched deploying LLM-based agents on constrained hardware using quantization plus LoRA/continuous learning, improving accuracy from ~80% to ~92%, with an emphasis on production logging for reliability.”

PythonCC++RSQLJava+91
View profile
SR

Srikanth Reddy

Screened

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

“Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.”

A/B TestingAgileAmazon BedrockAmazon CloudWatchAmazon EC2Amazon RDS+178
View profile
AS

Ashok Sai Doredla

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

United States5y exp
CVS HealthUniversity of Maryland, Baltimore County

“At CVS Health, the candidate productionized a RAG-based LLM solution in a regulated healthcare setting, emphasizing reliable data pipelines, LoRA fine-tuning, monitoring, safety guardrails, and A/B testing. They have hands-on experience troubleshooting real-time RAG failures (e.g., chunking/embedding issues) and regularly lead developer-focused demos/workshops while translating technical architecture into business value for stakeholders.”

A/B TestingAsynchronous ProcessingAWSAWS LambdaAzure Blob StorageAzure Functions+142
View profile
HC

Harsha Chimirala

Screened

Mid-level Data Engineer specializing in cloud data platforms and scalable ETL pipelines

USA, USA3y exp
HCLTechUniversity of New Haven

“Data engineer (~4 years) with full-stack delivery experience (Next.js App Router/TypeScript + React) building a real-time operations monitoring dashboard backed by Kafka and orchestrated data pipelines. Strong production focus: Airflow + CloudWatch monitoring, automated Python/SQL validation (99.5% accuracy), and CI/CD with Jenkins/Docker; has delivered measurable improvements in latency, pipeline reliability, and query performance (Postgres/Redshift).”

PythonSQLPySparkScalaBashApache Spark+80
View profile
TK

Tharun Kshathriya Sangaraju

Screened

Mid-level AI Engineer specializing in LLM orchestration, RAG, and multi-agent systems

Houston, TX4y exp
University of HoustonUniversity of Houston

“Research Assistant at the University of Houston who built and live-deployed a production RAG system for 1000+ research documents, using hybrid retrieval (dense+BM25+RRF) with cross-encoder reranking and RAGAS-based evaluation; reported 66% MRR, 0.85+ faithfulness, and 68% lower LLM inference costs. Also built a deployed LangGraph multi-agent research system (Researcher/Critic/Writer) with tool integrations (Tavily, arXiv) and dual memory (ChromaDB + Neo4j), plus freelance automation work delivering a WhatsApp chatbot and n8n workflows for a wholesale clothing business.”

API IntegrationApache AirflowApache HadoopApache KafkaApache SparkChromaDB+118
View profile
SC

Sai Charan Reddy Kothakapu

Screened

Mid-level Full-Stack Developer specializing in React/Node, GraphQL, and Databricks lakehouse

Dallas, TX6y exp
Southern Glazer's Wine & SpiritsWebster University

“Full-stack engineer currently at Southern Glazer’s who built and owned a real-time commercial finance expense analytics dashboard end-to-end (Next.js App Router + TypeScript), including post-launch monitoring, data quality checks, and stakeholder-driven iteration. Strong data/analytics backend experience (Postgres modeling and Databricks Delta Lake pipelines) with demonstrated performance wins—e.g., cutting a key reconciliation query from 8–12s to <400ms and improving frontend load time ~40% with a 25% bounce-rate drop at Verizon.”

ReactNext.jsJavaScriptTypeScriptTailwind CSSRedux+99
View profile
SS

Sai Swetha Bodlapati

Screened

Senior Data Engineer specializing in Spark, Kafka, and Databricks Lakehouse platforms

Dallas, TX5y exp
Fidelity InvestmentsNorthwest Missouri State University

“Data engineer at Fidelity who built and operated a real-time financial transactions lakehouse on AWS/Databricks, processing millions of records daily with Kafka streaming. Demonstrated strong reliability and data quality practices (watermarking, idempotent Delta writes, validation/reconciliation, observability) and delivered measurable improvements (~30% faster jobs and ~30% fewer data issues) while enabling trusted gold-layer analytics for downstream teams.”

PythonJavaSQLApache SparkPySparkApache Kafka+110
View profile
HJ

Harikiran Jangam

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG systems

California, USA3y exp
McKessonCalifornia Lutheran University

“Backend engineer who built and evolved a PHI-compliant RAG system (FastAPI + LangChain + embeddings/FAISS) for internal document search and summarization, delivering <400ms p95 latency at ~2,500 daily requests and measurable impact (30% faster investigations, +17% retrieval relevance). Demonstrates strong security and rollout discipline (RBAC/RLS/JWT, redaction/audits, shadow mode, dual writes, canaries) and a focus on reducing hallucination risk via grounded guardrails and confidence-based fallbacks.”

Amazon BedrockApache AirflowApache KafkaApache SparkAWSAWS Lambda+119
View profile
NS

Nisarg Shah

Screened

Junior Machine Learning Engineer specializing in geospatial analytics and computer vision

Tempe, Arizona1y exp
Arizona State UniversityArizona State University

“Built and evolved a geospatial ETL + API platform that processes pixel-wise satellite imagery in PostgreSQL/PostGIS into low-latency farm-level time-series metrics for an interactive dashboard, using precomputed hotspot analysis to reduce latency by 75–80%. Experienced in FastAPI-style API contract design (OpenAPI), caching, server-side filtering/compression, and production-minded security patterns (RBAC, session-derived authorization, password hashing) with disciplined rollback/versioning practices.”

PythonJavaJavaScriptTypeScriptReactSQL+102
View profile
KV

Ketan Verma

Screened

Junior Applied AI Engineer specializing in data pipelines and ML systems

College Station, TX2y exp
ElysiTexas A&M University

“Built an end-to-end wafer-data anomaly detection and reporting system at Samsung using PySpark, Random Forest models, SQL, and Grafana to help engineers track faults and take corrective action. Also has strong UX prototyping and validation practices in Figma plus hands-on front-end/full-stack experience (HTML/CSS/TypeScript), including a student project recognized as best design out of 25 teams, and early-stage startup experience pivoting a product based on user interviews into a real-time in-context feedback overlay.”

PythonSQLC++JavaGitPySpark+59
View profile
RE

Roshan Erukulla

Screened

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp
Elevance HealthIndiana University Indianapolis

“Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.”

A/B TestingAgileAmazon EC2Amazon ECSAmazon S3Apache Airflow+148
View profile
AS

Abhishek Soni

Screened

Mid-level Full-Stack Developer specializing in React and scalable web applications

Mumbai, India3y exp
Taurus TechnologiesDr. A. P. J. Abdul Kalam Technical University

“Backend/data engineer with hands-on production experience across FastAPI microservices and AWS data platforms. Has delivered serverless and Glue/EMR-based ETL pipelines with strong observability (Prometheus/Grafana/Sentry, CloudWatch/SNS), schema-evolution resilience, and measurable SQL performance wins (5 min to <30 sec). Open to onsite meetings in the Bethesda, MD area and flexible on remote arrangements.”

JavaScriptTypeScriptPythonJavaC++C+80
View profile
JW

Joseph Wonesh

Screened

Senior Full-Stack Software Engineer specializing in modern web apps and cloud platforms

Los Angeles, CA11y exp
SmartiStackUniversity of Florida

“Backend/data engineer focused on production-grade Python microservices and AWS platforms, including a hybrid Lambda + ECS Fargate architecture managed with Terraform and CI/CD. Has hands-on reliability experience (JWT/OAuth, timeouts, retries, centralized error classification) and built AWS Glue/PySpark ETL pipelines consolidating PostgreSQL/RDS, MongoDB, and S3 sources into curated partitioned Parquet datasets. Demonstrated measurable SQL tuning impact (8 minutes to 25 seconds) and disciplined legacy-to-modern migrations with parity validation and UAT sign-off.”

A/B TestingAgileAlgorithmsAmazon CloudFrontAmazon DynamoDBAmazon EC2+273
View profile
LJ

Lokesh Jain

Screened

Senior Data Engineer specializing in cloud data platforms and ML pipelines

5y exp
WayfairUniversity at Buffalo

“Built and deployed AcademiQ Ai, a production LLM-based teaching assistant using GPT/BERT with RAG (LangChain + Pinecone) to handle large student notes and generate adaptive explanations/quizzes. Demonstrated measurable retrieval-quality gains (18% precision improvement, 22% less irrelevant context) by tuning similarity thresholds and chunking based on user satisfaction signals. Also orchestrated terabyte-scale, real-time demand forecasting pipelines using Airflow and Kubeflow on GCP with strong monitoring, shadow deployment, and feedback-loop practices.”

A/B TestingAgileAngularApache HadoopApache KafkaAWS+91
View profile
NR

Nandini Reinthala

Screened

Mid-Level Full-Stack Python Developer specializing in AI and data platforms

Dallas, TX5y exp
Fannie MaeUniversity of Central Missouri

“Full-stack engineer who builds TypeScript/React SPAs on Python (Flask/FastAPI) backends and has hands-on experience integrating AI components (Azure OpenAI, LangChain, vector databases) into user workflows. Has built internal AI-enabled dashboards/search tools for analysts and business users, emphasizing typed API contracts, CI/CD-driven quality, and microservices reliability patterns (monitoring, retries, idempotency) at scale.”

AgileAJAXAmazon CloudFrontAmazon EC2Amazon EMRAmazon RDS+146
View profile
AK

Ajay Kumar Devireddy

Screened

Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps

USA4y exp
CignaTexas Tech University

“ML/AI engineer with healthcare payer experience (Signal Healthcare, Cigna) who has shipped production fraud/claims prediction systems using Python/TensorFlow and exposed them via FastAPI/Flask microservices integrated with EHR and Salesforce. Emphasizes operational reliability and trust—Airflow-orchestrated pipelines with data quality gates plus SHAP-based interpretability, A/B testing, and drift/debug workflows—backed by reported outcomes of 22% lower false payouts and 17% higher model accuracy.”

A/B TestingAgileApache AirflowApache KafkaApache SparkAudit Logging+134
View profile
MD

Mukesh Dontaraboina

Screened

Mid-level Full-Stack Developer specializing in web platforms and cloud (AWS)

United States4y exp
Lincoln FinancialCalifornia State University, Long Beach

“Full-stack engineer with financial services experience (Lincoln Financial) who owned a customer-facing financial portal end-to-end using TypeScript/React and Node/Express. Has hands-on microservices and RabbitMQ event-driven workflows, addressing scale issues like retries/duplicates with idempotency and traceable logging, and built an internal real-time ops/support dashboard to improve monitoring and incident response.”

PythonCC++JavaJavaScriptTypeScript+154
View profile
OR

OBUL REDDY LEKKALA

Screened

Mid-level Data Scientist specializing in predictive modeling, NLP/LLMs, and RAG search systems

Des Moines, IA6y exp
CDS GlobalUniversity of Massachusetts

“Built production LLM/RAG platforms for financial services to enable natural-language Q&A over large policy/compliance document sets stored in Snowflake and SharePoint. Strong in MLOps and orchestration (Airflow, ADF, Step Functions, MLflow) and in solving real production issues like stale embeddings and model performance, including an incremental Snowflake Streams sync that cut processing time from hours to minutes.”

A/B TestingAmazon CloudWatchAnomaly DetectionAWSAWS CodePipelineAWS Glue+124
View profile
RA

Rahul Alle

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

USA4y exp
CVS HealthAnderson University

“Built a production internal LLM/RAG assistant at CVS Health to cut time spent searching long policy and clinical guideline PDFs, combining fine-tuned BERT/GPT models with FAISS retrieval and a FastAPI service on AWS. Demonstrates strong real-world reliability work (document cleanup, hallucination controls, monitoring/drift tracking with MLflow) and close collaboration with non-technical clinical operations teams via demos and feedback-driven iteration.”

A/B TestingAmazon KinesisAmazon RedshiftAmazon S3AutomationAWS+136
View profile
TN

Tejaswini Narayana

Screened

Mid-level Data Scientist & AI/ML Engineer specializing in GenAI and cloud ML

Harrison, NJ5y exp
State FarmMonroe University

“GenAI/LLM engineer who recently built a production compliance assistant at State Farm for KYC/AML and regulatory teams, using AWS Bedrock + LangChain with Textract/Lambda pipelines to extract fields, tag risk, and summarize long documents. Implemented RAG, strict structured outputs, and human-in-the-loop guardrails, and reports automating ~80% of documentation work while reducing review time by ~40%.”

SDLCAgileWaterfallPythonCC+++149
View profile
1...464748...79

Related

Machine Learning EngineersData ScientistsSoftware EngineersData EngineersAI EngineersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search