Reval Logo
Home Browse Talent Skilled in PySpark

Vetted PySpark Professionals

Pre-screened and vetted.

PySparkPythonDockerSQLCI/CDAWS
AM

Aarushi Mahajan

Screened

Junior AI/ML Engineer specializing in LLMs, RAG, and information retrieval

Boston, MA2y exp
University of Massachusetts AmherstUniversity of Massachusetts Amherst

“Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.”

PythonSQLCC++JavaTypeScript+116
View profile
SR

SREEJA REDDY Konda

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics

Kentwood, MI6y exp
Fifth Third BankUniversity of Central Missouri

“AI/ML Engineer at Fifth Third Bank who has shipped production fraud detection and risk analysis systems combining ML models with LLM-powered insights/explanations, including real-time monitoring, drift detection, and automated retraining under regulatory explainability constraints. Also built a hybrid-retrieval internal knowledge-base QA system (+20% top-5 relevance) and delivered a customer support chatbot that reduced first response time by 30% through strong stakeholder collaboration.”

PythonSQLRJavaScalaScikit-learn+102
View profile
NK

Nagaraju Kanubuddi

Screened

Mid-level AI/ML Engineer specializing in fraud detection, recommender systems, and forecasting

Remote, USA4y exp
CitigroupUniversity of Dayton

“ML engineer/data scientist who built and deployed a real-time fraud detection platform at Citi on AWS SageMaker, processing 3M+ daily transactions and improving fraud response by 28%. Combines unsupervised anomaly detection (autoencoders) with ensemble models (XGBoost/Random Forest) plus Airflow/Step Functions orchestration, drift monitoring, and explainability (SHAP) to keep models reliable and compliant in production.”

PythonpandasspaCyRSQLPySpark+172
View profile
BA

Bhavana Anna

Screened

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)

USA5y exp
USAAKennesaw State University

“AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.”

AWSAWS CloudFormationAWS LambdaBERTCI/CDClaude+82
View profile
SP

shubham patil

Screened

Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics

New York, NY4y exp
Syracuse UniversitySyracuse University

“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”

A/B TestingAnomaly DetectionAPI DevelopmentAWSAzure Machine LearningCI/CD+91
View profile
AS

Allen Saunders

Screened

Senior DevOps/Solutions Engineer specializing in CI/CD, cloud platforms, and API integrations

San Francisco, California11y exp
SpiderOakSan Francisco State University

“Solutions Architect with 5+ years leading pre- and post-sales engagements, focused on taking complex tooling from test/prototype to secure production through a structured discovery-to-deployment approach. Experienced in LLM workflow troubleshooting using tools like Langfuse/Gopher and in developer enablement via concise, hands-on workshops (e.g., Jenkins on Kubernetes at scale). Has navigated internal and external blockers to drive adoption and keep enterprise deals moving (including a Jenkins sale to Love's).”

PythonSQLTensorFlowApache SparkPySparkCelery+107
View profile
MY

Mounika Yalamanchili

Screened

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
State StreetWebster University

“Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.”

A/B TestingAnomaly DetectionAWS CloudFormationAWS LambdaAzure DevOpsAzure Machine Learning+198
View profile
JP

Jay Patel

Screened

Mid-level AI/ML Engineer specializing in NLP, Document AI, and MLOps

USA6y exp
State StreetPace University

“ML/LLM engineer with production experience building a RAG-based LLM support assistant (FastAPI, Redis, Kafka) with multi-layer validation and human-in-the-loop feedback loops to improve accuracy over time. Has orchestration and MLOps depth using Airflow and Kubeflow on Kubernetes (autoscaling, alerting, monitoring) and delivered measurable ops impact (40% ticket efficiency improvement) by partnering closely with customer support teams.”

PythonRSQLPyTorchTensorFlowscikit-learn+106
View profile
SP

sai Pavan

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines

5y exp
American Family InsuranceGeorge Mason University

“Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.”

A/B TestingAmazon API GatewayAmazon EC2Amazon KinesisAmazon RedshiftAmazon S3+157
View profile
SJ

Shanmukha Jwalith Kristam

Screened

Mid-level Data Scientist / ML Engineer specializing in MLOps and Generative AI

Alexandria, Virginia3y exp
Schizophrenia & Psychosis Action AllianceStony Brook University

“Built and deployed an AI agent to help patients navigate complex housing information by scraping and normalizing unstructured data across all 50 U.S. states, then layering a LangChain RAG system with MMR re-ranking to reduce hallucinations. Experienced in orchestrating multi-agent workflows (LangGraph/CrewAI) and production reliability practices (Pydantic-validated outputs, LLM-as-judge evals, tracing). Also delivered stakeholder-facing explainability via SHAP dashboards for a loan-approval predictive model at Welspot.”

RPythonNumPypandasscikit-learnPyTorch+130
View profile
RR

Rajeev Reddy

Screened

Mid-level AI/ML Engineer specializing in NLP and production ML on cloud

4y exp
The HartfordFlorida Atlantic University

“ML engineer/data scientist who deployed a production credit risk + insurance claims triage platform at Hartford Financial, combining XGBoost default prediction with BERT-based document classification. Demonstrated strong MLOps by cutting inference latency to sub-500ms and building drift monitoring plus automated retraining/deployment pipelines (MLflow, CloudWatch, GitHub Actions, SageMaker) with human-in-the-loop review and SHAP-based explainability for underwriting adoption.”

A/B TestingAgileAmazon EC2Amazon RedshiftAmazon S3Anomaly Detection+115
View profile
TB

Teja Babu Mandaloju

Screened

Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms

Chicago, USA5y exp
VosynUniversity of North Texas

“AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.”

PythonRSQLMATLABC#Scikit-learn+166
View profile
DG

Divya Ganapala

Screened

Mid-level Data Scientist specializing in cloud ML, MLOps, and predictive analytics

Dallas, TX4y exp
UnitedHealth GroupJawaharlal Nehru Technological University, Hyderabad

“NLP/ML engineer with hands-on healthcare and support-ticket text experience, building clinical-note structuring and semantic linking systems using spaCy, BERT clinical embeddings, and FAISS. Emphasizes production-grade delivery (Airflow/Databricks, PySpark, Docker, AWS/FastAPI/Lambda) and rigorous validation via clinician-labeled datasets, retrieval metrics, and user feedback.”

PythonRSQLPySparkPandasNumPy+155
View profile
SB

Shreyansh Bhalani

Screened

Mid-level Full-Stack & ML Engineer specializing in AI SaaS, MLOps, and cloud infrastructure

Edison, NJ3y exp
AffirmoAINYU

“Built and shipped an AI-powered driver ranking/assignment system at AffirmoAI using LLM intent classification + RAG over pgvector/Postgres, served via FastAPI with a React UI that explains scores. Drove measurable improvements through optimization and iteration (latency down to <800ms, adoption 60%→90%+) and implemented rigorous eval loops with dispatcher ground truth plus cold-start handling for new drivers.”

PythonJavaScriptTypeScriptSQLJavaC+++120
View profile
MD

Meet Doshi

Screened

Mid-level Data Engineer specializing in cloud data platforms and AI/ML analytics

Chicago, IL4y exp
EDNANortheastern University

“Backend/data engineer in healthcare who built an AWS-based clinical analytics platform from scratch (DynamoDB/S3/Airflow/dbt) with sub-second clinician query goals, 99.9% uptime, and HIPAA-grade controls (KMS encryption, IAM RBAC, audit trails). Also modernized ML delivery by replacing a manual 4-hour deployment with a 30-minute Docker/GitHub Actions CI/CD pipeline using parallel runs, parity testing, and rollback, and caught critical EHR data edge cases (date formats/timezones) that could have impacted patient care.”

PythonPySparkSQLRJavaScala+120
View profile
HK

Harsha KeladiGanapathi

Screened

Intern Data Scientist specializing in robotics localization and SLAM

Lexington, KY1y exp
InfineonUniversity of New Haven

“Robotics/embodied-AI practitioner who built a TurtleBot3 LiDAR-fingerprint localization pipeline end-to-end (autonomous data collection + multi-head NN) achieving ~30 cm error in a 10x10 m space. Also has industry experience at Infineon building large-scale production data/AI pipelines and rapidly fixing a deployed recommendation system by correcting upstream data normalization, improving accuracy by 20%+.”

BashCC++Deep LearningGazeboGit+143
View profile
MW

Mark Wlodawski

Screened

Senior Software Engineer specializing in Python microservices, cloud platforms, and ML-powered APIs

Orlando, Florida10y exp
CognizantUniversity of Memphis

“Backend/data engineer focused on AWS-native Python systems: built a FastAPI microservice on ECS/Fargate serving real-time analytics at millions of daily requests with strong reliability (OAuth2/JWT, retries/timeouts, correlation IDs) and autoscaling. Also delivered Glue/PySpark ETL pipelines to curated S3 Parquet/Athena with schema evolution + data quality controls, owned Airflow pipeline incidents, and has a track record of measurable performance and cost optimizations (e.g., ~80%+ query latency reduction; reduced logging/NAT/Fargate spend).”

PythonTypeScriptSQLBashJSONFastAPI+183
View profile
RP

Raj Patel

Screened

Junior Machine Learning Engineer specializing in LLMs and RAG systems

Remote, USA1y exp
EmotionallNYU Tandon School of Engineering

“Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.”

PythonJavaTensorFlowPyTorchScikit-LearnLarge Language Models (LLMs)+86
View profile
CK

Chaitanya Kalagara

Screened

Mid-level Machine Learning Engineer specializing in LLMs, GenAI, and Computer Vision

Boston, MA3y exp
Camp4 TherapeuticsNortheastern University

“LLM/agent engineer who built a production multi-agent research automation system using LangGraph (planner, retriever with FAISS, supervisor, evaluator) with structured outputs and citation tracking for traceable reports. Emphasizes reliability and operations—LangSmith-based observability, multi-level testing, hallucination mitigation, and latency/cost controls—plus prior experience as a Computer Vision Software Engineer at Deepsight AI Labs working directly with non-technical customers.”

A/B TestingAmazon EC2Amazon S3Amazon SageMakerAWSAWS Lambda+87
View profile
HK

Haritha Kuraparthi

Screened

Mid-level Full-Stack Developer specializing in cloud data engineering and analytics

West Haven, CT4y exp
BlackbaudUniversity of Bridgeport

“Software developer with hands-on experience owning customer-facing work end-to-end (requirements, implementation, testing, and feedback-driven iteration) using Python and React.js. Also described remodeling an internal legacy page/tool to improve performance and accuracy, and has exposure to microservices and RabbitMQ plus ETL-based system work.”

PythonNumPyPandasJavaScriptNode.jsJava+81
View profile
SC

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp
HCLTechUniversity of New Haven

“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”

A/B TestingAgileAmazon CloudWatchAmazon DynamoDBAmazon S3Apache Airflow+139
View profile
AP

Alekhya Parimala Koppolu

Screened

Mid-level AI/ML Software Engineer specializing in data pipelines, BI dashboards, and computer vision

Wichita, Kansas3y exp
Friends UniversityFriends University

“Graduate Assistant Intern at Friends University who built and deployed a GenAI-driven requirement understanding system that automates extraction and semantic grouping of technical requirements from large unstructured documents. Demonstrates strong LLM engineering rigor (golden datasets, regression testing, post-processing validation) and production-minded delivery using LangChain/LlamaIndex orchestration, FastAPI microservices, Docker, and cloud deployment.”

PythonSQLRJavaCC+++119
View profile
SS

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp
Voltihost LLCStony Brook University

“AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.”

PythonRC++JavaShell ScriptingBash+172
View profile
KK

KHUSHBU KAKDIYA

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and cloud MLOps

California, USA6y exp
CVS HealthCleveland State University

“Built and deployed a production LLM/RAG system at CVS to automate clinical documents, addressing PHI compliance, retrieval accuracy, and latency; achieved a 35–40% reduction in review effort through chunking and FP16/INT8 optimization. Also has experience translating AI outputs into actionable insights for non-technical stakeholders (sports analysts).”

PythonSQLPySparkRBashScikit-learn+114
View profile
1...555657...79

Related

Machine Learning EngineersData ScientistsSoftware EngineersData EngineersAI EngineersData AnalystsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search