Reval Logo
Home Browse Talent Skilled in Data Cleaning

Vetted Data Cleaning Professionals

Pre-screened and vetted.

Data CleaningPythonSQLDockerAWSpandas
SK

Sai Karthik

Senior Data Scientist specializing in ML, fraud risk, and Generative AI (RAG/LLMs)

Dallas, Texas5y exp
JPMorgan ChaseUniversity of North Texas
PythonSQLPostgreSQLMySQLGitREST APIs+96
View profile
ND

Noah Dcruz

Junior Research Data Scientist specializing in healthcare analytics and real-world evidence

Boston, MA2y exp
Mass General BrighamNortheastern University
PythonNumPyPandasMatplotlibScikit-learnSciPy+67
View profile
NM

Naveen Malavath

Mid-level Data Scientist / ML Engineer specializing in LLMs and predictive analytics

4y exp
New York Life
PythonSQLRBashGitGitHub+108
View profile
PV

Pavani Vankayala

Senior RPA Developer/Analyst specializing in UiPath automation and enterprise integrations

14y exp
Disney
Change ManagementMentoringCode ReviewsPerformance TestingTest Case DesignAPI Integration+73
View profile
PK

Pavan Kalyan

Screened

Mid-level AI Engineer specializing in GenAI agents and RAG for IT operations

4y exp
DeloitteUniversity of North Texas

“Built and operates a production LLM agent for enterprise IT operations that triages and drafts resolutions for high-volume ServiceNow tickets using LangChain + RAG (Pinecone/pgvector) and AWS Bedrock/OpenAI. Emphasizes reliability with schema-validated stages, offline eval datasets from real tickets, and CloudWatch-driven monitoring/guardrails; system scales to 40K+ tickets/month and cut resolution time ~28%.”

PythonSQLJavaScriptGenerative AIMachine LearningOpenAI API+89
View profile
NC

Nightvid Cole

Screened ReferencesStrong rec.

Senior Computer Vision & Sensor Algorithms Engineer specializing in imaging systems

Saratoga, CA7y exp
Early-Stage StartupUniversity of Maryland, College Park

“Robotics/remote-sensing software engineer who built and validated multisensor image-processing and spectral chemical-detection pipelines (RX anomaly detection, ACE), including calibration protocols with a motorized shutter and rigorous data QC. Uses white-box NumPy simulators to debug SLAM/registration issues before translating logic to C++, and partnered with hardware teams to solve temperature-driven signal variation via combined software calibration and improved thermal management.”

MATLABPythonOpenCVSciPyNumPyPandas+90
View profile
AC

Alicia Chaney

Screened ReferencesStrong rec.

Senior Talent Acquisition & Talent Development Leader specializing in early-career and workforce transformation

Remote, TX9y exp
VisaSouthern New Hampshire University

“Talent/Recruiting Operations leader who has managed teams up to 6 and specializes in fixing broken interview pipelines through workflow standardization, scheduling automation, and ATS analytics. Built real-time dashboards in Lever/ATS to track aging, time-in-stage, and req health, and drove a 20% reduction in time to interview while improving candidate experience and hiring manager visibility. Experienced leading cross-functional tool/workflow implementations with HRIS and IT for high-volume early-career hiring.”

Workforce PlanningPerformance ManagementAutomationOnboardingCoachingProgram Management+130
View profile
YR

Yaswanth Reddy Seelam

Mid-level AI/ML Developer specializing in FinTech fraud detection and GenAI assistants

MO, USA4y exp
Edward JonesUniversity of Central Missouri
A/B TestingAnomaly DetectionApache HadoopApache SparkAWSCI/CD+70
View profile
RR

Rishika Reddy

Mid-level Data Scientist specializing in financial ML, NLP, and MLOps

San Diego, CA5y exp
Morgan StanleySan Diego State University
A/B TestingAgileAmazon S3Anomaly DetectionApache AirflowApache Kafka+135
View profile
JB

Jayeetra Bhattacharjee

Screened ReferencesStrong rec.

Mid-level AI/ML Engineer specializing in LLMs, NLP, and analytics automation

Bristol, UK4y exp
TCSUniversity of Bristol

“AI/ML Engineer (TCS) who built and deployed a production LLM-powered audit transaction validation service to reduce manual review of unstructured transaction records and comments. Implemented a LangChain/Python pipeline for extraction/normalization and discrepancy detection, with strong production reliability practices (decision logging, dashboards, labeled eval sets) and a human-in-the-loop auditor feedback loop to improve precision/recall under strict data-sensitivity and near-real-time constraints.”

AWSAnomaly DetectionAuthenticationAutomationBusiness IntelligenceCI/CD+121
View profile
JL

Joseph Lin

Screened ReferencesModerate rec.

Intern Software Engineer specializing in full-stack development and applied AI

New York, NY0y exp
Real Value CapitalNYU

“Internship experience building an end-to-end medical AI pipeline that extracts and normalizes messy medical PDFs, fine-tunes BioBERT to classify tumor-related statements (including negation/ambiguity handling), and integrates image-model outputs (MedSAM/GroundingDINO) for tumor localization and classification. Also worked on an LLM/RAG system to draft IPO prospectuses using retrieved regulatory/financial sources (including SEC EDGAR) with structured prompts to reduce hallucinations.”

AlgorithmsAmazon EC2AWSAuthenticationAuthorizationChromaDB+123
View profile
VK

Vamsi Koppala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

Barrington, IL4y exp
ComericaTexas Tech University

“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”

AgileApache SparkAzure Blob StorageBashBERTBitbucket+178
View profile
HS

HIMANSHU SHARMA

Screened

Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation

Orlando, FL6y exp
Kore.aiUniversity of South Florida

“Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.”

PythonPyTorchTensorFlowScikit-learnHugging Face TransformersSQL+121
View profile
PV

Prithviraju Venkataraman

Screened

Mid-level AI/ML Engineer specializing in MLOps, NLP, and Computer Vision

Long Beach, CA5y exp
Dell TechnologiesCal State Long Beach

“Built and deployed a production LLM-powered text extraction/classification system that converts messy unstructured reports into searchable insights, running on AWS SageMaker with automated retraining and monitoring. Strong in orchestration (Step Functions/Kubernetes/Airflow patterns) and reliability practices (gold datasets, prompt/tool unit tests, shadow/canary/A-B testing, guardrails/rollback), and has experience translating non-technical stakeholder needs into an NLP workflow plus dashboard.”

PythonRTensorFlowPyTorchScikit-learnKeras+110
View profile
AK

AnilKumar Kanakadandila

Screened

Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps

San Francisco Bay Area, CA5y exp
VerizonCalifornia State University

“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”

PythonPandasNumPyScikit-learnSQLMySQL+105
View profile
CB

Cary Burdick

Screened

Senior Data Scientist specializing in data engineering and analytics

Chicago, IL6y exp
USDAAuburn University

“Data/NLP practitioner with experience in both financial services (Truist) and government (USDA), including an NLP-driven analysis of EU regulations to anticipate US regulatory focus and a major redesign/cleaning of complex pathogen lab-test public datasets. Built production data-quality pipelines with Dagster, Pandera, and Azure Synapse, and is comfortable validating hypotheses with historical backtesting and SME-driven quality controls.”

PythonPySparkPandasNumPyRSciPy+53
View profile
SP

Sagar Patel

Screened

Mid-level Full-Stack Python Developer & Data Engineer specializing in ETL and web platforms

Arizona, United States6y exp
GoDaddyCampbellsville University

“Backend engineer who led major modernization efforts at GoDaddy, migrating legacy Perl services to Python/FastAPI with an incremental rollout strategy, containerization (Docker/Kubernetes), and CI/CD (Jenkins/GitHub Actions). Strong focus on secure, reliable API design (JWT, RBAC, PostgreSQL row-level security), rigorous testing, and data integrity—plus experience hardening an automated web-scraping pipeline against changing site structures and downtime.”

PythonSQLJavaScriptDjangoFlaskFastAPI+73
View profile
KP

Kavya Paluvai

Screened

Mid-level Data Scientist specializing in fraud detection and healthcare ML

North Carolina, USA4y exp
Wells FargoUniversity of North Carolina at Charlotte

“Applied NLP/ML in healthcare and financial services, including fine-tuning BERT on unstructured EHR text and building embedding-based similarity search for clinical concepts. Also redesigned a Wells Fargo fraud detection data pipeline using modular Python + AWS Glue/Step Functions, cutting runtime ~40% with improved monitoring and reliability.”

A/B TestingAWSAWS GlueAWS LambdaAWS Step FunctionsAzure DevOps+117
View profile
SR

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp
VerizonUniversity of Massachusetts Dartmouth

“Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.”

Machine LearningArtificial IntelligenceDeep LearningPyTorchTensorFlowKeras+110
View profile
PK

PHANINDRA KETHAMUKKALA

Screened

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”

A/B TestingAgileApache KafkaApache SparkAWS GlueAWS Lambda+170
View profile
AS

Aditya Sairam

Screened

Mid-Level Software Engineer specializing in cloud data platforms and AI search

Troy, MI6y exp
Robotics Technologies LLCCleveland State University

“Open-source JavaScript contributor focused on data visualization, extending Chart.js/React with custom plugins for real-time streaming dashboards. Designed an end-to-end telemetry pipeline using Apache Kafka and Azure Cosmos DB, optimizing partitioning, batching, caching, and client throttling to keep latency low and support thousands of concurrent users. Demonstrates strong ownership in fast-changing environments, including building full-stack AI applications and ingestion/ETL pipelines at Robotics Technologies LLC.”

Apache KafkaAWSAWS LambdaAzure FunctionsC#Cloud Computing+89
View profile
PV

PAVAN VARMA PENMETHSA

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

New York City, NY6y exp
AvanadeUniversity of North Texas

“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”

Machine LearningGenerative AILarge Language Models (LLMs)Prompt EngineeringRetrieval-Augmented Generation (RAG)Embeddings+131
View profile
YT

Yaswanth Thota Thota

Screened

Mid-level Data Analyst specializing in financial risk and healthcare analytics

AZ, USA4y exp
Wells FargoArizona State University

“AI/ML engineer focused on real-time, production-grade LLM systems, with a robotics-adjacent mindset around latency/accuracy tradeoffs and modular pipelines. Built a scalable RAG-based assistant orchestrated as microservices on Kubernetes with Kafka async messaging, ONNX/quantization optimizations, and monitoring (Prometheus/Grafana), citing a ~35% hallucination reduction; has also experimented with ROS Noetic/Gazebo to understand ROS concepts.”

A/B TestingAgileAmazon RedshiftApache AirflowApache KafkaCross-functional Collaboration+117
View profile
BB

BHARATH BHOOTHPUR

Screened

Mid-level Data Analyst specializing in healthcare and finance analytics

New Jersey, USA5y exp
Omada HealthRowan University

“Built an end-to-end Alexa smart-home IoT application controlling a Wi-Fi bulb, including ESP32 firmware (MQTT) and an AWS serverless backend (IoT Core/Device Shadow, Lambda, DynamoDB) with a REST API. Demonstrates strong real-time scalability patterns (streaming ingestion, stateless processing, partition-key design) and full-stack delivery with Spring Boot + React (JWT auth, CORS, data-heavy dashboards).”

PythonSQLRNumPyPandasMatplotlib+113
View profile
1...121314...39

Related

Machine Learning EngineersData ScientistsSoftware EngineersData AnalystsData EngineersResearch AssistantsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search