Reval Logo
Home Browse Talent Skilled in Data Cleaning

Vetted Data Cleaning Professionals

Pre-screened and vetted.

Data CleaningPythonSQLDockerAWSpandas
SV

Siva Vakati

Mid-level Marketing Analyst specializing in digital marketing analytics and omnichannel attribution

USA4y exp
NetflixKent State University
A/B TestingDashboardingData CleaningData ModelingData ValidationData Visualization+194
View profile
AN

Anuj Naik

Mid-level AI/ML Engineer specializing in production ML, LLMs, and MLOps

Remote, USA4y exp
StripeCalifornia State University
Amazon EC2Amazon S3AWSAnomaly DetectionAPI DevelopmentCI/CD+67
View profile
HL

Haochen LI

Intern Machine Learning Engineer specializing in Generative AI and LLM systems

Hong Kong, China1y exp
Blue InsuranceDuke University
Amazon SageMakerApache HadoopApache SparkAPI IntegrationArtificial IntelligenceAWS+61
View profile
SK

Sudhher Kumar

Mid-level AI & Machine Learning Engineer specializing in computer vision and MLOps

United States6y exp
NVIDIAUniversity of Massachusetts Lowell
PythonNumPyPandasScikit-learnPyTorchTensorFlow+106
View profile
VS

Varun Singh

Senior Digital Analyst specializing in marketing analytics, personalization, and MarTech

Dallas, TX7y exp
AT&TSouthern Methodist University
A/B TestingAgileAnomaly DetectionBigQueryCampaign ManagementChatGPT+129
View profile
VR

Venkat Ram

Staff Machine Learning Engineer specializing in Generative AI, MLOps, and Computer Vision

18y exp
SyndioUniversity of Nevada, Las Vegas
Apache HadoopApache HiveApache SparkAutomationAWSBigQuery+106
View profile
WD

William Donaldson

Senior Data Scientist/Statistician specializing in SAS automation and data quality

Cambria, WI14y exp
Wells FargoUniversity of Wisconsin–Madison
Database DesignData AnalyticsData QualityData CleaningETLAutomation+27
View profile
SS

Sahithi S

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI

Texas, USA6y exp
NVIDIAKennesaw State University

“Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).”

PythonSQLJavaSpring BootFastAPIFlask+108
View profile
EM

Eliza Miller

Screened

Senior Global Trade Consultant specializing in customs compliance and duty mitigation

New York, NY4y exp
EYIndiana University Kelley School of Business

“Consulting project manager in global trade/compliance and regulatory reporting, leading cross-functional initiatives for global clients (tax, customs, supply chain, IT, finance) including US tariff exposure mitigation and EU carbon reporting. Built repeatable automations in Alteryx for US Customs Reconciliation filings (replacing large Excel models) and drives executive alignment through concise, decision-ready briefs and strong governance.”

Adobe IllustratorAdobe PhotoshopProject managementRegulatory complianceTeam managementWorkflow automation+56
View profile
RR

Rushi Reddy Lambu

Screened

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

Remote, USA5y exp
McKinsey & CompanyUniversity of North Texas

“GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.”

PythonSQLRJavaC++Bash+192
View profile
AS

Abhinav Sharma

Screened

Mid-level Technical Consultant specializing in Appian delivery and data/AI workflow automation

Mclean, VA5y exp
AppianUniversity of Illinois Urbana-Champaign

“Appian consultant/engineer focused on insurance and financial services modernization and AI-enabled workflows. Built and productionized an AI-driven insurance submission intake system (email ingestion, classification/extraction, HITL review) cutting processing time from 2+ hours to under 10 minutes, and delivered semantic smart search with guardrails and UAT-driven ranking improvements. Also partnered with a global bank CTO org, running sessions with 200+ senior leaders to automate regulatory/board metric reporting via platform integrations and attestation.”

AgileAutomationDashboardingData analysisData cleaningData governance+53
View profile
SD

Shreyas Darade

Screened

Mid-level Data Scientist specializing in business intelligence and machine learning

Pittsburgh, PA2y exp
Armada PartnersCarnegie Mellon University

“Internship experience building a production LLM-powered podcast operations agent that automated lead intake (HubSpot), guest research, scheduling (Calendly), meeting-summary evaluation (Gemini), and human approval via Slack bot—while retaining rejected candidates for future outreach. Also contributed to ideation of a multi-agent orchestration framework with parsing and task routing, and emphasized reliability via structured prompts, HITL feedback, and prompt-based test sets.”

A/B TestingAnalyticsBusiness IntelligenceClassificationClusteringData Analytics+84
View profile
ML

Mengyu Liu

Screened

Senior Data Scientist specializing in GenAI agents and causal inference

Remote, USA10y exp
HumanaUniversity of Miami

“Built and deployed a production healthcare medical review agent that automates call-transcript summarization and medication reconciliation using a hybrid deterministic + LangGraph-orchestrated LLM workflow. Demonstrates strong reliability engineering (guardrails, schema validation, confidence thresholds, golden/adversarial eval, Langfuse monitoring) in a regulated environment, delivering 60% lower latency and 70%+ efficiency gains while partnering closely with care managers and operations.”

PythonRSQLNumPyPandasMatplotlib+129
View profile
SS

Sumanth Salluri

Screened

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

“Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.”

PythonSQLRHTMLCSSJavaScript+118
View profile
CD

CHAKRESH DODDI

Screened

Mid-Level Software Developer specializing in Java microservices and cloud-native systems

St. Louis, MO5y exp
EpsilonSaint Louis University

“Backend engineer focused on cloud/distributed systems, deploying Java 17/Spring Boot microservices on AWS EKS with RDS and Kafka. Demonstrated strong production readiness work (DB lock mitigation, Kafka idempotency, gradual rollouts) and delivered a major latency improvement (~400ms to ~100ms). Also has proven cross-layer troubleshooting skills, isolating intermittent API timeouts to a specific Kubernetes node’s network interface issue, and partners closely with ops teams to build dashboards and workflow automation (including Python scripts).”

JavaSQLJavaScriptTypeScriptSpring BootSpring Cloud+82
View profile
ZI

Zufeshan Imran

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”

Machine LearningDeep LearningGenerative AITransformersLarge Language Models (LLMs)LLM fine-tuning+120
View profile
SD

Sai Dinesh Pusapati

Screened

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp
Scale AIBelhaven University

“LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.”

PythonTypeScriptJavaC#JavaScriptR+103
View profile
VK

Vedant Kharwal

Screened

Intern AI/ML Engineer specializing in Generative AI and applied machine learning

Mumbai, India1y exp
LTIMindtreeBoston University

“New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.”

AlgorithmsAngularAPI DevelopmentArtificial IntelligenceAuthenticationBlender+93
View profile
AN

Abhay Naik

Screened

Mid-level Data Engineer specializing in cloud-native analytics and enterprise integrations

Remote3y exp
The GrooveUC Berkeley

“Built and productionized an LLM-powered clinical assistant at a healthcare startup, re-architecting a prototype into a robust RAG system on AWS with guardrails, citations, monitoring, and automated tests for clinical reliability. Works closely with clinicians to convert workflow feedback into evaluation criteria and iterative system improvements, and has hands-on experience debugging agentic systems in real time (including during live client demos).”

AWSAmazon S3Amazon EKSAmazon EC2Amazon ECSAWS IAM+91
View profile
WM

Will McEntee

Screened

Mid-level Operations & Analytics Professional specializing in logistics and sports data

Anaheim, CA4y exp
AmazonGeorgetown University

“Lifelong basketball player with extensive exposure to elite Southern California high school basketball (Servite/Trinity League) and familiarity with college recruiting through close connections, who applies a structured PFF-style evaluation lens to scouting. Comfortable identifying talent via film and in-person viewing and proactively engaging prospects through social media outreach; also brings experience working demanding overnight/on-call schedules from Amazon last-mile logistics.”

Microsoft ExcelPower BISalesforceSQLPythonR+52
View profile
SR

Sandeep Reddy Karumudi

Screened

Mid-level Data & Business Analyst specializing in analytics engineering and BI

6y exp
AdobeUniversity of Wisconsin–Madison

“Data/analytics professional with experience across manufacturing and enterprise environments (Wisconsin School of Business project with CNH Industrial; roles/projects at Ascensia Technologies, S&C, and Adobe). Has hands-on work combining warranty/lifecycle tables with technician free-text notes using TF-IDF + tree models (XGBoost/Random Forest), and deep experience in entity resolution/reconciliation across mismatched financial systems using Python/SQL and fuzzy matching, with production-grade pipeline practices in Azure Data Factory/Databricks.”

PythonPandasNumPyscikit-learnRSQL+119
View profile
CS

Cassandra Sullivan

Screened

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”

A/B TestingAutomationClassificationDashboardingData CleaningData Visualization+109
View profile
1...345...39

Related

Machine Learning EngineersData ScientistsSoftware EngineersData AnalystsResearch AssistantsData EngineersAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search