Reval Logo
Home Browse Talent Skilled in Data Cleaning

Vetted Data Cleaning Professionals

Pre-screened and vetted.

Data CleaningPythonSQLDockerAWSpandas
YP

Yash Pise

Screened

Mid-level Data Scientist specializing in Generative AI, LLMOps, and clinical data pipelines

5y exp
NovartisStevens Institute of Technology

“LLM/RAG engineer who has built and deployed corporate-scale systems at Novartis and Johnson & Johnson, including a healthcare AI agent that generates day-to-day treatment schedules. Recently handled a high-stakes safety incident (LLM suggesting overdose) by tightening model instructions and validating with ~200 test prompts, and has strong end-to-end data/embedding/vector DB pipeline experience (PySpark, FAISS, Pinecone) plus SME-in-the-loop evaluation (RLHF).”

PythonRJavaScriptMySQLPostgreSQLNumPy+88
View profile
PN

Priyanka Nelluri

Screened

Mid-level SAP MM/SD Functional Consultant specializing in P2P, O2C, and inventory management

Atlanta, GA5y exp
Community Dreams FoundationNortheastern University

“Procurement/sourcing professional with deep end-to-end ownership of recurring program supply and services sourcing (materials plus logistics/fulfillment), focused on reducing unit and landed costs while improving OTIF and tightening SAP purchasing controls. Demonstrates structured supplier vetting (capability, capacity, execution reliability), strong negotiation on commercial terms and SLAs, and proactive mitigation of international trade/freight/duty volatility through risk-adjusted sourcing strategies.”

Inventory ManagementVendor ManagementContract NegotiationIntegration TestingUnit TestingUser Acceptance Testing (UAT)+108
View profile
MS

Mahan Santosh Satya Sai Ashish Bandaru

Screened

Mid-level Software Engineer specializing in FinTech full-stack and AI applications

Remote, USA3y exp
JPMorgan ChaseArizona State University

“Built and productionized an NLP-powered customer support assistant at JPMorgan Chase for digital banking, focused on reducing response time for repetitive client queries. Strong in real-world AI deployment challenges—sensitive data handling, low-latency FastAPI services, and AWS/Kubernetes operations with CI/CD—plus a metrics- and guardrails-driven approach to reliable AI workflows.”

ReactReduxNext.jsTailwind CSSBootstrapMaterial UI+117
View profile
SS

Siva Sai Kumar Mogalluru

Screened

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp
EYUniversity of South Florida

“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”

A/B TestingAgileAnomaly DetectionApache AirflowApache SparkAzure DevOps+138
View profile
SK

Siddhardha Kanamatha

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
ServiceNowValparaiso University

“ServiceNow engineer who built and launched a production LLM-powered ticket resolution/knowledge assistant using RAG (LangChain + Hugging Face embeddings + vector search) integrated into internal support dashboards via REST APIs. Optimized the system from ~6–8s to ~2–3s latency while improving usability with concise, cited answers and guardrails (grounding + similarity thresholds), delivering ~30–35% reduction in manual ticket investigation effort.”

PythonSQLRJavaMachine LearningDeep Learning+93
View profile
RT

Ramya Thottempudi

Screened

Mid-level Full-Stack Software Engineer specializing in Java/Spring microservices and React

Mobile, AL4y exp
UberLindsey Wilson College

“Uber engineer who has owned internal products end-to-end across backend (Spring Boot microservices, MySQL) and frontend (React), including performance optimization and secure JWT-based auth. Also shipped a production internal RAG/embeddings LLM support assistant over policy docs and support tickets, with guardrails (confidence thresholds, human review) and an evaluation loop that directly reduced hallucinations.”

Amazon CloudWatchApache KafkaApache TomcatAPI GatewayAutomated TestingAWS+128
View profile
HZ

Haoyue Zhang

Screened

Intern Digital Marketer specializing in SEO/SEM and marketing analytics

Washington, DC1y exp
EarthNutriJohns Hopkins University

“Growth marketer from earthnutre who led a zero-to-one conversion and traffic-quality improvement project combining SEO, landing page/CRO, and paid-media measurement. Used SEMrush-driven high-intent keyword restructuring plus phased page optimizations with control comparisons, then tied results to paid performance—reporting ~15% lower CPA and ~30% MoM ROAS lift. Also runs a modular, data-driven creative iteration process across Meta/TikTok/YouTube and provides performance-based guidance to creators/editors.”

A/B TestingCampaign ManagementCRMData AnalyticsData CleaningData Visualization+67
View profile
AB

Angelo Bianchi

Screened

Senior HR Business Partner specializing in organizational design, labor relations, and people analytics

New York, NY16y exp
Liberty Coca-Cola BeveragesUniversity of Miami

“HR/people analytics and transformation leader with experience at Royal Caribbean and Baptist Hospital, combining a graduate degree in business analytics with hands-on org design and change execution. Built HR analytics capabilities (predictive/causal approaches, data quality, storytelling) and partnered directly with a CFO to run a 2+ year finance reorg across a 26,000-employee, 14-hospital system, including writing 124 job descriptions and implementing a CoE/business partnering model.”

Change ManagementCoachingWorkforce PlanningProcess ImprovementStakeholder ManagementMicrosoft Office+84
View profile
KL

Kangjie Lu

Screened

Intern Full-Stack Software Engineer specializing in data pipelines and AI/ML systems

Beijing, China1y exp
Shanghai Wanwu Zhiyun Industrial Technology Co., Ltd.Carnegie Mellon University

“Software engineer with experience building a Vue.js/TypeScript internal component library (with Jest testing standards) and improving JS runtime performance via profiling, code splitting, and lazy loading. Also led documentation and community support for a Python ML utility library, diagnosing metric-calculation bugs for imbalanced datasets and driving large reductions in support inquiries through targeted docs, tests, and rapid hotfixes in a startup environment.”

AgileAPI TestingAWSBackend DevelopmentCC+++111
View profile
UK

Uday Kumar gattu

Screened

Mid-level Generative AI Engineer specializing in LLM agents and RAG systems

4y exp
Capital OneLindsey Wilson College

“Built and deployed a production LLM/RAG knowledge assistant integrating internal docs, wikis, and ticket histories to reduce tribal-knowledge dependency and repetitive questions. Emphasizes reliability via grounding + a validation layer, and achieved major latency gains (>50%) through vector index optimization, caching, quantization, and selective re-validation. Comfortable orchestrating end-to-end LLM/data workflows with Airflow, Prefect, and Dagster, including monitoring and alerting.”

A/B TestingAmazon CloudWatchAmazon DynamoDBAmazon EKSAmazon RedshiftAmazon S3+129
View profile
BG

Bernard Griffin

Screened

Senior Data Scientist / ML Engineer specializing in cloud ML pipelines and GenAI

Baltimore, MD17y exp
IntelIllinois Institute of Technology

“ML/NLP practitioner with experience building a transformer-failure prediction system that combines sensor signals with unstructured maintenance comments using LLM-based extraction and similarity validation. Strong emphasis on production readiness—data leakage controls, SQL-driven data quality tiers, and rigorous bias/fairness validation (including contract/spec evaluation across diverse company profiles).”

A/B TestingAmazon BedrockAmazon EC2Amazon EMRAmazon KinesisAmazon Redshift+130
View profile
SM

Subhasmita Maharana

Screened

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

“Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).”

A/B TestingApache AirflowAzure Machine LearningBERTCI/CDClustering+170
View profile
SB

Sathyavarthan Balachandar

Screened

Mid-level Data Engineer specializing in scalable pipelines, Spark, and cloud data warehousing

Boston, USA3y exp
Fidelity InvestmentsNortheastern University

“Backend/data platform engineer who recently owned an end-to-end large-scale financial data platform delivering real-time decision support for finance and operations. Has hands-on experience modernizing legacy batch pipelines into AWS cloud-native ELT with parallel-run cutovers, strong data quality controls (dbt-style tests, reconciliation), and measurable improvements in runtime, cost, and SLA compliance. Also builds scalable, secure FastAPI microservices using Docker, ALB-based horizontal scaling, Redis caching, and managed auth with Cognito/Supabase plus Postgres RLS.”

PythonSQLGoApache SparkPySparkDatabricks+125
View profile
SN

Siddharth Nandakumar

Screened

Intern Full-Stack Software Engineer specializing in AI/ML and AWS cloud platforms

Birmingham, AL1y exp
Yuva BiosciencesTufts University

“Full-stack engineer who built an LLM-powered productivity web app (LifeOS) end-to-end with TypeScript/Next.js, Prisma, and Postgres, emphasizing fast iteration with stable API contracts and an isolated AI service boundary. Also built a security/compliance login-verification workflow at Medpace used within an internal admin portal for thousands of employees, and has AWS experience orchestrating batch GPU workloads with robust retry/idempotency patterns.”

AgileAlgorithmsAmazon BedrockAmazon EC2Amazon SageMakerAngular+89
View profile
SM

SUSENDRANATH MUSANI

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Connecticut, USA5y exp
PfizerUniversity of New Haven

“Built and deployed an enterprise GenAI knowledge assistant over thousands of internal PDFs/reports using a RAG stack (GPT-4 + Hugging Face embeddings + vector DB) to reduce manual search and SME escalations. Uses LangGraph/LangChain to orchestrate modular agent workflows with relevance filtering and fallback handling, and applies rigorous evaluation (golden datasets, edge cases, A/B tests) with production monitoring metrics.”

A/B TestingAgileApache KafkaApache SparkAWS LambdaBERT+103
View profile
SV

Sathwik Varikoti

Screened

Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI

Remote5y exp
InfosysUniversity at Buffalo

“GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.”

A/B TestingAmazon BedrockAmazon EC2Amazon S3AWS GlueAWS IAM+99
View profile
MT

Mihir Trivedi

Screened

Junior Machine Learning & Quant Research Engineer specializing in low-latency data and trading systems

New York, NY3y exp
Astera HoldingsColumbia University

“Applied ML to physical EV fleet systems at ST Labs, building a real-time CNN-LSTM fault prediction pipeline from streaming vehicle telemetry and addressing live data alignment issues via resampling/interpolation and buffered inference. Also developed a V2G/G2V energy transfer algorithm to automate charging/discharging for profit optimization, and made high-impact low-latency pipeline decisions at Astera Holdings using profiling, replay testing, and live A/B validation.”

AWS GlueBigQueryC++CUDAData CleaningData Engineering+109
View profile
RK

Ramtin Khorrami

Screened

Principal Software Engineer specializing in AI/ML and cloud-native backend systems

New York, NY16y exp
McKinsey & CompanyNJIT

“McKinsey data/ML practitioner who led production deployment of an entity resolution + semantic search platform for unstructured finance and healthcare data, integrating with legacy systems under HIPAA constraints. Deep hands-on stack across transformers (spaCy/HF BERT), embeddings + FAISS, and production MLOps/workflow tooling (Airflow, Docker, CI/CD, Prometheus/Grafana), with reported gains of +30% decision speed and +25% search relevance.”

PythonSQLRRubyJavaJavaScript+124
View profile
BB

Belal Beydoun

Screened

Intern Full-Stack Software Engineer specializing in AI and data analytics

Detroit, MI2y exp
DTE EnergyUniversity of Michigan

“Software engineer focused on real-time, low-latency AI pipelines: built an end-to-end mobile-to-backend image classification system using React Native/Expo, Node.js, gRPC, MySQL, and Google Vision AI, optimizing throughput and latency. Also integrated an AI model into a real-time field workflow at DTE via Node.js + Azure Databricks, adding data cleaning/validation and safe fallback logic for reliability in operations.”

PythonCC++SQLJavaJavaScript+57
View profile
SR

Sai Raja Ramya Bhavana Thota

Screened

Senior Data Scientist specializing in machine learning and customer analytics

Illinois, USA7y exp
Northern TrustBradley University

“Data/ML practitioner with experience applying NLP and classical ML to large-scale customer data (2B+ records) for segmentation, prediction, and survey-text classification, delivering measurable business impact (~18% engagement efficiency). Has hands-on entity resolution across multi-source datasets and has built embedding-based semantic search using SentenceBERT + a vector database with domain fine-tuning (~20% relevance improvement), plus production workflow experience with Spark/Airflow and cloud tooling (AWS/Azure).”

A/B TestingAnalyticsAzure Machine LearningBashBigQueryC+195
View profile
IS

Irfan Shaik

Screened

Mid-level AI Software Engineer specializing in risk and fraud detection

Los Angeles, California4y exp
VisaGeorge Mason University

“AI/software engineer with experience at Visa building a real-time transaction fraud/risk scoring microservice in the card authorization path (Python, Kafka, Kubernetes on AWS) with strict 120–150ms latency constraints and reason-code outputs for downstream decisioning. Owns ML backend end-to-end (data/feature engineering, model training, deployment) and has demonstrated production reliability work including latency spike mitigation, SLO-based observability, drift monitoring, and safe fallbacks to rule-based decisions.”

PythonPandasNumPyScikit-learnTensorFlowKeras+109
View profile
DM

Deepthi Mundarinti

Screened

Mid-level Generative AI Engineer specializing in decision intelligence and RAG for regulated enterprises

5y exp
JPMorgan ChaseSaint Louis University

“Healthcare GenAI engineer who built a HIPAA-compliant, auditable RAG-based claims decision support system at Molina Healthcare, processing 3M claims and delivering major impact (48% faster manual reviews, 43% higher decision accuracy). Deep hands-on experience with LangChain orchestration, vector search (ChromaDB/FAISS), embedding fine-tuning, and safety controls (confidence scoring, rule validation, human-in-the-loop escalation) for clinical workflows.”

Generative AIGPT-4OpenAI APIPrompt EngineeringRetrieval-Augmented Generation (RAG)Machine Learning+96
View profile
RA

Ravali Aleti

Screened

Senior Python Developer specializing in AWS backend APIs and enterprise authentication

Philadelphia, US7y exp
ComcastUniversity of Bridgeport

“Backend/data engineer focused on AWS-based Python services and data pipelines: built a Django/DRF user management/auth platform deployed with serverless AWS (Lambda/API Gateway) and event-driven workflows (Step Functions/EventBridge), with CloudFormation + Jenkins for automated delivery and Secrets Manager/Parameter Store for secure config. Also delivered AWS Glue ETL from S3 to RDS with schema evolution controls and incident-driven improvements, and has demonstrated measurable SQL tuning impact (minutes-to-seconds).”

PythonJavaScriptSQLDjangoFlaskPandas+93
View profile
1...789...39

Related

Machine Learning EngineersData ScientistsSoftware EngineersData AnalystsData EngineersResearch AssistantsAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search