Vetted Data Cleaning Professionals

Pre-screened and vetted.

CL

Senior Machine Learning Engineer specializing in recommender systems, search, and NLP/GenAI

Jersey City, NJ10y exp
InstagramStanford University
View profile
SS

Principal Data Scientist & AI/ML Engineer specializing in LLMs, recommender systems, and MLOps

San Francisco, CA11y exp
SlackStanford University
View profile
ZM

Executive AI Founder & CEO specializing in LLM agents, data engineering, and regulated healthcare markets

New York, NY12y exp
Pure GlobalCarnegie Mellon University
View profile
RW

Senior AI Engineer specializing in LLMs, RAG, and production ML systems

San Francisco, CA11y exp
OpenAIUniversity of North Texas
View profile
BS

Senior Research Scientist specializing in physics, machine learning, and scientific computing

Berkeley, CA8y exp
University of California, BerkeleyHarvard University

Research-oriented ML engineer/scientist with deep experience applying generative models, adaptive optimization, and HPC infrastructure to complex physics analyses. Built reusable Python-based tools that replaced expensive Monte Carlo workflows, integrated across HTCondor/SLURM environments, and reduced analysis timelines by 2x while supporting broader team adoption and training.

View profile
WL

Mid-Level Software Engineer specializing in backend systems, payments tokenization, and ML

New York, NY6y exp
EasyRent RealtyUniversity of Michigan
View profile
Denis Siminiuc - Junior Research Assistant specializing in LLMs, NLP, and data systems in Cambridge, MA

Junior Research Assistant specializing in LLMs, NLP, and data systems

Cambridge, MA3y exp
MIT CSAILMIT

Software-focused candidate who built a data monitoring pipeline during a hedge fund internship, integrating real databases and an email API to notify teams when data was ready. Comfortable working through legacy/scrappy code and uses LLMs to accelerate comprehension and delivery, with an emphasis on thorough testing and clear communication with stakeholders/customers.

View profile
JG

Jian Guo

Screened

Executive Data & AI Leader specializing in enterprise data platforms and analytics

NYC, NY25y exp
Bank of ChinaNYU

Early-stage founder building a service business targeting small clinics, already with one client. Identified the opportunity by helping a family member and then validating needs through direct client conversations; uses AI (including AI agents) for content generation and plans deeper workflow automation to scale cost-effectively.

View profile
VI

Vishanth Iyer

Screened

Senior AI/ML Engineer specializing in LLMs, multimodal AI, and scalable MLOps

San Jose, CA10y exp
NVIDIASanta Clara University

ML/NLP engineer with experience at NVIDIA and Cruise building production-grade AI systems across genomics/biomedical research and autonomous vehicle data. Has delivered multimodal LLM pipelines, large-scale entity resolution, and hybrid semantic search (BERT embeddings + FAISS + Elasticsearch), with measurable impact (≈40% accuracy/retrieval gains; ≈30% data consistency improvement) and strong MLOps practices (Kubernetes, CI/CD, MLflow, Prometheus/Grafana).

View profile
NR

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Dallas, TX6y exp
OpenAIUniversity of Texas at Dallas
View profile
NK

Noelle Keto

Screened

Intern/Student Software Engineer specializing in full-stack development, AI/ML, and quantitative finance

Cambridge, MA0y exp
BarclaysHarvard University

Software engineering intern who built an internal AI-agent automation using the Gemini API to reduce manual CRM data entry, iterating prompts closely with analysts to address precision concerns. Also worked on a medical image-diagnostics LLM project involving fine-tuning and benchmarking multiple model approaches, and has quant/sales-trading experience building automated pricers for complex options and persuading sales teams to adopt them with ROI-focused metrics.

View profile
CY

Staff Software Engineer specializing in distributed systems and platform architecture

Aldie, VA15y exp
ProviUniversity of Maryland, College Park

Built a production LLM-powered data ingestion workflow at Provi, an online alcohol marketplace, to clean and match millions of distributor inventory items against a product catalog. Their experience is strongest in applying LLMs to real-world, large-scale data operations with AWS Glue, S3, batching, API integration, human review, and drift detection.

View profile
CL

Staff Data Analytics Lead / Data Scientist specializing in manufacturing process control

Bellefonte, PA24y exp
IntelPenn State University

Intel veteran who applied multiple linear regression and time-series drift analysis to semiconductor lithography overlay/metrology data, feeding model outputs into automated process control. Comfortable working across Python, VBA, and JMP/JSL, with a pragmatic approach to validation (RMSE + trend visualization) and data quality via close coordination with measurement/metrology teams.

View profile
RN

Ronald Nap

Screened

Intern Machine Learning & AI Engineer specializing in computer vision and ML systems

San Jose, CA2y exp
AMDUC Berkeley

Robotics/ML engineer with internship experience at Valeo building a deep-learning prototype to replace parts of a legacy SLAM backend for autonomous parking, focused on making models run reliably in real time on embedded hardware (quantization/distillation + TensorRT). Also brings strong MLOps/deployment experience (Docker, Kubernetes on AWS EKS, CI via GitHub Actions) and has supported patent filing by explaining the technical approach to legal stakeholders.

View profile
LW

LEQUAN WANG

Screened

Intern Applied Scientist / ML Engineer specializing in NLP and conversational AI

Seattle, WA0y exp
AmazonUC Irvine

LLM/Conversational AI engineer who built a production multi-turn dialogue system using LoRA fine-tuning on LLaMA, cutting training compute/memory by 90%+ while maintaining low-latency inference via quantization and streaming generation. Experienced in orchestrating end-to-end ML workflows with Prefect/Airflow/Kubeflow (including hyperparameter sweeps and W&B tracking) and improving agent reliability through benchmark-driven testing, shadow-mode rollouts, and stakeholder-informed guardrails.

View profile
Pratham Thukral - Mid-level Software Engineer specializing in distributed systems on AWS in Seattle, WA

Mid-level Software Engineer specializing in distributed systems on AWS

Seattle, WA3y exp
AmazonUniversity of Waterloo

Data/infra engineer with AWS DynamoDB experience who has shipped reliability-critical systems (Global Tables replica repair protocol) and customer-facing service rollouts using canary/percentage-based deployments, strong observability, and rollback strategies. Also built end-to-end Airflow pipelines producing weekly automated reports over ~10TB of advertising segment data, with rigorous week-over-week data quality validation.

View profile
Yuhan Gao - Intern Software Engineer specializing in AI agents, RAG, and full-stack web development in Pittsburgh, PA

Intern Software Engineer specializing in AI agents, RAG, and full-stack web development

Pittsburgh, PA1y exp
AmazonCarnegie Mellon University
View profile
JM

Senior Software Engineer specializing in AI/ML tooling and data platforms

Old Greenwich, CT13y exp
Scale AIBryant University
View profile
AP

Mid-level AI/ML Engineer specializing in NLP/LLMs and production ML systems

4y exp
AnthropicGeorge Mason University
View profile
DP

Mid-Level Software Engineer specializing in backend, cloud, and CI/CD automation

IL, USA3y exp
GoogleDePaul University
View profile
LC

Senior AI/ML Engineer & Data Scientist specializing in NLP, entity resolution, and knowledge graphs

Remote8y exp
PlayStationUniversity of Virginia
View profile
JT

Junior Software Engineer specializing in full-stack web and data platforms

Claremont, CA2y exp
MicrosoftHarvey Mudd College
View profile
AL

Junior financial engineering analyst specializing in portfolio analytics and data science

San Francisco, CA2y exp
BlackRockPrinceton University
View profile

Need someone specific?

AI Search