Vetted Data Cleaning Professionals

Pre-screened and vetted.

NN

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

Green Bay, WI5y exp
StripeNew England College
View profile
SD

Mid-level Analytics Engineer specializing in dbt, SQL transformation, and Snowflake

USA5y exp
SalesforceBowling Green State University
View profile
SK

Mid-Level Software Engineer specializing in backend microservices and cloud automation

New York, NY4y exp
UberHarrisburg University of Science and Technology
View profile
SP

Mid-level Business Analyst specializing in finance, data analytics, and AI infrastructure

Chicago, IL5y exp
GoogleArizona State University
View profile
SP

Entry-level Software Engineer specializing in data pipelines and applied AI

Santa Cruz, CA2y exp
AmazonUC Santa Cruz
View profile
TM

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Austin, TX11y exp
Accenture
View profile
SS

Sahithi S

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI

Texas, USA6y exp
NVIDIAKennesaw State University

Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).

View profile
EM

Eliza Miller

Screened

Senior Global Trade Consultant specializing in customs compliance and duty mitigation

New York, NY4y exp
EYIndiana University Kelley School of Business

Consulting project manager in global trade/compliance and regulatory reporting, leading cross-functional initiatives for global clients (tax, customs, supply chain, IT, finance) including US tariff exposure mitigation and EU carbon reporting. Built repeatable automations in Alteryx for US Customs Reconciliation filings (replacing large Excel models) and drives executive alignment through concise, decision-ready briefs and strong governance.

View profile
AS

Mid-level Technical Consultant specializing in Appian delivery and data/AI workflow automation

Mclean, VA5y exp
AppianUniversity of Illinois Urbana-Champaign

Appian consultant/engineer focused on insurance and financial services modernization and AI-enabled workflows. Built and productionized an AI-driven insurance submission intake system (email ingestion, classification/extraction, HITL review) cutting processing time from 2+ hours to under 10 minutes, and delivered semantic smart search with guardrails and UAT-driven ranking improvements. Also partnered with a global bank CTO org, running sessions with 200+ senior leaders to automate regulatory/board metric reporting via platform integrations and attestation.

View profile
Surya Vardhan - Junior Operations Data Analyst specializing in KPI dashboards and SLA reporting in Seattle, WA

Surya Vardhan

Screened

Junior Operations Data Analyst specializing in KPI dashboards and SLA reporting

Seattle, WA1y exp
AmazonCentral Michigan University

Manufacturing/quality-focused professional with experience at Sikorsky Aerospace supporting aircraft parts production for clients such as Boeing and Cobham. Drove data-driven process improvements (cleaned/visualized production data) and redesigned material usage to cut delays by ~20% and reduce waste, while coordinating across production, inspection, QC, and delivery readiness.

View profile
HK

Mid-level Full-Stack Software Engineer specializing in cloud and data platforms

Boston, MA5y exp
Northeastern UniversityPenn State University

Full-stack engineer with experience spanning Amazon IMDb and Northeastern’s NeuroJSON portal, combining consumer product work with complex scientific data applications. Built IMDb’s streaming providers feature—described as the company’s most impactful feature of 2023—and has hands-on experience with React/Angular, GraphQL, AWS, Python services, and production monitoring.

View profile
PP

Senior Backend Software Engineer specializing in cloud, microservices, and AI systems

Richardson, TX8y exp
The University of Texas at DallasUniversity of Texas at Dallas

Built an AI-powered job outreach application for his own job search and took it from idea to production use, owning architecture, FastAPI backend, retrieval/generation pipeline, frontend workflow, deployment, and iteration. Especially compelling for teams needing a pragmatic full-stack engineer who can turn LLM-based product ideas into usable, maintainable tools with measurable workflow impact.

View profile
SD

Mid-level Data Scientist specializing in business intelligence and machine learning

Pittsburgh, PA2y exp
Armada PartnersCarnegie Mellon University

Internship experience building a production LLM-powered podcast operations agent that automated lead intake (HubSpot), guest research, scheduling (Calendly), meeting-summary evaluation (Gemini), and human approval via Slack bot—while retaining rejected candidates for future outreach. Also contributed to ideation of a multi-agent orchestration framework with parsing and task routing, and emphasized reliability via structured prompts, HITL feedback, and prompt-based test sets.

View profile
ML

Mengyu Liu

Screened

Senior Data Scientist specializing in GenAI agents and causal inference

Remote, USA10y exp
HumanaUniversity of Miami

Built and deployed a production healthcare medical review agent that automates call-transcript summarization and medication reconciliation using a hybrid deterministic + LangGraph-orchestrated LLM workflow. Demonstrates strong reliability engineering (guardrails, schema validation, confidence thresholds, golden/adversarial eval, Langfuse monitoring) in a regulated environment, delivering 60% lower latency and 70%+ efficiency gains while partnering closely with care managers and operations.

View profile
SS

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.

View profile
CD

Mid-Level Software Developer specializing in Java microservices and cloud-native systems

St. Louis, MO5y exp
EpsilonSaint Louis University

Backend engineer focused on cloud/distributed systems, deploying Java 17/Spring Boot microservices on AWS EKS with RDS and Kafka. Demonstrated strong production readiness work (DB lock mitigation, Kafka idempotency, gradual rollouts) and delivered a major latency improvement (~400ms to ~100ms). Also has proven cross-layer troubleshooting skills, isolating intermittent API timeouts to a specific Kubernetes node’s network interface issue, and partners closely with ops teams to build dashboards and workflow automation (including Python scripts).

View profile
ZI

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.

View profile
Sai Dinesh Pusapati - Senior AI/ML Engineer specializing in GenAI agents and LLM workflows in San Francisco, CA

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp
Scale AIBelhaven University

LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.

View profile
AN

Abhay Naik

Screened

Mid-level Data Engineer specializing in cloud-native analytics and enterprise integrations

Remote3y exp
The GrooveUC Berkeley

Built and productionized an LLM-powered clinical assistant at a healthcare startup, re-architecting a prototype into a robust RAG system on AWS with guardrails, citations, monitoring, and automated tests for clinical reliability. Works closely with clinicians to convert workflow feedback into evaluation criteria and iterative system improvements, and has hands-on experience debugging agentic systems in real time (including during live client demos).

View profile
WM

Will McEntee

Screened

Mid-level Operations & Analytics Professional specializing in logistics and sports data

Anaheim, CA4y exp
AmazonGeorgetown University

Lifelong basketball player with extensive exposure to elite Southern California high school basketball (Servite/Trinity League) and familiarity with college recruiting through close connections, who applies a structured PFF-style evaluation lens to scouting. Comfortable identifying talent via film and in-person viewing and proactively engaging prospects through social media outreach; also brings experience working demanding overnight/on-call schedules from Amazon last-mile logistics.

View profile
SR

Mid-level Data & Business Analyst specializing in analytics engineering and BI

6y exp
AdobeUniversity of Wisconsin–Madison

Data/analytics professional with experience across manufacturing and enterprise environments (Wisconsin School of Business project with CNH Industrial; roles/projects at Ascensia Technologies, S&C, and Adobe). Has hands-on work combining warranty/lifecycle tables with technician free-text notes using TF-IDF + tree models (XGBoost/Random Forest), and deep experience in entity resolution/reconciliation across mismatched financial systems using Python/SQL and fuzzy matching, with production-grade pipeline practices in Azure Data Factory/Databricks.

View profile
CS

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.

View profile
Vedant Kharwal - Intern AI/ML Engineer specializing in Generative AI and applied machine learning in Mumbai, India

Intern AI/ML Engineer specializing in Generative AI and applied machine learning

Mumbai, India1y exp
LTIMindtreeBoston University

New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.

View profile

Need someone specific?

AI Search