Vetted Data Validation Professionals

Pre-screened and vetted.

AW

Senior Software Engineer specializing in AI-powered compliance platforms

Dallas, TX14y exp
MetaUniversity of Chicago
View profile
AA

Avani Agarwal

Screened ReferencesStrong rec.

Senior Software Engineer specializing in real-time C++ systems and low-latency telemetry

San Diego, CA3y exp
Smallboard.comUniversity of Texas at Austin

LLM/agentic systems practitioner who partners directly with customers to productionize prototypes end-to-end—defining business-aligned metrics, building evaluation datasets, and shipping monitored, cost-bounded inference APIs on AWS Lambda. Notably delivered a vehicle damage classification system that cut manual review by 40% and stabilized agent workflows by instrumenting state transitions to uncover and fix a race-condition-driven skipped tool call.

View profile
VI

Vishanth Iyer

Screened

Senior AI/ML Engineer specializing in LLMs, multimodal AI, and scalable MLOps

San Jose, CA10y exp
NVIDIASanta Clara University

ML/NLP engineer with experience at NVIDIA and Cruise building production-grade AI systems across genomics/biomedical research and autonomous vehicle data. Has delivered multimodal LLM pipelines, large-scale entity resolution, and hybrid semantic search (BERT embeddings + FAISS + Elasticsearch), with measurable impact (≈40% accuracy/retrieval gains; ≈30% data consistency improvement) and strong MLOps practices (Kubernetes, CI/CD, MLflow, Prometheus/Grafana).

View profile
SI

Entry Software Engineer specializing in large-scale infrastructure and distributed systems

Menlo Park, CA1y exp
MetaUSC

Early-career full-stack engineer with internship experience at Bloomberg and Capital One, shipping both data-heavy product features and AI-powered internal tools. Stands out for owning end-to-end user-facing systems, from spreadsheet-to-dashboard automation that cut workflows from hours to seconds to a BERT-based support chatbot that improved reliability by ~25%.

View profile
Sri Yogesh Dorbala - Engineering Manager / Tech Lead specializing in large-scale distributed systems in Seattle, WA

Engineering Manager / Tech Lead specializing in large-scale distributed systems

Seattle, WA8y exp
DoorDashPurdue University

Software engineer focused on personalization and data/ML infrastructure who built a GenAI/LLM-driven carousel ranking system end-to-end, delivering a reported 6–7% order-rate lift. Also designed large-scale personalization ETL (15PB for ~100M users) and created a custom Airflow operator to integrate with Databricks under enterprise version constraints, with hands-on on-call and data-quality reliability improvements.

View profile
JL

Jesus Lujan

Screened

Senior Full-Stack Engineer specializing in FinTech compliance platforms

United States15y exp
GoogleMount Ida College

Software engineer with recent hands-on experience building a Google internal startup-style AML/KYC compliance platform for financial services clients. They combine full-stack product work with backend/data systems expertise, delivering measurable impact including 44% higher screening throughput, 94% fewer false positives, and 33% faster client onboarding.

View profile
ZY

Entry Software Engineer specializing in backend distributed systems and logistics platforms

Bellevue, WA1y exp
AmazonUCLA
View profile
SS

Senior Machine Learning Engineer specializing in LLMs and scalable MLOps

San Francisco, CA7y exp
MetaIndiana University
View profile
MJ

Senior Software Engineer specializing in backend systems, data platforms, and FinTech

Remote, USA13y exp
StripeUniversity of Tennessee, Knoxville
View profile
SN

Senior Data Engineer specializing in cloud data platforms and real-time streaming

California, USA7y exp
NetflixCampbellsville University
View profile
SD

Mid-level AI/ML Engineer specializing in LLM infrastructure and FinTech ML platforms

St. Louis, MO6y exp
AnthropicSaint Louis University
View profile
SC

Mid-level AI/ML Engineer specializing in Generative AI, LLM alignment, and RAG

CA6y exp
Scale AIUniversity of Texas at Arlington

Built and productionized a real-time enterprise RAG pipeline to improve factual accuracy and reduce LLM hallucinations by grounding responses in constantly changing internal knowledge bases (policies, manuals, FAQs). Experienced in orchestrating end-to-end ML workflows (Airflow/Kubernetes), handling messy multi-format data with schema enforcement (Pydantic/Hydra), and maintaining freshness via streaming incremental embeddings plus batch refresh. Also delivers applied ML solutions with non-technical teams (marketing/CRM) for segmentation and personalized engagement.

View profile
DJ

Daming Jiang

Screened

Intern Software/AI Engineer specializing in LLM fine-tuning and agentic RAG systems

0y exp
AT&TCornell University

Built and shipped an end-to-end LLM agent during an AT&T internship to automate network troubleshooting, with production-style reliability safeguards (timeouts/retries/fallbacks) and structured, state-machine orchestration; project won 3rd place in AT&T’s nationwide intern innovation challenge and was demoed to leadership. Also handled messy multi-partner data at Tencent by implementing schema validation/normalization, confidence-threshold fallbacks, and idempotent Python/ORM-based pipelines.

View profile
SK

Mid-Level Software Engineer specializing in data pipelines, observability, and analytics

San Francisco, CA2y exp
MetaArizona State University

Meta engineer who improved a critical revenue estimation dataset pipeline that was arriving ~6 days late—diagnosed via raw logs/lineage, redesigned legacy scans to only process the needed window, and shipped validation plus freshness/lag dashboards. Delivered ~50% latency reduction (to ~3 days) and regained adoption by running old/new pipelines in parallel with gated cutover and evidence-based customer communication. Applies incident-response rigor to real-time LLM/agentic workflow debugging and regularly runs developer demos/workshops.

View profile
Ahmed Sadaqat - Senior Machine Learning Engineer specializing in production ML and predictive analytics in Los Angeles, CA

Ahmed Sadaqat

Screened

Senior Machine Learning Engineer specializing in production ML and predictive analytics

Los Angeles, CA7y exp
Code GenixUC Berkeley

ML/AI engineering leader who has owned end-to-end production systems from experimentation through deployment, monitoring, and iteration at meaningful scale. They describe running a 1M+ records/day prediction platform with 99.9% availability, shipping a RAG-based conversational AI feature for 50,000 active users, and consistently improving precision, latency, reliability, and cost with measurable business impact.

View profile
JO

Director-level Engineering Leader specializing in AI platforms and FinTech systems

San Francisco, CA27y exp
EarthXCGCal Poly San Luis Obispo

Fintech and AI product engineer who has owned major production rollouts, including Lending Club's banking-arm launch, and has since built LLM-powered decision systems for finance and climate use cases. Particularly strong in combining stakeholder management with pragmatic architecture choices like observability, deterministic pipeline design, RAG, and document-to-structured-data workflows.

View profile
DB

Junior Full-Stack Software Engineer specializing in scalable web platforms and AI integration

New York, NY2y exp
AmazonGeorgia Tech

Frontend engineer from Amazon Advertising who owned a sophisticated React/TypeScript ad creative builder used by advertisers and ad ops teams. Stands out for combining deep browser-level debugging with product-minded UX improvements that reduced support escalations and made complex multi-placement ad configuration faster and more reliable for power users.

View profile
AC

Senior Data Engineer specializing in cloud data platforms and analytics pipelines

Seattle, WA11y exp
ConfluentIIT Kanpur

Data engineer focused on building and operating reliable Airflow-orchestrated pipelines into BigQuery, including daily billing ingestion (~1GB/day) and ad platform (Facebook/LinkedIn) data collection. Implemented end-to-end data quality checks plus org-wide incident response automation integrating PagerDuty, Slack, and Jira, and has experience executing large backfills (4–5TB) via time-window batching.

View profile
Pratham Thukral - Mid-level Software Engineer specializing in distributed systems on AWS in Seattle, WA

Mid-level Software Engineer specializing in distributed systems on AWS

Seattle, WA3y exp
AmazonUniversity of Waterloo

Data/infra engineer with AWS DynamoDB experience who has shipped reliability-critical systems (Global Tables replica repair protocol) and customer-facing service rollouts using canary/percentage-based deployments, strong observability, and rollback strategies. Also built end-to-end Airflow pipelines producing weekly automated reports over ~10TB of advertising segment data, with rigorous week-over-week data quality validation.

View profile
NM

Neil Moon

Screened

Mid-level Full-Stack Software Engineer specializing in SaaS and backend systems

San Francisco, CA6y exp
DokaiCalifornia State University, East Bay

Early-stage full-stack engineer who built Dokai's core web spreadsheet product and key AI features with just a two-engineer team. They combine strong product ownership with practical LLM integration experience, including reducing onboarding from a week to five minutes and solving difficult reliability and memory issues in production.

View profile
LT

Entry-level Data Analyst specializing in product, customer, and AI-driven analytics

New York, NY1y exp
VendeluxUniversity of Chicago
View profile
MK

Senior Machine Learning Engineer specializing in multimodal AI and biomedical data

Remote, USA8y exp
GoogleUniversity of Illinois Urbana-Champaign
View profile
CS

Senior Software Engineer specializing in distributed systems and data infrastructure

California, USA8y exp
AmazonUSC
View profile

Need someone specific?

AI Search