Vetted Data Validation Professionals

Pre-screened and vetted.

MJ

Senior Software Engineer specializing in backend systems, data platforms, and FinTech

Remote, USA13y exp
StripeUniversity of Tennessee, Knoxville
View profile
SN

Senior Data Engineer specializing in cloud data platforms and real-time streaming

California, USA7y exp
NetflixCampbellsville University
View profile
SD

Mid-level AI/ML Engineer specializing in LLM infrastructure and FinTech ML platforms

St. Louis, MO6y exp
AnthropicSaint Louis University
View profile
SC

Mid-level AI/ML Engineer specializing in Generative AI, LLM alignment, and RAG

CA6y exp
Scale AIUniversity of Texas at Arlington

Built and productionized a real-time enterprise RAG pipeline to improve factual accuracy and reduce LLM hallucinations by grounding responses in constantly changing internal knowledge bases (policies, manuals, FAQs). Experienced in orchestrating end-to-end ML workflows (Airflow/Kubernetes), handling messy multi-format data with schema enforcement (Pydantic/Hydra), and maintaining freshness via streaming incremental embeddings plus batch refresh. Also delivers applied ML solutions with non-technical teams (marketing/CRM) for segmentation and personalized engagement.

View profile
DJ

Daming Jiang

Screened

Intern Software/AI Engineer specializing in LLM fine-tuning and agentic RAG systems

0y exp
AT&TCornell University

Built and shipped an end-to-end LLM agent during an AT&T internship to automate network troubleshooting, with production-style reliability safeguards (timeouts/retries/fallbacks) and structured, state-machine orchestration; project won 3rd place in AT&T’s nationwide intern innovation challenge and was demoed to leadership. Also handled messy multi-partner data at Tencent by implementing schema validation/normalization, confidence-threshold fallbacks, and idempotent Python/ORM-based pipelines.

View profile
SK

Mid-Level Software Engineer specializing in data pipelines, observability, and analytics

San Francisco, CA2y exp
MetaArizona State University

Meta engineer who improved a critical revenue estimation dataset pipeline that was arriving ~6 days late—diagnosed via raw logs/lineage, redesigned legacy scans to only process the needed window, and shipped validation plus freshness/lag dashboards. Delivered ~50% latency reduction (to ~3 days) and regained adoption by running old/new pipelines in parallel with gated cutover and evidence-based customer communication. Applies incident-response rigor to real-time LLM/agentic workflow debugging and regularly runs developer demos/workshops.

View profile
Ahmed Sadaqat - Senior Machine Learning Engineer specializing in production ML and predictive analytics in Los Angeles, CA

Ahmed Sadaqat

Screened

Senior Machine Learning Engineer specializing in production ML and predictive analytics

Los Angeles, CA7y exp
Code GenixUC Berkeley

ML/AI engineering leader who has owned end-to-end production systems from experimentation through deployment, monitoring, and iteration at meaningful scale. They describe running a 1M+ records/day prediction platform with 99.9% availability, shipping a RAG-based conversational AI feature for 50,000 active users, and consistently improving precision, latency, reliability, and cost with measurable business impact.

View profile
AC

Senior Data Engineer specializing in cloud data platforms and analytics pipelines

Seattle, WA11y exp
ConfluentIIT Kanpur

Data engineer focused on building and operating reliable Airflow-orchestrated pipelines into BigQuery, including daily billing ingestion (~1GB/day) and ad platform (Facebook/LinkedIn) data collection. Implemented end-to-end data quality checks plus org-wide incident response automation integrating PagerDuty, Slack, and Jira, and has experience executing large backfills (4–5TB) via time-window batching.

View profile
Pratham Thukral - Mid-level Software Engineer specializing in distributed systems on AWS in Seattle, WA

Mid-level Software Engineer specializing in distributed systems on AWS

Seattle, WA3y exp
AmazonUniversity of Waterloo

Data/infra engineer with AWS DynamoDB experience who has shipped reliability-critical systems (Global Tables replica repair protocol) and customer-facing service rollouts using canary/percentage-based deployments, strong observability, and rollback strategies. Also built end-to-end Airflow pipelines producing weekly automated reports over ~10TB of advertising segment data, with rigorous week-over-week data quality validation.

View profile
ZY

Entry-Level Software Development Engineer specializing in distributed systems and logistics orchestration

Bellevue, WA1y exp
AmazonUCLA
View profile
LC

Senior AI/ML Engineer & Data Scientist specializing in NLP, entity resolution, and knowledge graphs

Remote8y exp
PlayStationUniversity of Virginia
View profile
SK

Mid-level Data Engineer specializing in AI/ML and cloud data platforms

Redmond, WA6y exp
NetflixGeorge Mason University
View profile
DY

Intern Software Engineer specializing in full-stack web and mobile development

Culver City, CA0y exp
AmazonPrinceton University
View profile
JW

Junior Data Analyst specializing in experimentation, data quality, and ML analytics

Los Angeles, CA2y exp
AppleCornell University
View profile
PK

Mid-level AI/ML Engineer specializing in RAG, NLP, and MLOps

Dallas, TX5y exp
MetaUniversity of North Texas
View profile
Bocheng Wan - Mid-level Software Engineer specializing in consumer robotics and machine learning in Sunnyvale, CA

Bocheng Wan

Screened

Mid-level Software Engineer specializing in consumer robotics and machine learning

Sunnyvale, CA4y exp
AmazonRice University

Developer with a structured AI-assisted coding workflow centered on algorithmic rigor, type-safe scaffolding, and spec-driven modular design. Used Claude Code to help build an Android SDK for IPC, quickly ramping up on AIDL and compressing delivery including unit tests from several weeks to two weeks.

View profile
HK

Harish Kasu

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

San Francisco, CA5y exp
NVIDIATexas A&M University-Kingsville

AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.

View profile
GU

Engineering executive specializing in production ML systems and enterprise SaaS

San Francisco, CA26y exp
FLYRCarnegie Mellon University

Engineering/data platform leader from FLYR (airline ML forecasting and automated pricing) who built scalable ingestion/ETL and a canonical data model to onboard airlines with highly heterogeneous source systems. Created a golden-metrics layer for airline KPIs and implemented monitoring/backfill capabilities, cutting onboarding time by 50%+ while improving SLA performance and controlling cloud/ML training costs through stronger data quality gates.

View profile
Vinay Ramrupe - Mid AI/ML Engineer specializing in LLM and enterprise generative AI in San Francisco, CA

Vinay Ramrupe

Screened

Mid AI/ML Engineer specializing in LLM and enterprise generative AI

San Francisco, CA5y exp
DatabricksCleveland State University

ML/AI engineer focused on taking LLM systems from experimentation to reliable production, including enterprise copilot and RAG-based knowledge retrieval use cases. Stands out for combining data pipelines, model training, inference optimization, automated evaluation, and safety guardrails, with cited impact including 20% throughput gains and 30% less manual evaluation effort.

View profile
BM

Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps

San Francisco, CA6y exp
Scale AISaint Louis University
View profile
MD

Mid-level Software Engineer specializing in backend, ML platforms, and FinTech

California, USA5y exp
MetaSaint Louis University
View profile
SM

Senior Full-Stack Engineer specializing in Unity/C# and AI-driven VR/mobile healthcare systems

Cleveland, TN14y exp
Avegen HealthITT Technical Institute
View profile
Manaswini Gogineni - Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development in San Francisco, CA

Manaswini Gogineni

Screened ReferencesStrong rec.

Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development

San Francisco, CA2y exp
CiscoUniversity of Wisconsin–Madison

Backend engineer at Electric Hydrogen who built a serverless device-log ingestion and processing platform in Python/Flask, scaling throughput (4x peak ingestion) while keeping sub-300ms API latency. Strong in Postgres/SQLAlchemy performance (partitioning, materialized views) and production ML integration (ONNX model served via FastAPI microservice with async batch inference, Redis feature caching, and drift monitoring via S3/Lambda). Experienced designing secure multi-tenant systems with schema-per-tenant isolation and KMS-backed encryption.

View profile
JQ

Jolie Qiu

Screened

Mid-Level Software Engineer specializing in AWS data infrastructure and pipeline automation

5y exp
AmazonUSC

AWS-focused software engineer who built a self-serve ETL pipeline scheduling service for non-engineers, including automated CloudFormation-based onboarding that cut setup time from 2–3 weeks to ~5 minutes. Strong in production reliability and customer-facing data platforms (EMR/DynamoDB/Lambda), with examples spanning pagination at scale, cross-table consistency, and phased rollouts to improve Parquet log SLAs.

View profile

Need someone specific?

AI Search