Vetted Data Engineers in the Bay Area

Pre-screened and vetted in the Bay Area.

Nagarjuna Kanamarlapudi - Director-level Data Platform Engineering Leader specializing in data governance and quality in Sunnyvale, CA

Director-level Data Platform Engineering Leader specializing in data governance and quality

Sunnyvale, CA15y exp
LinkedInBITS Pilani
View profile
Amit Khanna - Director-level Software Engineering Leader specializing in AI, Data Platforms, and Ads/FinTech in Menlo Park, California

Director-level Software Engineering Leader specializing in AI, Data Platforms, and Ads/FinTech

Menlo Park, California22y exp
MetaDr. A.P.J. Abdul Kalam Technical University
View profile
SR

Senior Machine Learning & GenAI Engineer specializing in LLM systems and data pipelines

San Francisco, CA7y exp
DatabricksIndiana Tech
View profile
SA

Mid-level Data Engineer specializing in cloud-native big data pipelines and analytics

San Jose, CA5y exp
CorsairSan José State University
View profile
MS

Senior Data Engineer specializing in payments and financial data platforms

San Jose, CA8y exp
TikTokNortheastern University
View profile
Emma Nguyen - Mid-level Software Engineer specializing in backend microservices, data pipelines, and QA in Foster City, CA

Mid-level Software Engineer specializing in backend microservices, data pipelines, and QA

Foster City, CA5y exp
AccentureUniversity of Pennsylvania
View profile
FL

Mid-level Software Engineer specializing in data platforms and full-stack systems

Palo Alto, CA4y exp
xAICase Western Reserve University
View profile
SS

Mid-level Data Engineer specializing in cloud lakehouse platforms (Azure/AWS/Snowflake)

San Francisco, CA5y exp
StripeEastern Illinois University
View profile
HO

Mid-level Machine Learning & Data Engineer specializing in MLOps and cloud data platforms

San Francisco, CA4y exp
Blue River TechnologyUC Berkeley
View profile
CC

Mid-level Data Engineer specializing in analytics engineering, ML forecasting, and modern data stacks

Cupertino, CA4y exp
AppleNortheastern University
View profile
AB

Principal Big Data & Software Engineer specializing in Spark/Scala and GCP data platforms

San Jose, CA13y exp
VerizonUniversity at Buffalo
View profile
BS

Senior Data Scientist specializing in LLMs, NLP, and anomaly detection

Foster City, CA9y exp
VisaUniversity at Buffalo
View profile
ET

Edwin Tse

Screened

Junior Data Engineer specializing in BI, governed metrics, and workflow automation

Berkeley, CA3y exp
EnvoyXUC San Diego

Built and shipped LLM/OCR/NLP-driven document-intelligence workflows in operational environments (EnvoyX and UPS), emphasizing production readiness via explicit state-machine orchestration, confidence gates, and human-in-the-loop review. Demonstrated strong business impact in customs brokerage/document ingestion: 50% fewer customs rejects, 30% higher throughput, SLA adherence improved from 71% to 96%, and platform reliability reaching 99.6% with 78% fewer bad-data incidents.

View profile
YY

Yinghai Yu

Screened

Mid-level Data Engineer specializing in cloud data platforms and AI/ML pipelines

San Mateo, CA6y exp
Bubbles and BooksGeorgia Tech

Data-engineering-oriented candidate with hands-on experience building an agentic AI product and operational automation workflows. They described automating inventory-to-ERP discrepancy reconciliation with anomaly detection and daily reporting, and also have practical scraping/automation experience dealing with Cloudflare-protected sites using Selenium and Puppeteer.

View profile
SG

Principal AI/ML Architect & Senior Data Scientist specializing in financial fraud and risk

San Jose, CA13y exp
NexusUC Davis
View profile
TH

Mid-level Data Engineer specializing in ML, OCR, and cloud-native pipelines

San Jose, CA4y exp
ZscalerStony Brook University
View profile
MM

Mid-level Data Engineer specializing in ML-driven pipelines and cloud microservices

San Jose, CA6y exp
HPEUC San Diego
View profile
VD

Principal Data Engineer specializing in petabyte-scale Spark pipelines on GCP

San Jose, CA19y exp
VerizonKakatiya University
View profile
SK

Mid-level AI/ML Engineer specializing in MLOps, real-time data platforms, and generative AI

Santa Clara, CA5y exp
Applied MaterialsUniversity of Central Oklahoma
View profile
ST

Senior AI/ML Engineer specializing in Generative AI and MLOps

Newark, CA11y exp
Lucid MotorsDeen Dayal Upadhyay Gorakhpur University
View profile
BH

Bryan Holland

Screened ReferencesStrong rec.

Executive AI Product & Controls Engineering Leader specializing in agentic video editing and EV systems

SF Bay Area, CA11y exp
MAGICSEVEN AIUniversity of Michigan

Startup builder (MagicSeven) who designed and implemented a browser-based, agentic video editor end-to-end, including an AWS event-driven multimodal LLM “indexing” pipeline and an orchestration LLM agent for searching and manipulating footage. Demonstrates deep video file/codec knowledge plus practical production hardening of LLM workflows (format validation, plan/execute, S3-based state for debuggability).

View profile
DW

David Wisdom

Screened

Mid-level Data & Machine Learning Engineer specializing in production ML and data platforms

San Francisco, CA5y exp
Spice DataWilliam & Mary

Built and deployed a production LLM system that scraped Google Maps menu photos, extracted structured prices via OpenAI, and cross-validated them against website-scraped data to automate data-quality verification at scale (replacing costly manual contractor checks). Demonstrates strong reliability instincts—precision-first prompting, output gating with image-quality metadata, and fuzzy matching/RAG techniques—plus solid orchestration (Dagster/Airflow) and observability (Sentry, Prometheus/Grafana).

View profile
RK

Mid-level AI/ML Engineer specializing in MLOps and LLM-powered applications

Mountain View, CA5y exp
IntuitUniversity of Central Missouri

AI/ML engineer with production experience building a RAG-based internal analytics assistant (Databricks + ADF ingestion, Pinecone vector store, LangChain orchestration) deployed via Docker on AWS SageMaker with CI/CD and MLflow. Strong focus on real-world constraints—latency/cost optimization (LoRA ~60% compute reduction), hallucination control with citation grounding, and enterprise security/governance. Previously at Intuit, delivered an interpretable churn prediction system (PySpark/Databricks, Airflow/Azure ML) that improved retention targeting ~12%.

View profile

Need someone specific?

AI Search