Vetted Data Engineers in the Bay Area

Pre-screened and vetted in the Bay Area.

SV

Senior Software Engineer specializing in backend, data, and cloud systems

Pleasanton, CA4y exp
AvathonUniversity of Texas at Austin
View profile
BH

Bryan Holland

Screened ReferencesStrong rec.

Executive AI Product & Controls Engineering Leader specializing in agentic video editing and EV systems

SF Bay Area, CA11y exp
MAGICSEVEN AIUniversity of Michigan

Startup builder (MagicSeven) who designed and implemented a browser-based, agentic video editor end-to-end, including an AWS event-driven multimodal LLM “indexing” pipeline and an orchestration LLM agent for searching and manipulating footage. Demonstrates deep video file/codec knowledge plus practical production hardening of LLM workflows (format validation, plan/execute, S3-based state for debuggability).

View profile
DW

David Wisdom

Screened

Mid-level Data & Machine Learning Engineer specializing in production ML and data platforms

San Francisco, CA5y exp
Spice DataWilliam & Mary

Built and deployed a production LLM system that scraped Google Maps menu photos, extracted structured prices via OpenAI, and cross-validated them against website-scraped data to automate data-quality verification at scale (replacing costly manual contractor checks). Demonstrates strong reliability instincts—precision-first prompting, output gating with image-quality metadata, and fuzzy matching/RAG techniques—plus solid orchestration (Dagster/Airflow) and observability (Sentry, Prometheus/Grafana).

View profile
RK

Mid-level AI/ML Engineer specializing in MLOps and LLM-powered applications

Mountain View, CA5y exp
IntuitUniversity of Central Missouri

AI/ML engineer with production experience building a RAG-based internal analytics assistant (Databricks + ADF ingestion, Pinecone vector store, LangChain orchestration) deployed via Docker on AWS SageMaker with CI/CD and MLflow. Strong focus on real-world constraints—latency/cost optimization (LoRA ~60% compute reduction), hallucination control with citation grounding, and enterprise security/governance. Previously at Intuit, delivered an interpretable churn prediction system (PySpark/Databricks, Airflow/Azure ML) that improved retention targeting ~12%.

View profile
AS

Mid-level Software/Data Engineer specializing in AI-driven data platforms and cloud ETL

Sunnyvale, CA4y exp
Aspen AerogelsUC Riverside
View profile
PK

Mid-level Data Engineer specializing in cloud-native ETL and real-time data pipelines

San Jose, CA6y exp
Gala GamesSan José State University
View profile
AK

Senior Data Engineer specializing in legal data pipelines and AI-native metadata systems

Sunnyvale, CA12y exp
WalmartUniversity of Central Missouri
View profile
DT

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

San Jose, CA4y exp
CiscoLindsey Wilson College
View profile
Suraj Thangellapally - Junior Software Engineer specializing in machine learning and data science in San Jose, CA

Junior Software Engineer specializing in machine learning and data science

San Jose, CA2y exp
dataAnnotationUC Irvine

Python backend engineer who built a personal LLM-powered AI code review tool that parses code into context-preserving diff chunks and uses the OpenAI API to analyze and summarize changes. Has hands-on Kubernetes deployment experience (replicas, rolling updates, ConfigMaps/Secrets, health probes) and follows GitOps-style, declarative CI/CD workflows; also has experience designing streaming/event-style processing with attention to reliability and observability.

View profile
Tejas Kolpek - Mid-level Solutions Architect/Engineer specializing in AI and data integrations in Mountain View, CA

Tejas Kolpek

Screened

Mid-level Solutions Architect/Engineer specializing in AI and data integrations

Mountain View, CA5y exp
IpserLabUniversity at Buffalo

Solutions Engineer specializing in taking LLM copilots from demo to production, with a strong emphasis on enterprise security (RBAC/OAuth), grounded RAG behavior (cite-or-refuse), and operational readiness (eval loops, logging, runbooks). Experienced in real-time diagnosis of agentic/LLM workflow failures and in partnering with Sales/CS to run integration-first POCs that clear security and reliability concerns and accelerate rollout.

View profile
Yun-Chi Pang - Junior Software Engineer specializing in backend systems and data engineering in Santa Clara, CA

Junior Software Engineer specializing in backend systems and data engineering

Santa Clara, CA1y exp
Apache KafkaNortheastern University
View profile
Mengke Han - Mid-level Data Scientist specializing in ML, RAG chatbots, and analytics in San Jose, CA

Mid-level Data Scientist specializing in ML, RAG chatbots, and analytics

San Jose, CA3y exp
Beshton Software IncNortheastern University
View profile
KR

Mid-level Data Engineer specializing in cloud data platforms and real-time streaming

San Francisco, CA5y exp
HumanaSan Francisco State University
View profile
SK

Senior Data Engineer specializing in ETL, cloud data platforms, and AI/NLP solutions

South San Francisco, CA5y exp
QuantoriNortheastern University
View profile
CR

Junior Data Engineer specializing in Azure, ETL, and applied ML

San Francisco, California2y exp
CognizantUniversity of Colorado Boulder
View profile
CL

Mid-level Full-Stack Engineer specializing in scalable web platforms and data systems

Palo Alto, CA5y exp
Rhombus PowerUniversity of Illinois Urbana-Champaign
View profile
PS

Junior Software Engineer specializing in AI data platforms on Azure

Fremont, CA2y exp
Sovereign InfoServicesGeorge Mason University
View profile
NH

Senior Full-Stack Engineer specializing in infrastructure, telemetry, and enterprise collaboration systems

Fremont, CA10y exp
EOS IT SolutionsUC Riverside
View profile
Shabari Vignesh - Mid-level Data Engineer specializing in cloud data platforms and AI agents in Santa Clara, CA

Mid-level Data Engineer specializing in cloud data platforms and AI agents

Santa Clara, CA6y exp
SwirepaySan José State University

Data/Backend engineer who has owned end-to-end merchant analytics systems on AWS: orchestrated multi-source ingestion (FISERV/Shopify/Clover) with Step Functions/Lambda, enforced strong data quality gates, and served curated datasets via Redshift and a FastAPI layer. Also built an early-stage Merchant Insights AI agent that converts natural language questions into SQL using OpenAI models, with full CI/CD and observability.

View profile
Lakshmi Priya Ramisetty - Mid-level ML & Data Engineer specializing in GenAI, graph modeling, and fraud/risk analytics in Redwood City, CA

Mid-level ML & Data Engineer specializing in GenAI, graph modeling, and fraud/risk analytics

Redwood City, CA5y exp
BlueArcYeshiva University

Built a production AI fraud/risk scoring platform at BlueArc that ingests web business/product/site data, generates text+image embeddings, and connects entities in a graph to detect reuse patterns and links to known bad actors. Optimized for scale with incremental graph re-scoring and delivered investigator-friendly explainability by surfacing the exact signals/relationships behind each score; orchestrated workflows with Airflow and GCP event-driven components (Pub/Sub, Dataflow, Cloud Run) and has recent LLM workflow orchestration experience (retrieval, prompting, scoring).

View profile
MB

Mid-level AI and Data Engineer specializing in cloud data platforms and LLM products

San Francisco, CA5y exp
EarlyApply.ioNortheastern University
View profile
Matineh Kashani - Senior Support Engineer specializing in data-driven production systems in San Francisco, CA

Senior Support Engineer specializing in data-driven production systems

San Francisco, CA10y exp
InfluxDataCalifornia State University, San Marcos
View profile
VR

Senior Data Engineer specializing in cloud data platforms, ETL pipelines, and analytics

Mountain View, CA7y exp
V-SoftUniversity at Buffalo
View profile
RK

Mid-level Data Analyst specializing in machine learning and analytics

San Jose, CA4y exp
San José State UniversitySan Jose State University
View profile

Need someone specific?

AI Search