Vetted Data Engineers

Pre-screened and vetted.

PJ

Senior AI Engineer specializing in LLMs, RAG, and scalable data platforms

USA5y exp
Programmers.aiUniversity of Pennsylvania
View profile
HO

Mid-level Machine Learning & Data Engineer specializing in MLOps and cloud data platforms

San Francisco, CA4y exp
Blue River TechnologyUC Berkeley
View profile
DS

Senior Data Engineer specializing in cloud data platforms and scalable ETL pipelines

Rosharon, TX11y exp
AssistRxUniversity of Texas at Austin
View profile
BK

Mid-level Data Engineer specializing in real-time streaming and ML feature pipelines

Atlanta, GA6y exp
LyftGrand Valley State University
View profile
SM

Mid-level AI/ML Engineer specializing in Generative AI and enterprise machine learning

New York, NY4y exp
BroadcomUniversity of Central Missouri
View profile
SR

Mid-level Data Engineer specializing in cloud lakehouse and streaming analytics

Remote, US4y exp
RampUniversity of Colorado Boulder
View profile
TP

Senior Data Engineer specializing in cloud-scale data pipelines and legal data systems

Austin, TX7y exp
NVIDIAUniversity of Central Missouri
View profile
BS

Senior Machine Learning Engineer specializing in computer vision and healthcare AI

Chicago, IL16y exp
ServiceNowNortheastern Illinois University
View profile
KA

Senior Data Engineer specializing in cloud lakehouse platforms and healthcare data

Remote13y exp
DeloitteUniversity of Michigan
View profile
JH

Mid-Level Software Engineer specializing in data infrastructure and LLM applications

Remote3y exp
H60 ConsultingUniversity of Illinois Urbana-Champaign
View profile
VS

Mid-level AI Data Engineer specializing in real-time streaming and LLM-powered fraud analytics

California, USA6y exp
PayPalCalifornia State University, East Bay
View profile
CC

Mid-level Data Engineer specializing in analytics engineering, ML forecasting, and modern data stacks

Cupertino, CA4y exp
AppleNortheastern University
View profile
NN

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

Green Bay, WI5y exp
StripeNew England College
View profile
VA

Senior Data Engineer specializing in cloud-scale pipelines and legal data utilities

Austin, TX6y exp
IBMUniversity of North Texas
View profile
TM

Senior Data Engineer specializing in cloud data platforms and big data pipelines

Austin, TX11y exp
Accenture
View profile
VD

Vismay Devjee

Screened ReferencesModerate rec.

Mid-level GenAI Engineer specializing in AI agents, RAG, and LLM evaluation

Boston, MA2y exp
Fidelity InvestmentsNortheastern University

Asset Management Risk professional at Fidelity Investments who built and productionized an agentic RAG platform enabling compliance and analysts to query 10,000+ fund documents with cited answers in seconds. Implemented structure-aware semantic chunking (AWS Textract), hierarchical retrieval, and hybrid search to raise accuracy from 68% to 94%, and built an evaluation framework tracking accuracy/latency/cost/hallucinations—delivering 40+ hours/month saved and zero critical production failures.

View profile
SK

Sahithi K

Screened

Mid-level Data Engineer specializing in cloud data platforms and streaming pipelines

Boston, MA4y exp
ModernaUniversity of Massachusetts Dartmouth

Data engineer with experience at Moderna and Block owning high-volume (≈10TB/day) production pipelines on AWS, using Kafka/S3/Glue/dbt/Snowflake with strong data quality and observability practices (schema validation, anomaly detection, CloudWatch monitoring). Also built external financial API ingestion with Airflow retries, throttling/token rotation, and schema versioning, and helped stand up an early-stage biomedical data platform with CI/CD and incident debugging.

View profile
LM

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp
eBayTexas Tech University

Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.

View profile
Travoy Spelling - Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP in Texarkana, TX

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

View profile
Saiteja Gaddam - Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

Mid-Level Data Engineer specializing in cloud data platforms and streaming analytics

3y exp
IntuitUniversity at Buffalo

Data engineer (Intuit) who owned an end-to-end telemetry and subscription analytics platform processing ~22M events/day, built on Kinesis/S3/Glue/Spark/Airflow/Redshift. Strong focus on reliability and data quality (schema drift controls, quarantine layers, idempotent reruns) and performance tuning, achieving a reporting latency reduction from ~15 minutes to under 4 minutes while enabling revenue and churn analytics for business teams.

View profile
VS

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp
Capital OneUniversity of the Cumberlands

AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.

View profile
NV

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

View profile

Need someone specific?

AI Search