Vetted dbt Professionals

Pre-screened and vetted.

AA

Senior AI/ML Engineer specializing in GenAI, LLMs, NLP, and MLOps

Manhattan, NY10y exp
AssemblyAI
View profile
SC

Mid-level Data Engineer specializing in cloud data platforms and real-time streaming

5y exp
Vertisage TechnologiesCarnegie Mellon University

Worked on onboarding a Middle East logistics client processing thousands of invoices/month, building a production-ready pipeline that routes known vendor PDFs to deterministic regex parsers via Tax ID matching and falls back to LlamaParse for unknown layouts. Added financial consistency validation plus human-in-the-loop review and logging/metrics to continuously reduce LLM usage and improve template coverage.

View profile
MM

Principal Applied Scientist specializing in ML systems and Generative AI

Tampa, FL11y exp
OracleUniversity of South Florida

Built and owned an end-to-end agentic RAG chatbot platform for Baptist Health that helped clinicians access policy and clinical documents faster, reducing manual lookup by 80% and delivering about $2M in annual savings. Brings strong healthcare GenAI production experience, including HIPAA-aligned governance, PHI redaction, observability, evaluation, and scalable Python/Kubernetes deployment practices.

View profile
MK

Mid-level Data Analyst specializing in retention, churn, and customer analytics

Chicago, IL5y exp
OptumNorthern Illinois University

Analytics professional with experience across healthcare and fintech, including building SQL/Python data pipelines at Optum and owning a fraud detection initiative at Razorpay. Stands out for combining messy-data cleanup, reproducible analytics workflows, and stakeholder-driven metric design, with a reported 25% improvement in fraud detection while keeping false positives under control.

View profile
Ambikadevi Damodaran - Principal Engineer specializing in GenAI/LLM platforms and enterprise modernization in Hayward, California

Principal Engineer specializing in GenAI/LLM platforms and enterprise modernization

Hayward, California19y exp
Realtor.comTexas State University

Built a production LLM-driven personalization microservice for realtor.com using LangChain/LangSmith with an MCP tool layer and RAG to generate schema-constrained ranked listings in real time, replacing a rule-based engine and improving engagement/lead conversion. Also owned an ambiguous cross-channel identity initiative, implementing an identity graph via Twilio Unify with required SDK and data-warehouse integration.

View profile
HS

Senior Data Engineer specializing in multi-cloud data platforms and streaming pipelines

4y exp
Northern TrustUniversity of Texas at Arlington

Data platform engineer with hands-on ownership of high-volume financial data pipelines (millions of transactions/day) on Azure (ADF, Databricks, Delta Lake, Synapse), emphasizing schema-drift protection and automated data-quality gates. Also built resilient web scraping pipelines with anti-bot and backfill strategies, and shipped a versioned FastAPI + Redis data API with autoscaling, testing, and CI/CD via GitHub Actions.

View profile
PC

Intern AI/Data Science Engineer specializing in LLM agents, data engineering, and predictive analytics

Overland Park, Kansas1y exp
Novel CapitalUSC
View profile
EO

Senior Software Engineer specializing in distributed systems and cloud infrastructure

U.S.A., U.S.A.12y exp
ElasticUniversity of Georgia
View profile
BS

Senior Data Scientist specializing in LLMs, NLP, and anomaly detection

Foster City, CA9y exp
VisaUniversity at Buffalo
View profile
PP

Senior Data Engineer specializing in Cloud Data Platforms and Generative AI

Brooklyn, NY11y exp
JPMorgan ChaseOsmania University
View profile
DP

Junior Data Analyst specializing in finance, supply chain, and GTM analytics

New York, NY2y exp
Authentic Brands GroupNYU
View profile
KP

Mid-level Data Engineer specializing in GCP, Spark, and healthcare analytics

New York, NY3y exp
CVS HealthColumbia University
View profile
TW

Timothy Wong

Screened

Mid-level Data Engineer specializing in experimentation, analytics, and AI-driven product experiences

4y exp
ZoomInfoUniversity of Texas at Austin

Built production LLM automations using the Claude API, including a sales enablement workflow that summarizes playbooks and incorporates sales call metadata into strategic one-pagers. Experienced in orchestrating and scheduling data pipelines with SnapLogic, Airflow, and Databricks, and in scaling LLM API calls via parallel/batch processing. Also partnered with HR to deliver prompt-tuned, automated Slack messaging aligned to business tone and acceptance criteria.

View profile
ET

Edwin Tse

Screened

Junior Data Engineer specializing in BI, governed metrics, and workflow automation

Berkeley, CA3y exp
EnvoyXUC San Diego

Built and shipped LLM/OCR/NLP-driven document-intelligence workflows in operational environments (EnvoyX and UPS), emphasizing production readiness via explicit state-machine orchestration, confidence gates, and human-in-the-loop review. Demonstrated strong business impact in customs brokerage/document ingestion: 50% fewer customs rejects, 30% higher throughput, SLA adherence improved from 71% to 96%, and platform reliability reaching 99.6% with 78% fewer bad-data incidents.

View profile
JS

Intern Software Engineer specializing in edge AI deployment and distributed systems

San Francisco, CA1y exp
Zetic AISan José State University

Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).

View profile
FM

Junior ML research engineer specializing in evaluation platforms and applied machine learning

New York, NY3y exp
Arthur AIEmory University

ML/LLM infrastructure engineer who built and shipped a production internal evaluation + failure-analysis agent (Arthur AI / R3AI context) that orchestrated end-to-end benchmarks with deterministic lineage, regression detection, and root-cause reporting at 5,000+ benchmarks/week. Also built backend observability and data validation systems for analytics pipelines at FullStory processing ~3.4B weekly events, emphasizing schema validation, quarantine fallbacks, and idempotent operations.

View profile
NP

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp
JPMorgan ChaseUniversity of Dayton

Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.

View profile
HK

Harini Kv

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Dallas, TX7y exp
EquinixFitchburg State University

GenAI/data engineering practitioner with production experience across Equinix, Optum, and Citibank—built an Azure OpenAI (GPT-4) + LangChain document intelligence platform processing 1.5M+ docs/month and a HIPAA-compliant Airflow healthcare pipeline handling 5M+ claims/day. Also delivered a real-time fraud detection + explainability system using LightGBM and a fine-tuned T5 NLG component, improving fraud accuracy by 15%+ while partnering closely with compliance stakeholders.

View profile
SG

Mid-level Data Engineer specializing in streaming and cloud data platforms for financial services

Edison, NJ3y exp
Morgan StanleyPace University

Data engineering-focused candidate (internship/project experience) who built end-to-end pipelines processing a few million transactional records/day for fraud detection and reporting, using Airflow, Python/SQL, and PySpark with strong emphasis on data quality gates, idempotency, and monitoring. Also implemented an external web/API data collection system with anti-bot tactics and schema-change quarantine, and shipped a versioned Flask API to serve curated warehouse data.

View profile
Thomas To - Mid-level Full-Stack Engineer specializing in AI/ML data platforms for biotech and FinTech in Emeryville, CA

Thomas To

Screened

Mid-level Full-Stack Engineer specializing in AI/ML data platforms for biotech and FinTech

Emeryville, CA6y exp
Canventa Life SciencesUC Davis

AI/ML full-stack practitioner in a small-scale manufacturing/lab operations environment who deployed a production ML system to improve blood cell order fulfillment by predicting yield/success from donor characteristics. Experienced building custom multi-agent orchestration (Python, LangChain/LangGraph, MCP) and balancing reliability, data quality constraints, and token/ROI economics while communicating tradeoffs to VP-level business stakeholders.

View profile
Prasanna Chelliboyina - Mid-level Machine Learning Engineer specializing in forecasting, NLP, and GenAI in United States

Mid-level Machine Learning Engineer specializing in forecasting, NLP, and GenAI

United States6y exp
WalgreensSyracuse University

GenAI/ML engineer with production experience building multilingual LLM systems (English/Spanish) and RAG-based clinical documentation summarization at Walgreens, combining prompt engineering, structured output validation, and rigorous evaluation (ROUGE + pharmacist review). Also orchestrated end-to-end ML pipelines for demand forecasting using Apache Airflow, PySpark, and MLflow with scheduled retraining and production monitoring.

View profile
AC

Mid-level AI/ML Engineer specializing in LLM systems, MLOps, and Healthcare AI

Remote, USA5y exp
CVS HealthUniversity of Missouri-Kansas City

Built and shipped a production-grade agentic RAG system at CVS Health for patient adherence and medication recommendations, processing 20k+ patient records/day. Strong focus on real-world reliability: hybrid retrieval tuned with re-ranking (<400ms latency), strict JSON/schema validation and tool guardrails, and monitoring/drift detection that reduced MTTD from 6 days to 18 hours while improving recommendation accuracy (+8%) and cutting escalations (~23%).

View profile

Need someone specific?

AI Search