Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted dbt Professionals

Pre-screened and vetted.

dbt Python SQL Apache Airflow Snowflake AWS

Anurag Reddy

Screened

Mid-level Data Scientist specializing in ML, MLOps, and Generative AI

TX, USA5y exp

CaterpillarUniversity of Illinois Chicago

“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”

A/B Testing Agile Anomaly Detection Ansible Apache Airflow Apache Hadoop+138

View profile

Cary Burdick

Screened

Senior Data Scientist specializing in data engineering and analytics

Chicago, IL6y exp

USDAAuburn University

“Data/NLP practitioner with experience in both financial services (Truist) and government (USDA), including an NLP-driven analysis of EU regulations to anticipate US regulatory focus and a major redesign/cleaning of complex pathogen lab-test public datasets. Built production data-quality pipelines with Dagster, Pandera, and Azure Synapse, and is comfortable validating hypotheses with historical backtesting and SME-driven quality controls.”

Python PySpark Pandas NumPy R SciPy+53

View profile

Hassan Abrar

Screened

Mid-level Analytics Professional specializing in marketing and business intelligence

Frisco, TX5y exp

TIAAPurdue University

“Analytics professional at TIAA with hands-on experience combining SQL, Python, and statistical modeling to unify complex marketing, product, finance, and customer datasets. Has worked on advisor-tool adoption analysis, 10-year wealth diagnostics, forecasting, cohort analysis, and escalation-risk modeling, with findings used by marketing and contact-center stakeholders.”

Python SQL Snowflake dbt Git Tableau+75

View profile

Rushabh Thakkar

Screened

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLMs

New York City, NY3y exp

WayfairStevens Institute of Technology

“Wayfair ML/AI engineer who has shipped and operated production LLM systems for both internal analytics and customer-facing assistants. Stands out for combining strong RAG/retrieval engineering with production-grade platform work—improving trust, reducing latency by ~30%, and cutting ad hoc reporting demand by ~50%.”

Machine Learning Deep Learning Natural Language Processing Computer Vision Large Language Models Python+168

View profile

Bharath Kumar Talasila

Screened

Mid-level Full-Stack Developer specializing in cloud-native enterprise platforms

4y exp

CignaAuburn University at Montgomery

“Built Nexthire-AI, shipping an end-to-end LLM-powered resume–job description matching product (React + Node.js) using embeddings and retrieval to generate match scores and skill-gap recommendations. Improved post-launch engagement by making feedback cleaner and more actionable, and added production guardrails (validation, timeouts, fallbacks) to handle messy resume formats and AI API instability.”

Angular TypeScript JavaScript Bootstrap Responsive Design Java+144

View profile

Siva Harini Sri Janaki Raman

Screened

Mid-level Data Engineer specializing in cloud data platforms

Dallas, TX3y exp

CVS HealthTexas Tech University

“Built an AI-powered internal support assistant at CVS Health using GPT-4, LangChain, and Pinecone, applying RAG, validation, and monitoring to reduce repetitive support tickets while protecting sensitive healthcare data. Stands out for a pragmatic approach to AI engineering: using multi-agent and LLM workflows to accelerate development while keeping systems constrained, observable, and production-friendly.”

Python SQL R AWS Amazon S3 AWS Glue+110

View profile

Gordon Ng

Screened

Mid-Level Software Engineer specializing in AI/ML and distributed systems

Brooklyn, NY3y exp

OptumBoston University

“Software engineer with production experience building a serverless monolith and multi-layer video pipeline at easyML, plus hands-on integration of multiple LLM providers (Grok/Claude/OpenAI) into a full-stack app. Interested in robotics via computer vision (OpenCV/OpenMMLab), with a strong real-time systems mindset around SLOs, latency, determinism, and reliability; also has low-level OS experience writing a keyboard device driver.”

Apache Kafka AWS AWS Lambda CI/CD Cloud Computing C+++77

View profile

Naga Yanala

Screened

Mid-level Data Engineer specializing in cloud data pipelines and analytics platforms

Texas, USA5y exp

Molina HealthcareSoutheast Missouri State University

“Data engineer with healthcare and enterprise experience (Molina Healthcare, Dell Technologies) building and operating high-volume batch + streaming pipelines across AWS and Azure. Strong focus on data quality (schema validation, fail-fast checks), reliability (monitoring/alerts, retries), and performance tuning (Spark/partitioning), with measurable runtime reduction and improved downstream trust.”

Python SQL PySpark Bash ETL Data pipelines+85

View profile

Sai Kavyusha Ponnagant

Screened

Mid-level Data Engineer specializing in cloud data pipelines and financial services warehousing

Chicago, IL4y exp

Charles SchwabDePaul University

“Data engineer (Charles Schwab) who took ownership of an unstable, ambiguous nightly financial data pipeline and rebuilt it into a reliable, incremental AWS Glue/Airflow/Redshift system feeding Power BI. Created a custom Python data-quality framework with hard-stop gating and schema drift detection, improving integrity (99.9%), cutting runtime (~20%), and reducing incidents/tickets (35% fewer schema-related dashboard incidents; 30% fewer investigations).”

Python SQL Amazon S3 AWS Glue Amazon Redshift Amazon Athena+73

View profile

UMESH KAMISETTY

Screened

Mid-level Data Engineer specializing in cloud lakehouse and streaming platforms

Seattle, WA5y exp

First United BankCleveland State University

“Data engineer focused on building production-grade pipelines on AWS (Kafka/Kinesis/Glue/S3) through to curated serving layers in Snowflake and Delta Lake. Emphasizes automated data quality validation (PySpark + CI/CD), modular dbt transformations for analytics (customer spending, risk metrics), and operational reliability with CloudWatch and DLQs; data consumed by BI tools and ML pipelines for fraud detection and risk analytics.”

Python PySpark SQL Shell Scripting AWS Amazon S3+146

View profile

Harshitha Parupalli

Screened

Mid-level Data Engineer specializing in multi-cloud real-time and batch data pipelines

Jersey City, NJ4y exp

Elevance HealthNJIT

“Data engineer with healthcare domain experience who owned 100M+ record pipelines end-to-end (Kafka/Kinesis/ADF → PySpark/dbt validation → Spark SQL transforms → Snowflake/Power BI serving). Built production-grade reliability practices (Airflow orchestration, CloudWatch/Grafana monitoring, pytest + contract/regression tests, idempotent ingestion/backfills) and delivered measurable improvements: 35% lower latency and 40% better query performance.”

Python SQL Shell Scripting R Scala Java+160

View profile

Mohan Naik Megavath

Screened

Mid-level Data Engineer specializing in real-time pipelines and cloud data platforms

Remote, USA4y exp

TruistElmhurst University

“Backend engineer with hands-on experience building secure Python/Flask services (sessions, JWT, RBAC) and optimizing PostgreSQL/SQLAlchemy performance, including custom SQL using CTEs/window functions profiled via EXPLAIN ANALYZE. Also integrates LLM features via OpenAI/Azure into backend systems and improves scalability with RabbitMQ-driven async processing, caching, and multi-tenant data isolation patterns.”

Amazon Athena Amazon DynamoDB Amazon EC2 Amazon Redshift Amazon S3 AngularJS+137

View profile

MOUNIKA SAI MEKALA

Screened

Junior Data Analyst specializing in financial and operational analytics

Kansas, USA3y exp

KPMGUniversity of Central Missouri

“Analytics professional with experience at KPMG turning messy operational and financial data from SQL Server and AWS S3 into clean reporting datasets and automated Python workflows. They combine SQL, Python, Power BI, and experimentation methods to deliver stakeholder-aligned KPI dashboards and marketing performance insights with a strong focus on data integrity and reproducibility.”

SQL Python Pandas NumPy SciPy R+103

View profile

Anuj Shah

Screened

Senior Data Analyst specializing in cloud data platforms, experimentation, and predictive analytics

GA, USA9y exp

UnitedHealth GroupNorthwestern Polytechnic University

“Healthcare data/ML practitioner with experience at UnitedHealth Group building production ETL and streaming pipelines (Python, BigQuery, Kafka) that unify EHR, IoT device, and lab data for patient risk prediction. Also implemented embedding-based semantic search/linking for noisy clinical notes via domain adaptation and rigorous validation with clinical stakeholders; previously built churn prediction at DirecTV using XGBoost.”

Python SQL R Apache Spark PySpark Apache Kafka+111

View profile

Mukesh Rajmohan

Screened

Mid-level Data Engineer specializing in AWS/Azure pipelines and streaming analytics

VA, USA5y exp

UnitedHealth GroupGeorge Mason University

“Data engineer with experience across healthcare and geospatial risk systems, owning end-to-end pipelines from ingestion through serving on AWS/Azure stacks. Built HIPAA-compliant data quality gates and CDC for millions of daily claims, and also delivered a real-time wildfire risk platform with 20-minute refresh cycles and a 60% data accuracy lift. Strong in streaming (Kafka), Spark performance tuning, and production-grade orchestration/CI/CD (Airflow, Docker, Jenkins, GitHub Actions, Terraform).”

Python SQL Java AWS Amazon S3 AWS Lambda+95

View profile

Jax Diagana

Screened

Senior AI Engineer specializing in forward-deployed voice agents and incident-response automation

San Francisco, CA7y exp

AnaplanUniversity of St. Thomas

“FDE at Bland.ai and founder of Fi (incident-response agent) who routinely takes LLM/agentic concepts from prototype to production. Has hands-on experience reverse-engineering undocumented systems to deliver integrations, building LLM testbeds for voice-agent reliability, and rapidly shipping RAG/semantic search solutions (e.g., Confluence runbooks) after deep customer discovery with DevOps/SRE teams.”

A/B Testing AI Agents Automation Confluence Data Science dbt+66

View profile

Abdul Mohammed

Screened

Mid-level Data Analyst specializing in healthcare and financial analytics

USA3y exp

Cardinal HealthIndiana Tech

“Built and productionized an LLM-powered clinical documentation and insights pipeline at Cardinal Health using LangChain + GPT-4 with RAG to summarize long clinical notes, extract medication/dosage entities, and generate structured SQL-ready outputs for downstream analytics. Emphasizes clinical reliability via labeled benchmarking (precision/recall/F1), shadow deployments, clinician human-in-the-loop review, and ongoing monitoring/orchestration with Airflow, Lambda, S3, Postgres, and Power BI.”

SQL Python R HTML JSON Microsoft Excel+105

View profile

Sana Khan

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech

Oklahoma, USA4y exp

Capital OneOklahoma Christian University

“ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.”

Python SQL NumPy Pandas Bash PySpark+97

View profile

Annie Suzan

Screened

Mid Software Engineer specializing in machine learning and real-time data systems

Remote, USA3y exp

ThoughtWorksArizona State University

“Hands-on implementation-focused candidate with experience owning cloud deployments and putting LLM/RAG workflows into production. They stand out for combining customer-facing deployment ownership with practical AI systems work, including retrieval tuning, hallucination mitigation, production incident response, and document-processing pipelines for messy real-world inputs.”

Python Java JavaScript SQL Bash React+121

View profile

Eugene Nelepko

Screened

Executive growth leader specializing in AI-powered SaaS, marketplaces, and e-commerce

San Francisco, CA13y exp

MagnitRSTU

“Growth leader with strong zero-to-one and systems-building experience across e-commerce and retail media. Most notably, they proposed and launched a new retail media division from scratch, presold demand before product build, and scaled it to $1.5M ARR with 85% margin, while also building data-driven lifecycle and acquisition systems that materially improved activation and CAC efficiency.”

A/B testing Snowflake Looker Computer vision Predictive analytics Data analytics+122

View profile

Sailaja Lokasani

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and healthcare analytics

Dallas, TX5y exp

Lightbeam Health SolutionsSyracuse University

“Healthcare-focused data engineer/ML practitioner with experience at Lightbeam Health Solutions and Humana building production entity-resolution and semantic similarity pipelines across EMR, lab, and claims data. Uses NLP/ML (spaCy, scikit-learn, BioBERT/LightGBM) plus Snowflake/Airflow and vector search (Pinecone) to improve linkage accuracy (reported 90%) and semantic match quality (reported +12–15%), while reducing manual cleanup by 40%+.”

Apache Airflow AWS AWS Glue AWS Lambda Agile C+++134

View profile

Vamshi Arempula

Screened

Senior AI/ML Engineer specializing in Generative AI, RAG, and agentic systems

6y exp

Wellmark Blue Cross and Blue ShieldIndiana Wesleyan University

“GenAI/LLM ML engineer (currently at Webprobo) building an enterprise GenAI platform with document intelligence and automation on AWS and blockchain. Has hands-on experience with RAG, LLM evaluation tooling, and orchestrating production LLM workflows with Apache Airflow, plus deep exposure to reliability challenges in globally distributed/edge deployments. Also partnered with business/marketing stakeholders at a banking client to deliver an AI-driven customer retention insights solution.”

A/B Testing Agile Amazon API Gateway Amazon Bedrock Amazon CloudWatch Amazon Redshift+212

View profile

Shashank Garg

Screened

Engineering leader specializing in FinTech ML/AI platforms

San Francisco, CA12y exp

TravelBankSan José State University

“Engineering Manager/player-coach leading Data Infrastructure, ML/DS, and AI Engineering pods who recently shipped multiple production agentic GenAI features. Built privacy-preserving LLM workflows (PII redaction via Microsoft Presidio) and drove an AI expense-approval agent from ambiguous ask to GA, cutting approval time from ~2.5 days to <4 hours with >85% accuracy. Also owned a major LLM cost overrun incident and implemented cost observability plus circuit breakers to prevent runaway agent loops.”

Leadership Team Building Agile Generative AI MLOps LangGraph+102

View profile

Mayur Komaravelly

Screened

Senior Data Analyst specializing in data pipelines, web scraping, and legal data enrichment

Illinois, USA5y exp

The HartfordIndiana Wesleyan University

“Data engineer focused on reliable, scalable analytics pipelines and external data collection. Has owned end-to-end pipelines processing 5–10M records/day, serving Snowflake data marts to Power BI/Tableau, and reports ~99% reliability through strong validation/monitoring. Also shipped versioned REST APIs for curated data with query optimization and caching.”

Apache Airflow Apache Kafka Apache Spark Ansible API Design AWS Glue+140

View profile

Data Engineers Machine Learning Engineers Software Engineers Data Analysts Data Scientists AI Engineers Data & Analytics AI & Machine Learning Engineering Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?