Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted dbt Professionals

Pre-screened and vetted.

dbt Python SQL Apache Airflow Snowflake AWS

GOWRI SHANKAR ANANTHULA

Screened

Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG

Auburn Hills, MI4y exp

StellantisUniversity of Cincinnati

“ML/NLP practitioner who built a retrieval-augmented generation (RAG) system for large financial and operational document sets using Sentence-Transformers (all-mpnet-base-v2) and a vector DB (e.g., Pinecone), with a strong focus on retrieval evaluation and chunking strategy optimization. Experienced in entity resolution (rules + embedding similarity with type-specific thresholds) and in productionizing scalable Python data workflows using Airflow/Dagster and Spark.”

Python SQL R Pandas NumPy SciPy+177

View profile

Ishwar Girase

Screened

Mid-level AI/ML Engineer specializing in LLMs, GenAI, and NLP

Hampton, NJ6y exp

UnumUniversity of Texas at Dallas

“AI/ML Engineer who built a production RAG-based LLM system for insurance policy documents, turning thousands of messy PDFs into a searchable index using LangChain, Azure AI Search vectors, hybrid retrieval, and FastAPI. Strong focus on evaluation (MRR/precision@k/recall@k, REGAS) and performance optimization (vLLM), with prior clinical NLP experience using BERT-based NER validated on ground-truth datasets.”

A/B Testing AWS AWS Lambda BERT Business Intelligence C+++169

View profile

Hemanth Dantu

Screened

Senior Software Engineer specializing in data pipelines and legal data systems

8y exp

AngiUniversity of Missouri-Kansas City

“Data/analytics engineer who owned Angi’s service-request funnel event pipeline end-to-end, routing events server-side to bypass ad blockers and recovering ~15% lost tracking at millions of events/day. Built Snowflake/dbt reporting tables powering Looker dashboards, with strong emphasis on validation, monitoring/alerting, and safe schema evolution. Also shipped a reusable flow state management backend service with TTL storage, CI/CD, and developer-friendly APIs.”

API Design Angular Audit Logging AWS CI/CD Containerization+77

View profile

Parvinder Singh

Screened

Mid-level Data Engineer specializing in AWS lakehouse platforms and scalable ETL/ELT

Texas, USA4y exp

HumanaUniversity of Texas at Dallas

“Data engineer focused on reliable, production-grade pipelines and data services: has owned end-to-end ingestion-to-serving workflows processing millions of records/day, using Airflow, Python/SQL, and PySpark. Demonstrates strong operational rigor (monitoring, retries, idempotency, backfills) and measurable outcomes (98% stability, ~30% faster processing), plus experience exposing curated warehouse data via versioned REST APIs.”

Data Engineering Data Pipelines AWS Databricks Snowflake ETL+88

View profile

Chandan Chalumuri

Screened

Mid-level Data Scientist specializing in ML, NLP, and Generative AI

Tempe, AZ4y exp

MetLifeArizona State University

“Data engineering / ML practitioner with experience at MetLife building transformer-based sentiment analysis over large unstructured datasets and productionizing pipelines with Airflow/PySpark/Hadoop (reported 52% efficiency gain). Also implemented embedding-based semantic search using Pinecone/Weaviate to improve retrieval relevance and enable RAG for customer support and document matching use cases.”

A/B Testing Agile Apache Airflow Apache Hadoop Apache Kafka Apache Spark+170

View profile

vineetha Pulipati

Screened

Mid-level Software Engineer specializing in backend microservices and cloud data pipelines

MO, USA4y exp

Morgan StanleyWebster University

“Backend engineer with Morgan Stanley experience building and owning an end-to-end Python FastAPI microservice for high-volume market data used by trading and risk systems. Strong in performance tuning and reliability (PySpark, Redis caching, async APIs), real-time streaming with Kafka, and production operations (Docker/Kubernetes, GitOps-style CI/CD, monitoring). Has led cloud/on-prem migration work across AWS and Azure, including fixing Azure Synapse performance issues via query and pipeline redesign.”

Python SQL Bash Shell Scripting TypeScript C+++129

View profile

Erik Moyer

Screened

Director-level Data Science & Analytics Leader specializing in cloud data platforms and AI/ML

Dallas, TX13y exp

EnumerateFlorida State University

“Candidate states they are very familiar with the venture capital/studio/accelerator landscape and expresses strong willingness to pursue entrepreneurship "at all costs," but did not provide details on a current startup, business plan, fundraising, or prior accelerator/VC involvement during the interview.”

Python SQL R JavaScript Java Ruby+88

View profile

srilekha pothula

Screened

Mid-level Data Engineer specializing in cloud data pipelines for healthcare and financial services

Bloomfield, CT4y exp

CignaPace University

“Data engineer with ~4 years of experience (Cigna) building and operating Azure Data Factory pipelines for healthcare claims/member/provider data at 2–3M records/day. Emphasizes reliability and downstream safety via schema/data-quality validation, quarantine workflows, idempotent processing, and backfills; also improved runtime ~20% through SQL optimization and served curated datasets through versioned views and well-documented, analyst-friendly interfaces.”

Apache Airflow Apache Kafka Apache Spark AWS AWS Glue AWS Lambda+71

View profile

Madhupal Singu

Screened

Mid-level Data Engineer specializing in multi-cloud data platforms for healthcare and finance

USA6y exp

CignaUniversity of Cincinnati

“Data engineer with Cigna experience building and operating an end-to-end AWS-based healthcare claims pipeline processing ~2TB/day, using Glue/Kafka/PySpark/SQL into Redshift. Strong focus on data quality and reliability (schema validation, monitoring/alerting, retries/checkpointing/backfills), reporting improved accuracy (~99%) and reduced latency, plus experience serving real-time Kafka/Spark data to downstream analytics with documented data contracts.”

Python Pandas PySpark SQL Scala Java+88

View profile

Harideep Balusa

Screened

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp

Freddie MacUniversity of Wisconsin

“Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.”

Python R SQL Scala Machine Learning Deep Learning+125

View profile

Hema Edavalapati

Screened

Mid-level AI/ML Engineer specializing in cloud data engineering and GenAI

Florida, USA6y exp

LexisNexisUniversity of South Florida

“AI/LLM engineer with production experience in legal tech: built a GPT-4 + LangChain RAG summarization system at Govpanel that reduced legal case-file review time by 50%+. Previously at LexisNexis, orchestrated end-to-end Airflow data/AI pipelines processing 5M+ legal documents daily, improving ETL runtime by 35% with robust validation, monitoring, and SLAs.”

SQL SQL query optimization Python Pandas NumPy PySpark+159

View profile

Brian Mar

Screened

Senior Data Engineer specializing in data infrastructure and marketing/CRM analytics

San Mateo, CA8y exp

Full Circle InsightsUC Davis

“Salesforce-focused implementation/solutions engineer from Full Circle Insights who owned end-to-end campaign attribution and reporting deployments for multiple customers at once (3–5 concurrently), including sandbox testing, KPI monitoring, and rollback-safe migrations from legacy reporting. Also builds personal multi-agent workflows and uses Claude Code to rapidly scaffold data/analytics scripts like an advertising optimization parser over CSV/XLSX inputs.”

Data Engineering Data Modeling ETL dbt Snowflake Apache Airflow+85

View profile

TEJASWI ARAVELLI

Screened

Junior Machine Learning Engineer specializing in Generative AI and analytics automation

Bengaluru, India2y exp

AccentureUniversity of Alabama at Birmingham

“AI/LLM engineer who built a production intelligent support system using RAG over a vectorized documentation library, addressing real-world issues like lost-in-the-middle context failures and doc freshness via automated GitHub-driven re-embedding pipelines. Emphasizes rigorous agent evaluation (component/E2E/ops) and prefers lightweight, decoupled workflow automation using message brokers (Redis/RabbitMQ) over heavyweight orchestration frameworks.”

Python SQL R Java TensorFlow Keras+100

View profile

Varun Gattamaneni

Screened

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp

HCLTechRowan University

“Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.”

Python Pandas NumPy R SQL Bash+150

View profile

Venkatesh Sanaboina

Screened

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp

VerizonJawaharlal Nehru Technological University

“Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.”

A/B Testing Agile Amazon Redshift Amazon S3 Amazon SageMaker Anomaly Detection+168

View profile

Harsha Sikha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and data engineering

Armonk, New York4y exp

IBMSaint Peter's University

“IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.”

A/B Testing Agile Anomaly Detection API Development Apache Hadoop Apache Hive+157

View profile

Tharun Kshathriya Sangaraju

Screened

Mid-level AI Engineer specializing in LLM orchestration, RAG, and multi-agent systems

Houston, TX4y exp

University of HoustonUniversity of Houston

“Research Assistant at the University of Houston who built and live-deployed a production RAG system for 1000+ research documents, using hybrid retrieval (dense+BM25+RRF) with cross-encoder reranking and RAGAS-based evaluation; reported 66% MRR, 0.85+ faithfulness, and 68% lower LLM inference costs. Also built a deployed LangGraph multi-agent research system (Researcher/Critic/Writer) with tool integrations (Tavily, arXiv) and dual memory (ChromaDB + Neo4j), plus freelance automation work delivering a WhatsApp chatbot and n8n workflows for a wholesale clothing business.”

Agentic AI AI Agents API Integration Apache Airflow Apache Hadoop Apache Kafka+118

View profile

Sai Harshith Varma Pericherla

Screened

Mid-level Data Engineer specializing in cloud ETL/ELT and lakehouse architecture

Jersey City, NJ4y exp

State StreetUniversity of New Haven

“Data engineer focused on sales/marketing analytics pipelines, owning ingestion from CRMs/ad platforms through warehouse serving and dashboards at ~hundreds of thousands of records/day. Built reliability-focused systems including dbt/SQL/Python data quality gates with alerting, a resilient web-scraping pipeline (retries/backoff, anti-bot tactics, schema-change detection, backfills), and a versioned internal REST API with caching and strong developer usability.”

SQL Python Pandas NumPy Scikit-learn Java+151

View profile

Sheshikanth Pothuganti

Screened

Mid-level Data Engineer specializing in real-time streaming and cloud data platforms

New York, NY4y exp

Wells FargoUniversity of Birmingham

“Data engineer with Wells Fargo experience owning an end-to-end lakehouse ETL pipeline on Databricks/Azure Data Factory, processing ~480GB daily and implementing robust data quality/reconciliation across 40+ tables to reach ~99.3% reliability. Strong in performance optimization (cut runtime 5.5h→3.8h), CI/CD and monitoring, and resilient external/API ingestion with retries, schema validation, and backfills.”

Python SQL Java Scala R PostgreSQL+122

View profile

Aishwarya Thorat

Screened

Intern Data Scientist specializing in ML engineering and LLM agentic workflows

San Francisco, CA6y exp

ContentstackSan José State University

“Built an agentic, multi-step LLM system that generates full-stack code for API integrations using LangChain orchestration, Pinecone/SentenceBERT RAG, and a human-in-the-loop feedback loop for iterative code refinement. Also collaborated with non-technical content writers and PMs during a Contentstack internship to deliver a Slack-based AI workflow that generates and brand-checks articles with one-click approvals.”

A/B Testing Agentic AI Amazon Redshift Amazon S3 API Integration AWS+129

View profile

Shanmukh Gudapati

Screened

Mid-level Data Analyst specializing in business intelligence and cloud data platforms

Stamford, CT4y exp

Franklin TempletonUniversity of Bridgeport

“Healthcare analytics professional with TCS/Humana experience turning messy claims and eligibility data into reliable reporting assets using SQL and Python. They combine strong data engineering and analytics execution with stakeholder management, including automating monthly claims reporting from half a day to under 5 minutes and driving a provider outreach effort that reduced claim rejection rates by about 20%.”

Data Analytics Data Modeling SQL Python Pandas NumPy+102

View profile

Hinal Kuvadiya

Screened

Mid-level Data Analyst specializing in cloud ETL, BI, and machine learning

Texas, 752235y exp

UnitedHealth GroupUniversity of Texas at Arlington

“Data/ML practitioner with experience at UnitedHealth Group building a fraud claims detection solution combining structured claims data and unstructured notes, validated with compliance stakeholders to improve actionable accuracy. Also applied embeddings, vector databases, and fine-tuned language models in a Bank of America capstone to detect threats/anomalies in financial documents, with production-minded Python ETL workflows using Airflow.”

A/B Testing Apache Airflow Apache Spark AWS Glue AWS Lambda Business Intelligence+118

View profile

Surya Vamshi Sriperambudooru

Screened

Mid-level AI Engineer specializing in healthcare claims analytics and RAG copilots

Remote, US4y exp

CodoxoUniversity of Texas at Dallas

“Built a production "appeals co-pilot" for a healthcare claims appeals team, combining an XGBoost/logistic ranking model with a Python/LangChain RAG stack (FAISS + Mistral 7B) to surface high-probability appeal wins and speed policy-grounded drafting. Emphasizes reliability and trust: hybrid retrieval with metadata routing, citation/eval scripts, guardrails, and an explainability layer that non-technical stakeholders could understand and override.”

A/B Testing Amazon EC2 Apache Airflow Apache Kafka AWS Confluence+118

View profile

Prasanth Sai

Screened

Mid-level Data Engineer specializing in cloud lakehouse/warehouse pipelines

4y exp

Wells FargoChristian Brothers University

“Data engineer with HCA Healthcare experience building and operating end-to-end AWS-based pipelines for clinical and operational reporting (50–100 GB/day), serving curated data into Redshift/Snowflake for Power BI/Tableau. Emphasizes production reliability (Airflow SLAs/retries/alerting, logging/observability) and strong data quality controls (reconciliations, schema/null/duplicate checks), and has shipped versioned REST APIs to expose warehouse data to downstream systems.”

Amazon EC2 Amazon EKS Amazon Kinesis Amazon Redshift Amazon S3 Ansible+98

View profile

Data Engineers Machine Learning Engineers Software Engineers Data Analysts Data Scientists AI Engineers Data & Analytics AI & Machine Learning Engineering Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?