Vetted Data Engineers

Pre-screened and vetted.

RV

Rahul Vemuri

Screened

Mid-level Data Engineer specializing in AI/ML, RAG systems, and cloud data pipelines

Malvern, PA4y exp
PQ CorporationPenn State Great Valley School of Graduate Professional Studies

Built a production lead-generation system using AI agents that researches the internet for relevant leads and integrates RAG-based contact enrichment/shortlisting aligned to existing CRM data, enabling sales reps to focus more on selling. Also has hands-on AWS data orchestration experience (Glue, Step Functions) moving raw data into Redshift and evaluates agent performance with human-in-the-loop plus BLEU/perplexity metrics.

View profile
AP

Ayushi Patel

Screened

Mid-level Software Engineer specializing in cloud data platforms and serverless ETL

Redmond, WA6y exp
HCLTechIllinois Institute of Technology

Data/ML engineer from HCLTech who modernized enterprise data by linking fragmented financial and supply-chain data across SAP/SQL Server/Snowflake using NLP entity linking and embeddings (FAISS). Delivered measurable impact including ~40% reduction in manual error-log triage and entity-linking accuracy improvements from ~86% to ~93%, with results surfaced in Power BI for real-time analytics.

View profile
SM

Mid-level Full-Stack Engineer specializing in cloud-native FinTech analytics

McKinney, TX5y exp
Martingale Solution GroupUniversity of Texas at Dallas

Full-stack/ML-leaning engineer who has shipped production-grade real-time analytics and an internal AI support assistant using RAG over enterprise documentation. Demonstrates strong systems thinking across scalability, reliability, observability, and LLM safety/evaluation (thresholded retrieval, RBAC, response validation, regression-gated evals), with concrete iteration based on performance metrics and user feedback.

View profile
JJ

Mid-level Data Engineer specializing in cloud data platforms and real-time pipelines

Denton, TX5y exp
Real DynamicsUniversity of North Texas

Data engineer who has owned production pipelines end-to-end—from Kafka/Airflow ingestion through SQL/Python validation and dbt transformations into Redshift/BI. Also built and operated a large-scale distributed web scraping platform (50–100 sites daily, ~5–10M records/day) with Kubernetes, Kafka queues, robust retries/DLQ, anti-bot measures, and backfill-safe raw HTML storage.

View profile
Vikram Sandigaru - Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation in Boston, US

Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation

Boston, US3y exp
FounderWayNortheastern University

Built and shipped production LLM systems at Founderbay, including a low-latency voice agent and a graph-based multi-agent research assistant. Strong focus on reliability in real workflows—hybrid SERP + full-site scraping RAG, grounding guardrails, validation checkpoints, and transcript-driven evaluation—plus performance tuning with async FastAPI, Redis caching, and containerization. Also partnered with a non-technical ops lead to automate post-call follow-ups via call summarization, field extraction, and tool-triggered actions.

View profile
MP

Mid-level Data Engineer specializing in FinTech data platforms

California, USA4y exp
AlloyUniversity of Massachusetts Dartmouth

Backend-focused engineer with experience at Ramp, Easebuzz, and George Mason University, spanning data pipelines, workflow automation, and production reliability. Stands out for quantifiable performance gains, strong debugging instincts in distributed job systems, and translating ambiguous finance operations processes into measurable automation outcomes.

View profile
YM

Yang MA

Screened

Junior Backend Software Engineer specializing in search, data systems, and LLM applications

New York, NY3y exp
Bevel HealthUniversity of Pittsburgh

Built and deployed a full-stack web product for international football fans visiting the U.S. for FIFA, owning everything from crawling and aggregating event data to frontend, backend, deployment, and maintenance. Particularly strong in data-heavy product work, using LLMs, Google Maps API, and SQL/RPC patterns to improve data quality, speed implementation, and support a polished user experience.

View profile
KV

Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure

Remote4y exp
Cloud Systems LLCVirginia Tech

Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.

View profile
hetvi patel - Mid-level Software/Data Engineer specializing in cloud ETL pipelines and data infrastructure in New Jersey

hetvi patel

Screened

Mid-level Software/Data Engineer specializing in cloud ETL pipelines and data infrastructure

New Jersey5y exp
Plore AIAvila University

Backend/data engineer who built a production analytics data service (Python/FastAPI on AWS/Postgres with PySpark ETL) handling millions of records per day and drove major latency improvements (10–15s to <2s) via indexing, Redis caching, and shifting aggregations into ETL. Also shipped an LLM-based natural-language-to-SQL assistant end-to-end with strong guardrails (schema restrictions, read-only validation, RBAC, masking) and designed a multi-step agent workflow with verification and fallback logic.

View profile
HC

Mid-level Data Engineer specializing in cloud data platforms and ETL automation

Atlanta, GA4y exp
Blue Diamond TechnologiesUniversity of Texas at Arlington

Data engineer who has owned high-volume production pipelines end-to-end (200–300 GB/day) on AWS, implementing strong data quality/observability and achieving 99.9% reliability while cutting data issues ~33%. Also built a large-scale external data collection system ingesting millions of records/day with anti-bot/rate-limit handling and backfill tooling, and shipped a versioned REST service exposing curated Snowflake data to downstream teams.

View profile
Vignesh Samarasam - Junior Data Engineer specializing in IoT analytics and AWS data pipelines in Bangalore, India

Junior Data Engineer specializing in IoT analytics and AWS data pipelines

Bangalore, India2y exp
HiPer AutomotiveArizona State University
View profile
Pooja Dolas - Intern AI/Software Engineer specializing in backend systems, cloud infrastructure, and GenAI in San Francisco, CA

Intern AI/Software Engineer specializing in backend systems, cloud infrastructure, and GenAI

San Francisco, CA3y exp
QOVAIUniversity of San Francisco
View profile
Kiran Ranganalli - Junior Data Engineer specializing in cloud data pipelines and warehousing in San Francisco, CA

Junior Data Engineer specializing in cloud data pipelines and warehousing

San Francisco, CA2y exp
San Francisco State UniversitySan Francisco State University
View profile
MM

Mid-level Machine Learning Engineer specializing in production ML, MLOps, and LLM retrieval systems

Dallas, TX6y exp
Nashville AnalyticsUniversity of Colorado Denver
View profile
RK

Mid-level AI Software Engineer specializing in ML services and agentic workflows

Austin, TX6y exp
Karncy Ventures IncUniversity of Texas at Dallas
View profile
MB

Mid-level Data Engineer specializing in cloud ETL and big data pipelines

TX, USA4y exp
DXC TechnologyUniversity of Texas at Dallas
View profile
SM

Senior Data Engineer specializing in Azure/AWS lakehouse and real-time analytics

Plano, TX7y exp
IDWTeamTexas A&M University
View profile
SY

Mid-level Data Engineer specializing in Microsoft Fabric and Azure Lakehouse platforms

3y exp
Algobrainz LLCNorthwest Missouri State University
View profile
VB

Senior Data Scientist specializing in recommendation systems and forecasting

Virginia, USA18y exp
IT ExcelWayne State University
View profile
AR

Mid-level Forward Deployed Engineer specializing in LLM agents and RAG/CAG systems

San Francisco, CA4y exp
MoolAISan Francisco State University
View profile
PJ

Mid-Level Software Engineer specializing in full-stack web and data engineering

San Jose, CA4y exp
MyAscend AIArizona State University
View profile
SR

Junior Data Scientist/Data Engineer specializing in ML, analytics, and cloud data pipelines

Monroe, NJ3y exp
SPR Software SystemsNJIT
View profile
RC

Mid-level Data Engineer specializing in cloud data pipelines and analytics platforms

Cincinnati, OH5y exp
MedpaceNortheastern University
View profile
BP

Junior Data Analyst specializing in BI, analytics, and energy data

Boston, MA3y exp
TrueLightWorcester Polytechnic Institute
View profile

Need someone specific?

AI Search