Vetted Data Pipelines Professionals

Pre-screened and vetted.

Naga Renuka Kandi - Junior Software Engineer specializing in cloud, full-stack development, and Generative AI in Remote, USA

Junior Software Engineer specializing in cloud, full-stack development, and Generative AI

Remote, USA2y exp
Handshake AI LabNortheastern University

Built and shipped a production Chrome extension (Promptly) that lets users select text on any webpage and transform it in place (rewrite/shorten/translate) using on-device AI plus external LLMs. Implemented a custom lightweight orchestration layer for prompt chaining, context flow, and output validation, and tackled tricky browser Selection API issues to preserve formatting while keeping the UX simple and fast.

View profile
Sai Nekkanti - Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance in Mount Laurel, NJ

Sai Nekkanti

Screened

Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance

Mount Laurel, NJ4y exp
MetLifeRowan University

Built a production "sentinel insight engine" to tame information overload from millions of product reviews and support transcripts, combining Azure OpenAI (GPT-3.5) zero-shot classification with a fine-tuned T5 summarizer to generate weekly actionable product insights. Demonstrated strong MLOps/production engineering by adding drift monitoring with embedding-based detection, integrating REST with legacy SOAP/queue-based CRM via FastAPI middleware, and scaling reliably on Kubernetes with HPA.

View profile
Revanth Goli - Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems in Morrisville, NC

Revanth Goli

Screened

Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems

Morrisville, NC6y exp
Syneos HealthUniversity of Alabama at Birmingham

Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.

View profile
BK

Mid-level Data Engineer specializing in big data pipelines and real-time streaming

Dallas, TX6y exp
Johnson & JohnsonUniversity of North Texas

Data engineer who has owned end-to-end production pipelines processing a few million records/day, using Python/Airflow/SQL/PySpark with Snowflake serving to BI (Power BI). Built resilient external web data collection systems (anti-bot, schema-change detection, backfills) and shipped versioned REST APIs for internal consumers, improving pipeline success rates to 99% through monitoring, retries, and idempotent design.

View profile
SV

Mid-Level Data Engineer specializing in cloud data platforms and governed analytics

5y exp
OptumUniversity of Central Missouri

Data engineer with Optum experience building end-to-end healthcare data pipelines for HL7/FHIR, processing millions of records daily across Kafka streaming and Databricks/Spark batch. Strong focus on data quality (schema enforcement/validations), reliability (Airflow monitoring/alerts), and analytics-ready serving in Snowflake powering Power BI/Tableau, with CI/CD via Git and Jenkins.

View profile
Wilson Harron - Director-level AI/ML & Computer Vision Engineer specializing in robotics and multimodal AI in Los Angeles, CA

Wilson Harron

Screened

Director-level AI/ML & Computer Vision Engineer specializing in robotics and multimodal AI

Los Angeles, CA15y exp
silvr.aiUniversity of Guelph

Candidate is not currently pursuing entrepreneurship (no business plan and no capital raised) and is not familiar with the VC/accelerator landscape. They show pragmatic, problem-first thinking about evaluating startup ideas—prioritizing real customer pain points and the quality of the founding team—and are open to working for others rather than founding "at all costs."

View profile
JC

Jiaji Chen

Screened

Junior Full-Stack Software Engineer specializing in AI-powered applications

Montebello, CA2y exp
Top Connect, Inc.University of Michigan

Built and owns the full ProteinMenus AI pipeline end-to-end, spanning the iOS client, FastAPI backend, Gemini integration, Firestore, and Cloud Run deployment. Strongest signal is full-stack product ownership in an AI-driven consumer workflow, including monetization logic via an atomic credit system and architecture choices optimized for fast iteration after launch.

View profile
SK

Mid-level Full-Stack Python Developer specializing in cloud, data engineering, and AI/ML

Washington, USA4y exp
Fannie MaeSt. Francis College

Full stack Python developer who actively integrates AI coding assistants into day-to-day engineering work, including code generation, debugging, testing, and documentation. Has also coordinated multi-agent workflows across backend, frontend, testing, and code review, showing an applied, productivity-focused approach to AI-enabled software delivery.

View profile
LV

Junior Machine Learning Engineer specializing in LLMs and applied AI

Boston, MA2y exp
Wave Life SciencesNortheastern University

AI/full-stack engineer with experience spanning startup product building at Twinly, enterprise analytics at Zoho, and high-stakes life sciences ML at Wave Life Sciences. Stands out for combining React/TypeScript + FastAPI product execution with rigorous AI evaluation, retrieval optimization, and human-in-the-loop design, delivering measurable outcomes like 75% fewer analytics requests, 20% fewer failed experiments, and MVP delivery 3 weeks early.

View profile
VB

Entry Data Scientist specializing in data engineering and automotive analytics

Bangalore, India1y exp
Tata ElxsiUniversity of Cincinnati

Frontend-focused candidate with hands-on experience building React and TypeScript dashboards for searching, filtering, and analyzing large datasets in real time. Demonstrates practical performance tuning skills using React DevTools, memoization, debouncing, and pagination, and has also built a Mapbox-based location data dashboard with interactive markers and popups.

View profile
SR

Executive technology leader specializing in healthcare SaaS and regulated cloud platforms

Raleigh, NC25y exp
ClinisysSathaye College

Engineering/technology leader who stays hands-on while driving executive-level roadmap execution, with deep experience modernizing cloud-based LIMS/LIS platforms and building AI-driven lab analytics. Led a monolith-to-microservices cloud migration with containerization and CI/CD, and delivered a reported 30% reduction in lab turnaround time while strengthening compliance.

View profile
Nithyashree Raghunathan - Mid-level Full-Stack AI Engineer specializing in agentic systems in Santa Clara, CA

Mid-level Full-Stack AI Engineer specializing in agentic systems

Santa Clara, CA5y exp
MetaPenn State Great Valley

QA/data pipeline engineer with hands-on AI product building experience, spanning enterprise AWS migration testing for Belgium postal services and personal multi-agent systems in fintech and recruiting. Stands out for combining rigorous validation and production stability work with modern LLM orchestration, guardrails, and messy-document normalization workflows.

View profile
MS

Manali Shetye

Screened

Mid-level Software Engineer specializing in AI platforms and enterprise full-stack systems

Fremont, CA5y exp
Trend MicroUniversity of Texas at Arlington

Full-stack product engineer who has built both operational systems and enterprise AI copilots in production. They owned an AI-powered inventory platform end-to-end, driving a 45% drop in stock issues, and also shipped a Microsoft Teams-based HR/IT copilot using RAG and workflow automation that reduced repetitive support queries by roughly 30%.

View profile
Murali Marupudi - Mid-level Backend Engineer specializing in Python microservices and scalable cloud systems in Jersey City, NJ

Mid-level Backend Engineer specializing in Python microservices and scalable cloud systems

Jersey City, NJ4y exp
BlackRockPace University

Backend engineer focused on high-throughput Python/Flask systems on AWS, with strong scaling and performance tuning experience (e.g., PostgreSQL join reduced from ~3s to <200ms; background aggregation cut from 10 minutes to <90 seconds with 8x throughput). Has also integrated ML model serving into production APIs (churn prediction) using Celery/Redis batching and AWS Lambda/S3, and designed secure multi-tenant architectures with PostgreSQL schema isolation and row-level security.

View profile
IP

Intern Data Scientist specializing in machine learning and predictive modeling

Irvine, CA2y exp
Trilemma FoundationUC Irvine

Built across data, backend, analytics, and visualization-heavy applications, including a nonprofit financial forecasting app, large-scale insurance model analysis at Mercury Insurance, and a publicly deployed soccer analytics dashboard. Stands out for combining machine learning, large-dataset SQL work, and practical production improvements like cutting dashboard load times to under two seconds and refactoring codebases for smoother team handoff.

View profile
MK

Junior Data Engineer / Analyst specializing in AI/ML data infrastructure

Houston, Texas1y exp
CallAgent AIUniversity of Texas at Austin

Built and deployed a compliance-sensitive LLM pipeline that extracts rebate logic from hospital–supplier medical contracts, using multi-layer redaction (regex/NER/dictionary), schema-validated structured outputs, and secure placeholder reinsertion. Hosted models on Amazon Bedrock to avoid retraining on sensitive data and improved both accuracy and cost by splitting the workflow into a lightweight section classifier plus a fine-tuned extraction model, orchestrated with LangChain and evaluated via layered, test-driven agent assessments.

View profile
KP

Senior AI Engineer specializing in Generative AI and RAG applications

8y exp
Keurig Dr PepperGeorge Mason University

AI engineer who has shipped production LLM systems across customer service and marketing use cases—building a RAG app on Azure OpenAI and speeding retrieval with Redis caching tied to Okta sessions. Also implemented a LangGraph multi-agent workflow that pulls image context from Figma to generate structured HTML marketing emails, adding a verification agent to improve image-selection accuracy while optimizing solution cost for business stakeholders.

View profile
JL

Junior Machine Learning Engineer specializing in LLMs, NLP, and computer vision

Bengaluru, Karnataka2y exp
PwCArizona State University

Built a production, agentic multi-agent pharmaceutical intelligence system for US oncology (breast cancer) conference/news intelligence, automating MSL-style information gathering and summarization for pharma and healthcare stakeholders. Uses CrewAI + LangChain orchestration, custom scraping across ~15 pharma newsrooms, and a grounding-score evaluation approach (sentence transformers/cosine similarity) to mitigate hallucinations.

View profile
NM

Mid-level Data Scientist/ML Engineer specializing in healthcare AI and MLOps

USA4y exp
CVS HealthUniversity at Buffalo

Designed and deployed an enterprise LLM-powered clinical/pharmacy policy knowledge assistant at CVS Health, replacing manual searches across PDFs/Word/SharePoint with a HIPAA-compliant RAG system. Built end-to-end ingestion and orchestration (Airflow + Azure ML/Data Lake + vector index) with PHI masking, versioned re-embedding, and production monitoring (Prometheus/Grafana), and partnered closely with clinicians/compliance to ensure policy-grounded, auditable answers.

View profile
NJ

Mid-level Data & AI Engineer specializing in healthcare data pipelines and MLOps

FL, USA4y exp
HumanaFlorida State University

Built and deployed a production LLM-powered clinical note summarization system used by care managers to speed review of 5–20 page unstructured medical records. Implemented safety-focused validation (prompt constraints, rule-based and section-level checks, human-in-the-loop) to reduce hallucinations while maintaining low latency and meeting privacy/regulatory constraints, integrating via APIs into existing clinical tools.

View profile
RQ

Ramiz Qudsi

Screened

Principal Data Scientist & Software Engineer specializing in space mission data systems

Boston, MA13y exp
Boston UniversityUniversity of Delaware

Space/heliophysics ML engineer who built a PyTorch GRU model to propagate solar wind from L1 to the magnetopause with probabilistic outputs for uncertainty quantification, achieving ~25% better CRPS than standard approaches. Also developed production-grade Python ETL and an open-source telemetry processing package for a mission (LEXI), using Docker and GitHub Actions CI/CD and iterating with scientist/engineer stakeholders.

View profile
JM

Jason Meno

Screened

Senior Full-Stack Software Engineer specializing in digital health and AI

San Francisco, CA7y exp
Feeling GreatPurdue University

ML practitioner with hands-on experience in healthcare time-series modeling (CGM-based blood glucose prediction) including a novel ICA-based blind source separation approach and robust data-cleaning for noisy, missing sensor data. Also built an embeddings + LLM-powered podcast recommendation workflow using YouTube transcript scraping and Vellum AI document indexing, with a strong emphasis on production-grade engineering practices (TDD, monitoring) and realistic rolling validation for forecasting.

View profile
PM

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in Financial Services

Austin, TX5y exp
Charles SchwabUniversity of Central Missouri

ML/LLM engineer at Charles Schwab who built a production loan-advisor chatbot integrated with internal knowledge and loan-calculator APIs, adding strict numeric validation to prevent rate hallucinations and optimizing context to control costs. Also runs ~40 Airflow DAGs orchestrating retraining/ETL/drift monitoring with an automated Snowflake→SageMaker→auto-deploy pipeline, and uses rigorous testing plus canary rollouts tied to business metrics and compliance constraints.

View profile
NK

Senior Data Scientist / ML Engineer specializing in NLP, anomaly detection, and cloud ML platforms

Remote, CA10y exp
EmotionallNMIMS University

ML/NLP practitioner who built customer-feedback topic modeling (NMF + TF-IDF) to diagnose chatbot-to-agent handovers and drove product/ops changes that reduced operational costs by 20%. Also developed LSTM-based intent recognition using Word2Vec/GloVe embeddings for semantic linking, and deployed an LSTM autoencoder for fraud anomaly detection that cut false positives by 25% while capturing 15% more fraud in A/B testing.

View profile

Need someone specific?

AI Search