Vetted Data Pipelines Professionals

Pre-screened and vetted.

SS

Sahithi S

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI

Texas, USA6y exp
NVIDIAKennesaw State University

Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).

View profile
KG

kunj Golwala

Screened

Junior Robotics Perception Engineer specializing in autonomous navigation and robot learning

College Park, MD2y exp
GAMMA LabUniversity of Maryland, College Park

Robotics software/perception engineer with production AMR experience at Symbotic, building a real-time SKU case re-identification pipeline used in high-volume Walmart/Target warehouse operations. Strong in ROS2 + Docker deployments on Jetson (TensorRT quantization) and system-level performance debugging, including cutting inference latency from ~13s to ~2s through architecture changes. Also has lab experience integrating SLAM/MPPI/behavior trees for rule-compliant navigation and distributed perception-to-UR5e manipulation systems (MoveIt/ros_control) with multi-camera sensing and 3D reconstruction.

View profile
LT

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).

View profile
YL

Yuqi Lei

Screened

Mid-level Software Engineer specializing in financial data platforms and quantitative research tooling

New York City, NY3y exp
BloombergWashington University in St. Louis

Owned and built Bloomberg’s end-to-end bitemporal dividend & dividend-forecast data platform powering BQL for 400k+ terminal users. Architected real-time Kafka ingestion (5k–10k msgs/sec) across 100k+ tickers with strong correctness guarantees (PIT/bitemporal time-travel, immutable history to avoid look-ahead bias) and achieved sub-100ms p95 query latency through indexing and caching, deployed with Kubernetes + DLQ and robust monitoring.

View profile
JA

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps

McKinney, TX6y exp
Globe LifeTexas A&M University

Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.

View profile
LM

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp
eBayTexas Tech University

Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.

View profile
Byron Pineda - Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps in Pascagoula, MS

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
Aaron Li - Junior AI/ML Engineer specializing in production LLM systems and RAG in Atlanta, GA

Aaron Li

Screened

Junior AI/ML Engineer specializing in production LLM systems and RAG

Atlanta, GA2y exp
Georgia Institute of TechnologyUniversity of Chicago

LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.

View profile
Akanksha Agrawal - Mid-Level Full-Stack Software Engineer specializing in event-driven data platforms in Bangalore, India

Mid-Level Full-Stack Software Engineer specializing in event-driven data platforms

Bangalore, India5y exp
SAPUniversity of Illinois Urbana-Champaign

Backend engineer with SAP experience modernizing a legacy Flask/PostgreSQL product master data platform into a modular, stateless, containerized service with Kafka-based background processing and improved observability. Also has hands-on academic/side-project experience operationalizing ML (NLP retrieval with TF-IDF/BERT via FastAPI and CV lane-edge detection inference APIs using PyTorch).

View profile
Vismay Patel - Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps in Berkeley, CA

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp
Kaiser PermanenteSan Francisco State University

ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.

View profile
HK

Mid-level Full-Stack Software Engineer specializing in cloud and data platforms

Boston, MA5y exp
Northeastern UniversityPenn State University

Full-stack engineer with experience spanning Amazon IMDb and Northeastern’s NeuroJSON portal, combining consumer product work with complex scientific data applications. Built IMDb’s streaming providers feature—described as the company’s most impactful feature of 2023—and has hands-on experience with React/Angular, GraphQL, AWS, Python services, and production monitoring.

View profile
SA

Mid-level Full-Stack Engineer specializing in AI-driven data platforms

Santa Barbara, CA5y exp
UberUniversity of Alabama at Birmingham

Full-stack engineer with 5+ years of experience who built real-time data visualization and analytics systems at Uber, spanning React/TypeScript frontends, Node/GraphQL services, Kafka pipelines, and PostgreSQL. Particularly compelling for teams needing a hands-on builder who can turn ambiguous customer needs into scalable products, and who has also applied RAG with LangChain/OpenAI over 1.8M support files to surface actionable insights.

View profile
JS

Jonas Shuai

Screened

Mid-level Full-Stack Software Engineer specializing in cloud, microservices, and React/Java

Menlo Park, CA3y exp
Mainspring EnergyUniversity of San Francisco

Software engineer with experience at PayPal and JPMC building large-scale onboarding/account setup systems using React/TypeScript with Spring Boot/Node microservices and Kafka. Also built an Ignition-based SCADA monitoring tool at Mainspring Energy that became the default for manufacturing/test engineers by aggregating real-time telemetry and historical test data.

View profile
SP

Sneha Patil

Screened

Mid-level Financial Analyst specializing in FP&A, forecasting, and regulatory reporting

New York, NY5y exp
JPMorgan ChaseUniversity of Texas at Arlington

Backend-focused software engineer (4+ years) across e-commerce, banking, and healthcare who owned mission-critical checkout/order management end-to-end and improved peak-traffic success rates via resiliency patterns (timeouts/retries/caching) and data-driven iteration. Also built and shipped real-time operational dashboards (React/TypeScript + Spring Boot) using WebSockets and event-stream integrations, with strong experience in Kafka/RabbitMQ-style messaging at scale.

View profile
JJ

Intern Generative AI Engineer specializing in RAG and multi-agent systems

Chicago, IL2y exp
NeuraFlashUniversity of Chicago

Built and deployed a production RAG-based multi-agent chatbot during an internship to help consultants answer client questions and guide users through new IT systems with step-by-step instructions. Demonstrates hands-on experience with LangGraph/LangChain/Google ADK, unstructured document parsing and chunking for RAG, and a reliability-first approach to agent workflows (metrics, fallbacks, human-in-the-loop, guardrails).

View profile
YP

Mid-level AI/ML Engineer specializing in Databricks, MLOps, and real-time fraud detection

The Colony, TX4y exp
DatabricksUniversity of North Texas

ML/LLM engineer building production, real-time fraud detection for financial transactions using a two-tier architecture (fast ML + GPT) to deliver both low-latency decisions and analyst-friendly risk explanations. Experienced orchestrating end-to-end retraining, drift monitoring, and automated model promotion with Databricks Jobs/Workflows and MLflow, and partnering closely with fraud analysts to tune alerts, thresholds, and dashboards.

View profile
ZI

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.

View profile
RK

Rohit Kumar

Screened

Mid-level Data Engineer specializing in large-scale analytics platforms

San Jose, CA5y exp
NutanixUSC

Data/Backend engineer with experience at Naukri building large-scale analytics products over a 130M+ user base, including Spark/Airflow pipelines and Kafka-based clickstream validation with Confluent Schema Registry. Also built an audience segmentation backend (Athena/S3 + Spring Boot APIs) for non-technical internal teams and recently shipped a GenAI customer data audit system (FastAPI/Postgres/Llama) that cut sales-planning validation from ~3 months to ~1 week.

View profile
Pratima Singh - Senior Full-Stack Software Engineer specializing in FinTech, cloud microservices, and blockchain in Tempe, AZ

Pratima Singh

Screened

Senior Full-Stack Software Engineer specializing in FinTech, cloud microservices, and blockchain

Tempe, AZ10y exp
Arizona State UniversityArizona State University

Python/ML engineer with strong DevOps depth: built an end-to-end regime-aware stock prediction system (custom fine-tuned FinBERT sentiment + technical/macro features) delivering a 12% accuracy lift. Also implemented Kubernetes/Helm + Jenkins/GitHub Actions pipelines (including GitOps-style workflows for multi-cloud Hyperledger Besu) and improved deployment speed/stability by ~50% while addressing race conditions and image drift.

View profile
Alex Vo - Staff Backend Software Engineer specializing in telemetry pipelines and observability in San Jose, CA

Alex Vo

Screened

Staff Backend Software Engineer specializing in telemetry pipelines and observability

San Jose, CA3y exp
VMwareUC Irvine

Backend engineer from VMware focused on proprietary enterprise systems (monitoring tools, data pipelines, and APIs). Drove a ClickHouse migration POC (local to remote host) using a dual-write/cutover approach and source-level debugging across Node/driver differences during a Node 12→20 upgrade, and delivered measurable performance gains (~20% CPU/memory improvement) through batching and streaming ingestion.

View profile
Sai Dinesh Pusapati - Senior AI/ML Engineer specializing in GenAI agents and LLM workflows in San Francisco, CA

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp
Scale AIBelhaven University

LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.

View profile
SP

Mid-level Software Engineer specializing in machine learning and full-stack AI systems

Seattle, WA4y exp
SakuraMedTechUniversity of Washington

Built production-grade Python systems in a medical/imaging context, including an image feature extraction and survival prediction microservice with strong testing, validation, and observability practices. Also developed a Playwright-based autonomous job application agent that handled dynamic UIs and anti-bot challenges with stealth tooling, proxies, and human-in-the-loop escalation.

View profile
Akhil Kunala - Mid-level Software Engineer specializing in backend systems and cloud-native FinTech in Seattle, WA

Akhil Kunala

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native FinTech

Seattle, WA5y exp
AmazonUniversity of North Texas

Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.

View profile
Sanjay Santhanam - Mid-level AI Software Engineer specializing in LLMs and FinTech data systems in San Jose, CA

Mid-level AI Software Engineer specializing in LLMs and FinTech data systems

San Jose, CA4y exp
Scry AIWestcliff University

Backend/AI systems engineer focused on productionizing agentic document-processing workflows for large financial PDFs. They describe owning deployments end-to-end, combining Python, Redis, LLM function calling, RAG/ReAct-style orchestration, and strong reliability practices to deliver 80% faster processing, reduce parsing errors from 12% to ~1%, and sustain 99.9% uptime in high-concurrency environments.

View profile

Need someone specific?

AI Search