Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Pipelines Professionals

Pre-screened and vetted.

Data Pipelines Python Docker SQL AWS CI/CD

Sahithi S

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI

Texas, USA6y exp

NVIDIAKennesaw State University

“Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).”

Python SQL Java Spring Boot FastAPI Flask+108

View profile

kunj Golwala

Screened

Junior Robotics Perception Engineer specializing in autonomous navigation and robot learning

College Park, MD2y exp

GAMMA LabUniversity of Maryland, College Park

“Robotics software/perception engineer with production AMR experience at Symbotic, building a real-time SKU case re-identification pipeline used in high-volume Walmart/Target warehouse operations. Strong in ROS2 + Docker deployments on Jetson (TensorRT quantization) and system-level performance debugging, including cutting inference latency from ~13s to ~2s through architecture changes. Also has lab experience integrating SLAM/MPPI/behavior trees for rule-compliant navigation and distributed perception-to-UR5e manipulation systems (MoveIt/ros_control) with multi-camera sensing and 3D reconstruction.”

Python C++MATLAB R TypeScript Reinforcement Learning+127

View profile

Leela Tikkisetty

Screened

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp

City and County of San FranciscoSan Francisco State University

“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”

A/B Testing Agile Amazon Bedrock Amazon EKS Amazon Redshift Authentication+198

View profile

Yuqi Lei

Screened

Mid-level Software Engineer specializing in financial data platforms and quantitative research tooling

New York City, NY3y exp

BloombergWashington University in St. Louis

“Owned and built Bloomberg’s end-to-end bitemporal dividend & dividend-forecast data platform powering BQL for 400k+ terminal users. Architected real-time Kafka ingestion (5k–10k msgs/sec) across 100k+ tickers with strong correctness guarantees (PIT/bitemporal time-travel, immutable history to avoid look-ahead bias) and achieved sub-100ms p95 query latency through indexing and caching, deployed with Kubernetes + DLQ and robust monitoring.”

Python SQL Java JavaScript C++Pandas+60

View profile

Jisvitha Athaluri

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps

McKinney, TX6y exp

Globe LifeTexas A&M University

“Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.”

A/B Testing Apache Spark BERT ChromaDB Data Engineering Data Pipelines+90

View profile

Lalithya Manasa Patri

Screened

Senior Data Engineer specializing in cloud ETL and real-time streaming pipelines

Austin, TX5y exp

eBayTexas Tech University

“Data engineer with eBay experience owning end-to-end pipelines for real-time order and user behavior analytics at 10M+ records/day. Strong in PySpark/SQL transformations, Airflow reliability patterns, and production observability (CloudWatch), with measurable outcomes including improved data quality and 30–40% query performance gains. Also built Python data APIs for analytics/ML consumers with versioning and backward compatibility.”

Python SQL Java Scala R Apache Spark+97

View profile

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp

TuringMississippi State University

“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”

Python R SQL Pandas NumPy Scikit-learn+132

View profile

Aaron Li

Screened

Junior AI/ML Engineer specializing in production LLM systems and RAG

Atlanta, GA2y exp

Georgia Institute of TechnologyUniversity of Chicago

“LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.”

Machine Learning LLM Integration Large Language Models (LLMs)OpenAI API Prompt Engineering Web Scraping+93

View profile

Akanksha Agrawal

Screened

Mid-Level Full-Stack Software Engineer specializing in event-driven data platforms

Bangalore, India5y exp

SAPUniversity of Illinois Urbana-Champaign

“Backend engineer with SAP experience modernizing a legacy Flask/PostgreSQL product master data platform into a modular, stateless, containerized service with Kafka-based background processing and improved observability. Also has hands-on academic/side-project experience operationalizing ML (NLP retrieval with TF-IDF/BERT via FastAPI and CV lane-edge detection inference APIs using PyTorch).”

Agile Angular Apache Cassandra API Design AWS AWS Lambda+110

View profile

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp

Kaiser PermanenteSan Francisco State University

“ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.”

Python SQL Java Go JavaScript REST APIs+121

View profile

Hari Kiran Reddy Rommala

Screened

Mid-level Full-Stack Software Engineer specializing in cloud and data platforms

Boston, MA5y exp

Northeastern UniversityPenn State University

“Full-stack engineer with experience spanning Amazon IMDb and Northeastern’s NeuroJSON portal, combining consumer product work with complex scientific data applications. Built IMDb’s streaming providers feature—described as the company’s most impactful feature of 2023—and has hands-on experience with React/Angular, GraphQL, AWS, Python services, and production monitoring.”

React TypeScript SQL PostgreSQL Docker Kubernetes+283

View profile

Sai Anuhya Bandi

Screened

Mid-level Full-Stack Engineer specializing in AI-driven data platforms

Santa Barbara, CA5y exp

UberUniversity of Alabama at Birmingham

“Full-stack engineer with 5+ years of experience who built real-time data visualization and analytics systems at Uber, spanning React/TypeScript frontends, Node/GraphQL services, Kafka pipelines, and PostgreSQL. Particularly compelling for teams needing a hands-on builder who can turn ambiguous customer needs into scalable products, and who has also applied RAG with LangChain/OpenAI over 1.8M support files to surface actionable insights.”

TypeScript JavaScript Python Java SQL React+232

View profile

Jonas Shuai

Screened

Mid-level Full-Stack Software Engineer specializing in cloud, microservices, and React/Java

Menlo Park, CA3y exp

Mainspring EnergyUniversity of San Francisco

“Software engineer with experience at PayPal and JPMC building large-scale onboarding/account setup systems using React/TypeScript with Spring Boot/Node microservices and Kafka. Also built an Ignition-based SCADA monitoring tool at Mainspring Energy that became the default for manufacturing/test engineers by aggregating real-time telemetry and historical test data.”

Agile Argo CD AWS AWS Lambda Bootstrap Blue/green deployment+118

View profile

Sneha Patil

Screened

Mid-level Financial Analyst specializing in FP&A, forecasting, and regulatory reporting

New York, NY5y exp

JPMorgan ChaseUniversity of Texas at Arlington

“Backend-focused software engineer (4+ years) across e-commerce, banking, and healthcare who owned mission-critical checkout/order management end-to-end and improved peak-traffic success rates via resiliency patterns (timeouts/retries/caching) and data-driven iteration. Also built and shipped real-time operational dashboards (React/TypeScript + Spring Boot) using WebSockets and event-stream integrations, with strong experience in Kafka/RabbitMQ-style messaging at scale.”

Financial Modeling Forecasting Budgeting Risk Management Predictive Analytics Data Analysis+81

View profile

John Joji Melel

Screened

Intern Generative AI Engineer specializing in RAG and multi-agent systems

Chicago, IL2y exp

NeuraFlashUniversity of Chicago

“Built and deployed a production RAG-based multi-agent chatbot during an internship to help consultants answer client questions and guide users through new IT systems with step-by-step instructions. Demonstrates hands-on experience with LangGraph/LangChain/Google ADK, unstructured document parsing and chunking for RAG, and a reliability-first approach to agent workflows (metrics, fallbacks, human-in-the-loop, guardrails).”

Python SQL R C++Kubernetes Docker+87

View profile

Yeshwanth Pulapa

Screened

Mid-level AI/ML Engineer specializing in Databricks, MLOps, and real-time fraud detection

The Colony, TX4y exp

DatabricksUniversity of North Texas

“ML/LLM engineer building production, real-time fraud detection for financial transactions using a two-tier architecture (fast ML + GPT) to deliver both low-latency decisions and analyst-friendly risk explanations. Experienced orchestrating end-to-end retraining, drift monitoring, and automated model promotion with Databricks Jobs/Workflows and MLflow, and partnering closely with fraud analysts to tune alerts, thresholds, and dashboards.”

A/B Testing Apache Airflow Apache Kafka Apache Spark AWS AWS Lambda+93

View profile

Zufeshan Imran

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp

SOTER AIUC San Diego

“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”

Machine Learning Deep Learning Generative AI Transformers Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)+120

View profile

Rohit Kumar

Screened

Mid-level Data Engineer specializing in large-scale analytics platforms

San Jose, CA5y exp

NutanixUSC

“Data/Backend engineer with experience at Naukri building large-scale analytics products over a 130M+ user base, including Spark/Airflow pipelines and Kafka-based clickstream validation with Confluent Schema Registry. Also built an audience segmentation backend (Athena/S3 + Spring Boot APIs) for non-technical internal teams and recently shipped a GenAI customer data audit system (FastAPI/Postgres/Llama) that cut sales-planning validation from ~3 months to ~1 week.”

Algorithms Amazon S3 Apache Hadoop Apache Hive Apache Kafka Apache Spark+95

View profile

Pratima Singh

Screened

Senior Full-Stack Software Engineer specializing in FinTech, cloud microservices, and blockchain

Tempe, AZ10y exp

Arizona State UniversityArizona State University

“Python/ML engineer with strong DevOps depth: built an end-to-end regime-aware stock prediction system (custom fine-tuned FinBERT sentiment + technical/macro features) delivering a 12% accuracy lift. Also implemented Kubernetes/Helm + Jenkins/GitHub Actions pipelines (including GitOps-style workflows for multi-cloud Hyperledger Besu) and improved deployment speed/stability by ~50% while addressing race conditions and image drift.”

Agile API Development Authentication AWS AWS Lambda C+++158

View profile

Alex Vo

Screened

Staff Backend Software Engineer specializing in telemetry pipelines and observability

San Jose, CA3y exp

VMwareUC Irvine

“Backend engineer from VMware focused on proprietary enterprise systems (monitoring tools, data pipelines, and APIs). Drove a ClickHouse migration POC (local to remote host) using a dual-write/cutover approach and source-level debugging across Node/driver differences during a Node 12→20 upgrade, and delivered measurable performance gains (~20% CPU/memory improvement) through batching and streaming ingestion.”

Backend Development Node.js TypeScript SQL REST APIs API Design+60

View profile

Sai Dinesh Pusapati

Screened

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp

Scale AIBelhaven University

“LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.”

Python TypeScript Java C#JavaScript R+103

View profile

Srinivasa Pavan Kancharla

Screened

Mid-level Software Engineer specializing in machine learning and full-stack AI systems

Seattle, WA4y exp

SakuraMedTechUniversity of Washington

“Built production-grade Python systems in a medical/imaging context, including an image feature extraction and survival prediction microservice with strong testing, validation, and observability practices. Also developed a Playwright-based autonomous job application agent that handled dynamic UIs and anti-bot challenges with stealth tooling, proxies, and human-in-the-loop escalation.”

Python JavaScript Java C++Kubernetes AWS+108

View profile

Akhil Kunala

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native FinTech

Seattle, WA5y exp

AmazonUniversity of North Texas

“Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.”

Java Python TypeScript JavaScript SQL Spring Boot+101

View profile

Sanjay Santhanam

Screened

Mid-level AI Software Engineer specializing in LLMs and FinTech data systems

San Jose, CA4y exp

Scry AIWestcliff University

“Backend/AI systems engineer focused on productionizing agentic document-processing workflows for large financial PDFs. They describe owning deployments end-to-end, combining Python, Redis, LLM function calling, RAG/ReAct-style orchestration, and strong reliability practices to deliver 80% faster processing, reduce parsing errors from 12% to ~1%, and sustain 99.9% uptime in high-concurrency environments.”

Python JavaScript SQL Java Large Language Models Retrieval-Augmented Generation+168

View profile

Software Engineers Machine Learning Engineers Software Developers Data Scientists Data Engineers Full Stack Developers Engineering AI & Machine Learning Data & Analytics Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?