Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Data Pipelines Professionals

Pre-screened and vetted.

Data Pipelines Python Docker SQL AWS CI/CD

Manish Vemula

Screened

Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI

TX, USA4y exp

DiscoverCentral Michigan University

“ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.”

Agile Anomaly Detection API Integration AWS Lambda Azure Machine Learning CI/CD+101

View profile

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp

PaycomUniversity of Kansas

“Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.”

A/B Testing Amazon EC2 Apache Kafka Apache Spark AWS AWS Glue+163

View profile

Tejaswini Waghmare

Screened

Senior Data Analytics & Data Science professional specializing in Financial Services

4y exp

InfosysGeorgia State University

“Worked on large financial analytics datasets combining complaint text, transaction logs, and demographics; built end-to-end NLP/ML pipelines (TF-IDF + Random Forest) and data integration in BigQuery with Tableau reporting, citing ~95–98% accuracy. Also implemented entity resolution with fuzzy matching and semantic linking using BERT sentence-transformer embeddings stored in FAISS, including fine-tuning on labeled pairs to improve search/linking relevance.”

SQL XML MySQL Python R BigQuery+109

View profile

Amie Gibson

Screened

Senior Geospatial Developer specializing in GIS automation, elevation/LiDAR, and AI-enabled apps

Sand Springs, OK27y exp

FEMAFlorida Institute of Technology

“Built and monetized an object-identification app end-to-end (FastAPI backend, HTML/JS frontend, SQLite→Postgres, auth, and an iOS wrapper via Capacitor/Xcode with Apple privacy/policy compliance). Also productionized an AI-native geospatial metadata/QA assistant using LLM+RAG plus deterministic Python validation, measuring impact via time-to-first-pass review and rework rate, and has experience modernizing legacy GIS workflows and delivering across USDA/FEMA-style teams with disciplined Jira-based execution.”

Agile API Integration AWS Bash C#C+++111

View profile

Pooja Miryala

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for banking and healthcare

Ohio, USA4y exp

Fifth Third BankYoungstown State University

“Deployed a real-time LLM-driven call center summarization and agent-assist platform at Fifth Third Bank, combining transformer models (BERT/GPT) with FastAPI inference on AKS and vector storage (ChromaDB/PostgreSQL). Emphasizes production-grade reliability (autoscaling, CI/CD, monitoring) and measurable evaluation (A/B testing), and translates model outputs into business-facing Power BI insights for call center leadership.”

A/B Testing Agile Amazon ECS Amazon SageMaker Amazon S3 Anomaly Detection+123

View profile

Ponugoti Sushma

Screened

Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML

Texas, USA5y exp

AllstateTexas A&M University-Corpus Christi

“Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.”

Python C C++TensorFlow PyTorch Scikit-learn+134

View profile

Akhil Bharadwaj Mateti

Screened

Mid-level Software Engineer specializing in Data Science and Machine Learning

Arlington, Virginia4y exp

ElevateMeGeorge Washington University

“Robotics/AV perception engineer who built a semantic-segmentation road detection system and integrated it into a ROS-based real-time pipeline (ROS bag camera feed to live monitor) achieving ~12 FPS. Strong in practical deployment work: solved multi-library versioning issues (ROS/OpenCV/TensorFlow), containerized the stack with Docker, and optimized inference by shifting runtime to C++ for large latency gains on NVIDIA hardware.”

Python R SQL C C++HTML+69

View profile

Andrew Clayman

Screened

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp

AppstemUniversity of Southampton

“Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.”

Python C++SQL Docker Flask CI/CD+133

View profile

Meghana Nandivada

Screened

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp

TCSStevens Institute of Technology

“ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.”

Python SQL Java Scala Machine Learning MLOps+99

View profile

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp

OptumUniversity of Texas at Dallas

“ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.”

Python R SQL MATLAB C C#+157

View profile

Serge Ahranovich

Screened

Executive CTO / Platform Architect specializing in IoT, telematics, and EV charging infrastructure

Los Angeles, CA20y exp

TimeTickBelarusian State University of Informatics and Radioelectronics

“Founder of TimeTick (timetick.io), an AI-powered diagnostics platform for IoT combining device simulation, automated testing, and real-time monitoring—initially focused on EV charger diagnostics. Former VP of Engineering with a track record of building IoT systems from scratch and applying AI to detect protocol-failure patterns that drive downtime; currently supporting existing customers and converting pilots (with leads like Siemens and ABB) into paid subscriptions.”

Agile API Design AWS CI/CD Cross-Functional Collaboration Data Pipelines+94

View profile

Patrick Seeman

Screened

Mid-level Data Scientist and Game Tech Leader specializing in ML, healthcare analytics, and Unity

Manila, Philippines5y exp

GridLock GamesJohn Carroll University

“Data scientist at Cleveland Clinic Taussig Cancer Institute who led a production automation to convert unstructured (and sometimes image-based) pathology reports into structured data for government reporting. Built an on-prem LangGraph + Ollama pipeline with OCR (Tesseract), spell-checking, confidence scoring, and human-audited guardrails to mitigate hallucinations and improve reliability under PHI constraints.”

Agile Analytics C#CSS Data Pipelines Data Science+48

View profile

Adrian Lawrence

Screened

Executive Product & Technology Leader specializing in AI, analytics, and regulated industries

Atlanta, GA14y exp

Vitalis VenturesGeorgia Tech

“Serial startup product/technology leader who previously exited a company to Green Street and has accelerator experience via Notre Dame’s IDEA Center. Now pursuing a commercial real estate analytics concept focused on deep demand analysis for better capital allocation, with a provisional patent filed and experience supporting VC funds as an operating partner on product vision and strategy.”

Product Strategy Product Development Product Management Go-to-Market Strategy Market Research Team Building+72

View profile

Abheesht Roy

Screened

Junior Software Engineer specializing in AI and distributed systems

San Francisco, CA2y exp

Agent-Techs AIArizona State University

“Built and shipped a production LLM-driven data harmonization/record-matching pipeline for pharmaceutical datasets, combining normalization, embeddings/vector search, and an LLM validation step. Emphasizes production reliability via guardrails, confidence thresholds, idempotent/retryable stages, and human-in-the-loop fallbacks, with monitoring focused on manual review and error rates to reduce false positives.”

Python Linux FAISS Vector Search Data Pipelines Auto-scaling+171

View profile

Bharath Simha Reddy Kothapeta

Screened

Full-Stack Software Engineer specializing in Java, React, and AWS

Plano, TX3y exp

Progress SolutionsNorthwest Missouri State University

“Backend-focused Python engineer who builds modular Flask services on AWS and specializes in performance/scalability work across data-heavy APIs. Has concrete wins in query optimization (1.5s to <200ms) and high-throughput async processing (Celery+Redis, ~40% throughput gain), plus experience serving scikit-learn text classification models via containerized REST services and designing multi-tenant data isolation strategies.”

Agile Amazon CloudWatch Amazon EC2 Amazon ECS Amazon Redshift Amazon S3+117

View profile

Shashank Bijarapu

Screened

Mid-level AI/ML & Data Engineer specializing in MLOps and cloud data pipelines

Remote, USA4y exp

MerkleUniversity of North Carolina at Charlotte

“AI/ML engineer (Merkle) with hands-on experience deploying RAG-based LLM applications and real-time recommendation engines into production. Strong in cloud/on-prem architectures, GPU autoscaling, caching, and network optimization—delivered measurable latency reductions (40–70%) and improved retrieval relevance by systematically benchmarking chunking/embedding configurations and validating pipelines via CI/CD.”

Python SQL R Java Bash Scikit-learn+103

View profile

Pandraju Gamanapriya

Screened

Mid-level Data Scientist specializing in healthcare ML and GenAI

San Marcos, TX4y exp

UnitedHealth GroupTexas State University

“Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.”

Agile Anomaly Detection API Integration AWS AWS Glue Bash+106

View profile

Shiva Maddoju

Screened

Mid-level Full-Stack Java Engineer specializing in cloud-native, event-driven systems

Chicago, IL4y exp

United AirlinesTrine University

“Backend engineer with airline operations domain experience who modernized flight-ops systems from batch updates to real-time streaming on AWS (Kafka + Spring Boot microservices), improving latency and stability through metric-driven tuning and idempotency. Also shipped a production LLM decision-support component using RAG over operational logs and internal procedures, with strong guardrails and an evaluation/regression loop to reduce hallucinations and enforce grounding.”

Java Python TypeScript Go Spring Boot Microservices Architecture+77

View profile

Gopichand Muppaneni

Screened

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp

Eastern Illinois UniversityEastern Illinois University

“Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.”

Python SQL PySpark ETL Data Pipelines Data Modeling+47

View profile

Saiprasad Barkam

Screened

Mid-level Data Engineer specializing in cloud ETL and streaming data pipelines

Detroit, MI5y exp

HarmonecareAuburn University at Montgomery

“Data engineer in healthcare/clinical data platforms (HarmonCare) who built and operated an end-to-end lakehouse pipeline ingesting HL7/FHIR at ~2–3M records/day on AWS (Glue/Lambda/S3/Spark) and serving trusted datasets in Snowflake. Implemented strong validation/reconciliation gates and a data quality framework that reduced discrepancies ~40%, plus CI/CD (GitHub Actions/Terraform) and monitoring (Airflow/CloudWatch).”

Python SQL PySpark Scala Shell scripting Apache Spark+89

View profile

Eric Guzman

Screened

Senior Solutions Architect specializing in MLOps and AI platform operations

New York, NY7y exp

AccentureCity College of New York (CUNY)

“Audio/music editor and mixer with Symphony Space promotional work (e.g., Uptown Showdown, Selected Shorts), focused on shaping emotion and pacing through tempo automation, tension-building harmonic choices, and precise cut-to-music timing. Pro Tools certified (Institute of Audio Research) with hands-on mixing workflows across Logic, Reason, and Cubase, and experience iterating based on commercial/producer feedback.”

Alerting Automation Azure Blob Storage Change Management CI/CD Data Pipelines+111

View profile

Ambuk Rehani

Screened

Mid-level AI/Backend Engineer specializing in RAG and data platforms

Dallas, TX7y exp

EABArizona State University

“Built and shipped a production LLM-powered financial Q&A interface that extracts precise numeric data from PDFs using a hybrid AWS Textract + LLM normalization pipeline, with confidence gating and guardrails to prevent unreliable answers. Experienced with LangChain-based RAG orchestration (chunking, memory, structured outputs) and collaborated closely with PMs/analysts on IRS Form 990 extraction requirements.”

Algorithms AWS Databricks Dashboard Development Data Pipelines Database Indexing+66

View profile

Erik Arriaga

Screened

Mid-level Data Engineer specializing in cloud data pipelines and machine learning

Austin, TX4y exp

Corner LeagueCalifornia State University, Long Beach

“Experience spans college-built AWS-hosted Python/Flask web apps and enterprise data work at General Motors, including PostgreSQL query optimization on millions of records and multi-tenant-style data isolation using group-based, column-level permission grants. Also built an AWS-hosted meat price prediction dashboard using Dash/Plotly and ran large nightly data pipelines orchestrated with Apache Airflow.”

Python SQL Java PySpark Pandas NumPy+59

View profile

Bhavishyasai Chigurupati

Screened

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp

CignaUniversity of Central Missouri

“Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.”

SDLC Agile Waterfall Python SQL R+179

View profile

Software Engineers Machine Learning Engineers Software Developers Data Scientists Data Engineers Full Stack Developers Engineering AI & Machine Learning Data & Analytics Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?