Vetted Data Pipelines Professionals

Pre-screened and vetted.

MV

Manish Vemula

Screened

Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI

TX, USA4y exp
DiscoverCentral Michigan University

ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.

View profile
DG

Dimple Galla

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in RAG, MLOps, and real-time analytics

Lawrence, KS4y exp
PaycomUniversity of Kansas

Software/ML engineer who built a production automated job-finding and cold-email personalization system for Fortune 500 outreach, using JobSpy for dynamic scraping, LangChain orchestration, and LLM+vector DB semantic search with grounding/relevance metrics and guardrails. Also delivered a predictive investment analytics platform for financial advisors, communicating results via Tableau dashboards and portfolio KPIs like Sharpe ratio and drawdowns.

View profile
TW

Senior Data Analytics & Data Science professional specializing in Financial Services

4y exp
InfosysGeorgia State University

Worked on large financial analytics datasets combining complaint text, transaction logs, and demographics; built end-to-end NLP/ML pipelines (TF-IDF + Random Forest) and data integration in BigQuery with Tableau reporting, citing ~95–98% accuracy. Also implemented entity resolution with fuzzy matching and semantic linking using BERT sentence-transformer embeddings stored in FAISS, including fine-tuning on labeled pairs to improve search/linking relevance.

View profile
AG

Amie Gibson

Screened

Senior Geospatial Developer specializing in GIS automation, elevation/LiDAR, and AI-enabled apps

Sand Springs, OK27y exp
FEMAFlorida Institute of Technology

Built and monetized an object-identification app end-to-end (FastAPI backend, HTML/JS frontend, SQLite→Postgres, auth, and an iOS wrapper via Capacitor/Xcode with Apple privacy/policy compliance). Also productionized an AI-native geospatial metadata/QA assistant using LLM+RAG plus deterministic Python validation, measuring impact via time-to-first-pass review and rework rate, and has experience modernizing legacy GIS workflows and delivering across USDA/FEMA-style teams with disciplined Jira-based execution.

View profile
Pooja Miryala - Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for banking and healthcare in Ohio, USA

Pooja Miryala

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for banking and healthcare

Ohio, USA4y exp
Fifth Third BankYoungstown State University

Deployed a real-time LLM-driven call center summarization and agent-assist platform at Fifth Third Bank, combining transformer models (BERT/GPT) with FastAPI inference on AKS and vector storage (ChromaDB/PostgreSQL). Emphasizes production-grade reliability (autoscaling, CI/CD, monitoring) and measurable evaluation (A/B testing), and translates model outputs into business-facing Power BI insights for call center leadership.

View profile
Ponugoti Sushma - Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML in Texas, USA

Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML

Texas, USA5y exp
AllstateTexas A&M University-Corpus Christi

Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.

View profile
Akhil Bharadwaj Mateti - Mid-level Software Engineer specializing in Data Science and Machine Learning in Arlington, Virginia

Mid-level Software Engineer specializing in Data Science and Machine Learning

Arlington, Virginia4y exp
ElevateMeGeorge Washington University

Robotics/AV perception engineer who built a semantic-segmentation road detection system and integrated it into a ROS-based real-time pipeline (ROS bag camera feed to live monitor) achieving ~12 FPS. Strong in practical deployment work: solved multi-library versioning issues (ROS/OpenCV/TensorFlow), containerized the stack with Docker, and optimized inference by shifting runtime to C++ for large latency gains on NVIDIA hardware.

View profile
Andrew Clayman - Senior Data Scientist specializing in ML, NLP, and production AI systems in Remote

Senior Data Scientist specializing in ML, NLP, and production AI systems

Remote8y exp
AppstemUniversity of Southampton

Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.

View profile
Meghana Nandivada - Junior Machine Learning Engineer specializing in production ML systems and MLOps

Junior Machine Learning Engineer specializing in production ML systems and MLOps

2y exp
TCSStevens Institute of Technology

ML/AI engineer (TCS) who built and productionized a customer segmentation and personalized-offer recommendation pipeline end-to-end (data cleaning/feature engineering/clustering through Flask API deployment in Docker with monitoring). Emphasizes reliability and operational rigor via validation checks, periodic retraining, model/API versioning, and latency optimization, and has experience translating marketing KPIs into usable dashboards for non-technical teams.

View profile
Dhairya Desai - Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics in Chicago, IL

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.

View profile
Serge Ahranovich - Executive CTO / Platform Architect specializing in IoT, telematics, and EV charging infrastructure in Los Angeles, CA

Executive CTO / Platform Architect specializing in IoT, telematics, and EV charging infrastructure

Los Angeles, CA20y exp
TimeTickBelarusian State University of Informatics and Radioelectronics

Founder of TimeTick (timetick.io), an AI-powered diagnostics platform for IoT combining device simulation, automated testing, and real-time monitoring—initially focused on EV charger diagnostics. Former VP of Engineering with a track record of building IoT systems from scratch and applying AI to detect protocol-failure patterns that drive downtime; currently supporting existing customers and converting pilots (with leads like Siemens and ABB) into paid subscriptions.

View profile
Patrick Seeman - Mid-level Data Scientist and Game Tech Leader specializing in ML, healthcare analytics, and Unity in Manila, Philippines

Mid-level Data Scientist and Game Tech Leader specializing in ML, healthcare analytics, and Unity

Manila, Philippines5y exp
GridLock GamesJohn Carroll University

Data scientist at Cleveland Clinic Taussig Cancer Institute who led a production automation to convert unstructured (and sometimes image-based) pathology reports into structured data for government reporting. Built an on-prem LangGraph + Ollama pipeline with OCR (Tesseract), spell-checking, confidence scoring, and human-audited guardrails to mitigate hallucinations and improve reliability under PHI constraints.

View profile
Adrian Lawrence - Executive Product & Technology Leader specializing in AI, analytics, and regulated industries in Atlanta, GA

Executive Product & Technology Leader specializing in AI, analytics, and regulated industries

Atlanta, GA14y exp
Vitalis VenturesGeorgia Tech

Serial startup product/technology leader who previously exited a company to Green Street and has accelerator experience via Notre Dame’s IDEA Center. Now pursuing a commercial real estate analytics concept focused on deep demand analysis for better capital allocation, with a provisional patent filed and experience supporting VC funds as an operating partner on product vision and strategy.

View profile
AR

Abheesht Roy

Screened

Junior Software Engineer specializing in AI and distributed systems

San Francisco, CA2y exp
Agent-Techs AIArizona State University

Built and shipped a production LLM-driven data harmonization/record-matching pipeline for pharmaceutical datasets, combining normalization, embeddings/vector search, and an LLM validation step. Emphasizes production reliability via guardrails, confidence thresholds, idempotent/retryable stages, and human-in-the-loop fallbacks, with monitoring focused on manual review and error rates to reduce false positives.

View profile
BS

Full-Stack Software Engineer specializing in Java, React, and AWS

Plano, TX3y exp
Progress SolutionsNorthwest Missouri State University

Backend-focused Python engineer who builds modular Flask services on AWS and specializes in performance/scalability work across data-heavy APIs. Has concrete wins in query optimization (1.5s to <200ms) and high-throughput async processing (Celery+Redis, ~40% throughput gain), plus experience serving scikit-learn text classification models via containerized REST services and designing multi-tenant data isolation strategies.

View profile
SB

Mid-level AI/ML & Data Engineer specializing in MLOps and cloud data pipelines

Remote, USA4y exp
MerkleUniversity of North Carolina at Charlotte

AI/ML engineer (Merkle) with hands-on experience deploying RAG-based LLM applications and real-time recommendation engines into production. Strong in cloud/on-prem architectures, GPU autoscaling, caching, and network optimization—delivered measurable latency reductions (40–70%) and improved retrieval relevance by systematically benchmarking chunking/embedding configurations and validating pipelines via CI/CD.

View profile
PG

Mid-level Data Scientist specializing in healthcare ML and GenAI

San Marcos, TX4y exp
UnitedHealth GroupTexas State University

Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.

View profile
SM

Shiva Maddoju

Screened

Mid-level Full-Stack Java Engineer specializing in cloud-native, event-driven systems

Chicago, IL4y exp
United AirlinesTrine University

Backend engineer with airline operations domain experience who modernized flight-ops systems from batch updates to real-time streaming on AWS (Kafka + Spring Boot microservices), improving latency and stability through metric-driven tuning and idempotency. Also shipped a production LLM decision-support component using RAG over operational logs and internal procedures, with strong guardrails and an evaluation/regression loop to reduce hallucinations and enforce grounding.

View profile
GM

Mid-level Data Engineer specializing in Azure, Spark, and scalable ETL/ELT pipelines

Charleston, IL4y exp
Eastern Illinois UniversityEastern Illinois University

Data engineer with banking FP&A experience who led an end-to-end migration of 10+ TB from Teradata to Azure (ADF + Data Lake + Databricks/PySpark + Synapse). Emphasizes reliability (multi-stage validation, monitoring/alerts) and performance (Spark tuning, incremental loads, autoscaling), reporting ~99.5% pipeline reliability while supporting downstream consumers with stable schemas and clear change management.

View profile
SB

Mid-level Data Engineer specializing in cloud ETL and streaming data pipelines

Detroit, MI5y exp
HarmonecareAuburn University at Montgomery

Data engineer in healthcare/clinical data platforms (HarmonCare) who built and operated an end-to-end lakehouse pipeline ingesting HL7/FHIR at ~2–3M records/day on AWS (Glue/Lambda/S3/Spark) and serving trusted datasets in Snowflake. Implemented strong validation/reconciliation gates and a data quality framework that reduced discrepancies ~40%, plus CI/CD (GitHub Actions/Terraform) and monitoring (Airflow/CloudWatch).

View profile
Eric Guzman - Senior Solutions Architect specializing in MLOps and AI platform operations in New York, NY

Eric Guzman

Screened

Senior Solutions Architect specializing in MLOps and AI platform operations

New York, NY7y exp
AccentureCity College of New York (CUNY)

Audio/music editor and mixer with Symphony Space promotional work (e.g., Uptown Showdown, Selected Shorts), focused on shaping emotion and pacing through tempo automation, tension-building harmonic choices, and precise cut-to-music timing. Pro Tools certified (Institute of Audio Research) with hands-on mixing workflows across Logic, Reason, and Cubase, and experience iterating based on commercial/producer feedback.

View profile
Ambuk Rehani - Mid-level AI/Backend Engineer specializing in RAG and data platforms in Dallas, TX

Ambuk Rehani

Screened

Mid-level AI/Backend Engineer specializing in RAG and data platforms

Dallas, TX7y exp
EABArizona State University

Built and shipped a production LLM-powered financial Q&A interface that extracts precise numeric data from PDFs using a hybrid AWS Textract + LLM normalization pipeline, with confidence gating and guardrails to prevent unreliable answers. Experienced with LangChain-based RAG orchestration (chunking, memory, structured outputs) and collaborated closely with PMs/analysts on IRS Form 990 extraction requirements.

View profile
Erik Arriaga - Mid-level Data Engineer specializing in cloud data pipelines and machine learning in Austin, TX

Erik Arriaga

Screened

Mid-level Data Engineer specializing in cloud data pipelines and machine learning

Austin, TX4y exp
Corner LeagueCalifornia State University, Long Beach

Experience spans college-built AWS-hosted Python/Flask web apps and enterprise data work at General Motors, including PostgreSQL query optimization on millions of records and multi-tenant-style data isolation using group-based, column-level permission grants. Also built an AWS-hosted meat price prediction dashboard using Dash/Plotly and ran large nightly data pipelines orchestrated with Apache Airflow.

View profile
Bhavishyasai Chigurupati - Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms in Overland Park, KS

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp
CignaUniversity of Central Missouri

Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.

View profile

Need someone specific?

AI Search