Vetted Data Preprocessing Professionals

Pre-screened and vetted.

Rohith Sadanala - Mid-level Machine Learning Engineer specializing in Generative AI and MLOps in Missouri, USA

Mid-level Machine Learning Engineer specializing in Generative AI and MLOps

Missouri, USA3y exp
AirbnbUniversity of South Florida

LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.

View profile
Byron Pineda - Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps in Pascagoula, MS

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
Sarath Dunga - Mid-level Full-Stack Developer specializing in cloud microservices and AI/ML integration in Remote, USA

Sarath Dunga

Screened

Mid-level Full-Stack Developer specializing in cloud microservices and AI/ML integration

Remote, USA4y exp
eBayArizona State University

Full-stack engineer (~3 years) with eBay production experience building and operating high-scale, event-driven Python microservices for order processing and AI-powered recommendations (Kafka/Redis/FastAPI on AWS with Prometheus/Grafana). Also delivered polished React+TypeScript analytics dashboards and designed high-concurrency PostgreSQL schemas with significant latency reductions. Recently built AI-agent orchestration and an interactive node-based requirements dashboard for Siemens Polarion via MCP servers, improving user interaction by ~17.8%+.

View profile
HK

Mid-level Full-Stack Software Engineer specializing in cloud and data platforms

Boston, MA5y exp
Northeastern UniversityPenn State University

Full-stack engineer with experience spanning Amazon IMDb and Northeastern’s NeuroJSON portal, combining consumer product work with complex scientific data applications. Built IMDb’s streaming providers feature—described as the company’s most impactful feature of 2023—and has hands-on experience with React/Angular, GraphQL, AWS, Python services, and production monitoring.

View profile
TY

Timothy Yeav

Screened

Senior AI/ML Engineer specializing in Generative AI and FinTech

Bronx, NY8y exp
InsitroNew York City College of Technology (CUNY)

Built end-to-end LLM/RAG systems for biological data and scientific literature analysis in a drug discovery setting, helping researchers explore disease insights and treatment hypotheses faster. Combines applied GenAI product work with strong production engineering, including monitoring, retrieval optimization, reusable Python services, and scalable deployment on AWS/Kubeflow.

View profile
RV

Rucha Visal

Screened

Mid-Level Software Development Engineer specializing in distributed systems and full-stack web apps

Seattle, USA4y exp
AmazonUniversity of North Carolina at Charlotte

Software engineer who owned customer-facing, high-traffic TypeScript/React + TypeScript backend systems end-to-end, emphasizing safe velocity through feature flags, staged rollouts, observability, and rollback-ready incremental delivery. Reports shipping more frequently with fewer production incidents and faster recovery due to these guardrails.

View profile
SS

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.

View profile
NV

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

View profile
ZI

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.

View profile
SR

Sriraksha Rao

Screened

Junior Software Engineer specializing in AI systems and distributed backend platforms

San Diego, CA3y exp
Relevance LabsUC San Diego

Built end-to-end AI features across both fitness and insurance domains, including a full-stack personalized workout recommendation system and a production RAG-based insurance QA assistant at Relevance Labs. Stands out for combining backend/distributed systems skills with practical LLM architecture, evaluation, and risk-aware human-in-the-loop design; notably reduced unnecessary LLM calls by 40% while improving latency and answer reliability.

View profile
PK

Junior Software Engineer specializing in full-stack systems and distributed log analytics

Miami, FL1y exp
NeocisCarnegie Mellon University

CMU candidate with hands-on experience taking LLM concepts from research prototypes toward production-ready designs (structured outputs, guardrails, failure-scenario evaluation). Also partnered with sales/customer teams at Mazecare to drive adoption with Dontia Alliance (largest dental clinic chain in Singapore) and engaged Singapore government stakeholders, bridging clinical workflow needs with IT security/integration concerns.

View profile
HC

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp
SplunkUniversity of Colorado Boulder

Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.

View profile
RM

Rakesh Munaga

Screened

Mid-level Full-Stack Engineer specializing in AI and FinTech platforms

TX, USA4y exp
JPMorgan ChaseUniversity of Texas at Arlington

Full-stack engineer building real-time internal banking operations dashboards (Java/Spring Boot microservices + React/TypeScript) with Kafka-based streaming and post-launch performance optimizations. Also shipped a production internal AI support assistant using RAG (Confluence/PDF/support docs ingestion, embeddings + vector DB retrieval) with guardrails, evaluation loops, and observability to reduce hallucinations and prevent regressions.

View profile
Sirisha Maddikunta - Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions in O Fallon, MO

Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions

O Fallon, MO6y exp
MastercardUniversity of Texas at Arlington

Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.

View profile
BD

Mid-level Full-Stack Software Engineer specializing in FinTech microservices

California, USA4y exp
PayPalCalifornia State University, Long Beach

Robotics software engineer who has built end-to-end pipelines spanning backend/data processing through model interfaces and hardware integration. Has hands-on ROS2 experience building Python nodes and debugging real-time behavior via profiling, publish-rate tuning, and latency fixes, plus experience standardizing multi-robot communication with QoS adjustments. Uses Gazebo simulation and Docker/CI/CD to catch integration issues early and speed iteration.

View profile
RV

Mid-Level Full-Stack Developer specializing in FinTech

Remote, USA4y exp
IntuitMississippi State University

Backend-heavy full-stack engineer with experience at Intuit (TurboTax Live) and Paytm payments, building and scaling Java/Spring Boot microservices for high-traffic transaction systems. Has hands-on wins improving peak-load performance using Redis/disk caching and Kafka event-driven patterns, plus React/Redux work for web app integration and strong monitoring practices with ELK.

View profile
Niyaz Nurbhasha - Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines

Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines

4y exp
BlueHaloDuke University

ML/LLM engineer who built production systems to speed up artist content-creation workflows, including a fine-tuned image captioning model paired with a RAG layer over image embeddings/captions to improve consistency across changing domains. Experienced orchestrating multi-tool agents with LangChain/LangGraph (planning + critic/reflection) and setting up practical monitoring (caption rejection rate) plus evaluation sets for tool-calling accuracy, output quality, and latency.

View profile
Vidhi Upadhyay - Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems in Remote

Senior Software Engineer specializing in AI/ML, computer vision, and cloud-native systems

Remote8y exp
Saayam for AllCarnegie Mellon University

Independently built a production-grade, containerized enterprise agentic AI platform (stateful orchestration + RAG) focused on real-world reliability—guardrails, citation-based outputs, reranking, query rewriting, and evaluation harnesses to reduce hallucinations. Hands-on with OpenAI SDK, CrewAI, and LangGraph, and has delivered AI solutions for non-technical NGO stakeholders via demos and practical POCs.

View profile
AC

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and predictive analytics

New Jersey, USA5y exp
JPMorgan ChaseStevens Institute of Technology

GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.

View profile
SM

SHREY MATHUR

Screened

Mid-level Machine Learning Engineer specializing in LLMs and AI products

Sunnyvale, CA6y exp
TCSUCLA

Applied ML/LLM engineer currently building AppleCare’s production chat recommender, owning the full lifecycle from transcript cleaning and fine-tuning through distributed deployment, monitoring, and iterative improvement. Their work delivered >10% copy-count improvement, 5% lower modification rate, 60% cost reduction, and $1.1M profitability in 2025, and they also created a reasoning-data generation approach that enabled a reasoning model and a judge model that cut eval time by over 99%.

View profile
TS

Tianai Shi

Screened

Intern Full-Stack Software Engineer specializing in test analytics platforms

La Jolla, CA2y exp
NutanixUC San Diego

Software engineer intern at Nutanix who independently shipped and maintained an internal smoke-test/failure-analysis dashboard, integrating failure data from multiple upstream systems (e.g., Jira, Jenkins, CircleCI) via REST APIs. Also has prior data-science experience building Postgres-based asset management analytics with automated reporting and indexing for faster time-series retrieval.

View profile
KT

Mid-level Data Scientist specializing in machine learning and generative AI

Saint Louis, MO5y exp
DoorDashSaint Louis University

ML/LLM engineer who has shipped a production transformer-based document understanding system on AWS, owning the full pipeline from domain fine-tuning to Dockerized CI/CD deployment. Demonstrates strong production rigor—latency optimization (distillation/quantization, async batching, autoscaling), orchestration with Airflow/Step Functions/Azure Data Factory, and monitoring/drift detection—plus experience translating ops stakeholder needs into adopted AI automation via dashboards.

View profile
YY

Yuxi Yang

Screened

Intern Embedded Software Engineer specializing in autonomous driving and applied computer vision

null1y exp
iFLYTEKJohns Hopkins University

Autonomous driving engineer from iFLYTEK who shipped 5+ middleware modules for vehicles across three models, with deep experience in reliability, IPC performance, and real-world system hardening. Stands out for translating flaky production behavior into measurable signals—resolving 30+ faults, cutting backlog 39%, improving latency 20%, and supporting 500+ hours of road testing with 99%+ reliability.

View profile

Need someone specific?

AI Search