Vetted Model Evaluation Professionals

Pre-screened and vetted.

SK

Mid-level Data Scientist / ML Engineer specializing in streaming ML systems for healthcare and IoT

Urbandale, IA4y exp
John DeereAuburn University at Montgomery

ML/GenAI engineer with production experience building an LLM-powered governance layer that summarizes verified drift/performance signals into validation reports and release notes, designed for regulated environments with de-identification and non-blocking fallbacks. Strong Airflow-based orchestration background across healthcare and finance, integrating Databricks/Spark and MLflow for scalable retraining/monitoring. Demonstrated ability to partner with non-technical healthcare operations teams to deliver actionable risk-scoring outputs via dashboards and automated reporting.

View profile
RR

Mid-level Data Scientist & Machine Learning Engineer specializing in fraud and forecasting

USA5y exp
JPMorgan ChaseUniversity of Texas at Dallas

ML/LLM practitioner who has shipped production RAG systems (summarization + Q&A) and end-to-end Airflow-orchestrated demand forecasting pipelines at NEON IT. Strong focus on reliability—uses evaluation scripts, retrieval/chunking tuning, validation/retries/alerts, and stakeholder-driven iteration to make AI workflows consistent and usable.

View profile
SS

Sowmya Sree

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

Dallas, TX5y exp
Bank of AmericaUniversity of North Texas

Built production LLM systems including a real-time customer feedback analysis and workflow automation platform using RAG and multi-agent orchestration with confidence-based human escalation, addressing privacy and legacy integration challenges. Also automated ML operations with Airflow/Kubernetes (e.g., daily churn model retraining) cutting retraining time to under 30 minutes, and demonstrates a rigorous testing/monitoring approach plus strong non-technical stakeholder collaboration.

View profile
SG

Shashank Garg

Screened

Engineering leader specializing in FinTech ML/AI platforms

San Francisco, CA12y exp
TravelBankSan José State University

Engineering Manager/player-coach leading Data Infrastructure, ML/DS, and AI Engineering pods who recently shipped multiple production agentic GenAI features. Built privacy-preserving LLM workflows (PII redaction via Microsoft Presidio) and drove an AI expense-approval agent from ambiguous ask to GA, cutting approval time from ~2.5 days to <4 hours with >85% accuracy. Also owned a major LLM cost overrun incident and implemented cost observability plus circuit breakers to prevent runaway agent loops.

View profile
Sampada shelke - Mid-level Machine Learning Engineer specializing in NLP, LLMs, and applied research in La Jolla, CA

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and applied research

La Jolla, CA3y exp
Statistical Visual Computing LabUC San Diego

New grad SDE (AI/ML) who built and deployed an LLM-based chatbot framework used across technology, military, and banking contexts, focusing on model selection tradeoffs (latency vs accuracy) through prototyping and benchmarking. Also built a multi-agent "eaterybot" using PyAutoGen/AutoGen with a manager agent orchestrating specialized agents, and emphasizes rigorous testing with adversarial/edge-case datasets and hallucination checks.

View profile
Rishitha reddy katamareddy - Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems in USA

Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems

USA4y exp
OptumUniversity at Buffalo

Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.

View profile
Hamidreza Lotfalizadeh - Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems in Bay Area, CA

Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems

Bay Area, CA6y exp
Inertia SystemsPurdue University

At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.

View profile
Anirudh Raghavan - Entry-level Computer Vision/Autonomy Engineer specializing in perception and object detection in West Lafayette, IN

Entry-level Computer Vision/Autonomy Engineer specializing in perception and object detection

West Lafayette, IN0y exp
Purdue UniversityPurdue University

Robotics software engineer with hands-on ROS2 + Autoware perception experience, focused on building benchmarking infrastructure for object detection models inside a real-time autonomous driving stack. Strong in evaluation rigor (synchronization, deterministic playback, format standardization) and practical ROS2 debugging/validation workflows using RViz and Gazebo.

View profile
Aravind Mohan - Junior Software Engineer specializing in AI agents and backend systems in Seattle, WA

Aravind Mohan

Screened

Junior Software Engineer specializing in AI agents and backend systems

Seattle, WA5y exp
Biostate AIUniversity at Buffalo

Backend/AI workflow engineer who built a production event-personalization service (FastAPI + AWS Lambda) and solved real-world reliability/latency issues with deterministic routing, caching, and query/index optimization. Also built an end-to-end Gmail-based job application tracking agent using a lightweight RAG pipeline with Gemini, strong guardrails (Pydantic schemas, confidence thresholds), and offline regression tests to prevent drift and hallucination-driven data corruption.

View profile
Supreet Purthpli - Mid-level AI/ML Software Engineer specializing in cloud-native MLOps and FinTech in San Francisco, CA

Mid-level AI/ML Software Engineer specializing in cloud-native MLOps and FinTech

San Francisco, CA4y exp
JPMorgan ChaseUniversity of Kansas

Software engineer with JPMorgan Chase experience delivering end-to-end fintech features (Next.js/React/Node/Postgres on AWS) and measurable performance gains. Built and productionized an AI-native credit decisioning workflow combining LLMs, vector retrieval, and a rules engine with strong governance (bias checks, auditability, human-in-loop), improving precision and cutting underwriting turnaround time by 40%.

View profile
SJ

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Missouri, USA4y exp
KPMGUniversity of Central Missouri

Built and deployed a production LLM-powered calorie-counting chatbot that turns plain-English meal descriptions into normalized food entities, quantities, and calorie estimates using a hybrid transformer + rule-engine pipeline. Emphasizes reliability with schema/constraint guardrails, confidence-based routing (including embedding similarity search fallbacks), and strong observability/metrics (hallucination rate, calibration, latency, cost). Partnered closely with nutritionists to encode domain standards into mappings and validation logic.

View profile
SP

Mid-level Machine Learning Engineer specializing in LLM systems and healthcare data automation

California, USA2y exp
Prime HealthcareUSC

React performance-focused engineer who contributed performance patches back to an open-source context+reducer state helper after profiling and fixing excessive re-renders in an enterprise project management platform at Easley Dunn Productions. Also built an end-to-end LLM-driven pipeline at Prime Healthcare to normalize millions of supply-chain records, reducing defects by 80% and saving 160+ hours/month.

View profile
JM

Jason Meno

Screened

Senior Full-Stack Software Engineer specializing in digital health and AI

San Francisco, CA7y exp
Feeling GreatPurdue University

ML practitioner with hands-on experience in healthcare time-series modeling (CGM-based blood glucose prediction) including a novel ICA-based blind source separation approach and robust data-cleaning for noisy, missing sensor data. Also built an embeddings + LLM-powered podcast recommendation workflow using YouTube transcript scraping and Vellum AI document indexing, with a strong emphasis on production-grade engineering practices (TDD, monitoring) and realistic rolling validation for forecasting.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp
UnitedHealthcareKent State University

AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.

View profile
VM

vinay maruthi

Screened

Mid-level Software Engineer specializing in LLM agents and ERP-integrated workflow automation

New York, NY4y exp
DeloitteUniversity of Central Missouri

Built and shipped a production LLM-powered agent that automated purchasing and inventory operations by integrating with live ERP data and returning structured, machine-readable outputs usable by downstream systems. Emphasizes real-world reliability through orchestration, strict schemas/validation, confidence-based fallbacks with human handoff, and monitoring/evaluation feedback loops to reduce silent failures and make issues observable.

View profile
NS

Mid-level ML Data Engineer specializing in MLOps and scalable healthcare data pipelines

Boston, MA5y exp
CignaNortheastern University

Data/ML platform engineer with healthcare (Cigna) experience owning an end-to-end pipeline spanning Airflow + Debezium CDC ingestion, PySpark/SQL transformations, rigorous data quality gates, and feature-store/API serving for ML training and inference. Worked at 10+ TB scale and cites a ~30% latency reduction plus stronger reliability via idempotent design, monitoring, and backfill-safe reprocessing; also built pragmatic early-stage data pipelines at Frankenbuild Ventures.

View profile
Prateek Pravanjan - Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines in Remote

Junior Machine Learning Engineer specializing in LLM evaluation and GenAI pipelines

Remote1y exp
MercorStevens Institute of Technology

LLM/agent engineer who built a production LangGraph multi-agent orchestrator connecting GitHub and APM/observability signals with a chain-of-verification loop for root-cause analysis. Emphasizes pragmatic architecture (start simple with state summaries), performance tuning (async LLM calls, Docker), and rigorous evaluation (LLM-as-judge, adversarial testing, hallucination/instruction adherence metrics, tool-call tracing) while iterating with non-technical stakeholders via A/B testing.

View profile
Saniya Shinde - Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems in Washington, DC

Saniya Shinde

Screened

Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems

Washington, DC4y exp
World BankGeorge Washington University

Built and deployed a production-style vision-language pipeline that generates structured medical reports from chest X-rays using BioViLT embeddings, an image-text alignment module, and BiGPT fine-tuned with LoRA, delivered via Streamlit and hosted on AWS EC2. Also collaborating experience presenting EDA findings, feature importance, and model performance to Ford managers while working with vehicle parts data at Bimcon.

View profile
Ashwini Ramesh Kumar - Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows in Remote

Junior AI Software Engineer specializing in LLMs, RAG, and agent workflows

Remote1y exp
UMass Chan Medical SchoolUniversity of Massachusetts Amherst

Backend/ML-leaning engineer who built a content-based event recommender for FlowMingle using embeddings + HNSW vector search on Google Cloud, with Firebase as the backend and a managed recommendation lifecycle (15 recs/user, daily async generation, weekly deletion) now serving 1500+ users. Also led a cost-driven migration of ConvAI services to Azure AI using parallel request testing from a Unity client, with post-migration monitoring via logs and model evals; contributed to a Massachusetts law-enforcement conversation analysis system by expanding ingestion to PDF/TXT/Excel and multi-file inputs.

View profile
Yuchen Wang - Intern Software Engineer specializing in full-stack development and AI/ML in New York, NY

Yuchen Wang

Screened

Intern Software Engineer specializing in full-stack development and AI/ML

New York, NY1y exp
AdasEcoNYU

Built and maintains an AI Finance Tracker end-to-end as a solo full-stack product owner, from Figma designs and React frontend to Flask APIs, Firestore, auth, deployment, and AI insights. Stands out for combining product instinct with pragmatic engineering decisions like pre-aggregating financial data to control LLM costs and adding OCR receipt scanning based on real user feedback.

View profile
AM

Senior Machine Learning Engineer specializing in conversational AI and healthcare ML

Chicago, IL5y exp
OptumUniversity of Illinois Chicago

ML/AI engineer focused on taking LLM products from experiment to production, with hands-on ownership of a RAG-based customer support system that improved response quality by 35% and cut latency by 30%. Stands out for combining product impact with production rigor across retrieval tuning, safety guardrails, monitoring, and reusable Python/FastAPI services that accelerated adoption across teams.

View profile
VS

Senior AI/ML Engineer specializing in Generative AI and agentic systems

Texas, USA5y exp
Bank of AmericaWichita State University

Built and deployed an agentic RAG assistant in production to automate enterprise knowledge search and multi-step workflows with tool calling, tackling real-world issues like hallucinations, retrieval accuracy, and latency. Demonstrates strong LLMOps and orchestration depth (MLflow, Airflow, LangGraph/LangChain/LlamaIndex) plus a metrics-driven approach to agent testing/evaluation and cross-functional delivery with business stakeholders.

View profile
NS

Naim San

Screened

Senior AI/ML Engineer specializing in Python, RAG systems, and LLM fine-tuning

United States8y exp
Mechanize

Built and owned an end-to-end RAG-based AI support platform at Mechanize (FastAPI/LangChain/Pinecone/React) with rigorous evals and guardrails, driving 45% fewer support tickets and ~$280K annual savings. Also led a high-risk legacy modernization at Argo AI, incrementally extracting a monolithic Django backend using Strangler Fig + feature flags while supporting 10K+ concurrent users.

View profile

Need someone specific?

AI Search