Reval Logo

Vetted Model Evaluation Professionals

Pre-screened and vetted.

AA

Intern Software Engineer specializing in backend systems and data engineering

1y exp
Pocket FMBITS Pilani, Hyderabad

Backend/AI engineer who has built and shipped two products: Know Founder (Python/SQL/AWS) scaling to 2,000+ users in the first month, and Unifr (unifr.online), an AI search visibility engine that queries multiple LLMs and turns responses into structured brand insights. Strong in production reliability/performance (Redis caching, indexing, precomputation) and in designing agentic workflows with guardrails, validation, retries, and human escalation.

View profile
SR

Sahithi Reddy

Screened

Mid-level Machine Learning Engineer specializing in LLM-powered products

Dallas, TX4y exp
VerizonUniversity of Massachusetts Dartmouth

Verizon engineer who productionized an LLM-based personalization capability for a customer-facing digital platform, owning the path from success metrics through scalable APIs, A/B validation, and post-launch monitoring (latency/accuracy/drift). Experienced in diagnosing and fixing real-time LLM/RAG workflow issues under peak load, and in enabling adoption via tailored technical demos/workshops and sales support materials.

View profile
NM

naveena musku

Screened

Senior AI/ML Engineer specializing in Agentic AI and LLM automation

8y exp
Western UnionJawaharlal Nehru Technological University

Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.

View profile
EG

Esha Gangam

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
DeloitteUniversity at Albany

GenAI/ML engineer from Deloitte who built and shipped a production RAG-based internal search assistant for support teams, delivering quantified operational gains (20% effort reduction, 35% faster manual lookup). Experienced in enterprise-grade LLM reliability (grounding/hallucination control), compliance/security constraints, and rapid release cycles using CI/CD, MLflow, and orchestration tools (Airflow, Databricks Jobs, LangChain).

View profile
PK

Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI

USA4y exp
GE HealthCareFranklin University

LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.

View profile
HW

Hans Walker

Screened

Junior Machine Learning Engineer specializing in generative AI and computer vision

Boston, MA2y exp
CuebricUSC

AI engineer who deployed a production LLM-powered safety system for an education platform, combining rule-based checks, multi-LLM verification, and selective context (prompt+image vs image-only) to prevent explicit prompts/images from getting through. Strong focus on reliability via benchmarking, trace-based failure analysis, and continuous improvement driven by stakeholder feedback and manual review.

View profile
KS

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp
UnitedHealth GroupKent State University

Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.

View profile
MR

Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI

Springfield, Missouri5y exp
O'Reilly Auto PartsSaint Louis University

ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.

View profile
KP

Krisha Patel

Screened

Entry-Level Software Engineer specializing in AI/ML and Full-Stack Development

United States0y exp
TargetUniversity at Albany

Backend engineer who built an NL-to-SQL system at Target, using a multi-step LLM pipeline with vector-store schema retrieval and SQL validation to safely answer business questions. Strong in production FastAPI systems (async, Pydantic, Docker/Uvicorn, load balancing) and security (OAuth2/JWT, scopes, and database row-level security), with experience migrating Flask apps to FastAPI + PostgreSQL using strangler/feature-flagged canary rollouts.

View profile
AA

Junior Backend Software Engineer specializing in conversational AI and cloud APIs

Bangalore, India1y exp
HarmanUSC

Backend/ML-focused software engineer who built and evolved a Python/FastAPI backend for a large-scale conversational AI platform, decoupling API and inference services to improve stability and deployment velocity. Experienced in production hardening (timeouts/fallbacks/monitoring), secure multi-tenant systems (JWT/RBAC/RLS), and low-risk migrations using shadow deployments and incremental traffic ramp-ups.

View profile
SP

Soham Patil

Screened

Junior Cloud & AI/ML Engineer specializing in AWS GovCloud and MLOps

Washington, DC2y exp
IBMTexas Tech University

Robotics software engineer with hands-on ROS 2 autonomy experience on an obstacle-avoiding quadrotor (ROS 2 + Gazebo + PX4 + Nav2/SLAM), including custom work to extend Nav2 into a 3D aerial domain and output PX4 trajectory setpoints. Also built cost-saving ML infrastructure (PostgreSQL + AWS data-cleaning pipeline) and improved object detection accuracy by 40% using CUDA/PyTorch, with strong containerization and CI/CD practices (Docker + Kubernetes, aggressive version pinning) to prevent environment drift.

View profile
JH

Joshua Hewitt

Screened

Senior Software Engineer specializing in Generative AI product development

San Francisco, USA9y exp
PadletUniversity of Sydney

AI product builder at Padlet who shipped multiple production LLM features for education workflows, including an AI document generator (AI Recipes) and a RAG-enabled in-product chat assistant. Built an AI microservice layer (LangChain) to swap model providers easily and created automated + human-in-the-loop evaluation systems (including ~100-test runs) to iterate on prompts and quality.

View profile
AM

Mid-level Data Analyst specializing in healthcare and financial analytics

USA3y exp
Cardinal HealthIndiana Tech

Built and productionized an LLM-powered clinical documentation and insights pipeline at Cardinal Health using LangChain + GPT-4 with RAG to summarize long clinical notes, extract medication/dosage entities, and generate structured SQL-ready outputs for downstream analytics. Emphasizes clinical reliability via labeled benchmarking (precision/recall/F1), shadow deployments, clinician human-in-the-loop review, and ongoing monitoring/orchestration with Airflow, Lambda, S3, Postgres, and Power BI.

View profile
VH

Mid-level ML/AI Engineer specializing in NLP, RAG pipelines, and financial risk & fraud systems

USA3y exp
FintaUniversity at Buffalo

Built and shipped LLM/RAG systems in finance and startup settings, including a Goldman Sachs document intelligence platform that indexed ~8TB of regulatory filings and delivered cited, conversational answers with <2s latency—cutting compliance research by ~4.5 hours per batch. Also developed LangChain-based agent workflows at Finta to automate CRM enrichment and investor lookup with strong testing, tracing (LangSmith), privacy guardrails, and auditability.

View profile
AM

Arya Mane

Screened

Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing

Dallas, Texas1y exp
Receptro.AIUniversity of Texas at Dallas

Built a production RAG-based NBA player scouting assistant that embeds player profiles into FAISS, orchestrates retrieval and LLM recommendations with LangChain, and surfaces results via embedded Tableau dashboards. Demonstrates strong focus on evaluation/monitoring (batch tests, LLM-as-judge, latency/failure/token metrics) and has experience translating non-technical founder goals into DAPT + fine-tuning plans on curated data.

View profile
SK

Sana Khan

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech

Oklahoma, USA4y exp
Capital OneOklahoma Christian University

ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.

View profile
DA

Executive AI Platform & Product Leader specializing in commercialization and multimodal AI

29y exp
InferLinkUniversity of Texas at Dallas

Entrepreneur building an applied-AI tool for geological resource exploration (critical minerals, oil & gas) that overlays proprietary and public data from reports/logs/maps to generate evidence-based greenfield profiling insights. Has spent ~2 years on industry research, built a POC, validated demand with purchasing signals, and developed partnerships/network including USGS, DARPA, and ESRI.

View profile
AR

Entry-level Computer Vision/Autonomy Engineer specializing in perception and object detection

West Lafayette, IN0y exp
Purdue UniversityPurdue University

Robotics software engineer with hands-on ROS2 + Autoware perception experience, focused on building benchmarking infrastructure for object detection models inside a real-time autonomous driving stack. Strong in evaluation rigor (synchronization, deterministic playback, format standardization) and practical ROS2 debugging/validation workflows using RViz and Gazebo.

View profile
RR

Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems

USA4y exp
OptumUniversity at Buffalo

Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.

View profile
HL

Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems

Bay Area, CA6y exp
Inertia SystemsPurdue University

At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.

View profile
MB

Manav Bhasin

Screened

Junior Full-Stack Machine Learning Engineer specializing in production ML systems

San Jose, CA2y exp
AgroFocal Technologies IncSan José State University

Software engineer who owned end-to-end delivery of customer-facing agricultural forecast reporting (crop yield/health) and iterated quickly via rigorous edge-case testing and customer feedback. Also built an internal ML training platform (TypeScript/React + Flask/Python + MongoDB) used by every developer, with architecture designed to stay responsive under heavy compute load.

View profile
SK

Mid-level Data Scientist / ML Engineer specializing in streaming ML systems for healthcare and IoT

Urbandale, IA4y exp
John DeereAuburn University at Montgomery

ML/GenAI engineer with production experience building an LLM-powered governance layer that summarizes verified drift/performance signals into validation reports and release notes, designed for regulated environments with de-identification and non-blocking fallbacks. Strong Airflow-based orchestration background across healthcare and finance, integrating Databricks/Spark and MLflow for scalable retraining/monitoring. Demonstrated ability to partner with non-technical healthcare operations teams to deliver actionable risk-scoring outputs via dashboards and automated reporting.

View profile
RR

Mid-level Data Scientist & Machine Learning Engineer specializing in fraud and forecasting

USA5y exp
JPMorgan ChaseUniversity of Texas at Dallas

ML/LLM practitioner who has shipped production RAG systems (summarization + Q&A) and end-to-end Airflow-orchestrated demand forecasting pipelines at NEON IT. Strong focus on reliability—uses evaluation scripts, retrieval/chunking tuning, validation/retries/alerts, and stakeholder-driven iteration to make AI workflows consistent and usable.

View profile

Need someone specific?

AI Search