Vetted Model Evaluation Professionals

Pre-screened and vetted.

AK

Staff AI Systems Engineer specializing in multi-agent and distributed platforms

San Francisco Bay Area, CA18y exp
Reddit
View profile
AS

Arnav Singh

Screened

Junior Software Engineer specializing in full-stack web, cloud data, and applied ML

Hanover, NH2y exp
PlayStationDartmouth College

Backend engineer who evolved the X-Ray gaming analytics platform, leading a zero-downtime MongoDB→AWS DocumentDB migration with dual-write, checksum-based validation, and Kubernetes canary rollouts while maintaining real-time monitoring for millions of concurrent sessions. Strong in FastAPI/Python API scaling and performance tuning (cut latency from ~2s to <150ms and reduced DB load 90%) plus production-grade auth/RLS security patterns (JWT, Supabase Auth, PostgreSQL RLS).

View profile
Sourabh Jain - Director of Software Engineering specializing in enterprise Data, ML & AI platforms in Bay Area, CA

Sourabh Jain

Screened

Director of Software Engineering specializing in enterprise Data, ML & AI platforms

Bay Area, CA23y exp
RSA SecurityShri G. S. Institute of Technology and Science

Former Walmart Director of Software Engineering who left in March 2025 to build products for clients. Recently delivered an LLM/RAG-based UNSPSC classification solution for an MRO client using a multi-stage retrieval + web search + prompt-engineering workflow, and has led large-scale retail forecasting initiatives and high-severity cloud-migration incidents end-to-end.

View profile
JK

Mid-level Software Engineer specializing in backend, cloud, and AI systems

Seattle, WA4y exp
AmazonSaint Louis University

Engineer with hands-on experience across backend, full-stack, cloud, and AI/ML systems, with particular depth in Python, FastAPI, AWS Bedrock, SageMaker, and RAG-based architectures. Stands out for treating AI and agents as accelerators within disciplined production engineering, emphasizing guardrails, observability, latency/cost monitoring, and scalable system design.

View profile
BK

Balpreet Kaur

Screened

Junior Machine Learning Engineer specializing in LLMs and data pipelines

Amherst, MA2y exp
Google DeepMindUniversity of Massachusetts Amherst

Research Extern at Google DeepMind and former AWS Software Development Engineer Intern with a strong focus on practical, trustworthy AI engineering. Built a multi-agent RAG system for personalized news headline generation using a fine-tuned Flan-T5 model, parallel critic agents, FAISS retrieval, and style embeddings, while also leading a 3-person team on the project.

View profile
Likhitha Bethi - Mid-level Software Engineer specializing in backend systems, distributed systems, and applied AI in Stony Brook, NY

Mid-level Software Engineer specializing in backend systems, distributed systems, and applied AI

Stony Brook, NY4y exp
Stony Brook UniversityStony Brook University

Goldman Sachs engineer who owned end-to-end features for an internal onboarding and case management platform, spanning React/TypeScript UI, a GraphQL gateway, and Node + Spring WebFlux microservices. Built and operated a Kafka-based ingestion and search pipeline with DLQs, retries, idempotency, and strong observability, and improved developer experience via backward-compatible GraphQL API design and schema-driven documentation.

View profile
CW

Mid-level Robotics & Autonomy Engineer specializing in MPC, RL, and GPU-accelerated optimization

4y exp
Georgia Institute of TechnologyUC Berkeley

Robotics software engineer from Ati Motors who brought a Linear MPC approach (based on Kuhne et al.) into production, rebuilding parts of the planning stack to eliminate oscillations and safely double AMR speed from 0.8 m/s to 1.6 m/s. Also delivered an end-to-end point-cloud detection pipeline (PointPillars) including synthetic data generation in Isaac Sim and TensorRT deployment for real-time human/trolley detection, with a strong focus on production reliability via iterative hardening and nightly SIL.

View profile
HR

Mid-level Data Analytics professional specializing in BI, data engineering, and applied AI

California, USA6y exp
AmazonSan Jose State University

Built GenMedX, a multi-module clinical AI system for emergency department decision support spanning triage prediction, diagnosis, medication Q&A, and visit summarization. Stands out for combining medical LLM fine-tuning, RAG, and rigorous evaluation/monitoring to drive a major triage recall improvement from 38.5% to 76.6%, with a strong focus on safety, edge-case detection, and production reliability.

View profile
KD

Junior ML Engineer specializing in Generative AI and LLM applications

Thousand Oaks, California3y exp
NVIDIACalifornia Lutheran University

Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.

View profile
AY

Arwen Yang

Screened

Staff Applied Scientist specializing in multimodal LLM safety, robustness, and retrieval

Los Altos, CA8y exp
LibrAIUniversity of Melbourne

Built a production LLM-driven archival assistant that turns large, low-quality scanned handwritten files (120+ pages) into structured datasets, overcoming context-window and hierarchy challenges with a two-phase LLM + rules pipeline and reaching 98.1% accuracy (Gemini-2.5 Flash). Also orchestrated a large human-in-the-loop effort with 78 archivists, producing 2,400 high-quality annotations in 4 days via detailed rubrics and support.

View profile
VM

Vishal Mittal

Screened

Director-level Engineering Manager specializing in cloud security platforms and AI-driven automation

Fremont, CA18y exp
Palo Alto NetworksStanford University

Senior engineering leader in the Bay Area with experience spanning VMware, Hortonworks/Cloudera, Barracuda, and Palo Alto Networks, including leading open-source work (Apache Knox) and architecting large-scale security platforms. Has driven disaster recovery and cloud security products, designed Python microservices for Microsoft 365 security, and scaled teams (3x) while formalizing enterprise readiness practices with automated documentation using Notebook LLM.

View profile
Daniel Luzzatto - Junior Machine Learning Engineer specializing in LLMs, computer vision, and robotics in Tirat Carmel, Israel

Junior Machine Learning Engineer specializing in LLMs, computer vision, and robotics

Tirat Carmel, Israel1y exp
FusmobileUCLA

Built and deployed an agentic, multimodal LLM system that automates privacy redaction pipelines (audio/video/tabular) using LangChain orchestration and a closed-loop self-correction design. Personally implemented and performance-optimized core CV tooling (face blurring with tracking/Kalman filter) achieving >100 FPS on CPU, and validated reliability with golden-dataset benchmarking across 100+ privacy intents and measurable redaction metrics.

View profile
Yash Jajoo - Senior Software Engineer specializing in AI and FinTech platforms in New York City, NY

Yash Jajoo

Screened

Senior Software Engineer specializing in AI and FinTech platforms

New York City, NY8y exp
Walter AINew York University

Built a production LLM pipeline at Walter AI that scans massive user inboxes, identifies financial newsletters, and extracts trading strategies into structured JSON for downstream paper-trading workflows. Stands out for combining agent architecture with strong production discipline—cutting scan time from 20 to 5 minutes, reducing LLM costs by 90%, and achieving 3-second P99 latency while handling messy, inconsistent email data at scale.

View profile
Robert Davis - Staff Software Engineer specializing in backend and distributed systems in Remote, US

Robert Davis

Screened

Staff Software Engineer specializing in backend and distributed systems

Remote, US19y exp
InovalonGeorgia Tech

Backend engineer who co-launched SkyKick’s Office 365 SharePoint/Exchange backup product, built the MVP, and then architected and led its design for 9 years. Stands out for high-scale systems expertise, including an algorithmic redesign that cut cloud costs by an order of magnitude, plus earlier experience integrating speech recognition systems in noisy real-world customer environments.

View profile
LN

Mid-level Data Science AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

USA3y exp
Samsara

Built a production RAG-based "knowledge copilot" for support/ops using LangChain/LangGraph, implementing the full pipeline (ingestion, chunking, embeddings, vector DB retrieval/rerank, guarded generation with citations) and operating it as monitored microservices with CI/CD. Also designed an event-driven, streaming backend for real-time inventory ordering predictions that reduced stockouts by 25%, and has hands-on incident response experience stabilizing LLM API latency/5xx spikes using Datadog/APM and resilience patterns.

View profile
AW

Senior QA Analyst specializing in VR and game testing

Burlingame, CA9y exp
MetaAcademy of Art University
View profile
RZ

Intern Machine Learning Researcher specializing in LLM and GNN security

Salt Lake City, UT0y exp
SamsungUniversity of Utah
View profile
Rohit Dhadvai - Mid-level AI/ML Engineer specializing in fraud detection and customer lifetime value modeling in Remote, USA

Mid-level AI/ML Engineer specializing in fraud detection and customer lifetime value modeling

Remote, USA4y exp
StripeGeorge Mason University
View profile
Piyush Jadhav - Junior Software Engineer specializing in DevOps and full-stack web development in Carlsbad, CA

Junior Software Engineer specializing in DevOps and full-stack web development

Carlsbad, CA1y exp
ViasatUC Santa Barbara
View profile
AM

Junior Robotics Engineer specializing in perception, control, and mechatronic prototyping

Peoria, IL3y exp
CaterpillarUniversity of Pennsylvania
View profile
Irtaza Syed - Senior Full-Stack Software Engineer specializing in cloud platforms and AI evaluation in Remote, California

Senior Full-Stack Software Engineer specializing in cloud platforms and AI evaluation

Remote, California9y exp
MetaKarlsruhe University of Applied Sciences
View profile
Julia Yoon - Junior Software Engineer specializing in distributed systems and AI evaluation in Pittsburgh, PA

Junior Software Engineer specializing in distributed systems and AI evaluation

Pittsburgh, PA2y exp
Carnegie Mellon UniversityCarnegie Mellon University
View profile

Need someone specific?

AI Search