Vetted Model Evaluation Professionals

Pre-screened and vetted.

kartikeya tiwari - Senior Software Engineer specializing in AI systems and platform engineering in Bangalore, India

Senior Software Engineer specializing in AI systems and platform engineering

Bangalore, India6y exp
CoralSwami Keshvanand Institute of Technology, Management & Gramothan, Jaipur

Backend/AI engineer with experience owning production systems in fintech and product startups, including a predictive scaling platform that cut AWS spend by 40% and an ambiguous social-intelligence feature that doubled MRR from $50K to $100K. Also building AI search and document-processing workflows, with reported 99.7% extraction accuracy and hands-on use of both classical forecasting and modern LLM stacks.

View profile
MB

Senior AI/Machine Learning Engineer specializing in production ML and IoT platforms

Winterville, NC17y exp
FreelanceEast Carolina University

Backend/cloud engineer who built an AWS serverless IoT system that computes Bluetooth beacon locations from telemetry using heavy scientific Python (NumPy/SciPy/pandas) packaged as Dockerized Lambda, integrated with Java microservices and scheduled batch orchestration. Has deep AWS delivery experience (CI/CD with Code* tools, CloudFormation, cost controls) and has led high-severity incident response including CloudTrail forensics and infrastructure recovery after a compromised-keys crypto-mining attack.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
TA

Junior Machine Learning Engineer specializing in Generative AI and analytics automation

Bengaluru, India2y exp
AccentureUniversity of Alabama at Birmingham

AI/LLM engineer who built a production intelligent support system using RAG over a vectorized documentation library, addressing real-world issues like lost-in-the-middle context failures and doc freshness via automated GitHub-driven re-embedding pipelines. Emphasizes rigorous agent evaluation (component/E2E/ops) and prefers lightweight, decoupled workflow automation using message brokers (Redis/RabbitMQ) over heavyweight orchestration frameworks.

View profile
PS

Mid-level QA Engineer specializing in AI/ML model validation and data quality

USA7y exp
AccentureClarkson University

ML practitioner with a QA background who has built end-to-end ML pipelines for a health risk prediction use case (lifestyle + demographics), emphasizing robustness through strict data validation, leakage prevention, and cross-validation. Collaborated with a dietician to sanity-check predictions and refine feature interpretation for real-world practicality; has not yet deployed LLM/AI systems to production and has no hands-on orchestration framework experience but is willing to learn.

View profile
Alex D'Souza - Junior Machine Learning Researcher specializing in healthcare AI and security in Davis, CA

Alex D'Souza

Screened

Junior Machine Learning Researcher specializing in healthcare AI and security

Davis, CA2y exp
University of California, DavisUC Davis

Research-focused AI/ML candidate who built an fMRI-based classifier to predict schizophrenia treatment effectiveness under small-dataset constraints. Demonstrated pragmatic model selection by moving from a complex GNN to graph-summary feature engineering with logistic regression, significantly improving accuracy and AUC; primarily works in Google Colab with script-based workflows.

View profile
Srikanth Reddy - Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics in Plainsboro, NJ

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.

View profile
Michael Chaves - Senior Creative Technologist & Full-Stack UX Engineer specializing in Generative AI and XR in Los Altos, CA

Senior Creative Technologist & Full-Stack UX Engineer specializing in Generative AI and XR

Los Altos, CA12y exp
Astrocade AISan José State University

Design engineer/product designer who built an end-to-end creator + review/moderation system for a UGC platform, spanning automated checks, human QA, final review, and creator feedback. Comfortable working directly with HTML/CSS/TypeScript and component systems, using prototyping and field observation to reduce reviewer hesitation, improve consistency, and prevent creator errors upstream.

View profile
Rayyan Alam - Junior Robotics & Machine Learning Engineer specializing in autonomy and RAG systems in Arlington, VA

Rayyan Alam

Screened

Junior Robotics & Machine Learning Engineer specializing in autonomy and RAG systems

Arlington, VA1y exp
Manitou Research Inc.University of Virginia

New-grad robotics software engineer with hands-on ROS 2 autonomy experience (Nav2, SLAM Toolbox, AMCL) and a strong track record debugging real-world instability (QoS, lifecycle timing, sensor dropouts). Built an HRI speech system on a Stretch 3 robot with deterministic, context-aware templates to manipulate trust/competence/emotion conditions, and integrated an LLM high-level planner that outputs PDDL for classical task planning and replanning.

View profile
SM

Surya Mahesh

Screened

Mid-level Software Engineer specializing in backend and real-time automotive systems

India3y exp
Bosch Global SoftwareCal State Long Beach

Hands-on ML practitioner who built and deployed an end-to-end phishing email classifier (CLI + simple web app), achieving 98% accuracy and reducing manual security triage. Emphasizes production reliability through input validation, graceful failure modes, monitoring/logging, and iterative error analysis, with experience hardening pipelines against messy backend/database data using fallbacks and idempotent processing.

View profile
MR

Mid-level AI/ML Engineer specializing in LLMs, RAG, and time-series forecasting

California, USA4y exp
Northern TrustUniversity of Massachusetts

ML/AI engineer with hands-on ownership of production recommendation and RAG systems at Northern Trust. They combine transformer modeling, latency optimization, cloud deployment, and monitoring with measurable business impact, including 14% accuracy gains, 12% engagement improvement, and 19% better query relevance.

View profile
SB

Senior AI/ML Engineer specializing in Generative AI, NLP, and regulated industries

Illinois, USA7y exp
Northern TrustUniversity of New Haven

Built end-to-end ML and GenAI systems at Northern Trust, including a production RAG-based document intelligence platform for financial reports and contracts. Stands out for combining strong MLOps execution with practical product judgment—improving forecast accuracy by 22%, document review accuracy by 38%, and cutting deployment time by 45% while keeping latency and reliability production-ready.

View profile
AL

Adnane Lokman

Screened

Senior software engineer specializing in AI/ML and LLM platform delivery

Remote8y exp
UKGUniversity of Florida

ML/AI engineer with strong production ownership across predictive ML and Generative AI systems. They’ve delivered measurable business impact through real-time churn/drop-off prediction, RAG-based document QA, and scalable LLM optimization, with a consistent focus on reliability, safety, latency, and developer productivity.

View profile
NS

Nisarg Shah

Screened

Entry-level Full-Stack Engineer specializing in distributed systems and ML platforms

Tempe, AZ1y exp
Arizona State UniversityArizona State University

Early-career/new-grad candidate who built TrendScout AI, an evidence-first market intelligence agent that ingests messy news, extracts entities/events, builds a Neo4j knowledge graph, and answers questions via RAG with citations. Achieved ~95% retrieval relevance by combining ChromaDB semantic search with graph-based retrieval and validating outputs through human evaluation and guardrails to prevent hallucinations.

View profile
PG

Prasanth Goli

Screened

Mid-level Data Scientist specializing in Generative AI and LLM production systems

United States5y exp
AT&TWestern Illinois University

Built and deployed a production LLM-powered workflow assistant that automated internal marketing/production business tasks (document summarization, repeated Q&A, status updates). Demonstrates end-to-end applied LLM engineering: modular RAG architecture, hallucination/latency mitigation, automated evals to prevent prompt regressions, and Azure-based orchestration (Functions/Logic Apps) with monitoring and controlled rollouts.

View profile
RE

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp
Elevance HealthIndiana University Indianapolis

Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.

View profile
MN

Madhuri Naik

Screened

Mid-level Data Scientist specializing in predictive analytics and LLM-powered data pipelines

Buffalo, NY3y exp
University at BuffaloUniversity at Buffalo

Early-career engineer from BNP Paribas who drove a large-scale observability modernization—selecting and implementing Prometheus/Grafana for a 2000+ server estate, then productionizing it on Kubernetes via Docker/Jenkins. Known for hands-on demos, strong documentation/templates, and pragmatic troubleshooting (including custom Python metrics) that improved visibility and cut debugging time by ~60%.

View profile
SS

Sameer Shaik

Screened

Senior AI Engineer specializing in Generative AI, NLP, and applied deep learning

Chicago, IL8y exp
Live NationDePaul University

Built a production multi-agent LLM system at Live Nation on Databricks (LangGraph/LangChain) that let venue/event teams ask questions in Slack, auto-generated optimized route schedules, and produced inventory/stocking recommendations from historical SQL data and venue trends. Improved reliability by tightening prompts with strict JSON schemas, providing sample questions/SQL, and adding guardrails plus synthetic/edge-case testing, while iterating with event managers and senior VPs via prototypes and feedback loops.

View profile
LH

Liam Huynh

Screened

Junior Software Engineer specializing in backend and full-stack development

San Francisco, CA2y exp
HandshakeUniversity of Missouri-Kansas City

Backend Python engineer who owned an AI-driven healthcare staffing matching service, rebuilding the model inference/data pipeline to eliminate blocking bottlenecks and cutting API latency by ~33%. Experienced running Python services on Kubernetes with GitOps/ArgoCD, and has executed a cloud-to-on-prem rollout under tight resource and tooling constraints while also building event-driven streaming updates via a message broker.

View profile
VN

Vasanthi N.

Screened

Senior AI/ML Engineer and Data Scientist specializing in Generative AI and MLOps

Los Angeles, CA9y exp
Pacific Community BankAurora University

ML/NLP practitioner focused on financial-services document intelligence and compliance workflows—built an end-to-end pipeline to classify documents and extract financial entities from loan applications, emails, and statements stored in S3/internal databases. Strong in entity resolution/record linkage and in productionizing pipelines with GitHub Actions CI/CD, testing, data validation, and Docker, plus semantic search using OpenAI embeddings and a vector database.

View profile
YT

Yash Tobre

Screened

Mid-level AI/ML Engineer specializing in computer vision, NLP/LLMs, and MLOps

Bentonville, AR4y exp
DyneticsUniversity of Texas at Arlington

ML/AI engineer with defense and commercial analytics experience: deployed a real-time aerial object detection system at Dynetics (YOLOv5 + TorchServe in Docker on AWS EC2) with drift-triggered retraining and 99.5% uptime, tackling ambiguous targets and weather degradation. Previously at Fractal Analytics, built and explained a churn prediction model for marketing stakeholders using SHAP and delivered it via a Flask API into dashboards, driving a reported 22% attrition reduction.

View profile
HK

Mid-level Data Analyst specializing in cloud ETL, BI, and machine learning

Texas, 752235y exp
UnitedHealth GroupUniversity of Texas at Arlington

Data/ML practitioner with experience at UnitedHealth Group building a fraud claims detection solution combining structured claims data and unstructured notes, validated with compliance stakeholders to improve actionable accuracy. Also applied embeddings, vector databases, and fine-tuned language models in a Bank of America capstone to detect threats/anomalies in financial documents, with production-minded Python ETL workflows using Airflow.

View profile
GM

Guerby Mertil

Screened

Senior Software Engineer specializing in cloud-native microservices and AI-enabled platforms

Jacksonville, Florida14y exp
FanaticsWest Virginia State University

Infrastructure/operations engineer with hands-on production IBM Power/AIX (AIX 7.x, VIOS, HMC) and PowerHA/HACMP clustering experience, including DLPAR changes, failover testing, and incident recovery. Also delivers modern cloud DevOps work—GitHub Actions CI/CD for Docker-to-Kubernetes on AWS and Terraform-based provisioning of core AWS infrastructure (VPC/EKS/RDS/IAM) with controlled rollouts and drift checks.

View profile
Rupak Chand - Junior ML Data Associate specializing in AI training data and LLM prompt evaluation in Connecticut

Rupak Chand

Screened

Junior ML Data Associate specializing in AI training data and LLM prompt evaluation

Connecticut2y exp
AmazonSacred Heart University

Applied ML/embodied AI practitioner who built an on-device gesture-control system for smart-home lights using Raspberry Pi + camera, focusing on privacy-preserving real-time inference and hardware-constrained optimization (async pipeline + TF Lite INT8). Also made a high-impact architecture decision for an ML content evaluation/QA pipeline processing millions of annotated text samples weekly, reducing batch runtime from ~6 hours to ~40 minutes while lowering compute cost.

View profile

Need someone specific?

AI Search