Vetted Model Evaluation Professionals

Pre-screened and vetted.

Bala Venkateswarlu K - Mid-level Data Scientist specializing in Generative AI, NLP, and MLOps in USA

Mid-level Data Scientist specializing in Generative AI, NLP, and MLOps

USA5y exp
MetLifeHarrisburg University of Science and Technology

Built and deployed an LLM-powered claims-document summarization system (insurance domain) that cut agent review time from 4–5 minutes to under 2 minutes and saved 1,200+ hours per quarter. Hands-on across orchestration and production infrastructure (Airflow retraining DAGs, Kubernetes, SageMaker endpoints, FastAPI) and recent RAG workflows using n8n + Pinecone, with a strong focus on reliability, cost, and explainability for non-technical stakeholders.

View profile
SM

Sathvika Meka

Screened

Mid-level Data Analyst specializing in BI, analytics, and healthcare data

Remote, USA4y exp
CVS HealthUniversity of South Florida

Analytics professional at Optum with hands-on experience turning messy healthcare claims data from SQL, Excel, and CRM systems into validated reporting datasets and Power BI dashboards. They also built reproducible Python workflows for claims analysis and owned an end-to-end project focused on improving claims processing efficiency through metric design, segmentation, and stakeholder-driven operational improvements.

View profile
RK

Senior AI/ML Engineer specializing in LLMs, generative AI, and applied research

Boca Raton, FL10y exp
ModMedFlorida Atlantic University

Research-heavy ML/AI candidate with a PhD/publications background who translated LLM evaluation and clinical summarization techniques into production at ModMed. They owned an end-to-end healthcare GenAI pipeline that cut clinician documentation time from ~22 minutes to ~7-8 minutes, reduced token costs by ~30%, and built an internal evaluation framework later adopted by multiple teams.

View profile
John Chance - Senior Machine Learning Engineer specializing in conversational AI and healthcare ML in Greenwood, LA

John Chance

Screened

Senior Machine Learning Engineer specializing in conversational AI and healthcare ML

Greenwood, LA9y exp
Elevance HealthMedaille University

ML/AI engineer with hands-on ownership of both classical recommender systems and safety-sensitive LLM agent platforms. They combine production MLOps depth with behavioral health domain experience, including clinical safety validation, explainability, and multi-agent orchestration, and cite measurable impact in both business metrics and latency reduction.

View profile
SN

Mid-level AI/ML Engineer specializing in GenAI, NLP, and financial systems

Texas, USA5y exp
CitibankConcordia University, St. Paul

GenAI/ML engineer with hands-on experience building production financial intelligence and document summarization systems at Citibank. Stands out for combining LLM fine-tuning, hybrid RAG, multi-agent workflows, and strong MLOps/observability practices to deliver measurable business impact, including 60% faster analyst retrieval, 31% higher precision, and 99%+ uptime.

View profile
VD

Vimala Devi

Screened

Mid-level AI & Machine Learning Engineer specializing in FinTech

Texas, USA4y exp
The HartfordUniversity of Houston

ML/AI engineer with hands-on experience building production systems in financial services, including a real-time underwriting analytics platform at Hartford Financial Services. Stands out for combining classic ML, low-latency API deployment, monitoring, and emerging LLM/RAG design patterns, with measurable impact including 20% better decision accuracy, sub-200ms latency, and 5M+ records processed daily.

View profile
kartikeya tiwari - Senior Software Engineer specializing in AI systems and platform engineering in Bangalore, India

Senior Software Engineer specializing in AI systems and platform engineering

Bangalore, India6y exp
CoralSwami Keshvanand Institute of Technology, Management & Gramothan, Jaipur

Backend/AI engineer with experience owning production systems in fintech and product startups, including a predictive scaling platform that cut AWS spend by 40% and an ambiguous social-intelligence feature that doubled MRR from $50K to $100K. Also building AI search and document-processing workflows, with reported 99.7% extraction accuracy and hands-on use of both classical forecasting and modern LLM stacks.

View profile
YM

Junior AI Engineer specializing in LLM systems and applied machine learning

San Francisco, CA2y exp
LangChainUniversity of the Pacific

Yogesh is an AI/full-stack engineer from LangChain who says he was the sole developer and core maintainer of OpenSWE/OpenSpeed, an asynchronous coding agent in LangSmith Cloud that turns requests from Slack, Linear, and GitHub into reviewable PRs. He emphasizes production-grade agent infrastructure: event-driven workflow design, typed run states, observability, retries, and latency improvements via pre-warmed sandboxes.

View profile
TA

Junior Machine Learning Engineer specializing in Generative AI and analytics automation

Bengaluru, India2y exp
AccentureUniversity of Alabama at Birmingham

AI/LLM engineer who built a production intelligent support system using RAG over a vectorized documentation library, addressing real-world issues like lost-in-the-middle context failures and doc freshness via automated GitHub-driven re-embedding pipelines. Emphasizes rigorous agent evaluation (component/E2E/ops) and prefers lightweight, decoupled workflow automation using message brokers (Redis/RabbitMQ) over heavyweight orchestration frameworks.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
PS

Mid-level QA Engineer specializing in AI/ML model validation and data quality

USA7y exp
AccentureClarkson University

ML practitioner with a QA background who has built end-to-end ML pipelines for a health risk prediction use case (lifestyle + demographics), emphasizing robustness through strict data validation, leakage prevention, and cross-validation. Collaborated with a dietician to sanity-check predictions and refine feature interpretation for real-world practicality; has not yet deployed LLM/AI systems to production and has no hands-on orchestration framework experience but is willing to learn.

View profile
Rayyan Alam - Junior Robotics & Machine Learning Engineer specializing in autonomy and RAG systems in Arlington, VA

Rayyan Alam

Screened

Junior Robotics & Machine Learning Engineer specializing in autonomy and RAG systems

Arlington, VA1y exp
Manitou Research Inc.University of Virginia

New-grad robotics software engineer with hands-on ROS 2 autonomy experience (Nav2, SLAM Toolbox, AMCL) and a strong track record debugging real-world instability (QoS, lifecycle timing, sensor dropouts). Built an HRI speech system on a Stretch 3 robot with deterministic, context-aware templates to manipulate trust/competence/emotion conditions, and integrated an LLM high-level planner that outputs PDDL for classical task planning and replanning.

View profile
Michael Chaves - Senior Creative Technologist & Full-Stack UX Engineer specializing in Generative AI and XR in Los Altos, CA

Senior Creative Technologist & Full-Stack UX Engineer specializing in Generative AI and XR

Los Altos, CA12y exp
Astrocade AISan José State University

Design engineer/product designer who built an end-to-end creator + review/moderation system for a UGC platform, spanning automated checks, human QA, final review, and creator feedback. Comfortable working directly with HTML/CSS/TypeScript and component systems, using prototyping and field observation to reduce reviewer hesitation, improve consistency, and prevent creator errors upstream.

View profile
Alex D'Souza - Junior Machine Learning Researcher specializing in healthcare AI and security in Davis, CA

Alex D'Souza

Screened

Junior Machine Learning Researcher specializing in healthcare AI and security

Davis, CA2y exp
University of California, DavisUC Davis

Research-focused AI/ML candidate who built an fMRI-based classifier to predict schizophrenia treatment effectiveness under small-dataset constraints. Demonstrated pragmatic model selection by moving from a complex GNN to graph-summary feature engineering with logistic regression, significantly improving accuracy and AUC; primarily works in Google Colab with script-based workflows.

View profile
Srikanth Reddy - Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics in Plainsboro, NJ

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.

View profile
SM

Surya Mahesh

Screened

Mid-level Software Engineer specializing in backend and real-time automotive systems

India3y exp
Bosch Global SoftwareCal State Long Beach

Hands-on ML practitioner who built and deployed an end-to-end phishing email classifier (CLI + simple web app), achieving 98% accuracy and reducing manual security triage. Emphasizes production reliability through input validation, graceful failure modes, monitoring/logging, and iterative error analysis, with experience hardening pipelines against messy backend/database data using fallbacks and idempotent processing.

View profile
MR

Mid-level AI/ML Engineer specializing in LLMs, RAG, and time-series forecasting

California, USA4y exp
Northern TrustUniversity of Massachusetts

ML/AI engineer with hands-on ownership of production recommendation and RAG systems at Northern Trust. They combine transformer modeling, latency optimization, cloud deployment, and monitoring with measurable business impact, including 14% accuracy gains, 12% engagement improvement, and 19% better query relevance.

View profile
SB

Senior AI/ML Engineer specializing in Generative AI, NLP, and regulated industries

Illinois, USA7y exp
Northern TrustUniversity of New Haven

Built end-to-end ML and GenAI systems at Northern Trust, including a production RAG-based document intelligence platform for financial reports and contracts. Stands out for combining strong MLOps execution with practical product judgment—improving forecast accuracy by 22%, document review accuracy by 38%, and cutting deployment time by 45% while keeping latency and reliability production-ready.

View profile
AL

Adnane Lokman

Screened

Senior software engineer specializing in AI/ML and LLM platform delivery

Remote8y exp
UKGUniversity of Florida

ML/AI engineer with strong production ownership across predictive ML and Generative AI systems. They’ve delivered measurable business impact through real-time churn/drop-off prediction, RAG-based document QA, and scalable LLM optimization, with a consistent focus on reliability, safety, latency, and developer productivity.

View profile
RF

Mid-level Software Engineer specializing in backend microservices and AI-integrated platforms

Dallas, TX4y exp
Blue Cross Blue ShieldUniversity of Texas at Arlington

Full-stack engineer with experience spanning AI-powered product features and healthcare fraud detection systems. Has built end-to-end LLM-enabled applications, customer-facing recommendation systems at scale, and operational platforms that improved real-time investigations and flagged over 1,200 high-risk cases quarterly.

View profile
NS

Nisarg Shah

Screened

Junior Software Engineer specializing in data, systems, and AI engineering

Arizona, USA2y exp
Arizona State UniversityArizona State University

Early-career/new-grad candidate who built TrendScout AI, an evidence-first market intelligence agent that ingests messy news, extracts entities/events, builds a Neo4j knowledge graph, and answers questions via RAG with citations. Achieved ~95% retrieval relevance by combining ChromaDB semantic search with graph-based retrieval and validating outputs through human evaluation and guardrails to prevent hallucinations.

View profile
SR

Senior Full-Stack Software Engineer specializing in AI agents and data platforms

Remote7y exp
AT&TCalifornia State University, Los Angeles

Full-stack and AI-focused builder who has shipped both customer-facing personalization at AT&T and internal LLM-powered automation/agent systems in startup environments. Stands out for combining TypeScript-heavy engineering rigor with practical AI orchestration, evaluation, and measurable business impact—from reducing support escalation through personalization to saving 10-11 hours per week by automating fragmented operational workflows.

View profile
PG

Prasanth Goli

Screened

Mid-level Data Scientist specializing in Generative AI and LLM production systems

United States5y exp
AT&TWestern Illinois University

Built and deployed a production LLM-powered workflow assistant that automated internal marketing/production business tasks (document summarization, repeated Q&A, status updates). Demonstrates end-to-end applied LLM engineering: modular RAG architecture, hallucination/latency mitigation, automated evals to prevent prompt regressions, and Azure-based orchestration (Functions/Logic Apps) with monitoring and controlled rollouts.

View profile
RE

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp
Elevance HealthIndiana University Indianapolis

Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.

View profile

Need someone specific?

AI Search