Vetted Model Evaluation Professionals

Pre-screened and vetted.

Sai Charan Kolla - Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS in TX, USA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS

TX, USA5y exp
BlackRockTexas A&M University-Kingsville

LLM engineer who built a production document intelligence/RAG pipeline to extract structured data from thousands of unstructured PDFs, cutting manual review time by 60%. Experienced with LangChain and Airflow orchestration plus rigorous evaluation (labeled datasets, prompt testing, HITL review, monitoring) to improve accuracy and reduce hallucinations while partnering closely with non-technical operations stakeholders.

View profile
ST

Director-level telecom standards leader specializing in 3GPP, 5G/6G architecture, and AI-driven networks

Chicago Area, IL23y exp
SIXTH-LOGIC ADVISORYOklahoma State University

Telecommunications strategist with more than two decades of experience spanning 3G through emerging 6G, focused on shaping global standards and advising senior leadership. Created a global AI-enabled autonomous systems framework in NGMN, has multiple patents in telecom, and is particularly differentiated by his ability to represent company interests in 3GPP while driving interoperable, multi-vendor 6G ecosystems.

View profile
BN

Mid-level Machine Learning Engineer specializing in AI/LLM systems

New York, NY5y exp
ServiceNowUniversity at Buffalo

ML/LLM systems engineer who has owned AI support automation products end-to-end, including ServiceNow-integrated incident routing, RAG-based resolution suggestion systems, and production stabilization. Stands out for combining hands-on platform work across PySpark, AWS Glue, FastAPI, Kubernetes, and Pinecone with measurable operational impact, including 30-35% MTTR reduction and 25-30% improvement in first-touch resolution.

View profile
Chaitanya Prasad Reddy Narala - Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems in USA

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

USA4y exp
ServiceNowSaint Louis University

Senior AI/ML engineer focused on production LLM systems, combining RAG, fine-tuning, distributed training, and AI safety to ship scalable real-time moderation and conversational AI platforms. Stands out for pairing deep AWS/Kubernetes MLOps expertise with measurable impact: 40% lower latency/cost, 30-50% fewer hallucinations, and major reliability gains through observability and automation.

View profile
AA

Anton Artemov

Screened

Senior AI Product Manager specializing in pricing, experimentation, and data products

New York City, NY12y exp
VicRoadsMoscow Technical University of Communication and Informatics

Product leader at VicRoads who rebuilt the Custom Plates digital portfolio, including an AI-powered pricing and recommendation platform that drove more than 20% revenue uplift. Stands out for combining commercial product thinking with explainable AI, regulatory compliance, and legacy-system modernization in a government-related environment.

View profile
SL

Mid-level AI/ML Engineer specializing in LLMs and Generative AI

Missouri, USA3y exp
Scale AISaint Louis University

Full-stack engineer with hands-on ownership of both a marketplace discovery product and a zero-to-one AI document intelligence assistant. Brings a strong blend of React/Node/MongoDB product engineering and production RAG/LLM system design, with clear experience turning technically complex AI capabilities into usable experiences for non-technical users.

View profile
DV

Mid-level AI/ML Engineer specializing in MLOps, NLP/LLMs, and computer vision

Remote, USA4y exp
BarclaysYeshiva University

Built and shipped a production LLM/RAG risk-case summarization and triage system used by fraud/compliance analysts, with strong grounding controls (evidence-cited outputs and refusal on low confidence). Demonstrates end-to-end ownership across retrieval quality, Airflow-orchestrated indexing pipelines, and compliance-grade privacy (PII redaction, RBAC, encrypted redacted logging, and auditable prompt/model versioning) plus a tight feedback loop with non-technical domain experts.

View profile
SD

Sai Dev

Screened

Mid-level AI/ML Engineer specializing in MLOps, computer vision, and NLP

Newark, CA4y exp
Lucid MotorsCleveland State University

GenAI/ML engineer from Lucid Motors who built and productionized an LLM-powered RAG diagnostic assistant for manufacturing and maintenance teams, deployed on AWS with Docker/Kubernetes and MLflow. Demonstrates end-to-end ownership from retrieval/prompt design to scalability, monitoring, and workflow integration via APIs, plus production ML pipeline orchestration with Kubeflow (Spark/Kafka + TensorFlow) for predictive maintenance use cases.

View profile
AA

Mid-Level Full-Stack Python Engineer specializing in cloud APIs and data/ML platforms

Bentonville, AR4y exp
WalmartUniversity of Central Missouri

Backend engineer at Goldman Sachs who deployed internal LLM-powered utilities to summarize operational logs/tickets, with a strong emphasis on data sensitivity and reliability. Built deterministic workflows with template-based prompts, confidence checks, and rule-based fallbacks, and used monitoring plus failure-rate metrics to tune performance; also has hands-on Temporal orchestration experience for resilient async backend jobs.

View profile
HK

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.

View profile
YX

Yuan Xu

Screened

Junior Machine Learning Engineer specializing in multimodal AI and audio deepfakes detection

Berkeley, California3y exp
Scam AICarnegie Mellon University

Internship experience building production-oriented AI systems, including a real-time voice scam/spoof detector (RawNet + AASIST) hardened for noisy audio via aggressive augmentation and Zoom-based noise simulation, evaluated with EER on clean and wild datasets. Also built an LLM-driven UI automation agent using Playwright for apps like Linear/Notion with modular tool design, unit tests, and replayable scripted scenarios, and has AWS Step Functions experience orchestrating Lambda/Cognito workflows.

View profile
HG

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

NJ, USA4y exp
Red HatOklahoma Christian University

Red Hat ML/LLM engineer who designed and deployed a production LLM-powered customer support automation system using RAG, improving latency by 30% via PEFT and vector search optimization. Built security and governance into retrieval (access-level filtering, encrypted Pinecone/ChromaDB) and delivered SHAP-based explainability via a dashboard for non-technical stakeholders. Experienced orchestrating distributed ML/RAG pipelines across AWS SageMaker and OpenShift with Airflow/Prefect, plus multi-agent workflows using CrewAI and LangGraph.

View profile
SG

Mid-level AI/ML Engineer specializing in GenAI, LLMs, RAG, and MLOps

St. Louis, MO5y exp
CenteneSaint Louis University

Built and deployed a production LLM-powered RAG document intelligence/Q&A system for healthcare prior authorization, reducing manual medical document review time and improving decision efficiency. Strong in end-to-end LLM application engineering (LangChain/LangGraph), retrieval quality improvements (hybrid search, embedding tuning, chunking strategies), and rigorous evaluation/monitoring for reliability.

View profile
Mahesh Kumar Duvvuri - Senior Full-Stack Software Engineer specializing in microservices and cloud-native systems in New York City, NY

Senior Full-Stack Software Engineer specializing in microservices and cloud-native systems

New York City, NY4y exp
JPMorgan ChaseUniversity of Dayton

Backend/infra engineer with experience across Nestle, J.P. Morgan, and Capgemini, combining ML systems work (YOLOv8/PyTorch object detection with TFLite edge deployment) with production-grade cloud/Kubernetes operations. Has delivered measurable impact via AWS migrations (25% cost reduction, 99.9% availability), microservice modernization (35% faster processing), and low-latency Kafka streaming for financial dashboards (<100ms) using DLQs and idempotent consumers.

View profile
Sanjana Duvva - Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.

View profile
DD

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp
Elevance HealthTexas Tech University

ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.

View profile
SJ

Shuhan Jhang

Screened

Mid-level software engineer specializing in backend systems, AI, and semiconductor data platforms

San Jose, CA4y exp
Vibie AINortheastern University

Built and shipped an end-to-end autonomous telemetry and log-triage product that combined LLM-based anomaly analysis, strict typed validation, and a React observability UI. Particularly compelling is their focus on making non-deterministic AI reliable in production at scale—500,000 daily requests and 99.9% uptime—while also translating complex AI output into a usable experience for non-technical teams during live outages.

View profile
AG

Amit Gaur

Screened

Mid-level AI Engineer specializing in LLMs and production ML systems

Long Beach, CA4y exp
California State University, Long BeachCalifornia State University, Long Beach

Engineering leader with hands-on AI/ML systems experience spanning production inference infrastructure and consumer-facing LLM products. At Jio, they led a 17-person AI features team and delivered measurable execution gains, including 40% faster deployments and 35% lower prediction latency, while also building an end-to-end RAG-based meal recommendation product using OpenAI and Gemini.

View profile
BC

Bhuvan Chandi

Screened

Mid-level Data Engineer specializing in AI/ML data platforms

NY, NY6y exp
BlackRockWebster University

Built and productionized an LLM-powered PDF document Q&A system to eliminate manual searching through long documents, focusing on scalability and answer reliability. Implemented semantic chunking (using headings/paragraphs/tables), overlap, and preprocessing/quality checks to reduce hallucinations, and orchestrated the end-to-end pipeline with Airflow using retries, alerts, and parallel tasks.

View profile
MS

Min-Han Shih

Screened

Junior Machine Learning Engineer specializing in speech and multimodal AI

Taipei, Taiwan2y exp
FurboUSC

New grad who has shipped a production vision-language recommendation feature for a pet camera/mobile app, including building a tagged video dataset with human annotators and optimizing inference by FPS downsampling under device compute limits. Also built a multimodal MLLM benchmark using an LLM-as-judge (GPT-5-thinking) with a feedback loop, validated against human scoring, and measured post-feedback quality gains (12% average score improvement).

View profile
IS

Irfan Shaik

Screened

Mid-level AI Software Engineer specializing in risk and fraud detection

Los Angeles, California4y exp
VisaGeorge Mason University

AI/software engineer with experience at Visa building a real-time transaction fraud/risk scoring microservice in the card authorization path (Python, Kafka, Kubernetes on AWS) with strict 120–150ms latency constraints and reason-code outputs for downstream decisioning. Owns ML backend end-to-end (data/feature engineering, model training, deployment) and has demonstrated production reliability work including latency spike mitigation, SLO-based observability, drift monitoring, and safe fallbacks to rule-based decisions.

View profile
RH

Rahul Hatkar

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

San Francisco, CA6y exp
Scale AIWebster University

AI/ML engineer who has shipped production AI systems end-to-end, including an automated multi-channel (Gmail/WhatsApp/voice) candidate interviewing workflow and an enterprise RAG knowledge search platform. Demonstrates strong production rigor (monitoring, A/B tests, guardrails, schema validation, shadow testing) with quantified impact: ~60–70% reduction in interview evaluation time and ~20–30% relevance gains in RAG retrieval.

View profile

Need someone specific?

AI Search