Vetted LlamaIndex Professionals

Pre-screened and vetted.

SG

Mid-level AI/ML Engineer specializing in LLMs, search ranking, and multimodal ML

San Francisco, CA5y exp
NVIDIAUniversity of North Texas
View profile
AC

Staff Full-Stack Engineer specializing in cloud microservices and AI-enabled platforms

San Antonio, TX11y exp
OptumUniversity of Houston
View profile
MW

Senior Agentic AI & Backend Engineer specializing in LLM platforms and multi-agent systems

10y exp
MoveworksRutgers University
View profile
TM

Senior AI/ML Software Engineer specializing in LLMs, NLP, and scalable ML platforms

Round Rock, TX15y exp
MetaMontclair State University
View profile
JY

Intern AI/ML Engineer specializing in LLM agents, RAG, and low-latency systems

Cambridge, MA2y exp
MIT CSAILUC Berkeley
View profile
KS

Senior AI/ML Engineer specializing in LLMs, RAG, and multimodal recommendation systems

CA6y exp
PerplexityVirginia Tech
View profile
JL

Executive AI Architect specializing in enterprise GenAI and LLM platforms

18y exp
Cogrithm.comUniversity of Colorado Boulder
View profile
HK

Harish Kasu

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

San Francisco, CA5y exp
NVIDIATexas A&M University-Kingsville

AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.

View profile
EL

Senior Full-Stack Software Engineer specializing in Telehealth and FinTech

Santa Clara, CA11y exp
AmazonUCLA
View profile
KK

Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure

San Francisco, CA6y exp
PerplexityStevens Institute of Technology
View profile
SP

Senior AI/ML Engineer specializing in GenAI, agentic systems, and healthcare AI

Bonham, TX12y exp
AnthropicTexas Tech University
View profile
YF

Mid-Level Software Development Engineer specializing in AWS serverless and ML/GenAI

Irvine, CA5y exp
AmazonUniversity of Chicago
View profile
Nikhil Reddy - Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms in San Francisco, CA

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp
NVIDIASaint Louis University

Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.

View profile
AR

Anagha Ram

Screened

Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search

Los Altos, CA2y exp
Columbia UniversityCornell University

Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.

View profile
MJ

Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems

Mesquite, TX11y exp
AmazonUniversity of Texas at Dallas

ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.

View profile
AA

Senior Full-Stack Python Developer specializing in cloud-native RAG and microservices

NY, USA6y exp
Google DeepMindUniversity of Saint Francis
View profile
SR

Mid-level Machine Learning Engineer specializing in LLMs and RAG systems

San Francisco, CA5y exp
Scale AIUniversity of New Haven
View profile
RG

Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems

San Francisco, CA5y exp
NVIDIAArizona State University
View profile
SL

Staff Software Engineer specializing in FinTech, AI/ML, and cloud microservices

Remote14y exp
CitigroupUniversity of Pennsylvania
View profile
DK

Senior Customer Success & Technical Account Leader specializing in AI/ML infrastructure

Pleasanton, CA19y exp
Lightning AIUC Berkeley
View profile
VP

Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps

Mountain View, CA5y exp
MetaUniversity of North Carolina at Charlotte
View profile
AC

Director of AI/ML Engineering specializing in MLOps, data platforms, and 3D computer vision

Teaneck, NJ10y exp
AetrexColumbia University

Backend/data engineer focused on production ML/LLM systems: built a real-time FastAPI inference API on Kubernetes with strong reliability patterns (timeouts, idempotent retries, centralized error handling). Delivered AWS platforms using EKS + Lambda with GitHub Actions/Helm CI/CD and built Glue-based ETL from S3/Kafka into Snowflake with schema evolution and data-quality controls; also modernized legacy analytics/recommendation workflows into Python services with safe, feature-flagged cutovers.

View profile
Osvaldo Calles - Senior Software Engineer specializing in developer tools, cloud automation, and generative AI in Redmond, WA

Senior Software Engineer specializing in developer tools, cloud automation, and generative AI

Redmond, WA13y exp
AmdocsUniversidad Autónoma de Guadalajara

Built and deployed a production chatbot on osvaldocalles.com and iterated through real-world LLM engineering issues: model quota/cost tradeoffs (migrating to Nova Pro), RAG accuracy via semantic chunking, AWS IAM/guardrail/security pitfalls, and Lambda/API Gateway streaming constraints (prefers JS for streaming layer). Experienced with agent orchestration using Strands SDK (AWS-focused) and LangGraph (Vercel/container deployments), plus evaluation pipelines using LLM-as-evaluator, dashboards, and staged model rollouts.

View profile

Need someone specific?

AI Search