Vetted Llama Professionals

Pre-screened and vetted.

WT

Executive engineering leader specializing in AI platforms and Healthcare IT

Salem, OR21y exp
Adoreal Inc.University of Maryland, College Park

Engineering executive and former CTO with a rare blend of enterprise healthcare AI leadership and consumer AI product building for neurodiverse users. Led Adoreal’s U.S. expansion, scaled a multidisciplinary org by about 60%, modernized platform architecture with Kubernetes and CI/CD, and consistently ties engineering and AI decisions to trust, onboarding efficiency, and revenue outcomes.

View profile
YP

Yash Pise

Screened

Mid-level Data Scientist specializing in Generative AI, LLMOps, and clinical data pipelines

5y exp
NovartisStevens Institute of Technology

LLM/RAG engineer who has built and deployed corporate-scale systems at Novartis and Johnson & Johnson, including a healthcare AI agent that generates day-to-day treatment schedules. Recently handled a high-stakes safety incident (LLM suggesting overdose) by tightening model instructions and validating with ~200 test prompts, and has strong end-to-end data/embedding/vector DB pipeline experience (PySpark, FAISS, Pinecone) plus SME-in-the-loop evaluation (RLHF).

View profile
SK

Mid-level GenAI/ML Engineer specializing in LLM agents and RAG for Financial Services & Healthcare

5y exp
Bank of AmericaVirginia Commonwealth University

Built and deployed a production GenAI internal support agent at Bank of America (“Ask GPS/AskGPT”) using RAG on Azure, focused on reducing escalations and improving response quality for repetitive knowledge-based queries. Demonstrates strong production LLM engineering: custom LangChain orchestration, retrieval tuning to reduce hallucinations, rigorous offline/online evaluation, and model benchmarking with dynamic routing (e.g., GPT-4 vs Claude).

View profile
SK

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
ServiceNowValparaiso University

ServiceNow engineer who built and launched a production LLM-powered ticket resolution/knowledge assistant using RAG (LangChain + Hugging Face embeddings + vector search) integrated into internal support dashboards via REST APIs. Optimized the system from ~6–8s to ~2–3s latency while improving usability with concise, cited answers and guardrails (grounding + similarity thresholds), delivering ~30–35% reduction in manual ticket investigation effort.

View profile
Sachin Komati - Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML in Florida, USA

Sachin Komati

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and healthcare ML

Florida, USA5y exp
BlackRockFlorida International University

Built an end-to-end GenAI/RAG platform for financial compliance and research at BlackRock, focused on safe, auditable answers in a highly regulated environment. Combines strong LLM engineering depth with production platform skills and delivered clear business impact, including reducing research/compliance turnaround from hours to seconds, improving retrieval relevance by 22%, and cutting inference costs by 75%.

View profile
MG

Mid-level Software Development Engineer specializing in cloud-native AI/ML systems

California, USA4y exp
ServiceNowCal State Long Beach

AI/ML-focused engineer with practical experience building RAG-based and multi-agent systems, including architectures for retrieval, reasoning, context processing, and response generation. Stands out for combining LLM productivity gains with disciplined software engineering practices like validation, monitoring, and reproducibility.

View profile
Anshika Bajpai - Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps in Bloomington, IN

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Bloomington, IN4y exp
Indiana UniversityIndiana University Bloomington

Engineer with impactful experience at Palo Alto Networks and Optum, focused on production automation and AI-powered internal tools. Built and owned an end-to-end RAG knowledge system adopted by 1000+ internal users with roughly 75% faster response times, and also transformed a legacy Optum coverage-feed workflow from 500+ minutes to under 3 minutes through data standardization and microservices refactoring.

View profile
BP

Bhargav Patel

Screened

Mid-level Full-Stack Developer specializing in AI-driven FinTech systems

Remote, USA3y exp
BarclaysRowan University

Built and shipped AI-powered fraud investigation and risk dashboard features at Barclays, combining full-stack engineering with RAG-based LLM systems in a compliance-sensitive environment. Stands out for pairing explainable AI architecture with measurable business impact: improving fraud detection accuracy by 18% while reducing analyst investigation time by 14%.

View profile
VM

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp
Texas InstrumentsTrine University

Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.

View profile
JV

Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP

Iselin, NJ5y exp
Wells FargoSt. Francis College

Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.

View profile
SG

Mid-Level Software Engineer specializing in secure cloud microservices and FinTech

Remote, USA4y exp
BrexSyracuse University

Built and owned major parts of a real-time distributed AI fraud-detection pipeline (ingestion, inference microservice integration, and automated action layer), optimizing latency and observability and reducing false positives by ~35%. Understands ROS/ROS2 concepts (nodes/topics/services) and planned hands-on ramp-up via ROS2 pub/sub exercises and Gazebo simulation, but has not worked on physical robots or ROS in production.

View profile
HK

Mid-level Data/ML Engineer specializing in NLP, GenAI, and scalable data pipelines

5y exp
AbbottClarkson University

AI/ML engineer with production experience building LLM-powered document intelligence and customer support systems in healthcare/insurance, emphasizing high-accuracy RAG, long-document processing, and robust monitoring/fallback mechanisms. Also automates and scales ML lifecycle workflows using Apache Airflow and Kubeflow, and partners closely with non-technical operations stakeholders to drive adoption.

View profile
HG

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

NJ, USA4y exp
Red HatOklahoma Christian University

Red Hat ML/LLM engineer who designed and deployed a production LLM-powered customer support automation system using RAG, improving latency by 30% via PEFT and vector search optimization. Built security and governance into retrieval (access-level filtering, encrypted Pinecone/ChromaDB) and delivered SHAP-based explainability via a dashboard for non-technical stakeholders. Experienced orchestrating distributed ML/RAG pipelines across AWS SageMaker and OpenShift with Airflow/Prefect, plus multi-agent workflows using CrewAI and LangGraph.

View profile
SM

Mid-level Data Scientist specializing in NLP/LLMs, time series forecasting, and MLOps

New York, NY6y exp
CitigroupKent State University

Data/ML practitioner with hands-on experience building NLP systems from prototype to production: delivered a Twitter sentiment classifier with robust preprocessing, SVM modeling, and Power BI reporting, and built entity-resolution pipelines for messy multi-source customer data (reporting ~95% improvement in unique entity identification). Also implemented semantic linking/search using SBERT embeddings with FAISS vector retrieval and domain fine-tuning (reported ~15% precision lift), and applies production workflow best practices (Airflow/Prefect, Docker, Azure ML/Databricks, Great Expectations).

View profile
Abhinav Gupta - Junior Machine Learning Engineer specializing in LLMs and applied data science

Abhinav Gupta

Screened

Junior Machine Learning Engineer specializing in LLMs and applied data science

2y exp
EsriUSC

Built and shipped multiple production AI systems, including Auto DocGen (LLM-generated OpenAPI docs kept in sync via AST diffs, schema-constrained generation, and CI/CD on Render) and a multimodal sign-language recognition pipeline at USC orchestrated with FastAPI, MediaPipe, and PyTorch. Also partnered with Esri’s non-technical community team to fine-tune an LLaMA-based spam classifier with a review UI, cutting moderation time by 70%.

View profile
Rui Cheng - Mid-level Software Engineer specializing in autonomous driving simulation and 3D mapping

Rui Cheng

Screened

Mid-level Software Engineer specializing in autonomous driving simulation and 3D mapping

5y exp
SimForge AIHuazhong University of Science and Technology

Founding software engineer who built an autonomous-vehicle 3D digital twin using Unreal Engine 5 and CARLA, owning core simulator logic (traffic/scenarios/weather) and a ROS 2-based pipeline to record synchronized multi-sensor data (RGB/depth/segmentation/LiDAR/IMU/GPS). Also implemented distributed synchronization patterns (server + client prediction) using FastAPI and WebSockets; seeking roles with H1B transfer and targeting ~$110k.

View profile
Sanjana Duvva - Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

Sanjana Duvva

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMOps, and MLOps

5y exp
Wells FargoUniversity of North Texas

Built and deployed an AWS-based LLM/RAG ticket triage and knowledge retrieval system (Pinecone/FAISS + Step Functions + MLflow) that cut support resolution time by 20%. Demonstrates strong production focus on hallucination reduction, PII security, and low-latency orchestration, with measurable evaluation improvements (e.g., ~25% grounding accuracy gain via re-ranking) and proven collaboration with support operations stakeholders.

View profile
Sri Harshitha Yannam - Junior Software Engineer specializing in AI/ML and cloud platforms in Austin, TX

Junior Software Engineer specializing in AI/ML and cloud platforms

Austin, TX2y exp
AmazonUniversity of Wisconsin–Milwaukee

LLM/agent engineer who shipped a production "Memory Assistant" at HydroX AI, building a LangChain/LlamaIndex RAG memory pipeline on ChromaDB/FAISS with robust fallbacks (BERT/BART), prompt-injection mitigation, and 99.9% uptime monitoring. Also built a multi-step customer support agent using Rasa + OpenAI Assistants API with structured tool calling, guardrails, and human-in-the-loop escalation, and has experience hardening agents against messy ERP data via Pydantic validation, idempotency, and transactional outbox patterns.

View profile
Wei-Hsien Wang - Entry-level AI Engineer specializing in full-stack generative AI systems in San Jose, CA

Entry-level AI Engineer specializing in full-stack generative AI systems

San Jose, CA1y exp
AzazieUC San Diego

AI/full-stack product engineer who has shipped both user-facing and internal LLM products, from a photo-to-music recommendation app to an experimentation agent at Azazie. Stands out for combining modern app development with production-grade agent and GraphRAG systems, including a 500k+ email analysis platform and measurable impact like 3x experiment velocity, 75% setup-time reduction, and 65% faster task discovery.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI, NLP, and MLOps

Connecticut, USA5y exp
PfizerUniversity of New Haven

Built and deployed an enterprise GenAI knowledge assistant over thousands of internal PDFs/reports using a RAG stack (GPT-4 + Hugging Face embeddings + vector DB) to reduce manual search and SME escalations. Uses LangGraph/LangChain to orchestrate modular agent workflows with relevance filtering and fallback handling, and applies rigorous evaluation (golden datasets, edge cases, A/B tests) with production monitoring metrics.

View profile
SS

Junior Software Engineer specializing in ML, distributed systems, and LLM applications

Austin, TX1y exp
ZondaUC San Diego

Interned at Zonda where he built an AI-driven semantic search solution over ~280M housing/builder records. Iterated from local LLMs via llama.cpp quantization to a vector-embedding retrieval system, then boosted semantic accuracy with a custom spaCy NER layer and re-ranking, optimizing for latency through precomputation. Collaborated with economics-focused stakeholders to reduce manual document/paperwork time by enabling natural-language search over internal data.

View profile
MT

Mihir Trivedi

Screened

Junior Machine Learning & Quant Research Engineer specializing in low-latency data and trading systems

New York, NY3y exp
Astera HoldingsColumbia University

Applied ML to physical EV fleet systems at ST Labs, building a real-time CNN-LSTM fault prediction pipeline from streaming vehicle telemetry and addressing live data alignment issues via resampling/interpolation and buffered inference. Also developed a V2G/G2V energy transfer algorithm to automate charging/discharging for profit optimization, and made high-impact low-latency pipeline decisions at Astera Holdings using profiling, replay testing, and live A/B validation.

View profile
SA

Mid-level Software Engineer specializing in cloud-native microservices and AI-powered web applications

Remote, USA5y exp
BigCommerceArizona State University

Backend engineer who built and owned an AI-powered SMS survey platform for a nonprofit serving at-risk communities (internet-limited users), using Cloudflare Workers + Twilio and a state-machine survey engine. Scaled it to ~10k active users with near-zero downtime, added English/Spanish support, and iteratively improved LLM behavior (Claude 3.7 Sonnet) to handle nuanced, real-world SMS responses reliably.

View profile
AP

Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications

Charlotte, NC5y exp
Bank of AmericaUniversity of North Carolina at Charlotte

Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.

View profile

Need someone specific?

AI Search