Vetted Llama Professionals

Pre-screened and vetted.

SV

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp
Summit Design and TechnologyNorthwest Missouri State University

Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
Daniel Berhane Araya - Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance in Fairfax, VA

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp
George Mason UniversityGeorge Mason University

AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.

View profile
Hritvik Gupta - Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI in San Francisco, CA

Hritvik Gupta

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI

San Francisco, CA3y exp
Penn MedicineUC Riverside

Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
MY

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp
Elevance HealthMLR Institute of Technology

Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.

View profile
MB

Mid-level AI Researcher specializing in multimodal LLMs and human-centered AI

Pittsburgh, PA7y exp
University of PittsburghUniversity of Pittsburgh

Has production deployment experience delivering computer-vision systems on AWS (Docker + S3) including a GDPR-focused face/license-plate obfuscation pipeline and a semantic-segmentation project aimed at reducing annotation time. Worked closely with DevOps and frontend teams and partnered with CEO/CMO to present an AI-driven annotation workflow to non-technical VC stakeholders.

View profile
YL

Yun-Hao Lee

Screened

Junior Machine Learning Engineer specializing in LLM deployment and computer vision

Dallas, TX2y exp
Lab for Intelligent Storage and ComputingUniversity of Texas at Dallas

Robotics/AI candidate who built an AI-driven landmark location tool during a summer internship at Mobile Drive, combining YOLOv5 object detection with OpenStreetMap-based geolocation to handle dense, cluttered urban environments. Also researched deploying LLM-based agents on constrained hardware using quantization plus LoRA/continuous learning, improving accuracy from ~80% to ~92%, with an emphasis on production logging for reliability.

View profile
Srikanth Reddy - Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics in Plainsboro, NJ

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.

View profile
Sai Venkata Sathwik Golla - Mid-level Backend & Applied ML Engineer specializing in LLM systems and scalable APIs in Palo Alto, CA

Mid-level Backend & Applied ML Engineer specializing in LLM systems and scalable APIs

Palo Alto, CA3y exp
University at BuffaloUniversity at Buffalo

Backend engineer who significantly evolved an internal analytics/reporting platform (Python API + Postgres) powering self-service dashboards for product/business teams, focusing on reliability under heavy concurrent load and fast query performance. Demonstrates strong production engineering practices across API design (FastAPI), observability, incremental rollouts with feature flags, and data security using JWT/RBAC plus Postgres row-level security.

View profile
MR

Mid-level AI/ML Engineer specializing in LLMs, RAG, and time-series forecasting

California, USA4y exp
Northern TrustUniversity of Massachusetts

ML/AI engineer with hands-on ownership of production recommendation and RAG systems at Northern Trust. They combine transformer modeling, latency optimization, cloud deployment, and monitoring with measurable business impact, including 14% accuracy gains, 12% engagement improvement, and 19% better query relevance.

View profile
SS

Shivam Soni

Screened

Mid-Level Full-Stack Software Developer specializing in cloud-native microservices and AI/ML

Remote, USA3y exp
Fidelity InvestmentsArizona State University

Backend engineer who optimized an AI-driven portfolio analytics/insights platform at Fidelity, addressing latency and traffic growth by moving services toward microservices, improving service communication, and tuning API/DB performance. Experienced scaling Python/FastAPI services with Docker + Kubernetes autoscaling, and strengthening security/privacy for sensitive client portfolio data used in LLM-based reporting.

View profile
GK

Goda Kodati

Screened

Mid-level Software Engineer specializing in Java/Spring backend and event-driven systems

Sunnyvale, CA4y exp
OptumUniversity of North Carolina at Charlotte

Backend engineer from Optum who built and optimized a real-time, Kafka-driven healthcare claims processing platform handling 1M+ claims/month. Strong in reliability, state management, and observability for distributed systems, plus production deployment automation with Docker/Kubernetes and CI/CD; no direct ROS/robotics simulator experience yet but frames work in robotics-adjacent real-time principles.

View profile
RE

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp
Elevance HealthIndiana University Indianapolis

Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.

View profile
SS

Sameer Shaik

Screened

Senior AI Engineer specializing in Generative AI, NLP, and applied deep learning

Chicago, IL8y exp
Live NationDePaul University

Built a production multi-agent LLM system at Live Nation on Databricks (LangGraph/LangChain) that let venue/event teams ask questions in Slack, auto-generated optimized route schedules, and produced inventory/stocking recommendations from historical SQL data and venue trends. Improved reliability by tightening prompts with strict JSON schemas, providing sample questions/SQL, and adding guardrails plus synthetic/edge-case testing, while iterating with event managers and senior VPs via prototypes and feedback loops.

View profile
YT

Yash Tobre

Screened

Mid-level AI/ML Engineer specializing in computer vision, NLP/LLMs, and MLOps

Bentonville, AR4y exp
DyneticsUniversity of Texas at Arlington

ML/AI engineer with defense and commercial analytics experience: deployed a real-time aerial object detection system at Dynetics (YOLOv5 + TorchServe in Docker on AWS EC2) with drift-triggered retraining and 99.5% uptime, tackling ambiguous targets and weather degradation. Previously at Fractal Analytics, built and explained a churn prediction model for marketing stakeholders using SHAP and delivered it via a Flask API into dashboards, driving a reported 22% attrition reduction.

View profile
KO

Karthik O

Screened

Mid-level AI Software Engineer specializing in LLM systems and cloud APIs

Kansas, USA3y exp
DeloitteUniversity of Central Missouri

Built and productionized an LLM-powered support/knowledge pipeline using embeddings and retrieval (RAG) to deliver more grounded, higher-quality responses while reducing manual effort. Focused on real-world reliability and performance—adding structured validation/guardrails, optimizing vector search and context size for latency/scale, and monitoring failure patterns in production. Experienced with orchestration via LangChain for LLM workflows and Airflow for production data/ML pipelines, and iterates closely with operations stakeholders through demos and feedback.

View profile
DP

Mid-level AI/ML Engineer specializing in LLMs, RAG, and enterprise MLOps

Baltimore, MD4y exp
CVS HealthUniversity of Maryland, Baltimore County

Backend engineer who built an AI-driven "Smart Feedback Analyzer" API (Flask → FastAPI) that processes user feedback with NLP (Hugging Face + OpenAI) and returns structured insights. Demonstrates strong production-minded architecture: stateless services, Cloud Run + Docker deployment, Redis/Celery background processing, and Postgres/SQLAlchemy performance tuning (EXPLAIN ANALYZE, indexing, N+1 fixes), plus multi-tenant data isolation via JWT/API-key derived tenant IDs.

View profile
Bhanu Gummadi - Mid-level Backend Software Engineer specializing in cloud-native microservices and FinTech in Bellevue, WA

Bhanu Gummadi

Screened

Mid-level Backend Software Engineer specializing in cloud-native microservices and FinTech

Bellevue, WA4y exp
MastercardUniversity of Central Missouri

Backend-focused engineer with Mastercard experience building and operating high-volume transaction-processing microservices. Has owned customer-facing banking services end-to-end and built an internal on-call analytics tool that centralized logs/metrics with real-time filtering to speed root-cause analysis and reduce incident investigation time.

View profile
SS

Sagar Sidhwa

Screened

Senior AI/ML Engineer specializing in LLMs, MLOps, and predictive analytics

Jamestown, NY6y exp
CumminsBinghamton University

ML/AI engineer with hands-on experience building production MLOps systems for predictive maintenance and demand forecasting, including deployment, monitoring, and iterative retraining. Also shipped a RAG-based employee onboarding chatbot integrated with ServiceNow APIs and reports business impact of roughly $300k/month in reduced stockout and overstock costs.

View profile
AC

Principal Software Engineer specializing in enterprise AI platforms

Richardson, TX12y exp
CBREUniversity of Texas at Dallas

Built a production-grade LLM document processing and workflow orchestration platform at CBRE for internal operations teams, handling highly variable long-form documents with a reusable architecture involving 50+ coordinated LLM calls per request. Stands out for treating agentic systems like distributed backend infrastructure, with strong emphasis on evaluation, observability, reliability, and vendor-agnostic orchestration across Bedrock, Vertex AI, and OpenAI.

View profile
Saipraneeth Ketireddi - Mid-level AI/ML Engineer specializing in LLM automation and healthcare analytics in Dallas, TX

Mid-level AI/ML Engineer specializing in LLM automation and healthcare analytics

Dallas, TX4y exp
LinkedInUniversity of Texas at Dallas

Full-stack AI engineer who has repeatedly taken ambiguous automation and agentic products from prototype to production, including a BRD automation platform that cut manual processing by 70% and a healthcare RAG assistant with long-term memory. Stands out for combining backend/AI orchestration depth with strong product instincts around trust, observability, security, and non-technical user experience.

View profile
EC

Entry-level Full-Stack Engineer specializing in web, mobile, and AI security

Merced, CA0y exp
Rogue’s GalleryUC Merced
View profile
SG

Mid-level AI/ML Engineer specializing in NLP, GenAI, and AWS MLOps

St. Louis, MO5y exp
S&P GlobalUniversity of Central Missouri
View profile

Need someone specific?

AI Search