Vetted Llama Professionals

Pre-screened and vetted.

RD

Mid-level Data Science & AI Engineer specializing in LLMs and cloud ML platforms

Los Angeles, CA6y exp
UpHealthDePaul University

Built and deployed an LLM-powered mental health therapy assistant at AppHealth that segments users by stress level and delivers personalized, non-medical guidance. Implemented healthcare-focused safety guardrails (secondary LLM output filtering) and a multi-agent router workflow validated via statistical tests and therapist review, then scaled training/inference on AWS (EC2/Lambda/DynamoDB) with Kubernetes.

View profile
SP

shubham patil

Screened

Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics

New York, NY4y exp
Syracuse UniversitySyracuse University

Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).

View profile
MY

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

USA4y exp
State StreetWebster University

Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.

View profile
AS

Arju Singh

Screened

Mid-level Machine Learning Engineer specializing in LLM apps, RAG pipelines, and MLOps

2y exp
Pervaziv AIIndiana University Bloomington

Software engineer with connected-car/automotive production experience who owned an end-to-end remote door lock/unlock feature and introduced unit testing (GTest) plus rig/simulator validation. Also built and productionized an AI-native AWS cloud cost assistant (Lex + GPT-based LLM + Lambda + RAG/vector DB) with guardrails and achieved 94% evaluation accuracy. Helped replace a third-party solution with an in-house build, saving the company ~€9M.

View profile
Teja Babu Mandaloju - Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms in Chicago, USA

Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms

Chicago, USA5y exp
VosynUniversity of North Texas

AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.

View profile
Bhavana Anna - Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG) in USA

Bhavana Anna

Screened

Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)

USA5y exp
USAAKennesaw State University

AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.

View profile
SC

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp
HCLTechUniversity of New Haven

Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.

View profile
Cameron Shapoorian - Mid-level Test Automation & AI Integration Engineer

Mid-level Test Automation & AI Integration Engineer

3y exp
Bland AIUniversity of Colorado Boulder

Forward-deployed/solutions-oriented engineer with experience shipping enterprise LLM voice-agent workflows from prototype to production, including variable extraction and API integrations. Demonstrated strong real-time troubleshooting via logs/RCA (e.g., fixing multilingual language-switching by tuning temperature and improving context), and has led technical workshops while partnering with sales/solutions teams to drive customer adoption.

View profile
Monisha Nettem - Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps in USA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA5y exp
M&T BankKennesaw State University

AI/ML engineer with banking domain experience (M&T Bank) who built a production credit-risk prediction and reporting platform combining ML models (XGBoost/TensorFlow) with a RAG pipeline (LangChain + GPT-4) over compliance documents. Delivered measurable impact (≈20% better risk detection/precision, 50% less manual reporting) and productionized workflows on Vertex AI/Kubeflow with CI/CD and monitoring; also implemented embedding-based semantic search using FAISS/Pinecone.

View profile
SREYAS GANGJI - Mid-level Software Engineer specializing in AI/ML backend systems in Chicago, IL

SREYAS GANGJI

Screened

Mid-level Software Engineer specializing in AI/ML backend systems

Chicago, IL4y exp
ZSDePaul University

AI/data engineer at ZS Associates focused on production-grade agentic systems, FastAPI microservices, and cloud-native ETL/RAG pipelines at significant scale. They’ve built multi-agent validation and diagnostic workflows inspired by their Copilot/KUBEPILOT AI work, supporting 500K+ records per day while improving ML inference performance by ~30% and cutting manual troubleshooting by 60%.

View profile
SC

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Atlanta, GA4y exp
Universal Health ServicesUniversity of New Haven

Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.

View profile
KG

Mid-level Generative AI Engineer specializing in LLM agents and RAG

Chesterfield, MO4y exp
Reinsurance Group of AmericaUniversity of Central Missouri

GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
SP

saran palle

Screened

Mid-level Applied AI Engineer specializing in agentic LLM workflows

North Carolina4y exp
Acentrik Technology SolutionsUniversity at Buffalo

AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.

View profile
DP

Drashti Patel

Screened

Junior Software Engineer and ML Researcher specializing in full-stack and applied deep learning

Indiana, USA3y exp
Purdue UniversityPurdue University

LLM engineer who built a production-style educational questionnaire generation system (MCQs/fill-in-the-blanks/short answers) using Hugging Face models (BERT/T5) and implemented grounding, decoding tuning, and post-generation validation to control hallucinations and quality. Also developed a "tech care" assistant chatbot with a custom Python orchestration/router layer (intent classification, context management, multi-step flows) and a structured testing/evaluation approach including expert review and automated checks.

View profile
Pravalika Kuppireddy - Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation

Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation

4y exp
University of Michigan-DearbornUniversity of Michigan-Dearborn

LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.

View profile
AK

Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps

KS, USA4y exp
Black & VeatchUniversity of Central Missouri

Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.

View profile
Pooja Miryala - Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for banking and healthcare in Ohio, USA

Pooja Miryala

Screened

Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for banking and healthcare

Ohio, USA4y exp
Fifth Third BankYoungstown State University

Deployed a real-time LLM-driven call center summarization and agent-assist platform at Fifth Third Bank, combining transformer models (BERT/GPT) with FastAPI inference on AKS and vector storage (ChromaDB/PostgreSQL). Emphasizes production-grade reliability (autoscaling, CI/CD, monitoring) and measurable evaluation (A/B testing), and translates model outputs into business-facing Power BI insights for call center leadership.

View profile
Sai somapalli - Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants in USA

Sai somapalli

Screened

Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants

USA6y exp
Stellar AI SolutionsCampbellsville University

GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.

View profile
LD

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

Atlanta, GA3y exp
AIGKennesaw State University

Data professional with ~4 years of experience, most recently at AIG (insurance), building ML/NLP systems for fraud detection and policy automation using transformers, CNNs, and clustering/anomaly detection. Also developed a RAG-based knowledge retrieval system, iterating across embedding models and moving to production based on precision and latency SLAs, then containerizing and deploying with SageMaker and CI/CD.

View profile
Bhavishyasai Chigurupati - Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms in Overland Park, KS

Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms

Overland Park, KS5y exp
CignaUniversity of Central Missouri

Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.

View profile
Saketh Kota - Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps in Irving, TX

Saketh Kota

Screened

Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps

Irving, TX4y exp
U.S. Bank

Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.

View profile
LD

Mid-level Data Scientist & AI/ML Engineer specializing in GenAI, NLP, and predictive modeling

Remote, USA5y exp
CenteneAdelphi University
View profile
Sathvik Vadlapatla - Mid-level AI/ML Engineer specializing in Generative AI, RAG pipelines, and NLP in Chattanooga, TN

Mid-level AI/ML Engineer specializing in Generative AI, RAG pipelines, and NLP

Chattanooga, TN4y exp
UnumStevens Institute of Technology
View profile

Need someone specific?

AI Search