Vetted Llama Professionals

Pre-screened and vetted.

PK

Mid-level AI/ML Engineer specializing in NLP, GenAI, and MLOps in healthcare and finance

USA5y exp
CVS HealthUniversity of Houston

AI/ML engineer with CVS Health experience deploying production LLM systems in regulated healthcare settings, including a large-scale RAG solution (1M+ documents) built for compliance-grade, auditable policy/regulatory Q&A with strong anti-hallucination controls. Also delivered an NLP summarization system for physician notes/case narratives by partnering closely with non-technical care operations stakeholders and iterating via prototypes, dashboards, and feedback loops.

View profile
GS

Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG

Auburn Hills, MI4y exp
StellantisUniversity of Cincinnati

ML/NLP practitioner who built a retrieval-augmented generation (RAG) system for large financial and operational document sets using Sentence-Transformers (all-mpnet-base-v2) and a vector DB (e.g., Pinecone), with a strong focus on retrieval evaluation and chunking strategy optimization. Experienced in entity resolution (rules + embedding similarity with type-specific thresholds) and in productionizing scalable Python data workflows using Airflow/Dagster and Spark.

View profile
RP

Ruudra Patel

Screened

Junior Data Scientist specializing in ML, LLMs, and RAG applications

Atlanta, GA3y exp
Georgia State UniversityGeorgia State University

University hackathon finalist (2nd place) who built CareerSpark, a production-style multi-agent career guidance app in 24 hours using a hierarchical debate architecture with a moderator/judge agent. Has startup internship experience at LiveSpheres AI using LangChain for multi-LLM orchestration, and demonstrates a structured approach to testing/evaluation (golden sets, integration sims, latency/accuracy KPIs) plus strong non-technical stakeholder communication.

View profile
AR

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

3y exp
State FarmCleveland State University

Built a secure, on-prem/private GPT assistant to replace manual SharePoint-style search across thousands of policies/SOPs/engineering docs, using a production RAG stack (LangChain/LangGraph, FAISS/Chroma, PyMuPDF+OCR, vLLM). Implemented layout-aware ingestion (including table-to-JSON) and a multi-agent retrieval/generation/verification workflow with strong observability and compliance guardrails, delivering ~70% reduction in search time.

View profile
MK

Mansoor Khan

Screened

Mid-level Conversational AI Developer specializing in enterprise chatbots and RAG

WI, USA6y exp
LivePersonConcordia University Wisconsin

ML/AI practitioner with hands-on experience deploying models to production and optimizing for low-latency inference using pruning/quantization, with deployments on AWS SageMaker and Azure ML. Has orchestrated end-to-end ML pipelines with Airflow and Kubeflow (ingestion through evaluation) and emphasizes reproducibility via containerization and version-controlled artifacts, while effectively partnering with non-technical stakeholders using dashboards and business-aligned metrics.

View profile
Apurva Banka - Mid-level Full-Stack & AI Engineer specializing in cloud, data platforms, and LLM automation in Houston, TX

Apurva Banka

Screened

Mid-level Full-Stack & AI Engineer specializing in cloud, data platforms, and LLM automation

Houston, TX5y exp
Jay Logistics & Trade LLCUniversity at Buffalo

Software engineer/product builder who has owned an agentic affiliate lead-gen platform end-to-end (Django + React/TypeScript) and deployed it on Kubernetes in anticipation of 10x user growth from ~5K DAUs. Also has healthcare claims microservices experience using Kafka, including hands-on performance tuning to address consumer lag and broker pressure, and built an internal downtime alerting tool adopted across the organization.

View profile
Manpreet Kour - Senior Data Scientist specializing in Generative AI and NLP in Seattle, USA

Manpreet Kour

Screened

Senior Data Scientist specializing in Generative AI and NLP

Seattle, USA6y exp
SOTIDr. B. R. Ambedkar National Institute of Technology, Jalandhar

ML/NLP engineer with recent Scotiabank experience building production-grade indexing automation over large-scale emails and customer databases, combining LLM fine-tuning (Mistral, XLM-R) with fuzzy matching to exceed 95% accuracy under strict banking constraints. Also built a RAG-based chat agent using Gecko embeddings, Vertex AI Search, Gemini, and cross-encoder reranking, and delivered a text-to-SQL chatbot at SOTI through iterative fine-tuning and benchmark-driven experimentation.

View profile
Phani K - Mid-level AI/ML Engineer specializing in GenAI, NLP, and healthcare-financial ML in Terre Haute, IN

Phani K

Screened

Mid-level AI/ML Engineer specializing in GenAI, NLP, and healthcare-financial ML

Terre Haute, IN4y exp
UnitedHealth GroupIndiana State University

ML/AI engineer with hands-on experience shipping healthcare AI systems, including an oncology risk prediction platform and RAG-based clinical decision support tools. Stands out for combining clinical domain context with strong production engineering across Spark, FastAPI, AWS SageMaker, monitoring, evaluation, and safety guardrails.

View profile
RP

Senior Full-Stack Engineer specializing in AI, backend systems, and supply chain platforms

Lawrenceville, NJ6y exp
Rutgers UniversityRutgers University

Full-stack engineer with hands-on experience spanning React/TypeScript frontends, Cloudflare serverless RAG systems, SQL-heavy backend redesigns, and computer vision workflows. He has shipped practical automation and reliability improvements with measurable impact, including cutting a video-validation reporting process from a week to 2 days and fixing a memory-heavy shipment system before Black Friday to support 30K+ orders successfully.

View profile
VM

Mid-level Machine Learning & Full-Stack Engineer specializing in GenAI platforms

San Francisco, CA5y exp
WellDhanNortheastern University

LLM/agent builder who has shipped production AI systems in the wellness space, including an LLM-powered food tracking product used by 5000+ users and a voice/call-routing onboarding workflow using LangGraph/LangChain with LiveKit and Twilio. Strong focus on practical reliability work: latency reduction, retrieval/embedding tuning, and CI-driven evaluation with simulations and metrics.

View profile
Muaaz Syed - Mid-level AI/ML Engineer specializing in NLP and conversational AI in Richardson, TX

Muaaz Syed

Screened

Mid-level AI/ML Engineer specializing in NLP and conversational AI

Richardson, TX4y exp
CVS HealthUniversity of Texas at Dallas

ML/NLP engineer focused on real-time IT ops analytics, building a predictive maintenance/anomaly detection platform end-to-end (multi-source ETL, streaming, modeling, and production deployment on GCP/Vertex AI). Uses deep learning (LSTMs, autoencoders/VAEs) plus embeddings (SentenceBERT) and vector search to improve incident correlation and search, citing ~40% reduction in duplicate alert noise.

View profile
Harideep Balusa - Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems in USA

Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems

USA6y exp
Freddie MacUniversity of Wisconsin

Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.

View profile
SV

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp
Summit Design and TechnologyNorthwest Missouri State University

Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
Hritvik Gupta - Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI in San Francisco, CA

Hritvik Gupta

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI

San Francisco, CA3y exp
Penn MedicineUC Riverside

Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.

View profile
Daniel Berhane Araya - Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance in Fairfax, VA

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp
George Mason UniversityGeorge Mason University

AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
MY

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp
Elevance HealthMLR Institute of Technology

Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.

View profile
MB

Mid-level AI Researcher specializing in multimodal LLMs and human-centered AI

Pittsburgh, PA7y exp
University of PittsburghUniversity of Pittsburgh

Has production deployment experience delivering computer-vision systems on AWS (Docker + S3) including a GDPR-focused face/license-plate obfuscation pipeline and a semantic-segmentation project aimed at reducing annotation time. Worked closely with DevOps and frontend teams and partnered with CEO/CMO to present an AI-driven annotation workflow to non-technical VC stakeholders.

View profile
YL

Yun-Hao Lee

Screened

Junior Machine Learning Engineer specializing in LLM deployment and computer vision

Dallas, TX2y exp
Lab for Intelligent Storage and ComputingUniversity of Texas at Dallas

Robotics/AI candidate who built an AI-driven landmark location tool during a summer internship at Mobile Drive, combining YOLOv5 object detection with OpenStreetMap-based geolocation to handle dense, cluttered urban environments. Also researched deploying LLM-based agents on constrained hardware using quantization plus LoRA/continuous learning, improving accuracy from ~80% to ~92%, with an emphasis on production logging for reliability.

View profile
Sai Venkata Sathwik Golla - Mid-level Backend & Applied ML Engineer specializing in LLM systems and scalable APIs in Palo Alto, CA

Mid-level Backend & Applied ML Engineer specializing in LLM systems and scalable APIs

Palo Alto, CA3y exp
University at BuffaloUniversity at Buffalo

Backend engineer who significantly evolved an internal analytics/reporting platform (Python API + Postgres) powering self-service dashboards for product/business teams, focusing on reliability under heavy concurrent load and fast query performance. Demonstrates strong production engineering practices across API design (FastAPI), observability, incremental rollouts with feature flags, and data security using JWT/RBAC plus Postgres row-level security.

View profile
Srikanth Reddy - Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics in Plainsboro, NJ

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp
State StreetWilmington University

Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.

View profile
MR

Mid-level AI/ML Engineer specializing in LLMs, RAG, and time-series forecasting

California, USA4y exp
Northern TrustUniversity of Massachusetts

ML/AI engineer with hands-on ownership of production recommendation and RAG systems at Northern Trust. They combine transformer modeling, latency optimization, cloud deployment, and monitoring with measurable business impact, including 14% accuracy gains, 12% engagement improvement, and 19% better query relevance.

View profile
SS

Shivam Soni

Screened

Mid-Level Full-Stack Software Developer specializing in cloud-native microservices and AI/ML

Remote, USA3y exp
Fidelity InvestmentsArizona State University

Backend engineer who optimized an AI-driven portfolio analytics/insights platform at Fidelity, addressing latency and traffic growth by moving services toward microservices, improving service communication, and tuning API/DB performance. Experienced scaling Python/FastAPI services with Docker + Kubernetes autoscaling, and strengthening security/privacy for sensitive client portfolio data used in LLM-based reporting.

View profile

Need someone specific?

AI Search