Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Llama Professionals

Pre-screened and vetted.

Llama Python Docker SQL CI/CD AWS

Sai Vivek Reddy Gankidi

Screened

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp

Summit Design and TechnologyNorthwest Missouri State University

“Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.”

Python PyTorch TensorFlow Keras Hugging Face Transformers+82

View profile

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp

HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

“Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.”

Agile AWS Azure DevOps Azure Functions Azure Machine Learning Bash+131

View profile

Daniel Berhane Araya

Screened

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp

George Mason UniversityGeorge Mason University

“AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.”

Python NumPy Pandas SQL PostgreSQL SQLite+112

View profile

Hritvik Gupta

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI

San Francisco, CA3y exp

Penn MedicineUC Riverside

“Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.”

AWS AWS Lambda C++CI/CD Communication Data Engineering+78

View profile

Varun Gattamaneni

Screened

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp

HCLTechRowan University

“Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.”

Python Pandas NumPy R SQL Bash+150

View profile

Manish Yamsani

Screened

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp

Elevance HealthMLR Institute of Technology

“Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.”

Agentic AI Anomaly Detection API Integration AWS AWS Glue AWS Lambda+116

View profile

Maneesh Bilalpur

Screened

Mid-level AI Researcher specializing in multimodal LLMs and human-centered AI

Pittsburgh, PA7y exp

University of PittsburghUniversity of Pittsburgh

“Has production deployment experience delivering computer-vision systems on AWS (Docker + S3) including a GDPR-focused face/license-plate obfuscation pipeline and a semantic-segmentation project aimed at reducing annotation time. Worked closely with DevOps and frontend teams and partnered with CEO/CMO to present an AI-driven annotation workflow to non-technical VC stakeholders.”

Large Language Models (LLMs)Deep Learning Transformers Computer Vision Natural Language Processing Data Science+60

View profile

Yun-Hao Lee

Screened

Junior Machine Learning Engineer specializing in LLM deployment and computer vision

Dallas, TX2y exp

Lab for Intelligent Storage and ComputingUniversity of Texas at Dallas

“Robotics/AI candidate who built an AI-driven landmark location tool during a summer internship at Mobile Drive, combining YOLOv5 object detection with OpenStreetMap-based geolocation to handle dense, cluttered urban environments. Also researched deploying LLM-based agents on constrained hardware using quantization plus LoRA/continuous learning, improving accuracy from ~80% to ~92%, with an emphasis on production logging for reliability.”

Python C C++R SQL Java+91

View profile

Srikanth Reddy

Screened

Mid-level AI/ML Engineer specializing in GenAI and financial risk & compliance analytics

Plainsboro, NJ7y exp

State StreetWilmington University

“Built and deployed a production LLM-powered financial risk and compliance platform to reduce manual trade exception handling and speed up insights from regulatory documents. Implemented a LangChain multi-agent workflow with structured/unstructured data integration (Redshift + vector DB) and emphasized hallucination reduction for regulatory safety using Amazon Bedrock. Strong MLOps/orchestration background across Kubernetes, Airflow, Jenkins, and monitoring/testing with MLflow, Evidently AI, and PyTest.”

A/B Testing Agile Amazon Bedrock Amazon CloudWatch Amazon EC2 Amazon RDS+178

View profile

Sai Venkata Sathwik Golla

Screened

Mid-level Backend & Applied ML Engineer specializing in LLM systems and scalable APIs

Palo Alto, CA3y exp

University at BuffaloUniversity at Buffalo

“Backend engineer who significantly evolved an internal analytics/reporting platform (Python API + Postgres) powering self-service dashboards for product/business teams, focusing on reliability under heavy concurrent load and fast query performance. Demonstrates strong production engineering practices across API design (FastAPI), observability, incremental rollouts with feature flags, and data security using JWT/RBAC plus Postgres row-level security.”

Python SQL JavaScript C++React PyTorch+85

View profile

Madhu Ramakrishnappa

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and time-series forecasting

California, USA4y exp

Northern TrustUniversity of Massachusetts

“ML/AI engineer with hands-on ownership of production recommendation and RAG systems at Northern Trust. They combine transformer modeling, latency optimization, cloud deployment, and monitoring with measurable business impact, including 14% accuracy gains, 12% engagement improvement, and 19% better query relevance.”

Python Bash SQL TypeScript R JSON+125

View profile

Shivam Soni

Screened

Mid-Level Full-Stack Software Developer specializing in cloud-native microservices and AI/ML

Remote, USA3y exp

Fidelity InvestmentsArizona State University

“Backend engineer who optimized an AI-driven portfolio analytics/insights platform at Fidelity, addressing latency and traffic growth by moving services toward microservices, improving service communication, and tuning API/DB performance. Experienced scaling Python/FastAPI services with Docker + Kubernetes autoscaling, and strengthening security/privacy for sensitive client portfolio data used in LLM-based reporting.”

Java Python JavaScript TypeScript Go gRPC+166

View profile

Goda Kodati

Screened

Mid-level Software Engineer specializing in Java/Spring backend and event-driven systems

Sunnyvale, CA4y exp

OptumUniversity of North Carolina at Charlotte

“Backend engineer from Optum who built and optimized a real-time, Kafka-driven healthcare claims processing platform handling 1M+ claims/month. Strong in reliability, state management, and observability for distributed systems, plus production deployment automation with Docker/Kubernetes and CI/CD; no direct ROS/robotics simulator experience yet but frames work in robotics-adjacent real-time principles.”

Java Python JavaScript TypeScript SQL C#+84

View profile

Roshan Erukulla

Screened

Mid-level AI/ML Engineer specializing in NLP and Generative AI

Indiana, USA6y exp

Elevance HealthIndiana University Indianapolis

“Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.”

A/B Testing Agile Amazon EC2 Amazon ECS Amazon S3 Apache Airflow+148

View profile

Sameer Shaik

Screened

Senior AI Engineer specializing in Generative AI, NLP, and applied deep learning

Chicago, IL8y exp

Live NationDePaul University

“Built a production multi-agent LLM system at Live Nation on Databricks (LangGraph/LangChain) that let venue/event teams ask questions in Slack, auto-generated optimized route schedules, and produced inventory/stocking recommendations from historical SQL data and venue trends. Improved reliability by tightening prompts with strict JSON schemas, providing sample questions/SQL, and adding guardrails plus synthetic/edge-case testing, while iterating with event managers and senior VPs via prototypes and feedback loops.”

A/B Testing Azure Blob Storage Azure Functions CI/CD Classification Clustering+143

View profile

Yash Tobre

Screened

Mid-level AI/ML Engineer specializing in computer vision, NLP/LLMs, and MLOps

Bentonville, AR4y exp

DyneticsUniversity of Texas at Arlington

“ML/AI engineer with defense and commercial analytics experience: deployed a real-time aerial object detection system at Dynetics (YOLOv5 + TorchServe in Docker on AWS EC2) with drift-triggered retraining and 99.5% uptime, tackling ambiguous targets and weather degradation. Previously at Fractal Analytics, built and explained a churn prediction model for marketing stakeholders using SHAP and delivered it via a Flask API into dashboards, driving a reported 22% attrition reduction.”

Python MATLAB SQL PyTorch TensorFlow Keras+98

View profile

Karthik O

Screened

Mid-level AI Software Engineer specializing in LLM systems and cloud APIs

Kansas, USA3y exp

DeloitteUniversity of Central Missouri

“Built and productionized an LLM-powered support/knowledge pipeline using embeddings and retrieval (RAG) to deliver more grounded, higher-quality responses while reducing manual effort. Focused on real-world reliability and performance—adding structured validation/guardrails, optimizing vector search and context size for latency/scale, and monitoring failure patterns in production. Experienced with orchestration via LangChain for LLM workflows and Airflow for production data/ML pipelines, and iterates closely with operations stakeholders through demos and feedback.”

Python JavaScript TypeScript Java SQL Git+112

View profile

DEDEEPYA PALAKURTHI

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and enterprise MLOps

Baltimore, MD4y exp

CVS HealthUniversity of Maryland, Baltimore County

“Backend engineer who built an AI-driven "Smart Feedback Analyzer" API (Flask → FastAPI) that processes user feedback with NLP (Hugging Face + OpenAI) and returns structured insights. Demonstrates strong production-minded architecture: stateless services, Cloud Run + Docker deployment, Redis/Celery background processing, and Postgres/SQLAlchemy performance tuning (EXPLAIN ANALYZE, indexing, N+1 fixes), plus multi-tenant data isolation via JWT/API-key derived tenant IDs.”

Python SQL Java Scala FastAPI REST APIs+220

View profile

Bhanu Gummadi

Screened

Mid-level Backend Software Engineer specializing in cloud-native microservices and FinTech

Bellevue, WA4y exp

MastercardUniversity of Central Missouri

“Backend-focused engineer with Mastercard experience building and operating high-volume transaction-processing microservices. Has owned customer-facing banking services end-to-end and built an internal on-call analytics tool that centralized logs/metrics with real-time filtering to speed root-cause analysis and reduce incident investigation time.”

Java Python C++C#Spring Boot Flask+86

View profile

Sagar Sidhwa

Screened

Senior AI/ML Engineer specializing in LLMs, MLOps, and predictive analytics

Jamestown, NY6y exp

CumminsBinghamton University

“ML/AI engineer with hands-on experience building production MLOps systems for predictive maintenance and demand forecasting, including deployment, monitoring, and iterative retraining. Also shipped a RAG-based employee onboarding chatbot integrated with ServiceNow APIs and reports business impact of roughly $300k/month in reduced stockout and overstock costs.”

Python SQL NoSQL JavaScript TypeScript C+210

View profile

Angela Churchwell

Screened

Principal Software Engineer specializing in enterprise AI platforms

Richardson, TX12y exp

CBREUniversity of Texas at Dallas

“Built a production-grade LLM document processing and workflow orchestration platform at CBRE for internal operations teams, handling highly variable long-form documents with a reusable architecture involving 50+ coordinated LLM calls per request. Stands out for treating agentic systems like distributed backend infrastructure, with strong emphasis on evaluation, observability, reliability, and vendor-agnostic orchestration across Bedrock, Vertex AI, and OpenAI.”

Prompt Engineering Semantic Search Embeddings OpenAI Vertex AI Llama+99

View profile

Saipraneeth Ketireddi

Screened

Mid-level AI/ML Engineer specializing in LLM automation and healthcare analytics

Dallas, TX4y exp

LinkedInUniversity of Texas at Dallas

“Full-stack AI engineer who has repeatedly taken ambiguous automation and agentic products from prototype to production, including a BRD automation platform that cut manual processing by 70% and a healthcare RAG assistant with long-term memory. Stands out for combining backend/AI orchestration depth with strong product instincts around trust, observability, security, and non-technical user experience.”

Python SQL C++Java C#Streamlit+149

View profile