Vetted Model Deployment Professionals

Pre-screened and vetted.

Meng Yang - Staff Software Engineer specializing in distributed systems, cloud platforms, and IoT in Columbus, OH

Meng Yang

Screened ReferencesStrong rec.

Staff Software Engineer specializing in distributed systems, cloud platforms, and IoT

Columbus, OH21y exp
M2M Technologies, Inc.California State University, San Bernardino

CTO/Chief Architect who rebuilt an IoT platform from a fragile legacy stack into an AWS-based, multi-tenant cloud-native system supporting 50k+ connected devices and 10M+ monthly events, then layered in real-time data pipelines and ML anomaly detection. Known for tightly aligning roadmaps and OKRs to business KPIs (onboarding speed, uptime, velocity) and for scaling teams into domain-focused pods; previously led a shift from LAMP to event-driven Node.js microservices using MQTT and message queues.

View profile
VA

VENU ANUPATI

Screened

Mid-level AI/ML Engineer specializing in Generative AI and LLM systems

San Jose, CA6y exp
Dignity HealthSan Jose State University

Senior AI/ML engineer with hands-on experience building production LLM systems in healthcare, including RAG-based clinical question answering and end-to-end MLOps on Vertex AI and Kubernetes. They combine strong platform engineering with applied GenAI work, citing a 35% improvement in factual accuracy and a 30% boost in internal team productivity through modular Python services and CI/CD.

View profile
BS

Bilal Sadaqat

Screened

Senior Machine Learning Engineer specializing in NLP, LLMs, and AI systems

Palo Alto, CA8y exp
Buzz SolsUniversity of Montana

AI/ML engineer with hands-on experience building a healthcare-focused generative AI application end-to-end, from architecture and data design through deployment, monitoring, and iterative improvement. Particularly strong in multi-agent LLM systems, fine-tuning, and safety guardrails, with measurable impact including a 20% accuracy lift to 91% and 10% latency improvement in a nutrition recommendation pipeline.

View profile
Sri Mounika Jammalamadaka - Mid-level AI/ML Engineer specializing in GenAI, LLMs, and data platforms in Fairfax, VA

Mid-level AI/ML Engineer specializing in GenAI, LLMs, and data platforms

Fairfax, VA6y exp
DewberrySan Jose State University

Built and helped deploy a production RAG-based LLM assistant for HVAC anomaly diagnostics, partnering closely with field engineers and operations teams to make AI outputs trustworthy in real workflows. Stands out for practical post-launch optimization work—improving retrieval quality, reducing hallucinations, and stabilizing non-deterministic behavior—which contributed to roughly a 40% reduction in diagnosis time.

View profile
AC

Mid-level ML Engineer specializing in LLMs, RAG, and real-time AI systems

USA5y exp
The Moore Law GroupUniversity at Buffalo

AI engineer focused on production-grade LLM systems rather than prompt-only solutions, with hands-on experience building citation-grounded RAG products and multi-agent workflows. Most notably built a financial document intelligence system for SEC filings and contracts that achieved ~92% recall@5, cut latency below 2 seconds, reduced hallucinations, and turned analyst research from hours into seconds.

View profile
LL

Mid-level AI/ML Engineer & Data Scientist specializing in NLP and Generative AI

Overland Park, KS5y exp
CenteneUniversity of Central Missouri

Built and deployed an agentic RAG platform at Centene Health to support healthcare claims and complaints workflows (Q&A for claims agents, executive complaint summarization, and compliance triage/classification). Experienced in LangChain/LangGraph orchestration, production deployment on AWS with FastAPI/Docker/Kubernetes, and implementing HIPAA-compliant guardrails to reduce hallucinations and ensure explainable outputs.

View profile
AS

Junior AI/Software Engineer specializing in LLM agents, RAG, and full-stack ML systems

Austin, TX2y exp
Gauntlet AIVirginia Tech

Backend engineer who built an Emergency Alert System with Virginia Tech for the City of Alexandria, focusing on real-time ingestion, secure dashboards, and AI-assisted prioritization. Emphasizes high-stakes reliability with guardrails (hybrid rules+LLM, confidence-based fallbacks), scalable async processing, and defense-in-depth security (JWT/RBAC plus database row-level security).

View profile
ST

Shreya Thakur

Screened

Mid-level Software Engineer specializing in Python backend and LLM/ML systems

New York, USA4y exp
Saayam for AllUniversity at Buffalo

Backend/AI engineer who has shipped production LLM systems end-to-end, including an AI request-routing service (FastAPI + BART MNLI + OpenAI/Gemini) that improved accuracy ~25% after launch via eval-driven prompt/category iteration. Also built an enterprise document intelligence/RAG platform on Azure (Blob/SharePoint/Teams ingestion, OCR/NLP chunking, embeddings in Azure Cognitive Search) with PII guardrails (Presidio), confidence gating, and scalable event-driven pipelines handling millions of documents.

View profile
Vaishnavi M - Mid-level AI/ML Engineer specializing in MLOps and Generative AI

Vaishnavi M

Screened

Mid-level AI/ML Engineer specializing in MLOps and Generative AI

5y exp
Liberty MutualUniversity of Maryland, Baltimore County

At Liberty Mutual, built a production underwriting decision assistant combining LLM reasoning with quantitative models and strong auditability. Implemented a claims-based response verification pipeline that cut hallucinations from 18% to 3% and materially improved user trust/validation scores. Experienced orchestrating ML/LLM workflows end-to-end with Airflow, Kubeflow Pipelines, and Jenkins, including SLA-focused pipeline hardening.

View profile
prashanth Jamalapurapu - Mid-level AI/ML Engineer specializing in data engineering, LLM/RAG pipelines, and recommender systems

Mid-level AI/ML Engineer specializing in data engineering, LLM/RAG pipelines, and recommender systems

5y exp
FriendzySaint Louis University

Research assistant at St. Louis University who built and deployed a production document-intelligence RAG system (Python/TensorFlow, vector DB, FastAPI) on AWS, focusing on grounding to reduce hallucinations and latency optimization via caching/async/batching. Also developed a personalized recommendation system for the Frenzy social platform and partnered closely with product/UX to define metrics and iterate on hybrid recommenders and cold-start handling.

View profile
MM

Manisha M

Screened

Senior AI/ML Engineer specializing in Generative AI and MLOps

Hollywood, FL7y exp
First Commonwealth BankJawaharlal Nehru Technological University

ML engineer with hands-on experience building banking AI systems end-to-end, including a customer-targeting model that improved campaign response rates by about 10%. Also shipped a RAG-based banking FAQ/support feature with safety guardrails and production optimizations around retrieval quality, latency, and cost, plus reusable Python services that reduced duplicate work for other engineers.

View profile
VM

Entry-Level Data Scientist specializing in ML, Azure, and LLM applications

Gainesville, Florida1y exp
University of FloridaUniversity of Florida

ML/computer-vision practitioner who shipped a CycleGAN-based bilingual handwriting translation demo (English↔Telugu) for low-resource scripts using unpaired datasets, focusing on preserving handwriting style and real-time deployment via Gradio. Also delivered a medical imaging pipeline by fine-tuning ResNet-50 and ViT-B/16 for pneumonia detection, emphasizing reproducibility, measurable evaluation, and stakeholder-friendly iteration.

View profile
BP

Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems

Fort Worth, Texas8y exp
Ingram MicroUniversity of North Texas

LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.

View profile
JP

Jhansi Priya

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows

Remote, null6y exp
fundae software IncUniversity of Dayton

Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.

View profile
Satish Kumar Reddy - Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps in Remote, NJ

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

Remote, NJ5y exp
Tungsten AutomationPace University

Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.

View profile
Nagendra Reddy Palugulla - Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps in Florida, United States

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Florida, United States4y exp
Community Dreams FoundationUniversity of Houston

Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.

View profile
Yash Amre - Intern Data Scientist specializing in machine learning and NLP in California, USA

Yash Amre

Screened

Intern Data Scientist specializing in machine learning and NLP

California, USA1y exp
LexTrack AIUniversity of Colorado Boulder

Analytics-focused early-career candidate with internship experience owning reporting and system performance analysis projects end to end. They combine SQL data preparation, Python automation, and dashboard delivery with measurable impact, including roughly 50% less manual reporting and about 20% better forecast accuracy.

View profile
Keeravani Chekuri - Mid-level AI/ML Engineer specializing in LLM systems and MLOps in Boston, MA

Mid-level AI/ML Engineer specializing in LLM systems and MLOps

Boston, MA3y exp
Nexoraschool.aiUniversity of Massachusetts

Built and deployed an AI tutoring assistant end-to-end at Nexora School, spanning discovery with school districts, multi-agent LangGraph/RAG architecture, AWS Bedrock migration, and post-launch stabilization. Stands out for combining hands-on LLM systems engineering with strong educator-facing trust building, FERPA-driven architecture decisions, and disciplined production practices around evals, logging, and messy document ingestion.

View profile
Krishna K - Junior Machine Learning Engineer specializing in multimodal systems and LLMs in Jersey City, NJ

Krishna K

Screened

Junior Machine Learning Engineer specializing in multimodal systems and LLMs

Jersey City, NJ2y exp
JerseySTEMUniversity at Buffalo

Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.

View profile
Lakshmi Meghana - Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP in Bristol, PA

Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP

Bristol, PA4y exp
DermanutureStevens Institute of Technology

Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.

View profile
Arunim Samudra - Mid-Level Software Engineer specializing in LLM applications, RAG, and OCR automation in Austin, TX

Mid-Level Software Engineer specializing in LLM applications, RAG, and OCR automation

Austin, TX3y exp
Trellis CompanyTexas A&M University

At Trellis, built and shipped a production multi-agent, authenticated GenAI chatbot for sensitive financial account inquiries (loan/payment lookups), using dynamic model routing to control latency and cost while improving accuracy. Implemented prompt-injection defenses (Meta Prompt Guard), RAG with LangChain, and LLM-as-a-judge evaluation; the system cut manual support call volume by 40%+ and was refined through close collaboration with QA-driven user testing.

View profile
HS

Human Sagheer

Screened

Senior AI/ML Engineer specializing in Agentic AI, RAG, and LLM systems

Chicago, IL8y exp
Origami RiskAir University

ML engineer with hands-on experience building production AI systems spanning agentic AI, RAG, LLM automation, fraud detection, and predictive analytics. At Origami Risk, they designed and implemented an enterprise RAG platform end to end using LangChain, LangGraph, vector search, and AWS Bedrock to improve internal knowledge retrieval, reduce manual effort, and raise response quality across teams.

View profile
Aryan Karadia - Intern Full-Stack Software Engineer specializing in web apps and edge ML in Calgary, AB

Intern Full-Stack Software Engineer specializing in web apps and edge ML

Calgary, AB1y exp
Engineered AirUniversity of Calgary
View profile

Need someone specific?

AI Search