Vetted Model Deployment Professionals

Pre-screened and vetted.

AR

Anagha Rumade

Screened

Senior Applied AI/ML Engineer specializing in GenAI, LLMs, RAG and agents

Palo Alto, California9y exp
JPMorgan ChaseStevens Institute of Technology

Applied AI/ML Engineer at JPMorgan Chase who led a banker-facing LLM chatbot from an OpenAI-API POC to a production RAG workflow, including hallucination mitigation, automated evaluation in SageMaker, and operational monitoring with Dynatrace. Also delivers external technical education—hosted a hands-on Grace Hopper Celebration 2025 workshop teaching LangChain/LangGraph agentic workflows.

View profile
PM

Piyush Modi

Screened

Intern Software Engineer specializing in backend systems, cloud infrastructure, and ML/LLM tooling

Buffalo, New York2y exp
Juniper NetworksUniversity at Buffalo

Infrastructure-leaning engineer who has built real-time ML systems end-to-end: a Jetson-deployed adaptive Whisper ASR service (Flask + WebSockets, React/TS UI) and a high-throughput Postgres schema for live transcription. Also delivered customer-facing AI billing/OCR improvements for a dental startup (Dentite), boosting OCR performance by 38%, and has experience instrumenting open-source ML deployment stacks to add infrastructure visibility.

View profile
Saisureshreddy Challa - Mid-level Data Scientist specializing in AI/ML, LLMs, and domain analytics in California, USA

Mid-level Data Scientist specializing in AI/ML, LLMs, and domain analytics

California, USA6y exp
BlackRockNortheastern University

BlackRock AI/ML engineer who built and owned a production LLM document intelligence system for regulatory and investment analysis end-to-end. They combined RAG, multi-agent validation, strong evaluation/monitoring, and reusable Python services to process 50K+ documents, cut review time 40-50%, and improve decision accuracy by about 25%.

View profile
AJ

Mid-level AI/ML Engineer specializing in generative AI, NLP, and MLOps

San Jose, CA4y exp
ServiceNowUniversity of North Carolina at Charlotte

ML/AI engineer with hands-on ownership of production GenAI and computer vision systems, spanning experimentation, deployment, monitoring, and iterative optimization. Stands out for shipping an enterprise RAG platform that cut manual review by 50% and a defect detection pipeline that reduced report generation from 15 minutes to under 1 second while maintaining high uptime and strong operational discipline.

View profile
DD

Drew Dunn

Screened

Senior AI Engineer specializing in generative AI and production ML systems

Aledo, TX14y exp
Elevance HealthTexas Tech University

ML/AI engineer with hands-on ownership of production computer vision, speech, and legal RAG systems. Notably improved a key-duplication CV pipeline enough to unblock commercial launch and remove specialist manual measurement, and also shipped a live Quran recitation detection feature for a product with 1M+ users.

View profile
SK

Mid-level Machine Learning Engineer specializing in NLP and cloud MLOps

CT, USA4y exp
ServiceNowRivier University

Built and deployed a production LLM-powered internal documentation assistant using embeddings, a vector database, and a RAG pipeline to reduce time spent searching PDFs/manuals. Experienced in orchestrating end-to-end LLM workflows with Airflow/LangChain, improving reliability via monitoring/error handling, and driving measurable quality through retrieval and hallucination-focused evaluation metrics.

View profile
MS

Min-Han Shih

Screened

Junior Machine Learning Engineer specializing in speech and multimodal AI

Taipei, Taiwan2y exp
FurboUSC

New grad who has shipped a production vision-language recommendation feature for a pet camera/mobile app, including building a tagged video dataset with human annotators and optimizing inference by FPS downsampling under device compute limits. Also built a multimodal MLLM benchmark using an LLM-as-judge (GPT-5-thinking) with a feedback loop, validated against human scoring, and measured post-feedback quality gains (12% average score improvement).

View profile
SV

Mid-level AI/ML Engineer specializing in Generative AI and Conversational AI

Remote5y exp
InfosysUniversity at Buffalo

GenAI Engineer at Infosys who built and deployed a production multi-agent RAG system for a top-tier bank, scaling to ~50,000 queries/day with 99.9% uptime. Drove measurable gains (45% accuracy improvement, 30% API cost reduction) through open-source LLM fine-tuning, Pinecone indexing/retrieval optimization, and AWS-based MLOps/monitoring, and has experience enabling adoption via developer workshops and customer-facing collaboration.

View profile
GJ

Mid-level Machine Learning Engineer specializing in MLOps, NLP, and Computer Vision

USA5y exp
WalmartUniversity of New Haven

ML/AI engineer with production experience across retail and healthcare: built a real-time computer-vision shelf monitoring system at Walmart and optimized edge inference latency by ~30% using TensorRT/ONNX and pruning. Also partnered with CVS Health clinical/pharmacy teams to deliver a medication-adherence predictive model, using Streamlit explainability dashboards and achieving an 18% adherence improvement.

View profile
IS

Irfan Shaik

Screened

Mid-level AI Software Engineer specializing in risk and fraud detection

Los Angeles, California4y exp
VisaGeorge Mason University

AI/software engineer with experience at Visa building a real-time transaction fraud/risk scoring microservice in the card authorization path (Python, Kafka, Kubernetes on AWS) with strict 120–150ms latency constraints and reason-code outputs for downstream decisioning. Owns ML backend end-to-end (data/feature engineering, model training, deployment) and has demonstrated production reliability work including latency spike mitigation, SLO-based observability, drift monitoring, and safe fallbacks to rule-based decisions.

View profile
RH

Rahul Hatkar

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

San Francisco, CA6y exp
Scale AIWebster University

AI/ML engineer who has shipped production AI systems end-to-end, including an automated multi-channel (Gmail/WhatsApp/voice) candidate interviewing workflow and an enterprise RAG knowledge search platform. Demonstrates strong production rigor (monitoring, A/B tests, guardrails, schema validation, shadow testing) with quantified impact: ~60–70% reduction in interview evaluation time and ~20–30% relevance gains in RAG retrieval.

View profile
pavan kalyan padala - Mid-level Data Scientist specializing in predictive and generative AI in Daytona Beach, Florida

Mid-level Data Scientist specializing in predictive and generative AI

Daytona Beach, Florida4y exp
2725 Hospitality LLCYeshiva University

AI/ML engineer with production LLM experience in regulated financial services (J.P. Morgan Chase), building a customer response engine to automate first-contact resolution while addressing privacy, bias, compliance, and scale. Strong MLOps/orchestration background (Airflow, Docker/Kubernetes, AWS Step Functions, Azure ML/SageMaker) plus proven ability to integrate with legacy systems and drive stakeholder adoption through dashboards, auditability, and training.

View profile
Junhui Huang - Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP in Providence, RI

Junhui Huang

Screened

Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP

Providence, RI1y exp
Harvard UniversityBrown University

Built and deployed a production LLM-driven Dungeons & Dragons game where the model acts as a dungeon master, adding a structured combat system and a macro-state tree to ensure campaigns converge to a clear ending. Fine-tuned Gemini 2.5 Flash on Vertex AI and deployed on GCP with Kubernetes, using RAG over DnD rules/spells plus multi-agent orchestration (intent-based routing between narrative and combat agents) to reduce hallucinations and improve reliability.

View profile
Suloni Praveen - Entry-Level Software Engineer specializing in data engineering and ML systems in Los Angeles, CA

Entry-Level Software Engineer specializing in data engineering and ML systems

Los Angeles, CA0y exp
Easley-Dunn ProductionsUSC

Built an end-to-end Next.js/TypeScript LLM-based scientific PDF analyzer using local Ollama/Llama inference to prioritize privacy and cost, producing structured research artifacts (e.g., authors/methods/findings) with ~92% extraction accuracy. At Qualtrics, helped replace a batch pipeline with a real-time, low-latency ML inference service (Python/Go on Kubernetes) using Redis caching, Grafana-based observability, and graceful fallbacks to protect UX during failures.

View profile
SP

Junior AI/ML Software Engineer specializing in LLMs and data-intensive systems

New York, NY3y exp
NYU Langone HealthNYU

AI/backend engineer who has owned production applied-ML systems end to end, including a Jitsi meeting intelligence platform with custom RoBERTa boundary detection, LLM summarization, and automated retraining from user feedback. Also has healthcare AI experience building a diabetes medication titration system with strict validation, drift monitoring, and safety guardrails—showing both product speed and high-stakes engineering rigor.

View profile
MB

Mounya Bonuga

Screened

Mid-level AI/ML Engineer specializing in multimodal AI and recommendation systems

USA4y exp
Goldman SachsUniversity of Central Oklahoma

ML/AI engineer with hands-on ownership of a production LLM/RAG system at Goldman Sachs, focused on workflow automation and large-scale document search for operational teams. They combine strong MLOps and backend engineering skills with practical GenAI evaluation and safety practices, and cite measurable impact including 22% better task guidance accuracy and sub-second search across millions of records.

View profile
ST

Mid-Level AI Engineer specializing in NLP, computer vision, and LLM applications

Austin, TX3y exp
BookedByUniversity of Maryland, Baltimore County

LLM/RAG practitioner who productionized an LLM-driven customer communication and transaction understanding system at PayPal, emphasizing privacy/compliance guardrails and large-scale data normalization. Experienced in real-time debugging of hallucinations via retrieval pipeline tuning and in leading hands-on developer workshops and sales-aligned POCs to drive adoption.

View profile
AS

Arjun Sharma

Screened

Staff Data Scientist specializing in AI/ML engineering and MLOps

Austin, TX10y exp
AccentureTexas State University

ML/NLP engineer with experience at Flatiron Health building a production NLP platform that processed millions of clinical notes, using BERT/BiLSTM-CRF and spaCy to extract and normalize entities from noisy EMR text with oncologist-in-the-loop validation. Also built scalable retail ML workflows (Spark + Kubernetes + feature store caching) and applied vector databases plus contrastive-learning fine-tuning to improve retrieval relevance and recommendations.

View profile
AD

Arnold Durazo

Screened

Senior Full-Stack Engineer specializing in AI/LLM and cloud-native SaaS

Austin, TX9y exp
OracleCal Poly Pomona

Software engineer with strong end-to-end ownership across frontend, backend, data, and infrastructure, including real-time systems (Kafka/Postgres) and observability (Datadog). Built and productionized an AI-native RAG support assistant (OpenAI embeddings + Pinecone) with prompt/guardrail design, achieving 48% agent adoption and 30% faster responses. Experienced in legacy modernization and reliability work using feature flags, event/transaction replay, and rapid embedded delivery.

View profile
Sandeep Athota - Mid-level AI/ML Engineer specializing in cloud MLOps and production ML systems in Texas, USA

Mid-level AI/ML Engineer specializing in cloud MLOps and production ML systems

Texas, USA4y exp
JPMorgan ChaseKennesaw State University

AI/ML engineer at J.P. Morgan Chase who deployed a production financial-risk prediction platform combining CNN/LSTM/gradient boosting on AWS SageMaker, with automated drift-triggered retraining and governance-grade fairness testing. Leveraged SageMaker Clarify plus SMOTE and LLM-generated synthetic data to improve minority-group F1 by 0.12, and communicated results to non-technical risk/ops teams via Power BI dashboards.

View profile
Shram Kadia - Mid-level Software Engineer specializing in backend systems, cloud-native apps, and AI platforms in Santa Clara, CA

Shram Kadia

Screened

Mid-level Software Engineer specializing in backend systems, cloud-native apps, and AI platforms

Santa Clara, CA4y exp
ServiceNowNorth Carolina State University

Backend/full-stack engineer who has owned production systems end-to-end, including a Dockerized Node.js/TypeScript probabilistic fault-tree analysis service for nuclear safety research deployed on AWS. Also built and operated a FastAPI-based RAG pipeline over 200+ PDFs using FAISS, focusing on low-latency, idempotent workflows and strong observability; experienced with API design and Playwright E2E automation across React/Angular projects.

View profile
Barbara Christina Cruze - Senior Business Analytics Consultant specializing in BI, data engineering, and predictive analytics in Dallas, TX

Senior Business Analytics Consultant specializing in BI, data engineering, and predictive analytics

Dallas, TX8y exp
InfosysUniversity of North Texas

Healthcare analytics candidate with hands-on experience turning messy claims, enrollment, and reference data into trusted SQL reporting layers and reproducible Python workflows. They emphasize metric standardization, stakeholder alignment, and operational impact, including ~40% reduction in manual reporting effort and improved forecasting/resource prioritization through high-risk patient segmentation.

View profile

Need someone specific?

AI Search