Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

VD

Mid-level Software Engineer specializing in AWS, full-stack development, and AI data systems

Seattle, Washington3y exp
AmazonArizona State University

Backend engineer who built a Python-based data profiling/statistics platform processing up to 50M rows and ~300 metrics, using a DAG execution model, multithreading, and smart caching to cut processing time by up to 70%. Also improved PostgreSQL query performance from 12s to 2s via indexing/query rewrites, integrated an LLM (LangChain + OpenAI) for explainable “chat with the pipeline” functionality, and designed an AWS EC2+SQS architecture for scalable, isolated per-user processing.

View profile
VC

Vyas Cholayil

Screened

Mid-Level Software Engineer specializing in Python automation, DevOps, and microservices

Raleigh, NC5y exp
AnsysNorth Carolina State University

Backend-focused engineer who built an internal wiki LLM chatbot end-to-end using FastAPI, Kubernetes, and ChromaDB vector search, including frontend integration. Also has strong DevOps/migration experience—automating large work-item and repo migrations (Jira/FogBugz/ADO on-prem to cloud) via Python scripts, JSON mappings, REST APIs, and validation test suites.

View profile
LT

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp
City and County of San FranciscoSan Francisco State University

Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).

View profile
SK

Mid-level AI/ML Engineer specializing in healthcare NLP, real-time risk systems, and ML platforms

Massachusetts, USA5y exp
Johnson & JohnsonRivier University

LLM-focused customer-facing engineer who repeatedly takes document Q&A and agentic prototypes into secure, monitored production systems. Experienced in reducing hallucinations via RAG + guardrails, diagnosing retrieval/embedding issues in real time, and partnering with sales to run metrics-driven PoCs that overcome accuracy/security objections and drive adoption.

View profile
CR

Mid-Level Software Engineer specializing in distributed systems and cloud-native platforms

Austin, TX5y exp
AMDNortheastern University

Backend/AI engineer who built and scaled an internal AMD semiconductor manufacturing microservice platform (SMR), reworking a synchronous lot-request workflow into an event-driven RabbitMQ/Celery/FastAPI pipeline. Diagnosed and fixed peak-load reliability issues using deep observability and Kubernetes autoscaling, cutting notification latency back to sub-second and reducing duplicates via idempotency/DLQs. Also shipped an LLM-powered natural-language search with schema-constrained JSON outputs and guardrails, plus a plan-execute-verify Jira bug-resolution agent that can propose fixes and raise PRs under restricted permissions.

View profile
JA

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps

McKinney, TX6y exp
Globe LifeTexas A&M University

Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.

View profile
Travoy Spelling - Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP in Texarkana, TX

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

View profile
Rohith Sadanala - Mid-level Machine Learning Engineer specializing in Generative AI and MLOps in Missouri, USA

Mid-level Machine Learning Engineer specializing in Generative AI and MLOps

Missouri, USA3y exp
AirbnbUniversity of South Florida

LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.

View profile
Byron Pineda - Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps in Pascagoula, MS

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
Aaron Li - Junior AI/ML Engineer specializing in production LLM systems and RAG in Atlanta, GA

Aaron Li

Screened

Junior AI/ML Engineer specializing in production LLM systems and RAG

Atlanta, GA2y exp
Georgia Institute of TechnologyUniversity of Chicago

LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.

View profile
Vismay Patel - Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps in Berkeley, CA

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp
Kaiser PermanenteSan Francisco State University

ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.

View profile
ChinmaySanjay Kawle - Junior Software Engineer specializing in cloud developer tools and backend APIs in Seattle, WA

Junior Software Engineer specializing in cloud developer tools and backend APIs

Seattle, WA2y exp
Amazon Web ServicesUniversity of Illinois Chicago

Summer intern on AWS Lambda tooling team who shipped Finch support in AWS SAM CLI, adding OS/runtime detection and robust fallback behavior to preserve Docker compatibility across developer environments. Also built an end-to-end RAG system for querying arXiv quantitative finance papers using Postgres/pgvector with two-stage retrieval, citation-grounded prompting, and rigorous evaluation loops driven by IR metrics and user feedback.

View profile
Ranganayak Meravath - Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

5y exp
LPL FinancialUniversity of North Texas

Senior engineer who built and productionized an Azure-based Enterprise AI Copilot for financial/compliance teams, focused on grounded, auditable answers with citations to reduce hallucinations in regulated workflows. Experienced designing multi-step agent orchestration and improving reliability through targeted iterations (e.g., fixing chunking/parsing to materially improve citation accuracy), plus building defensive pipelines for messy ERP/operational finance data.

View profile
Devika gade - Mid-level Full-Stack Developer specializing in FinTech and cloud-native applications in Remote, USA

Devika gade

Screened

Mid-level Full-Stack Developer specializing in FinTech and cloud-native applications

Remote, USA4y exp
PlaidChristian Brothers University

Full stack developer with strong implementation ownership across cloud deployments, integrations, and AI-powered support automation. They have put LLM/RAG workflows into production with measurable impact—cutting first response time by nearly 40%—and show unusual depth in debugging non-deterministic AI incidents, improving observability, and turning messy document inputs into reliable API-driven pipelines.

View profile
KJ

Krishi Jain

Screened

Junior Implementation Manager / Solution Engineer specializing in AI, ERP integrations, and predictive maintenance

Chicago, IL2y exp
Continuum AIWestcliff University

LLM/agentic workflow practitioner (Continuum AI) who productionized an LLM system for manufacturing RMA intake and warranty claims by moving from a brittle prompt to a modular pipeline with RAG, function-calling extraction, deterministic validation, and strong observability. Also diagnosed and fixed an agentic ticket-triage misrouting issue by tracing failures to retrieval timeouts, adding guardrails/fallbacks, and implementing retries plus continuous evaluation—bringing misroutes near zero while creating a repeatable debugging playbook.

View profile
CS

Mid-level Applied AI Engineer specializing in LLM infrastructure and model optimization

San Jose, CA3y exp
AMDUSC

LLM engineer who has deployed privacy-preserving, real-time workplace risk monitoring over massive enterprise chat/email streams, tackling latency, hallucinations, and extreme class imbalance with model benchmarking, RAG + fine-tuning, and a pre-filter alerting layer. Also built an agentic legal contract drafting system (Jurisagent) using LangGraph/LangChain with deterministic multi-agent control flow, structured outputs, and reliability-focused evaluation/telemetry.

View profile
YK

Junior AI/ML Engineer specializing in applied LLMs, security, and reinforcement learning

New York, USA2y exp
New York UniversityNYU

Built and shipped a production LLM-powered investor research feature for a fintech product, focused on grounded answers and minimizing hallucinations. Implemented retrieval-quality and evidence-coverage gating with clear refusal fallbacks, and evaluates systems with regression tests and metrics like correct-refusal rate, hallucination rate, and latency. Comfortable orchestrating workflows with LangChain or custom Python depending on production needs.

View profile
VS

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp
Capital OneUniversity of the Cumberlands

AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.

View profile
LK

Junior Full-Stack & Data Engineer specializing in cloud platforms and cybersecurity ML

New York, NY2y exp
AccentureNYU

Built a hackathon "Patient Summary Assistant" backend focused on healthcare workflows, combining RAG-based summarization with HIPAA-minded privacy controls (NER redaction + encryption). Demonstrated strong infra skills by deploying on Kubernetes with Helm/HPA and GitOps (ArgoCD), plus migrating from OpenAI to an on-prem Llama 3 stack (vLLM, quantization, shadow-mode testing) and adding real-time Kafka ingestion for patient vitals/anomaly alerts.

View profile
PJ

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp
JPMorgan ChaseUniversity of Massachusetts Amherst

Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.

View profile
JJ

Intern Generative AI Engineer specializing in RAG and multi-agent systems

Chicago, IL2y exp
NeuraFlashUniversity of Chicago

Built and deployed a production RAG-based multi-agent chatbot during an internship to help consultants answer client questions and guide users through new IT systems with step-by-step instructions. Demonstrates hands-on experience with LangGraph/LangChain/Google ADK, unstructured document parsing and chunking for RAG, and a reliability-first approach to agent workflows (metrics, fallbacks, human-in-the-loop, guardrails).

View profile
YP

Mid-level AI/ML Engineer specializing in Databricks, MLOps, and real-time fraud detection

The Colony, TX4y exp
DatabricksUniversity of North Texas

ML/LLM engineer building production, real-time fraud detection for financial transactions using a two-tier architecture (fast ML + GPT) to deliver both low-latency decisions and analyst-friendly risk explanations. Experienced orchestrating end-to-end retraining, drift monitoring, and automated model promotion with Databricks Jobs/Workflows and MLflow, and partnering closely with fraud analysts to tune alerts, thresholds, and dashboards.

View profile
NV

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

View profile
ZI

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.

View profile

Need someone specific?

AI Search