Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

KM

Mid-Level AI/ML Software Engineer specializing in agentic LLM systems

Dallas, Texas6y exp
DatatronUniversity of West Florida

Built and deployed a production LLM-powered multi-agent compliance copilot (life sciences/finance) using LangChain/LangGraph + RAG over vector databases, delivered via async FastAPI on Kubernetes. Emphasizes audit-ready, deterministic outputs with schema constraints and citations, plus rigorous evaluation/monitoring; reports 60%+ reduction in manual research time and successful production adoption.

View profile
JT

Jingyi Tian

Screened

Junior Machine Learning Engineer specializing in MLOps and LLM/RAG systems

Houston, TX2y exp
Daxwell, LLCColumbia University

LLM/agentic workflow builder focused on productionizing document-processing systems. Redesigned pipelines with LangGraph + RAG, schema-aware validation, and eval/monitoring loops; known for fast incident diagnosis (restored accuracy from ~70% to >95% same day). Partners closely with sales and stakeholders to deliver tailored demos and drive adoption (reported +40%).

View profile
SS

Mid-Level Full-Stack Software Engineer specializing in API-first microservices and cloud platforms

Arlington, TX4y exp
University of Texas at ArlingtonUniversity of Texas at Arlington

Backend-focused engineer who built a resume processing and job application platform using Python/MongoDB/Streamlit, including OpenAI-powered skill/keyword extraction and recruiter-facing search/filtering. Has hands-on cloud deployment experience on AWS/Azure and executed an on-prem reservation portal migration to Azure using a phased trial-and-cutover approach; also automated CI/CD with Jenkins and GitHub Actions.

View profile
HT

Hassam Tariq

Screened

Mid-Level Software Engineer specializing in Cloud, GenAI, and Federal systems

Arlington, VA
DeloitteUniversity of Maryland, College Park

Cloud-focused engineer experienced deploying and stabilizing complex production systems that span APIs, infrastructure, and automated workflows, with a strong observability and safe-release mindset (feature flags/canaries/rollbacks). Has hands-on, customer-facing incident leadership, including executing DR regional failover during an AWS us-east-1 outage to maintain service and reportedly save a client ~$10M.

View profile
SM

Shravya M

Screened

Senior AI/ML Engineer specializing in NLP, LLMs, and MLOps

Texas, USA6y exp
CVS HealthUniversity of North Texas

LLM/agent workflow engineer with healthcare experience (CVS/CBS Health) who built and deployed a production call-insights platform using Azure OpenAI + LangChain/LangGraph, including sentiment and compliance checks. Demonstrates deep HIPAA/PHI handling (tenant-contained processing, redaction, RBAC/encryption/audit logging) and production rigor (testing, eval sets, validation/retries, autoscaling) to scale to thousands of transcripts.

View profile
NN

Mid-level AI/ML Engineer specializing in Generative AI and MLOps

4y exp
WalgreensUniversity of North Texas

Built and deployed a production Retrieval-Augmented Generation (RAG) platform in a healthcare setting to automate clinical documentation review and summarization, targeting near-real-time, explainable outputs. Emphasizes grounded generation to reduce hallucinations, latency optimizations (chunking/embedding reuse), and PHI-safe workflows with access controls, plus strong orchestration experience using Apache Airflow.

View profile
KA

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and multimodal modeling

Ann Arbor, USA3y exp
University of MichiganUniversity of Michigan

Built and productionized a telecom-focused RAG assistant by LoRA fine-tuning LLaMA-2 and integrating LangChain+FAISS behind a FastAPI service, with dashboards and a human feedback UI for engineers. Demonstrated measurable impact (≈40% faster document lookup, +8–10% retrieval precision) and strong MLOps rigor via Airflow orchestration, CI/CD, and monitoring for drift and failures.

View profile
SK

Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications

Dallas, TX5y exp
Baylor Scott & WhiteUniversity of North Texas

Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).

View profile
BK

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp
ThorneBharathiar University

ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.

View profile
RW

Principal Data Scientist specializing in NLP and Generative AI

Chicago, IL9y exp
Witmer Consulting CorporationGeorgetown University

ML/NLP practitioner with experience building an embedding-based ad matching and search system at Vericast (BERT embeddings + similarity search) to replace a third-party taxonomy approach, evaluated via a human-curated gold standard. Also built a custom NER pipeline at Allstate for auto accident claims calls using a bidirectional LSTM and achieved 90%+ F1, with a strong emphasis on production-grade ML workflows (testing, CI/CD, orchestration, versioning, validation).

View profile
XX

Xinle Xu

Screened

Junior Product Manager specializing in GenAI and global e-commerce

Remote6y exp
ByteDanceNortheastern University

字节跳动实习期间将内部AI重量预测模型从“可用但难上线”的单点能力,改造成可商业化复用的通用API:统一多地区接口与评估口径,设计分层兜底与置信度分级,先灰度上线SEA/JP并推动US/EU落地,结合线上结果进行模型微调。具备LLM/RAG/Agent系统的实战排障方法论,以及面向开发者与售前场景的技术演示与跨团队推进能力。

View profile
CM

Chris Marcus

Screened

Executive CTO & AI Architect specializing in regulated SaaS (InsurTech/Healthcare/FinTech)

Remote15y exp
agentCanvas.aiUniversity of Texas at Austin

Insurance-tech CTO and repeat founder with 10+ years in insurance startups; was employee #4/CTO at Polly (formerly DealerPolicy) and helped scale it from a PowerPoint to 250 employees while raising $180M+. Currently building and selling AgentCanvas.ai—an extensible AI accelerator platform for large insurance agencies—after coding the product end-to-end and now running demos/POCs with prospective buyers.

View profile
HM

Mid-level Backend Software Engineer specializing in AWS cloud and FinTech platforms

Bengaluru, India3y exp
JPMorgan ChaseTexas A&M University

JP Morgan engineer and Texas A&M student web developer who has owned production systems end-to-end, including a real-time ML training workflow that improved internal search relevance by 30%. Experienced with AWS cloud migrations and operating containerized services on ECS with CloudWatch+ELK observability, Terraform infra, and Spinnaker CI/CD; also built event-driven pipelines with RabbitMQ and Elasticsearch at 1M+ record scale.

View profile
JS

Intern Software Engineer specializing in edge AI deployment and distributed systems

San Francisco, CA1y exp
Zetic AISan José State University

Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).

View profile
RA

Junior Software Engineer specializing in distributed systems and cloud-native backend services

Boston, MA2y exp
BoroughUniversity of Michigan

Founding engineer at a civic-tech startup (Barrow) who built and operated a Next.js/TypeScript product with map-based public reporting, including clustering and dynamic geospatial loading to improve UX and performance. Also implemented a location-aware RAG chatbot using Pinecone, web scraping/transcription, caching, and fallback web search, and owned post-launch observability plus scaling decisions (load balancing/horizontal scaling) based on API usage patterns.

View profile
Jincheng Pang - Principal Data Scientist specializing in healthcare analytics and medical imaging AI in Sudbury, MA

Jincheng Pang

Screened

Principal Data Scientist specializing in healthcare analytics and medical imaging AI

Sudbury, MA11y exp
AccessHopeTufts University

Developed an LLM-driven recommendation agent in Azure Databricks to triage oncology patients and trigger second-opinion case creation using medical claims and EHR data. Uses ICD-10/CPT/J-code features in prompts, embeddings + vector DB similarity, and a backtesting framework emphasizing recall to avoid missing clinically relevant cases while supporting business revenue.

View profile
Prateeksha Ranjan - Mid-level Software Engineer specializing in embedded AI and full-stack systems in Irvine, California

Mid-level Software Engineer specializing in embedded AI and full-stack systems

Irvine, California4y exp
SynapticsUC Irvine

Robotics software engineer who built and owned core navigation components for a TurtleBot in ROS/ROS2 and Gazebo, including an RRT-based planner, waypoint-to-velocity motion planning, and PID trajectory tracking. Demonstrates strong real-time debugging skills (control-loop timing under CPU load), costmap/occupancy-grid tuning, and distributed ROS2 communication design using DDS/QoS, plus Docker and CI/CD automation experience from Keysight.

View profile
Prateek Patil - Engineering Leader specializing in Digital Health, AI, and Cloud Platforms in Santa Clara, CA

Prateek Patil

Screened

Engineering Leader specializing in Digital Health, AI, and Cloud Platforms

Santa Clara, CA16y exp
RocheIllinois Institute of Technology

Senior Engineering Manager at Roche leading two Scrum teams building internally shared (“inner-sourced”) tools and libraries for a healthcare enterprise. Has led security/compliance-first architecture decisions (e.g., Python AI modules running inside a Java container) and front-end modularization (Angular monorepo to module federation), with a strong focus on developer experience via automated Swagger/OpenAPI documentation and robust testing/versioning practices.

View profile
Ali Rahmati - Senior Machine Learning Engineer specializing in optimization, LLMs, and on-device AI in Santa Clara, CA

Ali Rahmati

Screened

Senior Machine Learning Engineer specializing in optimization, LLMs, and on-device AI

Santa Clara, CA9y exp
QualcommNorth Carolina State University

Engineer with hands-on experience debugging and hardening a fixed-point implementation for an internal PoC, quickly diagnosing overflow/underflow issues that caused intermittent failures across thousands of runs and delivering a code fix. Comfortable presenting technical solutions with layered slide depth and doing follow-up deep dives for interested stakeholders, though has limited direct customer/sales partnership experience.

View profile
Chia-En Lu - Junior AI/ML Systems Engineer specializing in LLM infrastructure and distributed training

Chia-En Lu

Screened

Junior AI/ML Systems Engineer specializing in LLM infrastructure and distributed training

1y exp
GenseeAIUC San Diego

Built and shipped a production NMT system translating medical documentation for a rare/low-resource language, tackling data scarcity with retrieval-driven pattern matching plus dictionary/grammar- and LLM-based augmentation and validating quality with a linguistic expert. Also develops agentic LLM workflows with LangChain/LangGraph (including a deep-research style system) and has experience aligning medical AI deployments with clinician-defined risk metrics and human-in-the-loop decision making.

View profile
Advitha Bawgi - Junior Full-Stack Software Engineer specializing in cloud-native microservices in India

Advitha Bawgi

Screened

Junior Full-Stack Software Engineer specializing in cloud-native microservices

India1y exp
DAZNArizona State University

Backend engineer with hands-on IoT and AI product work: built a decoupled Raspberry Pi + AWS IoT Core weather monitoring backend and a Dockerized FastAPI LLM service on AWS ECS using OpenAI/HuggingFace with an emerging RAG layer. Also delivered measurable performance gains at DAZN by redesigning event-driven/serverless ingestion (SNS, S3->Lambda->DynamoDB), cutting latency ~30% and boosting throughput ~25% while automating ~90% of manual sync work.

View profile
Niteesh Ganipisetty - Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision in Grand Rapids, MI

Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision

Grand Rapids, MI4y exp
IntuitGrand Valley State University

Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.

View profile
Sandeep Mekala - Director-level Mobile Engineering Manager specializing in Generative AI and agentic mobile experiences in Remote

Director-level Mobile Engineering Manager specializing in Generative AI and agentic mobile experiences

Remote13y exp
T-MobileSilicon Valley University

iOS player-coach who led end-to-end development of real-time customer support chat and unified notification systems for T-Mobile’s iOS app using SwiftUI, Firebase, WebSockets, and Core Data (including offline handling). Drove measurable reliability/latency gains (~30%) through a major notification refactor and owned a high-severity push-notification incident from rollback through RCA and backward-compatible hotfix, while also scaling team process and people management.

View profile
silin liu - Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems in New York City, NY

silin liu

Screened

Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems

New York City, NY5y exp
Metropolitan Transportation AuthorityStevens Institute of Technology

Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.

View profile

Need someone specific?

AI Search