Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

Yashwant Gandham - Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure in Boulder, CO

Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure

Boulder, CO1y exp
NovaChat AIUniversity of Colorado Boulder

Built and deployed production RAG-based document search/Q&A systems (DocChat and an internship marketing RAG), using a React + FastAPI stack on GCP with docs stored in GCP buckets and retrieval via embeddings/vector DB. Emphasizes cost/performance tradeoffs (reported ~40% cost reduction) and ships via Docker (Railway), with load/API testing using JMeter and Swagger; regularly collaborates with a CEO stakeholder to iterate and push changes to production.

View profile
Binaya Sharma - Senior Software Engineer specializing in full-stack systems, big data, and applied AI in Baton Rouge, LA

Binaya Sharma

Screened

Senior Software Engineer specializing in full-stack systems, big data, and applied AI

Baton Rouge, LA6y exp
365LabsLouisiana State University

Built and deployed ForensicLLM, a local domain-specific LLaMA-3.1-8B model for digital forensic investigators using RAFT + RAG over 1000+ curated research papers, with citation-aware responses and rigorous evaluation (BERTScore/G-Eval). Deployed via vLLM and Docker and validated through a chatbot survey with 80+ participants; published at DFRWS EU 2025.

View profile
Mayank VYAS - Mid-level AI/ML Engineer specializing in LLM agents, RAG retrieval, and IoT ML systems in Tempe, AZ

Mayank VYAS

Screened

Mid-level AI/ML Engineer specializing in LLM agents, RAG retrieval, and IoT ML systems

Tempe, AZ4y exp
Coral LabsArizona State University

Built production LLM-driven products including a job-hunt AI (job ranking + resume optimization) and an InterviewAI agentic pipeline using LangChain. Focused on practical deployment concerns like securing OpenAI usage via rate limiting and tiered quotas, and demonstrates an applied approach to choosing models, retrieval methods (RAG), and prompting strategies.

View profile
Ram Abhinav Vedant Madabushi - Junior Full-Stack/AI Engineer specializing in web platforms and LLM applications in Palo Alto, CA

Junior Full-Stack/AI Engineer specializing in web platforms and LLM applications

Palo Alto, CA2y exp
FoodSupply.aiUniversity of Central Florida

Backend engineer from FoodSupply.ai who built and evolved a scalable restaurant/supplier product and order management platform using Node.js and REST APIs. Implemented a hybrid MySQL+MongoDB data architecture, optimized performance with Redis/Prisma, and led a phased migration with feature flags and a temporary sync layer to maintain data consistency. Strong focus on production security (OAuth2, RBAC, row-level security, AWS IAM) and reliability practices (testing with Pytest, Docker/AWS pipelines).

View profile
FA

FNU ASHUTOSH

Screened

Mid-level Software Engineer specializing in AI, full-stack systems, and FinTech

Remote6y exp
Tridiagonal SolutionsPace University

Product-minded full-stack engineer with experience in fintech identity verification and industrial analytics, focused on turning repeated operational pain points into reusable platforms. Built real-time KYC/KYB dashboards, secure cross-platform web components, and a multi-tenant workflow engine that cut onboarding from 2 weeks to 1 day while materially improving conversion, reliability, and developer speed.

View profile
I Anuj - Junior Backend/Infrastructure Engineer specializing in AWS distributed systems in Chennai, India

I Anuj

Screened

Junior Backend/Infrastructure Engineer specializing in AWS distributed systems

Chennai, India2y exp
Velvee AISASTRA Deemed-to-be University

Backend engineer with 1.7 years of experience plus prior founding experience who has already owned production systems end-to-end in an early-stage environment. Most notably, they rebuilt a failing ingestion pipeline into a stable SQS/Fargate architecture that improved success from 40% to 100%, boosted throughput 10x, and cut processing time by ~75%, while also shipping an LLM-powered fashion search workflow using Vertex AI and Elasticsearch.

View profile
MS

Mid-level AI Engineer specializing in LLM systems and data platforms

Jersey City, NJ4y exp
WPI Business SchoolWorcester Polytechnic Institute

AI/backend engineer who independently built and operated an agentic telecom analytics system end-to-end, using LangGraph and Claude to turn natural language into safe SQL in a regulated environment. He combines startup-speed execution with compliance-minded rigor, citing 95%+ NL-to-SQL accuracy, a 30-minute-to-2-minute workflow improvement, and zero-findings support across three regulatory audit cycles.

View profile
Shuchi Shah - Senior Applied AI Engineer specializing in RAG and full-stack systems in San Jose, CA

Shuchi Shah

Screened

Senior Applied AI Engineer specializing in RAG and full-stack systems

San Jose, CA13y exp
OpGov.AISan Diego State University

Backend engineer with experience building an end-to-end civic tech AI platform that ingests city council meeting videos, transcribes them with Whisper, and enables natural-language Q&A via a LangChain/FAISS RAG pipeline. Demonstrated strong systems thinking by tuning retrieval for accuracy/latency/memory (cutting response time ~3s→1s and memory ~500MB→25MB) and by safely migrating an ERP from monolith toward services using dual writes, reconciliation, and idempotency to protect financial workflows.

View profile
Alok Patel - Senior Product Manager specializing in AI-driven SaaS, FinTech, and E-commerce in NC, USA

Alok Patel

Screened

Senior Product Manager specializing in AI-driven SaaS, FinTech, and E-commerce

NC, USA13y exp
WhetStonesKathmandu University

Product leader who built Dhurba, a SaaS commerce and business management platform for small and medium retailers, owning the lifecycle from strategy and discovery through launch and scale. Particularly strong in simplifying complex products for non-technical users, aligning cross-functional teams, and introducing explainable AI features that improve merchant outcomes without removing human control.

View profile
kartik sharma - Mid-level Full Stack AI Engineer specializing in LLM and RAG systems in Stockholm, Sweden

kartik sharma

Screened

Mid-level Full Stack AI Engineer specializing in LLM and RAG systems

Stockholm, Sweden4y exp
AuraGraphic Era Hill University

Founding engineer and full-stack AI builder who single-handedly created Aura Groups Sweden's Trust and Growth platform across frontend, backend, ETL, and LLM services. Has hands-on experience shipping RAG-based products with OpenAI APIs and using them in real workflows, plus early-stage startup experience at nesoi.ai where they helped get an AI learning platform adopted by teams at Bain and Amazon.

View profile
DS

Darshan Shah

Screened

Mid-Level Software Engineer specializing in cloud-native microservices and full-stack development

Holliston, MA6y exp
Liberating TechnologiesNortheastern University

Full-stack engineer with deep startup experience building products from scratch under ambiguous requirements. Delivered a scalable, admin-configurable notification platform (Spring Boot/Java/Kafka) supporting 50+ notification types across 3 channels for 10k+ users, cutting new notification setup to ~5 minutes. Also built a Tinder-meets-LinkedIn job-swiping app (React/TS + Node/Prisma) and has hands-on AWS production ops (ECS/EKS, RDS, CloudWatch) plus multiple third-party integrations (Stripe, QuickBooks, Twilio).

View profile
AA

Senior Full-Stack AI/ML Engineer specializing in MLOps and GenAI

Belmont, Michigan10y exp
AvaSureCapitol Technology University

Senior backend/data engineer who has built and maintained HIPAA-compliant, real-time clinical FastAPI services on AWS, orchestrating ML/LLM and vector DB calls with strong reliability patterns (auth, timeouts/retries, graceful degradation, idempotency). Also delivered AWS IaC/CI-CD (Terraform/Helm/GitHub Actions) across EKS/Lambda/SageMaker and built Glue/Spark ETL with schema evolution and data quality controls, plus demonstrated large SQL performance wins (15 min to <9 sec) and hands-on incident ownership.

View profile
TS

Tamir ShemTov

Screened

Entry-Level Computer Vision Research Assistant specializing in medical imaging AI

Los Angeles, CA1y exp
Cedars-SinaiCalifornia State University, East Bay

New grad who shipped an LLM-powered writing app (“Write-it”) to production on Azure with CI/CD (GitHub Actions + JFrog) and implemented an unconventional RAG pipeline to prevent repetitive prompts using embeddings and cosine similarity. Also participated in a Luma AI image/video generation hackathon, iterating with artist feedback and improving usability by rewriting non-technical prompts via an LLM.

View profile
SP

Smit Panchal

Screened

Mid-level Full-Stack & XR Developer specializing in GenAI and immersive AR/VR systems

3y exp
Community Dreams FoundationIllinois Institute of Technology

Built and deployed a "personal second brain" product (CloneMind) with an end-to-end RAG pipeline for retrieving information across PDFs, URLs, images, and audio using Next.js/Node.js/Postgres/Supabase/Redis. Demonstrates strong practical depth in retrieval quality tuning, latency reduction via caching, and stateful orchestration with LangChain/LangGraph, plus experience persuading a non-technical professor stakeholder by shipping a working prototype.

View profile
KS

Kevin Sheu

Screened

Junior Full-Stack Software Engineer specializing in AI/ML platforms and microservices

2y exp
NCKUNational Cheng Kung University

Graduate-school lab engineer who built and owned the final architecture of a Microservices Hub that integrates REST APIs, issues API keys, monitors 10+ Linux servers, and visualizes service dependencies via a topology graph. Strong in bridging legacy and modern stacks (Dockerized and non-Dockerized services like Apache/screen) using deep Linux/networking knowledge, plus practical real-time audio streaming for STT/TTS and experience mentoring others.

View profile
SS

Senior Full-Stack & AI Developer specializing in Python/React, AWS, and LLM/RAG systems

Lahore, Pakistan9y exp
Devtor 360COMSATS University Islamabad

Backend Python engineer who owned the full backend build of an AI-driven platform for UK golf clubs, including FastAPI microservices, vector search, and a tuned LangChain+Pinecone RAG pipeline focused on cost and hallucination reduction. Experienced deploying Django/FastAPI/Flask stacks on AWS-backed Kubernetes with GitOps/ArgoCD-style delivery, plus executing legacy-to-AWS migrations and building Kafka-based real-time analytics pipelines.

View profile
RR

Junior Solutions Engineer specializing in full-stack automation and LLM prompt engineering

San Francisco, CA2y exp
SCU - Frugal Innovation HubSanta Clara University

Built and productionized an LLM-powered customer support system using a RAG architecture with structured document ingestion, embedding retrieval, and prompt templates for product-specific grounding. Experienced diagnosing live agent/workflow failures (e.g., retrieval regressions after new docs) by refactoring ingestion/chunking and adding grounding constraints plus evaluation benchmarks. Also supports go-to-market by joining discovery calls, shaping MVP workflows into demos/prototypes, and creating post-launch documentation to drive adoption.

View profile
VC

Junior Full-Stack Software Engineer specializing in Node.js, React, and REST APIs

Memphis, TN2y exp
Northern Arizona UniversityNorthern Arizona University

Full-stack engineer who shipped and owned a production Document Chat feature built with Next.js App Router/TypeScript and a Node/Express RAG backend, including JWT-secured route handlers and streaming responses. Demonstrated strong post-launch ownership by improving latency (~30%) via MongoDB indexing/query optimization and reducing AI costs through caching, backed by profiling with React Profiler and Chrome DevTools.

View profile
KC

Intern Full-Stack Engineer specializing in AI-powered products

San Jose, CA0y exp
EvovanceSanta Clara University

Software engineer (internship experience) who built and owned an AWS serverless multi-user “challenge” feature end-to-end (UI + REST APIs + DynamoDB + deployment), delivering measurable gains in latency (-30%), debugging time (-50%), and join drop-offs (~-30%). Also productionized a multilingual RAG-based QA system with vector retrieval and guardrails, improving accuracy to ~85% and driving ~20% DAU growth.

View profile
SG

Sugathri Gotu

Screened

Mid-Level Full-Stack Software Engineer specializing in FinTech and cloud-native microservices

California, USA4y exp
California State UniversityCal State Dominguez Hills

Built and shipped a production LLM-powered incident response agent for a microservices platform, automating alert triage and safe remediation recommendations with strong guardrails (RAG grounding, structured JSON outputs, rule-based validation, and human-in-the-loop). Implemented state-machine orchestration (Redis/Kafka), comprehensive eval/monitoring, and an error categorization pipeline that cut hallucination errors ~40% and reduced MTTR ~30%.

View profile
CharanKumar Pathakamuri - Entry-Level GenAI/LLM Engineer specializing in agentic systems and RAG in Baltimore, MD

Entry-Level GenAI/LLM Engineer specializing in agentic systems and RAG

Baltimore, MD1y exp
Kanehl ConsultingUniversity of Maryland, Baltimore County

LLM/AI agent engineer with consulting/contract experience (Kanhaiya Consulting LLC) who deployed a production AI agent to automate BIM list workflows end-to-end—from database understanding and data cleaning to automated visualizations/dashboards. Worked around restricted real-time data access by generating synthetic data and improving outputs via supervised fine-tuning, and uses AWS-based LLMOps observability (Opic/OPEC) plus hybrid retrieval (vector+BM25 with reranking) to optimize relevance, latency, and cost.

View profile
Vishnu Priyan Sellam Shanmugavel - Mid-Level Applied AI Engineer specializing in LLM services, RAG, and OCR/NLP extraction in Arlington, VA

Mid-Level Applied AI Engineer specializing in LLM services, RAG, and OCR/NLP extraction

Arlington, VA4y exp
HealthLab InnovationsIllinois Institute of Technology

Backend/platform engineer who built and evolved a large-scale healthcare document processing system (OCR + LLM orchestration) in Python/FastAPI on Google Cloud (Cloud Run, GCS, Firestore), processing ~1.5M files per batch and tens of millions overall. Emphasizes reliability and operational safety via deterministic IDs, idempotent state machines, strong observability, and self-healing reconciliation, plus disciplined migrations using dual-run validation and incremental rollouts.

View profile
Revanth P - Mid-level DevOps Engineer specializing in AWS, Azure, Kubernetes, and GenAI infrastructure in Walnut Creek, CA

Revanth P

Screened

Mid-level DevOps Engineer specializing in AWS, Azure, Kubernetes, and GenAI infrastructure

Walnut Creek, CA4y exp
Mechanics BankUniversity of Central Missouri

Database/platform engineer with stronger hands-on experience in AWS and Azure than GCP, but able to speak credibly about cloud database architecture, automation, and reliability engineering. They led an on-prem MySQL to RDS/DynamoDB migration, built Terraform/Python-based zero-touch database operations, and described a performance incident where latency dropped from 2s to under 300ms while supporting 2x traffic.

View profile
ZS

Zohaib Shahid

Screened

Mid-level Data Scientist specializing in Generative AI and LLM solutions

Magdeburg, Germany4y exp
DataRopes.aiOtto von Guericke University Magdeburg

Built and owned a production RAG-based internal knowledge assistant end-to-end, from experimentation through cloud deployment and monitoring. Demonstrated strong practical GenAI judgment by choosing prompt optimization and retrieval tuning over fine-tuning for dynamic data, driving a 40% to 50% reduction in time to answer while improving relevance, lowering hallucinations, and increasing productivity.

View profile

Need someone specific?

AI Search