Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

TS

Tamir ShemTov

Screened

Entry-Level Computer Vision Research Assistant specializing in medical imaging AI

Los Angeles, CA1y exp
Cedars-SinaiCalifornia State University, East Bay

New grad who shipped an LLM-powered writing app (“Write-it”) to production on Azure with CI/CD (GitHub Actions + JFrog) and implemented an unconventional RAG pipeline to prevent repetitive prompts using embeddings and cosine similarity. Also participated in a Luma AI image/video generation hackathon, iterating with artist feedback and improving usability by rewriting non-technical prompts via an LLM.

View profile
NT

Junior Backend Engineer specializing in cloud APIs and AI-enabled systems

Raleigh, NC2y exp
NC State UniversityNorth Carolina State University

Built and shipped "OnCall Copilot," a production Slack-based RAG assistant that answers on-call questions from runbooks and postmortems with citations using a FAISS vector index. Emphasizes reliability and measurable performance via strict guardrails ("no evidence, no answer"), evaluation metrics, drift monitoring, and operational hardening with Docker, logging, health checks, and offline fallback.

View profile
SP

Smit Panchal

Screened

Mid-level Full-Stack & XR Developer specializing in GenAI and immersive AR/VR systems

3y exp
Community Dreams FoundationIllinois Institute of Technology

Built and deployed a "personal second brain" product (CloneMind) with an end-to-end RAG pipeline for retrieving information across PDFs, URLs, images, and audio using Next.js/Node.js/Postgres/Supabase/Redis. Demonstrates strong practical depth in retrieval quality tuning, latency reduction via caching, and stateful orchestration with LangChain/LangGraph, plus experience persuading a non-technical professor stakeholder by shipping a working prototype.

View profile
RR

Junior Solutions Engineer specializing in full-stack automation and LLM prompt engineering

San Francisco, CA2y exp
SCU - Frugal Innovation HubSanta Clara University

Built and productionized an LLM-powered customer support system using a RAG architecture with structured document ingestion, embedding retrieval, and prompt templates for product-specific grounding. Experienced diagnosing live agent/workflow failures (e.g., retrieval regressions after new docs) by refactoring ingestion/chunking and adding grounding constraints plus evaluation benchmarks. Also supports go-to-market by joining discovery calls, shaping MVP workflows into demos/prototypes, and creating post-launch documentation to drive adoption.

View profile
VC

Junior Full-Stack Software Engineer specializing in Node.js, React, and REST APIs

Memphis, TN2y exp
Northern Arizona UniversityNorthern Arizona University

Full-stack engineer who shipped and owned a production Document Chat feature built with Next.js App Router/TypeScript and a Node/Express RAG backend, including JWT-secured route handlers and streaming responses. Demonstrated strong post-launch ownership by improving latency (~30%) via MongoDB indexing/query optimization and reducing AI costs through caching, backed by profiling with React Profiler and Chrome DevTools.

View profile
SG

Sugathri Gotu

Screened

Mid-Level Full-Stack Software Engineer specializing in FinTech and cloud-native microservices

California, USA4y exp
California State UniversityCal State Dominguez Hills

Built and shipped a production LLM-powered incident response agent for a microservices platform, automating alert triage and safe remediation recommendations with strong guardrails (RAG grounding, structured JSON outputs, rule-based validation, and human-in-the-loop). Implemented state-machine orchestration (Redis/Kafka), comprehensive eval/monitoring, and an error categorization pipeline that cut hallucination errors ~40% and reduced MTTR ~30%.

View profile
DA

Daniel Adams

Screened

Mid-level Unity Developer specializing in XR and multiplayer VR experiences

Kapa’a, HI4y exp
ZestyVirginia Commonwealth University

Unity mixed-reality developer who shipped ZenPlay, a multiplayer Go app on Meta Quest, integrating a C# rules engine with XR input, Meta avatars, Hathora-hosted matches, and Vivox voice chat (reported ~700 MAU). Also built a production LLM agents backend (LangChain + RAG with Pinecone/ChromaDB + ChatGPT) powering embodied conversational avatars, with a strong focus on streaming voice latency optimization (ElevenLabs TTS) and cross-platform WebXR delivery (Quest/iOS/Android).

View profile
HJ

Mid-level Software Engineer specializing in GenAI and machine learning systems

Hartford, CT4y exp
HartcareSaint Louis University

Backend/AI engineer with deep healthcare experience building production Python microservices that turn raw clinical audio into structured notes and insights. They owned systems end-to-end across architecture, launch, monitoring, and incident response, with measurable impact including 40% lower operating costs, 22% better latency, and 99.9% uptime in a regulated environment.

View profile
SG

Mid-level Full-Stack Software Engineer specializing in AI and RAG systems

Parsippany, NJ4y exp
Agadia SystemsCalifornia State University, East Bay

Backend/AI engineer who built an enterprise RAG chatbot over 40,000+ technical documents, owning the system from ingestion and retrieval design through launch, optimization, and incident prevention. Stands out for treating LLM reliability as a data, retrieval, and observability problem—delivering 90%+ benchmark accuracy, ~50% fewer hallucinations, and major gains in lookup speed and latency.

View profile
VC

Mid-level AI Engineer specializing in GenAI, agentic workflows, and RAG systems

USA6y exp
Federal Home Loan BankIndiana Tech

Built a production multi-agent RAG assistant using LangChain/LangGraph with OpenAI embeddings and FAISS, focusing on retrieval quality and latency (Redis caching, parallel retrieval, precomputed embeddings). Experienced orchestrating ETL/ML pipelines with Airflow and Databricks Workflows, and has delivered an AI assistant for business ops to extract insights from policy/compliance documents through close non-technical stakeholder collaboration.

View profile
PK

Senior AI/Data Engineer specializing in Agentic AI and Advanced RAG on Azure Databricks

United States7y exp
Spark Data SolutionsUniversity of Cincinnati

Built production LLM/agent systems for procurement and contract spend controls, including a proactive contract value leakage detection platform that moved an organization from reactive audits to pre-payment rejection. Combines multi-agent orchestration (Semantic Kernel/LangChain/AutoGen), document AI benchmarking (Textract vs Azure DI), and MLOps/testing (MLflow, QTest/Pytest) with strong security practices (RAG-grounded responses to prevent prompt injection). Integrated anomaly alerts directly into SAP SES workflows and Power BI dashboards, citing ~$38M leakage addressed across large spend environments.

View profile
PK

Prerana Kumsi

Screened

Junior Full-Stack Software Engineer specializing in cloud microservices and .NET/Go

Tempe, AZ3y exp
Arizona State UniversityArizona State University

Product-minded full-stack engineer with hospitality tech experience who owned and scaled a multi-region guest verification/check-in workflow (ID/passport scanning, OCR, and government submissions) and built internal tools that cut manual entry up to 80%. Also built a React/TypeScript + FastAPI RAG “second brain” with async ingestion workers and an event-driven e-folio email microservice hardened with idempotency and retries.

View profile
SB

Intern AI/ML Software Engineer specializing in LLMs, NLP, and multimodal systems

India0y exp
National Small Industries CorporationLawrence Technological University

Built and deployed a production AI-powered personalized learning platform (Django + FastAPI) featuring an LLM+RAG tutoring assistant and automated grading. Demonstrates strong applied LLM reliability engineering (structured JSON outputs with Pydantic validation, hallucination control via FAISS-based RAG thresholds and refusals) plus scalable async microservice design and Airflow-orchestrated ETL across AWS/GCP.

View profile
RV

Mid-level Full-Stack Cloud Engineer specializing in GCP/Azure and AI-powered applications

Montville, NJ4y exp
Comptivia TechnologiesGeorge Mason University

Backend/DevOps-leaning engineer who has owned a Python serverless platform on AWS (Lambda, DynamoDB, Step Functions), including complex multi-step business workflows with transaction-based consistency and robust failure handling. Also supported an on-prem SQL to Azure Data Lake migration by building and monitoring Python + Azure Data Factory ETL pipelines, and led GitOps-style CI/CD automation with GitHub Actions (tests, security scans, automated deployments).

View profile
SK

Junior Full-Stack Software Engineer specializing in AI workflows and LLM integrations

Remote3y exp
LumeoFinanceClemson University

Built and productionized an AI-assisted merchant onboarding automation workflow for Kort Payments, replacing slow manual underwriting document review with structured extraction, cross-document validation, and human-in-the-loop guardrails. Emphasizes reliability via scenario-based testing, repeatability checks, and deep observability (timestamped logs), plus incremental rollout with legacy fallback to prevent regressions.

View profile
BN

Balaji Namala

Screened

Junior AI Engineer specializing in LLM systems, RAG, and scalable cloud AI

Seattle, WA2y exp
Sparrow VisionUniversity of Colorado Denver

Built and shipped production LLM agents for real-time, high-concurrency conversational systems, including a RAG-based pipeline with dynamic multi-provider routing and failover that achieved 99.99% reliability and sub-800ms latency. Also architected a UAV telemetry chatbot with tool-calling (anomaly detection/summarization), strict schema validation, and robust eval/monitoring loops, cutting tool-call errors by 30% and reducing operational costs by 90%.

View profile
Rizwana Shaik - Mid-level Full-Stack Software Engineer specializing in cloud-native apps and AI copilots in Dallas, TX

Rizwana Shaik

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-native apps and AI copilots

Dallas, TX4y exp
Integrated Digital SolutionsUniversity of North Texas

Internship project building and deploying a LLaMA-based, RAG-enabled copilot inside a Professional Services Automation platform, enabling natural-language navigation, text-to-SQL reporting, and project/resource/budget insights across multiple modules. Addressed real production issues like context drift and vague queries with hybrid search, metadata enrichment, and an intent classification/rewriting layer, orchestrated via Apache Airflow—ultimately cutting PMO reporting time by 40%.

View profile
Ramya Jonnala - Principal/GenAI Engineer specializing in LLMs, RAG, and MLOps in Plano, TX

Ramya Jonnala

Screened

Principal/GenAI Engineer specializing in LLMs, RAG, and MLOps

Plano, TX9y exp
AmplifAITexas A&M University-San Antonio

Built a production AI-powered university marking system that automates question generation and grading from PDF course materials using a RAG pipeline (S3 + Pinecone) orchestrated with LangChain/LangGraph and deployed on AWS ECS via Docker/ECR and GitHub Actions CI/CD. Addressed a key real-world LLM challenge—grading consistency—by implementing rubric-based scoring, retrieval re-ranking, and standardized context summarization, validated against human instructors.

View profile
Emmanuel Eruck - Senior Full-Stack Software Engineer specializing in AI and FinTech in Toronto, Canada

Senior Full-Stack Software Engineer specializing in AI and FinTech

Toronto, Canada8y exp
Reddy AIUniversity of Buea

Frontend engineer who led the mentorAi SaaS platform UI end-to-end at IBL.ai, building real-time React/Redux experiences backed by WebSockets and scaling quality with Playwright E2E tests in GitHub CI. Also worked at Paymentology for ~2 years on fintech features (token management, Visa card creation/assignment and funding flows), modernizing an existing React codebase by introducing TypeScript and strengthening CI/coverage with SonarQube to reduce deployment bugs.

View profile
Thota Rajkumar - Junior Cloud & AI Infrastructure Engineer specializing in Agentic AI and AWS in San Jose, CA

Junior Cloud & AI Infrastructure Engineer specializing in Agentic AI and AWS

San Jose, CA2y exp
IpserLabNortheastern University

Built and deployed a production AI career-advice agent designed to combat unreliable/generic LLM guidance by grounding outputs in retrieval-first RAG over resumes/job/hiring data, with multi-step reasoning, structured memory, and evidence-only prompting to reduce hallucinations. Implemented the system with LangChain/Python and deployed on AWS as scalable microservices orchestrated via REST and asynchronous calls, iterating closely with career coaches and students.

View profile
HarshaVardhanRao Vemuganti - Junior AI/ML Engineer specializing in Generative and Agentic AI in Lafayette, Louisiana

Junior AI/ML Engineer specializing in Generative and Agentic AI

Lafayette, Louisiana4y exp
University of Louisiana at LafayetteUniversity of Louisiana at Lafayette

Built and deployed a production-grade LLM agent for credit management and accounts receivable automation, integrating ERP/MySQL data via a RAG pipeline and exposing services through FastAPI with Pydantic-validated outputs on AWS Bedrock. Emphasizes reliability and compliance for financial operations using schema validation and human-in-the-loop review, reporting ~32% reduction in manual work and ~41% improvement in response time/reliability.

View profile
Kishan Kumar Reddy Konreddy - Junior Software Engineer specializing in distributed systems and cloud platforms

Junior Software Engineer specializing in distributed systems and cloud platforms

2y exp
LancesoftSan José State University

Software engineer (Lance Soft Engineering) who built a Java/gRPC real-time request tracking system supporting ~20K simultaneous requests, using Kafka event streaming and PostgreSQL to improve transparency and cut support requests by 35%. Demonstrates strong production operations skills—resolved live latency spikes with Kafka async messaging (+48% throughput) and executed safe migrations using parallel runs, staging validation, and blue-green deployments.

View profile
ROHITH DAMARLA - Mid-level Backend Engineer specializing in high-scale systems and LLM pipelines in Syracuse, NY

Mid-level Backend Engineer specializing in high-scale systems and LLM pipelines

Syracuse, NY6y exp
Syracuse UniversitySyracuse University

Open-source-focused TypeScript/JavaScript engineer who built a lightweight Node.js utility library to standardize LLM-agent message formatting, tool invocation, and safe schema-validated JSON outputs. Emphasizes composable abstractions, real-world performance profiling/benchmarks, and strong community feedback loops (GitHub issues, structured errors, logging hooks). Also did research at Syracuse University on converting natural language into structured JSON with validation layers.

View profile

Need someone specific?

AI Search