Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

RK

Junior Software Engineer specializing in distributed systems and ML platforms

Fullerton, CA1y exp
California State University, FullertonCal State Fullerton

Built and deployed real-world systems end-to-end across security and healthcare contexts: led a 3-person team delivering a university vehicle tracking system with 30% cost savings and 1-year post-launch monitoring. Also implemented a healthcare RAG chatbot with adaptive query routing that cut LLM costs by 40% while maintaining answer accuracy, and has experience debugging non-deterministic LLM behavior in DevOps pipeline automation.

View profile
DB

Mid-level Software Engineer specializing in backend, full-stack, and healthcare IT

Lake Mary, FL5y exp
Vesta TeleradiologyNorthern Arizona University

Software engineer with a pragmatic, production-oriented approach to AI-driven development, using AI to accelerate coding while keeping human oversight on correctness, architecture, and final decisions. Has hands-on experience with agent-style AI workflows and has led the design and coordination of AI-agent systems with a strong emphasis on reliability, performance, and end-to-end execution.

View profile
VJ

Mid-level Full-Stack & AI Engineer specializing in LLM applications

6y exp
Our National ConversationFitchburg State University

Full-stack engineer who has shipped and operated generative-AI chat/QA features end-to-end, including a RAG-based pipeline with guardrails and cost/latency monitoring in production. Experienced with React/TypeScript + Node/Postgres architectures, Dockerized deployments to AWS (EC2) via GitHub Actions CI/CD, and building reliable ingestion/ETL systems with idempotency, backfills, and reconciliation.

View profile
KP

Mid-level AI/ML Software Engineer specializing in GPU-optimized LLM inference and cloud microservices

Seattle, WA5y exp
DVR SoftekSan José State University

Built and deployed a production RAG-based multilingual analytics assistant for healthcare operations, enabling non-technical teams to query claims/EHR and risk metrics with grounded explanations. Demonstrates strong end-to-end LLM system engineering (retrieval tuning, re-ranking, hallucination controls, verification layers) plus workflow orchestration (Airflow/Composer/Step Functions) and stakeholder-driven iteration via prototypes and dashboards.

View profile
SK

Shram Kadia

Screened

Junior Software Engineer specializing in ML, RAG systems, and safety-critical risk modeling

San Jose, CA2y exp
OpenPRA OrgNorth Carolina State University

Backend/cloud engineer from Resilient Tech with hands-on experience deploying REST APIs and database migrations into a live ERP used by real customers while maintaining 99% uptime. Has debugged intermittent AWS container timeouts down to security group/load balancer misconfigurations, and has extended Python in an ERPNext system to meet GST/e-invoicing compliance requirements with strong customer collaboration.

View profile
BP

Intern Data Scientist specializing in GenAI agents, RAG, and ML platforms

Chicago, IL3y exp
Immerso.aiIllinois Institute of Technology

LLM/agent systems builder who deployed a production hybrid router for immerso.ai that dynamically selects retrieval vs reasoning vs generative pathways, achieving an 82% factual-accuracy lift. Deep hands-on experience optimizing local Mistral 7B inference (4–5 bit GGUF quantization, KV-cache reuse) and building reliable RAG/agent workflows with LangChain/LangGraph/AutoGen across GCP Cloud Run and AWS (ECS/Lambda).

View profile
DA

Junior Full-Stack Software Engineer specializing in cloud-native web apps and AI tooling

California, US3y exp
EduQuencherMissouri University of Science and Technology

Software engineer with experience across edtech, live gaming, and an AI document intelligence platform, delivering end-to-end customer-facing features and production backends. Built secure, automated live-session scheduling integrating Zoom and TalentLMS (JWT/RBAC, idempotency, transactions) cutting setup time from ~3 minutes to under 1 minute, and optimized real-time gaming dashboards/APIs with query tuning, caching, and CDN improvements (~60% latency reduction under peak load) on AWS.

View profile
DD

Dinal Dholiya

Screened

Mid-level Full-Stack Engineer specializing in AI-powered and cloud-native systems

Remote4y exp
ZentraisUniversity at Buffalo

Product-minded engineer who has owned features end-to-end, including a full onboarding redesign that lifted completion ~25% and a production LLM/RAG report-generation system with strong guardrails (schema-constrained JSON, confidence gating, logging) and an automated eval/regression loop built from real user queries. Also built a scalable research data pipeline ingesting messy PDFs/JSON/CSVs with normalization, idempotent reruns, observability, and cost/latency tradeoffs.

View profile
BK

Junior AI/ML Engineer specializing in Generative AI, NLP, and MLOps

Lewisville, TX1y exp
ThinkBig Software SolutionsTexas Tech University

LLM engineer who has deployed a production RAG system (LangChain/FAISS/FastAPI) for enterprise semantic search, tackling real-world latency by LoRA/PEFT fine-tuning and grounding outputs with retrieval. Brings strong MLOps (Docker, AWS EKS, CI/CD, MLflow) plus stakeholder-facing explainability experience using SHAP to align ML-driven financial guidance with non-technical domain experts.

View profile
Pranav Mishra - Junior Machine Learning Engineer specializing in LLM agents, RAG, and MLOps in Charlotte, NC

Pranav Mishra

Screened

Junior Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

Charlotte, NC2y exp
WheelPriceUniversity of Illinois Chicago

AI/ML engineer who has shipped production systems across computer vision and conversational agents: built a YOLOv8-based wheel fitment pipeline at a Techstars-backed automotive startup, focusing on sub-second latency, monitoring, and robust fallback mechanisms that drove 2–3x page view growth and +5–6k users. Also built a voice-based interview platform orchestrating Deepgram + GPT-4 Mini + OpenAI TTS with FSM-driven reliability, and has hands-on RAG experience (LangChain, hybrid retrieval, cross-encoder reranking, custom pseudo-query generation).

View profile
Rethvick Sriram Yugendra Babu - Junior AI/ML Software Engineer specializing in Generative AI and scalable data pipelines in Tucson, AZ

Junior AI/ML Software Engineer specializing in Generative AI and scalable data pipelines

Tucson, AZ2y exp
University of ArizonaUniversity of Arizona

Built and operated large-scale biodiversity/ecological research platforms, integrating 50+ heterogeneous global datasets into a unified BIEN 3 schema on PostgreSQL/PostGIS and improving data consistency by 35%. Strong production engineering background (Linux monitoring, CI/CD performance gates, Docker on AWS/Azure) plus applied AI work building a Python RAG system (0.90 precision) and halving latency with Elasticsearch.

View profile
Taruni Reddy Ampojwala - Mid-level GenAI Engineer specializing in LLM agents and RAG systems in Brooklyn, NY

Mid-level GenAI Engineer specializing in LLM agents and RAG systems

Brooklyn, NY4y exp
PamTenLong Island University

Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.

View profile
Yashwant Gandham - Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure in Boulder, CO

Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure

Boulder, CO1y exp
NovaChat AIUniversity of Colorado Boulder

Built and deployed production RAG-based document search/Q&A systems (DocChat and an internship marketing RAG), using a React + FastAPI stack on GCP with docs stored in GCP buckets and retrieval via embeddings/vector DB. Emphasizes cost/performance tradeoffs (reported ~40% cost reduction) and ships via Docker (Railway), with load/API testing using JMeter and Swagger; regularly collaborates with a CEO stakeholder to iterate and push changes to production.

View profile
Varun Mahankali - Junior Full-Stack Software Engineer specializing in React, Node.js, AWS, and Generative AI

Junior Full-Stack Software Engineer specializing in React, Node.js, AWS, and Generative AI

3y exp
KalvenTech TechnologiesUniversity of North Texas

Built and production-deployed a Streamlit-based PDF RAG chatbot using LangChain (FAISS, embeddings, prompt templates) and OpenAI, optimizing Streamlit’s stateless behavior by caching vector DB + chat history to cut latency and API cost. Demonstrates a rigorous evaluation mindset (gold datasets, unit tests, LLM-as-judge, groundedness KPIs) and has experience communicating privacy/accuracy safeguards (RBAC, data masking, citations) to a non-technical client at Kalven Technologies.

View profile
Binaya Sharma - Senior Software Engineer specializing in full-stack systems, big data, and applied AI in Baton Rouge, LA

Binaya Sharma

Screened

Senior Software Engineer specializing in full-stack systems, big data, and applied AI

Baton Rouge, LA6y exp
365LabsLouisiana State University

Built and deployed ForensicLLM, a local domain-specific LLaMA-3.1-8B model for digital forensic investigators using RAFT + RAG over 1000+ curated research papers, with citation-aware responses and rigorous evaluation (BERTScore/G-Eval). Deployed via vLLM and Docker and validated through a chatbot survey with 80+ participants; published at DFRWS EU 2025.

View profile
Karan Baid - Intern Machine Learning Engineer specializing in Generative AI and RAG systems in Jaipur, India

Karan Baid

Screened

Intern Machine Learning Engineer specializing in Generative AI and RAG systems

Jaipur, India
Netgraph Networking Pvt. Ltd.Vellore Institute of Technology

Early-career AI/LLM builder who created and deployed a multi-agent news analysis agent (Patrakarita) using CrewAI, coordinating researcher/analyst roles to turn noisy article URLs into structured, prioritized outputs (claims, tone, verification questions, opposing views). Strong focus on orchestration debugging and reliability evaluation, including measuring hallucination/redundancy and improving reasoning by refactoring pipeline sequencing.

View profile
Srikar Tharala - Mid-level AI/ML Engineer specializing in Generative AI and RAG systems in Remote, USA

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

Remote, USA4y exp
ProcialCentral Michigan University

Currently at ProShare and reports building an AI/LLM-powered system deployed to production, aimed at helping with status-related difficulties and reducing misunderstandings across transactions. Also cites prior collaboration at Porsche with marketing teams, focusing on translating marketing goals into technical requirements and communicating solutions clearly to non-technical stakeholders.

View profile
Atharva Chavan - Junior Full-Stack Software Engineer specializing in mobile, cloud, and GenAI integration in Syracuse, NY

Junior Full-Stack Software Engineer specializing in mobile, cloud, and GenAI integration

Syracuse, NY2y exp
D&D Motor Systems Inc.Syracuse University

Software engineering intern with hands-on ownership of a Java/Spring Boot order management microservice, including production performance tuning via Redis caching and database indexing driven by API logs/metrics. Also contributed to a production mobile-backend LLM feature using RAG with embeddings over structured data and documents (DB + object storage), with guardrails to keep responses grounded.

View profile
Merub SHAIKH - Junior Software Engineer specializing in full-stack web development and test automation in Chicago, IL

Merub SHAIKH

Screened

Junior Software Engineer specializing in full-stack web development and test automation

Chicago, IL3y exp
Illinois Institute of TechnologyIllinois Institute of Technology

Full-stack engineer who built and owned a production workflow/kanban-style drag-and-drop system in Next.js (App Router) with Postgres/Prisma, including reusable component abstractions, Cypress E2E coverage, and post-launch performance/bug ownership. Notable for measurable impact (25% faster UI dev, ~30% query perf improvement) and for leading an incremental Express→NestJS migration that reduced technical debt (~40%) through better structure, docs, and team enablement.

View profile
Sampath Achalla - Mid-level Python Full-Stack Engineer specializing in AI microservices and cloud data platforms in USA

Mid-level Python Full-Stack Engineer specializing in AI microservices and cloud data platforms

USA3y exp
DoJaGaIllinois Institute of Technology

Backend-leaning full-stack engineer in fintech/payments who shipped an end-to-end Stripe payments + webhook system for a financial microservices platform, emphasizing ledger accuracy via idempotency, transactional writes, retries, and DLQs. Also delivered a real-time React/TypeScript payment status dashboard informed by user interviews, and improved production performance by 35% p95 latency through PostgreSQL tuning and Redis caching on AWS.

View profile
Gomathy Selvamuthiah - Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications in Portland, US

Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications

Portland, US2y exp
SBD TechnologiesNortheastern University

Built an LLM-driven MLOps agent at SBD Technologies that automated an EV-charging prediction workflow end-to-end, integrating with real-time Kafka/FastAPI systems supporting 120K+ chargers at 99.99% event delivery. Addressed frequent schema drift by implementing SQLAlchemy/Flyway validation (60% reduction in drift issues) and deployed as Kubernetes microservices with GitHub Actions CI/CD; also has Airflow-based ingestion/crawling experience into Snowflake and stakeholder-facing delivery via a Fleetcharge PWA.

View profile
Shuchi Shah - Senior Applied AI Engineer specializing in RAG and full-stack systems in San Jose, CA

Shuchi Shah

Screened

Senior Applied AI Engineer specializing in RAG and full-stack systems

San Jose, CA13y exp
OpGov.AISan Diego State University

Backend engineer with experience building an end-to-end civic tech AI platform that ingests city council meeting videos, transcribes them with Whisper, and enables natural-language Q&A via a LangChain/FAISS RAG pipeline. Demonstrated strong systems thinking by tuning retrieval for accuracy/latency/memory (cutting response time ~3s→1s and memory ~500MB→25MB) and by safely migrating an ERP from monolith toward services using dual writes, reconciliation, and idempotency to protect financial workflows.

View profile
DS

Darshan Shah

Screened

Mid-Level Software Engineer specializing in cloud-native microservices and full-stack development

Holliston, MA6y exp
Liberating TechnologiesNortheastern University

Full-stack engineer with deep startup experience building products from scratch under ambiguous requirements. Delivered a scalable, admin-configurable notification platform (Spring Boot/Java/Kafka) supporting 50+ notification types across 3 channels for 10k+ users, cutting new notification setup to ~5 minutes. Also built a Tinder-meets-LinkedIn job-swiping app (React/TS + Node/Prisma) and has hands-on AWS production ops (ECS/EKS, RDS, CloudWatch) plus multiple third-party integrations (Stripe, QuickBooks, Twilio).

View profile
GJ

Junior AI/ML Software Engineer specializing in automation and healthcare imaging

Charlotte, NC2y exp
Bridge Investment GroupUniversity of North Carolina at Charlotte

Backend-focused engineer who built a Python-based automation system leveraging Gemini AI and prompt-driven PDF field extraction to replace a previously manual third-party workflow. Drove stakeholder alignment around accuracy/acceptance thresholds and added production-minded safeguards like graceful failure handling and backup model contingencies.

View profile

Need someone specific?

AI Search