Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

JP

Mid-level Full-Stack Engineer specializing in AI, voice systems, and SaaS

4y exp
Solvr LabsUniversity of Arizona

Built Taskline end-to-end as a solo founder: an AI-native operations platform for tradespeople with web and mobile apps, AI receptionists, invoicing, scheduling, and payments. Particularly interesting for teams seeking a zero-to-one full-stack builder who can turn LLM/agent capabilities into practical, low-friction products for non-technical users.

View profile
SS

Mid-level Full-Stack Engineer specializing in Applied AI

USA4y exp
HomeInheritanceCentral Michigan University

AI/full-stack product engineer with experience shipping agentic systems in both fintech and enterprise compliance contexts, including an Anthropic-powered compliance Q&A tool at American Express and a home-equity assessment experience for seniors. Stands out for combining strong product instincts, typed full-stack implementation, and rigorous LLM evaluation/monitoring practices to improve trust, adoption, and operational efficiency.

View profile
RJ

Rohit Jaiswal

Screened

Mid-level Software Engineer specializing in distributed backend systems for FinTech

New York, NY5y exp
JPMorgan ChaseSyracuse University

Full-stack/backend-leaning engineer with experience spanning fintech platforms, internal AI/RAG assistants, real-time analytics systems, and a zero-to-one academic web platform. Stands out for combining hands-on backend and infrastructure work with product ownership, team guidance, and measurable impact like cutting troubleshooting lookup time from 30 minutes to under 8 minutes and creating reusable UI components adopted across multiple projects.

View profile
NP

Nihari Puli

Screened

Mid-level AI/ML Engineer specializing in LLM systems and agentic workflows

4y exp
OptumUniversity of Cincinnati

Built an agentic medical coding system at Optum that combined LangGraph, LangChain RAG, Azure OpenAI, pgvector, and TypeScript to automate routine clinical coding while escalating risky cases to humans. The system automated about 40% of routine cases at roughly 92% accuracy, with strong production evals and observability using MLflow, Ragas, and DeepEval.

View profile
SS

Mid-level Software Engineer specializing in backend, microservices, and AI for FinTech

San Diego, CA5y exp
PNCUniversity of North Texas

Built and shipped an internal Financial Insights Assistant for banking analysts, owning the experience from workflow design through React frontend, FastAPI backend, and AI search integration. Particularly strong in making AI products usable and trustworthy by surfacing sources, experts, suggested prompts, and search history to improve confidence and speed.

View profile
Mahima Mannava - Junior Software Engineer specializing in AI, LLM systems, and full-stack development in San Jose, CA

Junior Software Engineer specializing in AI, LLM systems, and full-stack development

San Jose, CA3y exp
StackbirdsSan Jose State University

AI/full-stack engineer who built a computer-usage agent end to end, including a split local/cloud architecture that used a vision LLM to drive real Chrome workflows while avoiding bot detection. Stands out for combining product-minded systems design, rigorous evals, and prompt iteration to achieve sub-2-second latency and a 20% reduction in automation failures in an early-stage environment.

View profile
KB

Mid Software Engineer specializing in backend, cloud, and AI platforms

San Jose, CA3y exp
PNCSan Jose State University

Built and shipped an AI-powered support assistant at PNC Bank, owning both the React/TypeScript frontend and Python/FastAPI + LangChain/OpenAI backend. Stands out for applying RAG in a regulated banking environment with measurable impact: 34% better knowledge retrieval accuracy and 27% faster support lookup time.

View profile
Jaya Rathi - Senior Software Engineer specializing in distributed systems and cloud platforms in Cupertino, CA

Jaya Rathi

Screened

Senior Software Engineer specializing in distributed systems and cloud platforms

Cupertino, CA15y exp
AI LearningDr. A. P. J. Abdul Kalam Technical University

Full-stack developer who built a unit billing subscription portal in React, Node.js, TypeScript, and PostgreSQL to track employee billable hours and generate paperless invoices, including BambooHR integration and async workflows with RabbitMQ. Also independently exploring agentic AI through a support desk assistant using RAG, ChromaDB, and LangChain, with a strong focus on grounding, type safety, and measurable retrieval quality.

View profile
SS

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp
Cloud9USC

AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.

View profile
MD

Junior Software Engineer specializing in AI, backend systems, and AWS cloud

Sunnyvale, CA2y exp
LinkedInNortheastern University

Built and shipped a production multi-agent conversational AI platform (Monitor agent + RAG + 4 additional agents) with enterprise REST APIs, using ChromaDB-grounded WCAG knowledge to keep responses accurate while varying tone via personality modes and conversation memory. Has experience at LinkedIn delivering technical demos and pre-sales guidance to both engineering teams and C-level stakeholders, acting as a translator between sales and technical teams to drive adoption.

View profile
VS

Principal Software Architect specializing in AI/ML and cloud-native full-stack platforms

Austin, TX17y exp
Happiest MindsAnna University

AI/LLM engineer who built a production content-generation system for nursing education, combining multimodal RAG over proprietary PDFs (including images) with structured Cosmos DB data and external sources. Strong focus on production reliability—prompt-chaining with LangChain, validation/guardrails, and Azure-based monitoring/observability—plus experience designing Azure AI agents with tool integrations like Bing Search.

View profile
SG

Sai Garipally

Screened

Mid-level AI/ML Engineer specializing in GenAI, LLMs, and computer vision

USA5y exp
UiPathSacred Heart University

Built and productionized a multi-agent, LLM-powered document understanding system to replace manual review of long documents, using LangGraph orchestration plus RAG to reduce hallucinations. Implemented layered reliability controls (structured templates, checker agent, and human-in-the-loop feedback) and reported ~40% speed improvement after orchestration; also has hands-on Airflow experience for scheduled data pipelines.

View profile
AM

Mid-level AI Engineer specializing in multi-agent LLM systems and multimodal tutoring

Boston, United States3y exp
PearsonUniversity of Illinois Urbana-Champaign

LLM/agentic systems builder who has deployed multi-agent educational chatbots using LangChain + LangGraph, with LangFuse-based tracing and FastAPI hosting. Focused on production reliability and performance (latency reduction via agent decomposition and caching) and on evaluation/testing (routing test scenarios, LLM-as-judge). Partnered with product to add image understanding by parsing and storing images in S3, expanding chatbot coverage to 30+ books with images.

View profile
TM

Tejal Mane

Screened

Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems

Moundsville, WV4y exp
CitiusTechUniversity of Michigan

Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.

View profile
LK

Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps

New York, NY4y exp
AIGUniversity of Texas at Arlington

LLM/ML platform engineer with hands-on experience taking an LLM document summarization prototype into a production-grade service on AWS EKS, emphasizing low-latency inference, drift monitoring, and safe CI/CD rollouts (canary + rollback). Strong in real-time debugging of agentic/RAG systems (tracing, retrieval/index drift fixes) and in developer enablement through practical workshops (Docker/Kubernetes/FastAPI) plus pre-sales support via demos and benchmarks to close pilots.

View profile
CB

Mid-level Full-Stack Software Engineer specializing in cloud and AI-enabled applications

San Francisco, CA4y exp
One CommunityPurdue University

Product-focused full-stack engineer (70/30 app vs infra) with Accenture experience and recent AI workflow work, shipping end-to-end systems from React/TypeScript UIs through FastAPI backends to Postgres. Built an AI-driven data extraction platform with async job APIs, strict schema validation, and strong observability, and has operated AWS ECS-based deployments with real incident mitigation (DB connection exhaustion/latency under traffic spikes).

View profile
KK

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp
AccentureEastern Illinois University

LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).

View profile
VL

Vasu Lakhani

Screened

Mid-Level Software Engineer specializing in AI-enabled backend and full-stack web systems

Los Angeles, California4y exp
AIRKITCHENZCalifornia State University, Fullerton

Backend/AI workflow engineer with experience at AirKitchenz, Uber, and Vivma Software, building production systems on AWS (Lambda, DynamoDB, Step Functions). Has a track record of major performance wins (DynamoDB latency 2s to <150ms; Postgres query 2s to ~180ms) and shipping LLM-powered onboarding and ticket-routing workflows with strong guardrails (schema validation, confidence thresholds, human-in-the-loop escalation).

View profile
PB

Junior Data Scientist / ML Engineer specializing in LLMs and Computer Vision

Tempe, Arizona2y exp
Arizona State UniversityArizona State University

Currently working in CoRAL Lab, built and deployed IntegrityShield—a document-layer PDF watermarking system that keeps assessments visually identical while disrupting LLM-based solving; validated in a real classroom where it helped catch 12 AI-cheating cases. Also built MALDOC, a modular red-teaming platform for document-processing AI agents using LangGraph to run reproducible, deterministic adversarial trials across OCR/text/vision routes.

View profile
RA

Mid-level Full-Stack Java Developer specializing in cloud-native microservices and FinTech

Austin, TX5y exp
Dell TechnologiesClemson University

Full-stack Java engineer (4+ years) who led end-to-end modernization of high-latency order management systems into cloud-native reactive microservices (Spring WebFlux) and built real-time React/Redux dashboards, reporting 99.98% uptime and 22% infra cost savings. Also headed a production RAG-based Order Support Bot at Dell Technologies with embeddings + MongoDB semantic search, automated validation and human fallback, plus CI/CD-driven LLM eval loops to reduce hallucinations.

View profile
AK

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp
Data Security Council of IndiaUSC

Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.

View profile
Anthony Ngumah - Junior Embedded Software Engineer specializing in robotics, firmware, and AI-enabled systems in Boston, MA

Junior Embedded Software Engineer specializing in robotics, firmware, and AI-enabled systems

Boston, MA4y exp
OpentronsNortheastern University

Robotics-focused engineer with co-op experience building and debugging embedded C++/Python drivers for time-of-flight sensing on a Flex Stacker product, plus automation of large-scale test data collection via Google Drive/Sheets APIs to enable parallel robot testing. Also has ROS2 sensor-driver experience (GPS/RTK/IMU with custom messages/ROSbags) and is building a side project integrating Whisper-based live transcription with chunked abstractive summarization in a latency-aware pipeline.

View profile
Rushir Bhavsar - Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp
Cadence Design SystemsArizona State University

Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.

View profile
Tuukka Luolamo - Executive Technology Leader (CTO) specializing in AI, cloud, and distributed platforms in Remote

Executive Technology Leader (CTO) specializing in AI, cloud, and distributed platforms

Remote14y exp
StagePilotLoyola Marymount University

Engineering leader who stays hands-on in high-leverage technical areas (architecture, scalability, reliability) while operating at an executive level. Led StagePilot’s shift from a tightly coupled legacy system to a cloud-native, event-driven real-time platform proven at 1M+ concurrent users, and previously scaled multiple SRE teams at McGraw-Hill with SLOs, on-call, and blameless ops practices.

View profile

Need someone specific?

AI Search