Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

MK

Mid-Level Java Developer specializing in FinTech microservices

Remote, USA5y exp
StripeUniversity of Central Florida

Backend/platform engineer with deep payments experience who built and operated a real-time transaction routing service end-to-end on AWS (Spring Boot, PostgreSQL/RDS, Redis, Kubernetes), delivering ~40% latency reduction and 99.99% uptime via strong resiliency and observability practices. Also productionized an internal LLM-powered RAG knowledge assistant with guardrails and a user-feedback-driven evaluation loop, and has led incremental monolith-to-microservices modernization using Strangler Fig and shadow traffic.

View profile
RM

Rohith M

Screened

Mid-level Full-Stack Developer specializing in AWS serverless and Java/Spring

Austin, Texas6y exp
AppleUniversity of Bridgeport

Built and shipped a production generative-AI recipe feature on AWS serverless (Lambda + Bedrock), evolving it post-launch from fully AI-generated outputs to user-guided structured generation based on real usage patterns and system metrics. Emphasizes reliability via prompt constraints plus deterministic validation, with automated/human eval loops and CloudWatch-based observability to manage latency, cost, and output consistency.

View profile
Benjamin Kozel - Mid-level Machine Learning & Software Engineer specializing in RAG systems and ML infrastructure in Atlanta, GA

Mid-level Machine Learning & Software Engineer specializing in RAG systems and ML infrastructure

Atlanta, GA4y exp
Montage TechnologyGeorgia Tech

Built and deployed an in-house RAG LLM system ("MONTY") using LLaMA 3B + FAISS to help teams quickly understand long internal/external specifications. Delivered usable production performance despite severe compute limits (single RTX 3080) by tuning retrieval/reranking and model choice, and is planning a LightRAG/knowledge-graph rewrite to improve accuracy and latency.

View profile
Ruturaj Ghatage - Junior Software Engineer specializing in full-stack, cloud infrastructure, and applied AI in Herndon, Virginia

Junior Software Engineer specializing in full-stack, cloud infrastructure, and applied AI

Herndon, Virginia2y exp
Amazon Web ServicesUC San Diego

Master’s student at UC San Diego who built an LLM-powered healthcare chatbot for patient history-taking and sepsis-related output, using a Node.js backend integrated with FastAPI for RAG/LLM interactions and a Flutter client. Also has healthcare AI startup experience deploying on AWS (ECS/Terraform/Docker) and implementing Kubernetes autoscaling to improve efficiency and reduce costs, with strong iterative evaluation in collaboration with a physician.

View profile
SN

Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI

Sunnyvale, CA10y exp
WalmartUniversity of Illinois Urbana-Champaign

ML/GenAI engineer with strong end-to-end production ownership across predictive ML, RAG systems, and LLM routing. They pair solid platform engineering skills with measurable business impact, including 15% churn reduction, 35% support ticket deflection, 45% GenAI cost savings, and a shared inference library that cut deployment time from weeks to days.

View profile
JR

Joseph Rivas

Screened

Senior AI/ML Engineer specializing in GenAI, MLOps, and computer vision

Boston, MA9y exp
Jaxon.AIGeorgia Tech

ML/AI engineer with hands-on ownership of production document intelligence and GenAI systems, spanning model experimentation, AWS deployment, monitoring, and iterative optimization. Stands out for turning document-heavy workflows into reliable, near real-time products with measurable gains in accuracy, latency, and manual-effort reduction, while also shipping citation-grounded RAG features that drove user trust and adoption.

View profile
Michael Matonte - Senior Backend Engineer specializing in distributed systems and AI-enabled platforms in Jersey City, NJ

Senior Backend Engineer specializing in distributed systems and AI-enabled platforms

Jersey City, NJ7y exp
CitibankUniversity of Texas at Austin

Backend engineer with end-to-end ownership experience in high-stakes environments spanning Citibank and industrial operations. They built an internal banking platform that automated complex entitlement workflows across thousands of business units with an 80% reduction in redundant processing, and they are now applying AI through OpenAI-powered agent workflows with RAG, vector databases, and security controls.

View profile
DV

Dipesh Valia

Screened

Director of Engineering specializing in cloud platforms and enterprise SaaS

San Jose, CA24y exp
IvantiCarnegie Mellon University

Engineering leader focused on large-scale enterprise SaaS and MDM platforms, with experience modernizing monoliths into microservices, improving reliability, and scaling systems to support 15M devices across AWS and Azure. Stands out for combining deep platform architecture work with strong org-building: managed teams up to 45 globally and built a 0-to-1 platform services team to 22 people in under a year.

View profile
Harsh Sanas - Junior Full-Stack Engineer specializing in AI systems and cloud applications in San Francisco, CA

Harsh Sanas

Screened

Junior Full-Stack Engineer specializing in AI systems and cloud applications

San Francisco, CA2y exp
Scale AIUSC

Full-stack engineer with a strong applied AI bent who has built both a real-time EV charging platform and a production text-to-SQL system. Particularly compelling for teams needing someone who can bridge frontend, backend, infrastructure, and LLM evaluation/safety work, with experience shipping under early-stage ambiguity and integrating software with real-world hardware.

View profile
WL

winston lo

Screened

Junior Software Engineer specializing in AI agents, RAG, and full-stack development

Remote2y exp
Tresle AIUC Berkeley

Backend engineer who built and iterated a secure, multi-tenant RAG system over a large document corpus, emphasizing strict RBAC/ACL isolation, hybrid retrieval (vector+keyword), reranking, and strong observability to balance relevance, latency, and cost. Also led production refactors/migrations using strangler + feature flags/dual writes and has experience catching subtle real-world failure modes (including in a sensor calibration optimization pipeline).

View profile
Darsh Sharma - Mid-level Software Engineer specializing in ML systems and microservices in Madison, WI

Darsh Sharma

Screened

Mid-level Software Engineer specializing in ML systems and microservices

Madison, WI2y exp
TeradataUniversity of Wisconsin–Madison

Teradata Text Security intern who built a production LLM-powered planner agent that decomposes complex tasks into dependency-aware subtasks (DAG/topological graph) and executes them via a custom orchestrator with parallelism, status tracking, and error handling. Also contributed to an HR-facing internal document chatbot concept to streamline onboarding, showing cross-functional collaboration.

View profile
VG

Machine learning engineer and software developer with experience across fintech, e-commerce, and gaming.

Dallas, Texas, USA6y exp
Fidelity InvestmentsUniversity of the Cumberlands

ML/AI engineer with hands-on ownership of production systems spanning classical ML fraud detection and GenAI agent workflows. At Fidelity, they built an end-to-end fraud platform that improved review queue Precision@K by 15-20% while reducing false positives 10-15%, and they also shipped RAG-based agent systems that cut manual workflow effort by 30-40%.

View profile
Hansraj Pabbati - Senior Software Engineer specializing in AI/LLM systems and cloud backend platforms in Remote, CA

Senior Software Engineer specializing in AI/LLM systems and cloud backend platforms

Remote, CA8y exp
OracleSan Jose State University

Built and owned an end-to-end AI-powered natural-language-to-SQL deployment within Oracle OCI/Autonomous Database, including enrichment pipelines, RAG-based retrieval, SQL generation APIs, and post-launch monitoring. Stands out for combining LLM production engineering with strong guardrails, stakeholder management, and operational rigor around accuracy, latency, hallucination mitigation, and reliability.

View profile
Nilesh Dixit - Executive AI engineering leader specializing in agentic AI and enterprise platforms in San Francisco, CA

Nilesh Dixit

Screened

Executive AI engineering leader specializing in agentic AI and enterprise platforms

San Francisco, CA24y exp
Zeehub AICentre for Development of Advanced Computing

Bay Area engineering leader and startup co-founder with a rare mix of deep hands-on architecture experience, large-scale people leadership, and cross-functional product ownership. He helped launch GE Digital's industrial IoT efforts, holds multiple patents in the space, has scaled teams to 60-70 people, and has led both enterprise platform modernization and AI startup product development.

View profile
SS

Shubham Singh

Screened

Mid-level Software Engineer specializing in LLM systems and intelligent search

CO, USA6y exp
PalantirSan José State University

Backend engineer from Palantir who built and productionized an enterprise LLM-based document intelligence/search platform, evolving it into a hybrid lexical+vector retrieval system. Emphasizes reliability and cost control via strict LLM gating, robust fallback paths, and evaluation frameworks (e.g., MMLU/BLEU), plus disciplined migration practices (feature flags, dual-writes, shadow reads) to ship changes safely at scale.

View profile
NJ

Mid-level Applied AI Engineer specializing in LLM agents, RAG, and model alignment

Chicago, IL3y exp
Medhastra AINorthwestern University

Applied Scientist with legal-tech experience who builds production LLM systems. Created and deployed Quibo AI, a LangGraph-based multi-agent pipeline that turns large markdown/Jupyter inputs into polished blogs and social posts, overcoming context limits via ChromaDB + HyDE RAG. Also built a large-scale iterative code-evolution workflow using multi-model orchestration (GPT/Claude/Gemini) with testing, debugging loops, and evaluation/observability practices.

View profile
SS

Shubham Singh

Screened

Mid-level AI/ML Engineer specializing in speech, computer vision, and agentic GenAI

Pittsburgh, PA6y exp
Musing AICarnegie Mellon University

Built and shipped a production multi-agent, voice-based conversational assistant for older adults’ daily health management using Vertex AI, FastAPI, Firebase/Firestore, and Cloud Run, with a custom cross-session memory design to keep responses context-aware at low latency. Also partnered with caregivers/elderly users and health officials, translating needs into workflows and explaining HIV risk predictions with SHAP and dashboards.

View profile
JY

Jiacheng Yin

Screened

Intern Software Engineer specializing in data engineering and AI agent systems

Beijing, China1y exp
JD.comCornell University

AI engineer at Anote.ai who built and shipped a production multi-agent LangGraph/LangChain/Ray RAG platform for enterprise search and workflow automation, supporting 3 commercial products and 100+ developers. Drove measurable gains (30% accuracy improvement, lower latency) and improved reliability with Redis-based state checkpointing, message-queue synchronization, and Milvus retrieval optimizations, while partnering with PMs/clients to add transparency features like confidence scores and real-time logs.

View profile
AR

Amy Russ

Screened

Director-level Applied Science & AI/ML leader specializing in LLMs, RAG, and MLOps

Atlanta, GA9y exp
HertzUniversity of Tennessee, Knoxville

Active in the venture ecosystem as a Rogue Women's Fund fellow and angel investor, with memberships in Gaingels and Angel Squad (HustleFund). Interested in founding a company to leverage extensive experience, and evaluates ideas through market need and economic viability of the target population.

View profile
Akhilaa Sonduri - Junior Full-Stack Software Engineer specializing in SaaS and AI-powered web apps in Cambridge, MA

Junior Full-Stack Software Engineer specializing in SaaS and AI-powered web apps

Cambridge, MA2y exp
HubSpotUSC

Full-stack engineer with experience at HubSpot, Accolite, and an early-stage USC alumni startup (Workup). Built and shipped end-to-end workflow automation features (dynamic input configuration with strict schema validation) driving ~25% faster configuration, and delivered an AI interview customization feature in a high-ambiguity startup setting that increased adoption by ~40%. Comfortable operating production systems on AWS with CloudWatch observability and CI/CD, and has built real-time web apps with caching/indexing for performance.

View profile
Feras Alsaiari - Senior Software Engineer specializing in AWS data platforms and event-driven systems

Senior Software Engineer specializing in AWS data platforms and event-driven systems

4y exp
Capital OneGeorgia Tech

Capital One engineer leading the architecture and delivery of a large-scale AWS Glue/Spark/Delta Lake batch messaging pipeline that decoupled batch from real-time flows, added multi-region failover and automated retries, and delivered ~40% AWS cost savings with ~3x performance gains. Currently building an LLM-powered Slack bot using RAG to automate message investigations by querying CloudWatch, Snowflake, and internal documentation with privacy-aware masking of NPI/PII.

View profile
Pranav Puranik - Senior AI Engineer specializing in LLMs, RAG, and multimodal NLP in Austin, TX

Senior AI Engineer specializing in LLMs, RAG, and multimodal NLP

Austin, TX5y exp
Health Care Service CorporationUniversity of Florida

Built a production LLM/RAG assistant for insurance/health claims agents that ingests 100–200 page patient PDFs via OCR (migrated from local Tesseract to Azure Document Intelligence) and delivers grounded claim detail retrieval plus summaries with PII/PHI guardrails. Experienced orchestrating large workflows with Celery worker pipelines and AWS Step Functions (S3-triggered, Fargate-based batch inference/accuracy aggregation), and collaborates closely with non-technical SMEs (claims agents/nurses) through shadowing, iterative demos, and SME-defined evaluation.

View profile
TT

Junior Full-Stack Engineer specializing in web applications and backend systems

San Jose, CA3y exp
ParadigmUC Irvine

Software engineer at Paradigm who is deeply hands-on with agentic development workflows and AI-assisted coding. He built an in-house OAuth layer to replace a third-party service and reduce projected integration costs, and also created an "ambient agent" that proactively responds to user behavior on the website. He stands out for combining full-stack architecture thinking with a disciplined, model-aware workflow for designing, implementing, and reviewing AI-generated code.

View profile
NI

Nergal Issaie

Screened

Director of Engineering specializing in AI/ML platforms and cloud systems

San Jose, CA20y exp
IBMSan Jose State University

Senior engineering leader from IBM who has built and scaled enterprise AI/GenAI platforms across hybrid and multi-cloud environments, combining executive-level org leadership with hands-on debugging of production distributed systems. Particularly compelling for Director/VP roles needing someone who can unify architecture, platform strategy, and engineering execution across multiple teams.

View profile

Need someone specific?

AI Search