Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation (RAG)Python Docker AWS SQL CI/CD

Meghana kanakam

Screened

Mid-Level Java Developer specializing in FinTech microservices

Remote, USA5y exp

StripeUniversity of Central Florida

“Backend/platform engineer with deep payments experience who built and operated a real-time transaction routing service end-to-end on AWS (Spring Boot, PostgreSQL/RDS, Redis, Kubernetes), delivering ~40% latency reduction and 99.99% uptime via strong resiliency and observability practices. Also productionized an internal LLM-powered RAG knowledge assistant with guardrails and a user-feedback-driven evaluation loop, and has led incremental monolith-to-microservices modernization using Strangler Fig and shadow traffic.”

A/B Testing API Gateway Auto-scaling AWS AWS CodePipeline Azure DevOps+99

View profile

Rohith M

Screened

Mid-level Full-Stack Developer specializing in AWS serverless and Java/Spring

Austin, Texas6y exp

AppleUniversity of Bridgeport

“Built and shipped a production generative-AI recipe feature on AWS serverless (Lambda + Bedrock), evolving it post-launch from fully AI-generated outputs to user-guided structured generation based on real usage patterns and system metrics. Emphasizes reliability via prompt constraints plus deterministic validation, with automated/human eval loops and CloudWatch-based observability to manage latency, cost, and output consistency.”

Java Python JavaScript React Angular Bootstrap+98

View profile

Benjamin Kozel

Screened

Mid-level Machine Learning & Software Engineer specializing in RAG systems and ML infrastructure

Atlanta, GA4y exp

Montage TechnologyGeorgia Tech

“Built and deployed an in-house RAG LLM system ("MONTY") using LLaMA 3B + FAISS to help teams quickly understand long internal/external specifications. Delivered usable production performance despite severe compute limits (single RTX 3080) by tuning retrieval/reranking and model choice, and is planning a LightRAG/knowledge-graph rewrite to improve accuracy and latency.”

Anomaly Detection Automation Bash C C++CI/CD+66

View profile

Ruturaj Ghatage

Screened

Junior Software Engineer specializing in full-stack, cloud infrastructure, and applied AI

Herndon, Virginia2y exp

Amazon Web ServicesUC San Diego

“Master’s student at UC San Diego who built an LLM-powered healthcare chatbot for patient history-taking and sepsis-related output, using a Node.js backend integrated with FastAPI for RAG/LLM interactions and a Flutter client. Also has healthcare AI startup experience deploying on AWS (ECS/Terraform/Docker) and implementing Kubernetes autoscaling to improve efficiency and reduce costs, with strong iterative evaluation in collaboration with a physician.”

Python C++HTML CSS JavaScript SQL+61

View profile

Sairaghav Nissankula

Screened

Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI

Sunnyvale, CA10y exp

WalmartUniversity of Illinois Urbana-Champaign

“ML/GenAI engineer with strong end-to-end production ownership across predictive ML, RAG systems, and LLM routing. They pair solid platform engineering skills with measurable business impact, including 15% churn reduction, 35% support ticket deflection, 45% GenAI cost savings, and a shared inference library that cut deployment time from weeks to days.”

Generative AI Python PyTorch FAISS Elasticsearch LangChain+149

View profile

Joseph Rivas

Screened

Senior AI/ML Engineer specializing in GenAI, MLOps, and computer vision

Boston, MA9y exp

Jaxon.AIGeorgia Tech

“ML/AI engineer with hands-on ownership of production document intelligence and GenAI systems, spanning model experimentation, AWS deployment, monitoring, and iterative optimization. Stands out for turning document-heavy workflows into reliable, near real-time products with measurable gains in accuracy, latency, and manual-effort reduction, while also shipping citation-grounded RAG features that drove user trust and adoption.”

Agentic AI Generative AI Multi-Agent Systems ReAct Computer Vision Natural Language Processing+282

View profile

Michael Matonte

Screened

Senior Backend Engineer specializing in distributed systems and AI-enabled platforms

Jersey City, NJ7y exp

CitibankUniversity of Texas at Austin

“Backend engineer with end-to-end ownership experience in high-stakes environments spanning Citibank and industrial operations. They built an internal banking platform that automated complex entitlement workflows across thousands of business units with an 80% reduction in redundant processing, and they are now applying AI through OpenAI-powered agent workflows with RAG, vector databases, and security controls.”

Java Python JavaScript C#Data Pipelines Data Modeling+83

View profile

Dipesh Valia

Screened

Director of Engineering specializing in cloud platforms and enterprise SaaS

San Jose, CA24y exp

IvantiCarnegie Mellon University

“Engineering leader focused on large-scale enterprise SaaS and MDM platforms, with experience modernizing monoliths into microservices, improving reliability, and scaling systems to support 15M devices across AWS and Azure. Stands out for combining deep platform architecture work with strong org-building: managed teams up to 45 globally and built a 0-to-1 platform services team to 22 people in under a year.”

Product development Performance optimization Microservices CI/CD Jenkins Azure DevOps+151

View profile

Harsh Sanas

Screened

Junior Full-Stack Engineer specializing in AI systems and cloud applications

San Francisco, CA2y exp

Scale AIUSC

“Full-stack engineer with a strong applied AI bent who has built both a real-time EV charging platform and a production text-to-SQL system. Particularly compelling for teams needing someone who can bridge frontend, backend, infrastructure, and LLM evaluation/safety work, with experience shipping under early-stage ambiguity and integrating software with real-world hardware.”

Python PostgreSQL Redis REST APIs AWS Amazon S3+312

View profile

winston lo

Screened

Junior Software Engineer specializing in AI agents, RAG, and full-stack development

Remote2y exp

Tresle AIUC Berkeley

“Backend engineer who built and iterated a secure, multi-tenant RAG system over a large document corpus, emphasizing strict RBAC/ACL isolation, hybrid retrieval (vector+keyword), reranking, and strong observability to balance relevance, latency, and cost. Also led production refactors/migrations using strangler + feature flags/dual writes and has experience catching subtle real-world failure modes (including in a sensor calibration optimization pipeline).”

Python Java JavaScript TypeScript SQL HTML+114

View profile

Darsh Sharma

Screened

Mid-level Software Engineer specializing in ML systems and microservices

Madison, WI2y exp

TeradataUniversity of Wisconsin–Madison

“Teradata Text Security intern who built a production LLM-powered planner agent that decomposes complex tasks into dependency-aware subtasks (DAG/topological graph) and executes them via a custom orchestrator with parallelism, status tracking, and error handling. Also contributed to an HR-facing internal document chatbot concept to streamline onboarding, showing cross-functional collaboration.”

C C++Python Java SQL PyTorch+101

View profile

Vikranth Gurram

Screened

Machine learning engineer and software developer with experience across fintech, e-commerce, and gaming.

Dallas, Texas, USA6y exp

Fidelity InvestmentsUniversity of the Cumberlands

“ML/AI engineer with hands-on ownership of production systems spanning classical ML fraud detection and GenAI agent workflows. At Fidelity, they built an end-to-end fraud platform that improved review queue Precision@K by 15-20% while reducing false positives 10-15%, and they also shipped RAG-based agent systems that cut manual workflow effort by 30-40%.”

Python SQL C++TypeScript Machine Learning Scikit-learn+66

View profile

Hansraj Pabbati

Screened

Senior Software Engineer specializing in AI/LLM systems and cloud backend platforms

Remote, CA8y exp

OracleSan Jose State University

“Built and owned an end-to-end AI-powered natural-language-to-SQL deployment within Oracle OCI/Autonomous Database, including enrichment pipelines, RAG-based retrieval, SQL generation APIs, and post-launch monitoring. Stands out for combining LLM production engineering with strong guardrails, stakeholder management, and operational rigor around accuracy, latency, hallucination mitigation, and reliability.”

Prompt Engineering Vector Search Embeddings Machine Learning Feature Engineering Model Evaluation+107

View profile

Nilesh Dixit

Screened

Executive AI engineering leader specializing in agentic AI and enterprise platforms

San Francisco, CA24y exp

Zeehub AICentre for Development of Advanced Computing

“Bay Area engineering leader and startup co-founder with a rare mix of deep hands-on architecture experience, large-scale people leadership, and cross-functional product ownership. He helped launch GE Digital's industrial IoT efforts, holds multiple patents in the space, has scaled teams to 60-70 people, and has led both enterprise platform modernization and AI startup product development.”

Agentic AI Multi-Agent Systems Go-to-Market Strategy AI Agents LangChain LangGraph+87

View profile

Shubham Singh

Screened

Mid-level Software Engineer specializing in LLM systems and intelligent search

CO, USA6y exp

PalantirSan José State University

“Backend engineer from Palantir who built and productionized an enterprise LLM-based document intelligence/search platform, evolving it into a hybrid lexical+vector retrieval system. Emphasizes reliability and cost control via strict LLM gating, robust fallback paths, and evaluation frameworks (e.g., MMLU/BLEU), plus disciplined migration practices (feature flags, dual-writes, shadow reads) to ship changes safely at scale.”

AJAX API Design API Gateway Apache Kafka Agile Angular+181

View profile

Narasimha Jwalapuram

Screened

Mid-level Applied AI Engineer specializing in LLM agents, RAG, and model alignment

Chicago, IL3y exp

Medhastra AINorthwestern University

“Applied Scientist with legal-tech experience who builds production LLM systems. Created and deployed Quibo AI, a LangGraph-based multi-agent pipeline that turns large markdown/Jupyter inputs into polished blogs and social posts, overcoming context limits via ChromaDB + HyDE RAG. Also built a large-scale iterative code-evolution workflow using multi-model orchestration (GPT/Claude/Gemini) with testing, debugging loops, and evaluation/observability practices.”

BigQuery CI/CD Databricks Docker FastAPI GPT-4+82

View profile

Shubham Singh

Screened

Mid-level AI/ML Engineer specializing in speech, computer vision, and agentic GenAI

Pittsburgh, PA6y exp

Musing AICarnegie Mellon University

“Built and shipped a production multi-agent, voice-based conversational assistant for older adults’ daily health management using Vertex AI, FastAPI, Firebase/Firestore, and Cloud Run, with a custom cross-session memory design to keep responses context-aware at low latency. Also partnered with caregivers/elderly users and health officials, translating needs into workflows and explaining HIV risk predictions with SHAP and dashboards.”

BigQuery CI/CD C C++Computer Vision Containerization+108

View profile

Jiacheng Yin

Screened

Intern Software Engineer specializing in data engineering and AI agent systems

Beijing, China1y exp

JD.comCornell University

“AI engineer at Anote.ai who built and shipped a production multi-agent LangGraph/LangChain/Ray RAG platform for enterprise search and workflow automation, supporting 3 commercial products and 100+ developers. Drove measurable gains (30% accuracy improvement, lower latency) and improved reliability with Redis-based state checkpointing, message-queue synchronization, and Milvus retrieval optimizations, while partnering with PMs/clients to add transparency features like confidence scores and real-time logs.”

.NET Agile AI agents Amazon CloudFront Angular Anomaly detection+158

View profile

Amy Russ

Screened

Director-level Applied Science & AI/ML leader specializing in LLMs, RAG, and MLOps

Atlanta, GA9y exp

HertzUniversity of Tennessee, Knoxville

“Active in the venture ecosystem as a Rogue Women's Fund fellow and angel investor, with memberships in Gaingels and Angel Squad (HustleFund). Interested in founding a company to leverage extensive experience, and evaluates ideas through market need and economic viability of the target population.”

A/B Testing Agile Agentic AI API Design AWS CI/CD+83

View profile

Akhilaa Sonduri

Screened

Junior Full-Stack Software Engineer specializing in SaaS and AI-powered web apps

Cambridge, MA2y exp

HubSpotUSC

“Full-stack engineer with experience at HubSpot, Accolite, and an early-stage USC alumni startup (Workup). Built and shipped end-to-end workflow automation features (dynamic input configuration with strict schema validation) driving ~25% faster configuration, and delivered an AI interview customization feature in a high-ambiguity startup setting that increased adoption by ~40%. Comfortable operating production systems on AWS with CloudWatch observability and CI/CD, and has built real-time web apps with caching/indexing for performance.”

Agile Angular API Integration Bootstrap C C+++83

View profile

Feras Alsaiari

Screened

Senior Software Engineer specializing in AWS data platforms and event-driven systems

4y exp

Capital OneGeorgia Tech

“Capital One engineer leading the architecture and delivery of a large-scale AWS Glue/Spark/Delta Lake batch messaging pipeline that decoupled batch from real-time flows, added multi-region failover and automated retries, and delivered ~40% AWS cost savings with ~3x performance gains. Currently building an LLM-powered Slack bot using RAG to automate message investigations by querying CloudWatch, Snowflake, and internal documentation with privacy-aware masking of NPI/PII.”

Python Java JavaScript SQL C C+++91

View profile

Pranav Puranik

Screened

Senior AI Engineer specializing in LLMs, RAG, and multimodal NLP

Austin, TX5y exp

Health Care Service CorporationUniversity of Florida

“Built a production LLM/RAG assistant for insurance/health claims agents that ingests 100–200 page patient PDFs via OCR (migrated from local Tesseract to Azure Document Intelligence) and delivers grounded claim detail retrieval plus summaries with PII/PHI guardrails. Experienced orchestrating large workflows with Celery worker pipelines and AWS Step Functions (S3-triggered, Fargate-based batch inference/accuracy aggregation), and collaborates closely with non-technical SMEs (claims agents/nurses) through shadowing, iterative demos, and SME-defined evaluation.”

Python SQL Java TypeScript Bash Unix+120

View profile

Ting Tse Chang

Screened

Junior Full-Stack Engineer specializing in web applications and backend systems

San Jose, CA3y exp

ParadigmUC Irvine

“Software engineer at Paradigm who is deeply hands-on with agentic development workflows and AI-assisted coding. He built an in-house OAuth layer to replace a third-party service and reduce projected integration costs, and also created an "ambient agent" that proactively responds to user behavior on the website. He stands out for combining full-stack architecture thinking with a disciplined, model-aware workflow for designing, implementing, and reviewing AI-generated code.”

TypeScript Python Java JavaScript Next.js React+89

View profile

Nergal Issaie

Screened

Director of Engineering specializing in AI/ML platforms and cloud systems

San Jose, CA20y exp

IBMSan Jose State University

“Senior engineering leader from IBM who has built and scaled enterprise AI/GenAI platforms across hybrid and multi-cloud environments, combining executive-level org leadership with hands-on debugging of production distributed systems. Particularly compelling for Director/VP roles needing someone who can unify architecture, platform strategy, and engineering execution across multiple teams.”

Machine Learning Generative AI AWS Google Cloud Platform Microservices Distributed Systems+131

View profile

Software Engineers Machine Learning Engineers Data Scientists Software Developers Research Assistants Full Stack Developers Engineering AI & Machine Learning Data & Analytics Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?