Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation (RAG)Python Docker CI/CD AWS SQL

Ofek Shaltiel

Screened

Mid-Level Full-Stack Software Engineer specializing in AI-enabled web platforms

Dallas, TX4y exp

HyperWater AIUniversity of Texas at Dallas

“Backend/AI engineer in construction tech (HyperWater AI) who delivered major production performance wins (analytics API from ~1 hour to 0.5s) and shipped LLM features for parsing subcontractor manifests into CSI divisions with human-in-the-loop review. Also built a freelance agentic document-verification system using OCR + RAG over pgvector with robust retry/escalation logic and user feedback loops.”

Agile Amazon DynamoDB AWS AWS Lambda CI/CD C+89

View profile

Viswanath Jagaluri

Screened

Mid-level Full-Stack & AI Engineer specializing in LLM applications

6y exp

Our National ConversationFitchburg State University

“Full-stack engineer who has shipped and operated generative-AI chat/QA features end-to-end, including a RAG-based pipeline with guardrails and cost/latency monitoring in production. Experienced with React/TypeScript + Node/Postgres architectures, Dockerized deployments to AWS (EC2) via GitHub Actions CI/CD, and building reliable ingestion/ETL systems with idempotency, backfills, and reconciliation.”

Python Java JavaScript TypeScript SQL C#+222

View profile

Ram Abhinav Vedant Madabushi

Screened

Junior Full-Stack/AI Engineer specializing in web platforms and LLM applications

Palo Alto, CA2y exp

FoodSupply.aiUniversity of Central Florida

“Backend engineer from FoodSupply.ai who built and evolved a scalable restaurant/supplier product and order management platform using Node.js and REST APIs. Implemented a hybrid MySQL+MongoDB data architecture, optimized performance with Redis/Prisma, and led a phased migration with feature flags and a temporary sync layer to maintain data consistency. Strong focus on production security (OAuth2, RBAC, row-level security, AWS IAM) and reliability practices (testing with Pytest, Docker/AWS pipelines).”

API Gateway AWS AWS IAM AWS Lambda Bash Bootstrap+116

View profile

Karthik Patralapati

Screened

Mid-level AI/ML Software Engineer specializing in GPU-optimized LLM inference and cloud microservices

Seattle, WA5y exp

DVR SoftekSan José State University

“Built and deployed a production RAG-based multilingual analytics assistant for healthcare operations, enabling non-technical teams to query claims/EHR and risk metrics with grounded explanations. Demonstrates strong end-to-end LLM system engineering (retrieval tuning, re-ranking, hallucination controls, verification layers) plus workflow orchestration (Airflow/Composer/Step Functions) and stakeholder-driven iteration via prototypes and dashboards.”

Python Pandas NumPy PySpark C C+++197

View profile

Mayank VYAS

Screened

Mid-level AI/ML Engineer specializing in LLM agents, RAG retrieval, and IoT ML systems

Tempe, AZ4y exp

Coral LabsArizona State University

“Built production LLM-driven products including a job-hunt AI (job ranking + resume optimization) and an InterviewAI agentic pipeline using LangChain. Focused on practical deployment concerns like securing OpenAI usage via rate limiting and tiered quotas, and demonstrates an applied approach to choosing models, retrieval methods (RAG), and prompting strategies.”

Algorithms Anomaly Detection AWS Bash BigQuery C+81

View profile

Pranav Mishra

Screened

Junior Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

Charlotte, NC2y exp

WheelPriceUniversity of Illinois Chicago

“AI/ML engineer who has shipped production systems across computer vision and conversational agents: built a YOLOv8-based wheel fitment pipeline at a Techstars-backed automotive startup, focusing on sub-second latency, monitoring, and robust fallback mechanisms that drove 2–3x page view growth and +5–6k users. Also built a voice-based interview platform orchestrating Deepgram + GPT-4 Mini + OpenAI TTS with FSM-driven reliability, and has hands-on RAG experience (LangChain, hybrid retrieval, cross-encoder reranking, custom pseudo-query generation).”

Python Java C++JavaScript C#TensorFlow+117

View profile

Taruni Reddy Ampojwala

Screened

Mid-level GenAI Engineer specializing in LLM agents and RAG systems

Brooklyn, NY4y exp

PamTenLong Island University

“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”

Alerting Analytics AWS BigQuery CI/CD Claude+107

View profile

MANOGNA VADLAMUDI

Screened

Junior AI Engineer specializing in LLM evaluation, prompt engineering, and AI orchestration

Chicago, IL1y exp

IDSIllinois Institute of Technology

“LLM workflow builder who has deployed a personalized GPT experience (including Delphi AI-based knowledge ingestion) and built a LangChain/LangGraph job-aggregation pipeline that ingests, normalizes/dedupes, filters, then uses an LLM to rank and summarize matches. Emphasizes production reliability with structured outputs, retries/fallbacks, metric-driven evaluation, logging/prompt versioning, and A/B testing, and collaborates with non-technical stakeholders through demo-driven iteration.”

Python JavaScript C SQL Java Object-Oriented Programming (OOP)+106

View profile

Yashwant Gandham

Screened

Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure

Boulder, CO1y exp

NovaChat AIUniversity of Colorado Boulder

“Built and deployed production RAG-based document search/Q&A systems (DocChat and an internship marketing RAG), using a React + FastAPI stack on GCP with docs stored in GCP buckets and retrieval via embeddings/vector DB. Emphasizes cost/performance tradeoffs (reported ~40% cost reduction) and ships via Docker (Railway), with load/API testing using JMeter and Swagger; regularly collaborates with a CEO stakeholder to iterate and push changes to production.”

Python NumPy Pandas PyTorch scikit-learn SQL+78

View profile

Rethvick Sriram Yugendra Babu

Screened

Junior AI/ML Software Engineer specializing in Generative AI and scalable data pipelines

Tucson, AZ2y exp

University of ArizonaUniversity of Arizona

“Built and operated large-scale biodiversity/ecological research platforms, integrating 50+ heterogeneous global datasets into a unified BIEN 3 schema on PostgreSQL/PostGIS and improving data consistency by 35%. Strong production engineering background (Linux monitoring, CI/CD performance gates, Docker on AWS/Azure) plus applied AI work building a Python RAG system (0.90 precision) and halving latency with Elasticsearch.”

AWS CI/CD C#C++Computer Vision D3.js+109

View profile

Binaya Sharma

Screened

Senior Software Engineer specializing in full-stack systems, big data, and applied AI

Baton Rouge, LA6y exp

365LabsLouisiana State University

“Built and deployed ForensicLLM, a local domain-specific LLaMA-3.1-8B model for digital forensic investigators using RAFT + RAG over 1000+ curated research papers, with citation-aware responses and rigorous evaluation (BERTScore/G-Eval). Deployed via vLLM and Docker and validated through a chatbot survey with 80+ participants; published at DFRWS EU 2025.”

Agile Ansible Angular Apache Hadoop Apache Kafka Apache Spark+107

View profile

BHAVANA KRISHNAN

Screened

Junior AI/ML Engineer specializing in Generative AI, NLP, and MLOps

Lewisville, TX1y exp

ThinkBig Software SolutionsTexas Tech University

“LLM engineer who has deployed a production RAG system (LangChain/FAISS/FastAPI) for enterprise semantic search, tackling real-world latency by LoRA/PEFT fine-tuning and grounding outputs with retrieval. Brings strong MLOps (Docker, AWS EKS, CI/CD, MLflow) plus stakeholder-facing explainability experience using SHAP to align ML-driven financial guidance with non-technical domain experts.”

Apache Spark AWS AWS Lambda Azure Machine Learning CI/CD Clustering+87

View profile

Darshan Shah

Screened

Mid-Level Software Engineer specializing in cloud-native microservices and full-stack development

Holliston, MA6y exp

Liberating TechnologiesNortheastern University

“Full-stack engineer with deep startup experience building products from scratch under ambiguous requirements. Delivered a scalable, admin-configurable notification platform (Spring Boot/Java/Kafka) supporting 50+ notification types across 3 channels for 10k+ users, cutting new notification setup to ~5 minutes. Also built a Tinder-meets-LinkedIn job-swiping app (React/TS + Node/Prisma) and has hands-on AWS production ops (ECS/EKS, RDS, CloudWatch) plus multiple third-party integrations (Stripe, QuickBooks, Twilio).”

Java Python TypeScript JavaScript Swift React+128

View profile

Alejandro Alemany

Screened

Senior Full-Stack AI/ML Engineer specializing in MLOps and GenAI

Belmont, Michigan10y exp

AvaSureCapitol Technology University

“Senior backend/data engineer who has built and maintained HIPAA-compliant, real-time clinical FastAPI services on AWS, orchestrating ML/LLM and vector DB calls with strong reliability patterns (auth, timeouts/retries, graceful degradation, idempotency). Also delivered AWS IaC/CI-CD (Terraform/Helm/GitHub Actions) across EKS/Lambda/SageMaker and built Glue/Spark ETL with schema evolution and data quality controls, plus demonstrated large SQL performance wins (15 min to <9 sec) and hands-on incident ownership.”

Angular API Design Authentication Authorization AWS Azure Blob Storage+197

View profile

Vishnu Priyan Sellam Shanmugavel

Screened

Mid-Level Applied AI Engineer specializing in LLM services, RAG, and OCR/NLP extraction

Arlington, VA4y exp

HealthLab InnovationsIllinois Institute of Technology

“Backend/platform engineer who built and evolved a large-scale healthcare document processing system (OCR + LLM orchestration) in Python/FastAPI on Google Cloud (Cloud Run, GCS, Firestore), processing ~1.5M files per batch and tens of millions overall. Emphasizes reliability and operational safety via deterministic IDs, idempotent state machines, strong observability, and self-healing reconciliation, plus disciplined migrations using dual-run validation and incremental rollouts.”

Agile Android Angular AWS BigQuery C+169

View profile

Tamir ShemTov

Screened

Entry-Level Computer Vision Research Assistant specializing in medical imaging AI

Los Angeles, CA1y exp

Cedars-SinaiCalifornia State University, East Bay

“New grad who shipped an LLM-powered writing app (“Write-it”) to production on Azure with CI/CD (GitHub Actions + JFrog) and implemented an unconventional RAG pipeline to prevent repetitive prompts using embeddings and cosine similarity. Also participated in a Luma AI image/video generation hackathon, iterating with artist feedback and improving usability by rewriting non-technical prompts via an LLM.”

Python C++PyTorch TensorFlow Scikit-learn Pandas+59

View profile

Navyatej Tummala

Screened

Junior Backend Engineer specializing in cloud APIs and AI-enabled systems

Raleigh, NC2y exp

NC State UniversityNorth Carolina State University

“Built and shipped "OnCall Copilot," a production Slack-based RAG assistant that answers on-call questions from runbooks and postmortems with citations using a FAISS vector index. Emphasizes reliability and measurable performance via strict guardrails ("no evidence, no answer"), evaluation metrics, drift monitoring, and operational hardening with Docker, logging, health checks, and offline fallback.”

API Gateway AWS AWS Lambda Authentication Authorization BERT+87

View profile

Smit Panchal

Screened

Mid-level Full-Stack & XR Developer specializing in GenAI and immersive AR/VR systems

3y exp

Community Dreams FoundationIllinois Institute of Technology

“Built and deployed a "personal second brain" product (CloneMind) with an end-to-end RAG pipeline for retrieving information across PDFs, URLs, images, and audio using Next.js/Node.js/Postgres/Supabase/Redis. Demonstrates strong practical depth in retrieval quality tuning, latency reduction via caching, and stateful orchestration with LangChain/LangGraph, plus experience persuading a non-technical professor stakeholder by shipping a working prototype.”

Agile Algorithms API Integration AWS C C#+142

View profile

CharanKumar Pathakamuri

Screened

Entry-Level GenAI/LLM Engineer specializing in agentic systems and RAG

Baltimore, MD1y exp

Kanehl ConsultingUniversity of Maryland, Baltimore County

“LLM/AI agent engineer with consulting/contract experience (Kanhaiya Consulting LLC) who deployed a production AI agent to automate BIM list workflows end-to-end—from database understanding and data cleaning to automated visualizations/dashboards. Worked around restricted real-time data access by generating synthetic data and improving outputs via supervised fine-tuning, and uses AWS-based LLMOps observability (Opic/OPEC) plus hybrid retrieval (vector+BM25 with reranking) to optimize relevance, latency, and cost.”

Algorithms AWS AWS Lambda ChromaDB CI/CD Data Pipelines+77

View profile

Kevin Sheu

Screened

Junior Full-Stack Software Engineer specializing in AI/ML platforms and microservices

2y exp

NCKUNational Cheng Kung University

“Graduate-school lab engineer who built and owned the final architecture of a Microservices Hub that integrates REST APIs, issues API keys, monitors 10+ Linux servers, and visualizes service dependencies via a topology graph. Strong in bridging legacy and modern stacks (Dockerized and non-Dockerized services like Apache/screen) using deep Linux/networking knowledge, plus practical real-time audio streaming for STT/TTS and experience mentoring others.”

Python C C++C#Java JavaScript+95

View profile

Shahzad Shairf

Screened

Senior Full-Stack & AI Developer specializing in Python/React, AWS, and LLM/RAG systems

Lahore, Pakistan9y exp

Devtor 360COMSATS University Islamabad

“Backend Python engineer who owned the full backend build of an AI-driven platform for UK golf clubs, including FastAPI microservices, vector search, and a tuned LangChain+Pinecone RAG pipeline focused on cost and hallucination reduction. Experienced deploying Django/FastAPI/Flask stacks on AWS-backed Kubernetes with GitOps/ArgoCD-style delivery, plus executing legacy-to-AWS migrations and building Kafka-based real-time analytics pipelines.”

Python Django Flask FastAPI React Next.js+101

View profile

Ruthu Rajendra

Screened

Junior Solutions Engineer specializing in full-stack automation and LLM prompt engineering

San Francisco, CA2y exp

SCU - Frugal Innovation HubSanta Clara University

“Built and productionized an LLM-powered customer support system using a RAG architecture with structured document ingestion, embedding retrieval, and prompt templates for product-specific grounding. Experienced diagnosing live agent/workflow failures (e.g., retrieval regressions after new docs) by refactoring ingestion/chunking and adding grounding constraints plus evaluation benchmarks. Also supports go-to-market by joining discovery calls, shaping MVP workflows into demos/prototypes, and creating post-launch documentation to drive adoption.”

Python Java JavaScript TypeScript C++HTML+86

View profile

Vignesh Chowdary Pamulapati

Screened

Junior Full-Stack Software Engineer specializing in Node.js, React, and REST APIs

Memphis, TN2y exp

Northern Arizona UniversityNorthern Arizona University

“Full-stack engineer who shipped and owned a production Document Chat feature built with Next.js App Router/TypeScript and a Node/Express RAG backend, including JWT-secured route handlers and streaming responses. Demonstrated strong post-launch ownership by improving latency (~30%) via MongoDB indexing/query optimization and reducing AI costs through caching, backed by profiling with React Profiler and Chrome DevTools.”

Python Data Structures Algorithms Java C++C+86

View profile

Keerti Chaudhary

Screened

Intern Full-Stack Engineer specializing in AI-powered products

San Jose, CA0y exp

EvovanceSanta Clara University

“Software engineer (internship experience) who built and owned an AWS serverless multi-user “challenge” feature end-to-end (UI + REST APIs + DynamoDB + deployment), delivering measurable gains in latency (-30%), debugging time (-50%), and join drop-offs (~-30%). Also productionized a multilingual RAG-based QA system with vector retrieval and guardrails, improving accuracy to ~85% and driving ~20% DAU growth.”

API Design API Gateway Artificial Intelligence AWS AWS Lambda Backend Development+83

View profile

Software Engineers Machine Learning Engineers Data Scientists Software Developers AI Engineers Research Assistants Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?