Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation (RAG)Python Docker AWS SQL CI/CD

Sri vardhini

Screened

Junior Software Engineer specializing in AI/LLM full-stack systems

Houston, TX2y exp

University of HoustonUniversity of Houston

“AI/full-stack engineer who has built zero-to-one internal products around LLMs, RAG, and NLP pipelines, including a conversational data interface and a production AI agent system. Stands out for combining frontend UX for non-technical users with backend/cloud architecture and measurable impact, including a reported 60% reduction in data retrieval time.”

Python JavaScript TypeScript SQL Java C+++124

View profile

Abhinava Sai Tirunagari

Screened

Junior Full-Stack Engineer specializing in AI, healthcare, and FinTech systems

Gainesville, FL2y exp

University of FloridaUniversity of Florida

“Frontend-leaning software engineer who built significant parts of an AI platform at Cognura Health, translating complex document-processing and extraction workflows into usable browser interfaces for business and operations teams. Stands out for combining React/TypeScript UI ownership with backend API collaboration, performance tuning, and thoughtful UX for asynchronous AI workflows.”

TypeScript JavaScript Next.js GraphQL Node.js Python+83

View profile

nathaniel briggs

Screened

Executive CTO / Software Architect specializing in GenAI, FinTech, and PropTech

Los Angeles, California17y exp

American ExpressUniversity of Advancing Technology

“Entrepreneur/fintech product builder who raised a $100K pre-seed from ex-Google/Microsoft execs and built a real-time, direct-to-vendor bill pay micropayments platform. Previously helped scale Norton LifeLock to 1M users (2003) and also created Karma LA, a fraud-resistant, verified donation system (including VA veteran verification) aimed at improving trust and conversion in giving.”

API Integration AWS AWS CloudFormation AWS Lambda CI/CD Computer Vision+136

View profile

Harsha KeladiGanapathi

Screened

Intern Data Scientist specializing in robotics localization and SLAM

Lexington, KY1y exp

InfineonUniversity of New Haven

“Robotics/embodied-AI practitioner who built a TurtleBot3 LiDAR-fingerprint localization pipeline end-to-end (autonomous data collection + multi-head NN) achieving ~30 cm error in a 10x10 m space. Also has industry experience at Infineon building large-scale production data/AI pipelines and rapidly fixing a deployed recommendation system by correcting upstream data normalization, improving accuracy by 20%+.”

Bash C C++Deep Learning Git Linux+143

View profile

Meghavardhan Ketireddi

Screened

Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps

USA6y exp

Northern TrustUniversity of North Texas

“Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.”

Python SQL Bash Java TypeScript PyTorch+127

View profile

Youssef Briki

Screened

Intern AI Researcher specializing in NLP, LLMs, and knowledge graphs

Montreal, QC1y exp

Acceleration ConsortiumUniversity of Montreal

“Built and shipped “LabMate,” a production AI assistant specialized in laboratory hardware, using a weighted multi-source RAG pipeline with reranking and reasoning-focused query decomposition to handle complex user questions. Deployed on a local GPU cluster with vLLM and NVIDIA MPS (plus OCR/VLM components), and established evaluation using synthetic + public reasoning datasets while collaborating weekly with non-technical admins to align requirements and resource constraints.”

API Development Authentication BERT C C++Data Analysis+94

View profile

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp

HCLTechUniversity of New Haven

“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”

A/B Testing Agile Amazon CloudWatch Amazon DynamoDB Amazon S3 Apache Airflow+139

View profile

Sharanya Guduri

Screened

Mid-level Full-Stack Python Developer specializing in Healthcare IT

NJ, USA5y exp

Johnson & JohnsonUniversity of Dayton

“Backend/AI engineer with Johnson & Johnson experience building data-heavy payer/claims analytics services (Python/FastAPI, PostgreSQL, AWS) and optimizing them under peak ingestion load via indexing/query tuning and caching. Also shipped an end-to-end RAG feature for clinicians to extract insights from unstructured clinical notes, using constrained prompts and retrieval-confidence guardrails to prevent hallucinations.”

Python JavaScript TypeScript SQL Django FastAPI+110

View profile

Akash Shanmuganathan

Screened

Mid-level GenAI & Data Engineer specializing in agentic AI systems and AWS Bedrock

Fort Mill, SC4y exp

OneData Software SolutionsNortheastern University

“At onedata, built and deployed an LLM-powered, multi-agent analytics platform on AWS Bedrock that lets users create Amazon QuickSight dashboards through natural-language conversation, cutting dashboard build time from ~30 minutes to ~5 minutes. Strong in production concerns (observability, token/cost tracking, model tradeoffs) and in bridging business + technical work, owning pre-sales pitching through delivery with an engineering management background focused on AI product management.”

Agentic AI Amazon Bedrock Amazon Redshift Amazon RDS Amazon S3 Amazon SNS+95

View profile

Ninad Walanj

Screened

Intern Software Engineer specializing in full-stack and LLM/RAG systems

Seattle, USA1y exp

Capria VenturesSyracuse University

“Full-stack engineer who built "Workstream AI," an AI-powered engineering visibility product that converts GitHub activity into real-time insights using an event-driven microservices stack (RabbitMQ/Postgres/Express) and GPT-4 with a React frontend. Previously a Founding SWE at a health & wellness startup, building data-driven user management tooling, and also delivered a real-time shuttle tracking/ride request system using Java Spring Boot/Hibernate + React; comfortable owning production deployment details (AWS EC2, DNS, SSL).”

Agile Angular AWS CI/CD Caching C+76

View profile

HemaSri Perumalla

Screened

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Reston, VA4y exp

TruistUniversity of Central Missouri

“ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.”

A/B Testing Agile Apache Kafka Apache Spark AWS Glue AWS Lambda+108

View profile

Shruti Rawat

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services

Jersey City, NJ4y exp

State StreetPace University

“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”

A/B Testing API Integration Apache Airflow AWS AWS Glue AWS Lambda+112

View profile

Sandesh Shridhar

Screened

Senior Full-Stack AI Engineer specializing in LLM and RAG applications

Chicago, IL7y exp

FreelanceIllinois Institute of Technology

“Consulting-style LLM practitioner who builds enterprise knowledge assistants using RAG and takes them from prototype to production with guardrails, evaluation, and full-stack observability. Experienced partnering with IT and customer-facing teams to demo solutions, build tailored prototypes, and drive adoption through API-based integration.”

Python TypeScript Java SQL Retrieval Augmented Generation (RAG)Vector Databases+71

View profile

pradyumna ravuri

Screened

Senior Full-Stack Software Engineer specializing in IIoT, Edge AI, and real-time analytics

Los Angeles, CA9y exp

Career Soft SolutionsCal State East Bay

“Full-stack engineer who built an end-to-end low-code/no-code IDE for creating AI/ML workflows for industrial IoT sensors using Next.js/TypeScript and NestJS microservices. Focused on scaling high-volume sensor dashboards—improved UX and performance via WebSockets, debouncing, pagination, and API payload reduction—validated with profiling tools and user feedback in a startup environment.”

AI Agents Agile Angular AngularJS Anomaly Detection Apache Kafka+158

View profile

Uchechukwu Okechukwu

Screened

Mid-Level Software Engineer specializing in backend, distributed systems, and AI/LLM platforms

Prairie View, TX4y exp

Prairie View A&M UniversityPrairie View A&M University

“Built and shipped AI-powered workflow automation at Oracle, including an MCP-based agentic workflow with tool-calling and guardrails, plus Grafana monitoring and Confluence documentation. Also led a Django monolith-to-microservices migration at Chamsmobile using blue-green deployment and load balancer traffic splitting to avoid regressions while modernizing production systems.”

AI Agents Algorithms Apache Kafka Artificial Intelligence AWS AWS Lambda+105

View profile

Deepanjay Nandal

Screened

Software Engineering Intern specializing in real-time analytics and distributed systems

California, USA2y exp

Discover Excellence LLCArizona State University

“Built a production AI legal search platform that uses a retrieval-first, source-grounded LLM pipeline with confidence-based fallbacks and structured, traceable outputs to reduce hallucinations and improve trust. Also has experience at Discover Excellence building real-time analytics and identity stitching systems, emphasizing conservative data validation, idempotent processing, and fault-tolerant queue-based workflows.”

Android AWS AWS Lambda Claude C++Data Modeling+106

View profile

Cameron Shapoorian

Screened

Mid-level Test Automation & AI Integration Engineer

3y exp

Bland AIUniversity of Colorado Boulder

“Forward-deployed/solutions-oriented engineer with experience shipping enterprise LLM voice-agent workflows from prototype to production, including variable extraction and API integrations. Demonstrated strong real-time troubleshooting via logs/RCA (e.g., fixing multilingual language-switching by tuning temperature and improving context), and has led technical workshops while partnering with sales/solutions teams to drive customer adoption.”

Agile API integration C Cross-functional collaboration HTML Jira+68

View profile

Ramya Konda

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and generative AI

Remote, USA5y exp

HumanaUniversity of New Haven

“AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.”

Python R SQL PostgreSQL BigQuery Snowflake+108

View profile

Bhavya Sri Gunnapaneni

Screened

Mid-level AI/ML Engineer specializing in fraud detection and NLP

United States4y exp

AIGLewis University

“Built production AI/RAG-style systems for message Q&A and insurance claims workflows, combining data ingestion, indexing/retrieval, and LLM integration with fallback modes. Has hands-on orchestration experience (Airflow, Prefect, LangChain) and cites large operational gains (claims processing reduced to ~45 seconds; manual review -50%; false alerts -30%) through automated, monitored pipelines and close collaboration with non-technical stakeholders.”

Python SQL R Java TensorFlow PyTorch+125

View profile

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp

Voltihost LLCStony Brook University

“AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.”

Python R C++Java Shell Scripting Bash+172

View profile

Satwika Boppudi

Screened

Mid-level Site Reliability Engineer specializing in AWS cloud and AI-driven backend systems

Houston, TX7y exp

CignaUniversity of North Texas

“Backend/AI engineer in healthcare/insurance (mentions Cigna) who has shipped production systems spanning high-reliability APIs, async job architectures (Celery), and LLM/RAG features. Built an LLM document assistant with Terraform-managed AWS infra, semantic search retrieval, and strict permissioning/audit logs, and designed an automated prior-authorization workflow with human-in-the-loop escalation and compliance-driven thresholds.”

Python C++Java SQL Linux Unix+64

View profile

Raj Patel

Screened

Junior Machine Learning Engineer specializing in LLMs and RAG systems

Remote, USA1y exp

EmotionallNYU Tandon School of Engineering

“Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.”

Python Java TensorFlow PyTorch Scikit-Learn Large Language Models (LLMs)+86

View profile

Sai Krishna Mallikanti

Screened

Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP

TN4y exp

CignaUniversity of Memphis

“Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.”

A/B Testing Anomaly Detection Data Validation Deep Learning Feature Engineering Generative AI+77

View profile

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp

CignaUniversity of North Texas

“Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.”

Amazon EC2 Amazon RDS AWS AWS Lambda Analytics Anomaly Detection+55

View profile

Software Engineers Machine Learning Engineers Data Scientists Software Developers Research Assistants Full Stack Developers Engineering AI & Machine Learning Data & Analytics Executive & Leadership

Need someone specific?

AI Search

Related

Need someone specific?