Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

SV

Sri vardhini

Screened

Junior Software Engineer specializing in AI/LLM full-stack systems

Houston, TX2y exp
University of HoustonUniversity of Houston

AI/full-stack engineer who has built zero-to-one internal products around LLMs, RAG, and NLP pipelines, including a conversational data interface and a production AI agent system. Stands out for combining frontend UX for non-technical users with backend/cloud architecture and measurable impact, including a reported 60% reduction in data retrieval time.

View profile
Abhinava Sai Tirunagari - Junior Full-Stack Engineer specializing in AI, healthcare, and FinTech systems in Gainesville, FL

Junior Full-Stack Engineer specializing in AI, healthcare, and FinTech systems

Gainesville, FL2y exp
University of FloridaUniversity of Florida

Frontend-leaning software engineer who built significant parts of an AI platform at Cognura Health, translating complex document-processing and extraction workflows into usable browser interfaces for business and operations teams. Stands out for combining React/TypeScript UI ownership with backend API collaboration, performance tuning, and thoughtful UX for asynchronous AI workflows.

View profile
NB

Executive CTO / Software Architect specializing in GenAI, FinTech, and PropTech

Los Angeles, California17y exp
American ExpressUniversity of Advancing Technology

Entrepreneur/fintech product builder who raised a $100K pre-seed from ex-Google/Microsoft execs and built a real-time, direct-to-vendor bill pay micropayments platform. Previously helped scale Norton LifeLock to 1M users (2003) and also created Karma LA, a fraud-resistant, verified donation system (including VA veteran verification) aimed at improving trust and conversion in giving.

View profile
HK

Intern Data Scientist specializing in robotics localization and SLAM

Lexington, KY1y exp
InfineonUniversity of New Haven

Robotics/embodied-AI practitioner who built a TurtleBot3 LiDAR-fingerprint localization pipeline end-to-end (autonomous data collection + multi-head NN) achieving ~30 cm error in a 10x10 m space. Also has industry experience at Infineon building large-scale production data/AI pipelines and rapidly fixing a deployed recommendation system by correcting upstream data normalization, improving accuracy by 20%+.

View profile
MK

Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps

USA6y exp
Northern TrustUniversity of North Texas

Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.

View profile
YB

Youssef Briki

Screened

Intern AI Researcher specializing in NLP, LLMs, and knowledge graphs

Montreal, QC1y exp
Acceleration ConsortiumUniversity of Montreal

Built and shipped “LabMate,” a production AI assistant specialized in laboratory hardware, using a weighted multi-source RAG pipeline with reranking and reasoning-focused query decomposition to handle complex user questions. Deployed on a local GPU cluster with vLLM and NVIDIA MPS (plus OCR/VLM components), and established evaluation using synthetic + public reasoning datasets while collaborating weekly with non-technical admins to align requirements and resource constraints.

View profile
SC

Sai Charan C

Screened

Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS

CT, USA3y exp
HCLTechUniversity of New Haven

Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.

View profile
SG

Mid-level Full-Stack Python Developer specializing in Healthcare IT

NJ, USA5y exp
Johnson & JohnsonUniversity of Dayton

Backend/AI engineer with Johnson & Johnson experience building data-heavy payer/claims analytics services (Python/FastAPI, PostgreSQL, AWS) and optimizing them under peak ingestion load via indexing/query tuning and caching. Also shipped an end-to-end RAG feature for clinicians to extract insights from unstructured clinical notes, using constrained prompts and retrieval-confidence guardrails to prevent hallucinations.

View profile
AS

Mid-level GenAI & Data Engineer specializing in agentic AI systems and AWS Bedrock

Fort Mill, SC4y exp
OneData Software SolutionsNortheastern University

At onedata, built and deployed an LLM-powered, multi-agent analytics platform on AWS Bedrock that lets users create Amazon QuickSight dashboards through natural-language conversation, cutting dashboard build time from ~30 minutes to ~5 minutes. Strong in production concerns (observability, token/cost tracking, model tradeoffs) and in bridging business + technical work, owning pre-sales pitching through delivery with an engineering management background focused on AI product management.

View profile
NW

Ninad Walanj

Screened

Intern Software Engineer specializing in full-stack and LLM/RAG systems

Seattle, USA1y exp
Capria VenturesSyracuse University

Full-stack engineer who built "Workstream AI," an AI-powered engineering visibility product that converts GitHub activity into real-time insights using an event-driven microservices stack (RabbitMQ/Postgres/Express) and GPT-4 with a React frontend. Previously a Founding SWE at a health & wellness startup, building data-driven user management tooling, and also delivered a real-time shuttle tracking/ride request system using Java Spring Boot/Hibernate + React; comfortable owning production deployment details (AWS EC2, DNS, SSL).

View profile
HP

Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics

Reston, VA4y exp
TruistUniversity of Central Missouri

ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.

View profile
SR

Shruti Rawat

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services

Jersey City, NJ4y exp
State StreetPace University

Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.

View profile
SS

Senior Full-Stack AI Engineer specializing in LLM and RAG applications

Chicago, IL7y exp
FreelanceIllinois Institute of Technology

Consulting-style LLM practitioner who builds enterprise knowledge assistants using RAG and takes them from prototype to production with guardrails, evaluation, and full-stack observability. Experienced partnering with IT and customer-facing teams to demo solutions, build tailored prototypes, and drive adoption through API-based integration.

View profile
PR

Senior Full-Stack Software Engineer specializing in IIoT, Edge AI, and real-time analytics

Los Angeles, CA9y exp
Career Soft SolutionsCal State East Bay

Full-stack engineer who built an end-to-end low-code/no-code IDE for creating AI/ML workflows for industrial IoT sensors using Next.js/TypeScript and NestJS microservices. Focused on scaling high-volume sensor dashboards—improved UX and performance via WebSockets, debouncing, pagination, and API payload reduction—validated with profiling tools and user feedback in a startup environment.

View profile
UO

Mid-Level Software Engineer specializing in backend, distributed systems, and AI/LLM platforms

Prairie View, TX4y exp
Prairie View A&M UniversityPrairie View A&M University

Built and shipped AI-powered workflow automation at Oracle, including an MCP-based agentic workflow with tool-calling and guardrails, plus Grafana monitoring and Confluence documentation. Also led a Django monolith-to-microservices migration at Chamsmobile using blue-green deployment and load balancer traffic splitting to avoid regressions while modernizing production systems.

View profile
DN

Software Engineering Intern specializing in real-time analytics and distributed systems

California, USA2y exp
Discover Excellence LLCArizona State University

Built a production AI legal search platform that uses a retrieval-first, source-grounded LLM pipeline with confidence-based fallbacks and structured, traceable outputs to reduce hallucinations and improve trust. Also has experience at Discover Excellence building real-time analytics and identity stitching systems, emphasizing conservative data validation, idempotent processing, and fault-tolerant queue-based workflows.

View profile
Cameron Shapoorian - Mid-level Test Automation & AI Integration Engineer

Mid-level Test Automation & AI Integration Engineer

3y exp
Bland AIUniversity of Colorado Boulder

Forward-deployed/solutions-oriented engineer with experience shipping enterprise LLM voice-agent workflows from prototype to production, including variable extraction and API integrations. Demonstrated strong real-time troubleshooting via logs/RCA (e.g., fixing multilingual language-switching by tuning temperature and improving context), and has led technical workshops while partnering with sales/solutions teams to drive customer adoption.

View profile
Ramya Konda - Mid-level AI/ML Engineer specializing in healthcare ML and generative AI in Remote, USA

Ramya Konda

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and generative AI

Remote, USA5y exp
HumanaUniversity of New Haven

AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.

View profile
Bhavya Sri Gunnapaneni - Mid-level AI/ML Engineer specializing in fraud detection and NLP in United States

Mid-level AI/ML Engineer specializing in fraud detection and NLP

United States4y exp
AIGLewis University

Built production AI/RAG-style systems for message Q&A and insurance claims workflows, combining data ingestion, indexing/retrieval, and LLM integration with fallback modes. Has hands-on orchestration experience (Airflow, Prefect, LangChain) and cites large operational gains (claims processing reduced to ~45 seconds; manual review -50%; false alerts -30%) through automated, monitored pipelines and close collaboration with non-technical stakeholders.

View profile
Swati Swati - Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps in Florida, United States

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp
Voltihost LLCStony Brook University

AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.

View profile
Satwika Boppudi - Mid-level Site Reliability Engineer specializing in AWS cloud and AI-driven backend systems in Houston, TX

Mid-level Site Reliability Engineer specializing in AWS cloud and AI-driven backend systems

Houston, TX7y exp
CignaUniversity of North Texas

Backend/AI engineer in healthcare/insurance (mentions Cigna) who has shipped production systems spanning high-reliability APIs, async job architectures (Celery), and LLM/RAG features. Built an LLM document assistant with Terraform-managed AWS infra, semantic search retrieval, and strict permissioning/audit logs, and designed an automated prior-authorization workflow with human-in-the-loop escalation and compliance-driven thresholds.

View profile
Raj Patel - Junior Machine Learning Engineer specializing in LLMs and RAG systems in Remote, USA

Raj Patel

Screened

Junior Machine Learning Engineer specializing in LLMs and RAG systems

Remote, USA1y exp
EmotionallNYU Tandon School of Engineering

Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.

View profile
Sai Krishna Mallikanti - Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP in TN

Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP

TN4y exp
CignaUniversity of Memphis

Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.

View profile
Nikhil Chagi - Intern Data Analyst specializing in data pipelines and LLM/RAG applications in San Francisco, CA

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp
CignaUniversity of North Texas

Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.

View profile

Need someone specific?

AI Search