Reval Logo

Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

MK

Mid-level Full-Stack Engineer specializing in cloud-native microservices and AI automation

USA5y exp
Fuel AICalifornia State University

Software engineer/product owner who has led end-to-end delivery of AI and content-management platforms, including building RAG-based reliability improvements and migrating fragile systems to containerized AWS ECS/Kubernetes with Terraform-managed CI/CD. Experienced designing event-driven microservices (SQS/SNS/RabbitMQ), scaling queue consumers with autoscaling, and creating internal Python tooling to standardize data connectors (e.g., BigQuery/Airtable/internal APIs) to speed iteration.

View profile
SP

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp
BNY MellonFranklin University

Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.

View profile
OT

Mid-Level Full-Stack Software Engineer specializing in web apps, data pipelines, and ML

San Francisco, CA4y exp
University of FloridaUniversity of Florida

Software engineer who owned an Order Management System end-to-end at Reliance Jio, improving large-table performance via UI virtualization shipped behind feature flags and refined through direct ops-user observation. Also built an OCR automation tool at Piramal Realty using Python/Tesseract with validation and manual correction fallbacks, driving adoption by operations teams. Experienced integrating with Kafka-based microservices and improving observability using structured logging and correlation IDs.

View profile
SS

Somil Shah

Screened

Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents

San Francisco, CA4y exp
INTERACT Animal LabNortheastern University

AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).

View profile
SA

Saniya Athani

Screened

Junior Software Engineer specializing in cloud-native DevOps and GenAI

San Francisco Bay Area, CA2y exp
IpserLabNortheastern University

Cloud-focused engineer with hands-on experience deploying production cloud-native REST APIs on AWS using Pulumi IaC, containerization, and CI/CD, with strong emphasis on secure credential management and operational monitoring via CloudWatch. Also has IoT troubleshooting experience across edge hardware constraints and networking (TLS handshake failures), plus Python-based configurable data-processing tools and customer-facing requirements translation.

View profile
TT

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp
BNY MellonUniversity at Albany

BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.

View profile
OR

Owen Reilly

Screened

Intern Full-Stack Software Engineer specializing in web apps, distributed systems, and AI tooling

Boston, MA3y exp
CiveraBoston University

Software engineer with experience spanning high-scale backend systems and distributed consensus: led a 6-person team delivering a production data querying/visualization platform with major latency improvements via cursor-based pagination and streamed results. Built a RAFT-based distributed logging tool resilient to partitions and storage constraints, and at Nasuni developed FastAPI services processing multi-terabyte workloads for 500+ enterprise customers with secure API key management.

View profile
KG

Senior AI Engineer specializing in Agentic AI and distributed systems

Charlotte, NC4y exp
UnitedHealth GroupUniversity of North Carolina at Charlotte

LLM/agentic workflow engineer with healthcare domain experience who built a HIPAA-compliant multi-agent RAG system for clinical review automation at UnitedHealth Group, achieving 92% precision and cutting latency 40% through async orchestration and Redis semantic caching. Also has strong data engineering orchestration background (Airflow on AWS EMR with Great Expectations) and a proven clinician-in-the-loop feedback process that improved model faithfulness by 18%.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
AV

AI & Full-Stack Software Engineer specializing in LLM-powered applications

Atlanta, GA4y exp
PRGXArizona State University

Full-stack engineer focused on productionizing LLM applications, including an Android privacy-policy risk summarization app (Kotlin/React Native + FastAPI + Ollama) that cut response times from ~10s to ~5–6s via batching, caching, async, and event-driven architecture. Currently at PRGX building an LLM-based legal contract clause extraction system, partnering closely with legal/procurement SMEs to create schemas, labeled datasets, and evaluation pipelines that improved accuracy from 70% to 85%. Also has experience architecting real-time voice/LLM systems with streaming microservices (Kafka, Kubernetes, gRPC/WebSockets) and an avatar chatbot pipeline (TalkingHead, Google TTS, AnythingLLM).

View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG systems, and real-time risk modeling

PA, USA4y exp
Capital OneRobert Morris University

AI/ML Engineer with 4+ years of experience (Capital One, Odin Technologies) and a master’s in Data Analytics (4.0 GPA) who has deployed LLM/RAG systems to production for compliance/risk and document review. Strong in orchestration and MLOps (Airflow, Kubernetes, MLflow, GitHub Actions) and in tackling real-world LLM constraints like latency, context limits, and data privacy, with measurable impact (20%+ manual review reduction; 33% faster release cycles).

View profile
AG

Archit Gangal

Screened

Senior Full-Stack Developer specializing in cloud-native microservices and AI/ML analytics

7y exp
AllstateColorado State University

Full-stack/backend engineer with deep insurance claims domain experience who built and operated a microservices + ETL platform (Java/Spring Boot + Python + Kafka/Databricks) processing 1M+ daily transactions. Combines production-grade reliability (99.7% uptime, zero-downtime blue/green releases, strong observability) with customer-facing UI delivery (AngularJS/React+TS dashboards and a hackathon-winning research chatbot).

View profile
BM

Brian Mar

Screened

Senior Data Engineer specializing in data infrastructure and marketing/CRM analytics

San Mateo, CA8y exp
Full Circle InsightsUC Davis

Salesforce-focused implementation/solutions engineer from Full Circle Insights who owned end-to-end campaign attribution and reporting deployments for multiple customers at once (3–5 concurrently), including sandbox testing, KPI monitoring, and rollback-safe migrations from legacy reporting. Also builds personal multi-agent workflows and uses Claude Code to rapidly scaffold data/analytics scripts like an advertising optimization parser over CSV/XLSX inputs.

View profile
CT

Mid-level AI/ML Engineer specializing in LLM systems, RAG, and MLOps

5y exp
HCA HealthcareUniversity of South Florida

Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.

View profile
RA

Rayyan Alam

Screened

Junior Robotics & Machine Learning Engineer specializing in autonomy and RAG systems

Arlington, VA1y exp
Manitou Research Inc.University of Virginia

New-grad robotics software engineer with hands-on ROS 2 autonomy experience (Nav2, SLAM Toolbox, AMCL) and a strong track record debugging real-world instability (QoS, lifecycle timing, sensor dropouts). Built an HRI speech system on a Stretch 3 robot with deterministic, context-aware templates to manipulate trust/competence/emotion conditions, and integrated an LLM high-level planner that outputs PDDL for classical task planning and replanning.

View profile
MC

Senior Creative Technologist & Full-Stack UX Engineer specializing in Generative AI and XR

Los Altos, CA12y exp
Astrocade AISan José State University

Design engineer/product designer who built an end-to-end creator + review/moderation system for a UGC platform, spanning automated checks, human QA, final review, and creator feedback. Comfortable working directly with HTML/CSS/TypeScript and component systems, using prototyping and field observation to reduce reviewer hesitation, improve consistency, and prevent creator errors upstream.

View profile
TA

Junior Machine Learning Engineer specializing in Generative AI and analytics automation

Bengaluru, India2y exp
AccentureUniversity of Alabama at Birmingham

AI/LLM engineer who built a production intelligent support system using RAG over a vectorized documentation library, addressing real-world issues like lost-in-the-middle context failures and doc freshness via automated GitHub-driven re-embedding pipelines. Emphasizes rigorous agent evaluation (component/E2E/ops) and prefers lightweight, decoupled workflow automation using message brokers (Redis/RabbitMQ) over heavyweight orchestration frameworks.

View profile
AT

Intern Data Scientist specializing in ML engineering and LLM agentic workflows

San Francisco, CA6y exp
ContentstackSan José State University

Built an agentic, multi-step LLM system that generates full-stack code for API integrations using LangChain orchestration, Pinecone/SentenceBERT RAG, and a human-in-the-loop feedback loop for iterative code refinement. Also collaborated with non-technical content writers and PMs during a Contentstack internship to deliver a Slack-based AI workflow that generates and brand-checks articles with one-click approvals.

View profile
MY

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp
Elevance HealthMLR Institute of Technology

Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.

View profile
MB

Mid-level AI Researcher specializing in multimodal LLMs and human-centered AI

Pittsburgh, PA7y exp
University of PittsburghUniversity of Pittsburgh

Has production deployment experience delivering computer-vision systems on AWS (Docker + S3) including a GDPR-focused face/license-plate obfuscation pipeline and a semantic-segmentation project aimed at reducing annotation time. Worked closely with DevOps and frontend teams and partnered with CEO/CMO to present an AI-driven annotation workflow to non-technical VC stakeholders.

View profile
VS

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.

View profile
IU

Mid-Level Full-Stack Software Engineer specializing in AI/ML and cloud-native systems

Los Angeles, CA3y exp
DevolvedAIUSC

At BondiTech, built and deployed customer-facing backend improvements for enterprise dashboards handling 1M+ records, redesigning a .NET/Entity Framework API with server-side pagination/filtering and feature-flagged rollout to cut latency from ~15s to ~2s. Experienced integrating customer systems into existing APIs, including stabilizing a legacy CRM sync by normalizing inconsistent IDs, handling strict rate limits with batching, and adding DLQs plus reconciliation reporting.

View profile
MP

Meghana P

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and NLP

Illinois, USA5y exp
State FarmSaint Louis University

AI/ML engineer with forensic analytics and healthcare claims experience (Optum), building production LLM/RAG systems to surface context-driven fraud patterns from unstructured claim notes and explain risk to investigators. Strong in large-scale retrieval performance tuning, legacy API integration with reliability patterns (SQS, circuit breakers), and MLOps orchestration on Airflow/Kubernetes with rigorous testing, monitoring, and stakeholder-friendly interpretability.

View profile
HS

Harsha Sikha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and data engineering

Armonk, New York4y exp
IBMSaint Peter's University

IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.

View profile

Need someone specific?

AI Search