Reval Logo
Home Browse Talent Skilled in Retrieval-Augmented Generation (RAG)

Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation (RAG)PythonDockerCI/CDAWSSQL
SK

Santhosh Kumar

Screened

Mid-level GenAI/ML Engineer specializing in LLM agents and RAG for Financial Services & Healthcare

5y exp
Bank of AmericaVirginia Commonwealth University

“Built and deployed a production GenAI internal support agent at Bank of America (“Ask GPS/AskGPT”) using RAG on Azure, focused on reducing escalations and improving response quality for repetitive knowledge-based queries. Demonstrates strong production LLM engineering: custom LangChain orchestration, retrieval tuning to reduce hallucinations, rigorous offline/online evaluation, and model benchmarking with dynamic routing (e.g., GPT-4 vs Claude).”

AWSAWS LambdaCI/CDClaudeDatabricksDecision Trees+97
View profile
RK

Ramu Kumar

Screened

Intern Machine Learning Engineer specializing in NLP, RAG, and deepfake detection

Guwahati, India1y exp
IIT GuwahatiIIT Guwahati

“Early-career (fresher) candidate who built and deployed a production AI medical document chatbot using a RAG architecture (LangChain + Hugging Face LLM + Pinecone) with a Flask backend on AWS EC2 via Docker. Has experience troubleshooting real deployment constraints (model dependencies, disk space, container stability) and setting up continuous-style evaluation with fixed query test sets tracking relevance, latency, and error rate.”

Data PreprocessingData Structures and AlgorithmsDeep LearningDockerEmbeddingsFirebase+73
View profile
SK

Sharath Kumar

Screened

Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps

Remote, USA5y exp
HPWilmington University

“AI/ML engineer with HP experience building and productionizing an LLM-powered document intelligence platform (LangChain + Pinecone) to deliver semantic search and contextual Q&A across millions of enterprise support documents. Demonstrates strong MLOps and scaling expertise (Airflow, Kubernetes autoscaling, Triton GPU inference, monitoring with Prometheus/W&B) plus a structured approach to evaluation (A/B tests, shadow deployments, failover) and effective collaboration with non-technical stakeholders.”

PythonSQLPostgreSQLBigQuerySnowflakeBash+142
View profile
DD

Dhyey Desai

Screened

Intern AI/ML Engineer specializing in RAG, multimodal AI, and LLM systems

Los Angeles, California0y exp
NalaUSC

“Built and shipped 'PetPulse,' a production AI pet-health note system that records voice notes, transcribes them, converts transcripts into structured symptom/event data, and supports grounded Q&A over a user’s notes and vet PDFs. Demonstrates full-stack LLM product execution (FastAPI + GPT-4 + Firebase), with concrete reliability/performance work (async endpoints, caching, RAG/embeddings, function calling) and user-centered iteration with a non-technical product stakeholder.”

Apache HadoopBERTCCachingData VisualizationDatabricks+87
View profile
DA

Divyam Agrawal

Screened

Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems

Seattle, WA4y exp
Affinity SolutionsUniversity of Washington

“Internship experience building a production RAG+LLM pipeline to map messy card transaction descriptions to merchant brands, including a custom modified-ROUGE evaluation approach for weak/variant ground truth. Improved scalability and cost by moving from a managed LLM endpoint (e.g., Bedrock) to self-hosted vLLM, and orchestrated massive embedding backfills (5,000+ files, 10B+ rows) using an Airflow-triggered SQS + ECS worker architecture with robust retry/DLQ handling.”

A/B TestingAPI DesignAWSAWS CloudFormationAWS LambdaAuto-scaling+110
View profile
BM

Bernie Miao

Screened

Junior Full-Stack Software Engineer specializing in EdTech and AI-powered learning tools

Berkeley, CA2y exp
CollegeNETUC Berkeley

“Edtech/education-focused engineer who took an accessibility-critical LLM/vision feature from concept to production: built an OpenCV-gated whiteboard capture pipeline feeding Gemini Vision for handwriting-to-LaTeX, improving math transcription 80% while cutting inference costs 60%. Also built RAG observability and retrieval fixes that stabilized inconsistent answers, and partnered directly with sales to reshape demos and open a new K-12 revenue pipeline aligned to California Digital Divide grant requirements.”

Apache TomcatAWSCCSSD3.jsDatabase schema design+82
View profile
SS

Siva Sai Kumar Mogalluru

Screened

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp
EYUniversity of South Florida

“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”

A/B TestingAgileAnomaly DetectionApache AirflowApache SparkAzure DevOps+138
View profile
PC

Prasanna Chelliboyina

Screened

Mid-level Machine Learning Engineer specializing in forecasting, NLP, and GenAI

United States6y exp
WalgreensSyracuse University

“GenAI/ML engineer with production experience building multilingual LLM systems (English/Spanish) and RAG-based clinical documentation summarization at Walgreens, combining prompt engineering, structured output validation, and rigorous evaluation (ROUGE + pharmacist review). Also orchestrated end-to-end ML pipelines for demand forecasting using Apache Airflow, PySpark, and MLflow with scheduled retraining and production monitoring.”

A/B TestingAgileAnomaly DetectionApache SparkAWSAzure Machine Learning+114
View profile
HC

Hari Chandana Kasula

Screened

Entry Machine Learning Engineer specializing in NLP, computer vision, and recommender systems

New York, NY0y exp
Columbia UniversityColumbia University

“Built and shipped an end-to-end podcast recommendation system exposed via a Flask API and React UI, explicitly balancing relevance, diversity (MMR), and safety constraints while meeting ~200ms latency targets. Also implemented a production-style RAG/information-extraction pipeline using web retrieval, spaCy NER, and fine-tuned SpanBERT with guardrails and evaluation loops (precision/recall/F1) to tune confidence thresholds and improve reliability.”

JavaPythonJavaScriptSQLPyTorchTensorFlow+80
View profile
KA

Kirk Anderson

Screened

Executive Product & Engineering Leader specializing in AI, SaaS data platforms, and sensor systems

Boston, MA19y exp
GRAXMichigan Technological University

“Early-stage founder building an engineering alpha product and planning a structured path to pilot and general availability. Active mentor in TechStars and MassChallenge with a strong VC network, emphasizing PMF, MVP-in-market feedback, and early sales while maintaining a sustainable approach to entrepreneurship.”

A/B testingAgileAmazon S3AnalyticsAnomaly detectionAsana+106
View profile
RC

Rui Cheng

Screened

Mid-level Software Engineer specializing in autonomous driving simulation and 3D mapping

5y exp
SimForge AIHuazhong University of Science and Technology

“Founding software engineer who built an autonomous-vehicle 3D digital twin using Unreal Engine 5 and CARLA, owning core simulator logic (traffic/scenarios/weather) and a ROS 2-based pipeline to record synchronized multi-sensor data (RGB/depth/segmentation/LiDAR/IMU/GPS). Also implemented distributed synchronization patterns (server + client prediction) using FastAPI and WebSockets; seeking roles with H1B transfer and targeting ~$110k.”

BlenderComputer VisionC#Data EngineeringDeep LearningFAISS+100
View profile
DP

Daniel Panrong Dai

Screened

Junior Software Engineer specializing in AI and full-stack development

New York, NY1y exp
Saige ConsultingMcGill University

“Consulting-background AI practitioner who led a production LLM pipeline on Snowflake Cortex to map hundreds of thousands of messy OCR/form-based contract fields into standardized Salesforce fields, including confidence scoring and an LLM-driven feedback loop. Strong focus on real-world constraints—token limits, cost control, and evaluation without ground truth—paired with frequent stakeholder-facing progress reporting.”

AlgorithmsAngularJSAWSAzure DevOpsC#CI/CD+74
View profile
AG

Ashitha Gowda

Screened

Mid-level Software Engineer specializing in GenAI and backend systems

Baltimore, MD4y exp
cnotes.inJohns Hopkins University

“Built and productionized an LLM-based PDF extraction pipeline for Medicaid policy documents by fine-tuning Gemini Flash 2.0 and deploying via Vertex AI, adding validation/guardrails to improve trust and reliability. Also built and scaled a SaaS platform (cnotes) for cable operators and regularly partners with customers and sales teams through interactive demos, rapid iteration, and real-time workflow debugging.”

AWS CloudFormationAWS LambdaBackend DevelopmentChromaDBCloud ComputingComputer Vision+85
View profile
SD

Sai Dev

Screened

Mid-level AI/ML Engineer specializing in MLOps, computer vision, and NLP

Newark, CA4y exp
Lucid MotorsCleveland State University

“GenAI/ML engineer from Lucid Motors who built and productionized an LLM-powered RAG diagnostic assistant for manufacturing and maintenance teams, deployed on AWS with Docker/Kubernetes and MLflow. Demonstrates end-to-end ownership from retrieval/prompt design to scalability, monitoring, and workflow integration via APIs, plus production ML pipeline orchestration with Kubeflow (Spark/Kafka + TensorFlow) for predictive maintenance use cases.”

PythonC++RSQLScalaTensorFlow+121
View profile
AA

Akshitha Akula

Screened

Mid-Level Full-Stack Python Engineer specializing in cloud APIs and data/ML platforms

Bentonville, AR4y exp
WalmartUniversity of Central Missouri

“Backend engineer at Goldman Sachs who deployed internal LLM-powered utilities to summarize operational logs/tickets, with a strong emphasis on data sensitivity and reliability. Built deterministic workflows with template-based prompts, confidence checks, and rule-based fallbacks, and used monitoring plus failure-rate metrics to tune performance; also has hands-on Temporal orchestration experience for resilient async backend jobs.”

PythonJavaScriptTypeScriptSQLFastAPIFlask+116
View profile
NS

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp
Quant AI ResearchNYU

“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”

PythonSQLCC++JavaHTML+120
View profile
VM

Vasavi Mittapalli

Screened

Senior Data Scientist specializing in GenAI, LLMs and RAG

Dallas, TX5y exp
Texas InstrumentsTrine University

“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”

A/B TestingAgileAmazon DynamoDBAmazon EC2Amazon EMRAmazon Kinesis+195
View profile
JV

Jaswanth Vakkala

Screened

Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP

Iselin, NJ5y exp
Wells FargoSt. Francis College

“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”

A/B TestingAnomaly DetectionApache HadoopApache HiveApache SparkAWS+224
View profile
KA

Kedareswara Abhinav Batchu

Screened

Mid-level Full-Stack & GenAI Engineer specializing in RAG and LLM applications

Saint Louis, MO5y exp
WayfairSaint Louis University

“Software engineer working on an e-commerce platform, currently building a RAG-based recommendation system with a team new to the technology. Has delivered an end-to-end React/TypeScript website for a local car dealer and built an internal "encryption as a service" tool to secure sensitive data across repositories and through release/UAT, with experience debugging microservices integration issues.”

JavaPythonJavaScriptTypeScriptNode.jsFastAPI+101
View profile
YX

Yuan Xu

Screened

Junior Machine Learning Engineer specializing in multimodal AI and audio deepfakes detection

Berkeley, California3y exp
Scam AICarnegie Mellon University

“Internship experience building production-oriented AI systems, including a real-time voice scam/spoof detector (RawNet + AASIST) hardened for noisy audio via aggressive augmentation and Zoom-based noise simulation, evaluated with EER on clean and wild datasets. Also built an LLM-driven UI automation agent using Playwright for apps like Linear/Notion with modular tool design, unit tests, and replayable scripted scenarios, and has AWS Step Functions experience orchestrating Lambda/Cognito workflows.”

PythonCC++JavaLinuxSQL+78
View profile
UK

Uday Kumar gattu

Screened

Mid-level Generative AI Engineer specializing in LLM agents and RAG systems

4y exp
Capital OneLindsey Wilson College

“Built and deployed a production LLM/RAG knowledge assistant integrating internal docs, wikis, and ticket histories to reduce tribal-knowledge dependency and repetitive questions. Emphasizes reliability via grounding + a validation layer, and achieved major latency gains (>50%) through vector index optimization, caching, quantization, and selective re-validation. Comfortable orchestrating end-to-end LLM/data workflows with Airflow, Prefect, and Dagster, including monitoring and alerting.”

A/B TestingAmazon CloudWatchAmazon DynamoDBAmazon EKSAmazon RedshiftAmazon S3+129
View profile
SP

Siddharth Prothia

Screened

Junior Software Engineer specializing in full-stack and AI/LLM applications

Santa Cruz, CA1y exp
RoboGrade.ioUC Santa Cruz

“Founder/builder of an EdTech startup (robograde.io) who personally conducted on-site classroom discovery with teachers and rapidly iterated the product based on real-world feedback. Implemented a Canvas LMS integration and refined it through weeks of in-person testing, and handled a live production grading failure by quickly debugging and deploying a fix, then adding fault-tolerant/backup API design.”

AgileBatch ProcessingC++Data Structures and AlgorithmsDeep LearningFirebase+74
View profile
JC

Jen-Hung Chang

Screened

Mid-level Software Engineer specializing in cloud infrastructure and distributed systems

Hsinchu, Taiwan4y exp
TSMCDuke University

“Backend/platform engineer who built an AI RAG system on FastAPI/Postgres/AWS with 10+ microservices, vector search optimization (ANN + two-stage re-ranking), and GitOps-driven CI/CD that cut deploy time from hours to minutes. Also deployed Java identity services on Kubernetes at TSMC for 200K+ users using ArgoCD/Azure Pipelines, and built a reliable real-time IoT pipeline (MQTT/Node/MongoDB) with strong consistency controls.”

AWSAWS LambdaCI/CDCC++Data Structures+93
View profile
1...242526...97

Related

Software EngineersMachine Learning EngineersData ScientistsSoftware DevelopersAI EngineersResearch AssistantsEngineeringAI & Machine LearningData & AnalyticsEducation

Need someone specific?

AI Search