Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Monisha Nettem - Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps in USA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA5y exp
M&T BankKennesaw State University

AI/ML engineer with banking domain experience (M&T Bank) who built a production credit-risk prediction and reporting platform combining ML models (XGBoost/TensorFlow) with a RAG pipeline (LangChain + GPT-4) over compliance documents. Delivered measurable impact (≈20% better risk detection/precision, 50% less manual reporting) and productionized workflows on Vertex AI/Kubeflow with CI/CD and monitoring; also implemented embedding-based semantic search using FAISS/Pinecone.

View profile
Bhavya Sri Gunnapaneni - Mid-level AI/ML Engineer specializing in fraud detection and NLP in United States

Mid-level AI/ML Engineer specializing in fraud detection and NLP

United States4y exp
AIGLewis University

Built production AI/RAG-style systems for message Q&A and insurance claims workflows, combining data ingestion, indexing/retrieval, and LLM integration with fallback modes. Has hands-on orchestration experience (Airflow, Prefect, LangChain) and cites large operational gains (claims processing reduced to ~45 seconds; manual review -50%; false alerts -30%) through automated, monitored pipelines and close collaboration with non-technical stakeholders.

View profile
Swati Swati - Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps in Florida, United States

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp
Voltihost LLCStony Brook University

AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.

View profile
Atharva Deshmukh - Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps in Rochester, New York

Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps

Rochester, New York4y exp
CrowdDoingRochester Institute of Technology

Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.

View profile
Alicia Geng - Entry-level AI/ML Engineer specializing in AWS MLOps and computer vision in Worcester, MA

Alicia Geng

Screened

Entry-level AI/ML Engineer specializing in AWS MLOps and computer vision

Worcester, MA0y exp
Applied Industrial MeasurementsNortheastern University

Built and shipped a production RAG question-answering system using LangChain/OpenAI, Docker, and FastAPI, then reduced hallucinations through disciplined retrieval tuning and constrained prompting. Also implemented a custom evaluation framework (QA-pair dataset) to measure faithfulness/relevance and deployed containerized ML microservices on AWS ECS/Fargate with ALB and rolling, zero-downtime updates.

View profile
Ranxin Li - Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG in San Jose, USA

Ranxin Li

Screened

Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG

San Jose, USA2y exp
RevoAgent SolutionUC Davis

Built and deployed Clyra.AI, an AI-driven daily scheduling product that uses a LangGraph-based multi-agent LLM pipeline (task extraction, verification, reflection) grounded with strict RAG over emails/documents/calendars and real-world signals like health metrics. Designed a custom agent orchestrator with bounded loops/termination conditions and a self-auditing verification/reflection layer to reduce hallucinations while controlling latency and cost via caching and model distillation.

View profile
Raj Patel - Junior Machine Learning Engineer specializing in LLMs and RAG systems in Remote, USA

Raj Patel

Screened

Junior Machine Learning Engineer specializing in LLMs and RAG systems

Remote, USA1y exp
EmotionallNYU Tandon School of Engineering

Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.

View profile
Sai Krishna Mallikanti - Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP in TN

Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP

TN4y exp
CignaUniversity of Memphis

Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.

View profile
Nikhil Chagi - Intern Data Analyst specializing in data pipelines and LLM/RAG applications in San Francisco, CA

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp
CignaUniversity of North Texas

Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.

View profile
Alankrit Srivastava - Intern Data Engineer specializing in Snowflake pipelines and AI/ML analytics in Houston, TX

Intern Data Engineer specializing in Snowflake pipelines and AI/ML analytics

Houston, TX3y exp
Verity Advisor LLCUniversity of North Texas

Built and operated an end-to-end TypeScript/Node AI agent platform for high-volume financial data that generates explainable investment signals and automates execution via resilient Playwright browser automation. Uses Postgres + pgvector/Prisma for RAG retrieval, Redis for async orchestration, Zod-based boundary validation as a circuit breaker, and OpenTelemetry for tracing/latency monitoring; also designed a TypeScript SDK with semver, scoped bearer-token auth, CLI key rotation, and interactive Swagger docs.

View profile
Ramya Sree Kanijam - Mid-level Software Engineer specializing in backend systems, cloud, and AI pipelines in Remote, USA

Mid-level Software Engineer specializing in backend systems, cloud, and AI pipelines

Remote, USA3y exp
NetomiTexas A&M University-Corpus Christi

Built and owned an end-to-end AI-driven content enrichment pipeline for a news workflow, using n8n, LLM agents, and external APIs to automate ingestion, deduplication, categorization, and approval routing. Stands out for production-minded AI systems work: they improved reliability with schema validation, retries, idempotency, and monitoring, while automating 90% of processing and cutting duplication errors by 95%+.

View profile
SR

Mid-level AI/ML Engineer specializing in RAG systems and Python cloud backends

USA4y exp
CignaSoutheast Missouri State University

Frontend engineer with hands-on experience building AI-powered document search and analytics products, including RAG-based knowledge retrieval interfaces with citations, filters, and document previews. Stands out for combining React/TypeScript architecture with production performance tuning using profiling tools, memoization, lazy loading, and debounced data flows to keep complex, document-heavy UIs responsive.

View profile
Dinesh Guguloth - Mid-level Software Engineer specializing in full-stack cloud-native applications in New York, NY

Mid-level Software Engineer specializing in full-stack cloud-native applications

New York, NY4y exp
AccentureCleveland State University

Full-stack engineer with cloud and GenAI experience who has owned production features end-to-end, including a reporting dashboard optimized from 14s to 5s using query/API refactoring and monitored via AWS CloudWatch. Also productionized an OpenAI-powered chatbot using LangChain with prompt design, guardrails, and evaluation via production logs and user feedback, and has led incremental legacy-to-microservices modernization with parallel run to avoid regressions.

View profile
AA

Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs

San Jose, CA3y exp
TCSCalifornia State University, Chico

Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.

View profile
VV

Vaidik Vyas

Screened

Mid-Level AI Backend Engineer specializing in Python, LLM/RAG, and healthcare/insurance platforms

Franklin, NJ5y exp
MetLifeNJIT

AI Backend Engineer in MetLife’s claims technology group who built and deployed a production LLM-based decision support system that helps claim adjusters quickly find relevant policy rules from long PDFs and historical notes. Designed it as multiple production-grade services with retrieval-first guardrails, continuous validation, and Airflow-orchestrated pipelines for ingestion, embeddings, and vector index updates to keep the system reliable as policies and data evolve.

View profile
AD

Junior AI Engineer specializing in ML, LLM systems, and RAG

Bangalore, India2y exp
NxtGen Cloud TechnologiesUniversity at Buffalo

Built and deployed an LLM/applied-ML system enabling efficient extraction of useful information from large unstructured multimodal datasets, owning the full pipeline from ingestion to inference and APIs with a strong emphasis on production reliability, latency, and monitoring. Also delivered a voice-based AI workflow for Hindi policy document access for the Election Commission of India by translating non-technical usability needs into iterative demos and a successful implementation.

View profile
SC

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Atlanta, GA4y exp
Universal Health ServicesUniversity of New Haven

Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.

View profile
KG

Mid-level Generative AI Engineer specializing in LLM agents and RAG

Chesterfield, MO4y exp
Reinsurance Group of AmericaUniversity of Central Missouri

GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.

View profile
YS

Yash Sanap

Screened

Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications

Virginia Beach, VA2y exp
City of Virginia BeachGeorge Mason University

Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.

View profile
AZ

Mid-Level Software Engineer specializing in Generative AI and LLM applications

Johnston, Iowa4y exp
CortevaNortheastern University

Built and deployed a production RAG-based AI assistant for sales reps to unify access to product info, pricing, and internal documents across multiple systems. Implemented ETL pipelines for normalization/chunking/embeddings, integrated the assistant into internal React/TypeScript UIs with user-specific context, and enforced security with private vector storage and permission-filtered retrieval.

View profile
VK

Vaishnavi K

Screened

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

USA5y exp
TCSUniversity of New Haven

LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.

View profile
DK

Deepak K

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps for FinTech

Overland Park, KS4y exp
IntuitUniversity of Central Missouri

ML/LLM engineer with production experience building a compliant RAG-based virtual assistant at Intuit, optimizing embeddings and FAISS retrieval (including PCA) for low-latency, privacy-controlled search and deploying via AWS SageMaker containers. Also built scalable Airflow+MLflow pipelines using Docker and KubernetesExecutor, cutting training cycles by 37%, and partnered with civil engineers/project managers at Aegis Infra to deliver predictive maintenance for construction equipment.

View profile
RP

Rupesh Pathak

Screened

Junior Data Scientist and Robotics Perception Engineer specializing in GenAI and autonomous systems

Boston, MA2y exp
VERIDIX AINortheastern University

Robotics software architect who built an automated pick-and-place palletizing prototype at BLACK-I-ROBOTICS, spanning perception (multi-RealSense fusion, segmentation, 6D pose, ICP), GPU-accelerated motion planning (MoveIt 2 + NVIDIA CuRobo), grasp generation, and safety (human detection + safe mode). Also brings cloud/CI/CD depth from VERIDIX AI (AWS Cognito/Lambda/ECS and CodePipeline stack) and demonstrated strong debugging chops by reducing outdoor rover EKF drift to ~5 cm via Allan variance-based IMU tuning.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile

Need someone specific?

AI Search