Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Monisha Nettem

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA5y exp

M&T BankKennesaw State University

“AI/ML engineer with banking domain experience (M&T Bank) who built a production credit-risk prediction and reporting platform combining ML models (XGBoost/TensorFlow) with a RAG pipeline (LangChain + GPT-4) over compliance documents. Delivered measurable impact (≈20% better risk detection/precision, 50% less manual reporting) and productionized workflows on Vertex AI/Kubeflow with CI/CD and monitoring; also implemented embedding-based semantic search using FAISS/Pinecone.”

Python R SQL Jupyter Notebook Machine Learning Predictive Analytics+112

View profile

Bhavya Sri Gunnapaneni

Screened

Mid-level AI/ML Engineer specializing in fraud detection and NLP

United States4y exp

AIGLewis University

“Built production AI/RAG-style systems for message Q&A and insurance claims workflows, combining data ingestion, indexing/retrieval, and LLM integration with fallback modes. Has hands-on orchestration experience (Airflow, Prefect, LangChain) and cites large operational gains (claims processing reduced to ~45 seconds; manual review -50%; false alerts -30%) through automated, monitored pipelines and close collaboration with non-technical stakeholders.”

Python SQL R Java TensorFlow PyTorch+125

View profile

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp

Voltihost LLCStony Brook University

“AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.”

Python R C++Java Shell Scripting Bash+172

View profile

Atharva Deshmukh

Screened

Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps

Rochester, New York4y exp

CrowdDoingRochester Institute of Technology

“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”

Python R SQL Bash Java JavaScript+90

View profile

Alicia Geng

Screened

Entry-level AI/ML Engineer specializing in AWS MLOps and computer vision

Worcester, MA0y exp

Applied Industrial MeasurementsNortheastern University

“Built and shipped a production RAG question-answering system using LangChain/OpenAI, Docker, and FastAPI, then reduced hallucinations through disciplined retrieval tuning and constrained prompting. Also implemented a custom evaluation framework (QA-pair dataset) to measure faithfulness/relevance and deployed containerized ML microservices on AWS ECS/Fargate with ALB and rolling, zero-downtime updates.”

A/B Testing AWS CI/CD Computer Vision Data Science Docker+82

View profile

Ranxin Li

Screened

Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG

San Jose, USA2y exp

RevoAgent SolutionUC Davis

“Built and deployed Clyra.AI, an AI-driven daily scheduling product that uses a LangGraph-based multi-agent LLM pipeline (task extraction, verification, reflection) grounded with strict RAG over emails/documents/calendars and real-world signals like health metrics. Designed a custom agent orchestrator with bounded loops/termination conditions and a self-auditing verification/reflection layer to reduce hallucinations while controlling latency and cost via caching and model distillation.”

C C++Kotlin Java Python JavaScript+119

View profile

Raj Patel

Screened

Junior Machine Learning Engineer specializing in LLMs and RAG systems

Remote, USA1y exp

EmotionallNYU Tandon School of Engineering

“Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.”

Python Java TensorFlow PyTorch Scikit-Learn Large Language Models (LLMs)+86

View profile

Sai Krishna Mallikanti

Screened

Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP

TN4y exp

CignaUniversity of Memphis

“Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.”

A/B Testing Anomaly Detection Data Validation Deep Learning Feature Engineering Generative AI+77

View profile

Nikhil Chagi

Screened

Intern Data Analyst specializing in data pipelines and LLM/RAG applications

San Francisco, CA1y exp

CignaUniversity of North Texas

“Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.”

Amazon EC2 Amazon RDS AWS AWS Lambda Analytics Anomaly Detection+55

View profile

Alankrit Srivastava

Screened

Intern Data Engineer specializing in Snowflake pipelines and AI/ML analytics

Houston, TX3y exp

Verity Advisor LLCUniversity of North Texas

“Built and operated an end-to-end TypeScript/Node AI agent platform for high-volume financial data that generates explainable investment signals and automates execution via resilient Playwright browser automation. Uses Postgres + pgvector/Prisma for RAG retrieval, Redis for async orchestration, Zod-based boundary validation as a circuit breaker, and OpenTelemetry for tracing/latency monitoring; also designed a TypeScript SDK with semver, scoped bearer-token auth, CLI key rotation, and interactive Swagger docs.”

Python Java SQL JavaScript Microsoft Excel AWS+90

View profile

Ramya Sree Kanijam

Screened

Mid-level Software Engineer specializing in backend systems, cloud, and AI pipelines

Remote, USA3y exp

NetomiTexas A&M University-Corpus Christi

“Built and owned an end-to-end AI-driven content enrichment pipeline for a news workflow, using n8n, LLM agents, and external APIs to automate ingestion, deduplication, categorization, and approval routing. Stands out for production-minded AI systems work: they improved reliability with schema validation, retries, idempotency, and monitoring, while automating 90% of processing and cutting duplication errors by 95%+.”

Python SQL Bash FastAPI Celery Apache Airflow+231

View profile

Sai Revanth Evani

Screened

Mid-level AI/ML Engineer specializing in RAG systems and Python cloud backends

USA4y exp

CignaSoutheast Missouri State University

“Frontend engineer with hands-on experience building AI-powered document search and analytics products, including RAG-based knowledge retrieval interfaces with citations, filters, and document previews. Stands out for combining React/TypeScript architecture with production performance tuning using profiling tools, memoization, lazy loading, and debounced data flows to keep complex, document-heavy UIs responsive.”

Python pytest Django Flask FastAPI Celery+158

View profile

Dinesh Guguloth

Screened

Mid-level Software Engineer specializing in full-stack cloud-native applications

New York, NY4y exp

AccentureCleveland State University

“Full-stack engineer with cloud and GenAI experience who has owned production features end-to-end, including a reporting dashboard optimized from 14s to 5s using query/API refactoring and monitored via AWS CloudWatch. Also productionized an OpenAI-powered chatbot using LangChain with prompt design, guardrails, and evaluation via production logs and user feedback, and has led incremental legacy-to-microservices modernization with parallel run to avoid regressions.”

JavaScript TypeScript Python Java SQL C+++224

View profile

Aditya Anil Raut

Screened

Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs

San Jose, CA3y exp

TCSCalifornia State University, Chico

“Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.”

AWS AWS Lambda Bootstrap C C++ChromaDB+106

View profile

Vaidik Vyas

Screened

Mid-Level AI Backend Engineer specializing in Python, LLM/RAG, and healthcare/insurance platforms

Franklin, NJ5y exp

MetLifeNJIT

“AI Backend Engineer in MetLife’s claims technology group who built and deployed a production LLM-based decision support system that helps claim adjusters quickly find relevant policy rules from long PDFs and historical notes. Designed it as multiple production-grade services with retrieval-first guardrails, continuous validation, and Airflow-orchestrated pipelines for ingestion, embeddings, and vector index updates to keep the system reliable as policies and data evolve.”

Amazon DynamoDB Amazon ECS Amazon RDS Amazon S3 AWS Lambda Bash+106

View profile

Aryaa Deshpande

Screened

Junior AI Engineer specializing in ML, LLM systems, and RAG

Bangalore, India2y exp

NxtGen Cloud TechnologiesUniversity at Buffalo

“Built and deployed an LLM/applied-ML system enabling efficient extraction of useful information from large unstructured multimodal datasets, owning the full pipeline from ingestion to inference and APIs with a strong emphasis on production reliability, latency, and monitoring. Also delivered a voice-based AI workflow for Hindi policy document access for the Election Commission of India by translating non-technical usability needs into iterative demos and a successful implementation.”

Python SQL HTML CSS JavaScript C+83

View profile

Sudeepti Chalamalasetti

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Atlanta, GA4y exp

Universal Health ServicesUniversity of New Haven

“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”

A/B Testing Anomaly Detection Audit Logging AWS AWS Glue AWS Lambda+123

View profile

Karthik Gantasala

Screened

Mid-level Generative AI Engineer specializing in LLM agents and RAG

Chesterfield, MO4y exp

Reinsurance Group of AmericaUniversity of Central Missouri

“GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.”

A/B Testing Agile Amazon Bedrock Ansible Apache Airflow AWS+168

View profile

Yash Sanap

Screened

Junior Data Scientist specializing in ML, geospatial analytics, and LLM applications

Virginia Beach, VA2y exp

City of Virginia BeachGeorge Mason University

“Built and deployed a production AI “term explainer” agent that adapts explanations to beginner/intermediate/expert users by combining multi-step LLM reasoning with grounded Wikipedia retrieval. Owns end-to-end agent orchestration (smolagents/Python), reliability patterns (fallback across LLM providers, retries, guardrails), and observability/metrics-driven evaluation; also partnered with a non-technical researcher to deliver a plain-language research assistant agent.”

Python SQL Java Go Bash JavaScript+95

View profile

Abdelrahman Zeidan

Screened

Mid-Level Software Engineer specializing in Generative AI and LLM applications

Johnston, Iowa4y exp

CortevaNortheastern University

“Built and deployed a production RAG-based AI assistant for sales reps to unify access to product info, pricing, and internal documents across multiple systems. Implemented ETL pipelines for normalization/chunking/embeddings, integrated the assistant into internal React/TypeScript UIs with user-specific context, and enforced security with private vector storage and permission-filtered retrieval.”

Agile Amazon EC2 Amazon S3 Amazon SageMaker Angular API Development+73

View profile

Vaishnavi K

Screened

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

USA5y exp

TCSUniversity of New Haven

“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”

A/B Testing Amazon EC2 Amazon S3 Amazon SageMaker Apache Airflow Apache Hadoop+135

View profile

Deepak K

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps for FinTech

Overland Park, KS4y exp

IntuitUniversity of Central Missouri

“ML/LLM engineer with production experience building a compliant RAG-based virtual assistant at Intuit, optimizing embeddings and FAISS retrieval (including PCA) for low-latency, privacy-controlled search and deploying via AWS SageMaker containers. Also built scalable Airflow+MLflow pipelines using Docker and KubernetesExecutor, cutting training cycles by 37%, and partnered with civil engineers/project managers at Aegis Infra to deliver predictive maintenance for construction equipment.”

A/B Testing Amazon EC2 Amazon S3 BERT CI/CD Classification+93

View profile

Rupesh Pathak

Screened

Junior Data Scientist and Robotics Perception Engineer specializing in GenAI and autonomous systems

Boston, MA2y exp

VERIDIX AINortheastern University

“Robotics software architect who built an automated pick-and-place palletizing prototype at BLACK-I-ROBOTICS, spanning perception (multi-RealSense fusion, segmentation, 6D pose, ICP), GPU-accelerated motion planning (MoveIt 2 + NVIDIA CuRobo), grasp generation, and safety (human detection + safe mode). Also brings cloud/CI/CD depth from VERIDIX AI (AWS Cognito/Lambda/ECS and CodePipeline stack) and demonstrated strong debugging chops by reducing outdoor rover EKF drift to ~5 cm via Allan variance-based IMU tuning.”

Python C C++Multithreading SQL MATLAB+164

View profile

Yash Pankhania

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp

Humanitarians.AINortheastern University

“AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).”

Agile Amazon Bedrock Amazon DynamoDB Amazon RDS Amazon Redshift Amazon S3+127

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?