Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Varshith Dupati

Screened

Mid-level Software Engineer specializing in AWS, full-stack development, and AI data systems

Seattle, Washington3y exp

AmazonArizona State University

“Backend engineer who built a Python-based data profiling/statistics platform processing up to 50M rows and ~300 metrics, using a DAG execution model, multithreading, and smart caching to cut processing time by up to 70%. Also improved PostgreSQL query performance from 12s to 2s via indexing/query rewrites, integrated an LLM (LangChain + OpenAI) for explainable “chat with the pipeline” functionality, and designed an AWS EC2+SQS architecture for scalable, isolated per-user processing.”

Java JUnit Spring Boot Python C C+++84

View profile

Vyas Cholayil

Screened

Mid-Level Software Engineer specializing in Python automation, DevOps, and microservices

Raleigh, NC5y exp

AnsysNorth Carolina State University

“Backend-focused engineer who built an internal wiki LLM chatbot end-to-end using FastAPI, Kubernetes, and ChromaDB vector search, including frontend integration. Also has strong DevOps/migration experience—automating large work-item and repo migrations (Jira/FogBugz/ADO on-prem to cloud) via Python scripts, JSON mappings, REST APIs, and validation test suites.”

Python C#JavaScript PowerShell Bash SQL+96

View profile

Leela Tikkisetty

Screened

Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems

San Francisco, CA5y exp

City and County of San FranciscoSan Francisco State University

“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”

A/B Testing Agile Amazon Bedrock Amazon EKS Amazon Redshift Authentication+198

View profile

Sai Krishna Yemineni

Screened

Mid-level AI/ML Engineer specializing in healthcare NLP, real-time risk systems, and ML platforms

Massachusetts, USA5y exp

Johnson & JohnsonRivier University

“LLM-focused customer-facing engineer who repeatedly takes document Q&A and agentic prototypes into secure, monitored production systems. Experienced in reducing hallucinations via RAG + guardrails, diagnosing retrieval/embedding issues in real time, and partnering with sales to run metrics-driven PoCs that overcome accuracy/security objections and drive adoption.”

Python R C++SQL Bash TensorFlow+107

View profile

Chetana Reddy Yellareddy

Screened

Mid-Level Software Engineer specializing in distributed systems and cloud-native platforms

Austin, TX5y exp

AMDNortheastern University

“Backend/AI engineer who built and scaled an internal AMD semiconductor manufacturing microservice platform (SMR), reworking a synchronous lot-request workflow into an event-driven RabbitMQ/Celery/FastAPI pipeline. Diagnosed and fixed peak-load reliability issues using deep observability and Kubernetes autoscaling, cutting notification latency back to sub-second and reducing duplicates via idempotency/DLQs. Also shipped an LLM-powered natural-language search with schema-constrained JSON outputs and guardrails, plus a plan-execute-verify Jira bug-resolution agent that can propose fixes and raise PRs under restricted permissions.”

Algorithms API Gateway Asynchronous Processing AWS AWS IAM AWS Lambda+118

View profile

Jisvitha Athaluri

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps

McKinney, TX6y exp

Globe LifeTexas A&M University

“Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.”

A/B Testing Apache Spark BERT ChromaDB Data Engineering Data Pipelines+90

View profile

Travoy Spelling

Screened

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp

TredenceUniversity of Texas at Austin

“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”

A/B Testing API Development AWS AWS Lambda AWS Step Functions Azure Data Factory+247

View profile

Rohith Sadanala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and MLOps

Missouri, USA3y exp

AirbnbUniversity of South Florida

“LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.”

A/B Testing Amazon Bedrock Amazon EC2 Amazon EKS Amazon RDS Amazon S3+154

View profile

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp

TuringMississippi State University

“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”

Python R SQL Pandas NumPy Scikit-learn+132

View profile

Aaron Li

Screened

Junior AI/ML Engineer specializing in production LLM systems and RAG

Atlanta, GA2y exp

Georgia Institute of TechnologyUniversity of Chicago

“LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.”

Machine Learning LLM Integration Large Language Models (LLMs)OpenAI API Prompt Engineering Web Scraping+93

View profile

Vismay Patel

Screened

Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps

Berkeley, CA7y exp

Kaiser PermanenteSan Francisco State University

“ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.”

Python SQL Java Go JavaScript REST APIs+121

View profile

ChinmaySanjay Kawle

Screened

Junior Software Engineer specializing in cloud developer tools and backend APIs

Seattle, WA2y exp

Amazon Web ServicesUniversity of Illinois Chicago

“Summer intern on AWS Lambda tooling team who shipped Finch support in AWS SAM CLI, adding OS/runtime detection and robust fallback behavior to preserve Docker compatibility across developer environments. Also built an end-to-end RAG system for querying arXiv quantitative finance papers using Postgres/pgvector with two-stage retrieval, citation-grounded prompting, and rigorous evaluation loops driven by IR metrics and user feedback.”

Python Java C C++JavaScript TypeScript+83

View profile

Ranganayak Meravath

Screened

Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

5y exp

LPL FinancialUniversity of North Texas

“Senior engineer who built and productionized an Azure-based Enterprise AI Copilot for financial/compliance teams, focused on grounded, auditable answers with citations to reduce hallucinations in regulated workflows. Experienced designing multi-step agent orchestration and improving reliability through targeted iterations (e.g., fixing chunking/parsing to materially improve citation accuracy), plus building defensive pipelines for messy ERP/operational finance data.”

Python SQL JavaScript Node.js Next.js Bash+190

View profile

Devika gade

Screened

Mid-level Full-Stack Developer specializing in FinTech and cloud-native applications

Remote, USA4y exp

PlaidChristian Brothers University

“Full stack developer with strong implementation ownership across cloud deployments, integrations, and AI-powered support automation. They have put LLM/RAG workflows into production with measurable impact—cutting first response time by nearly 40%—and show unusual depth in debugging non-deterministic AI incidents, improving observability, and turning messy document inputs into reliable API-driven pipelines.”

Java Spring Boot Hibernate TypeScript React Redux+153

View profile

Krishi Jain

Screened

Junior Implementation Manager / Solution Engineer specializing in AI, ERP integrations, and predictive maintenance

Chicago, IL2y exp

Continuum AIWestcliff University

“LLM/agentic workflow practitioner (Continuum AI) who productionized an LLM system for manufacturing RMA intake and warranty claims by moving from a brittle prompt to a modular pipeline with RAG, function-calling extraction, deterministic validation, and strong observability. Also diagnosed and fixed an agentic ticket-triage misrouting issue by tracing failures to retrieval timeouts, adding guardrails/fallbacks, and implementing retries plus continuous evaluation—bringing misroutes near zero while creating a repeatable debugging playbook.”

Python Java Swift C++C JavaScript+84

View profile

Chaitanya Sachdeva

Screened

Mid-level Applied AI Engineer specializing in LLM infrastructure and model optimization

San Jose, CA3y exp

AMDUSC

“LLM engineer who has deployed privacy-preserving, real-time workplace risk monitoring over massive enterprise chat/email streams, tackling latency, hallucinations, and extreme class imbalance with model benchmarking, RAG + fine-tuning, and a pre-filter alerting layer. Also built an agentic legal contract drafting system (Jurisagent) using LangGraph/LangChain with deterministic multi-agent control flow, structured outputs, and reliability-focused evaluation/telemetry.”

Python C++Bash LangChain LangGraph NumPy+104

View profile

Yukta Kulkarni

Screened

Junior AI/ML Engineer specializing in applied LLMs, security, and reinforcement learning

New York, USA2y exp

New York UniversityNYU

“Built and shipped a production LLM-powered investor research feature for a fintech product, focused on grounded answers and minimizing hallucinations. Implemented retrieval-quality and evidence-coverage gating with clear refusal fallbacks, and evaluates systems with regression tests and metrics like correct-refusal rate, hallucination rate, and latency. Comfortable orchestrating workflows with LangChain or custom Python depending on production needs.”

Python C C++SQL TypeScript JavaScript+82

View profile

Venkata Sai Pavan Dema

Screened

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp

Capital OneUniversity of the Cumberlands

“AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.”

A/B Testing Amazon EC2 Amazon Redshift Amazon S3 Amazon SageMaker Azure App Service+163

View profile

Lakshmi Kiranmayi Chelluboyina

Screened

Junior Full-Stack & Data Engineer specializing in cloud platforms and cybersecurity ML

New York, NY2y exp

AccentureNYU

“Built a hackathon "Patient Summary Assistant" backend focused on healthcare workflows, combining RAG-based summarization with HIPAA-minded privacy controls (NER redaction + encryption). Demonstrated strong infra skills by deploying on Kubernetes with Helm/HPA and GitOps (ArgoCD), plus migrating from OpenAI to an on-prem Llama 3 stack (vLLM, quantization, shadow-mode testing) and adding real-time Kafka ingestion for patient vitals/anomaly alerts.”

Agile Apache Spark C C#C++CI/CD+93

View profile

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp

JPMorgan ChaseUniversity of Massachusetts Amherst

“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”

Python C C++CUDA SQL MATLAB+129

View profile

John Joji Melel

Screened

Intern Generative AI Engineer specializing in RAG and multi-agent systems

Chicago, IL2y exp

NeuraFlashUniversity of Chicago

“Built and deployed a production RAG-based multi-agent chatbot during an internship to help consultants answer client questions and guide users through new IT systems with step-by-step instructions. Demonstrates hands-on experience with LangGraph/LangChain/Google ADK, unstructured document parsing and chunking for RAG, and a reliability-first approach to agent workflows (metrics, fallbacks, human-in-the-loop, guardrails).”

Python SQL R C++Kubernetes Docker+87

View profile

Yeshwanth Pulapa

Screened

Mid-level AI/ML Engineer specializing in Databricks, MLOps, and real-time fraud detection

The Colony, TX4y exp

DatabricksUniversity of North Texas

“ML/LLM engineer building production, real-time fraud detection for financial transactions using a two-tier architecture (fast ML + GPT) to deliver both low-latency decisions and analyst-friendly risk explanations. Experienced orchestrating end-to-end retraining, drift monitoring, and automated model promotion with Databricks Jobs/Workflows and MLflow, and partnering closely with fraud analysts to tune alerts, thresholds, and dashboards.”

A/B Testing Apache Airflow Apache Kafka Apache Spark AWS AWS Lambda+93

View profile

Nikita Vivek Kolhe

Screened

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp

WorkUpUSC

“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”

Python SQL C C++R MATLAB+90

View profile

Zufeshan Imran

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp

SOTER AIUC San Diego

“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”

Machine Learning Deep Learning Generative AI Transformers Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)+120

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?