Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

suparshwa patil

Screened ReferencesStrong rec.

Mid-level Software Engineer specializing in AI platforms and full-stack systems

Santa Clara, CA4y exp

One CommunityPurdue University

“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”

Python Java TypeScript Go SQL FastAPI+130

View profile

TejaSree Chiluveru

Screened ReferencesModerate rec.

Mid-level Software Engineer specializing in FinTech and cloud-native microservices

Austin, TX5y exp

JPMorgan ChaseWebster University

“Built and launched an internal AI troubleshooting assistant focused on safe, retrieval-first root cause analysis for enterprise systems, with strong attention to monitoring, fallback behavior, and post-launch iteration. Also owns full-stack product work across React and Java/Spring Boot, including high-volume financial operations workflows, and reports measurable LLM improvements such as ~30-40% latency reduction.”

Java Python AWS Microservices Distributed Systems LangChain+127

View profile

Manasa Pantra

Screened ReferencesStrong rec.

Junior Software Engineer specializing in AI, LLM systems, and full-stack development

Stony Brook, NY2y exp

Stony Brook UniversityStony Brook University

“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”

Python C C++JavaScript TypeScript Git+161

View profile

Abnik Ahilasamy

Screened ReferencesModerate rec.

Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference

Chennai, India0y exp

Larsen & ToubroArizona State University

“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”

Python PyTorch TensorFlow C++SQL Bash+153

View profile

Srinivas Tenneti

Screened

Mid-level AI/ML Engineer specializing in GenAI and predictive modeling

Fullerton, California5y exp

UnitedHealth GroupGeorge Washington University

“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”

A/B Testing Amazon ECS Apache Spark AWS AWS Glue BigQuery+110

View profile

Julian Lee

Screened

Intern Software Engineer specializing in AI/LLMs and full-stack development

New York, New York1y exp

Highlight.AIUSC

“AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).”

Algorithms API Integration AWS AWS Lambda CI/CD C#+152

View profile

Vamsi Koppala

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems

Barrington, IL4y exp

ComericaTexas Tech University

“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”

Agile Apache Spark Azure Blob Storage Bash BERT Bitbucket+178

View profile

Alex Nguyen

Screened

Junior Applied AI Engineer specializing in LLMs, RAG, and agentic systems

La Jolla, CA2y exp

Uniwise.aiUC San Diego

“Co-founded a healthcare AI startup building and deploying software directly with end users, emphasizing rapid shipping, deep user interviews, and workflow-first adoption. Has hands-on production deployment experience on AWS (including diagnosing a silent AWS App Runner failure caused by an ARM vs amd64 Docker build mismatch) and is motivated by customer-facing, travel-heavy roles to keep engineering tightly connected to real-world usage.”

Python PyTorch Pandas NumPy Scikit-learn Hugging Face+83

View profile

AnilKumar Kanakadandila

Screened

Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps

San Francisco Bay Area, CA5y exp

VerizonCalifornia State University

“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”

Python Pandas NumPy Scikit-learn SQL MySQL+105

View profile

shiva kumar kotha

Screened

Mid-level AI/ML Engineer specializing in Generative AI and healthcare data

NJ, USA6y exp

Johnson & JohnsonWichita State University

“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”

Analytics API Integration API Testing AWS Azure Data Factory BERT+158

View profile

Anurag Reddy

Screened

Mid-level Data Scientist specializing in ML, MLOps, and Generative AI

TX, USA5y exp

CaterpillarUniversity of Illinois Chicago

“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”

A/B Testing Agile Anomaly Detection Ansible Apache Airflow Apache Hadoop+138

View profile

David Wisdom

Screened

Mid-level Data & Machine Learning Engineer specializing in production ML and data platforms

San Francisco, CA5y exp

Spice DataWilliam & Mary

“Built and deployed a production LLM system that scraped Google Maps menu photos, extracted structured prices via OpenAI, and cross-validated them against website-scraped data to automate data-quality verification at scale (replacing costly manual contractor checks). Demonstrates strong reliability instincts—precision-first prompting, output gating with image-quality metadata, and fuzzy matching/RAG techniques—plus solid orchestration (Dagster/Airflow) and observability (Sentry, Prometheus/Grafana).”

Python SQL Ruby Rust Snowflake BigQuery+74

View profile

Sarthak Singh

Screened

Mid-level Full-Stack Engineer specializing in cloud-native systems and LLM applications

Remote, USA4y exp

InfluencedUniversity of Maryland, College Park

“Customer-support/engineering background spanning Informatica PowerCenter ETL and IBM demos/workshops, with hands-on experience hardening data workflows for production (error tables/reject links, validation, restart strategies, alerting, performance tuning). Also demonstrates a clear, systems-level approach to diagnosing LLM/agentic workflow issues (prompt/RAG/tooling/memory) using instrumentation and iterative fixes, and has partnered with sales on POCs by defining success metrics and mapping solutions to customer architectures.”

Next.js TypeScript Tailwind CSS Amazon DynamoDB Large Language Models (LLMs)Python+104

View profile

Asif Mulla

Screened

Mid-Level Software Engineer specializing in Java microservices and event-driven systems

Maryland, USA6y exp

Morgan StanleyUniversity of Alabama at Birmingham

“Backend engineer on Morgan Stanley’s trade risk and compliance platform, building Java/Spring Boot microservices that validate equity and fixed-income trades at multi-million-events/day scale. Shipped an LLM-assisted trade exception analysis feature using RAG over internal policy documents and trade history, with production-grade guardrails (confidence thresholds, audit logs, human-in-the-loop) and measurable performance wins (~30–35% faster reporting) through PostgreSQL tuning and Redis caching.”

Java Python TypeScript SQL Spring Framework Spring Boot+138

View profile

Ananya Bojja

Screened

Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps

USA4y exp

CignaUniversity of New Hampshire

“AI/ML engineer at Cigna Healthcare building a production, HIPAA-compliant LLM-powered clinical insights platform that summarizes unstructured medical notes using a fine-tuned transformer + RAG on AWS. Demonstrates strong end-to-end MLOps and cloud optimization (distillation, Spot/Lambda/Auto Scaling) with quantified outcomes (~28% accuracy lift, ~40% less manual review, ~25% lower ops cost) and strong clinician-facing explainability via SHAP and dashboards.”

A/B Testing Agile API Integration Apache Airflow Apache Kafka Apache Spark+148

View profile

Sai Ganesh nelluri

Screened

Mid-level Generative AI Engineer specializing in LLM systems and RAG

5y exp

Huntington BankCentral Michigan University

“Currently at Huntington Bank, built a production-grade RAG system that helps business/operations teams get grounded answers from large volumes of internal enterprise documents. Owns ingestion and FastAPI backend, tuned hybrid BM25+vector retrieval and chunking for relevance, and evaluates reliability with metrics and observability (LangSmith, CloudWatch, Prometheus/Grafana) while partnering closely with non-technical stakeholders.”

Python SQL Java Bash Shell Scripting R+169

View profile

Hruday Vuppala

Screened

Junior Software Engineer specializing in Full-Stack and ML for FinTech

Hyderabad, Telangana1y exp

Volksoft TechnologiesUSC

“Full-stack engineer with fintech trading-platform experience who shipped and operated a real-time portfolio P&L/performance feature end-to-end (React + Node/WebSockets + MongoDB) on AWS, including significant performance tuning under peak trading load. Also built a Spark-based trading analytics pipeline with idempotency and reconciliation for auditability, and has a personal React/TS + Node/Express project (Artsy) with JWT auth and schema-evolution practices.”

Python JavaScript TypeScript C C++SQL+92

View profile

Neimisha Konda

Screened

Senior Data Engineer specializing in Palantir Foundry and Snowflake for regulated industries

USA5y exp

American ExpressUniversity of Massachusetts Boston

“Data engineer focused on high-volume transaction pipelines (2M+ per day) using Snowflake/Snowpipe, Spark/PySpark, Kafka, and Airflow, with a strong emphasis on schema/data-quality enforcement and reliability improvements. Also built a greenfield compliance-focused RAG solution, using CloudWatch monitoring and adding ingestion validation to prevent malformed OCR documents from degrading search quality.”

Snowflake SQL PostgreSQL MySQL NoSQL Apache Spark+109

View profile

Jiyang Wu

Screened

Junior Software Engineer specializing in cloud microservices and database systems

Stony Brook, NY2y exp

Stony Brook UniversityStony Brook University

“Grad student who co-developed a safety-oriented mental health LLM consulting agent using RAG + Gemini and Hugging Face emotion detection to assess user crisis level and adapt responses. Implemented a key reliability improvement for CRISIS scenarios by bypassing generative output and returning direct, emotionless, knowledge-base guidance to seek immediate real-world help.”

Amazon CloudWatch Amazon EC2 Amazon EKS Amazon RDS Amazon SQS Angular+53

View profile

Cristian Vega

Screened

Senior AI/ML Engineer specializing in Generative AI and RAG

California, null9y exp

Morf HealthUniversity of Texas at Austin

“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”

Python R Java JavaScript SQL MySQL+154

View profile

Nikitha Kommidi

Screened

Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps

6y exp

CitibankUniversity of Texas at Arlington

“Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.”

Python SQL Bash C JavaScript PHP+154

View profile

HIMANSHU SHARMA

Screened

Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation

Orlando, FL6y exp

Kore.aiUniversity of South Florida

“Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.”

Python PyTorch TensorFlow Scikit-learn Hugging Face Transformers SQL+121

View profile

Kasireddy Kumar reddy

Screened

Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems

Missouri, USA6y exp

CenteneUniversity of Central Missouri

“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”

A/B Testing Agile Apache Airflow Apache Kafka Azure Blob Storage BigQuery+137

View profile

Fnu Pallavi Sharma

Screened

Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI

Madison, WI1y exp

University of Wisconsin–MadisonUniversity of Wisconsin–Madison

“Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.”

A/B Testing API Gateway AWS Computer Vision Data Visualization Deep Learning+118

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?