Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Krishna Rajput

Screened

Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems

Tempe, AZ5y exp

HCLTechArizona State University

“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”

A/B Testing AI Agents Anomaly Detection AWS Glue AWS Lambda Azure Machine Learning+126

View profile

nitesh bommisetty

Screened

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp

LumenUniversity of South Florida

“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”

Python R SQL TensorFlow PyTorch Keras+123

View profile

Gabriele Gobbi

Screened

Mid-level Data Scientist specializing in GenAI, LLM-to-SQL, and analytics platforms

Turin, Italy3y exp

Engineering Ingegneria InformaticaUniversity of Ferrara

“LLM/agentic AI builder who led end-to-end integration of an LLM system into a business intelligence product, creating a scalable, metadata-driven RAG/agent pipeline with an orchestrator that routes queries to specialized agents (including DB-backed quantitative querying). Also built an LLM-to-SQL chatbot and partnered with non-technical stakeholders to capture domain context and improve SQL generation, using automated LLM-based testing to evaluate reliability.”

Python Machine Learning Scikit-Learn TensorFlow PyTorch Large Language Models (LLMs)+51

View profile

saran palle

Screened

Mid-level Applied AI Engineer specializing in agentic LLM workflows

North Carolina4y exp

Acentrik Technology SolutionsUniversity at Buffalo

“AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.”

Python FastAPI Flask TypeScript REST APIs System Design+101

View profile

ManiKumar Chintha

Screened

Mid-level Full-Stack Java Developer specializing in microservices and cloud (AWS/Azure)

Texas, USA4y exp

PNCWichita State University

“Backend/full-stack Java engineer at PNC Bank specializing in real-time fraud detection systems. Built event-driven Spring Boot + Kafka microservices with PostgreSQL/Redis performance tuning, and shipped a production LLM-powered RAG feature for fraud analysts with strong guardrails (grounded internal data, structured prompts with references, human-in-the-loop) plus an evaluation loop using labeled historical fraud cases.”

Java C C++TypeScript Python SQL+97

View profile

Vidit Naik

Screened

Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems

San Francisco, CA2y exp

Checksum AIUC Riverside

“Forward-deployed engineer who built a production AI drone-control chatbot that lets users fly a drone via natural language while viewing a real-time feed. Implemented RAG over drone SDK documentation (vector DB + top-k retrieval) and LoRA fine-tuning, with a focus on latency, token efficiency, and cost reduction, and regularly works with non-technical clients to integrate and explain AI system architecture.”

Artificial Intelligence AWS AWS Glue AWS Lambda BERT CI/CD+101

View profile

Jitesh Kumar S

Screened

Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps

Lafayette, IN3y exp

YaarcubesUniversity of Maryland, College Park

“ML/LLM engineer with Meta experience building production AI systems for near real-time user-report classification and summarization under strict latency (<250ms), safety, cost, and privacy constraints. Has hands-on MLOps/orchestration experience (Airflow, Spark, MLflow, Kubernetes, Docker, GitHub Actions) plus observability (Prometheus/Grafana) and applies rigorous evaluation, staged rollouts, and A/B testing to keep agent workflows reliable in production.”

Python SQL Bash Shell Scripting Java C+++99

View profile

Surya Danturty

Screened

Intern AI/ML Engineer specializing in computer vision and time-series forecasting

Riverside, CA0y exp

University of California, RiversideUC Riverside

“Undergrad who built a production RAG chatbot for a messy college website using OpenAI embeddings + FAISS, overcoming hard-to-crawl/non-selectable site content and strict API budget limits. Applies information-retrieval best practices (section-based chunking with overlap, precision/recall evaluation) and reliability techniques (edge-case testing, similarity thresholds, fallback responses), and has experience scaling similar indexing work to ~300,000 Wikipedia pages.”

C Python Java JavaScript SQL HTML+74

View profile

Akshay Katageri

Screened

Mid-level AI Engineer specializing in multi-agent systems and RAG

Jersey City, NJ4y exp

Elevance HealthPace University

“Built and shipped a production LangGraph-based multi-agent LLM analytics/decision copilot that answers questions across SQL/BI systems and unstructured docs, emphasizing grounded, tool-verified outputs with citations and confidence gating. Deep hands-on experience with orchestration (LangGraph, CrewAI, OpenAI Assistants, MCP) plus real-world latency/cost optimization (vLLM batching/KV caching, speculative decoding, quantization) and rigorous eval/observability. Partnered closely with business/ops stakeholders to deliver explainable reporting automation, cutting manual reporting time by 50%+.”

AI Agents Cross-Functional Collaboration Data Pipelines Docker FAISS Feature Engineering+106

View profile

Bhoomi Parikh

Screened

Junior Product Manager specializing in AI-enabled analytics products

Mountain View, CA2y exp

KantarUniversity of Texas at Dallas

“Product/full-stack engineer with analytics-dashboard experience at Kantar, owning features end-to-end from React/Next.js UI through Postgres data modeling and query optimization. Built a multidimensional filters/tags module that cut analyst discovery time by ~60% and also implemented durable backend workflows for bulk report generation with retries and idempotency, validated via EXPLAIN ANALYZE and production monitoring.”

Product Strategy A/B Testing Cross-Functional Leadership Customer Segmentation Prompt Engineering Retrieval-Augmented Generation (RAG)+87

View profile

Farida Poor

Screened

Junior Machine Learning Engineer specializing in NLP and multimodal transformers

Bay Area, CA3y exp

Altea TechnologyUniversity of Denver

“Built and deployed LLM-powered agentic chatbot and text-to-SQL systems using LangGraph/LangChain (and Bedrock), structuring workflows as DAGs with planning/replanning and validation to improve tool-calling reliability and reduce hallucinations. Operates production feedback loops with online/offline metrics, drift detection, and LangSmith-based evaluation pipelines, and regularly partners with business stakeholders and clinicians using slide decks and visual charts.”

Python C C++MATLAB R SQL+107

View profile

Snehitha Penumaka

Screened

Mid-level AI/ML Engineer specializing in predictive modeling and cloud ML pipelines

Dallas, TX3y exp

Cambard LLCUniversity of Texas at Dallas

“LLM engineer/data engineer who has deployed production RAG systems for internal-document Q&A, building end-to-end ingestion, embedding, vector search, and FastAPI serving while actively reducing hallucinations and latency through rigorous retrieval tuning and caching. Also experienced in orchestrating cloud data pipelines (Airflow, AWS Glue, Azure Data Factory) and partnering with non-technical business teams to deliver AI solutions like automated document review.”

A/B Testing Agile Anomaly Detection Apache Spark AWS Lambda Classification+93

View profile

Pravalika Kuppireddy

Screened

Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation

4y exp

University of Michigan-DearbornUniversity of Michigan-Dearborn

“LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.”

Python SQL Java C++Scikit-learn PyTorch+133

View profile

Abhiroopsudansh Karengula

Screened

Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps

KS, USA4y exp

Black & VeatchUniversity of Central Missouri

“Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.”

Python R C++Java SQL Bash+161

View profile

Fabio Pecora

Screened

Junior Software Engineer specializing in distributed systems and applied AI

New York, NY3y exp

NextStep.AICollege of Staten Island (CUNY)

“Early-career full-stack builder who created an AI interview-prep platform used by 200+ students, tested it with a 25-student study group, and earned recognition through the CUNY Startup accelerator, including prize money and local college adoption. Has also shipped compliance-sensitive AI products in healthcare marketing and operational tools like invoice approval systems, showing unusual breadth across AI, UX, and backend systems.”

Python Java TypeScript JavaScript SQL FastAPI+140

View profile

Pyneni Sai Charan

Screened

Mid-level Full-Stack Engineer specializing in FinTech deployments

4y exp

BMOUniversity of Central Missouri

“Backend-focused engineer with banking-domain deployment experience who has owned releases end-to-end, from discovery and API/database implementation through post-launch stabilization. Brings a reliability-first mindset across distributed systems, incident response, and messy real-world data handling, and has also applied that foundation to retrieval-based LLM workflows in production-oriented cloud environments.”

API Gateway JWT SOAP OpenAPI Swagger Python+72

View profile

Chaitanya Annabathana

Screened

Mid-level Software Engineer specializing in AI pipelines and enterprise integrations

USA5y exp

AFBA Life InsuranceCalifornia State University, East Bay

“Candidate has 4 years of experience and appears strongest in customer-facing implementation and AI-enabled workflow automation. They describe owning deployments end-to-end, putting an LLM support assistant with RAG and function calling into production, and improving support operations with a 30% reduction in resolution time and 25% gain in agent productivity.”

Python Java SQL JavaScript TypeScript Workflow Automation+70

View profile

Vasavi Nagavalli Gollapalli

Screened

Senior Full-Stack Software Engineer specializing in backend systems and cloud-native APIs

Detroit, MI7y exp

CortileSan Jose State University

“Full-stack engineer with startup-style ownership across backend, frontend, and AI systems, spanning Java/Spring, React, Node/TypeScript, and LLM-powered retrieval. Shipped a workspace intelligence layer using LangChain, OpenAI, and Pinecone to paying customers, while also improving core product metrics like workspace creation success (+30%), latency (450ms to 280ms), and deployment cycle time (-40%).”

Java JavaScript TypeScript Go Python SQL+143

View profile

santhosh ravula

Screened

Mid-level Full-Stack Software Engineer specializing in cloud-deployed web apps and APIs

Dayton, OH3y exp

Wells FargoWright State University

“Software engineer who has shipped both core web platform features (secure user authentication/profile management) and production LLM systems. Built an internal documentation knowledge assistant using a full RAG pipeline (OpenAI embeddings, vector DB, semantic search, reranking) with evaluation loops and a scalable document-ingestion pipeline for PDFs/FAQs, iterating based on metrics and user feedback.”

Python JavaScript TypeScript SQL React Angular+127

View profile

sahithi A

Screened

Mid-level AI Engineer specializing in LLM agents and RAG for health-tech

Remote6y exp

Milton AITexas Tech University

“Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.”

Agile AI Agents API Integration AWS Backend Development CI/CD+121

View profile

Sai Addala

Screened

Mid-level AI/ML Engineer specializing in financial risk, fraud analytics, and forecasting

USA4y exp

Northern TrustSyracuse University

“Built and productionized an LLM-powered financial intelligence and forecasting platform at Northern Trust using a RAG architecture (LangChain + Hugging Face + FAISS) with end-to-end MLOps (Docker/Kubernetes, Airflow, MLflow). Emphasized regulatory-grade explainability (SHAP/Power BI) and hallucination control (retrieval-only grounding), achieving ~30% forecasting accuracy improvement and ~65% reduction in analyst research time, with sub-second inference and 95% uptime on EKS/AKS.”

Python NumPy Pandas JSON SQL PostgreSQL+116

View profile

Akshay Krishna Varma Buddharaju

Screened

Junior Machine Learning Engineer specializing in computer vision and generative AI

1y exp

INV TechnologiesKennesaw State University

“CoreAI intern at The Home Depot who improved the Magic Apron Assistant by building a production video ingestion + RAG retrieval system for long videos (uploads and YouTube), including a graph-based retrieval module to speed up and improve relevance. Experienced with Kubernetes orchestration (HPA) and production reliability practices like caching, monitoring, regression testing, and stakeholder-driven requirements.”

Automated Testing AWS BERT C C++CI/CD+84

View profile

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp

ZymebalanzBoston University

“LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.”

Python C++Java C HTML JavaScript+174

View profile

SaiGanesh Konagalla

Screened

Mid-level ML Engineer specializing in NLP and Generative AI

Houston, TX4y exp

Epic SystemsUniversity of Central Missouri

“Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.”

Python NumPy Pandas Scikit-learn Seaborn Matplotlib+186

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?