Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Raahul Krishna Durairaju

Screened

Junior Software Engineer specializing in distributed systems and ML platforms

Fullerton, CA1y exp

California State University, FullertonCal State Fullerton

“Built and deployed real-world systems end-to-end across security and healthcare contexts: led a 3-person team delivering a university vehicle tracking system with 30% cost savings and 1-year post-launch monitoring. Also implemented a healthcare RAG chatbot with adaptive query routing that cut LLM costs by 40% while maintaining answer accuracy, and has experience debugging non-deterministic LLM behavior in DevOps pipeline automation.”

Python Java C C++JavaScript TypeScript+177

View profile

DileepReddy Battu

Screened

Mid-level Software Engineer specializing in backend, full-stack, and healthcare IT

Lake Mary, FL5y exp

Vesta TeleradiologyNorthern Arizona University

“Software engineer with a pragmatic, production-oriented approach to AI-driven development, using AI to accelerate coding while keeping human oversight on correctness, architecture, and final decisions. Has hands-on experience with agent-style AI workflows and has led the design and coordination of AI-agent systems with a strong emphasis on reliability, performance, and end-to-end execution.”

Python JavaScript TypeScript Java C++C+118

View profile

Viswanath Jagaluri

Screened

Mid-level Full-Stack & AI Engineer specializing in LLM applications

6y exp

Our National ConversationFitchburg State University

“Full-stack engineer who has shipped and operated generative-AI chat/QA features end-to-end, including a RAG-based pipeline with guardrails and cost/latency monitoring in production. Experienced with React/TypeScript + Node/Postgres architectures, Dockerized deployments to AWS (EC2) via GitHub Actions CI/CD, and building reliable ingestion/ETL systems with idempotency, backfills, and reconciliation.”

Python Java JavaScript TypeScript SQL C#+222

View profile

Karthik Patralapati

Screened

Mid-level AI/ML Software Engineer specializing in GPU-optimized LLM inference and cloud microservices

Seattle, WA5y exp

DVR SoftekSan José State University

“Built and deployed a production RAG-based multilingual analytics assistant for healthcare operations, enabling non-technical teams to query claims/EHR and risk metrics with grounded explanations. Demonstrates strong end-to-end LLM system engineering (retrieval tuning, re-ranking, hallucination controls, verification layers) plus workflow orchestration (Airflow/Composer/Step Functions) and stakeholder-driven iteration via prototypes and dashboards.”

Python Pandas NumPy PySpark C C+++197

View profile

Shram Kadia

Screened

Junior Software Engineer specializing in ML, RAG systems, and safety-critical risk modeling

San Jose, CA2y exp

OpenPRA OrgNorth Carolina State University

“Backend/cloud engineer from Resilient Tech with hands-on experience deploying REST APIs and database migrations into a live ERP used by real customers while maintaining 99% uptime. Has debugged intermittent AWS container timeouts down to security group/load balancer misconfigurations, and has extended Python in an ERPNext system to meet GST/e-invoicing compliance requirements with strong customer collaboration.”

Agile AWS CI/CD C#Computer Vision Data Visualization+81

View profile

Bhavana Polakala

Screened

Intern Data Scientist specializing in GenAI agents, RAG, and ML platforms

Chicago, IL3y exp

Immerso.aiIllinois Institute of Technology

“LLM/agent systems builder who deployed a production hybrid router for immerso.ai that dynamically selects retrieval vs reasoning vs generative pathways, achieving an 82% factual-accuracy lift. Deep hands-on experience optimizing local Mistral 7B inference (4–5 bit GGUF quantization, KV-cache reuse) and building reliable RAG/agent workflows with LangChain/LangGraph/AutoGen across GCP Cloud Run and AWS (ECS/Lambda).”

AJAX BigQuery Bootstrap C++CI/CD Claude+153

View profile

Doondi Ashlesh Tammineedi

Screened

Junior Full-Stack Software Engineer specializing in cloud-native web apps and AI tooling

California, US3y exp

EduQuencherMissouri University of Science and Technology

“Software engineer with experience across edtech, live gaming, and an AI document intelligence platform, delivering end-to-end customer-facing features and production backends. Built secure, automated live-session scheduling integrating Zoom and TalentLMS (JWT/RBAC, idempotency, transactions) cutting setup time from ~3 minutes to under 1 minute, and optimized real-time gaming dashboards/APIs with query tuning, caching, and CDN improvements (~60% latency reduction under peak load) on AWS.”

Python Java JavaScript TypeScript C C+++101

View profile

Dinal Dholiya

Screened

Mid-level Full-Stack Engineer specializing in AI-powered and cloud-native systems

Remote4y exp

ZentraisUniversity at Buffalo

“Product-minded engineer who has owned features end-to-end, including a full onboarding redesign that lifted completion ~25% and a production LLM/RAG report-generation system with strong guardrails (schema-constrained JSON, confidence gating, logging) and an automated eval/regression loop built from real user queries. Also built a scalable research data pipeline ingesting messy PDFs/JSON/CSVs with normalization, idempotent reruns, observability, and cost/latency tradeoffs.”

TypeScript JavaScript Python Go SQL C+++91

View profile

BHAVANA KRISHNAN

Screened

Junior AI/ML Engineer specializing in Generative AI, NLP, and MLOps

Lewisville, TX1y exp

ThinkBig Software SolutionsTexas Tech University

“LLM engineer who has deployed a production RAG system (LangChain/FAISS/FastAPI) for enterprise semantic search, tackling real-world latency by LoRA/PEFT fine-tuning and grounding outputs with retrieval. Brings strong MLOps (Docker, AWS EKS, CI/CD, MLflow) plus stakeholder-facing explainability experience using SHAP to align ML-driven financial guidance with non-technical domain experts.”

Apache Spark AWS AWS Lambda Azure Machine Learning CI/CD Clustering+87

View profile

Pranav Mishra

Screened

Junior Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

Charlotte, NC2y exp

WheelPriceUniversity of Illinois Chicago

“AI/ML engineer who has shipped production systems across computer vision and conversational agents: built a YOLOv8-based wheel fitment pipeline at a Techstars-backed automotive startup, focusing on sub-second latency, monitoring, and robust fallback mechanisms that drove 2–3x page view growth and +5–6k users. Also built a voice-based interview platform orchestrating Deepgram + GPT-4 Mini + OpenAI TTS with FSM-driven reliability, and has hands-on RAG experience (LangChain, hybrid retrieval, cross-encoder reranking, custom pseudo-query generation).”

Python Java C++JavaScript C#TensorFlow+117

View profile

Rethvick Sriram Yugendra Babu

Screened

Junior AI/ML Software Engineer specializing in Generative AI and scalable data pipelines

Tucson, AZ2y exp

University of ArizonaUniversity of Arizona

“Built and operated large-scale biodiversity/ecological research platforms, integrating 50+ heterogeneous global datasets into a unified BIEN 3 schema on PostgreSQL/PostGIS and improving data consistency by 35%. Strong production engineering background (Linux monitoring, CI/CD performance gates, Docker on AWS/Azure) plus applied AI work building a Python RAG system (0.90 precision) and halving latency with Elasticsearch.”

Agentic AI AWS CI/CD C#C++Computer Vision+109

View profile

Taruni Reddy Ampojwala

Screened

Mid-level GenAI Engineer specializing in LLM agents and RAG systems

Brooklyn, NY4y exp

PamTenLong Island University

“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”

AI Agents Alerting Analytics AWS BigQuery CI/CD+107

View profile

Yashwant Gandham

Screened

Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure

Boulder, CO1y exp

NovaChat AIUniversity of Colorado Boulder

“Built and deployed production RAG-based document search/Q&A systems (DocChat and an internship marketing RAG), using a React + FastAPI stack on GCP with docs stored in GCP buckets and retrieval via embeddings/vector DB. Emphasizes cost/performance tradeoffs (reported ~40% cost reduction) and ships via Docker (Railway), with load/API testing using JMeter and Swagger; regularly collaborates with a CEO stakeholder to iterate and push changes to production.”

Python NumPy Pandas PyTorch scikit-learn SQL+78

View profile

Varun Mahankali

Screened

Junior Full-Stack Software Engineer specializing in React, Node.js, AWS, and Generative AI

3y exp

KalvenTech TechnologiesUniversity of North Texas

“Built and production-deployed a Streamlit-based PDF RAG chatbot using LangChain (FAISS, embeddings, prompt templates) and OpenAI, optimizing Streamlit’s stateless behavior by caching vector DB + chat history to cut latency and API cost. Demonstrates a rigorous evaluation mindset (gold datasets, unit tests, LLM-as-judge, groundedness KPIs) and has experience communicating privacy/accuracy safeguards (RBAC, data masking, citations) to a non-technical client at Kalven Technologies.”

TypeScript JavaScript Python Java C C+++84

View profile

Binaya Sharma

Screened

Senior Software Engineer specializing in full-stack systems, big data, and applied AI

Baton Rouge, LA6y exp

365LabsLouisiana State University

“Built and deployed ForensicLLM, a local domain-specific LLaMA-3.1-8B model for digital forensic investigators using RAFT + RAG over 1000+ curated research papers, with citation-aware responses and rigorous evaluation (BERTScore/G-Eval). Deployed via vLLM and Docker and validated through a chatbot survey with 80+ participants; published at DFRWS EU 2025.”

Agile Ansible Angular Apache Hadoop Apache Kafka Apache Spark+107

View profile

Karan Baid

Screened

Intern Machine Learning Engineer specializing in Generative AI and RAG systems

Jaipur, India

Netgraph Networking Pvt. Ltd.Vellore Institute of Technology

“Early-career AI/LLM builder who created and deployed a multi-agent news analysis agent (Patrakarita) using CrewAI, coordinating researcher/analyst roles to turn noisy article URLs into structured, prioritized outputs (claims, tone, verification questions, opposing views). Strong focus on orchestration debugging and reliability evaluation, including measuring hallucination/redundancy and improving reasoning by refactoring pipeline sequencing.”

Python C++Flask FastAPI LangChain LangGraph+75

View profile

Srikar Tharala

Screened

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

Remote, USA4y exp

ProcialCentral Michigan University

“Currently at ProShare and reports building an AI/LLM-powered system deployed to production, aimed at helping with status-related difficulties and reducing misunderstandings across transactions. Also cites prior collaboration at Porsche with marketing teams, focusing on translating marketing goals into technical requirements and communicating solutions clearly to non-technical stakeholders.”

Machine Learning Deep Learning Generative AI Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)Multi-agent Systems+112

View profile

Atharva Chavan

Screened

Junior Full-Stack Software Engineer specializing in mobile, cloud, and GenAI integration

Syracuse, NY2y exp

D&D Motor Systems Inc.Syracuse University

“Software engineering intern with hands-on ownership of a Java/Spring Boot order management microservice, including production performance tuning via Redis caching and database indexing driven by API logs/metrics. Also contributed to a production mobile-backend LLM feature using RAG with embeddings over structured data and documents (DB + object storage), with guardrails to keep responses grounded.”

C++Java Kotlin Swift Python JavaScript+86

View profile

Merub SHAIKH

Screened

Junior Software Engineer specializing in full-stack web development and test automation

Chicago, IL3y exp

Illinois Institute of TechnologyIllinois Institute of Technology

“Full-stack engineer who built and owned a production workflow/kanban-style drag-and-drop system in Next.js (App Router) with Postgres/Prisma, including reusable component abstractions, Cypress E2E coverage, and post-launch performance/bug ownership. Notable for measurable impact (25% faster UI dev, ~30% query perf improvement) and for leading an incremental Express→NestJS migration that reduced technical debt (~40%) through better structure, docs, and team enablement.”

Python TypeScript Node.js REST APIs JavaScript React+88

View profile

Sampath Achalla

Screened

Mid-level Python Full-Stack Engineer specializing in AI microservices and cloud data platforms

USA3y exp

DoJaGaIllinois Institute of Technology

“Backend-leaning full-stack engineer in fintech/payments who shipped an end-to-end Stripe payments + webhook system for a financial microservices platform, emphasizing ledger accuracy via idempotency, transactional writes, retries, and DLQs. Also delivered a real-time React/TypeScript payment status dashboard informed by user interviews, and improved production performance by 35% p95 latency through PostgreSQL tuning and Redis caching on AWS.”

Python SQL Django Flask FastAPI SQLAlchemy+178

View profile

Gomathy Selvamuthiah

Screened

Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications

Portland, US2y exp

SBD TechnologiesNortheastern University

“Built an LLM-driven MLOps agent at SBD Technologies that automated an EV-charging prediction workflow end-to-end, integrating with real-time Kafka/FastAPI systems supporting 120K+ chargers at 99.99% event delivery. Addressed frequent schema drift by implementing SQLAlchemy/Flyway validation (60% reduction in drift issues) and deployed as Kubernetes microservices with GitHub Actions CI/CD; also has Airflow-based ingestion/crawling experience into Snowflake and stakeholder-facing delivery via a Fleetcharge PWA.”

Python Java C C++FastAPI Node.js+99

View profile

Shuchi Shah

Screened

Senior Applied AI Engineer specializing in RAG and full-stack systems

San Jose, CA13y exp

OpGov.AISan Diego State University

“Backend engineer with experience building an end-to-end civic tech AI platform that ingests city council meeting videos, transcribes them with Whisper, and enables natural-language Q&A via a LangChain/FAISS RAG pipeline. Demonstrated strong systems thinking by tuning retrieval for accuracy/latency/memory (cutting response time ~3s→1s and memory ~500MB→25MB) and by safely migrating an ERP from monolith toward services using dual writes, reconciliation, and idempotency to protect financial workflows.”

Retrieval-Augmented Generation Prompt Engineering OpenAI LangChain LangGraph FAISS+208

View profile

Darshan Shah

Screened

Mid-Level Software Engineer specializing in cloud-native microservices and full-stack development

Holliston, MA6y exp

Liberating TechnologiesNortheastern University

“Full-stack engineer with deep startup experience building products from scratch under ambiguous requirements. Delivered a scalable, admin-configurable notification platform (Spring Boot/Java/Kafka) supporting 50+ notification types across 3 channels for 10k+ users, cutting new notification setup to ~5 minutes. Also built a Tinder-meets-LinkedIn job-swiping app (React/TS + Node/Prisma) and has hands-on AWS production ops (ECS/EKS, RDS, CloudWatch) plus multiple third-party integrations (Stripe, QuickBooks, Twilio).”

Java Python TypeScript JavaScript Swift React+128

View profile

Gagan Jagadish

Screened

Junior AI/ML Software Engineer specializing in automation and healthcare imaging

Charlotte, NC2y exp

Bridge Investment GroupUniversity of North Carolina at Charlotte

“Backend-focused engineer who built a Python-based automation system leveraging Gemini AI and prompt-driven PDF field extraction to replace a previously manual third-party workflow. Drove stakeholder alignment around accuracy/acceptance thresholds and added production-minded safeguards like graceful failure handling and backup model contingencies.”

Python Java React JavaScript TypeScript CSS+67

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?