Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

Retrieval-Augmented Generation Python Docker SQL AWS CI/CD

Aditya Mustyala

Screened

Mid-level Full-Stack Developer specializing in healthcare and scalable web platforms

USA6y exp

CitiusTechUniversity of Central Florida

“Software engineer experienced delivering customer-facing, real-time industrial monitoring dashboards (motors/shafts/turbines) by partnering directly with end users to refine charts, alerts, and performance. Strong in API/platform integrations and production troubleshooting—uses feature flags, logging, validation/mapping, containerization, and performance testing to keep systems stable while iterating quickly.”

Agile Algorithms Angular API Design AWS AWS Lambda+152

View profile

Bharat Potluri

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems

Fort Worth, Texas8y exp

Ingram MicroUniversity of North Texas

“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”

Python R SQL Java C#HTML+134

View profile

Jhansi Priya

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows

Remote, null6y exp

fundae software IncUniversity of Dayton

“Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.”

Agentic AI Agile Apache Kafka Apache Spark AWS AWS Glue+129

View profile

Akshara Bhukya

Screened

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

Remote4y exp

KGS Technology GroupStevens Institute of Technology

“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”

Python R C++SQL Scikit-learn Pandas+112

View profile

Ravi Gupta

Screened

Mid-level Software Engineer specializing in Generative AI automation and secure platforms

Santa Clara, CA4y exp

ExafortUC Santa Cruz

“Backend/security-focused engineer from VeroTX who built an IdP service (Spring Boot + MongoDB on GCP) for an AI workflow platform and drove major latency improvements via caching and query/index optimization. Also shipped an AI loan-processing agent using LangChain/LangGraph, owning the document ingestion + vector database layer and designing a reliable multi-step workflow with retries, monitoring, and human-in-the-loop safeguards.”

Python JavaScript TypeScript SQL Java C+++65

View profile

Harsh Chauhan

Screened

Junior AI Engineer specializing in Generative AI, RAG, and NLP

Remote, US3y exp

TickerIndiana University Bloomington

“AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.”

A/B Testing API Integration AWS CI/CD C C+++120

View profile

Sonam Chhatani

Screened

Mid-level AI Engineer specializing in causal inference and LLM research

New York, USA8y exp

Binghamton UniversityBinghamton University

“LLM engineer who has deployed a production system combining LLMs with causal inference (DoWhy) to enable counterfactual “what-if” analysis for experimental research, including a robust variable-mapping/validation layer to reduce hallucinations. Also partnered with non-technical operations leadership at Irriion Technologies to deliver an AI-assisted onboarding workflow that cut onboarding time by 50% and reduced manual errors by ~40%.”

Python SQL C Java PyTorch Scikit-learn+87

View profile

Moore Macauley

Screened

Intern Backend Developer specializing in AI, multi-agent systems, and computer vision

0y exp

True Harmony AIUC Santa Cruz

“Backend-focused Python engineer who built core systems for an AI beauty-advice product: converting facial-recognition landmarks into usable facial measurements and dynamically shaping chatbot context for personalized guidance. Also worked on high-volume data ingestion at AINVESTgroup, improving agent context selection via a RAG database when upstream tags were unreliable, and has strong Git/GitOps + automated testing practices from rapid-deadline delivery environments.”

Agile API Development C C#C++CSS+68

View profile

Jai Vilatkar

Screened

Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems

Pune, India2y exp

NexaByte TechnologiesVellore Institute of Technology

“Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.”

Agile CI/CD Data Analysis Data Cleaning Data Science Deep Learning+101

View profile

Anudeep Reddy Veerla

Screened

Mid-Level Software Engineer specializing in cloud-native microservices

Boston, MA3y exp

Tech MahindraUniversity of the Potomac

“Built and shipped both a solo real-time multiplayer Spades game (TypeScript monorepo with shared client/server engine) and a production internal LLM-powered document Q&A tool for a SaaS company. Demonstrates strong RAG pipeline design (Pinecone + embeddings + reranking), rigorous eval/regression practices, and pragmatic data ingestion/observability work across Confluence, Notion, and messy PDFs/OCR—backed by clear metric improvements (P@1 61%→78%, escalations 40%→22%).”

.NET Agile Ansible Angular Bash Bootstrap+151

View profile

Vishesh Kumar

Screened

Intern Software & AI Engineer specializing in distributed systems and LLM applications

Palo Alto, CA1y exp

AmpUpStony Brook University

“Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).”

C++C#Python Java C JavaScript+134

View profile

Mohammed Mudassir Hussain Shaikh

Screened

Entry-Level Full-Stack Software Engineer specializing in serverless AWS and AI applications

Tempe, AZ1y exp

Indian ServersArizona State University

“Built and deployed serverless AWS applications (Lambda/S3/RDS Proxy) including a NASA L’Space React + Python data analysis tool, focusing on performance for large datasets. Demonstrates strong cloud troubleshooting across compute and networking (CloudWatch-driven diagnosis, EC2 scaling, security group fixes) and a user-driven iteration loop that improved product usability with dynamic filtering and interactive UI.”

Agile AWS AWS Lambda CI/CD Cloud Computing Data Ingestion+94

View profile

Hari Krishna Kona

Screened

Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP

Boston, MA3y exp

G-PLindsey Wilson College

“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”

Machine Learning Deep Learning Generative AI Large Language Models (LLMs)Computer Vision Semantic Search+111

View profile

Srinivasan Gomadam Ramesh

Screened

Mid-level AI/Data Engineer specializing in agentic AI and data platforms

Redmond, WA7y exp

Quadrant TechnologiesUniversity of Texas at Dallas

“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”

Agentic AI LangGraph LangChain LlamaIndex ReAct Regression Testing+216

View profile

Sakshi Mohan Tapkir

Screened

Junior Software Engineer specializing in AI platforms and backend systems

Boston, MA1y exp

Humanitarians.AINortheastern University

“Built and shipped AI products at Humanitarians AI, including a full-stack multi-agent platform that consolidated six faculty AI tools into one interface and achieved 100+ user adoption, 70% less workflow switching, and a 6x latency improvement. Also designed a grounded document parser using FAISS and structured LLM outputs that reduced hallucinations by 60%, showing strong depth in both product-minded engineering and production AI systems.”

Python LangChain Spring Boot AWS Data Structures Algorithms+121

View profile

Dhanush Kukatla

Screened

Mid Software Engineer specializing in backend distributed systems and AI/RAG platforms

2y exp

Compusoft Integrated SolutionsArizona State University

“Full-stack engineer with hands-on ownership of a production AI knowledge assistant used by 10,000+ daily users. Combines React/Next.js frontend work with FastAPI, AWS serverless, and RAG architecture using GPT-4, LangChain, and Pinecone, with measurable impact on relevance, latency, uptime, and support deflection.”

Java C#Python TypeScript JavaScript SQL+148

View profile

Yash Sandansing

Screened

Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs

Remote, United States4y exp

FILMIC TECHNOLOGIESUniversity at Buffalo

“Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.”

Python Java C++TypeScript Node.js Django+101

View profile

Phani Tarun Munukuntla

Screened

Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps

New York, USA2y exp

University at BuffaloUniversity at Buffalo

“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”

Python PySpark Apache Airflow Java JavaScript SQL+121

View profile

Anar Nurizada

Screened

Mid-level Robotics Engineer specializing in simulation-to-real ML control

Brooklyn, NY5y exp

DL-RLStony Brook University

“Robotics/ML engineer who benchmarks and adapts open-source robot action models, building synthetic datasets in Isaac Sim and modifying vendor code to scale training across multiple GPUs. Also built a production-style computer vision pipeline at Zortag—training a tiny YOLO-based classifier for fake-vs-real label detection and deploying it in a real-time iOS app with additional display/spoof detection.”

Machine Learning Deep Learning Reinforcement Learning Transformers Computer Vision PyTorch+157

View profile

Gopichand Amaraneni

Screened

Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems

USA4y exp

CitiusTechNorthwest Missouri State University

“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”

Python NumPy Pandas JSON SQL PostgreSQL+151

View profile

Sharanya Rao

Screened

Mid-level Backend & Blockchain Engineer specializing in Cosmos SDK and EVM

Remote, USA4y exp

Fair OrganizationYeshiva University

“Built and productionized an LLM+RAG lending assistant on AWS to help loan officers quickly answer questions from credit policies and prior decisions, tackling hallucinations with retrieval-only responses and a no-context fallback. Also automated end-to-end ETL and model retraining/deployment using Apache Airflow, and has experience translating clinical stakeholder needs (doctors/care managers) into ML features, metrics, and dashboards.”

Go Python Java JavaScript TypeScript SQL+113

View profile

Arjun Shrestha

Screened

Intern AI/GenAI Engineer specializing in NLP, RAG, and Snowflake Cortex

Columbus, Ohio1y exp

VertivLamar University

“Built and deployed a production AI invention/patent review platform that compares invention submissions against patent rules to provide instant feedback, reportedly cutting legal team review time by ~80%. Learned Snowflake Cortex LLMs and production deployment (Docker + AWS) on the job, and validated system quality through human-in-the-loop testing with experienced legal stakeholders.”

Amazon ECS AWS BERT Data preprocessing Docker FastAPI+70

View profile

Satish Kumar Reddy

Screened

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

Remote, NJ5y exp

Tungsten AutomationPace University

“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”

Python R Java C++SQL PostgreSQL+142

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Research Assistants Software Developers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?