Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

AM

Mid-level Full-Stack Developer specializing in healthcare and scalable web platforms

USA6y exp
CitiusTechUniversity of Central Florida

Software engineer experienced delivering customer-facing, real-time industrial monitoring dashboards (motors/shafts/turbines) by partnering directly with end users to refine charts, alerts, and performance. Strong in API/platform integrations and production troubleshooting—uses feature flags, logging, validation/mapping, containerization, and performance testing to keep systems stable while iterating quickly.

View profile
BP

Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems

Fort Worth, Texas8y exp
Ingram MicroUniversity of North Texas

LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.

View profile
JP

Jhansi Priya

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG pipelines, and agentic workflows

Remote, null6y exp
fundae software IncUniversity of Dayton

Applied AI/ML engineer with hands-on production experience building a RAG-based AI assistant for pharmaceutical maintenance troubleshooting using LangChain + FAISS/Pinecone, including a custom normalization layer to handle inconsistent terminology and duplicate document revisions. Also built Airflow-orchestrated pipelines for document ingestion/embeddings and predictive maintenance workflows (SCADA ETL, drift-based retraining), and partnered closely with production supervisors/quality engineers via Power BI dashboards and real-time alerts.

View profile
AB

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

Remote4y exp
KGS Technology GroupStevens Institute of Technology

LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.

View profile
RG

Ravi Gupta

Screened

Mid-level Software Engineer specializing in Generative AI automation and secure platforms

Santa Clara, CA4y exp
ExafortUC Santa Cruz

Backend/security-focused engineer from VeroTX who built an IdP service (Spring Boot + MongoDB on GCP) for an AI workflow platform and drove major latency improvements via caching and query/index optimization. Also shipped an AI loan-processing agent using LangChain/LangGraph, owning the document ingestion + vector database layer and designing a reliable multi-step workflow with retries, monitoring, and human-in-the-loop safeguards.

View profile
Harsh Chauhan - Junior AI Engineer specializing in Generative AI, RAG, and NLP in Remote, US

Harsh Chauhan

Screened

Junior AI Engineer specializing in Generative AI, RAG, and NLP

Remote, US3y exp
TickerIndiana University Bloomington

AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.

View profile
Sonam Chhatani - Mid-level AI Engineer specializing in causal inference and LLM research in New York, USA

Mid-level AI Engineer specializing in causal inference and LLM research

New York, USA8y exp
Binghamton UniversityBinghamton University

LLM engineer who has deployed a production system combining LLMs with causal inference (DoWhy) to enable counterfactual “what-if” analysis for experimental research, including a robust variable-mapping/validation layer to reduce hallucinations. Also partnered with non-technical operations leadership at Irriion Technologies to deliver an AI-assisted onboarding workflow that cut onboarding time by 50% and reduced manual errors by ~40%.

View profile
Moore Macauley - Intern Backend Developer specializing in AI, multi-agent systems, and computer vision

Intern Backend Developer specializing in AI, multi-agent systems, and computer vision

0y exp
True Harmony AIUC Santa Cruz

Backend-focused Python engineer who built core systems for an AI beauty-advice product: converting facial-recognition landmarks into usable facial measurements and dynamically shaping chatbot context for personalized guidance. Also worked on high-volume data ingestion at AINVESTgroup, improving agent context selection via a RAG database when upstream tags were unreliable, and has strong Git/GitOps + automated testing practices from rapid-deadline delivery environments.

View profile
Jai Vilatkar - Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems in Pune, India

Jai Vilatkar

Screened

Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems

Pune, India2y exp
NexaByte TechnologiesVellore Institute of Technology

Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.

View profile
Anudeep Reddy Veerla - Mid-Level Software Engineer specializing in cloud-native microservices in Boston, MA

Mid-Level Software Engineer specializing in cloud-native microservices

Boston, MA3y exp
Tech MahindraUniversity of the Potomac

Built and shipped both a solo real-time multiplayer Spades game (TypeScript monorepo with shared client/server engine) and a production internal LLM-powered document Q&A tool for a SaaS company. Demonstrates strong RAG pipeline design (Pinecone + embeddings + reranking), rigorous eval/regression practices, and pragmatic data ingestion/observability work across Confluence, Notion, and messy PDFs/OCR—backed by clear metric improvements (P@1 61%→78%, escalations 40%→22%).

View profile
Vishesh Kumar - Intern Software & AI Engineer specializing in distributed systems and LLM applications in Palo Alto, CA

Vishesh Kumar

Screened

Intern Software & AI Engineer specializing in distributed systems and LLM applications

Palo Alto, CA1y exp
AmpUpStony Brook University

Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).

View profile
Mohammed Mudassir Hussain Shaikh - Entry-Level Full-Stack Software Engineer specializing in serverless AWS and AI applications in Tempe, AZ

Entry-Level Full-Stack Software Engineer specializing in serverless AWS and AI applications

Tempe, AZ1y exp
Indian ServersArizona State University

Built and deployed serverless AWS applications (Lambda/S3/RDS Proxy) including a NASA L’Space React + Python data analysis tool, focusing on performance for large datasets. Demonstrates strong cloud troubleshooting across compute and networking (CloudWatch-driven diagnosis, EC2 scaling, security group fixes) and a user-driven iteration loop that improved product usability with dynamic filtering and interactive UI.

View profile
Hari Krishna Kona - Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP in Boston, MA

Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP

Boston, MA3y exp
G-PLindsey Wilson College

LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.

View profile
Srinivasan Gomadam Ramesh - Mid-level AI/Data Engineer specializing in agentic AI and data platforms in Redmond, WA

Mid-level AI/Data Engineer specializing in agentic AI and data platforms

Redmond, WA7y exp
Quadrant TechnologiesUniversity of Texas at Dallas

AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.

View profile
SM

Junior Software Engineer specializing in AI platforms and backend systems

Boston, MA1y exp
Humanitarians.AINortheastern University

Built and shipped AI products at Humanitarians AI, including a full-stack multi-agent platform that consolidated six faculty AI tools into one interface and achieved 100+ user adoption, 70% less workflow switching, and a 6x latency improvement. Also designed a grounded document parser using FAISS and structured LLM outputs that reduced hallucinations by 60%, showing strong depth in both product-minded engineering and production AI systems.

View profile
DK

Mid Software Engineer specializing in backend distributed systems and AI/RAG platforms

2y exp
Compusoft Integrated SolutionsArizona State University

Full-stack engineer with hands-on ownership of a production AI knowledge assistant used by 10,000+ daily users. Combines React/Next.js frontend work with FastAPI, AWS serverless, and RAG architecture using GPT-4, LangChain, and Pinecone, with measurable impact on relevance, latency, uptime, and support deflection.

View profile
YS

Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs

Remote, United States4y exp
FILMIC TECHNOLOGIESUniversity at Buffalo

Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.

View profile
PT

Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps

New York, USA2y exp
University at BuffaloUniversity at Buffalo

Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.

View profile
AN

Anar Nurizada

Screened

Mid-level Robotics Engineer specializing in simulation-to-real ML control

Brooklyn, NY5y exp
DL-RLStony Brook University

Robotics/ML engineer who benchmarks and adapts open-source robot action models, building synthetic datasets in Isaac Sim and modifying vendor code to scale training across multiple GPUs. Also built a production-style computer vision pipeline at Zortag—training a tiny YOLO-based classifier for fake-vs-real label detection and deploying it in a real-time iOS app with additional display/spoof detection.

View profile
GA

Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems

USA4y exp
CitiusTechNorthwest Missouri State University

Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.

View profile
Sharanya Rao - Mid-level Backend & Blockchain Engineer specializing in Cosmos SDK and EVM in Remote, USA

Sharanya Rao

Screened

Mid-level Backend & Blockchain Engineer specializing in Cosmos SDK and EVM

Remote, USA4y exp
Fair OrganizationYeshiva University

Built and productionized an LLM+RAG lending assistant on AWS to help loan officers quickly answer questions from credit policies and prior decisions, tackling hallucinations with retrieval-only responses and a no-context fallback. Also automated end-to-end ETL and model retraining/deployment using Apache Airflow, and has experience translating clinical stakeholder needs (doctors/care managers) into ML features, metrics, and dashboards.

View profile
Arjun Shrestha - Intern AI/GenAI Engineer specializing in NLP, RAG, and Snowflake Cortex in Columbus, Ohio

Intern AI/GenAI Engineer specializing in NLP, RAG, and Snowflake Cortex

Columbus, Ohio1y exp
VertivLamar University

Built and deployed a production AI invention/patent review platform that compares invention submissions against patent rules to provide instant feedback, reportedly cutting legal team review time by ~80%. Learned Snowflake Cortex LLMs and production deployment (Docker + AWS) on the job, and validated system quality through human-in-the-loop testing with experienced legal stakeholders.

View profile
Satish Kumar Reddy - Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps in Remote, NJ

Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps

Remote, NJ5y exp
Tungsten AutomationPace University

Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.

View profile

Need someone specific?

AI Search