Vetted Retrieval-Augmented Generation (RAG) Professionals

Pre-screened and vetted.

Sri Charan Reddy Mallu - Mid-Level Software Development Engineer specializing in GenAI and full-stack cloud systems in Redwood City, CA

Mid-Level Software Development Engineer specializing in GenAI and full-stack cloud systems

Redwood City, CA5y exp
C3 AISan José State University

Full-stack engineer with experience across Magna, C3.ai, and Amazon, building GenAI-enabled products and finance transaction systems. Has shipped Next.js (App Router) + TypeScript features backed by Go/Python RAG pipelines, and emphasizes production quality via load testing, Selenium regression coverage, LLM-aware integration testing, and Azure observability. Also built LangGraph-orchestrated multi-step content generation workflows with robust retry/idempotency strategies.

View profile
Jingyao Chen - Junior Backend/Platform Engineer specializing in AI microservices and cloud-native systems in Pittsburgh, PA

Jingyao Chen

Screened

Junior Backend/Platform Engineer specializing in AI microservices and cloud-native systems

Pittsburgh, PA2y exp
MeowyAICarnegie Mellon University

Cofounder at MeowyAI who shipped a production multimodal (vision/voice/text) AI task manager using Gemini, tackling real-world issues like hallucinations, tool-calling safety, and RAG-based preference memory. Also built a production multi-agent RAG system orchestrated with LangGraph (and contributes to LangChain), with strong emphasis on latency optimization, observability (OpenTelemetry), and rigorous testing/evaluation including A/B tests and adversarial prompting.

View profile
RS

Rahul Singh

Screened

Junior Software Engineer specializing in AI and healthcare automation

San Francisco, CA2y exp
Vali HealthUC Berkeley

Seed-stage startup engineer owning features end-to-end across full-stack development, DevOps, rollout, and post-launch maintenance. Built data ingestion and evaluation workflows for an LLM data-quality platform using Next.js, MongoDB, Postgres, and GCP Pub/Sub, with a strong focus on reliability, caching, and pragmatic performance improvements.

View profile
HR

Mid-level Data Analytics professional specializing in BI, data engineering, and applied AI

California, USA6y exp
AmazonSan Jose State University

Built GenMedX, a multi-module clinical AI system for emergency department decision support spanning triage prediction, diagnosis, medication Q&A, and visit summarization. Stands out for combining medical LLM fine-tuning, RAG, and rigorous evaluation/monitoring to drive a major triage recall improvement from 38.5% to 76.6%, with a strong focus on safety, edge-case detection, and production reliability.

View profile
KI

Staff Software Engineer specializing in FinTech and AI-powered customer support

San Francisco, CA16y exp
BlockStony Brook University

Technical lead who shipped a production GPT-4-powered customer support agent for Square, serving a large fintech customer base through a React chat interface with tool-using orchestration, guardrails, and live handoff paths. Brings strong real-world experience in agent reliability, evaluation, observability, and workflow orchestration using Temporal, Sidekiq, Pinecone, Datadog, and Snowflake.

View profile
MH

Madison Heck

Screened

Staff Product Manager specializing in AI products and SaaS platforms

Bend, OR9y exp
SagewaveCornell University

AI product leader with end-to-end experience building revenue-generating products and internal AI platforms across legal tech, edtech, and neurofeedback healthcare. At Clio, they helped launch the company's first AI offerings, drove $60K MRR in two months, scaled the AI team from 15 to 45, and enabled 12 additional AI applications across the business. They also bring a strong human-centered AI perspective from building tutor matching, lesson-planning, and clinician recommendation systems.

View profile
KD

Junior ML Engineer specializing in Generative AI and LLM applications

Thousand Oaks, California3y exp
NVIDIACalifornia Lutheran University

Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.

View profile
PV

Praveen V

Screened

Mid-Level Software Engineer specializing in Generative AI and RAG systems

Remote, USA5y exp
MetaUniversity of North Carolina at Charlotte

Built a production RAG-based natural-language-to-SQL system at Global Atlantic to replace slow, expensive manual analytics ticket workflows, focusing heavily on retrieval quality and measurable evaluation (200-question ground-truth set; recall@5 improved 0.65→0.78 via semantic chunking). Also built a custom MCP-style agent orchestrator for a personal project (arxiv-ai) to improve flexibility and Langfuse-aligned observability, and has hands-on experience with LangGraph, CrewAI, and n8n.

View profile
NP

Nikhel Parkar

Screened

Executive engineering leader specializing in FinTech, data platforms, and cloud modernization

Reston, VA23y exp
Fannie MaePowerMBA Business School

Aspiring founder building an AI governance and compliance startup for the finance industry, focused on agents that monitor data lakes/lakehouses to detect security vulnerabilities, PII exposure, and governance issues in real time. Has already formed an S-corp, has not raised capital yet, and approaches idea validation through Minimum Lovable Product testing with potential clients.

View profile
Ranjani Salla - Mid-level AI/ML Engineer specializing in LLMs, FinTech, and Healthcare IT in USA

Ranjani Salla

Screened

Mid-level AI/ML Engineer specializing in LLMs, FinTech, and Healthcare IT

USA5y exp
StripeClark University

Built production GenAI systems in both healthcare and financial services, including a Verily clinical platform and an Accenture financial Q&A product. Stands out for combining advanced RAG, fine-tuning, safety evaluation, and infrastructure engineering to deliver measurable gains in engagement, groundedness, hallucination reduction, and cost efficiency.

View profile
NS

Nitin Sunda

Screened

Mid-level Software Engineer specializing in FinTech and GenAI platforms

Seattle, WA4y exp
AmazonNortheastern University

Candidate describes a development approach centered on AI-assisted coding, testing, and agent-driven workflows, including production exposure to multi-agent systems and governance-oriented logging. They appear particularly focused on combining AI speed with structured validation through unit tests, boundary tests, and edge-case monitoring.

View profile
Yashwanth J - Mid-level Software Engineer specializing in AI/ML and full-stack systems in Seattle, WA

Yashwanth J

Screened

Mid-level Software Engineer specializing in AI/ML and full-stack systems

Seattle, WA4y exp
AppleUniversity of North Texas

Engineer with Apple experience building LLM-powered internal workflow orchestration systems using Python, LangGraph, FastAPI, Redis, vector search, and Kubernetes. Stands out for a highly pragmatic, production-focused approach to agentic systems: deterministic state management, strong guardrails, observability, and human review for high-risk actions.

View profile
SB

Mid-level AI/LLM Engineer specializing in generative AI and ML systems

Remote, USA4y exp
NetflixMissouri University of Science and Technology

AI/LLM-focused engineer with hands-on experience building RAG pipelines, prompt engineering workflows, and multi-agent systems using tools like LangChain. Stands out for combining AI-assisted development with production-grade validation and for leading the architecture/orchestration of agent-based recommendation systems that improved response time, accuracy, and scalability.

View profile
OG

Or Glazer

Screened

Executive engineering leader specializing in AI, cloud, and streaming platforms

Santa Monica, CA20y exp
Wonder ProjectThe Academic College of Tel Aviv-Yaffo

Engineering leader with hands-on architecture depth who has managed offshore teams up to ~30 engineers across mobile, web, backend, and Roku. Particularly strong in applying AI pragmatically—driving code-generation and review workflows, tailoring model usage by tech stack, and building internal LLM/RAG tools—while also improving operational KPIs through automation, cost optimization, and workflow redesign in content/media environments.

View profile
AT

Antoine Tan

Screened

Senior Full-Stack Software Engineer specializing in workflow automation and healthcare AI

Remote12y exp
Rad AIUniversity of Florida

Backend/data engineer who has owned production Python APIs and high-throughput async workflows on AWS (FastAPI, Docker, ECS/EKS/Lambda) with mature reliability practices like idempotency, bounded retries, circuit breakers, and strong observability. Also built AWS Glue ETL into an S3/Redshift lakehouse and modernized legacy batch systems via parallel-run parity testing and feature-flagged migrations, including a SQL tuning win cutting a multi-minute query to under 10 seconds.

View profile
CS

Chappidi Sasi

Screened

Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference

Bay Area, CA5y exp
NVIDIAWebster University

ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.

View profile
greg farhadian - Senior Software Engineer specializing in cloud data platforms and Java microservices in Remote

Senior Software Engineer specializing in cloud data platforms and Java microservices

Remote4y exp
IBMUC Irvine

Backend/data engineer with experience building Kafka-driven real-time pipelines that support ML code deployment and downstream integrations. Currently migrating high-throughput mainframe (COBOL/assembly) processing to Java, using Spark/Databricks to preserve performance and employing rigorous A/B testing across dev/pre-prod/prod with years of historical data.

View profile
Michael Kilgore - Mid-Level Software Development Engineer specializing in AWS data pipelines and forecasting systems

Mid-Level Software Development Engineer specializing in AWS data pipelines and forecasting systems

3y exp
AmazonUniversity of Washington Bothell

Built and deployed (via an Upwork contract) an LLM-powered agent for options trading that detects large options trade events, enriches them with market/filing data (price history, earnings transcripts, insider trading), and delivers recommendations via Telegram. Implemented schema-constrained outputs (Pydantic/Google GenAI), robust orchestration, logging, and error-notification handling, plus vector-DB-based reuse of prior outputs to improve consistency.

View profile
Yash Jajoo - Senior Software Engineer specializing in AI and FinTech platforms in New York City, NY

Yash Jajoo

Screened

Senior Software Engineer specializing in AI and FinTech platforms

New York City, NY8y exp
Walter AINew York University

Built a production LLM pipeline at Walter AI that scans massive user inboxes, identifies financial newsletters, and extracts trading strategies into structured JSON for downstream paper-trading workflows. Stands out for combining agent architecture with strong production discipline—cutting scan time from 20 to 5 minutes, reducing LLM costs by 90%, and achieving 3-second P99 latency while handling messy, inconsistent email data at scale.

View profile
Richard Ferguson - Executive technology leader specializing in government, cloud, and cybersecurity in Reno, NV

Executive technology leader specializing in government, cloud, and cybersecurity

Reno, NV31y exp
Coleridge InitiativeGeorgetown University

Founder with a bootstrapped startup who navigated early hiring and scaling by leveraging contractor talent before converting key contributors to full-time employees. Active in Northern Nevada's startup ecosystem through Generator and StartupNV, and notably declined angel/VC offers to preserve equity because of strong conviction in a differentiated product and its market fit.

View profile
NH

Mid-level Full-Stack Engineer specializing in cloud-native data and enterprise platforms

USA5y exp
AmazonUniversity of Cincinnati

Software engineer with practical, day-to-day experience embedding AI into development workflows across coding, testing, code review, and AWS data pipelines. Uses tools like Claude, Cline, JUnit, Mockito, and Amazon Bedrock, and stands out for having a realistic, mature view of agent limitations, hallucinations, and the need for strong prompting and human validation.

View profile
Akhilesh Patil - Junior AI/ML Engineer specializing in FinTech and generative AI in Remote, USA

Junior AI/ML Engineer specializing in FinTech and generative AI

Remote, USA2y exp
StripeSan Jose State University

Built an end-to-end AI bug triage dashboard that combined React/TypeScript, FastAPI, Postgres, and classical ML to reduce manual engineering triage work by about 40%. Stands out for pragmatic, product-minded AI engineering: choosing interpretable models when they were sufficient, designing human-in-the-loop UX for trust, and separately building an agentic RAG project with vector search, Neo4j knowledge graphs, and reranking.

View profile
BW

Boyun Wang

Screened

Junior AI Agent Engineer specializing in regulated healthcare software

Berkeley, CA3y exp
Echelon DiagnosticsUC Berkeley

Built and deployed PIKA, an internal multi-agent platform for FDA-regulated software development, owning it from concept through production. The candidate combines strong full-stack engineering with rigorous LLM orchestration, human-in-the-loop controls, and production eval systems, delivering measurable impact: 3x more design issues caught, ~90% fewer false positives, and ~40% efficiency gains on documentation-heavy workflows.

View profile

Need someone specific?

AI Search