Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

SA

Mid-level Software Engineer specializing in cloud-native microservices and AI-powered web applications

Remote, USA5y exp
BigCommerceArizona State University

Backend engineer who built and owned an AI-powered SMS survey platform for a nonprofit serving at-risk communities (internet-limited users), using Cloudflare Workers + Twilio and a state-machine survey engine. Scaled it to ~10k active users with near-zero downtime, added English/Spanish support, and iteratively improved LLM behavior (Claude 3.7 Sonnet) to handle nuanced, real-world SMS responses reliably.

View profile
SU

Intern Software Engineer specializing in AWS cloud architecture and GenAI systems

Seattle, WA2y exp
Amazon Web ServicesSan José State University

AWS Solutions Architect intern who advised customers on securing a multi-tenant LLM-based SaaS, including isolation strategy tradeoffs and production guardrails against prompt injection. Has experience investigating a prompt-injection incident using logs/traces and TTP-style documentation, and designing scalable SDK/agent integrations via asynchronous worker architecture with prompt versioning.

View profile
IS

Irfan Shaik

Screened

Mid-level AI Software Engineer specializing in risk and fraud detection

Los Angeles, California4y exp
VisaGeorge Mason University

AI/software engineer with experience at Visa building a real-time transaction fraud/risk scoring microservice in the card authorization path (Python, Kafka, Kubernetes on AWS) with strict 120–150ms latency constraints and reason-code outputs for downstream decisioning. Owns ML backend end-to-end (data/feature engineering, model training, deployment) and has demonstrated production reliability work including latency spike mitigation, SLO-based observability, drift monitoring, and safe fallbacks to rule-based decisions.

View profile
YW

Yufan Wei

Screened

Intern AI Engineer specializing in LLM agents, RAG, and applied biostatistics

Beijing, China0y exp
SiemensEmory University

Siemens AI engineer who shipped production multi-agent LLM systems across cybersecurity and sustainability, including a vulnerability automation agent that cut manual work 70%. Deep in orchestration (LangGraph supervisor-worker state machines), reliability engineering (async fault tolerance, retries, spike handling), and rigorous evaluation (offline benchmarks, LLM-as-a-Judge improving label agreement 28.9%) with measurable production guardrails.

View profile
SC

Mid-Level Software Engineer specializing in LLM-powered developer tools

Fairfax, VA3y exp
Active LLM Documentation, DevXGeorge Mason University

Built and owned "Cortex," an AI agent that helps users understand large GitHub repositories by mapping architecture and relationships between files/folders in minutes. Implemented an agentic, multi-stage prompt decomposition approach and validated it across open-source repos, while also doing legacy service modernization work involving dependency upgrades and refactors.

View profile
RH

Rahul Hatkar

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps

San Francisco, CA6y exp
Scale AIWebster University

AI/ML engineer who has shipped production AI systems end-to-end, including an automated multi-channel (Gmail/WhatsApp/voice) candidate interviewing workflow and an enterprise RAG knowledge search platform. Demonstrates strong production rigor (monitoring, A/B tests, guardrails, schema validation, shadow testing) with quantified impact: ~60–70% reduction in interview evaluation time and ~20–30% relevance gains in RAG retrieval.

View profile
DM

Mid-level Generative AI Engineer specializing in decision intelligence and RAG for regulated enterprises

5y exp
JPMorgan ChaseSaint Louis University

Healthcare GenAI engineer who built a HIPAA-compliant, auditable RAG-based claims decision support system at Molina Healthcare, processing 3M claims and delivering major impact (48% faster manual reviews, 43% higher decision accuracy). Deep hands-on experience with LangChain orchestration, vector search (ChromaDB/FAISS), embedding fine-tuning, and safety controls (confidence scoring, rule validation, human-in-the-loop escalation) for clinical workflows.

View profile
AP

Mid-level Machine Learning Engineer specializing in fraud detection and LLM applications

Charlotte, NC5y exp
Bank of AmericaUniversity of North Carolina at Charlotte

Unreal Engine UI engineer focused on scalable, production-ready UI architecture (C++/Slate/UMG/CommonUI) with strong designer enablement via decoupled, interface-driven patterns and MVVM. Demonstrated measurable performance wins: replaced 200+ per-frame Blueprint bindings to cut UI prepass/paint from 4.2ms to 0.5ms and reduced VRAM by ~120MB using texture streaming proxies.

View profile
Monish Sri Sai Devineni - Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps in Boca Raton, FL

Mid-level Machine Learning Engineer specializing in financial AI, NLP, and MLOps

Boca Raton, FL5y exp
Morgan StanleyFlorida Atlantic University

AI/ML engineer with experience at Accenture and Morgan Stanley, building production LLM systems (GPT-3 summarization) and finance-focused ML models (credit risk and trading anomaly detection). Combines MLOps depth (Docker/Kubernetes, AWS SageMaker/Glue/Lambda, MLflow, A/B testing, drift monitoring) with practical domain adaptation techniques like few-shot prompting and RAG/knowledge-base integration.

View profile
Junhui Huang - Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP in Providence, RI

Junhui Huang

Screened

Intern Machine Learning Engineer specializing in LLMs, MLOps, and NLP

Providence, RI1y exp
Harvard UniversityBrown University

Built and deployed a production LLM-driven Dungeons & Dragons game where the model acts as a dungeon master, adding a structured combat system and a macro-state tree to ensure campaigns converge to a clear ending. Fine-tuned Gemini 2.5 Flash on Vertex AI and deployed on GCP with Kubernetes, using RAG over DnD rules/spells plus multi-agent orchestration (intent-based routing between narrative and combat agents) to reduce hallucinations and improve reliability.

View profile
Saksham Khatwani - Mid-level Software Engineer specializing in NLP and search systems in Aurora, United States

Mid-level Software Engineer specializing in NLP and search systems

Aurora, United States3y exp
University of Colorado Anschutz Medical CampusUniversity of Colorado Boulder

Built an AI journaling app at HackCU 2025 featuring a speaking AI avatar with long-term memory via RAG (ChromaDB) and low-latency microservices coordinated through Kafka, including deployment under AMD/non-CUDA constraints using a quantized Llama 8B model. Also has Goldman Sachs experience deploying a Trade UI on Kubernetes with CI/CD rollback automation, plus a healthcare AI internship at CU Anschutz collaborating closely with physicians on diagnostic reasoning and dataset annotation.

View profile
Harshavardhan Reddy - Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics in Albany, NY

Mid-level AI/ML Data Scientist specializing in NLP, computer vision, and risk analytics

Albany, NY5y exp
Capital OnePace University

ML/AI engineer with Capital One experience building production-grade customer segmentation and fraud detection systems combining NLP (transformers) and anomaly detection. Strong MLOps and orchestration background (PySpark ETL, MLflow, Airflow, Docker/Kubernetes, Azure ML) with real-time monitoring/alerting and performance optimizations like quantization and caching, plus proven ability to deliver business-facing insights through Power BI/Tableau for marketing stakeholders.

View profile
Akshit Modi - Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps in Remote, USA

Akshit Modi

Screened

Mid-level AI/ML Engineer specializing in healthcare NLP and MLOps

Remote, USA5y exp
TempusArizona State University

Healthcare/clinical ML practitioner who built and productionized ClinicalBERT-based pipelines to extract and standardize oncology EHR data, improving downstream model F1 from 0.81 to 0.92 while controlling training cost via LoRA/QLoRA. Experienced orchestrating real-time AWS ETL/ML workflows (Glue, Lambda, SageMaker) and partnering with clinicians using SHAP-based interpretability, contributing to an 18% reduction in readmissions and full adoption.

View profile
Srushti Milind Jamsandekar - Mid-Level Software Developer specializing in backend, cloud, and GenAI in CA, USA

Mid-Level Software Developer specializing in backend, cloud, and GenAI

CA, USA5y exp
Azul ArcCalifornia State University

Full-stack engineer with fintech and AI feature experience who shipped an AI-powered project summary module in Next.js (App Router + TypeScript) with secure server-side fetching and route handlers to a FastAPI backend, then owned monitoring and performance fixes in production. Demonstrated measurable UX wins (30% faster dashboard loads) and strong backend fundamentals (Postgres indexing/EXPLAIN ANALYZE, SQS-orchestrated idempotent reconciliation workflows with DLQs and retries).

View profile
PM

Mid-level AI/ML Engineer specializing in LLM agents and workflow automation

4y exp
UnitedHealth GroupKansas State University

AI/LLM engineer with strong healthcare domain depth who has shipped production-grade agents for care coordination and clinical workflow automation. Stands out for combining Knowledge Graph RAG, LangGraph orchestration, and rigorous eval/guardrail systems to improve reliability in high-stakes environments, with measurable gains in review time, hallucination reduction, latency, and clinician adoption.

View profile
MB

Mounya Bonuga

Screened

Mid-level AI/ML Engineer specializing in multimodal AI and recommendation systems

USA4y exp
Goldman SachsUniversity of Central Oklahoma

ML/AI engineer with hands-on ownership of a production LLM/RAG system at Goldman Sachs, focused on workflow automation and large-scale document search for operational teams. They combine strong MLOps and backend engineering skills with practical GenAI evaluation and safety practices, and cite measurable impact including 22% better task guidance accuracy and sub-second search across millions of records.

View profile
PavanKumar Varkala - Mid-level Full-Stack Engineer specializing in AI-driven web applications in Jersey City, NJ

Mid-level Full-Stack Engineer specializing in AI-driven web applications

Jersey City, NJ4y exp
AdobePace University

Built and shipped an AI-driven operational workflow platform at Adobe that handled 12k+ monthly requests using React, Node.js, TypeScript, OpenAI APIs, PostgreSQL, Redis, and RAG. Stands out for combining full-stack product ownership with production-grade LLM architecture, evals, and human-in-the-loop controls, delivering measurable gains including 38% higher accuracy and 40% less manual triage.

View profile
LS

Mid-level Software Engineer specializing in cloud platforms, SRE, and ML-powered engineering tools

Austin, TX5y exp
IntelUniversity of Illinois Chicago

Platform-focused engineer/technical program leader working in silicon/wafer validation environments, with hands-on experience securing access to sensitive test results and engineering tooling. Has implemented RBAC/least-privilege controls with Azure Entra ID, Key Vault, PAM and integrated Checkmarx into dev workflows, while also deploying ML services on AKS using Bicep/Helm/Docker and Azure DevOps CI/CD with strong monitoring and incident response practices.

View profile
ST

Mid-Level AI Engineer specializing in NLP, computer vision, and LLM applications

Austin, TX3y exp
BookedByUniversity of Maryland, Baltimore County

LLM/RAG practitioner who productionized an LLM-driven customer communication and transaction understanding system at PayPal, emphasizing privacy/compliance guardrails and large-scale data normalization. Experienced in real-time debugging of hallucinations via retrieval pipeline tuning and in leading hands-on developer workshops and sales-aligned POCs to drive adoption.

View profile
YL

Yupeng Lu

Screened

Mid-level Backend & Full-Stack Engineer specializing in distributed systems

Beijing, China3y exp
HuaweiBoston University

Built a production internal RAG-based Q&A assistant at Huawei for ~4,000 engineers over a 12M-document Elasticsearch corpus, replacing link-only search with synthesized answers and achieving 87% user acceptance while keeping hallucinations under 0.4%. Pairs rigorous offline benchmarking (RAGAS, PR-gated F1 improvements) with human A/B testing and OpenTelemetry-based production monitoring, and also has strong Kubernetes/SRE experience orchestrating 50+ gRPC services with major MTTR and pager-fatigue reductions.

View profile
AB

Anuj Bubna

Screened

Senior DevOps/SRE Engineer specializing in cloud automation, reliability, and data pipelines

10y exp
IntuitUniversity of Texas at Dallas

Hands-on technical professional experienced in taking LLM/AI-adjacent integrations from prototype to production, using customer observation to refine UX and uncover edge cases. Diagnoses workflow issues in real time using logs and Sankey-style workflow analysis, and communicates fixes with clear short/long-term plans plus proactive alerting. Also partners cross-functionally to drive adoption and cost savings, including a POC around IBM Sterling Integrator that reduced licensing costs by $30K/year.

View profile
AD

Junior Full-Stack Software Engineer specializing in AI data systems

New York, NY1y exp
SEPAL AINYU

Full-stack engineer with strong DevOps/AWS production experience who builds and operates multi-agent AI systems end-to-end (Streamlit/Python through Docker/Kubernetes and ECS/Fargate). Has delivered measurable outcomes: sub-2s latency and ~92% routing accuracy for an AI wellness assistant, shipped an AI-for-BI prototype in under 6 weeks cutting analysis time ~40%, and improved pipeline iteration speed ~35% via modularization and CI/regression checks.

View profile
OL

Olivia Liau

Screened

Junior Data Scientist specializing in ML research, NLP, and healthcare analytics

Los Angeles, CA2y exp
Worcester Polytechnic InstituteUSC

Completed an Amazon externship building a GPT-4 + RAG pipeline to summarize themes from hundreds of employee reviews for workforce analytics aimed at improving warehouse retention. Emphasizes production-readiness through labeled-data evaluation, source attribution for explainability, human-in-the-loop review, and rigorous data cleaning/observability to debug real-world LLM workflow issues.

View profile
MW

Senior Full-Stack AI Engineer specializing in Azure OpenAI and RAG/GraphRAG systems

Eagle Mountain, UT24y exp
GoEngineerBrigham Young University

Built GoEngineer’s first production AI systems, including an end-to-end RAG pipeline for SolidWorks technical support using Azure Blob Storage, Azure AI Search, and Azure OpenAI, plus an AI summarization feature adopted by sales/customer success. Strong in productionizing LLM workflows with evaluation harnesses (golden sets, LLM-as-judge, red teaming, shadow deploys) and Azure infrastructure integrations (Redis, Service Bus, App Insights), and has also implemented a custom MCP server for agentic monitoring.

View profile

Need someone specific?

AI Search