Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

AM

Mid-Level Software Engineer specializing in backend microservices and distributed systems

Waltham, MA4y exp
Dassault SystèmesNortheastern University

Built and productionized an internal LLM-powered search tool that helps engineers find the right SolidWorks macros using plain-English queries, using OpenAI embeddings and ChromaDB with strong logging/fallback safeguards. Experienced in diagnosing RAG/agentic workflow issues in real time and in hands-on API support, including fixing customer macros after SolidWorks version updates and driving adoption through reusable solutions and best practices.

View profile
TT

Mid-level AI/ML Engineer specializing in MLOps and LLM applications

New York, NY4y exp
BNY MellonUniversity at Albany

BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.

View profile
NA

Mid-level Full-Stack Software Engineer specializing in AI platforms and microservices

Mooresville, NC6y exp
Lowe'sUniversity of North Carolina at Charlotte

Backend engineer currently building an AWS Lambda/FastAPI inventory recommendation system using a LangChain + GPT-4 RAG pipeline and MongoDB vector search; drove major cost optimization via Redis caching (60% reduction) while sustaining 10k+ daily requests under 2s latency. Previously deployed Node.js microservices on AWS OpenShift with Jenkins/Helm at UnitedHealth Group and led a zero-downtime monolith-to-microservices migration at Verizon, including RabbitMQ-based real-time messaging with DLQs and idempotency.

View profile
KG

Senior AI Engineer specializing in Agentic AI and distributed systems

Charlotte, NC4y exp
UnitedHealth GroupUniversity of North Carolina at Charlotte

LLM/agentic workflow engineer with healthcare domain experience who built a HIPAA-compliant multi-agent RAG system for clinical review automation at UnitedHealth Group, achieving 92% precision and cutting latency 40% through async orchestration and Redis semantic caching. Also has strong data engineering orchestration background (Airflow on AWS EMR with Great Expectations) and a proven clinician-in-the-loop feedback process that improved model faithfulness by 18%.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
HE

Mid-level AI/ML Engineer specializing in cloud data engineering and GenAI

Florida, USA6y exp
LexisNexisUniversity of South Florida

AI/LLM engineer with production experience in legal tech: built a GPT-4 + LangChain RAG summarization system at Govpanel that reduced legal case-file review time by 50%+. Previously at LexisNexis, orchestrated end-to-end Airflow data/AI pipelines processing 5M+ legal documents daily, improving ETL runtime by 35% with robust validation, monitoring, and SLAs.

View profile
VA

Mid-level Data Scientist specializing in Generative AI and NLP for financial risk

Glassboro, NJ4y exp
S&P GlobalRowan University

Built and shipped production generative AI/RAG assistants in regulated financial contexts (S&P Global), automating compliance-oriented Q&A over earnings reports/filings with grounded answers and citations. Experienced across the full stack—AWS-based ingestion (PySpark/Glue), vector retrieval + LangChain agents, GPT-4/Claude model selection, and production reliability (monitoring, caching, retries) plus rigorous evaluation and regression testing.

View profile
MB

Intern Machine Learning & Full-Stack Engineer specializing in computer vision and healthcare AI

India0y exp
Amrita Vishwa VidyapeethamUniversity of Illinois Urbana-Champaign

AI/ML-focused backend engineer who shipped two production systems: PersonaPal (agentic LLM chatbot with RAG, FAISS-based retrieval, and Redis semantic caching) and CervixScan (clinical diagnostics platform with PostgreSQL data modeling and human-in-the-loop safety for low-confidence predictions). Demonstrates strong performance/reliability work (indexed vector search, caching, query optimization to ~200ms) and end-to-end ownership from orchestration design through deployment.

View profile
Hiya Kothari - Intern Full-Stack Software Engineer specializing in AI/ML and cloud in San Francisco, CA

Hiya Kothari

Screened

Intern Full-Stack Software Engineer specializing in AI/ML and cloud

San Francisco, CA3y exp
Sparx LabsUC Irvine

Built a Python-based geospatial machine learning backend for PFAS contamination risk mapping, including reproducible feature pipelines, ensemble modeling, and a FastAPI layer for visualization/analysis. Emphasizes data integrity and robustness (CRS/coverage checks, fail-fast validation) and has led safe backend refactors using feature flags, idempotent backfills, and Postgres RLS for secure, queryable results delivery.

View profile
Somil Shah - Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents in San Francisco, CA

Somil Shah

Screened

Mid-level AI/ML Engineer specializing in generative AI, RAG platforms, and LLM agents

San Francisco, CA4y exp
INTERACT Animal LabNortheastern University

AI/LLM engineer who has shipped 10+ production applications, including InvestIQ on GCP—a production-grade RAG due-diligence engine that ethically scrapes web/PDF sources, builds a ChromaDB knowledge base, and delivers analyst-style dashboards plus a citation-backed chat copilot. Deep focus on reliability (evidence-only answers, hard citations, refusal gating), retrieval tuning, and orchestration (Airflow/Cloud Composer), plus multi-agent systems (CrewAI with 7 specialized finance agents).

View profile
Daniel Berhane Araya - Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance in Fairfax, VA

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp
George Mason UniversityGeorge Mason University

AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.

View profile
Harshitha K - Mid-level Full-Stack .NET Developer specializing in cloud-native microservices in Greensboro, NC

Harshitha K

Screened

Mid-level Full-Stack .NET Developer specializing in cloud-native microservices

Greensboro, NC5y exp
Lincoln FinancialUniversity of Bridgeport

Full-stack .NET engineer with cloud and applied GenAI experience who shipped a real-time policy status tracking module at Lincoln Financial using ASP.NET Core/.NET 8, Kafka, Angular, SQL Server, Redis, and AKS autoscaling. Also delivered a production internal LLM+RAG support assistant at Honeywell with strong security/guardrails (PII masking, RBAC) and a rigorous eval/regression loop built on a 200-question gold set.

View profile
maheen Adeeb - Senior Machine Learning Engineer specializing in LLMs, speech AI, and RAG systems in Chicago, IL

maheen Adeeb

Screened

Senior Machine Learning Engineer specializing in LLMs, speech AI, and RAG systems

Chicago, IL3y exp
VosynDePaul University

AI engineer with production experience building multilingual speech-to-speech translation pipelines (ASR + LLM) for enterprise/media, focused on reliability at scale. Has hands-on orchestration experience (including IBM Watson contexts) and emphasizes production evaluation/monitoring using a mix of traditional metrics and LLM-based evaluators to catch quality regressions while balancing latency and cost.

View profile
Archana yaramala - Mid-level AI/ML Engineer specializing in deep learning, MLOps, and LLM applications in NY, USA

Mid-level AI/ML Engineer specializing in deep learning, MLOps, and LLM applications

NY, USA4y exp
DataRobotSt. Francis College

Built and deployed production LLM assistants for internal Q&A and customer-feedback summarization, emphasizing reliability (RAG, prompt tuning, validation/whitelisting) and privacy safeguards. Improved adoption by adding explainable outputs and a user feedback mechanism, and has hands-on orchestration experience with Aflow and Azure Logic Apps.

View profile
SN

Mid-level AI/ML Engineer specializing in GenAI, NLP, and financial systems

Texas, USA5y exp
CitibankConcordia University, St. Paul

GenAI/ML engineer with hands-on experience building production financial intelligence and document summarization systems at Citibank. Stands out for combining LLM fine-tuning, hybrid RAG, multi-agent workflows, and strong MLOps/observability practices to deliver measurable business impact, including 60% faster analyst retrieval, 31% higher precision, and 99%+ uptime.

View profile
VD

Vimala Devi

Screened

Mid-level AI & Machine Learning Engineer specializing in FinTech

Texas, USA4y exp
The HartfordUniversity of Houston

ML/AI engineer with hands-on experience building production systems in financial services, including a real-time underwriting analytics platform at Hartford Financial Services. Stands out for combining classic ML, low-latency API deployment, monitoring, and emerging LLM/RAG design patterns, with measurable impact including 20% better decision accuracy, sub-200ms latency, and 5M+ records processed daily.

View profile
AN

Alir Navid

Screened

Executive CTO specializing in FinTech, Healthcare IT, and AI platforms

Irvine, CA19y exp
AphidUniversity of Phoenix

Engineering/product leader who builds business-aligned technology roadmaps and scales pod-based orgs with strong delivery discipline (OKRs, CI/CD, QA automation). Led a SaaS supply-chain application adopted by Fortune 100 customers, citing ~$4M MRR and ~87% gross profit, and has hands-on experience standardizing LLM + cloud/MLOps architectures with security/compliance guardrails. Also created the PISEK methodology and used it to run distributed innovation sprints (e.g., an AI ETA predictor moved from pilot to production).

View profile
VM

Mid-Level Full-Stack/Backend Engineer specializing in AWS, APIs, and GenAI systems

Los Angeles, CA5y exp
AIRKITCHENZCalifornia State University, Fullerton

Backend engineer who built the core backend for Air Kitchens’ discovery/booking platform on AWS (Node + Python, DynamoDB, SQS/Lambda), optimizing for fast user-facing APIs and scalable async workflows. Introduced an AI matching service with a deterministic pre-filter + LLM ranking approach to balance latency vs quality, and has hands-on experience with production security (JWT/RBAC/RLS), CI/CD, and blue-green, staged migrations from Django to modular services.

View profile
ST

Mid-level Software Engineer specializing in backend, cloud-native microservices, and LLM apps

Remote, US3y exp
WalmartUniversity of Bridgeport

LLM/agentic systems practitioner who repeatedly takes customer-facing LLM prototypes into production by operationalizing prompts, hardening RAG pipelines, and adding monitoring/guardrails. Has hands-on experience debugging intermittent production failures under high traffic (vector store timeouts/empty retrieval) and implementing fail-safe behavior plus alerting. Also partners closely with sales in pilots/POCs, customizing demos with customer data and running side-by-side comparisons to drive adoption.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
MY

Mid-level AI/ML Engineer specializing in Generative AI and RAG systems

6y exp
Elevance HealthMLR Institute of Technology

Built a production multi-agent orchestration platform to automate healthcare claims and HR workflows, combining LangChain/CrewAI/AutoGPT with RAG (FAISS/Pinecone) and fine-tuned open-source LLMs (LLaMA/Mistral/Falcon) in private Azure ML environments to meet HIPAA requirements. Emphasizes rigorous agent evaluation/observability (trajectory eval, adversarial testing, LLM-as-judge, drift monitoring) and reports measurable outcomes including 35% faster claims processing and 40% fewer chatbot errors.

View profile
MP

Meghana P

Screened

Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and NLP

Illinois, USA5y exp
State FarmSaint Louis University

AI/ML engineer with forensic analytics and healthcare claims experience (Optum), building production LLM/RAG systems to surface context-driven fraud patterns from unstructured claim notes and explain risk to investigators. Strong in large-scale retrieval performance tuning, legacy API integration with reliability patterns (SQS, circuit breakers), and MLOps orchestration on Airflow/Kubernetes with rigorous testing, monitoring, and stakeholder-friendly interpretability.

View profile
TP

Tejaswini P

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Austin, TX3y exp
State StreetUniversity of Central Missouri

Built and deployed an LLM-powered financial/regulatory document analysis platform at State Street, combining fine-tuned transformer models with a RAG pipeline over internal knowledge bases. Owned the productionization stack (FastAPI, Docker, SageMaker, Terraform, CI/CD) plus monitoring for drift/latency/hallucinations, delivering ~40% faster analyst review and improved reliability through chunking/embeddings and grounding.

View profile

Need someone specific?

AI Search