Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

HA

Hamad Alajeel

Screened

Intern Machine Learning & AI Automation Engineer specializing in ML workflows and AI hardware

Fort Lauderdale, FL0y exp
Revscale Technologies Inc.UC San Diego

ML practitioner with hands-on experience adapting diffusion models (DDPM + U-Net in PyTorch) to improve low-dose CT medical imaging quality via denoising and upsampling against high-dose ground truth. Also built a RAG workflow during a recent internship by cleaning client survey data, embedding with OpenAI text-embedding-3-large, and indexing in Pinecone with MD5 deduplication, alongside a strong emphasis on production-grade Python practices.

View profile
CB

Mid-level Full-Stack Software Engineer specializing in cloud and AI-enabled applications

San Francisco, CA4y exp
One CommunityPurdue University

Product-focused full-stack engineer (70/30 app vs infra) with Accenture experience and recent AI workflow work, shipping end-to-end systems from React/TypeScript UIs through FastAPI backends to Postgres. Built an AI-driven data extraction platform with async job APIs, strict schema validation, and strong observability, and has operated AWS ECS-based deployments with real incident mitigation (DB connection exhaustion/latency under traffic spikes).

View profile
RM

Rifat Mahfuz

Screened

Junior Backend Software Engineer specializing in microservices and API platforms

New York City, NY1y exp
ShareTripUniversity of Illinois Urbana-Champaign

Backend engineer with strong performance and security instincts: built a Flask API for readability metrics with clean, testable modular design; optimized SQLAlchemy/Postgres to eliminate N+1 issues (800ms to 120ms). Also implemented an LLM-powered natural-language travel search using Claude Sonnet + Amadeus with RAG and anti-exploitation safeguards, plus multi-tenant isolation via Postgres RLS and Redis caching that cut search latency from ~20s to ~4–5s while reducing storage costs.

View profile
KK

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp
AccentureEastern Illinois University

LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).

View profile
RJ

Ramesh Jasti

Screened

Mid-level AI/ML & MLOps Engineer specializing in cloud AI infrastructure and GenAI

San Jose, USA5y exp
HPEWestern Illinois University

At HPE, led and deployed an enterprise-grade LLM document intelligence platform for an insurance client, automating extraction from highly variable PDFs/scans/emails and raising field accuracy from 74% to 93%. Built a LangChain/Pinecone/OpenSearch RAG framework to cut hallucinations by 37% and operationalized LangSmith evals in CI, driving a 41% triage accuracy lift and >33% fewer incorrect resolutions while partnering closely with claims operations via HITL workflows.

View profile
VH

Mid-level ML/AI Engineer specializing in NLP, RAG pipelines, and financial risk & fraud systems

USA3y exp
FintaUniversity at Buffalo

Built and shipped LLM/RAG systems in finance and startup settings, including a Goldman Sachs document intelligence platform that indexed ~8TB of regulatory filings and delivered cited, conversational answers with <2s latency—cutting compliance research by ~4.5 hours per batch. Also developed LangChain-based agent workflows at Finta to automate CRM enrichment and investor lookup with strong testing, tracing (LangSmith), privacy guardrails, and auditability.

View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
TR

Tejaswi Rao

Screened

Mid-level Machine Learning Engineer specializing in MLOps and GenAI analytics

Jersey City, New Jersey7y exp
MediacomStevens Institute of Technology

ML/LLM practitioner who has deployed a production RAG-based trouble-call identifier using multiple datasets (device, network, past complaints). Experienced in end-to-end MLOps (FastAPI + Docker + Kubernetes with HPA) and in evaluating/monitoring LLM behavior to reduce hallucinations, with additional applied work in forecasting/anomaly detection and churn prediction for retention campaigns.

View profile
SS

Sushma Sri B

Screened

Mid-level Full-Stack Engineer specializing in cloud-native microservices (FinTech/Healthcare)

Charlotte, NC5y exp
ADPUniversity of North Carolina at Charlotte

Built and shipped production systems spanning real-time operational dashboards and an LLM-powered internal documentation assistant using RAG (embeddings + vector DB). Demonstrates strong focus on reliability and iteration: implemented guardrails and evaluation loops (human review, hallucination tracking, regression prevention) and improved performance/scalability through query optimization, caching, and retrieval tuning.

View profile
AK

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp
Data Security Council of IndiaUSC

Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.

View profile
SASIREKHA GULIPALLI - Mid-level Data Analyst specializing in procurement, supply chain analytics, and applied machine learning in Alpharetta, GA

Mid-level Data Analyst specializing in procurement, supply chain analytics, and applied machine learning

Alpharetta, GA4y exp
MotrexGeorgia State University

Strategic sourcing professional specializing in seasonal apparel supply chains, combining Coupa/JD Edwards analytics with Excel/Python modeling and Power BI dashboards to drive cost reduction and OTIF gains. Notable for rapid mitigation of a 10-day factory delay affecting 12 holiday SKUs (preserved 95% of revenue) and for automating PO workflows to cut cycle time by 4.2 days and improve OTIF by 15%.

View profile
Rushir Bhavsar - Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp
Cadence Design SystemsArizona State University

Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.

View profile
Joshua Hewitt - Senior Software Engineer specializing in Generative AI product development in San Francisco, USA

Joshua Hewitt

Screened

Senior Software Engineer specializing in Generative AI product development

San Francisco, USA9y exp
PadletUniversity of Sydney

AI product builder at Padlet who shipped multiple production LLM features for education workflows, including an AI document generator (AI Recipes) and a RAG-enabled in-product chat assistant. Built an AI microservice layer (LangChain) to swap model providers easily and created automated + human-in-the-loop evaluation systems (including ~100-test runs) to iterate on prompts and quality.

View profile
Jax Diagana - Senior AI Engineer specializing in forward-deployed voice agents and incident-response automation in San Francisco, CA

Jax Diagana

Screened

Senior AI Engineer specializing in forward-deployed voice agents and incident-response automation

San Francisco, CA7y exp
AnaplanUniversity of St. Thomas

FDE at Bland.ai and founder of Fi (incident-response agent) who routinely takes LLM/agentic concepts from prototype to production. Has hands-on experience reverse-engineering undocumented systems to deliver integrations, building LLM testbeds for voice-agent reliability, and rapidly shipping RAG/semantic search solutions (e.g., Confluence runbooks) after deep customer discovery with DevOps/SRE teams.

View profile
Aniket Janrao - Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs in Houma, LA

Aniket Janrao

Screened

Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs

Houma, LA2y exp
Objective Medical Systems LLCUniversity at Buffalo

Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.

View profile
Manasa Reddy Nagendla - Mid-level Full-Stack Java Engineer specializing in microservices, cloud, and event-driven systems in Cincinnati, OH

Mid-level Full-Stack Java Engineer specializing in microservices, cloud, and event-driven systems

Cincinnati, OH6y exp
Procter & GambleUniversity of Cincinnati

Software engineer at Procter & Gamble focused on warehouse/operations systems, building near-real-time order/inventory visibility using Java/Spring Boot, React, Kafka, PostgreSQL, and Redis with measurable latency and load-time gains. Also shipped internal LLM/RAG knowledge assistants grounded in company runbooks and workflows, implementing guardrails and an evaluation loop that drove concrete retrieval improvements (document chunking) and regression prevention.

View profile
Arya Mane - Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing in Dallas, Texas

Arya Mane

Screened

Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing

Dallas, Texas1y exp
Receptro.AIUniversity of Texas at Dallas

Built a production RAG-based NBA player scouting assistant that embeds player profiles into FAISS, orchestrates retrieval and LLM recommendations with LangChain, and surfaces results via embedded Tableau dashboards. Demonstrates strong focus on evaluation/monitoring (batch tests, LLM-as-judge, latency/failure/token metrics) and has experience translating non-technical founder goals into DAPT + fine-tuning plans on curated data.

View profile
Sana Khan - Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech in Oklahoma, USA

Sana Khan

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech

Oklahoma, USA4y exp
Capital OneOklahoma Christian University

ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.

View profile
Saikrishna Vallala - Mid-level QA Automation Engineer / SDET specializing in Financial Services and Healthcare in USA

Mid-level QA Automation Engineer / SDET specializing in Financial Services and Healthcare

USA5y exp
Morgan StanleyDePaul University

Fintech-focused engineer who built an end-to-end KYC verification pipeline for advisor onboarding using Flask microservices, Celery/Redis, and AWS (Lambda/ECS/EC2) with CloudWatch-driven scaling and latency optimizations. Also shipped a production internal knowledge assistant using RAG + embeddings/vector search with guardrails (similarity-based fallback, prompt-injection protections) and an evaluation loop with compliance specialist review that drove measurable retrieval improvements.

View profile
Jake Lee - Junior Full-Stack/Systems Engineer specializing in AI, embedded systems, and healthcare apps in Boston, MA

Jake Lee

Screened

Junior Full-Stack/Systems Engineer specializing in AI, embedded systems, and healthcare apps

Boston, MA3y exp
SolstisBoston University

Led architecture for “Solstice/Solstis,” a safety-aware, hands-free AI medical assistant that guides users through minor emergencies with a structured, state-machine-driven LLM agent integrated with device hardware. Built RAG grounded in Red Cross procedures plus guardrails, fallbacks, and emergency escalation, and improved real-world usability by shifting from open-ended chat to a deterministic step-by-step workflow measured via completion rate, repeat prompts, and latency.

View profile
Ankita A Khartmol - Junior Backend Software Engineer specializing in conversational AI and cloud APIs in Bangalore, India

Junior Backend Software Engineer specializing in conversational AI and cloud APIs

Bangalore, India1y exp
HarmanUSC

Backend/ML-focused software engineer who built and evolved a Python/FastAPI backend for a large-scale conversational AI platform, decoupling API and inference services to improve stability and deployment velocity. Experienced in production hardening (timeouts/fallbacks/monitoring), secure multi-tenant systems (JWT/RBAC/RLS), and low-risk migrations using shadow deployments and incremental traffic ramp-ups.

View profile
Chandra Shekar Akkandra - Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services in Newark, CA

Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services

Newark, CA5y exp
JPMorgan ChaseUniversity of Missouri-Kansas City

Finance-domain ML/LLM engineer who has shipped production systems including a RAG-based financial insights assistant with a custom post-generation validation layer that verifies atomic claims against retrieved source text to prevent hallucinations in compliance-critical workflows. Also built large-scale MLOps automation on AWS using Kubeflow + MLflow + CI/CD for fraud detection and credit risk models processing 500M+ transactions/day with a 99.99% uptime goal, and partnered closely with JP Morgan risk/compliance stakeholders on NLP-driven compliance monitoring.

View profile
Abdul Mohammed - Mid-level Data Analyst specializing in healthcare and financial analytics in USA

Mid-level Data Analyst specializing in healthcare and financial analytics

USA3y exp
Cardinal HealthIndiana Tech

Built and productionized an LLM-powered clinical documentation and insights pipeline at Cardinal Health using LangChain + GPT-4 with RAG to summarize long clinical notes, extract medication/dosage entities, and generate structured SQL-ready outputs for downstream analytics. Emphasizes clinical reliability via labeled benchmarking (precision/recall/F1), shadow deployments, clinician human-in-the-loop review, and ongoing monitoring/orchestration with Airflow, Lambda, S3, Postgres, and Power BI.

View profile
NR

Mid-level AI Engineer specializing in LLMs, RAG, and MLOps

5y exp
Wells FargoSouthern Methodist University

Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.

View profile

Need someone specific?

AI Search