Vetted Retrieval-Augmented Generation Professionals

Pre-screened and vetted.

KM

Mid-Level Software Development Engineer specializing in GenAI automation and cloud systems

Long Beach, CA6y exp
simplehumanGeorge Mason University

Backend Python engineer who architected an event-driven order integration engine connecting EDI vendors to ERP/WMS/3PL systems, including a canonical order model and adapter framework to eliminate per-customer hardcoding. Has hands-on Kubernetes production experience (microservices, Celery workers, CronJobs, HPAs) and implemented GitOps/CI-CD using GitHub Actions, Docker, and ArgoCD, including moving deployments from on-prem to Azure.

View profile
HP

Harsh Patel

Screened

Senior Data Scientist specializing in LLM applications, RAG systems, and production ML

New York, NY6y exp
Fulcrum AnalyticsUniversity of Maryland, Robert H. Smith School of Business

Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.

View profile
AS

Althaf Shaik

Screened

Senior Software Engineer specializing in cloud-scale distributed systems and data platforms

Hyderabad, India4y exp
DHI ADT SolutionsNJIT

LLM/RAG-focused engineer who repeatedly takes agentic workflows from impressive demos to dependable production using rigorous evals, SLOs, and deep observability. Has led high-impact incident mitigation (22-minute MTTR during a major sale) and developer enablement workshops, and partnered with sales to close a $410k ARR enterprise deal with a tailored RAG pilot (FastAPI/pgvector/Okta/InfoSec-ready).

View profile
SV

Satya VM

Screened

Mid-level GenAI/Data Engineer specializing in LLMs, RAG systems, and fraud detection

Ruston, LA7y exp
Origin BankOsmania University

ML/NLP engineer with banking domain experience who built a GenAI-powered fraud detection and risk intelligence system at Origin Bank, combining RAG (LangChain + FAISS), fine-tuned BERT NER, and GPT-4/Sentence-BERT embeddings. Delivered measurable impact (25% higher fraud detection accuracy, 40% less manual review) and emphasizes production-grade pipelines on AWS SageMaker/Airflow with strong data validation and scalable PySpark processing.

View profile
AS

Junior AI/Software Engineer specializing in LLM agents, RAG, and full-stack ML systems

Austin, TX2y exp
Gauntlet AIVirginia Tech

Backend engineer who built an Emergency Alert System with Virginia Tech for the City of Alexandria, focusing on real-time ingestion, secure dashboards, and AI-assisted prioritization. Emphasizes high-stakes reliability with guardrails (hybrid rules+LLM, confidence-based fallbacks), scalable async processing, and defense-in-depth security (JWT/RBAC plus database row-level security).

View profile
ST

Shreya Thakur

Screened

Mid-level Software Engineer specializing in Python backend and LLM/ML systems

New York, USA4y exp
Saayam for AllUniversity at Buffalo

Backend/AI engineer who has shipped production LLM systems end-to-end, including an AI request-routing service (FastAPI + BART MNLI + OpenAI/Gemini) that improved accuracy ~25% after launch via eval-driven prompt/category iteration. Also built an enterprise document intelligence/RAG platform on Azure (Blob/SharePoint/Teams ingestion, OCR/NLP chunking, embeddings in Azure Cognitive Search) with PII guardrails (Presidio), confidence gating, and scalable event-driven pipelines handling millions of documents.

View profile
Akshay Bharadwaj Kunigal Harish - Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLM systems in Boston, MA

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLM systems

Boston, MA5y exp
Perceptive TechnologiesNortheastern University

Built a production multi-agent cybersecurity defense simulator orchestrated with CrewAI, combining Red/Blue team LLM agents, a RAG runbook retriever, and an RL remediation agent trained via state-space simplification and reward shaping for rapid incident response. Also partnered with quant analysts and fund managers to deliver an automated trading and portfolio management system using statistical methods plus CNN/LSTM models, reporting up to 15% weekly ROI.

View profile
Vaishnavi M - Mid-level AI/ML Engineer specializing in MLOps and Generative AI

Vaishnavi M

Screened

Mid-level AI/ML Engineer specializing in MLOps and Generative AI

5y exp
Liberty MutualUniversity of Maryland, Baltimore County

At Liberty Mutual, built a production underwriting decision assistant combining LLM reasoning with quantitative models and strong auditability. Implemented a claims-based response verification pipeline that cut hallucinations from 18% to 3% and materially improved user trust/validation scores. Experienced orchestrating ML/LLM workflows end-to-end with Airflow, Kubeflow Pipelines, and Jenkins, including SLA-focused pipeline hardening.

View profile
Aditi Deshpande - Mid-level Software/AI Engineer specializing in GenAI, AWS, and microservices in Remote, United States

Mid-level Software/AI Engineer specializing in GenAI, AWS, and microservices

Remote, United States4y exp
LegalPro+Arizona State University

Built a production AI pipeline at EyCrowd to automatically grade shaky outdoor user-submitted brand videos using CV + CLIP/BLIP and a LangChain RAG layer per brand, with GPT-4 generating structured JSON explanations and grades. Optimized for latency and cost (batch PyTorch inference, caching), cutting review time from ~8 minutes to <2 minutes while reaching ~90% alignment with human graders and supporting thousands of videos/day.

View profile
Abhishek Ingle - Junior Full-Stack & AI Software Engineer specializing in React/Next.js and LLM systems in Bloomington, IN

Junior Full-Stack & AI Software Engineer specializing in React/Next.js and LLM systems

Bloomington, IN2y exp
Indiana UniversityIndiana University Bloomington

Backend engineer with hands-on experience building low-latency, high-concurrency real-time chat on AWS (Node.js/Socket.IO/MongoDB) and improving reliability under unstable networks, contributing to ~40% user adoption growth. Also built FastAPI-based AI assistant context retrieval (RAG) APIs with embeddings/vector search, and has strong production experience in rate-limit handling, async refactors with safe rollout, and Supabase Auth/RLS optimization.

View profile
Mahiyadav Sidda - Junior Machine Learning Engineer specializing in LLMs, RAG, and on-device AI in Bangalore, India

Junior Machine Learning Engineer specializing in LLMs, RAG, and on-device AI

Bangalore, India2y exp
HashmintArizona State University

Built an "Offline Study Assistant" that runs LLM inference locally on a 5-year-old Android device using Llama.cpp and the Android NDK, achieving a 27x speedup and cutting time-to-first-token from 11 minutes to 30 seconds. Also has applied backend/API experience with FastAPI, Supabase (Auth + RLS), and production hardening of a RAG system at Hashmint using Celery and Redis to eliminate PDF-processing-related query failures.

View profile
prashanth Jamalapurapu - Mid-level AI/ML Engineer specializing in data engineering, LLM/RAG pipelines, and recommender systems

Mid-level AI/ML Engineer specializing in data engineering, LLM/RAG pipelines, and recommender systems

5y exp
FriendzySaint Louis University

Research assistant at St. Louis University who built and deployed a production document-intelligence RAG system (Python/TensorFlow, vector DB, FastAPI) on AWS, focusing on grounding to reduce hallucinations and latency optimization via caching/async/batching. Also developed a personalized recommendation system for the Frenzy social platform and partnered closely with product/UX to define metrics and iterate on hybrid recommenders and cold-start handling.

View profile
Kingsley Torlowei - Senior Software Engineer specializing in AI systems and data platforms in Remote

Senior Software Engineer specializing in AI systems and data platforms

Remote8y exp
Sift PlatformsFanshawe College

Built and productionized LLM agents that ingest multi-source workplace data (Slack, meetings, calendars, PM tools) to extract entities (tasks/decisions/risks/initiatives) and generate customer insights like risk alerts, deadline-miss prediction with evidence, and workload overload detection. Also architected a graph-DB-backed multi-step agent using LangChain + Pydantic with async queue/worker execution and LLM-as-judge evaluation plus human review loops.

View profile
CK

Mid-level Conversational AI Engineer specializing in enterprise chatbots and workflow automation

Miami, FL4y exp
Lid VizionUniversity of South Dakota

Built a production LLM/RAG document extraction and game/quiz content workflow using LLaMA 2, LangChain/LangGraph, and FAISS, achieving ~94% accuracy and reducing turnaround from hours to minutes. Demonstrates strong applied MLOps/orchestration (CI/CD, MLflow, Databricks/PySpark), robust handling of noisy/variable document layouts (layout chunking + OCR fallbacks), and practical reliability practices (human-in-the-loop routing, drift monitoring, A/B testing).

View profile
AW

Junior Full-Stack & AI Engineer specializing in computer vision and cloud platforms

Buffalo, NY2y exp
FILMIC TECHNOLOGIESUniversity at Buffalo

Early-career backend engineer and solo builder of FrameFindr, an AI/OCR-based marathon photo tagging product used at live events. Demonstrated pragmatic scaling under tight infrastructure constraints (2GB VPS) and hands-on ownership of architecture, API design, auth (Google OAuth/JWT), and a MongoDB-to-MySQL migration with data-integrity safeguards.

View profile
NK

Intern Data Scientist specializing in Generative AI and NLP

United States2y exp
HCLTechUniversity of New Haven

Backend/AI engineer with internship experience building an AI-powered financial insights platform (FastAPI, Redis, BigQuery) and prior HCL experience leading a monolith-to-microservices refactor (Flask, Kafka) using blue-green deployments. Demonstrates strong performance/security focus (OAuth/JWT/RBAC, encryption) and measurable impact on latency, downtime, and ML model reliability; MVP was submitted to Google’s accelerator program.

View profile
DP

DHYAN PATEL

Screened

Mid-level AI Engineer specializing in NLP and production ML systems

Tempe, AZ3y exp
MindSparkArizona State University

AI/LLM engineer who has shipped production RAG chatbots using LangChain/OpenAI with FAISS and FastAPI, focusing on real-world constraints like context windows, concurrency, and latency (reported ~40% latency reduction and <2s average response). Experienced orchestrating AI pipelines with Celery and fault-tolerant long-running workflows with Temporal, and has applied NLP model tradeoff testing (Word2Vec vs BERT) to drive measurable accuracy gains.

View profile
HL

Hanif Lashari

Screened

Mid-level Data & Machine Learning Engineer specializing in anomaly detection and forecasting

Ames, IA3y exp
Mary Greeley Medical CenterIowa State University

Built and productionized an agentic RAG assistant using Ollama + LangChain + MCP + ChromaDB to speed up and standardize access to operational knowledge from tickets and runbooks. Focused on real-world reliability: mitigated timeouts/latency with retries and concurrency limits, improved retrieval via chunking/embedding iteration, and reduced hallucinations through citation-grounding and confidence-based abstention. Also partnered with non-technical ops staff to deliver anomaly detection/monitoring by translating operational needs into model signals, thresholds, and alerting logic.

View profile
BK

Mid-level AI Engineer specializing in ML, NLP, and Generative AI

Atlanta, GA4y exp
CGIUniversity of New Haven

AI/LLM engineer with production experience building an LLM-powered investment recommendation system using RAG and chatbots, deployed via Docker/CI/CD and scaled on Kubernetes. Demonstrated measurable performance wins (sub-200ms latency) through QLoRA fine-tuning and TensorRT INT8/INT4 quantization, plus strong MLOps/orchestration background (Airflow ETL + scoring, MLflow monitoring) and stakeholder-facing delivery using demos and Tableau dashboards.

View profile
LG

Lavan Gajula

Screened

Mid-level GenAI Engineer specializing in LLM agents and production AI workflows

New York, NY5y exp
Lara DesignNew England College

Designed and deployed end-to-end LLM-powered AI agent systems to automate knowledge-intensive workflows across marketing/GTM, recruiting, and support. Brings production reliability rigor (evaluation pipelines, monitoring, testing, A/B experiments) plus orchestration expertise (Airflow, Prefect, custom Python) and a track record of translating non-technical stakeholder goals into working AI solutions (e.g., personalized customer engagement agent at Lara Design).

View profile
VM

Entry-Level Data Scientist specializing in ML, Azure, and LLM applications

Gainesville, Florida1y exp
University of FloridaUniversity of Florida

ML/computer-vision practitioner who shipped a CycleGAN-based bilingual handwriting translation demo (English↔Telugu) for low-resource scripts using unpaired datasets, focusing on preserving handwriting style and real-time deployment via Gradio. Also delivered a medical imaging pipeline by fine-tuning ResNet-50 and ViT-B/16 for pneumonia detection, emphasizing reproducibility, measurable evaluation, and stakeholder-friendly iteration.

View profile
VR

Junior Software Engineer specializing in backend, cloud, and LLM-powered search

Baltimore, MD3y exp
BetterWorldTechnologyUniversity of Maryland, Baltimore County

Python backend engineer (BetterWorld Technology) who owns microservice systems end-to-end on Azure, including Kubernetes deployments, CI/CD, and production monitoring/alerting. Has hands-on experience integrating SQL/NoSQL (including Cosmos DB with vector search/graph workflow) and has built a Kafka + Spark Streaming pipeline to Snowflake with a reported 40% latency reduction.

View profile
MP

Mehul Parmar

Screened

Mid-level Data Scientist specializing in insurance, healthcare, and cloud analytics

Somerset, NJ4y exp
P&F SolutionsLong Island University

Built a production-style LLM document summarization/generation workflow that mitigates token limits and reduces hallucinations using semantic chunking, FAISS-based embedding retrieval (top-k via cosine similarity), and section-wise generation. Orchestrated the end-to-end pipeline with AWS Step Functions and aligned outputs with sales stakeholders through demos, visuals, and documentation.

View profile

Need someone specific?

AI Search