Vetted Vector Search Professionals

Pre-screened and vetted.

SK

Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications

Dallas, TX5y exp
Baylor Scott & WhiteUniversity of North Texas

Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).

View profile
BK

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp
ThorneBharathiar University

ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.

View profile
OB

Omkar Bhope

Screened

Staff Full-Stack Engineer specializing in AI platforms and infrastructure automation

San Jose, CA5y exp
Etched AIUC San Diego

Backend/full-stack engineer building complex internal platforms and customer-facing demos at the intersection of infrastructure and product. Shipped a no-code Product Lifecycle Manager for manufacturing (3 manufacturers, 1000+ evolving tests) using AWS S3/SQS ingestion and extensible Postgres (EAV+JSONB) with end-to-end traceability. Also built a FastAPI-based company data intelligence platform with Okta-secured RBAC and an LLM/MCP layer for ChatGPT-like analytics over enterprise data sources.

View profile
TK

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI

3y exp
AetnaIndiana Tech

Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.

View profile
RG

Mid-level GenAI Engineer specializing in production RAG and LLM fine-tuning

San Jose, California5y exp
eBayTexas Tech University

LLM engineer who built a production seller-support RAG system at eBay using hybrid retrieval (BM25 + Pinecone vectors) with Cohere reranking, LangGraph orchestration, and citation-grounded answers. Strong focus on reliability: semantic/structure-aware chunking, automated Ragas-based evaluation with nightly regressions, and production observability (LangSmith) plus drift monitoring (Arize). Also implemented a multi-agent fraud pipeline with AutoGen using JSON-schema contracts and explicit termination conditions.

View profile
YT

Yupeng Tang

Screened

Junior Machine Learning Engineer specializing in LLM systems and GPU inference

Atlanta, GA1y exp
GMI CloudGeorgia Tech

LLM/agent engineer who shipped a production RAG-based recommendation + explanation system that replaced a traditional recommender stack, delivering ~20% CTR lift (and +8% after a reliability iteration) with strong cold-start performance. Demonstrates strong production rigor: schema-constrained generation, typed tool calling, explicit state/orchestration, deep monitoring/feedback loops, and safe integration with messy ERP inventory/order data using normalization, idempotency, and conflict-resolution guardrails.

View profile
JS

Intern Software Engineer specializing in edge AI deployment and distributed systems

San Francisco, CA1y exp
Zetic AISan José State University

Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).

View profile
Prateeksha Ranjan - Mid-level Software Engineer specializing in embedded AI and full-stack systems in Irvine, California

Mid-level Software Engineer specializing in embedded AI and full-stack systems

Irvine, California4y exp
SynapticsUC Irvine

Robotics software engineer who built and owned core navigation components for a TurtleBot in ROS/ROS2 and Gazebo, including an RRT-based planner, waypoint-to-velocity motion planning, and PID trajectory tracking. Demonstrates strong real-time debugging skills (control-loop timing under CPU load), costmap/occupancy-grid tuning, and distributed ROS2 communication design using DDS/QoS, plus Docker and CI/CD automation experience from Keysight.

View profile
Prateek Patil - Engineering Leader specializing in Digital Health, AI, and Cloud Platforms in Santa Clara, CA

Prateek Patil

Screened

Engineering Leader specializing in Digital Health, AI, and Cloud Platforms

Santa Clara, CA16y exp
RocheIllinois Institute of Technology

Senior Engineering Manager at Roche leading two Scrum teams building internally shared (“inner-sourced”) tools and libraries for a healthcare enterprise. Has led security/compliance-first architecture decisions (e.g., Python AI modules running inside a Java container) and front-end modularization (Angular monorepo to module federation), with a strong focus on developer experience via automated Swagger/OpenAPI documentation and robust testing/versioning practices.

View profile
CY

Charlotte Yu

Screened

Junior Full-Stack AI Engineer specializing in LLM apps and RAG systems

Remote1y exp
StealthUCLA

Built and shipped a production LLM-powered “Vet agent” that automates pet symptom intake across multimodal inputs (images/files/text/speech) and provides analysis/home-care guidance, reaching thousands of daily active users within two months. Demonstrates strong agent engineering fundamentals: state-machine orchestration with structured JSON, tool/schema validation, high-availability routing/failover, and rigorous offline/online evaluation loops with trace-driven reliability improvements.

View profile
Aarushi Mahajan - Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps in New York, USA

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps

New York, USA4y exp
IntuitUniversity of Massachusetts Amherst

Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.

View profile
OB

Mid-level Software Developer specializing in backend microservices for healthcare and FinTech

USA4y exp
HumanaUniversity of Virginia

Built and deployed an AI-powered insurance claims fraud platform end-to-end using Java/Spring Boot, Kafka, OpenAI, pgvector, and AWS EKS. Stands out for combining LLM/RAG architecture with production-grade scalability and observability, delivering measurable impact including 62% less manual review, 40% better fraud precision, 37% higher throughput, and 99.95% uptime.

View profile
JE

Justin Emsoff

Screened

Director-level Solutions Architect specializing in AI, integrations, and enterprise SaaS

Altadena, CA12y exp
KnowdeUSC

Player-coach engineering leader currently running a Solution Architecture/FDE team responsible for both presales and postsales delivery. Stands out for combining enterprise systems thinking with hands-on AI product work: they built configurable tooling that sped delivery by ~30%, drove a Kafka-to-Pulsar architecture shift for scale, and spent the last two years building LLM-based document extraction and RAG inference pipelines shaped directly by user feedback.

View profile
NP

Navneet Parab

Screened

Mid-level AI/ML Engineer specializing in financial risk and LLM systems

New Jersey, USA4y exp
Ally FinancialNortheastern University

AI/ML engineer in financial services who has built both LLM-powered compliance tools and production fraud/credit risk systems at Ally Financial. Particularly strong in regulated, high-stakes environments: combines RAG/LLM architecture, rigorous evaluation, and human-in-the-loop governance, and also helped stand up a unified ML platform from scratch.

View profile
SG

Junior Software Engineer specializing in AI search and full-stack systems

Denver, CO3y exp
finish’d, Inc.University of Colorado Boulder

AI/full-stack engineer who has built both a real-time crypto sentiment platform from scratch and production enterprise RAG search systems at Kore.ai. Stands out for combining strong systems engineering with practical LLM evaluation, retrieval tuning, and careful human-in-the-loop design for high-risk network automation use cases with Cisco.

View profile
Aakash Khepar - Mid-level Full-Stack AI Engineer specializing in agentic AI systems in Tempe, AZ

Aakash Khepar

Screened

Mid-level Full-Stack AI Engineer specializing in agentic AI systems

Tempe, AZ4y exp
Arizona State UniversityArizona State University

AI/full-stack builder with hands-on experience shipping healthcare, career-tech, nonprofit, and fintech products, spanning speech AI, browser extensions, agentic RAG systems, and enterprise ML monitoring. Stands out for combining strong technical depth with measurable outcomes, including reducing clinical call WER from 26% to 3%, building safe tool-using agents with rollback/RBAC, and delivering zero-to-one multi-tenant platform features in ambiguous environments.

View profile
Suman Madipeddi - Junior AI/ML Engineer specializing in agentic AI, RAG, and voice systems in San Jose, CA

Junior AI/ML Engineer specializing in agentic AI, RAG, and voice systems

San Jose, CA2y exp
ZscalerArizona State University

Full-stack AI product engineer who has owned production-grade document intelligence and agent systems at meaningful scale, including a copilot used by 10,000+ users and 1M+ queries. Particularly strong in combining React/TypeScript product work with Python/FastAPI, RAG, knowledge graphs, observability, and performance tuning—cutting latency from ~7 seconds to 0.5 milliseconds while improving trust through citations and human review.

View profile
NP

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp
JPMorgan ChaseUniversity of Dayton

Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.

View profile
SS

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp
EYUniversity of South Florida

Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.

View profile
UC

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and RAG systems

Atlanta, GA5y exp
Morgan StanleyKennesaw State University

Machine learning/NLP engineer who built a production-oriented retrieval-based AI system at Morgan Stanley for healthcare use cases, combining RAG over unstructured patient records with deep-learning medical image segmentation (U-Net/Mask R-CNN). Strong in end-to-end pipelines and MLOps (Spark/MongoDB, AWS SageMaker, CI/CD, monitoring, automated retraining) and in entity resolution/data quality validation for noisy clinical data.

View profile
Divyam Agrawal - Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems in Seattle, WA

Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems

Seattle, WA4y exp
Affinity SolutionsUniversity of Washington

Internship experience building a production RAG+LLM pipeline to map messy card transaction descriptions to merchant brands, including a custom modified-ROUGE evaluation approach for weak/variant ground truth. Improved scalability and cost by moving from a managed LLM endpoint (e.g., Bedrock) to self-hosted vLLM, and orchestrated massive embedding backfills (5,000+ files, 10B+ rows) using an Airflow-triggered SQS + ECS worker architecture with robust retry/DLQ handling.

View profile
Sai Charan Kolla - Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS in TX, USA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS

TX, USA5y exp
BlackRockTexas A&M University-Kingsville

LLM engineer who built a production document intelligence/RAG pipeline to extract structured data from thousands of unstructured PDFs, cutting manual review time by 60%. Experienced with LangChain and Airflow orchestration plus rigorous evaluation (labeled datasets, prompt testing, HITL review, monitoring) to improve accuracy and reduce hallucinations while partnering closely with non-technical operations stakeholders.

View profile
Amaan Elahi - Mid-level Software Engineer specializing in backend, AI, and full-stack systems in New York, NY

Amaan Elahi

Screened

Mid-level Software Engineer specializing in backend, AI, and full-stack systems

New York, NY5y exp
SAIL GTXNYU

Built and shipped production LLM agents including an internal RAG-based compliance classification system at SAIL (FastAPI/Redis/Docker) designed to handle real failure modes and scale to ~10k LLM calls/hour, achieving ~93% pipeline accuracy with reduced hallucination risk via multi-model orchestration and strict grounding. Also architected “Elara,” a state-machine-driven conversational appointment booking agent using structured JSON outputs and backend function execution for reliability, and has experience normalizing messy OTA/PMS data at RateGain.

View profile
BN

Mid-level Machine Learning Engineer specializing in AI/LLM systems

New York, NY5y exp
ServiceNowUniversity at Buffalo

ML/LLM systems engineer who has owned AI support automation products end-to-end, including ServiceNow-integrated incident routing, RAG-based resolution suggestion systems, and production stabilization. Stands out for combining hands-on platform work across PySpark, AWS Glue, FastAPI, Kubernetes, and Pinecone with measurable operational impact, including 30-35% MTTR reduction and 25-30% improvement in first-touch resolution.

View profile

Need someone specific?

AI Search