Vetted Vector Search Professionals

Pre-screened and vetted.

MC

Senior Applied AI Engineer specializing in LLMs, NLP, and production AI systems

Madison, WI11y exp
IndeedCarnegie Mellon University
View profile
SL

Senior Full-Stack Engineer specializing in AI/ML and cloud microservices

Summit, IL13y exp
OptumUniversity of Illinois Urbana-Champaign
View profile
ML

Senior Software Engineer specializing in AI agents and cloud platforms

Louisiana, USA7y exp
NotionSanta Clara University
View profile
MD

Mid-level Software Engineer specializing in backend, ML platforms, and FinTech

California, USA5y exp
MetaSaint Louis University
View profile
MR

Senior AI Engineer specializing in LLM platforms and RAG systems

Bronx, NY8y exp
PerplexityFordham University
View profile
KK

Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure

San Francisco, CA6y exp
PerplexityStevens Institute of Technology
View profile
RK

Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems

San Francisco, CA4y exp
PlaidSaint Louis University
View profile
SP

Senior AI/ML Engineer specializing in GenAI, agentic systems, and healthcare AI

Bonham, TX12y exp
AnthropicTexas Tech University
View profile
RT

Rana Taki

Screened

Junior Mechanical Engineering & Software Developer specializing in aviation autonomy and retrieval systems

Stanford, CA2y exp
Stanford UniversityStanford University

Robotics/embedded builder who trained an aviation-specific LLM and deployed it offline on an NVIDIA Jetson for an in-flight voice assistant, solving performance and cabling constraints with NVMe storage and Bluetooth. Also has hands-on Raspberry Pi/Arduino robot builds (including a cigarette-butt picking prototype with hydraulic actuation) plus Docker-based FEA work using FEniCS/Gmsh and strong CI/CD + automated testing practices.

View profile
Nishitha Thummala - Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference in San Francisco, CA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference

San Francisco, CA6y exp
PerplexityUniversity of Nebraska Omaha

Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.

View profile
Nikhil Reddy - Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms in San Francisco, CA

Nikhil Reddy

Screened

Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms

San Francisco, CA5y exp
NVIDIASaint Louis University

Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.

View profile
MJ

Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems

Mesquite, TX11y exp
AmazonUniversity of Texas at Dallas

ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.

View profile
Pavankumar Pendela - Mid-level AI/ML Engineer specializing in LLMs, RAG, and multi-agent systems in Centerton, AR

Mid-level AI/ML Engineer specializing in LLMs, RAG, and multi-agent systems

Centerton, AR6y exp
MetaUniversity of the Cumberlands
View profile
Zhengjie Qian - Intern Full-Stack Engineer specializing in AI-driven RAG applications in Menlo Park, CA

Intern Full-Stack Engineer specializing in AI-driven RAG applications

Menlo Park, CA1y exp
MetaUSC
View profile
RG

Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems

San Francisco, CA5y exp
NVIDIAArizona State University
View profile
SR

Mid-level AI/ML Engineer specializing in LLM fine-tuning and RAG systems

San Francisco, CA5y exp
Scale AIConcordia University
View profile
SL

Senior Software Engineer specializing in AI/LLM and distributed cloud systems

Renton, WA12y exp
MicrosoftUC San Diego
View profile
ML

Senior Software Engineer specializing in full-stack platforms, MLOps, and LLM search

Foreman, Arkansas10y exp
IQVIAUniversity of Florida
View profile
JS

Senior Data Scientist specializing in Generative AI and conversational AI

Chicago, IL12y exp
MozillaUniversity of Michigan
View profile
Venu Dave - Junior Software Development Engineer specializing in backend data platforms and LLM applications in New York, NY

Venu Dave

Screened ReferencesStrong rec.

Junior Software Development Engineer specializing in backend data platforms and LLM applications

New York, NY3y exp
AmazonNortheastern University

Amazon internship experience building and shipping an end-to-end NL-to-SQL system: ingested/normalized metadata across 60+ internal tables, added rigorous multi-layer validation for LLM-generated SQL, and served it via a FastAPI backend for engineers—driving 90%+ faster dataset discovery and ~70% lower effort to access data. Also built an early-stage RAG-based healthcare assistant, iterating on chunking, embeddings, and retrieval to improve answer quality post-launch.

View profile
AC

Director of AI/ML Engineering specializing in MLOps, data platforms, and 3D computer vision

Teaneck, NJ10y exp
AetrexColumbia University

Backend/data engineer focused on production ML/LLM systems: built a real-time FastAPI inference API on Kubernetes with strong reliability patterns (timeouts, idempotent retries, centralized error handling). Delivered AWS platforms using EKS + Lambda with GitHub Actions/Helm CI/CD and built Glue-based ETL from S3/Kafka into Snowflake with schema evolution and data-quality controls; also modernized legacy analytics/recommendation workflows into Python services with safe, feature-flagged cutovers.

View profile

Need someone specific?

AI Search