Vetted Vector Search Professionals

Pre-screened and vetted.

KP

Krisha Patel

Screened

Entry-Level Software Engineer specializing in AI/ML and Full-Stack Development

United States0y exp
TargetUniversity at Albany

Backend engineer who built an NL-to-SQL system at Target, using a multi-step LLM pipeline with vector-store schema retrieval and SQL validation to safely answer business questions. Strong in production FastAPI systems (async, Pydantic, Docker/Uvicorn, load balancing) and security (OAuth2/JWT, scopes, and database row-level security), with experience migrating Flask apps to FastAPI + PostgreSQL using strangler/feature-flagged canary rollouts.

View profile
SS

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp
Cloud9USC

AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.

View profile
VS

Principal Software Architect specializing in AI/ML and cloud-native full-stack platforms

Austin, TX17y exp
Happiest MindsAnna University

AI/LLM engineer who built a production content-generation system for nursing education, combining multimodal RAG over proprietary PDFs (including images) with structured Cosmos DB data and external sources. Strong focus on production reliability—prompt-chaining with LangChain, validation/guardrails, and Azure-based monitoring/observability—plus experience designing Azure AI agents with tool integrations like Bing Search.

View profile
AE

Ashwitha E

Screened

Junior Data Scientist specializing in fraud analytics and cloud data platforms

Dallas, TX3y exp
Bank of AmericaUniversity of North Texas

Built and deployed production LLM-powered document summarization/classification systems using embeddings, vector databases (RAG-style retrieval), and automated evaluation (BERTScore/ROUGE), with a focus on monitoring and scalable cloud pipelines. Also partnered with a fraud analytics team to deliver a transaction anomaly detection solution, translating model outputs into Power BI dashboards and actionable KPIs while iterating on thresholds and alerts based on stakeholder feedback.

View profile
VL

Vasu Lakhani

Screened

Mid-Level Software Engineer specializing in AI-enabled backend and full-stack web systems

Los Angeles, California4y exp
AIRKITCHENZCalifornia State University, Fullerton

Backend/AI workflow engineer with experience at AirKitchenz, Uber, and Vivma Software, building production systems on AWS (Lambda, DynamoDB, Step Functions). Has a track record of major performance wins (DynamoDB latency 2s to <150ms; Postgres query 2s to ~180ms) and shipping LLM-powered onboarding and ticket-routing workflows with strong guardrails (schema validation, confidence thresholds, human-in-the-loop escalation).

View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
VJ

Vedant Jagtap

Screened

Junior AI/NLP Engineer specializing in LLM systems and RAG

New York, NY1y exp
NYU’s Center for Social Media, AI, and PoliticsNYU

LLM/agent engineer who shipped a two-stage AI recruitment screening platform at Foursquare that automated resume ingestion through behavioral assessment, delivering an 85% reduction in screening time across 5,000+ applications with auditability and confidence-gated decisions. Also built a multi-agent benchmarking framework using MCP tool interfaces and a RAGAS + LangSmith evaluation/observability stack, including async re-architecture that cut production latency by 50%.

View profile
Joshua Hewitt - Senior Software Engineer specializing in Generative AI product development in San Francisco, USA

Joshua Hewitt

Screened

Senior Software Engineer specializing in Generative AI product development

San Francisco, USA9y exp
PadletUniversity of Sydney

AI product builder at Padlet who shipped multiple production LLM features for education workflows, including an AI document generator (AI Recipes) and a RAG-enabled in-product chat assistant. Built an AI microservice layer (LangChain) to swap model providers easily and created automated + human-in-the-loop evaluation systems (including ~100-test runs) to iterate on prompts and quality.

View profile
Kavya Aitagoni - Junior Full-Stack Engineer specializing in web platforms and live events in San Francisco, CA

Junior Full-Stack Engineer specializing in web platforms and live events

San Francisco, CA2y exp
EventbriteSyracuse University

Full-stack product engineer with experience shipping user-facing web products end-to-end, including an event analytics dashboard and checkout improvements at Eventbrite. Stands out for combining frontend polish, backend reliability, and production-minded practices like idempotent APIs, query optimization, CI/CD, logging, and monitoring to improve conversion and reduce engineering dependency.

View profile
NR

Junior Product Manager and AI/ML engineer specializing in enterprise SaaS and cloud AI

Bellevue, WA4y exp
CoreStackPenn State University

Growth-focused B2B SaaS operator with hands-on experience improving enterprise adoption for a cloud governance and FinOps platform. They combine customer discovery, ROI-driven messaging, automation, and funnel instrumentation to improve conversion and handoffs, citing an 18% lift in enterprise adoption and roughly $200K-$3M in influenced pipeline.

View profile
SV

Sai V

Screened

Mid-level Software Engineer specializing in backend systems and FinTech

Maryland, USA5y exp
Fidelity InvestmentsIllinois Institute of Technology

Built an internal RAG assistant for financial documents using FastAPI, OpenAI APIs, and vector search, improving document search speed and reducing manual effort for the business team. Stands out for a pragmatic approach to AI engineering: uses AI heavily for productivity, but keeps human judgment central and has designed retrieval, validation, and summarization workflows end-to-end.

View profile
SS

Sowmya Sree

Screened

Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps

Dallas, TX5y exp
Bank of AmericaUniversity of North Texas

Built production LLM systems including a real-time customer feedback analysis and workflow automation platform using RAG and multi-agent orchestration with confidence-based human escalation, addressing privacy and legacy integration challenges. Also automated ML operations with Airflow/Kubernetes (e.g., daily churn model retraining) cutting retraining time to under 30 minutes, and demonstrates a rigorous testing/monitoring approach plus strong non-technical stakeholder collaboration.

View profile
Qichen Zhao - Intern Software Engineer specializing in Applied AI and LLM systems in Los Angeles, CA

Qichen Zhao

Screened

Intern Software Engineer specializing in Applied AI and LLM systems

Los Angeles, CA0y exp
Search-AIUSC

Built and deployed a production RAG-based conversational "Yelp for AI tools" at Search-AI Inc., focused on personalized, explainable AI tool recommendations from thousands of options. Emphasizes production-grade reliability and performance (hybrid retrieval, async two-stage pipelines) and is also building a multi-agent orchestration layer (MAgIc) with typed memory and controlled coordination policies.

View profile
Sai Nekkanti - Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance in Mount Laurel, NJ

Sai Nekkanti

Screened

Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance

Mount Laurel, NJ4y exp
MetLifeRowan University

Built a production "sentinel insight engine" to tame information overload from millions of product reviews and support transcripts, combining Azure OpenAI (GPT-3.5) zero-shot classification with a fine-tuned T5 summarizer to generate weekly actionable product insights. Demonstrated strong MLOps/production engineering by adding drift monitoring with embedding-based detection, integrating REST with legacy SOAP/queue-based CRM via FastAPI middleware, and scaling reliably on Kubernetes with HPA.

View profile
Nishad Kane - Mid-level Data Scientist & AI Engineer specializing in RAG, agentic AI, and production ML

Nishad Kane

Screened

Mid-level Data Scientist & AI Engineer specializing in RAG, agentic AI, and production ML

5y exp
Xtrium AIArizona State University

AI/data engineer who built a production LLM-powered schema drift detection system (LangChain/LangGraph) to catch semantic data changes before they break downstream analytics/ML. Deployed on AWS with Docker/S3 and implemented an LLM-as-a-judge evaluation framework to improve trust, reduce hallucinations, and control false positives/alert fatigue. Collaborated with non-technical risk/business analytics stakeholders at EY by delivering human-readable drift explanations that improved confidence in financial analytics dashboards.

View profile
AR

ABHEESHT ROY

Screened

Junior Software Engineer specializing in LLMs, ML, and full-stack development

Atlanta, GA1y exp
Agent-Techs AIArizona State University

Built and shipped a production LLM-driven data harmonization/record-matching pipeline for pharmaceutical datasets, combining normalization, embeddings/vector search, and an LLM validation step. Emphasizes production reliability via guardrails, confidence thresholds, idempotent/retryable stages, and human-in-the-loop fallbacks, with monitoring focused on manual review and error rates to reduce false positives.

View profile
DP

Dhruv Pandoh

Screened

Junior Full-Stack Software Engineer specializing in AI, FinTech, and e-commerce

New York, USA2y exp
MIO PartnersNYU

Built both traditional internal tooling and LLM-powered systems during an internship, including a React/Python/AWS calculator onboarding platform and a production-style ROS2 RAG assistant over 10K+ documents. Stands out for combining full-stack delivery, stakeholder coordination, and practical AI reliability work like retrieval tuning, source-grounded answers, and low-confidence fallbacks.

View profile
Nithyashree Raghunathan - Mid-level Software Engineer in Test specializing in AI and healthcare platforms in Santa Clara, CA

Mid-level Software Engineer in Test specializing in AI and healthcare platforms

Santa Clara, CA5y exp
MetaPenn State Great Valley

QA/data pipeline engineer with hands-on AI product building experience, spanning enterprise AWS migration testing for Belgium postal services and personal multi-agent systems in fintech and recruiting. Stands out for combining rigorous validation and production stability work with modern LLM orchestration, guardrails, and messy-document normalization workflows.

View profile
KP

Senior AI Engineer specializing in Generative AI and RAG applications

8y exp
Keurig Dr PepperGeorge Mason University

AI engineer who has shipped production LLM systems across customer service and marketing use cases—building a RAG app on Azure OpenAI and speeding retrieval with Redis caching tied to Okta sessions. Also implemented a LangGraph multi-agent workflow that pulls image context from Figma to generate structured HTML marketing emails, adding a verification agent to improve image-selection accuracy while optimizing solution cost for business stakeholders.

View profile
SP

Mid-level Machine Learning Engineer specializing in LLM systems and healthcare data automation

California, USA2y exp
Prime HealthcareUSC

React performance-focused engineer who contributed performance patches back to an open-source context+reducer state helper after profiling and fixing excessive re-renders in an enterprise project management platform at Easley Dunn Productions. Also built an end-to-end LLM-driven pipeline at Prime Healthcare to normalize millions of supply-chain records, reducing defects by 80% and saving 160+ hours/month.

View profile
SM

Mid-level AI/ML Engineer specializing in Generative AI and NLP

Dallas, TX5y exp
Gilead SciencesUniversity of North Texas

AI/LLM engineer with production experience building secure, scalable compliance-focused generative AI systems (GPT-3/4, BERT) including RAG over internal regulatory document bases. Has delivered end-to-end pipelines on AWS with PySpark/Airflow/Kubernetes/FastAPI, emphasizing privacy controls, monitoring, and iterative evaluation (A/B testing). Also partnered closely with bank compliance officers using prototypes to refine NLP summarization/classification and reduce document review time.

View profile
NA

Niveditha A

Screened

Mid-level AI/ML Engineer specializing in healthcare ML and LLM/RAG systems

USA4y exp
UnitedHealth GroupBowling Green State University

AI/LLM engineer with recent production experience at UnitedHealth Group building an end-to-end RAG system over structured EMR data and unstructured clinical notes, including evidence retrieval, GPT/LLaMA-based reasoning, and a validation layer for reliability. Strong in orchestration (Kubeflow/Airflow/MLflow), prompt engineering for noisy healthcare text, and rigorous evaluation/monitoring with gold-standard benchmarking, plus close collaboration with clinical operations stakeholders.

View profile
SM

Mid-level AI/ML Engineer specializing in GenAI agents, RAG pipelines, and MLOps

USA6y exp
UnitedHealthcareKent State University

AI/ML engineer who built a production RAG-based internal document intelligence assistant (LangChain + Pinecone) to let employees query enterprise reports in natural language. Demonstrated hands-on pipeline orchestration with Apache Airflow and tackled real production issues like retrieval grounding and latency using tuning, caching, and token optimization, while partnering closely with non-technical business stakeholders through iterative demos.

View profile

Need someone specific?

AI Search