Vetted Vector Search Professionals

Pre-screened and vetted.

SP

Surya Pavan

Screened

Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications

Baltimore, MD5y exp
AcerCalifornia State University, Northridge

GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.

View profile
SG

Entry-Level AI/ML Engineer specializing in LLM apps, RAG pipelines, and production ML systems

1y exp
iFrog Marketing SolutionsUC San Diego

AI/LLM practitioner at iFrog Marketing Solutions who drove a RAG chatbot from prototype to production in a legacy, AI-resistant environment by validating customer needs and building a business case. Implemented production-grade LLM practices (CI/CD eval gating, rollbacks, prompt/context engineering) and led internal workshops to bring non-AI-native developers up to speed while partnering with sales on tailored demos to drive adoption.

View profile
VM

Mid-level Machine Learning & Full-Stack Engineer specializing in GenAI platforms

San Francisco, CA5y exp
WellDhanNortheastern University

LLM/agent builder who has shipped production AI systems in the wellness space, including an LLM-powered food tracking product used by 5000+ users and a voice/call-routing onboarding workflow using LangGraph/LangChain with LiveKit and Twilio. Strong focus on practical reliability work: latency reduction, retrieval/embedding tuning, and CI-driven evaluation with simulations and metrics.

View profile
HC

Senior Full-Stack Developer specializing in Python microservices and cloud-native AWS deployments

Dallas, Texas5y exp
ComcastUniversity of North Texas

Backend engineer with hands-on ownership of FastAPI/Django services using MongoDB and React integration, focused on production reliability and performance (Redis caching, Celery background jobs, automated testing). Has delivered AWS container deployments via GitHub Actions to ECR with scripted rollouts/health checks, and supported phased migrations with replication and rollback planning. Also built a real-time user-activity streaming pipeline addressing partition hot spots and consumer lag through partition-key strategy, idempotency, and monitoring.

View profile
SA

Sean Aguinaga

Screened

Senior Full-Stack Software Engineer specializing in mobile, healthcare, and UX

11y exp
Smart Code SolutionsSanta Monica College

Former co-founder at PreConception (acquired) who partnered closely with Operations, Legal, and Medical teams to deliver a HIPAA-compliant product meeting technical and regulatory requirements. Motivated by mission and team fit, and interested in a Venture Studio CTO path with a focus on 0-to-1 building and early validation via beta testing/PMF.

View profile
Ramcharan Reddy - Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices in Texas, USA

Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices

Texas, USA6y exp
Morgan StanleyUniversity of Central Missouri

Backend engineer focused on AI-enabled systems, having built a production-style RAG pipeline (vector search + LLM) exposed via Python/Flask endpoints with strong observability and hallucination-reduction techniques. Demonstrates deep performance work in PostgreSQL/SQLAlchemy (5x faster analytics queries) and high-throughput optimization using Celery + Redis (800ms to 120ms latency, 3x throughput), plus schema-per-tenant multi-tenancy with tenant-aware middleware and logging.

View profile
Hemanth Kumar Gajagiri - Mid-level Full-Stack AI Engineer specializing in agentic systems and scalable platforms in San Francisco, CA

Mid-level Full-Stack AI Engineer specializing in agentic systems and scalable platforms

San Francisco, CA6y exp
GE HealthCareWilliam Jessup University

AI-focused full-stack/DevOps engineer who goes beyond using copilots and has built production-oriented LLM systems such as natural-language-to-SQL and structured insight extraction pipelines. Stands out for treating AI as an accelerator rather than a replacement, with a strong emphasis on guardrails, validation, observability, and safe deployment practices in agent-based and distributed systems.

View profile
TM

Tarun Majhi

Screened

Mid-level AI Software Engineer specializing in FinTech and LLM systems

Massachusetts, USA4y exp
State StreetClark University

Engineer with hands-on experience designing and leading multi-agent AI development workflows, including a LangGraph-based system that automated parts of a RAG pipeline and significantly reduced development time. Stands out for treating AI agents like an engineering team, with clear architecture, handoff schemas, validation, and supervisor-driven conflict resolution.

View profile
SV

Mid-level Generative AI Engineer specializing in LLMs and RAG systems

5y exp
Summit Design and TechnologyNorthwest Missouri State University

Built and shipped a production RAG-based enterprise knowledge assistant to replace slow/inaccurate search across millions of documents, using LangChain orchestration with GPT-4/LLaMA and vector databases. Strong focus on production constraints—latency, hallucination control, and cost—using hybrid retrieval, guardrails, LLM-as-judge validation, and model routing, and has experience translating non-technical stakeholder pain points into measurable outcomes.

View profile
SP

SASI PAILA

Screened

Mid-level AI/ML Engineer specializing in Generative AI and production ML systems

PA, USA4y exp
BNY MellonFranklin University

Built and deployed a production SecureAIChatBot (RAG-based) for secure internal information retrieval, using embeddings/vector search, GPT models, monitoring, and safety filters. Focused on real-world production challenges like latency and output consistency, applying caching, retrieval scoping, smaller models, and controlled prompting, and used LangChain to orchestrate the end-to-end workflow.

View profile
NA

Mid-level Full-Stack Software Engineer specializing in AI platforms and microservices

Mooresville, NC6y exp
Lowe'sUniversity of North Carolina at Charlotte

Backend engineer currently building an AWS Lambda/FastAPI inventory recommendation system using a LangChain + GPT-4 RAG pipeline and MongoDB vector search; drove major cost optimization via Redis caching (60% reduction) while sustaining 10k+ daily requests under 2s latency. Previously deployed Node.js microservices on AWS OpenShift with Jenkins/Helm at UnitedHealth Group and led a zero-downtime monolith-to-microservices migration at Verizon, including RabbitMQ-based real-time messaging with DLQs and idempotency.

View profile
KG

Senior AI Engineer specializing in Agentic AI and distributed systems

Charlotte, NC4y exp
UnitedHealth GroupUniversity of North Carolina at Charlotte

LLM/agentic workflow engineer with healthcare domain experience who built a HIPAA-compliant multi-agent RAG system for clinical review automation at UnitedHealth Group, achieving 92% precision and cutting latency 40% through async orchestration and Redis semantic caching. Also has strong data engineering orchestration background (Airflow on AWS EMR with Great Expectations) and a proven clinician-in-the-loop feedback process that improved model faithfulness by 18%.

View profile
SV

Sathvik Vanja

Screened

Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines

Overland Park, KS3y exp
HCA HealthcareVNR Vignana Jyothi Institute of Engineering and Technology

Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.

View profile
VA

Mid-level Data Scientist specializing in Generative AI and NLP for financial risk

Glassboro, NJ4y exp
S&P GlobalRowan University

Built and shipped production generative AI/RAG assistants in regulated financial contexts (S&P Global), automating compliance-oriented Q&A over earnings reports/filings with grounded answers and citations. Experienced across the full stack—AWS-based ingestion (PySpark/Glue), vector retrieval + LangChain agents, GPT-4/Claude model selection, and production reliability (monitoring, caching, retries) plus rigorous evaluation and regression testing.

View profile
NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in NLP, RAG systems, and real-time risk modeling

PA, USA4y exp
Capital OneRobert Morris University

AI/ML Engineer with 4+ years of experience (Capital One, Odin Technologies) and a master’s in Data Analytics (4.0 GPA) who has deployed LLM/RAG systems to production for compliance/risk and document review. Strong in orchestration and MLOps (Airflow, Kubernetes, MLflow, GitHub Actions) and in tackling real-world LLM constraints like latency, context limits, and data privacy, with measurable impact (20%+ manual review reduction; 33% faster release cycles).

View profile
Daniel Berhane Araya - Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance in Fairfax, VA

Senior AI/ML Engineer specializing in production-grade LLM systems for regulated finance

Fairfax, VA9y exp
George Mason UniversityGeorge Mason University

AI/LLM engineer with published work who built FinVet, a production financial misinformation detection system using multi-pipeline RAG, confidence-based voting, and evidence-backed outputs (F1 0.85, +37% vs baseline). Also built NexusForest-MCP, a Dockerized Model Context Protocol server exposing structured global deforestation/carbon data via SQL tools for reliable LLM tool use. Previously delivered borrower risk-rating (PD) models at BMO Financial Group that were validated and integrated into an enterprise credit system through close collaboration with credit officers and portfolio managers.

View profile
Hritvik Gupta - Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI in San Francisco, CA

Hritvik Gupta

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI

San Francisco, CA3y exp
Penn MedicineUC Riverside

Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.

View profile
CT

Mid-level AI Engineer specializing in LLMs, MLOps, and healthcare NLP

4y exp
HCA HealthcareUniversity of South Florida

Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.

View profile
RK

Senior AI/ML Engineer specializing in LLMs, generative AI, and applied research

Boca Raton, FL10y exp
ModMedFlorida Atlantic University

Research-heavy ML/AI candidate with a PhD/publications background who translated LLM evaluation and clinical summarization techniques into production at ModMed. They owned an end-to-end healthcare GenAI pipeline that cut clinician documentation time from ~22 minutes to ~7-8 minutes, reduced token costs by ~30%, and built an internal evaluation framework later adopted by multiple teams.

View profile
SN

Mid-level AI/ML Engineer specializing in GenAI, NLP, and financial systems

Texas, USA5y exp
CitibankConcordia University, St. Paul

GenAI/ML engineer with hands-on experience building production financial intelligence and document summarization systems at Citibank. Stands out for combining LLM fine-tuning, hybrid RAG, multi-agent workflows, and strong MLOps/observability practices to deliver measurable business impact, including 60% faster analyst retrieval, 31% higher precision, and 99%+ uptime.

View profile
AN

Alir Navid

Screened

Executive CTO specializing in FinTech, Healthcare IT, and AI platforms

Irvine, CA19y exp
AphidUniversity of Phoenix

Engineering/product leader who builds business-aligned technology roadmaps and scales pod-based orgs with strong delivery discipline (OKRs, CI/CD, QA automation). Led a SaaS supply-chain application adopted by Fortune 100 customers, citing ~$4M MRR and ~87% gross profit, and has hands-on experience standardizing LLM + cloud/MLOps architectures with security/compliance guardrails. Also created the PISEK methodology and used it to run distributed innovation sprints (e.g., an AI ETA predictor moved from pilot to production).

View profile
VM

Mid-Level Full-Stack/Backend Engineer specializing in AWS, APIs, and GenAI systems

Los Angeles, CA5y exp
AIRKITCHENZCalifornia State University, Fullerton

Backend engineer who built the core backend for Air Kitchens’ discovery/booking platform on AWS (Node + Python, DynamoDB, SQS/Lambda), optimizing for fast user-facing APIs and scalable async workflows. Introduced an AI matching service with a deterministic pre-filter + LLM ranking approach to balance latency vs quality, and has hands-on experience with production security (JWT/RBAC/RLS), CI/CD, and blue-green, staged migrations from Django to modular services.

View profile
VG

Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps

Glassboro, NJ5y exp
HCLTechRowan University

Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.

View profile
VS

Senior AI/ML Engineer specializing in Generative AI, LLMs, and MLOps

Tampa, FL9y exp
VerizonJawaharlal Nehru Technological University

Telecom (Verizon) AI/ML practitioner who built a production multimodal system that ingests messy customer issue reports (calls, chats, emails, screenshots, videos) and turns them into confidence-scored incident summaries with reproducible steps and evidence links. Also built KPI/alarm-to-ticket correlation to rank likely root-cause domains (RAN/Core/Transport), cutting triage from hours to minutes and improving MTTR.

View profile

Need someone specific?

AI Search