Vetted FAISS Professionals

Pre-screened and vetted.

NK

Nandini Kosgi

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and fraud/risk analytics in Financial Services

PA, USA4y exp
Capital OneRobert Morris University

Built and shipped a production-grade GenAI Fraud & Compliance Investigation Copilot for a large US bank, integrating OCR docs, structured data, and prior case history to generate grounded, regulator-friendly summaries and red-flag highlights. Demonstrates strong end-to-end LLM systems engineering (LangGraph/LangChain, hybrid retrieval with FAISS+BM25, guardrails/citations, streaming/latency optimization) plus rigorous evaluation and close partnership with compliance stakeholders.

View profile
Travoy Spelling - Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP in Texarkana, TX

Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP

Texarkana, TX10y exp
TredenceUniversity of Texas at Austin

ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).

View profile
Rohith Sadanala - Mid-level Machine Learning Engineer specializing in Generative AI and MLOps in Missouri, USA

Mid-level Machine Learning Engineer specializing in Generative AI and MLOps

Missouri, USA3y exp
AirbnbUniversity of South Florida

LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.

View profile
Byron Pineda - Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps in Pascagoula, MS

Byron Pineda

Screened

Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps

Pascagoula, MS10y exp
TuringMississippi State University

Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.

View profile
Ranganayak Meravath - Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

Mid-level Generative AI Engineer specializing in RAG, agentic copilots, and regulated AI

5y exp
LPL FinancialUniversity of North Texas

Senior engineer who built and productionized an Azure-based Enterprise AI Copilot for financial/compliance teams, focused on grounded, auditable answers with citations to reduce hallucinations in regulated workflows. Experienced designing multi-step agent orchestration and improving reliability through targeted iterations (e.g., fixing chunking/parsing to materially improve citation accuracy), plus building defensive pipelines for messy ERP/operational finance data.

View profile
PP

Senior Backend Software Engineer specializing in cloud, microservices, and AI systems

Richardson, TX8y exp
The University of Texas at DallasUniversity of Texas at Dallas

Built an AI-powered job outreach application for his own job search and took it from idea to production use, owning architecture, FastAPI backend, retrieval/generation pipeline, frontend workflow, deployment, and iteration. Especially compelling for teams needing a pragmatic full-stack engineer who can turn LLM-based product ideas into usable, maintainable tools with measurable workflow impact.

View profile
TY

Timothy Yeav

Screened

Senior AI/ML Engineer specializing in Generative AI and FinTech

Bronx, NY8y exp
InsitroNew York City College of Technology (CUNY)

Built end-to-end LLM/RAG systems for biological data and scientific literature analysis in a drug discovery setting, helping researchers explore disease insights and treatment hypotheses faster. Combines applied GenAI product work with strong production engineering, including monitoring, retrieval optimization, reusable Python services, and scalable deployment on AWS/Kubeflow.

View profile
SA

Mid-level Full-Stack Engineer specializing in AI-driven data platforms

Santa Barbara, CA5y exp
UberUniversity of Alabama at Birmingham

Full-stack engineer with 5+ years of experience who built real-time data visualization and analytics systems at Uber, spanning React/TypeScript frontends, Node/GraphQL services, Kafka pipelines, and PostgreSQL. Particularly compelling for teams needing a hands-on builder who can turn ambiguous customer needs into scalable products, and who has also applied RAG with LangChain/OpenAI over 1.8M support files to surface actionable insights.

View profile
CS

Mid-level Applied AI Engineer specializing in LLM infrastructure and model optimization

San Jose, CA3y exp
AMDUSC

LLM engineer who has deployed privacy-preserving, real-time workplace risk monitoring over massive enterprise chat/email streams, tackling latency, hallucinations, and extreme class imbalance with model benchmarking, RAG + fine-tuning, and a pre-filter alerting layer. Also built an agentic legal contract drafting system (Jurisagent) using LangGraph/LangChain with deterministic multi-agent control flow, structured outputs, and reliability-focused evaluation/telemetry.

View profile
VS

Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps

5y exp
Capital OneUniversity of the Cumberlands

AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.

View profile
PJ

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp
JPMorgan ChaseUniversity of Massachusetts Amherst

Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.

View profile
NV

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp
WorkUpUSC

ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).

View profile
ZI

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.

View profile
Shanmukha Koganti - Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision in Bay Area, CA

Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision

Bay Area, CA6y exp
ShopifyUniversity of North Texas

ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).

View profile
Amit Sharma - Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems in San Francisco, CA

Amit Sharma

Screened

Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems

San Francisco, CA25y exp
FambotUniversity of Delhi

Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.

View profile
KC

Kevin Cruz

Screened

Senior Gen AI Engineer specializing in agentic LLM systems

Tempe, AZ15y exp
OpendoorUSC

Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.

View profile
Sanjay Santhanam - Mid-level AI Software Engineer specializing in LLMs and FinTech data systems in San Jose, CA

Mid-level AI Software Engineer specializing in LLMs and FinTech data systems

San Jose, CA4y exp
Scry AIWestcliff University

Backend/AI systems engineer focused on productionizing agentic document-processing workflows for large financial PDFs. They describe owning deployments end-to-end, combining Python, Redis, LLM function calling, RAG/ReAct-style orchestration, and strong reliability practices to deliver 80% faster processing, reduce parsing errors from 12% to ~1%, and sustain 99.9% uptime in high-concurrency environments.

View profile
JA

Jeevan aher

Screened

Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech

Remote, USA3y exp
JPMorgan ChaseUniversity of Illinois Urbana-Champaign

AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.

View profile
CS

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp
Aurora AIUniversity of Chicago

ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.

View profile
HC

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp
SplunkUniversity of Colorado Boulder

Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.

View profile
Vamshikrishna Bandi - Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

6y exp
PayPalTrine University

Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.

View profile
Vedant Kharwal - Intern AI/ML Engineer specializing in Generative AI and applied machine learning in Mumbai, India

Intern AI/ML Engineer specializing in Generative AI and applied machine learning

Mumbai, India1y exp
LTIMindtreeBoston University

New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.

View profile
Praveen Nutulapati - Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems in New York, NY

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp
JPMorgan ChaseUniversity of Central Missouri

Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.

View profile
Sirisha Maddikunta - Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions in O Fallon, MO

Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions

O Fallon, MO6y exp
MastercardUniversity of Texas at Arlington

Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.

View profile

Need someone specific?

AI Search