Vetted FAISS Professionals

Pre-screened and vetted.

VM

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and Clinical AI

Chicago, Illinois4y exp
OptumIllinois Institute of Technology

Built and productionized a HIPAA-compliant LLM+RAG Clinical AI assistant at Optum, fine-tuning GPT/LLaMA on de-identified patient notes and integrating FAISS/Pinecone for sub-second retrieval; reported to cut diagnosis time by ~20 minutes per case. Experienced in orchestrating ML pipelines (Airflow, AWS Step Functions, Azure Data Factory) and in reliability techniques for LLM systems (grounding, citations, confidence filters, monitoring) while partnering closely with clinicians and compliance teams.

View profile
KS

Mid-level AI/ML Engineer specializing in Generative AI and LLMOps

USA6y exp
UnitedHealth GroupKent State University

Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.

View profile
MR

Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI

Springfield, Missouri5y exp
O'Reilly Auto PartsSaint Louis University

ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.

View profile
NV

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and real-time fraud detection

4y exp
U.S. BankUniversity of Massachusetts Dartmouth

GenAI/ML engineer who has shipped production agentic systems in highly regulated and high-throughput environments, including an AWS Bedrock-based fraud/compliance workflow at U.S. Bank with PII redaction and hallucination detection that cut investigation time by 50%+. Also built and evaluated RAG and recommendation systems at Target, using RAGAS-driven testing, hybrid retrieval with re-ranking, and SHAP explainability dashboards to align model behavior with merchandising business KPIs.

View profile
MR

Mid-level GenAI Engineer specializing in production AI agents and evaluation pipelines

Overland Park, Kansas5y exp
MinutentagWilmington University

Built and shipped a production LLM-powered internal operations automation platform using LangChain RAG (Pinecone) and FastAPI microservices, deployed on AWS EKS, serving 10k+ daily interactions. Implemented a rigorous evaluation/observability stack (golden datasets, prompt regression tests, MLflow, retrieval metrics, hallucination monitoring) that drove hallucinations below 2% and improved reliability, and partnered closely with non-technical ops leaders to cut manual lookup work by 60%+.

View profile
RK

Ram Kottala

Screened

Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms

Michigan, USA5y exp
FordWebster University

Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.

View profile
Jaideep bommidi - Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps in Denton, TX

Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps

Denton, TX8y exp
Webster BankUniversity of North Texas

Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.

View profile
Bryan West - Senior Software Engineer specializing in AI, cloud infrastructure, and full-stack development in Chantilly, VA

Bryan West

Screened

Senior Software Engineer specializing in AI, cloud infrastructure, and full-stack development

Chantilly, VA17y exp
West Consulting LLCHoward University

ML/NLP engineer who built a production system that converts large-scale unstructured text into a connected, searchable knowledge base using spaCy + Sentence Transformers/FAISS and a Neo4j knowledge graph, with BERTopic and XGBoost for organization/labeling. Strong focus on production-grade Python workflows (FastAPI/Celery, Pydantic validation, Docker, AWS ECS/Lambda) and robust entity resolution with measurable precision/recall and human review for low-confidence matches.

View profile
ESHWANTH D. G - Mid-level Robotics Software Engineer specializing in autonomous perception and sensor fusion in CA, USA

ESHWANTH D. G

Screened

Mid-level Robotics Software Engineer specializing in autonomous perception and sensor fusion

CA, USA4y exp
HoneywellUniversity at Buffalo

Robotics engineer with Honeywell and Tata Motors experience deploying ROS/ROS2 autonomous mobile robot fleets into live factory environments, integrating sensors, safety PLCs, and on-prem services. Known for solving end-to-end latency and stability issues (including network spikes under load) using gRPC, Docker, and improved diagnostics—cutting diagnosis time from hours to minutes and achieving sub-150 ms control response.

View profile
HG

Harsh Gupta

Screened

Mid Software Engineer specializing in backend systems and FinTech

Mumbai, India3y exp
Oracle Financial ServicesUniversity of Florida

Backend engineer with experience spanning regulated financial systems at Oracle Financial Services and early-stage product building at Apli.ai. Has owned production onboarding infrastructure end-to-end, improved reliability through strong observability and incident response, and also built AI-backed backend workflows using AWS, Bedrock, and RAG.

View profile
Akhila Kannegari - Mid-level AI/ML Engineer specializing in FinTech and retail ML systems in Alabama, USA

Mid-level AI/ML Engineer specializing in FinTech and retail ML systems

Alabama, USA4y exp
Wells FargoAuburn University at Montgomery

ML-focused candidate with strong Wells Fargo experience building production fraud systems and internal GenAI tools for fraud analysts. Stands out for measurable impact in fraud detection—raising recall from 71% to 88%—while also demonstrating hands-on depth across streaming infrastructure, MLOps, LLM/RAG implementation, and Python service architecture.

View profile
Naveena Musku - Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

Naveena Musku

Screened

Mid-level AI/ML Engineer specializing in agentic AI and LLM systems

5y exp
Western UnionJawaharlal Nehru Technological University

Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.

View profile
SS

Mid-level AI Engineer specializing in LLMs, RAG, and content automation

Los Angeles, CA3y exp
Cloud9USC

AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.

View profile
TM

Tejal Mane

Screened

Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems

Moundsville, WV4y exp
CitiusTechUniversity of Michigan

Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.

View profile
KK

Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps

Remote, United States6y exp
AccentureEastern Illinois University

LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).

View profile
VH

Mid-level ML/AI Engineer specializing in NLP, RAG pipelines, and financial risk & fraud systems

USA3y exp
FintaUniversity at Buffalo

Built and shipped LLM/RAG systems in finance and startup settings, including a Goldman Sachs document intelligence platform that indexed ~8TB of regulatory filings and delivered cited, conversational answers with <2s latency—cutting compliance research by ~4.5 hours per batch. Also developed LangChain-based agent workflows at Finta to automate CRM enrichment and investor lookup with strong testing, tracing (LangSmith), privacy guardrails, and auditability.

View profile
KE

Kamal Ede

Screened

Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines

MO, USA4y exp
S&P GlobalUniversity of Central Missouri

Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.

View profile
AK

Ansh Krishna

Screened

Intern Data Scientist specializing in ML systems and LLM-powered analytics

Noida, India1y exp
Data Security Council of IndiaUSC

Built an autonomous decision analytics LLM agent for end-to-end tabular binary classification, using RAG (FAISS) to retain context across multi-step queries. Deployed as a FastAPI service with production-style reliability features (schema-aware validation, fallbacks, retries, structured outputs) plus offline/online evaluation and monitoring to reduce analysis time and improve consistency versus stateless approaches.

View profile
VJ

Vedant Jagtap

Screened

Junior AI/NLP Engineer specializing in LLM systems and RAG

New York, NY1y exp
NYU’s Center for Social Media, AI, and PoliticsNYU

LLM/agent engineer who shipped a two-stage AI recruitment screening platform at Foursquare that automated resume ingestion through behavioral assessment, delivering an 85% reduction in screening time across 5,000+ applications with auditability and confidence-gated decisions. Also built a multi-agent benchmarking framework using MCP tool interfaces and a RAGAS + LangSmith evaluation/observability stack, including async re-architecture that cut production latency by 50%.

View profile
Rushir Bhavsar - Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training

1y exp
Cadence Design SystemsArizona State University

Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.

View profile
SASIREKHA GULIPALLI - Mid-level Data Analyst specializing in procurement, supply chain analytics, and applied machine learning in Alpharetta, GA

Mid-level Data Analyst specializing in procurement, supply chain analytics, and applied machine learning

Alpharetta, GA4y exp
MotrexGeorgia State University

Strategic sourcing professional specializing in seasonal apparel supply chains, combining Coupa/JD Edwards analytics with Excel/Python modeling and Power BI dashboards to drive cost reduction and OTIF gains. Notable for rapid mitigation of a 10-day factory delay affecting 12 holiday SKUs (preserved 95% of revenue) and for automating PO workflows to cut cycle time by 4.2 days and improve OTIF by 15%.

View profile
Arya Mane - Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing in Dallas, Texas

Arya Mane

Screened

Junior Full-Stack & AI/ML Engineer specializing in LLMs and multimodal document processing

Dallas, Texas1y exp
Receptro.AIUniversity of Texas at Dallas

Built a production RAG-based NBA player scouting assistant that embeds player profiles into FAISS, orchestrates retrieval and LLM recommendations with LangChain, and surfaces results via embedded Tableau dashboards. Demonstrates strong focus on evaluation/monitoring (batch tests, LLM-as-judge, latency/failure/token metrics) and has experience translating non-technical founder goals into DAPT + fine-tuning plans on curated data.

View profile
Sana Khan - Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech in Oklahoma, USA

Sana Khan

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech

Oklahoma, USA4y exp
Capital OneOklahoma Christian University

ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.

View profile

Need someone specific?

AI Search