Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted FAISS Professionals

Pre-screened and vetted.

FAISS Python Docker SQL LangChain CI/CD

Prachi Jain

Screened

Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps

Remote, US6y exp

JPMorgan ChaseUniversity of Massachusetts Amherst

“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”

Python C C++CUDA SQL MATLAB+129

View profile

Nikita Vivek Kolhe

Screened

Junior Data & Machine Learning Engineer specializing in MLOps and NLP

Los Angeles, United States1y exp

WorkUpUSC

“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”

Python SQL C C++R MATLAB+90

View profile

Zufeshan Imran

Screened

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp

SOTER AIUC San Diego

“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”

Machine Learning Deep Learning Generative AI Transformers Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)+120

View profile

Shanmukha Koganti

Screened

Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision

Bay Area, CA6y exp

ShopifyUniversity of North Texas

“ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).”

A/B Testing Agile Ansible Apache Kafka Apache Spark AWS+170

View profile

Amit Sharma

Screened

Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems

San Francisco, CA25y exp

FambotUniversity of Delhi

“Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.”

Python FastAPI Flask Django Pandas NumPy+146

View profile

Harsh Sanas

Screened

Intern-level Software Engineer specializing in GenAI, RAG, and backend systems

San Francisco, CA2y exp

Scale AIUSC

“AI/LLM engineer focused on shipping production-grade agents that automate support, sales intake, and ERP-connected workflows. Stands out for combining strong orchestration and guardrails with measurable business outcomes, including 45% faster support handling, ~$1.2M annual savings, 18% higher customer satisfaction, and 99.5%+ reliability in production.”

Python TypeScript Go PostgreSQL MongoDB MySQL+277

View profile

Kevin Cruz

Screened

Senior Gen AI Engineer specializing in agentic LLM systems

Tempe, AZ15y exp

OpendoorUSC

“Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.”

Generative AI Multi-Agent Systems Prompt Engineering Vector Databases LangChain LangGraph+396

View profile

Jeevan aher

Screened

Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech

Remote, USA3y exp

JPMorgan ChaseUniversity of Illinois Urbana-Champaign

“AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.”

Python R SQL JavaScript Bash C+107

View profile

Cassandra Sullivan

Screened

Intern Data Scientist specializing in generative AI and forecasting

San Francisco, CA5y exp

Aurora AIUniversity of Chicago

“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”

A/B Testing Automation Classification Dashboarding Data Cleaning Data Visualization+109

View profile

Harsh Chaudhari

Screened

Intern Software Engineer specializing in ML/NLP and LLM applications

Boulder, CO0y exp

SplunkUniversity of Colorado Boulder

“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”

C++Data Pipelines Data Preprocessing Docker Embeddings FAISS+70

View profile

Vamshikrishna Bandi

Screened

Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems

6y exp

PayPalTrine University

“Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.”

A/B Testing Agile AWS Azure Machine Learning BigQuery Caching+138

View profile

Praveen Nutulapati

Screened

Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems

New York, NY6y exp

JPMorgan ChaseUniversity of Central Missouri

“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”

A/B Testing Agile Amazon Bedrock Amazon EC2 Amazon EMR Amazon RDS+184

View profile

Vedant Kharwal

Screened

Intern AI/ML Engineer specializing in Generative AI and applied machine learning

Mumbai, India1y exp

LTIMindtreeBoston University

“New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.”

Agentic AI Algorithms Angular API Development Artificial Intelligence Authentication+93

View profile

Sirisha Maddikunta

Screened

Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions

O Fallon, MO6y exp

MastercardUniversity of Texas at Arlington

“Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.”

Generative AI Natural Language Processing Computer Vision Prompt Engineering Retrieval-Augmented Generation LoRA+234

View profile

Victor Pirie

Screened

Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI

Des Moines, IA11y exp

AssistRxMonash University

“Built and owned a production conversational AI platform for a healthcare contact center, including RAG-based agent assist, hybrid retrieval, safety guardrails, and production monitoring. Stands out for combining LLM product delivery with strong operational rigor, driving a reported 25-30% improvement in handling time in a sensitive healthcare environment.”

Python PyTorch SQL Bash Go Scala+229

View profile

Lekha Karanam

Screened

Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation

Dallas, TX3y exp

Goldman SachsUniversity of Texas at Dallas

“Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.”

Python SQL R Scikit-learn TensorFlow PyTorch+61

View profile

Harish Gaddam

Screened

Mid-level AI/ML Engineer specializing in LLM agents and RAG systems

Dallas, TX5y exp

VerizonUniversity of Texas at Arlington

“LLM/agentic systems builder at Verizon who deployed a LangGraph-orchestrated multi-agent ticket-automation platform with RAG (FAISS) to replace brittle rule-based bots. Improved routing correctness by ~30–40%, hit ~300ms latency targets via model routing, and reduced ops workload by ~60% through tight iteration with non-technical stakeholders and strong testing/observability practices.”

AWS AWS Lambda Automation Backend Development CI/CD Collaboration+103

View profile

Akshay Koneti

Screened

Mid-Level Full-Stack Software Engineer specializing in AWS cloud and microservices

Dallas, TX6y exp

AmazonUniversity of North Texas

“Backend/LLM engineer who built a production-critical Amazon Bedrock + RAG correction and compliance layer for employee communications, integrating tightly with existing Spring Boot/AWS microservices to reduce manual review while keeping outputs explainable and auditable. Also designed an event-driven system processing 10M+ events/day (SQS/Lambda/DynamoDB/Elasticsearch) and handled on-call incidents with strong observability and reliability patterns (idempotency, retries, hotspot mitigation).”

Java Python JavaScript TypeScript JSON XML+138

View profile

Raghav Konduri

Screened

Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems

NJ, USA4y exp

Scale AIRowan University

“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”

A/B Testing Agile Amazon CloudWatch Amazon EC2 Amazon EKS Amazon Kinesis+151

View profile

Akhil Chippalthurthy

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and predictive analytics

New Jersey, USA5y exp

JPMorgan ChaseStevens Institute of Technology

“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”

Python R SQL Jupyter Notebook LightGBM XGBoost+172

View profile

SHREY MATHUR

Screened

Mid-level Machine Learning Engineer specializing in LLMs and AI products

Sunnyvale, CA6y exp

TCSUCLA

“Applied ML/LLM engineer currently building AppleCare’s production chat recommender, owning the full lifecycle from transcript cleaning and fine-tuning through distributed deployment, monitoring, and iterative improvement. Their work delivered >10% copy-count improvement, 5% lower modification rate, 60% cost reduction, and $1.1M profitability in 2025, and they also created a reasoning-data generation approach that enabled a reasoning model and a judge model that cut eval time by over 99%.”

Data preprocessing Deep Learning LoRA LangChain Retrieval Augmented Generation Hugging Face+138

View profile

Apoorva Nanabolu

Screened

Senior Data Scientist / Generative AI Engineer specializing in fraud, risk, and MLOps

5y exp

PayPalUniversity of New Haven

“Built and deployed a production LLM/RAG fraud investigation system to replace manual investigator workflows, combining transaction data, historical cases, and policy documents with agent-style steps and LoRA fine-tuning. Demonstrates strong reliability engineering (grounding, citations, abstention paths), performance optimization (retrieval/indexing/caching), and end-to-end MLOps orchestration using Azure ML Pipelines/MLflow plus Kubernetes/Argo with canary and rollback deployments.”

Python R SQL NoSQL Snowflake BigQuery+178

View profile

Shreya Andela

Screened

Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise data platforms

5y exp

JPMorgan ChaseUniversity of North Texas

“Built and shipped a production LLM-powered RAG assistant for enterprise internal document search (PDFs, knowledge bases, structured data), addressing real-world issues like noisy documents, hallucinations, and latency with grounded prompting, retrieval-confidence fallbacks, and performance optimizations. Also partnered with compliance and business teams at JPMc to deliver a solution aligned with regulatory constraints, supported by monitoring, feedback loops, and systematic evaluation.”

Python R SQL FastAPI ETL Pipelines Unit Testing+156

View profile

Vishnu Varma

Screened

Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps

Milpitas, California8y exp

DatabricksCampbellsville University

“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”

Python SQL PySpark Bash TensorFlow PyTorch+106

View profile

Machine Learning Engineers Software Engineers Data Scientists AI Engineers Generative AI Engineers Data Engineers AI & Machine Learning Engineering Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?