Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted FAISS Professionals

Pre-screened and vetted.

FAISS Python Docker SQL LangChain CI/CD

Jacqueline Zhang

Screened

Mid-level Machine Learning Engineer specializing in LLMs, fairness, and healthcare ML

Illinois, USA4y exp

iSchool Statistical ML & AI LabUniversity of Illinois Urbana-Champaign

“ML/NLP practitioner with a master’s thesis focused on domain-adaptive knowledge distillation for LLMs (LLaMA2/sheared LLaMA), showing improved perplexity and ROUGE-L on biomedical data. Also built real-world data linking and search systems: integrated ClinicalTrials.gov with FAERS using fuzzy matching + embeddings, and delivered an LLM-powered FAQ recommender at Hyperledger using sentence-transformers, FAISS, and fine-tuning to mitigate embedding drift.”

A/B Testing API Development CI/CD Computer Vision C Data Engineering+93

View profile

Chanakya rudru

Screened

Senior Machine Learning Engineer specializing in conversational AI and Generative AI

San Francisco, CA6y exp

Scale AIDallas Baptist University

“ML/AI engineer with experience at Uber and Scale AI, focused on customer service automation across both classical NLP and generative AI systems. Has owned systems from experimentation through production on AWS, including LLM fine-tuning, RAG optimization, safety evaluation, and internal Python platform tooling that improved consistency and engineering velocity.”

Python Java C++R JavaScript TypeScript+111

View profile

Shweta Chavan

Screened

Junior Computer Vision & ML Engineer specializing in autonomous perception systems

Pittsburgh, PA2y exp

Magna InternationalCarnegie Mellon University

“LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.”

Python C++OpenCV MATLAB PyTorch TensorFlow+126

View profile

Yue Yang

Screened

Intern Data Scientist specializing in GenAI (LLMs, RAG) and ML model optimization

Sunnyvale, CA1y exp

SynopsysColumbia University

“Built and deployed a production LLM-powered risk assistant for KPMG and Freddie Mac that lets analysts query a confidential Neo4j risk graph in natural language (no Cypher), turning multi-day analysis into minutes with traceable, cited answers. Implemented rigorous guardrails, deterministic verification, RBAC/security controls, and a full eval/observability stack, cutting query error rate by ~50% and iterating through weekly UAT with non-technical risk analysts.”

Generative AI Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)Machine Learning Deep Learning Data Science+113

View profile

Muhan Zhang

Screened

Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG

Palo Alto, USA2y exp

Platflow.AICornell University

“Built and shipped a production LLM pipeline for nursing home Medicare reimbursement (PDF OCR + fact extraction + keyword RAG + QA) that reportedly increased payouts by ~$1K/month per patient. Strong in LLM ops/benchmarking (ground truth, LLM-as-judge, cost/I-O tracking) and pragmatic optimization—swapped retrieval approaches, fine-tuned a small model to cut OCR cost 90%, and migrated workloads to Azure/Temporal to scale nightly processing 10x.”

Python JavaScript React R C++Java+89

View profile

Asrith Velireddy

Screened

Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems

Harrison, NJ4y exp

AdobeNJIT

“ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.”

A/B Testing Apache Airflow Auto Scaling AWS AWS IAM AWS Lambda+123

View profile

Hamsalakshmi Ramachandran

Screened

Mid-level Data Analytics professional specializing in BI, data engineering, and applied AI

California, USA6y exp

AmazonSan Jose State University

“Built GenMedX, a multi-module clinical AI system for emergency department decision support spanning triage prediction, diagnosis, medication Q&A, and visit summarization. Stands out for combining medical LLM fine-tuning, RAG, and rigorous evaluation/monitoring to drive a major triage recall improvement from 38.5% to 76.6%, with a strong focus on safety, edge-case detection, and production reliability.”

SQL PostgreSQL MySQL Snowflake Python Pandas+167

View profile

Kevin Patel

Screened

Senior Full-Stack Engineer specializing in AI, FinTech, and Healthcare IT

Tyler, TX10y exp

BrightOpsJones College

“AI/full-stack engineer with hands-on production experience across React/TypeScript, Go, and Python, spanning an early-stage education startup and a compliance-sensitive internal healthcare data platform. Stands out for shipping LLM and retrieval-based products with measurable impact, including a 27% recommendation improvement, support for 1M+ daily events, and a 19% lift in task completion in a secure, auditable environment.”

React TypeScript JavaScript Redux Vue.js Angular+141

View profile

Kella Dhanush Venkata Sai

Screened

Junior ML Engineer specializing in Generative AI and LLM applications

Thousand Oaks, California3y exp

NVIDIACalifornia Lutheran University

“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”

Python NumPy Pandas Scikit-Learn Matplotlib Seaborn+95

View profile

Praveen V

Screened

Mid-Level Software Engineer specializing in Generative AI and RAG systems

Remote, USA5y exp

MetaUniversity of North Carolina at Charlotte

“Built a production RAG-based natural-language-to-SQL system at Global Atlantic to replace slow, expensive manual analytics ticket workflows, focusing heavily on retrieval quality and measurable evaluation (200-question ground-truth set; recall@5 improved 0.65→0.78 via semantic chunking). Also built a custom MCP-style agent orchestrator for a personal project (arxiv-ai) to improve flexibility and Langfuse-aligned observability, and has hands-on experience with LangGraph, CrewAI, and n8n.”

Python Java C#JavaScript TypeScript PostgreSQL+105

View profile

Arwen Yang

Screened

Staff Applied Scientist specializing in multimodal LLM safety, robustness, and retrieval

Los Altos, CA8y exp

LibrAIUniversity of Melbourne

“Built a production LLM-driven archival assistant that turns large, low-quality scanned handwritten files (120+ pages) into structured datasets, overcoming context-window and hierarchy challenges with a two-phase LLM + rules pipeline and reaching 98.1% accuracy (Gemini-2.5 Flash). Also orchestrated a large human-in-the-loop effort with 78 archivists, producing 2,400 high-quality annotations in 4 days via detailed rubrics and support.”

Machine Learning Large Language Models (LLMs)Retrieval-Augmented Generation (RAG)JSON Team Leadership PyTorch+78

View profile

Ranjani Salla

Screened

Mid-level AI/ML Engineer specializing in LLMs, FinTech, and Healthcare IT

USA5y exp

StripeClark University

“Built production GenAI systems in both healthcare and financial services, including a Verily clinical platform and an Accenture financial Q&A product. Stands out for combining advanced RAG, fine-tuning, safety evaluation, and infrastructure engineering to deliver measurable gains in engagement, groundedness, hallucination reduction, and cost efficiency.”

Python SQL JavaScript TypeScript R PyTorch+122

View profile

Yashwanth J

Screened

Mid-level Software Engineer specializing in AI/ML and full-stack systems

Seattle, WA4y exp

AppleUniversity of North Texas

“Engineer with Apple experience building LLM-powered internal workflow orchestration systems using Python, LangGraph, FastAPI, Redis, vector search, and Kubernetes. Stands out for a highly pragmatic, production-focused approach to agentic systems: deterministic state management, strong guardrails, observability, and human review for high-risk actions.”

Python Java JavaScript TypeScript SQL Node.js+188

View profile

Sairam Bodapothula

Screened

Mid-level AI/LLM Engineer specializing in generative AI and ML systems

Remote, USA4y exp

NetflixMissouri University of Science and Technology

“AI/LLM-focused engineer with hands-on experience building RAG pipelines, prompt engineering workflows, and multi-agent systems using tools like LangChain. Stands out for combining AI-assisted development with production-grade validation and for leading the architecture/orchestration of agent-based recommendation systems that improved response time, accuracy, and scalability.”

PyTorch TensorFlow Hugging Face Transformers LangChain LlamaIndex OpenAI API+189

View profile

Chappidi Sasi

Screened

Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference

Bay Area, CA5y exp

NVIDIAWebster University

“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”

A/B Testing Apache Spark AWS AWS Lambda BigQuery Claude+141

View profile

Akhilesh Patil

Screened

Junior AI/ML Engineer specializing in FinTech and generative AI

Remote, USA2y exp

StripeSan Jose State University

“Built an end-to-end AI bug triage dashboard that combined React/TypeScript, FastAPI, Postgres, and classical ML to reduce manual engineering triage work by about 40%. Stands out for pragmatic, product-minded AI engineering: choosing interpretable models when they were sufficient, designing human-in-the-loop UX for trust, and separately building an agentic RAG project with vector search, Neo4j knowledge graphs, and reranking.”

Python SQL Pandas TensorFlow Scikit-learn XGBoost+183

View profile

Tanuja Ikkurthi

Screened

Mid AI/ML Engineer specializing in LLM systems and Generative AI

Texas, USA4y exp

StripeUniversity of North Texas

“Built and owned an LLM support copilot at Stripe focused on improving agent ticket resolution. Designed the backend and ML system end to end, using RAG, Redis caching, hybrid vector search, and LoRA fine-tuning to achieve 40% lower latency and 22% higher response accuracy, with continuous quality monitoring via Ragas and related evaluation frameworks.”

GPT-4 LLaMA Hugging Face Transformers LangChain LangGraph LlamaIndex+107

View profile

Lakshmi Narayana

Screened

Mid-level Data Science AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems

USA3y exp

Samsara

“Built a production RAG-based "knowledge copilot" for support/ops using LangChain/LangGraph, implementing the full pipeline (ingestion, chunking, embeddings, vector DB retrieval/rerank, guarded generation with citations) and operating it as monitored microservices with CI/CD. Also designed an event-driven, streaming backend for real-time inventory ordering predictions that reduced stockouts by 25%, and has hands-on incident response experience stabilizing LLM API latency/5xx spikes using Datadog/APM and resilience patterns.”

Agile API Development API Integration AWS AWS Lambda BERT+112

View profile