Vetted FAISS Professionals

Pre-screened and vetted.

Harsh Chauhan - Junior AI Engineer specializing in Generative AI, RAG, and NLP in Remote, US

Harsh Chauhan

Screened

Junior AI Engineer specializing in Generative AI, RAG, and NLP

Remote, US3y exp
TickerIndiana University Bloomington

AI/LLM engineer who has shipped a production RAG platform at Ticker Inc. on GCP (Qdrant + Postgres) delivering sub-second retrieval over 550k+ items, with measurable gains in latency and answer quality (HNSW optimization, MMR re-ranking). Also built an asynchronous LangChain/LangGraph multi-agent research system (10x faster cycles) and partnered with Indiana University doctors on synthetic patient records and ML error analysis using clinician-friendly F1/loss dashboards.

View profile
Jai Vilatkar - Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems in Pune, India

Jai Vilatkar

Screened

Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems

Pune, India2y exp
NexaByte TechnologiesVellore Institute of Technology

Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.

View profile
Jayasri Guthula - Mid-level Applied ML Engineer specializing in LLM evaluation and multimodal agent systems in Remote

Mid-level Applied ML Engineer specializing in LLM evaluation and multimodal agent systems

Remote5y exp
Handshake AIUniversity of Arkansas at Little Rock

Full-stack engineer working at the intersection of product and infrastructure, building developer-facing interfaces for AI voice agents in XR/immersive environments plus telemetry-heavy analytics dashboards. Experienced in Postgres telemetry data modeling and performance tuning, and in designing durable multi-step LLM pipelines with idempotency, retries, and strong observability; has operated in fast-moving startup-like teams (Biocom, HandshakeAI).

View profile
Vishesh Kumar - Intern Software & AI Engineer specializing in distributed systems and LLM applications in Palo Alto, CA

Vishesh Kumar

Screened

Intern Software & AI Engineer specializing in distributed systems and LLM applications

Palo Alto, CA1y exp
AmpUpStony Brook University

Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).

View profile
Hari Krishna Kona - Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP in Boston, MA

Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP

Boston, MA3y exp
G-PLindsey Wilson College

LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.

View profile
SM

Junior Software Engineer specializing in AI platforms and backend systems

Boston, MA1y exp
Humanitarians.AINortheastern University

Built and shipped AI products at Humanitarians AI, including a full-stack multi-agent platform that consolidated six faculty AI tools into one interface and achieved 100+ user adoption, 70% less workflow switching, and a 6x latency improvement. Also designed a grounded document parser using FAISS and structured LLM outputs that reduced hallucinations by 60%, showing strong depth in both product-minded engineering and production AI systems.

View profile
NH

Senior Full-Stack Engineer specializing in AI, cloud, data, and healthcare tech

Van Nuys, CA9y exp
SmartiStackUniversity of South Florida

Backend/data engineer with hands-on production experience across Python/Flask microservices and AWS serverless/data platforms (Lambda, DynamoDB, S3, Glue/PySpark). Demonstrated strong reliability and operations mindset (JWT/RBAC, retries/timeouts/circuit breakers, CloudWatch/SNS alerting) and measurable performance wins (SQL report runtime cut from 10 minutes to 30 seconds). Seeking ~$150k base and cannot travel for onsite meetings for the next 5–6 months due to family medical constraints.

View profile
YS

Mid-Level Software Engineer specializing in backend, cloud, and scalable APIs

Remote, United States4y exp
FILMIC TECHNOLOGIESUniversity at Buffalo

Backend Python engineer who has built an LLM agentic tutoring/assignment helper with a custom pipeline for parsing visually complex textbooks (integrating AlibabaResearch VGT and implementing missing preprocessing from the paper), improving RAG grounding with ~90% cleaner extracted text. Also led major platform scaling work by refactoring monolithic image processing into Celery-based async microservices on AWS (GPU/CUDA + S3), and implemented Kafka streaming for payment webhooks with strict ordering, idempotency, and multi-zone fault tolerance.

View profile
AN

Anar Nurizada

Screened

Mid-level Robotics Engineer specializing in simulation-to-real ML control

Brooklyn, NY5y exp
DL-RLStony Brook University

Robotics/ML engineer who benchmarks and adapts open-source robot action models, building synthetic datasets in Isaac Sim and modifying vendor code to scale training across multiple GPUs. Also built a production-style computer vision pipeline at Zortag—training a tiny YOLO-based classifier for fake-vs-real label detection and deploying it in a real-time iOS app with additional display/spoof detection.

View profile
GA

Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems

USA4y exp
CitiusTechNorthwest Missouri State University

Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.

View profile
AP

Ayushi Patel

Screened

Mid-level Software Engineer specializing in cloud data platforms and serverless ETL

Redmond, WA6y exp
HCLTechIllinois Institute of Technology

Data/ML engineer from HCLTech who modernized enterprise data by linking fragmented financial and supply-chain data across SAP/SQL Server/Snowflake using NLP entity linking and embeddings (FAISS). Delivered measurable impact including ~40% reduction in manual error-log triage and entity-linking accuracy improvements from ~86% to ~93%, with results surfaced in Power BI for real-time analytics.

View profile
Vikram Sandigaru - Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation in Boston, US

Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation

Boston, US3y exp
FounderWayNortheastern University

Built and shipped production LLM systems at Founderbay, including a low-latency voice agent and a graph-based multi-agent research assistant. Strong focus on reliability in real workflows—hybrid SERP + full-site scraping RAG, grounding guardrails, validation checkpoints, and transcript-driven evaluation—plus performance tuning with async FastAPI, Redis caching, and containerization. Also partnered with a non-technical ops lead to automate post-call follow-ups via call summarization, field extraction, and tool-triggered actions.

View profile
Nagendra Reddy Palugulla - Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps in Florida, United States

Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps

Florida, United States4y exp
Community Dreams FoundationUniversity of Houston

Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.

View profile
Keeravani Chekuri - Mid-level AI/ML Engineer specializing in LLM systems and MLOps in Boston, MA

Mid-level AI/ML Engineer specializing in LLM systems and MLOps

Boston, MA3y exp
Nexoraschool.aiUniversity of Massachusetts

Built and deployed an AI tutoring assistant end-to-end at Nexora School, spanning discovery with school districts, multi-agent LangGraph/RAG architecture, AWS Bedrock migration, and post-launch stabilization. Stands out for combining hands-on LLM systems engineering with strong educator-facing trust building, FERPA-driven architecture decisions, and disciplined production practices around evals, logging, and messy document ingestion.

View profile
HG

Mid-level AI Prompt Engineer specializing in agentic AI and automation

Chicago, IL4y exp
The Aspen GroupIllinois Institute of Technology

Built GRETA, a full-stack multi-agent AI platform for SEO content analysis and blog-writing support, combining React/TypeScript, serverless GCP Cloud Run workflows, and LLM/tool orchestration at scale. The system reportedly reduced manual analysis by 60%, and the candidate shows strong hands-on experience shipping AI products in ambiguous environments and refining them through internal user feedback.

View profile
SM

Mid-level Full-Stack AI Engineer specializing in agentic systems

San Jose, CA4y exp
ReferU.AISan Jose State University

At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.

View profile
AN

Junior Software Engineer specializing in backend systems and full-stack development

San Jose, CA3y exp
San José State UniversitySan Jose State University

Full-stack software engineer with hands-on experience shipping AI-driven product experiences, including a conversational travel planner and a RAG-based PDF question-answering system. Has also built enterprise automation APIs at Accenture for network diagnostics, combining backend engineering, testing automation, and user-focused product simplification for non-technical operations teams.

View profile
KV

Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure

Remote4y exp
Cloud Systems LLCVirginia Tech

Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.

View profile
LC

Mid-level Data Scientist specializing in NLP, recommender systems, and ML deployment

Fairfax, VA4y exp
ProvenBaseNJIT

At Provenbase, built and shipped a production LLM-powered semantic search and candidate matching platform (RAG with GPT-4/Gemini, multi-agent orchestration, Elasticsearch vector search) to scale sourcing across 10M+ candidate records and 1000+ data sources. Drove sub-second performance, cut LLM spend 30% with routing/caching, and improved recruiting outcomes (+45% sourcing accuracy; +38% visibility of underrepresented talent) through bias-aware ranking and tight collaboration with recruiting stakeholders.

View profile
Krishna K - Junior Machine Learning Engineer specializing in multimodal systems and LLMs in Jersey City, NJ

Krishna K

Screened

Junior Machine Learning Engineer specializing in multimodal systems and LLMs

Jersey City, NJ2y exp
JerseySTEMUniversity at Buffalo

Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.

View profile
Aneri Patel - Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval in Washington, D.C.

Aneri Patel

Screened

Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval

Washington, D.C.2y exp
Enquire AI, Inc.George Washington University

Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.

View profile
Hemanth Suddala - Mid-level GenAI Engineer specializing in LLM automation, RAG, and document intelligence in Boca Raton, FL

Mid-level GenAI Engineer specializing in LLM automation, RAG, and document intelligence

Boca Raton, FL3y exp
Florida Atlantic UniversityFlorida Atlantic University

Built and deployed a production GenAI resume screening and matching system for Florida Atlantic University, focused on improving recruiter efficiency and search relevance. Demonstrates strong RAG engineering (embeddings, query rewriting, metadata filtering, threshold tuning) plus practical reliability work (grounding constraints, fallbacks, and evaluation using real user queries) using Python REST APIs and orchestration frameworks like LangChain and LlamaIndex.

View profile
Tamanna Nandlal Choithani - Entry-level Full-Stack Engineer specializing in AI and distributed systems in California, USA

Entry-level Full-Stack Engineer specializing in AI and distributed systems

California, USA1y exp
BottlelyArizona State University

Full-stack engineer who built an AI-based inventory/procurement query system at Botlily/Botlerly using Flask and Google Sheets as a live knowledge base, overcoming Sheets latency with caching and structured in-memory models. Demonstrated strong LLM product engineering (40% accuracy improvement via preprocessing/prompting) and customer-driven iteration with bar/restaurant owners, evolving the tool into a more comprehensive inventory management and forecasting solution.

View profile
HG

Mid-level Software Engineer specializing in AI and machine learning

Santa Clara, CA5y exp
Frugal Innovation HubSanta Clara University

Graduate-level candidate who uses AI as a disciplined engineering assistant rather than an autonomous replacement, with hands-on experience coordinating manual multi-agent coding workflows across planning, implementation, and testing. They emphasize scoped execution, clear constraints, and human ownership of final merges, suggesting a thoughtful and practical approach to AI-augmented software development.

View profile

Need someone specific?

AI Search