Vetted Semantic Search Professionals

Pre-screened and vetted.

Monisha Nettem - Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps in USA

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

USA5y exp
M&T BankKennesaw State University

AI/ML engineer with banking domain experience (M&T Bank) who built a production credit-risk prediction and reporting platform combining ML models (XGBoost/TensorFlow) with a RAG pipeline (LangChain + GPT-4) over compliance documents. Delivered measurable impact (≈20% better risk detection/precision, 50% less manual reporting) and productionized workflows on Vertex AI/Kubeflow with CI/CD and monitoring; also implemented embedding-based semantic search using FAISS/Pinecone.

View profile
Bhavya Sri Gunnapaneni - Mid-level AI/ML Engineer specializing in fraud detection and NLP in United States

Mid-level AI/ML Engineer specializing in fraud detection and NLP

United States4y exp
AIGLewis University

Built production AI/RAG-style systems for message Q&A and insurance claims workflows, combining data ingestion, indexing/retrieval, and LLM integration with fallback modes. Has hands-on orchestration experience (Airflow, Prefect, LangChain) and cites large operational gains (claims processing reduced to ~45 seconds; manual review -50%; false alerts -30%) through automated, monitored pipelines and close collaboration with non-technical stakeholders.

View profile
Swati Swati - Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps in Florida, United States

Swati Swati

Screened

Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps

Florida, United States5y exp
Voltihost LLCStony Brook University

AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.

View profile
Ranxin Li - Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG in San Jose, USA

Ranxin Li

Screened

Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG

San Jose, USA2y exp
RevoAgent SolutionUC Davis

Built and deployed Clyra.AI, an AI-driven daily scheduling product that uses a LangGraph-based multi-agent LLM pipeline (task extraction, verification, reflection) grounded with strict RAG over emails/documents/calendars and real-world signals like health metrics. Designed a custom agent orchestrator with bounded loops/termination conditions and a self-auditing verification/reflection layer to reduce hallucinations while controlling latency and cost via caching and model distillation.

View profile
Raj Patel - Junior Machine Learning Engineer specializing in LLMs and RAG systems in Remote, USA

Raj Patel

Screened

Junior Machine Learning Engineer specializing in LLMs and RAG systems

Remote, USA1y exp
EmotionallNYU Tandon School of Engineering

Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.

View profile
Sai Krishna Mallikanti - Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP in TN

Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP

TN4y exp
CignaUniversity of Memphis

Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.

View profile
Sumanth Gottipati - Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and FinTech in New York, NY

Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and FinTech

New York, NY4y exp
Delta Air LinesVirginia University of Science and Technology

At Delta Airlines, built and shipped a production LLM-powered semantic search/troubleshooting assistant over maintenance logs and operational documentation using OpenAI embeddings and a vector database. Implemented hybrid ranking, query enrichment, and structured filters to improve relevance ~35% while optimizing latency via caching and vector tuning. Also designed a scalable Kafka + AWS (Lambda/SQS) ingestion pipeline with strong reliability/observability and an eval loop using real engineer queries and human review.

View profile
Nidhip Patel - Mid-level Software Engineer specializing in AI/ML and full-stack systems in United States

Nidhip Patel

Screened

Mid-level Software Engineer specializing in AI/ML and full-stack systems

United States3y exp
UnumWebster University

Backend Java engineer with strong platform/DevOps experience: modernized an insurance claims legacy monolith into DDD-aligned microservices, deployed containerized services on Kubernetes with Jenkins CI/CD and static analysis gates, and implemented GitOps using ArgoCD. Also led major AWS migration planning with dependency mapping and network monitoring to uncover hidden dependencies, and built Kafka-based real-time event streaming with schema-registry-driven evolution.

View profile
AA

Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs

San Jose, CA3y exp
TCSCalifornia State University, Chico

Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.

View profile
SC

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps

Atlanta, GA4y exp
Universal Health ServicesUniversity of New Haven

Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.

View profile
VK

Vaishnavi K

Screened

Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection

USA5y exp
TCSUniversity of New Haven

LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.

View profile
YP

Mid-level AI Engineer specializing in LLMs, RAG, and data engineering

Boston, MA5y exp
Humanitarians.AINortheastern University

AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).

View profile
NB

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.

View profile
AM

Mid-Level Full-Stack Software Engineer specializing in cloud-native microservices

Sanford, FL4y exp
HCLTechUniversity of Massachusetts Lowell

Backend engineer with cloud-native Python/Flask experience building high-throughput financial platforms (loan origination intelligent document processing and real-time fraud detection). Has scaled microservices on AKS with event-driven Azure messaging, delivered measurable performance gains (e.g., 700ms→180ms query latency; ~40% API improvements), and implemented strong security controls (OAuth2/JWT, Azure AD RBAC, audit logging, AES-256/TLS) for sensitive regulated data.

View profile
Vidit Naik - Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems in San Francisco, CA

Vidit Naik

Screened

Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems

San Francisco, CA2y exp
Checksum AIUC Riverside

Forward-deployed engineer who built a production AI drone-control chatbot that lets users fly a drone via natural language while viewing a real-time feed. Implemented RAG over drone SDK documentation (vector DB + top-k retrieval) and LoRA fine-tuning, with a focus on latency, token efficiency, and cost reduction, and regularly works with non-technical clients to integrate and explain AI system architecture.

View profile
Surya Danturty - Intern AI/ML Engineer specializing in computer vision and time-series forecasting in Riverside, CA

Intern AI/ML Engineer specializing in computer vision and time-series forecasting

Riverside, CA0y exp
University of California, RiversideUC Riverside

Undergrad who built a production RAG chatbot for a messy college website using OpenAI embeddings + FAISS, overcoming hard-to-crawl/non-selectable site content and strict API budget limits. Applies information-retrieval best practices (section-based chunking with overlap, precision/recall evaluation) and reliability techniques (edge-case testing, similarity thresholds, fallback responses), and has experience scaling similar indexing work to ~300,000 Wikipedia pages.

View profile
Pravalika Kuppireddy - Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation

Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation

4y exp
University of Michigan-DearbornUniversity of Michigan-Dearborn

LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.

View profile
JA

Mid-level Full-Stack AI Engineer specializing in agentic systems and security-hardened pipelines

USA3y exp
Adsgency AIUniversity of Colorado Boulder

Founding/early engineer experience across Asante and a Series A startup (Adgency), shifting from data science/ML into owning production full-stack systems end-to-end. Built core product flows (registration, business profiles, map service), AWS-deployed gRPC microservices with CI/CD, and operated low-latency agent/video ad generation workflows with retries/fallbacks and PostHog-based observability.

View profile
FP

Fabio Pecora

Screened

Junior Software Engineer specializing in distributed systems and applied AI

New York, NY3y exp
NextStep.AICollege of Staten Island (CUNY)

Early-career full-stack builder who created an AI interview-prep platform used by 200+ students, tested it with a 25-student study group, and earned recognition through the CUNY Startup accelerator, including prize money and local college adoption. Has also shipped compliance-sensitive AI products in healthcare marketing and operational tools like invoice approval systems, showing unusual breadth across AI, UX, and backend systems.

View profile
KB

Keerthi Basam

Screened

Mid-level Software Engineer specializing in AI/ML for FinTech and Healthcare

United States4y exp
IBMWright State University

Built and deployed an end-to-end fintech product, FinSight, for bank statement analysis and financial Q&A using a production-style RAG architecture. Stands out for combining FastAPI, OpenAI embeddings, FAISS, hybrid SQL/vector retrieval, and practical reliability work like chunking optimization, validation, and low-latency performance tuning.

View profile
AA

Anil Ande

Screened

Mid-level Software Engineer specializing in full-stack and AI-powered FinTech systems

Long Beach, CA4y exp
PNCCalifornia State University, Dominguez Hills

Backend-focused engineer with hands-on experience deploying AI-driven document processing and RAG-based workflows using Python, LangChain, FAISS, and REST APIs. Has owned projects from requirements through post-launch monitoring, including debugging production retrieval issues and building reliable pipelines for messy PDFs/scans and compliance-oriented document analysis.

View profile
SR

Mid-level Full-Stack Software Engineer specializing in cloud-deployed web apps and APIs

Dayton, OH3y exp
Wells FargoWright State University

Software engineer who has shipped both core web platform features (secure user authentication/profile management) and production LLM systems. Built an internal documentation knowledge assistant using a full RAG pipeline (OpenAI embeddings, vector DB, semantic search, reranking) with evaluation loops and a scalable document-ingestion pipeline for PDFs/FAQs, iterating based on metrics and user feedback.

View profile
BY

Billy Y

Screened

Junior Software Engineer specializing in Full-Stack and GenAI/LLM applications

San Jose, CA2y exp
ZymebalanzBoston University

LLM/RAG practitioner building clinician-facing AI search and Q&A inside EHR workflows, focused on trust, latency, and safety (grounded answers with citations, PHI controls, encryption/audit logs). Demonstrated real-time incident response for production LLM systems (e.g., fixing a metadata-filter deployment regression to prevent irrelevant results/cross-patient leakage) and strong demo/enablement skills for mixed technical and clinical stakeholders; also shipped a multi-model RAG tool at OrbeX Labs with upload/search/audit features for day-to-day adoption.

View profile
AK

Junior Machine Learning Engineer specializing in computer vision and generative AI

1y exp
INV TechnologiesKennesaw State University

CoreAI intern at The Home Depot who improved the Magic Apron Assistant by building a production video ingestion + RAG retrieval system for long videos (uploads and YouTube), including a graph-based retrieval module to speed up and improve relevance. Experienced with Kubernetes orchestration (HPA) and production reliability practices like caching, monitoring, regression testing, and stakeholder-driven requirements.

View profile

Need someone specific?

AI Search