Browse Talent Find Talent Open Jobs Pricing FAQsGet Started

Vetted Vector Search Professionals

Pre-screened and vetted.

Vector Search Python Docker SQL CI/CD AWS

Sasi Katamneni

Screened

Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications

Dallas, TX5y exp

Baylor Scott & WhiteUniversity of North Texas

“Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).”

A/B Testing Agile Ajax Amazon API Gateway Amazon Bedrock Amazon CloudWatch+267

View profile

Bharath kumar

Screened

Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps

Draper, UT12y exp

ThorneBharathiar University

“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”

A/B Testing API Development API Testing Apache Hadoop Apache Hive Apache Kafka+251

View profile

Omkar Bhope

Screened

Staff Full-Stack Engineer specializing in AI platforms and infrastructure automation

San Jose, CA5y exp

Etched AIUC San Diego

“Backend/full-stack engineer building complex internal platforms and customer-facing demos at the intersection of infrastructure and product. Shipped a no-code Product Lifecycle Manager for manufacturing (3 manufacturers, 1000+ evolving tests) using AWS S3/SQS ingestion and extensible Postgres (EAV+JSONB) with end-to-end traceability. Also built a FastAPI-based company data intelligence platform with Okta-secured RBAC and an LLM/MCP layer for ChatGPT-like analytics over enterprise data sources.”

Python C C++TypeScript JavaScript SQL+159

View profile

Tejaswi Kothapalli

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI

3y exp

AetnaIndiana Tech

“Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.”

Agile Amazon SageMaker Apache Spark AWS AWS Lambda Azure DevOps+165

View profile

Raja Gurugubelli

Screened

Mid-level GenAI Engineer specializing in production RAG and LLM fine-tuning

San Jose, California5y exp

eBayTexas Tech University

“LLM engineer who built a production seller-support RAG system at eBay using hybrid retrieval (BM25 + Pinecone vectors) with Cohere reranking, LangGraph orchestration, and citation-grounded answers. Strong focus on reliability: semantic/structure-aware chunking, automated Ragas-based evaluation with nightly regressions, and production observability (LangSmith) plus drift monitoring (Arize). Also implemented a multi-agent fraud pipeline with AutoGen using JSON-schema contracts and explicit termination conditions.”

Python SQL Bash GPT-4 LoRA LangChain+130

View profile

Yupeng Tang

Screened

Junior Machine Learning Engineer specializing in LLM systems and GPU inference

Atlanta, GA1y exp

GMI CloudGeorgia Tech

“LLM/agent engineer who shipped a production RAG-based recommendation + explanation system that replaced a traditional recommender stack, delivering ~20% CTR lift (and +8% after a reliability iteration) with strong cold-start performance. Demonstrates strong production rigor: schema-constrained generation, typed tool calling, explicit state/orchestration, deep monitoring/feedback loops, and safe integration with messy ERP inventory/order data using normalization, idempotency, and conflict-resolution guardrails.”

Python SQL C C++Go Java+124

View profile

Jathin Shettigar

Screened

Intern Software Engineer specializing in edge AI deployment and distributed systems

San Francisco, CA1y exp

Zetic AISan José State University

“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”

Python Go JavaScript TypeScript Bash C+205

View profile

Prateeksha Ranjan

Screened

Mid-level Software Engineer specializing in embedded AI and full-stack systems

Irvine, California4y exp

SynapticsUC Irvine

“Robotics software engineer who built and owned core navigation components for a TurtleBot in ROS/ROS2 and Gazebo, including an RRT-based planner, waypoint-to-velocity motion planning, and PID trajectory tracking. Demonstrates strong real-time debugging skills (control-loop timing under CPU load), costmap/occupancy-grid tuning, and distributed ROS2 communication design using DDS/QoS, plus Docker and CI/CD automation experience from Keysight.”

Python C C++Go JavaScript TypeScript+204

View profile

Prateek Patil

Screened

Engineering Leader specializing in Digital Health, AI, and Cloud Platforms

Santa Clara, CA16y exp

RocheIllinois Institute of Technology

“Senior Engineering Manager at Roche leading two Scrum teams building internally shared (“inner-sourced”) tools and libraries for a healthcare enterprise. Has led security/compliance-first architecture decisions (e.g., Python AI modules running inside a Java container) and front-end modularization (Angular monorepo to module federation), with a strong focus on developer experience via automated Swagger/OpenAPI documentation and robust testing/versioning practices.”

Java Python Object-oriented programming (OOP)Design patterns Algorithms Distributed systems+112

View profile

Charlotte Yu

Screened

Junior Full-Stack AI Engineer specializing in LLM apps and RAG systems

Remote1y exp

StealthUCLA

“Built and shipped a production LLM-powered “Vet agent” that automates pet symptom intake across multimodal inputs (images/files/text/speech) and provides analysis/home-care guidance, reaching thousands of daily active users within two months. Demonstrates strong agent engineering fundamentals: state-machine orchestration with structured JSON, tool/schema validation, high-availability routing/failover, and rigorous offline/online evaluation loops with trace-driven reliability improvements.”

Python TypeScript Java C++SQL JavaScript+95

View profile

Aarushi Mahajan

Screened

Mid-level AI/ML Engineer specializing in NLP, Generative AI, and MLOps

New York, USA4y exp

IntuitUniversity of Massachusetts Amherst

“Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.”

Python SQL Data Structures Algorithms TensorFlow PyTorch+185

View profile

Omkar Bhambure

Screened

Mid-level Software Developer specializing in backend microservices for healthcare and FinTech

USA4y exp

HumanaUniversity of Virginia

“Built and deployed an AI-powered insurance claims fraud platform end-to-end using Java/Spring Boot, Kafka, OpenAI, pgvector, and AWS EKS. Stands out for combining LLM/RAG architecture with production-grade scalability and observability, delivering measurable impact including 62% less manual review, 40% better fraud precision, 37% higher throughput, and 99.95% uptime.”

Java Python SQL Spring Boot Spring MVC Spring Security+96

View profile

Justin Emsoff

Screened

Director-level Solutions Architect specializing in AI, integrations, and enterprise SaaS

Altadena, CA12y exp

KnowdeUSC

“Player-coach engineering leader currently running a Solution Architecture/FDE team responsible for both presales and postsales delivery. Stands out for combining enterprise systems thinking with hands-on AI product work: they built configurable tooling that sped delivery by ~30%, drove a Kafka-to-Pulsar architecture shift for scale, and spent the last two years building LLM-based document extraction and RAG inference pipelines shaped directly by user feedback.”

Machine Learning Prompt Engineering Computer Vision LlamaIndex OpenAI API Hugging Face Transformers+167

View profile

Navneet Parab

Screened

Mid-level AI/ML Engineer specializing in financial risk and LLM systems

New Jersey, USA4y exp

Ally FinancialNortheastern University

“AI/ML engineer in financial services who has built both LLM-powered compliance tools and production fraud/credit risk systems at Ally Financial. Particularly strong in regulated, high-stakes environments: combines RAG/LLM architecture, rigorous evaluation, and human-in-the-loop governance, and also helped stand up a unified ML platform from scratch.”

Machine Learning Artificial Intelligence BERT XGBoost LightGBM LSTM+144

View profile

Sai Gautham Ghanta

Screened

Junior Software Engineer specializing in AI search and full-stack systems

Denver, CO3y exp

finish’d, Inc.University of Colorado Boulder

“AI/full-stack engineer who has built both a real-time crypto sentiment platform from scratch and production enterprise RAG search systems at Kore.ai. Stands out for combining strong systems engineering with practical LLM evaluation, retrieval tuning, and careful human-in-the-loop design for high-risk network automation use cases with Cisco.”

Python Java JavaScript TypeScript C++Go+130

View profile

Aakash Khepar

Screened

Mid-level Full-Stack AI Engineer specializing in agentic AI systems

Tempe, AZ4y exp

Arizona State UniversityArizona State University

“AI/full-stack builder with hands-on experience shipping healthcare, career-tech, nonprofit, and fintech products, spanning speech AI, browser extensions, agentic RAG systems, and enterprise ML monitoring. Stands out for combining strong technical depth with measurable outcomes, including reducing clinical call WER from 26% to 3%, building safe tool-using agents with rollback/RBAC, and delivering zero-to-one multi-tenant platform features in ambiguous environments.”

Python TypeScript JavaScript Java SQL NoSQL+259

View profile

Suman Madipeddi

Screened

Junior AI/ML Engineer specializing in agentic AI, RAG, and voice systems

San Jose, CA2y exp

ZscalerArizona State University

“Full-stack AI product engineer who has owned production-grade document intelligence and agent systems at meaningful scale, including a copilot used by 10,000+ users and 1M+ queries. Particularly strong in combining React/TypeScript product work with Python/FastAPI, RAG, knowledge graphs, observability, and performance tuning—cutting latency from ~7 seconds to 0.5 milliseconds while improving trust through citations and human review.”

Agentic AI LoRA LangGraph Pinecone AWS OCR+216

View profile

Nikita Prasad

Screened

Mid-level AI/ML Engineer specializing in NLP, MLOps, and scalable data pipelines

Remote, USA5y exp

JPMorgan ChaseUniversity of Dayton

“Built and shipped a production LLM-powered personalized client engagement assistant in the financial domain, balancing real-time recommendations with strict privacy/compliance requirements. Demonstrates strong MLOps/LLMOps depth (Airflow + MLflow, containerized microservices, drift monitoring) and a privacy-by-design approach validated in collaboration with risk and compliance teams.”

Python Pandas spaCy R SQL PySpark+199

View profile

Siva Sai Kumar Mogalluru

Screened

Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare

Remote, USA4y exp

EYUniversity of South Florida

“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”

A/B Testing Agile Anomaly Detection Apache Airflow Apache Spark Azure DevOps+138

View profile

Uday Chilakala

Screened

Mid-level Machine Learning Engineer specializing in NLP, computer vision, and RAG systems

Atlanta, GA5y exp

Morgan StanleyKennesaw State University

“Machine learning/NLP engineer who built a production-oriented retrieval-based AI system at Morgan Stanley for healthcare use cases, combining RAG over unstructured patient records with deep-learning medical image segmentation (U-Net/Mask R-CNN). Strong in end-to-end pipelines and MLOps (Spark/MongoDB, AWS SageMaker, CI/CD, monitoring, automated retraining) and in entity resolution/data quality validation for noisy clinical data.”

Python SQL Flask Apache Spark gRPC TensorFlow+125

View profile

Divyam Agrawal

Screened

Mid-level Machine Learning Engineer specializing in LLMs and NLP classification systems

Seattle, WA4y exp

Affinity SolutionsUniversity of Washington

“Internship experience building a production RAG+LLM pipeline to map messy card transaction descriptions to merchant brands, including a custom modified-ROUGE evaluation approach for weak/variant ground truth. Improved scalability and cost by moving from a managed LLM endpoint (e.g., Bedrock) to self-hosted vLLM, and orchestrated massive embedding backfills (5,000+ files, 10B+ rows) using an Airflow-triggered SQS + ECS worker architecture with robust retry/DLQ handling.”

A/B Testing API Design AWS AWS CloudFormation AWS Lambda Auto-scaling+110

View profile

Sai Charan Kolla

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS

TX, USA5y exp

BlackRockTexas A&M University-Kingsville

“LLM engineer who built a production document intelligence/RAG pipeline to extract structured data from thousands of unstructured PDFs, cutting manual review time by 60%. Experienced with LangChain and Airflow orchestration plus rigorous evaluation (labeled datasets, prompt testing, HITL review, monitoring) to improve accuracy and reduce hallucinations while partnering closely with non-technical operations stakeholders.”

Python SQL R Java C++Machine Learning+99

View profile

Amaan Elahi

Screened

Mid-level Software Engineer specializing in backend, AI, and full-stack systems

New York, NY5y exp

SAIL GTXNYU

“Built and shipped production LLM agents including an internal RAG-based compliance classification system at SAIL (FastAPI/Redis/Docker) designed to handle real failure modes and scale to ~10k LLM calls/hour, achieving ~93% pipeline accuracy with reduced hallucination risk via multi-model orchestration and strict grounding. Also architected “Elara,” a state-machine-driven conversational appointment booking agent using structured JSON outputs and backend function execution for reliability, and has experience normalizing messy OTA/PMS data at RateGain.”

C++C#Python JavaScript TypeScript SQL+116

View profile

Balaji Nissenkarao

Screened

Mid-level Machine Learning Engineer specializing in AI/LLM systems

New York, NY5y exp

ServiceNowUniversity at Buffalo

“ML/LLM systems engineer who has owned AI support automation products end-to-end, including ServiceNow-integrated incident routing, RAG-based resolution suggestion systems, and production stabilization. Stands out for combining hands-on platform work across PySpark, AWS Glue, FastAPI, Kubernetes, and Pinecone with measurable operational impact, including 30-35% MTTR reduction and 25-30% improvement in first-touch resolution.”

Python SQL Java Bash XGBoost Model Evaluation+92

View profile

Software Engineers Machine Learning Engineers Data Scientists AI Engineers Full Stack Developers Software Developers Engineering AI & Machine Learning Data & Analytics Education

Need someone specific?

AI Search

Related

Need someone specific?