“Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.”

LangGraph LangChain Prompt Engineering Hugging Face Transformers OpenAI API Model Context Protocol (MCP)+118

View profile

Nikhil Soni

Screened

Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation

New York, NY2y exp

Quant AI ResearchNYU

“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”

Python SQL C C++Java HTML+120

View profile

Syeda Bushra Fatima

Mid-level Backend & Full-Stack Developer specializing in AI and FinTech systems

Summit, NJ4y exp

Wells FargoSaint Louis University

Python Django Flask FastAPI REST APIs Microservices+56

View profile

Nan Xiao

Junior AI Engineer specializing in LLMs, RAG, and agent evaluation

New York, NY1y exp

SummonerColumbia University

A/B Testing API Development AWS CI/CD Clustering Data Cleaning+94

View profile

Jaya Krishna

Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems

Jersey City, NJ5y exp

JPMorgan ChaseSaint Peter's University

Python TypeScript JavaScript SQL PyTorch TensorFlow+108

View profile

Shykh Hafeez

Senior AI Architect specializing in Generative AI and LLM systems

New York City, NY8y exp

Rezolve AI

Generative AI GPT-4 Claude LLaMA Transformers Prompt Engineering+165

View profile

Ojasmitha Pedirappagari

Screened

Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms

Jersey City, NJ5y exp

Nurture HoldingsUC Santa Cruz

“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”

Python SQL C#TypeScript JavaScript AWS+83

View profile

Soumith Ganji

Mid-level AI Engineer specializing in GenAI, RAG, and multi-agent systems

New York, NY4y exp

FiservStevens Institute of Technology

Python Java Kotlin C C++SQL+65

View profile

Mahikshit Mahikshit

Entry-level AI Engineer specializing in LLM-powered backend systems

Edison, NJ1y exp

TCSPenn State University

Artificial Intelligence Azure Functions Bash C C++CI/CD+61

View profile

Manideep Thotakura

Mid-level Full-Stack Engineer specializing in AI-driven cloud applications

New York, USA5y exp

Marras PharmacySt. Francis College

Java Spring Boot Hibernate TypeScript JavaScript React+143

View profile

Jayanth Vodnala

Mid-level AI Engineer specializing in machine learning and generative AI

New York, NY5y exp

USAAYeshiva University

Python R SQL Jupyter Notebook LightGBM XGBoost+109

View profile

shubham patil

Screened

Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics

New York, NY4y exp

Syracuse UniversitySyracuse University

“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”

A/B Testing Anomaly Detection API Development AWS Azure Machine Learning CI/CD+91

View profile

Shruti Rawat

Screened

Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services

Jersey City, NJ4y exp

State StreetPace University

“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”

A/B Testing API Integration Apache Airflow AWS AWS Glue AWS Lambda+112

View profile

Need someone specific?

AI Search