Pre-screened and vetted in the NYC Metro.
Junior AI/ML Engineer specializing in LLMs, RAG, and document intelligence
Mid-level AI/ML Engineer specializing in NLP, Computer Vision, and Generative AI
Mid-level AI Engineer specializing in LLM agents and RAG systems
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems
“Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.”
Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation
“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”
Mid-level Backend & Full-Stack Developer specializing in AI and FinTech systems
Junior AI Engineer specializing in LLMs, RAG, and agent evaluation
Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level AI Engineer specializing in GenAI, RAG, and multi-agent systems
Entry-level AI Engineer specializing in LLM-powered backend systems
Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics
“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services
“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”
Mid-level AI Engineer specializing in multi-agent systems and RAG
“Built and shipped a production LangGraph-based multi-agent LLM analytics/decision copilot that answers questions across SQL/BI systems and unstructured docs, emphasizing grounded, tool-verified outputs with citations and confidence gating. Deep hands-on experience with orchestration (LangGraph, CrewAI, OpenAI Assistants, MCP) plus real-world latency/cost optimization (vLLM batching/KV caching, speculative decoding, quantization) and rigorous eval/observability. Partnered closely with business/ops stakeholders to deliver explainable reporting automation, cutting manual reporting time by 50%+.”
Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines
Mid-level Applied AI/ML Engineer specializing in scalable generative model infrastructure
Junior AI Engineer specializing in agentic AI, RAG, and voice/telephony systems
“LLM/agent engineer who has built production multi-agent systems (LangChain/LangGraph) for enterprise workflows like email and calendar automation, with a strong focus on latency, tool-calling accuracy, and evaluation via LangSmith. Also worked on AI long-term memory using knowledge graphs at VEAI and communicated the approach and tradeoffs to CEO/CTO stakeholders.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and AWS MLOps
“Recent master’s graduate in robotics with applied experience across reinforcement learning and ROS 2 autonomy stacks. Built an RL-based drone vertiport traffic controller (PPO) focused on reward design and simulation integration, and has hands-on navigation work in ROS 2 including LiDAR preprocessing, SLAM/path planning, and stabilizing TurtleBot3 wall-following. Also brings deployment experience containerizing robotics nodes and scaling them with Kubernetes on AWS.”
Mid-level AI Engineer specializing in causal inference and LLM research
“LLM engineer who has deployed a production system combining LLMs with causal inference (DoWhy) to enable counterfactual “what-if” analysis for experimental research, including a robust variable-mapping/validation layer to reduce hallucinations. Also partnered with non-technical operations leadership at Irriion Technologies to deliver an AI-assisted onboarding workflow that cut onboarding time by 50% and reduced manual errors by ~40%.”
Mid-level AI Engineer specializing in Generative AI and multimodal RAG systems
“GenAI/LLM engineer who built and productionized a 0-1 application (EMULaiTOR at Lumanity) combining qualitative + quantitative data using Postgres/pgvector RAG and prompt engineering, deployed with Azure backend and AWS-hosted frontend. Demonstrates strong production instincts (latency reduction via region alignment, autoscaling/health checks) and hands-on agent/tool-call debugging, plus experience enabling sales and winning a large pharma client.”
Junior Full-Stack & LLM Application Developer specializing in agentic RAG systems
Mid-level AI Engineer specializing in LLM automation and RAG systems