Pre-screened and vetted in the NYC Metro.
Junior AI/ML Engineer specializing in LLMs, RAG, and document intelligence
Senior AI Infrastructure Engineer specializing in LLM systems and real-time ML platforms
Mid-level AI/ML Engineer specializing in NLP, Computer Vision, and Generative AI
Senior AI/ML Engineer specializing in GenAI, LLMs, NLP, and MLOps
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems
“Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.”
Junior AI/ML Engineer specializing in LLM systems and retrieval-augmented generation
“Built and deployed a production LLM-powered market intelligence and decision-support platform for noisy, real-time financial data, using a high-throughput embedding + vector DB RAG architecture to reduce hallucinations while keeping latency and cost low. Operated it at scale with GPU-backed inference (continuous batching/quantization), FastAPI on Kubernetes, and Airflow-orchestrated ingestion/embedding/retraining workflows, with strong schema-based reliability and monitoring.”
Mid-level Backend & Full-Stack Developer specializing in AI and FinTech systems
Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems
Senior AI Architect specializing in Generative AI and LLM systems
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level AI Engineer specializing in GenAI, RAG, and multi-agent systems
Entry-level AI Engineer specializing in LLM-powered backend systems
Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics
“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services
“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”
Mid-level AI Engineer specializing in multi-agent systems and RAG
“Built and shipped a production LangGraph-based multi-agent LLM analytics/decision copilot that answers questions across SQL/BI systems and unstructured docs, emphasizing grounded, tool-verified outputs with citations and confidence gating. Deep hands-on experience with orchestration (LangGraph, CrewAI, OpenAI Assistants, MCP) plus real-world latency/cost optimization (vLLM batching/KV caching, speculative decoding, quantization) and rigorous eval/observability. Partnered closely with business/ops stakeholders to deliver explainable reporting automation, cutting manual reporting time by 50%+.”
Junior Software Engineer specializing in distributed systems and applied AI
“Early-career full-stack builder who created an AI interview-prep platform used by 200+ students, tested it with a 25-student study group, and earned recognition through the CUNY Startup accelerator, including prize money and local college adoption. Has also shipped compliance-sensitive AI products in healthcare marketing and operational tools like invoice approval systems, showing unusual breadth across AI, UX, and backend systems.”
Mid-level AI/Data Engineer specializing in LLM agents, RAG, and cloud data pipelines
Mid-level Applied AI/ML Engineer specializing in scalable generative model infrastructure
Junior AI Engineer specializing in agentic AI, RAG, and voice/telephony systems
“LLM/agent engineer who has built production multi-agent systems (LangChain/LangGraph) for enterprise workflows like email and calendar automation, with a strong focus on latency, tool-calling accuracy, and evaluation via LangSmith. Also worked on AI long-term memory using knowledge graphs at VEAI and communicated the approach and tradeoffs to CEO/CTO stakeholders.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and AWS MLOps
“Recent master’s graduate in robotics with applied experience across reinforcement learning and ROS 2 autonomy stacks. Built an RL-based drone vertiport traffic controller (PPO) focused on reward design and simulation integration, and has hands-on navigation work in ROS 2 including LiDAR preprocessing, SLAM/path planning, and stabilizing TurtleBot3 wall-following. Also brings deployment experience containerizing robotics nodes and scaling them with Kubernetes on AWS.”