Pre-screened and vetted in the NYC Metro.
Mid-level Generative AI Engineer specializing in RAG, multi-agent LLM systems, and LLMOps
Senior AI/ML Engineer specializing in GenAI, MLOps, and Databricks/AWS
Mid-level Data Scientist specializing in LLMs, NLP, and predictive modeling in healthcare and finance
Mid-level AI/ML Engineer specializing in agentic AI for financial services
Mid-level Machine Learning Engineer specializing in Generative AI and foundation models
Mid-level AI Engineer specializing in machine learning and generative AI
Mid-level Data Scientist / ML Engineer specializing in healthcare predictive analytics and NLP
“Built and deployed a real-time hospital readmission risk prediction system at NYU Langone Health, combining structured EHR data with BERT-based NLP on clinical notes and serving predictions to clinicians via Azure ML and FHIR APIs. Emphasizes production reliability and clinical trust through SHAP-based explainability and robust healthcare data preprocessing, and reports a 22% reduction in 30-day readmissions.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services
“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”
Mid-level AI/ML & Backend Engineer specializing in AI platforms and computer vision
“Backend engineer with hands-on experience building real-time, low-latency systems: owned the Python backend for a real-time crowd-monitoring product (top 5% at HackHarvard 2025) using OpenCV, GPU YOLO inference (PyTorch), WebRTC, and OAuth. Also has production Kubernetes/GitOps experience (Helm/Kustomize, GitHub Actions, Argo CD), Kafka-based event pipelines, and executed a minimal-downtime on-prem PostgreSQL migration to AWS EC2.”
Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception
“Built and shipped a production multi-agent personal financial assistant at AlphevaAI on AWS ECS, combining FastAPI microservices, Redis/SQS orchestration, and Pinecone-based hybrid RAG (semantic + BM25) to ground financial guidance. Improved routing accuracy with an embedding-based SetFit + logistic regression intent classifier feeding an LLM router, and optimized UX with live streaming plus cost controls via model tiering and caching.”
Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems
“Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.”
Senior Machine Learning Engineer specializing in LLMs, NLP, and computer vision
“Built and owned production GenAI systems for both infrastructure automation and customer support. Most notably, they created a self-healing multi-cloud incident response system that automated 65% of tier-1 alerts and reduced application crashes by 75%, and also shipped a hybrid RAG support triage agent that automated 60% of tier-1 inquiries with human escalation guardrails.”
Mid-level Software Engineer specializing in Python, cloud, and ML applications
Mid-level Data Scientist/ML Engineer specializing in Generative AI, NLP, and RAG systems
Mid-level AI/ML Engineer specializing in banking risk, fraud detection, and NLP
Mid-level AI/ML Engineer specializing in GenAI, NLP, and AWS MLOps
Mid-level AI/ML Engineer specializing in scalable ML systems and LLM applications
Mid-level Full-Stack AI Engineer specializing in agentic SaaS and LLM systems
Mid-level AI/ML Engineer specializing in LLMs, NLP, and AWS MLOps
“Recent master’s graduate in robotics with applied experience across reinforcement learning and ROS 2 autonomy stacks. Built an RL-based drone vertiport traffic controller (PPO) focused on reward design and simulation integration, and has hands-on navigation work in ROS 2 including LiDAR preprocessing, SLAM/path planning, and stabilizing TurtleBot3 wall-following. Also brings deployment experience containerizing robotics nodes and scaling them with Kubernetes on AWS.”
Mid-level Machine Learning Engineer specializing in healthcare and financial AI
Senior Data Scientist specializing in LLM applications, RAG systems, and production ML
“Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.”
Mid-level GenAI Engineer specializing in LLM agents and production AI workflows
“Designed and deployed end-to-end LLM-powered AI agent systems to automate knowledge-intensive workflows across marketing/GTM, recruiting, and support. Brings production reliability rigor (evaluation pipelines, monitoring, testing, A/B experiments) plus orchestration expertise (Airflow, Prefect, custom Python) and a track record of translating non-technical stakeholder goals into working AI solutions (e.g., personalized customer engagement agent at Lara Design).”
Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”