Pre-screened and vetted in the NYC Metro.
Mid-level Machine Learning Engineer specializing in Generative AI and foundation models
Mid-level Data Scientist / ML Engineer specializing in healthcare predictive analytics and NLP
“Built and deployed a real-time hospital readmission risk prediction system at NYU Langone Health, combining structured EHR data with BERT-based NLP on clinical notes and serving predictions to clinicians via Azure ML and FHIR APIs. Emphasizes production reliability and clinical trust through SHAP-based explainability and robust healthcare data preprocessing, and reports a 22% reduction in 30-day readmissions.”
Mid-level AI/ML & Backend Engineer specializing in AI platforms and computer vision
“Backend engineer with hands-on experience building real-time, low-latency systems: owned the Python backend for a real-time crowd-monitoring product (top 5% at HackHarvard 2025) using OpenCV, GPU YOLO inference (PyTorch), WebRTC, and OAuth. Also has production Kubernetes/GitOps experience (Helm/Kustomize, GitHub Actions, Argo CD), Kafka-based event pipelines, and executed a minimal-downtime on-prem PostgreSQL migration to AWS EC2.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services
“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”
Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception
“Built and shipped a production multi-agent personal financial assistant at AlphevaAI on AWS ECS, combining FastAPI microservices, Redis/SQS orchestration, and Pinecone-based hybrid RAG (semantic + BM25) to ground financial guidance. Improved routing accuracy with an embedding-based SetFit + logistic regression intent classifier feeding an LLM router, and optimized UX with live streaming plus cost controls via model tiering and caching.”
Mid-level AI/ML Engineer specializing in healthcare imaging and GenAI/LLM systems
“Built and deployed a production LLM/RAG clinical document understanding and summarization system for healthcare, focused on reducing manual review time while meeting strict accuracy, latency, and compliance needs. Demonstrates strong MLOps/orchestration depth (Airflow, Kubernetes, Azure ML Pipelines) and a rigorous approach to hallucination mitigation through layered, source-grounded safeguards and stakeholder-driven requirements with physicians/compliance teams.”
Mid-level Software Engineer specializing in Python, cloud, and ML applications
Mid-level Data Scientist/ML Engineer specializing in Generative AI, NLP, and RAG systems
Mid-level AI/ML Engineer specializing in banking risk, fraud detection, and NLP
Mid-level AI/ML Engineer specializing in GenAI, NLP, and AWS MLOps
Mid-level AI/ML Engineer specializing in LLMs, NLP, and AWS MLOps
“Recent master’s graduate in robotics with applied experience across reinforcement learning and ROS 2 autonomy stacks. Built an RL-based drone vertiport traffic controller (PPO) focused on reward design and simulation integration, and has hands-on navigation work in ROS 2 including LiDAR preprocessing, SLAM/path planning, and stabilizing TurtleBot3 wall-following. Also brings deployment experience containerizing robotics nodes and scaling them with Kubernetes on AWS.”
Mid-level Machine Learning Engineer specializing in healthcare and financial AI
Senior Data Scientist specializing in LLM applications, RAG systems, and production ML
“Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.”
Mid-level GenAI Engineer specializing in LLM agents and production AI workflows
“Designed and deployed end-to-end LLM-powered AI agent systems to automate knowledge-intensive workflows across marketing/GTM, recruiting, and support. Brings production reliability rigor (evaluation pipelines, monitoring, testing, A/B experiments) plus orchestration expertise (Airflow, Prefect, custom Python) and a track record of translating non-technical stakeholder goals into working AI solutions (e.g., personalized customer engagement agent at Lara Design).”
Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”
Junior Machine Learning Engineer specializing in multimodal systems and LLMs
“Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.”
Junior AI & Machine Learning Engineer specializing in LLM automation and RAG systems
Mid-level AI/ML Engineer specializing in LLM, RAG, and semantic search systems
Mid-level AI/ML Engineer specializing in LLMs, RAG, and agentic AI systems
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
“LLM engineer/data analyst who built a production RAG QA assistant over the Jurafsky & Martin NLP textbook to reduce hallucinations and provide explainable, source-grounded answers. Experienced with LangChain/LangGraph orchestration, retrieval optimization (embeddings, vector DBs, caching), and rigorous evaluation/monitoring (Retrieval@K, A/B tests, telemetry/drift). Previously communicated analytics insights to non-technical stakeholders at GS Analytics using Power BI and simplified reporting.”
Mid-level GenAI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”
Mid-level Generative AI Engineer specializing in LLMs and RAG for enterprise and FinTech
Mid-level Machine Learning Engineer specializing in GenAI, RAG, and medical imaging