Pre-screened and vetted in the NYC Metro.
Mid-level Machine Learning & GenAI Engineer specializing in LLMs, RAG, and NLP
“Built and deployed an LLM-powered customer support assistant (“Notable Assistant”) focused on automating common post-customer queries while maintaining multi-turn context and meeting scalability/latency needs. Experienced with production orchestration and operations using Kubernetes and Apache Airflow (DAG-based ETL, scheduling, monitoring/alerts), and has partnered closely with customer service stakeholders to align chatbot behavior with brand voice through iterative testing.”
Mid-level Full-Stack AI Engineer specializing in distributed systems and GenAI
Mid-level Software Engineer specializing in AI code evaluation and full-stack development
Mid-level AI/ML Engineer specializing in healthcare and pharmaceutical AI
Senior AI/ML Engineer specializing in Generative AI, RAG, and LLM fine-tuning
Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms
Mid-level AI/ML Engineer specializing in MLOps, NLP, and computer vision
Junior Software Engineer specializing in AI/ML and full-stack development
Senior Machine Learning Engineer specializing in Generative AI and LLM systems
Senior AI Architect specializing in Generative AI and LLM systems
Mid-level Machine Learning Engineer specializing in safety-critical and uncertainty-aware ML systems
“Built and productionized an LLM-powered assistant for company documents and support questions, focused on reducing time spent searching PDFs/policies/tickets while preventing hallucinations by grounding answers in approved sources. Demonstrates strong production engineering (Kubernetes/orchestration, caching, monitoring, fallbacks) plus security-minded permissioning and close collaboration with operations/support stakeholders.”
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level AI/ML Engineer specializing in MLOps and LLM applications
“BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.”
Mid-level AI/ML Engineer specializing in Generative AI and data engineering
“IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.”
Senior Full-Stack Engineer specializing in React/Next.js and AI engineering
Intern Machine Learning Engineer specializing in RAG, semantic search, and applied NLP
Senior Machine Learning Engineer specializing in AI, NLP, and computer vision
Mid-level strategy and legal professional specializing in FinTech, AI, and market entry
Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLMs
Mid-level Generative AI Engineer specializing in RAG, multi-agent LLM systems, and LLMOps