Pre-screened and vetted in the NYC Metro.
Mid-level AI/ML Engineer specializing in credit risk, fraud detection, and NLP in financial services
Junior AI Engineer specializing in LLMs, RAG, and agent evaluation
Mid-level ML Engineer specializing in production NLP, forecasting, and anomaly detection
Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems
Mid-level Machine Learning Engineer specializing in healthcare NLP and MLOps
“ML/AI practitioner in healthcare (Syneos Health) who has deployed production clinical NLP and risk models. Built a BERT-based physician-note information extraction system on Docker + AWS SageMaker (reported ~42% retrieval improvement) and automated retraining/deployment with Airflow and drift detection, while partnering closely with clinicians to drive adoption (reported ~18% readmission reduction).”
Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”
Intern AI/ML Engineer specializing in agentic systems and full-stack development
“Built and scaled a multi-agent LLM automation pipeline during a fintech internship, growing from a rapid 1-week proof-of-concept to a 15+ agent hierarchical system that cut market brief report generation time from ~5 hours to under 30 minutes. Hands-on with agent frameworks (Haystack, CrewAI, LangChain) and experienced in debugging agent communication issues via sandboxed modular testing and context/token management; also regularly gives architecture-first technical demos at multiple hackathons and university events.”
Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps
“LLM/ML platform engineer with hands-on experience taking an LLM document summarization prototype into a production-grade service on AWS EKS, emphasizing low-latency inference, drift monitoring, and safe CI/CD rollouts (canary + rollback). Strong in real-time debugging of agentic/RAG systems (tracing, retrieval/index drift fixes) and in developer enablement through practical workshops (Docker/Kubernetes/FastAPI) plus pre-sales support via demos and benchmarks to close pilots.”
Mid-level Machine Learning Engineer specializing in MLOps and GenAI analytics
“ML/LLM practitioner who has deployed a production RAG-based trouble-call identifier using multiple datasets (device, network, past complaints). Experienced in end-to-end MLOps (FastAPI + Docker + Kubernetes with HPA) and in evaluating/monitoring LLM behavior to reduce hallucinations, with additional applied work in forecasting/anomaly detection and churn prediction for retention campaigns.”
Mid-level Machine Learning & GenAI Engineer specializing in LLMs, RAG, and NLP
“Built and deployed an LLM-powered customer support assistant (“Notable Assistant”) focused on automating common post-customer queries while maintaining multi-turn context and meeting scalability/latency needs. Experienced with production orchestration and operations using Kubernetes and Apache Airflow (DAG-based ETL, scheduling, monitoring/alerts), and has partnered closely with customer service stakeholders to align chatbot behavior with brand voice through iterative testing.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps
Mid-level Software Engineer specializing in AI code evaluation and full-stack development
Mid-level AI/ML Engineer specializing in healthcare and pharmaceutical AI
Senior AI/ML Engineer specializing in Generative AI, RAG, and LLM fine-tuning
Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms
Mid-level AI/ML Engineer specializing in MLOps, NLP, and computer vision
Junior Software Engineer specializing in AI/ML and full-stack development
Junior AI/ML Engineer specializing in healthcare NLP and MLOps
Mid-level Machine Learning Engineer specializing in safety-critical and uncertainty-aware ML systems
“Built and productionized an LLM-powered assistant for company documents and support questions, focused on reducing time spent searching PDFs/policies/tickets while preventing hallucinations by grounding answers in approved sources. Demonstrates strong production engineering (Kubernetes/orchestration, caching, monitoring, fallbacks) plus security-minded permissioning and close collaboration with operations/support stakeholders.”
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”
Mid-level AI/ML Engineer specializing in MLOps and LLM applications
“BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.”
Mid-level AI/ML Engineer specializing in Generative AI and data engineering
“IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.”