Pre-screened and vetted in New York.
Mid-level ML Engineer specializing in generative AI, RAG, and production ML systems
Mid-level Data Scientist specializing in fraud detection and ML pipelines
Senior Data Scientist specializing in applied ML, NLP, and computer vision
Mid-level GenAI/ML Engineer specializing in LLM agents and RAG for fraud detection
Mid-level AI/ML Engineer specializing in credit risk, fraud detection, and NLP in financial services
Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps
“Built and deployed a production LLM-powered lesson adaptation platform for K–12 educators that personalizes content for multilingual and neurodiverse students using RAG and content transformation. Owned the full stack from FastAPI backend and OpenAI integration through reliability/safety controls, latency/cost optimization, and weekly shippable modular APIs, iterating directly with curriculum stakeholders to reduce hallucinations and improve educator trust.”
Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”
Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps
“LLM/ML platform engineer with hands-on experience taking an LLM document summarization prototype into a production-grade service on AWS EKS, emphasizing low-latency inference, drift monitoring, and safe CI/CD rollouts (canary + rollback). Strong in real-time debugging of agentic/RAG systems (tracing, retrieval/index drift fixes) and in developer enablement through practical workshops (Docker/Kubernetes/FastAPI) plus pre-sales support via demos and benchmarks to close pilots.”
Mid-level Machine Learning & GenAI Engineer specializing in LLMs, RAG, and NLP
“Built and deployed an LLM-powered customer support assistant (“Notable Assistant”) focused on automating common post-customer queries while maintaining multi-turn context and meeting scalability/latency needs. Experienced with production orchestration and operations using Kubernetes and Apache Airflow (DAG-based ETL, scheduling, monitoring/alerts), and has partnered closely with customer service stakeholders to align chatbot behavior with brand voice through iterative testing.”
Mid-Level Software Engineer specializing in cloud-native backend and AI/ML systems
Mid-level AI/ML Engineer specializing in healthcare and pharmaceutical AI
Mid-Level Software Engineer specializing in backend systems and LLM-powered workflows
Senior AI/ML Engineer specializing in Python, LLMs, and agentic AI on cloud platforms
Mid-level AI/ML Engineer specializing in MLOps, NLP, and computer vision
Mid-level AI/ML Engineer specializing in GenAI, fraud detection, and healthcare AI
Mid-level AI/ML Engineer specializing in MLOps and LLM applications
“BNY Mellon engineer who has built and operated production AI systems end-to-end: a LangChain/Pinecone RAG platform scaled via FastAPI + Kubernetes to 1000 RPM with 99.9% uptime, supported by monitoring and data-drift detection. Also deep in data/infra orchestration (Airflow, Dagster, Terraform on AWS/EMR/EC2), processing 500GB+ daily and delivering measurable reliability and performance gains, plus strong compliance-facing model explainability using SHAP and Tableau.”
Mid-level AI/ML Engineer specializing in deep learning, MLOps, and LLM applications
“Built and deployed production LLM assistants for internal Q&A and customer-feedback summarization, emphasizing reliability (RAG, prompt tuning, validation/whitelisting) and privacy safeguards. Improved adoption by adding explainable outputs and a user feedback mechanism, and has hands-on orchestration experience with Aflow and Azure Logic Apps.”
Mid-level AI/ML Engineer specializing in Generative AI and data engineering
“IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.”
Junior Robotics & Machine Learning Engineer specializing in perception, SLAM, and control
Mid-level Machine Learning Engineer specializing in MLOps and LLM/RAG systems
Mid-level Data Scientist / ML Engineer specializing in healthcare predictive analytics and NLP
“Built and deployed a real-time hospital readmission risk prediction system at NYU Langone Health, combining structured EHR data with BERT-based NLP on clinical notes and serving predictions to clinicians via Azure ML and FHIR APIs. Emphasizes production reliability and clinical trust through SHAP-based explainability and robust healthcare data preprocessing, and reports a 22% reduction in 30-day readmissions.”
Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps
“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”
Mid-level AI/ML & Backend Engineer specializing in AI platforms and computer vision
“Backend engineer with hands-on experience building real-time, low-latency systems: owned the Python backend for a real-time crowd-monitoring product (top 5% at HackHarvard 2025) using OpenCV, GPU YOLO inference (PyTorch), WebRTC, and OAuth. Also has production Kubernetes/GitOps experience (Helm/Kustomize, GitHub Actions, Argo CD), Kafka-based event pipelines, and executed a minimal-downtime on-prem PostgreSQL migration to AWS EC2.”