Pre-screened and vetted.
Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision
“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”
Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision
“ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).”
Senior AI/ML Engineer specializing in GenAI agents and LLM workflows
“LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.”
“Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.”
Mid-level Software Engineer specializing in machine learning and full-stack AI systems
“Built production-grade Python systems in a medical/imaging context, including an image feature extraction and survival prediction microservice with strong testing, validation, and observability practices. Also developed a Playwright-based autonomous job application agent that handled dynamic UIs and anti-bot challenges with stealth tooling, proxies, and human-in-the-loop escalation.”
Mid-level AI/ML Engineer specializing in generative AI and intelligent automation
“Backend-focused AI engineer with enterprise experience building startup-style internal products at JPMorgan Chase. He helped create an AI-powered financial research platform for analysts, leading retrieval and multi-agent orchestration work that cut research prep from hours to under 20 minutes while scaling across large volumes of SEC filings and earnings transcripts.”
Junior Software Engineer specializing in AI systems and distributed backend platforms
“Built end-to-end AI features across both fitness and insurance domains, including a full-stack personalized workout recommendation system and a production RAG-based insurance QA assistant at Relevance Labs. Stands out for combining backend/distributed systems skills with practical LLM architecture, evaluation, and risk-aware human-in-the-loop design; notably reduced unnecessary LLM calls by 40% while improving latency and answer reliability.”
“ML engineer with production experience at Goldman Sachs and Medtronic, focused on real-time AI systems in fraud detection and healthcare. Brings a rare mix of backend ML infrastructure, MLOps, and product-minded UX thinking, including dashboard and API design that made complex model outputs usable for analysts and clinical users.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and MLOps for healthcare and finance
“Built a production LLM-powered RAG agent for healthcare/insurance operations that retrieves and summarizes patient medical documents with grounded citations, scaling to ~4.5M records. Addressed medical shorthand and terminology by fine-tuning ~120 lightweight DistilBERT models by specialty and validating entities against SNOMED/RxNorm, while using SHAP/LIME and human-in-the-loop review to make decisions explainable to stakeholders.”
Intern Software Engineer specializing in ML/NLP and LLM applications
“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”
Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems
“Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.”
Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions
“Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.”
Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI
“Built and owned a production conversational AI platform for a healthcare contact center, including RAG-based agent assist, hybrid retrieval, safety guardrails, and production monitoring. Stands out for combining LLM product delivery with strong operational rigor, driving a reported 25-30% improvement in handling time in a sensitive healthcare environment.”
Mid-level AI/ML Engineer specializing in fraud detection and recommendation systems
“ML engineer with production experience at PayPal and Flipkart, owning high-scale systems across fraud detection, recommendations, and LLM tooling. Stands out for combining strong modeling judgment with practical platform engineering, delivering measurable impact like 22% fewer fraud false positives, 18% CTR lift, 40% less LLM manual review, and 30% faster redeployments.”
Mid-level AI/ML Engineer specializing in LLM agents and RAG systems
“LLM/agentic systems builder at Verizon who deployed a LangGraph-orchestrated multi-agent ticket-automation platform with RAG (FAISS) to replace brittle rule-based bots. Improved routing correctness by ~30–40%, hit ~300ms latency targets via model routing, and reduced ops workload by ~60% through tight iteration with non-technical stakeholders and strong testing/observability practices.”
Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems
“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”
Mid-level Machine Learning Engineer specializing in computer vision and LLM pipelines
“ML/LLM engineer who built production systems to speed up artist content-creation workflows, including a fine-tuned image captioning model paired with a RAG layer over image embeddings/captions to improve consistency across changing domains. Experienced orchestrating multi-tool agents with LangChain/LangGraph (planning + critic/reflection) and setting up practical monitoring (caption rejection rate) plus evaluation sets for tool-calling accuracy, output quality, and latency.”
Mid-level GenAI Engineer specializing in RAG, LLMs, and enterprise AI
“Built and shipped production LLM agents that automate document processing and decision workflows, with a strong focus on reliability, guardrails, and measurable business impact. Stands out for combining RAG, tool calling, evals/monitoring, and ERP integration to deliver 30-35% manual effort reduction and higher throughput without additional headcount.”
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and predictive analytics
“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”
Mid-level Machine Learning Engineer specializing in LLMs and AI products
“Applied ML/LLM engineer currently building AppleCare’s production chat recommender, owning the full lifecycle from transcript cleaning and fine-tuning through distributed deployment, monitoring, and iterative improvement. Their work delivered >10% copy-count improvement, 5% lower modification rate, 60% cost reduction, and $1.1M profitability in 2025, and they also created a reasoning-data generation approach that enabled a reasoning model and a judge model that cut eval time by over 99%.”
Senior Data Scientist / Generative AI Engineer specializing in fraud, risk, and MLOps
“Built and deployed a production LLM/RAG fraud investigation system to replace manual investigator workflows, combining transaction data, historical cases, and policy documents with agent-style steps and LoRA fine-tuning. Demonstrates strong reliability engineering (grounding, citations, abstention paths), performance optimization (retrieval/indexing/caching), and end-to-end MLOps orchestration using Azure ML Pipelines/MLflow plus Kubernetes/Argo with canary and rollback deployments.”
Senior AI & Machine Learning Engineer specializing in GenAI, Agentic AI, and RAG
“Built a production agentic AI system to automate data science work using a layered architecture (executive-summary handling, tool-based execution, and on-the-fly code generation). Demonstrates strong end-to-end agent development practices including RAG with vector databases, prompt engineering, and multi-method evaluation (LLM-as-judge/human/code-based), plus Airflow-based orchestration for ML data pipelines and close collaboration with business end users.”
Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise data platforms
“Built and shipped a production LLM-powered RAG assistant for enterprise internal document search (PDFs, knowledge bases, structured data), addressing real-world issues like noisy documents, hallucinations, and latency with grounded prompting, retrieval-confidence fallbacks, and performance optimizations. Also partnered with compliance and business teams at JPMc to deliver a solution aligned with regulatory constraints, supported by monitoring, feedback loops, and systematic evaluation.”