Pre-screened and vetted.
Senior Machine Learning Engineer specializing in LLMs, speech AI, and RAG systems
“AI engineer with production experience building multilingual speech-to-speech translation pipelines (ASR + LLM) for enterprise/media, focused on reliability at scale. Has hands-on orchestration experience (including IBM Watson contexts) and emphasizes production evaluation/monitoring using a mix of traditional metrics and LLM-based evaluators to catch quality regressions while balancing latency and cost.”
Mid-level AI Engineer specializing in LLMs, MLOps, and healthcare NLP
“Built a production, real-time clinical documentation system at HCA that converts doctor–patient conversations into structured clinical summaries using speech-to-text, LLM summarization, and RAG. Demonstrated measurable gains from medical-domain fine-tuning (clinical concept recall +18%, ROUGE-L 0.62 to 0.74) while meeting HIPAA constraints via PHI anonymization and encryption, and deployed via Docker/FastAPI with CI/CD and monitoring.”
Mid-level AI/ML Engineer specializing in GenAI, NLP, and financial systems
“GenAI/ML engineer with hands-on experience building production financial intelligence and document summarization systems at Citibank. Stands out for combining LLM fine-tuning, hybrid RAG, multi-agent workflows, and strong MLOps/observability practices to deliver measurable business impact, including 60% faster analyst retrieval, 31% higher precision, and 99%+ uptime.”
Mid-level Full-Stack Engineer specializing in AI and cloud platforms
“Built end-to-end product features spanning full-stack web development and LLM-powered systems in an early-stage startup environment. Notably shipped an AI financial assistant chatbot with agent routing, validation, fallback handling, and production monitoring, and also owned a scheduling system integrating Next.js, backend APIs, database design, and Google Calendar OAuth.”
Junior AI Engineer specializing in LLM systems and applied machine learning
“Yogesh is an AI/full-stack engineer from LangChain who says he was the sole developer and core maintainer of OpenSWE/OpenSpeed, an asynchronous coding agent in LangSmith Cloud that turns requests from Slack, Linear, and GitHub into reviewable PRs. He emphasizes production-grade agent infrastructure: event-driven workflow design, typed run states, observability, retries, and latency improvements via pre-warmed sandboxes.”
Junior Full-Stack Engineer specializing in AI systems and healthcare RAG
“AI/full-stack engineer with hands-on experience shipping both computer vision and LLM products in production across marketplace and healthcare settings. Built an automated device grading system at Northladder and improved a Deloitte healthcare chatbot using RAG, with a strong emphasis on grounding, validation, uncertainty handling, and human review for high-impact decisions.”
Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps
“Healthcare-focused LLM engineer who deployed a production triage and clinical knowledge retrieval assistant using RAG and LangGraph-orchestrated multi-agent workflows. Emphasizes clinical safety and compliance with robust hallucination controls, HIPAA/PHI protections (tokenization, encryption, audit logging, zero-retention), and human-in-the-loop escalation; reports a 75% latency reduction in a healthcare agent system.”
Mid-level AI/ML Engineer specializing in Generative AI and data engineering
“IBM engineer who built and deployed a production RAG-based LLM assistant using LangChain/FAISS with a fine-tuned LLaMA model, served via FastAPI microservices on Kubernetes, achieving 99%+ uptime. Demonstrates strong practical expertise in reducing hallucinations (semantic chunking + metadata-driven retrieval) and managing latency, plus mature MLOps practices (Airflow/dbt pipelines, MLflow tracking, monitoring, A/B and shadow deployments) and effective collaboration with non-technical stakeholders.”
Intern Data Scientist specializing in AI, analytics, and cloud data engineering
“Built a production multimodal LLM-based vendor risk assessment platform that ingests SOC reports and other documents, uses a strict RAG pipeline with grounded evidence (page/paragraph citations), and dramatically reduces analyst review time. Experienced with LangGraph/LangChain/AutoGen for stateful, fault-tolerant agent workflows, and emphasizes reliability (schema validation, guardrails) plus low-latency delivery (~1–2s) through hybrid retrieval, reranking, caching, and model tiering.”
Mid-level AI/ML Engineer specializing in Generative AI and production ML systems
“At CVS Health, the candidate productionized a RAG-based LLM solution in a regulated healthcare setting, emphasizing reliable data pipelines, LoRA fine-tuning, monitoring, safety guardrails, and A/B testing. They have hands-on experience troubleshooting real-time RAG failures (e.g., chunking/embedding issues) and regularly lead developer-focused demos/workshops while translating technical architecture into business value for stakeholders.”
Mid-level AI Engineer specializing in LLM orchestration, RAG, and multi-agent systems
“Research Assistant at the University of Houston who built and live-deployed a production RAG system for 1000+ research documents, using hybrid retrieval (dense+BM25+RRF) with cross-encoder reranking and RAGAS-based evaluation; reported 66% MRR, 0.85+ faithfulness, and 68% lower LLM inference costs. Also built a deployed LangGraph multi-agent research system (Researcher/Critic/Writer) with tool integrations (Tavily, arXiv) and dual memory (ChromaDB + Neo4j), plus freelance automation work delivering a WhatsApp chatbot and n8n workflows for a wholesale clothing business.”
Mid-level AI/ML & Full-Stack Engineer specializing in LLM agents and medical RAG systems
“Full-stack engineer at an early-stage startup building an agentic AI application for enterprise systems, combining customer-facing Next.js/React UI work (30% faster load times) with backend/workflow orchestration using FastAPI + n8n, Redis, and RabbitMQ. Previously at Deloitte USI, built BDD Selenium/Java automation and managed 200+ defects end-to-end using JIRA/JAMA to support on-time production releases.”
Staff RPA & Automation Engineer specializing in Financial Services
“Blue Prism RPA developer in a small FinTech-aligned team who owned ~20 production bots and drove both delivery and reliability. Built a shared VDI/locking design that cut infrastructure cost ~20–30% and routinely handled ServiceNow-driven production incidents end-to-end, including hotfixes and longer-term SDLC fixes. Also acted as a player-coach, training junior hires and maintaining high bot success rates (up to 99% within SLA).”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and time-series forecasting
“ML/AI engineer with hands-on ownership of production recommendation and RAG systems at Northern Trust. They combine transformer modeling, latency optimization, cloud deployment, and monitoring with measurable business impact, including 14% accuracy gains, 12% engagement improvement, and 19% better query relevance.”
Mid-level Full-Stack Engineer specializing in cloud-native and AI-powered applications
“Candidate has a thoughtful, hands-on approach to AI-assisted software development, treating AI as a pair programmer while retaining ownership of architecture, tradeoffs, and final code quality. They have practical experience using multi-agent workflows to ship small features end-to-end, including planning, execution, and gap detection under human oversight.”
Mid-level Software Engineer specializing in AI, cloud, and full-stack systems
“Full-stack and AI product engineer with strong AWS/Snowflake experience who built an internal feature flag platform and helped migrate a cybersecurity insights product into a multi-agent AI chat interface. They report production scale of 1M+ embeddings and 50k+ monthly queries, with outcomes including an 80% reduction in analyst work and dashboard generation in 7 minutes; the work was also featured by Claude and AWS.”
Senior Full-Stack Software Engineer specializing in AI agents and data platforms
“Full-stack and AI-focused builder who has shipped both customer-facing personalization at AT&T and internal LLM-powered automation/agent systems in startup environments. Stands out for combining TypeScript-heavy engineering rigor with practical AI orchestration, evaluation, and measurable business impact—from reducing support escalation through personalization to saving 10-11 hours per week by automating fragmented operational workflows.”
Intern Full-Stack Software Engineer specializing in cloud, voice AI, and billing systems
“Product-minded full-stack engineer at a B2B startup who ships high-stakes customer-facing features fast: delivered a Spanish AI support agent in 2 weeks by benchmarking LLMs and using native Spanish system prompts, reaching 90% resolution. Built the company’s first monetization system (hybrid subscription + usage) with Stripe/Firebase, emphasizing secure JWT-based flows and idempotent webhooks, and led a microservices decoupling effort that cut developer onboarding time by 50%.”
Mid-level AI/ML Engineer specializing in NLP and Generative AI
“Built and deployed a production LLM-powered RAG assistant for healthcare teams (care managers/support) to answer questions from clinical and policy documentation, emphasizing trustworthiness via improved retrieval, reranking, and strict grounding prompts to reduce hallucinations. Also has hands-on orchestration experience with Apache Airflow for end-to-end ETL/ML workflows and applies rigorous testing/metrics (hallucination rate, tool-call accuracy, latency, cost) to ensure reliable AI agent behavior.”
Mid-level Data Scientist specializing in LLMs, RAG, and document intelligence
“LLM/ML engineer who has shipped production systems in legal/financial-risk domains at Wolters Kluwer, including a hybrid OCR+deterministic+LLM extraction pipeline that structured UCC filings at massive scale and drove $6M+ in revenue. Also built LangGraph-based multi-agent “Deep Research” workflows with model routing, tool calls (MCP), persistence, and human-in-the-loop review, and partnered closely with policy writers to deliver LLM summarization that cut writing time by ~60%.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“LLM/agentic systems engineer who built a production "Agentic AI Diagnostic Assistant" for network engineers, using a multi-agent Llama 2 + LangChain architecture with RAG over telemetry/incident data in DynamoDB and confidence-based deferrals to reduce hallucinations. Also has strong MLOps/orchestration experience (Airflow, EventBridge, Spark, Docker, SageMaker/ECS) at multi-terabyte/day scale and delivered multilingual NLP analytics (fine-tuned BERT/spaCy) for support operations through hands-on stakeholder workshops.”
Senior AI Engineer specializing in Generative AI, NLP, and applied deep learning
“Built a production multi-agent LLM system at Live Nation on Databricks (LangGraph/LangChain) that let venue/event teams ask questions in Slack, auto-generated optimized route schedules, and produced inventory/stocking recommendations from historical SQL data and venue trends. Improved reliability by tightening prompts with strict JSON schemas, providing sample questions/SQL, and adding guardrails plus synthetic/edge-case testing, while iterating with event managers and senior VPs via prototypes and feedback loops.”
Executive Product & Strategy Consultant specializing in analytics, GTM, and GenAI prototyping
“Former mechanical engineering professional who pivoted into business development and drove major international growth through strategic partnerships (helping scale headcount from a few to 300+ in 2–3 years). Now finishing an MBA (graduating in May) and building a recruiting/hiring-manager-focused product—prototype complete, validating demand, and aiming to launch an MVP within ~2 months before fundraising.”
Mid-level Data Scientist / ML Engineer specializing in FinTech and Healthcare ML systems
“AI/LLM engineer who has shipped production RAG systems (including a 250K-document compliance knowledge tool on AWS) and focuses on reliability via citations, guardrails, and rigorous evaluation (Ragas/Opik/DeepEval). Also built a LangGraph-orchestrated webcrawler agent that cut research paper extraction from hours to minutes, and collaborated with clinical teams to deliver patient volume forecasting with an optimization layer for staffing.”