Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation
“LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.”
Mid-level AI/ML Engineer specializing in production ML, RAG systems, and MLOps
“Built and shipped a widely adopted, production-grade RAG internal search assistant that unified scattered engineering knowledge, deployed as a FastAPI service on Kubernetes with FAISS + LangChain. Demonstrates deep practical expertise in retrieval tuning (chunking, hybrid search, re-ranking) and in making LLM workflows reliable in production via guardrails, monitoring, and evaluation, plus strong cross-functional delivery with non-technical operations teams.”
Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants
“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”
Mid-level ML & Data Engineer specializing in GenAI, graph modeling, and fraud/risk analytics
“Built a production AI fraud/risk scoring platform at BlueArc that ingests web business/product/site data, generates text+image embeddings, and connects entities in a graph to detect reuse patterns and links to known bad actors. Optimized for scale with incremental graph re-scoring and delivered investigator-friendly explainability by surfacing the exact signals/relationships behind each score; orchestrated workflows with Airflow and GCP event-driven components (Pub/Sub, Dataflow, Cloud Run) and has recent LLM workflow orchestration experience (retrieval, prompting, scoring).”
Mid-level Robotics Software Engineer specializing in perception, sensor fusion, and motion planning
“Robotics/Perception Software Engineer at Berkshire Grey who built and hardened a production ROS-based perception + supervision stack for autonomous trailer-unloading robots (RGB-D + LiDAR), including grasp/geometry estimation and segmentation. Diagnosed real-time behavior issues by instrumenting ROS pipelines, then implemented runtime RANSAC-based compensation for LiDAR yaw bias and TF-window validation; also supports containerized deployment on Kubernetes and is actively porting the system from ROS1 to ROS2.”
Mid-level AI/Machine Learning Engineer specializing in Generative AI, NLP, and MLOps
“Built a production LLM/RAG document analysis system for large financial documents (credit reports/PDFs) to help business analysts extract insights faster. Implemented end-to-end pipeline orchestration with LangChain, vector search (e.g., FAISS), and hallucination controls (context grounding, similarity thresholds, and no-answer fallback), delivered as a Dockerized Python API.”
Mid-level Machine Learning Engineer specializing in cloud-native generative AI for healthcare
“AI engineer at Cleveland Clinic building production LLM/NLP systems for radiology documentation, focused on HIPAA-aware, real-time performance across ~298 campuses. Re-architected infrastructure with AWS event-driven services to handle scaling and improved SLA compliance ~40%, and complements this with a personal multi-agent debate system (CrewAI) using local Llama/Mistral plus rigorous evaluation (A/B tests, red teaming, observability).”
Mid-level Machine Learning Engineer specializing in data security and GenAI systems
“Built Hexagon’s production Text-to-CAD Copilot that converts text and rough sketches into editable CAD code, combining GraphRAG (Neo4j/LangChain) with a Gemini-powered vision module and multi-agent geometric validation—cutting manual modeling from a day to ~45 seconds and driving retrieval latency below 50ms. Also has large-scale GCP data/ML orchestration experience (Airflow/Cloud Composer, Dataflow, Pub/Sub, Snowflake) processing 50M+ daily records with drift monitoring and automated reliability controls.”
Senior Full-Stack AI Engineer specializing in LLM/RAG agentic systems
“Built and deployed JobMatcher AI, an LLM-driven workflow automation product for job seekers that extracts requirements from job descriptions, matches to user skills, and generates tailored outreach. Demonstrated strong production engineering by cutting per-run cost ~70%, improving reliability with retries/backoff/fallbacks, and reducing hallucinations via schema validation and templating; also orchestrated the system with LangGraph plus Docker Compose across API, vector DB, and workers.”
Junior Machine Learning Engineer specializing in GenAI and LLM fine-tuning
“Robotics software engineer focused on hard real-time autonomy for legged robots, building a quadruped navigation stack that combines vision SLAM with MPC and maintains a deterministic 500Hz control loop. Deep performance optimization experience across CUDA (sub-2ms perception latency), ROS 2/DDS real-time tuning, and motion planning (cut 500ms spikes to sub-5ms). Also designed distributed ROS 2 + Zenoh communications between quadrupeds and aerial drones and validated robustness under lossy wireless conditions.”
Mid-level AI Engineer specializing in agentic LLM systems and RAG platforms
“Built and shipped Serrano AI, a multi-tenant SaaS conversational AI platform that automates Odoo ERP workflows and lets ops/finance/supply-chain teams query ERP data in natural language. Implemented a multi-agent architecture (LangChain/LangGraph/CrewAI) with hybrid RAG over ERP schemas, deployed on Heroku/Vercel with production observability, cutting reporting time by ~80% while addressing hallucinations, latency, and schema complexity.”
Mid AI/ML Engineer specializing in LLMs, RAG, and healthcare AI
“Healthcare ML/AI engineer with production experience at UnitedHealth Group, including an end-to-end readmission prediction system built on 50M+ patient records that improved accuracy by 18% and reduced preventable readmissions by 12%. Also shipped a clinically grounded LLM/RAG referral generator with human-in-the-loop safety controls, showing strong depth in regulated, high-stakes AI systems.”
Entry-level AI Engineer specializing in LLMs and applied NLP systems
“Built Lumo, a real-time voice AI companion, owning the product end-to-end across React/TypeScript, FastAPI WebSockets, and PostgreSQL. Stands out for combining deep full-stack systems thinking with voice UX polish, reliability instrumentation, and configurable parent-control guardrails in a multi-tenant setup.”
Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps
“Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.”
Intern Full-Stack & AI Engineer specializing in LLM applications and computer vision
Mid-level Generative AI & Machine Learning Engineer specializing in LLMs, RAG, and multimodal AI
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps
Mid-level AI & Machine Learning Engineer specializing in Generative AI, NLP & MLOps
Mid-level Data Scientist specializing in LLMs, RAG, and ML systems
Junior Machine Learning Engineer specializing in LLMs and information retrieval
Mid-level Full-Stack Engineer specializing in AI, GTM systems, and backend platforms
Mid-level Software Engineer specializing in backend, ML, and cloud systems