Pre-screened and vetted in the Bay Area.
Mid-level AI/ML Engineer specializing in LLM RAG pipelines and cloud MLOps
Mid-level AI/ML Engineer specializing in LLM, RAG, and multimodal systems
Mid-level Machine Learning Engineer specializing in fraud detection and recommendations
Staff/Lead Software Architect specializing in Contact Center platforms and GenAI automation
“Built and deployed production LLM systems in healthcare and at LinkedIn: automated pen-and-paper clinical trial evaluations with a 40x efficiency gain and created an evidence-based Evaluation Agent focused on accuracy and speed. Also used Temporal to orchestrate resilient data-ingestion workflows for customer support staffing prediction, improving prediction outcomes by 40% while handling missing data, retries, and backfills.”
Senior Machine Learning Engineer specializing in conversational AI and Generative AI
“ML/AI engineer with experience at Uber and Scale AI, focused on customer service automation across both classical NLP and generative AI systems. Has owned systems from experimentation through production on AWS, including LLM fine-tuning, RAG optimization, safety evaluation, and internal Python platform tooling that improved consistency and engineering velocity.”
Intern Data Scientist specializing in GenAI (LLMs, RAG) and ML model optimization
“Built and deployed a production LLM-powered risk assistant for KPMG and Freddie Mac that lets analysts query a confidential Neo4j risk graph in natural language (no Cypher), turning multi-day analysis into minutes with traceable, cited answers. Implemented rigorous guardrails, deterministic verification, RBAC/security controls, and a full eval/observability stack, cutting query error rate by ~50% and iterating through weekly UAT with non-technical risk analysts.”
Intern Applied AI/Software Engineer specializing in computer vision and full-stack platforms
“Built production LLM systems focused on reliability and safety, including a plain-English deployment tool that generates validated plans and provisions to Kubernetes while preventing unsafe actions via schema enforcement and plan/execute separation. Also created multi-LLM workflows (LangGraph) and stakeholder-friendly demos at Bosch, including a PyQt/FastAPI/CUDA app comparing SAM2 vs SAMWISE for on-device object detection with intuitive UX for business users.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Intern Machine Learning/Robotics Engineer specializing in computer vision and 3D simulation
Intern Perception/Robotics Engineer specializing in computer vision and embodied AI
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Engineering Manager specializing in AI/ML and full-stack product delivery
Mid-level Machine Learning Engineer specializing in fraud prevention and LLM systems
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Intern Machine Learning Engineer specializing in RL post-training for LLMs and VLMs
Mid-level Data Engineer specializing in ML, LLM pipelines, and analytics platforms
Mid-level AI/ML Engineer specializing in multimodal and generative AI at scale
Junior AI/ML Engineer specializing in agentic AI and cloud optimization
Mid-level AI/ML Engineer specializing in recommendation, retrieval, and MLOps
Mid AI/ML Engineer specializing in LLM systems and inference optimization
Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI
“ML/GenAI engineer with strong end-to-end production ownership across predictive ML, RAG systems, and LLM routing. They pair solid platform engineering skills with measurable business impact, including 15% churn reduction, 35% support ticket deflection, 45% GenAI cost savings, and a shared inference library that cut deployment time from weeks to days.”
Intern AI Engineer specializing in agentic systems and full-stack products