Pre-screened and vetted in California.
Junior Machine Learning Engineer specializing in LLMs, data pipelines, and MLOps
Mid-level AI/ML Engineer specializing in GPU-accelerated LLMs, RAG, and production MLOps
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and multi-agent systems
Senior AI/ML Engineer specializing in LLMs, RAG, and MLOps
Junior AI/ML Engineer specializing in LLM agents, RAG, and multimodal data pipelines
Senior Machine Learning & GenAI Engineer specializing in LLM systems and data pipelines
Senior Applied ML Scientist specializing in LLMs, ads ranking, and RAG systems
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps
Senior AI/ML Engineer specializing in computer vision, NLP, and real-time forecasting
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multi-agent systems
Senior AI/ML Data Scientist specializing in recommender systems, LLMs, and MLOps
“ML/NLP leader with 12+ years of impact across LinkedIn, TikTok, and Levi's, building and productionizing multimodal recommendation and embedding-based search systems. Deep experience in entity resolution, vector retrieval, and rigorous evaluation, with cloud-native deployment/monitoring (MLflow, Airflow, SageMaker/Lambda, Azure ML, Kubernetes) and demonstrated double-digit relevance gains at millions-of-users scale.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production NLP
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference
Intern Machine Learning Engineer specializing in LLMs, RAG, and model quantization
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps
Senior Python Developer specializing in AI/ML and cloud-native microservices
Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure
Junior Robotics & Reinforcement Learning Engineer specializing in dexterous manipulation
“Robotics software engineer (master’s student) who placed 3rd in the CMU VLA challenge and presented at IROS, building an LLM-powered language system (Gemini 2.5) for mobile-robot scene Q&A and language-based navigation. Hands-on ROS1/ROS2 experience including ros2_control + PILZ planning for a KUKA arm, plus simulation (Gazebo) and containerized submissions with Docker.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety
“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”
Junior Software/ML Engineer specializing in AI systems, cloud infrastructure, and applied research
“Backend/infra-focused engineer with experience spanning Go-based MCP servers for an AI-assisted Kubernetes on-call diagnosis chatbot and a Python/Flask PagerDuty automation integration. Previously at Tesla, optimized high-volume battery test data in PostgreSQL using JSONB, partitioning, and a timestamp normalization pipeline; also built PyTorch PINN training workflows and achieved a 20x speedup via batch vectorization.”