Pre-screened and vetted in California.
Director of AI/ML specializing in edge AI, computer vision, and foundation models
Mid-level Machine Learning Engineer specializing in reinforcement learning and multimodal AI
Junior AI Prompt Engineer specializing in LLMs, RAG, and conversational AI
Senior AI/ML Engineer specializing in Generative AI and LLM applications
Mid-level AI/ML Engineer specializing in LLM RAG pipelines and cloud MLOps
Mid-level Machine Learning Engineer specializing in MLOps and Generative AI
Mid-level AI/ML Engineer specializing in LLM, RAG, and multimodal systems
Mid-level Machine Learning Engineer specializing in fraud detection and recommendations
Staff/Lead Software Architect specializing in Contact Center platforms and GenAI automation
“Built and deployed production LLM systems in healthcare and at LinkedIn: automated pen-and-paper clinical trial evaluations with a 40x efficiency gain and created an evidence-based Evaluation Agent focused on accuracy and speed. Also used Temporal to orchestrate resilient data-ingestion workflows for customer support staffing prediction, improving prediction outcomes by 40% while handling missing data, retries, and backfills.”
Intern Data Scientist specializing in GenAI (LLMs, RAG) and ML model optimization
“Built and deployed a production LLM-powered risk assistant for KPMG and Freddie Mac that lets analysts query a confidential Neo4j risk graph in natural language (no Cypher), turning multi-day analysis into minutes with traceable, cited answers. Implemented rigorous guardrails, deterministic verification, RBAC/security controls, and a full eval/observability stack, cutting query error rate by ~50% and iterating through weekly UAT with non-technical risk analysts.”
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Intern Applied AI/Software Engineer specializing in computer vision and full-stack platforms
“Built production LLM systems focused on reliability and safety, including a plain-English deployment tool that generates validated plans and provisions to Kubernetes while preventing unsafe actions via schema enforcement and plan/execute separation. Also created multi-LLM workflows (LangGraph) and stakeholder-friendly demos at Bosch, including a PyQt/FastAPI/CUDA app comparing SAM2 vs SAMWISE for on-device object detection with intuitive UX for business users.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Intern Machine Learning/Robotics Engineer specializing in computer vision and 3D simulation
Mid-level Machine Learning Engineer specializing in deep learning, MLOps, and real-time inference
Intern Perception/Robotics Engineer specializing in computer vision and embodied AI
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Engineering Manager specializing in AI/ML and full-stack product delivery
Mid-level Machine Learning Engineer specializing in NLP, federated learning, and fraud detection
Mid-level Machine Learning Engineer specializing in recommender systems and LLM/RAG pipelines