Pre-screened and vetted in California.
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Engineering Manager and ML/Data Architect specializing in scalable data platforms and personalization
“Hands-on engineering manager at a marketing company leading a highly senior, distributed team (10 direct reports) while personally coding ~60–70% and owning end-to-end architecture across three interconnected products. Built agentic CRM automation and a reinforcement-learning-driven distribution layer for channel spend/bidding, with a strong focus on scalable design and observability (Prometheus/APM/logging) enabling frequent releases and few production incidents.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning
“ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.”
Senior Generative AI Implementation Consultant specializing in RAG and agentic AI on cloud
“LLM/RAG practitioner who built an AWS-based enterprise document search and summarization platform with RBAC and scaled it to 10K+ users, solving relevance issues via contextual chunking and hybrid retrieval. Also designed agentic workflows for a telecom forecast-validation use case using sub-agents, tool APIs, and strict context management, and has proven pre-sales influence (supported a $300K manufacturing deal with a roadmap-driven pitch).”
Junior Machine Learning Engineer specializing in LLM systems and inference reliability
“ML/LLM infrastructure-focused engineer who built a production stateful LLM inference service that cuts latency and GPU compute for repeated/overlapping prompts via caching with correctness guardrails. Strong in Kubernetes-based deployment and reliability engineering, using A/B testing and similarity-based evaluation to quantify performance gains without sacrificing output quality.”
Mid-level Machine Learning Engineer specializing in fraud detection and real-time personalization
“ML/LLM engineer with Stripe and Adobe experience who productionized a transformer-based Payments Foundation Model for real-time fraud detection at global scale (billions of transactions). Built petabyte-scale ETL/feature pipelines (Spark/EMR, Airflow, dbt, Kafka/Flink) and achieved <100ms multi-region inference (EKS, TorchServe, edge/Lambda, GPU/CPU routing) with strong PCI-DSS/GDPR compliance and explainability (SHAP/LIME), reporting a 64% fraud accuracy improvement.”
Mid-level Machine Learning Engineer specializing in NLP, federated learning, and fraud detection
“ML/robotics engineer with Apple experience who built a computer-vision-driven industrial defect detection system integrating a robotic arm with ROS-based real-time inference on an edge GPU. Drove major performance gains (cut inference time ~60% via quantization + TensorRT) and improved robustness to lighting/material variation, with strong emphasis on production reliability (health checks, watchdogs, observability, CI/CD) and interest in shaping early-stage startup engineering culture.”
Mid-Level Software Engineer specializing in ML and full-stack AI search
Senior Machine Learning Engineer specializing in NLP, LLMs, and scalable ML platforms
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
Mid-level Machine Learning Engineer specializing in real-time fraud detection and edge AI
Mid-level Machine Learning Engineer specializing in reinforcement learning and multimodal AI
Mid-level AI/ML Engineer specializing in LLM RAG pipelines and cloud MLOps
Mid-level Machine Learning Engineer specializing in MLOps and Generative AI
Mid-level AI/ML Engineer specializing in LLM, RAG, and multimodal systems
Mid-level Machine Learning Engineer specializing in fraud detection and recommendations
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Mid-level Machine Learning Engineer specializing in deep learning, MLOps, and real-time inference
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps