Pre-screened and vetted in the Bay Area.
Mid-level AI/ML Engineer specializing in LLM systems and cloud MLOps
“Built a production LLM-powered fraud detection platform at Wells Fargo, combining OpenAI/Hugging Face models with RAG-based explanations to make flagged transactions interpretable for risk and compliance teams. Delivered low-latency, real-time inference at high scale on AWS (SageMaker + EKS), with strong observability and security controls, reducing manual reviews and false positives in a regulated environment.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Senior AI Engineer specializing in production GenAI systems
“AI engineer who has shipped production LLM systems end-to-end, including a natural-language-to-SQL analytics copilot for career advisors that achieved ~95% query success through schema grounding, access controls, and automated regression testing with golden queries. Also builds LangGraph-orchestrated multi-step agents (resume analysis, recommendations) and RAG pipelines (PDF ingestion + FAISS) and partners closely with non-technical users to drive adoption and trust.”
Mid-level AI/ML Engineer specializing in Generative AI for Financial Services
“ML/AI engineer with strong financial-services domain experience who has built production systems spanning trade anomaly detection, investment-research RAG, and agentic LLM workflows. Particularly compelling for teams needing someone who can take ML/GenAI from prototype to monitored production while balancing compliance, latency, cost, and reliability.”
Senior AI Engineer specializing in forward-deployed voice agents and incident-response automation
“FDE at Bland.ai and founder of Fi (incident-response agent) who routinely takes LLM/agentic concepts from prototype to production. Has hands-on experience reverse-engineering undocumented systems to deliver integrations, building LLM testbeds for voice-agent reliability, and rapidly shipping RAG/semantic search solutions (e.g., Confluence runbooks) after deep customer discovery with DevOps/SRE teams.”
Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services
“Finance-domain ML/LLM engineer who has shipped production systems including a RAG-based financial insights assistant with a custom post-generation validation layer that verifies atomic claims against retrieved source text to prevent hallucinations in compliance-critical workflows. Also built large-scale MLOps automation on AWS using Kubeflow + MLflow + CI/CD for fraud detection and credit risk models processing 500M+ transactions/day with a 99.99% uptime goal, and partnered closely with JP Morgan risk/compliance stakeholders on NLP-driven compliance monitoring.”
Junior Full-Stack Machine Learning Engineer specializing in production ML systems
“Software engineer who owned end-to-end delivery of customer-facing agricultural forecast reporting (crop yield/health) and iterated quickly via rigorous edge-case testing and customer feedback. Also built an internal ML training platform (TypeScript/React + Flask/Python + MongoDB) used by every developer, with architecture designed to stay responsive under heavy compute load.”
Junior ML Engineer specializing in energy forecasting and battery optimization
“Backend/ML engineer working on a battery energy storage system operations dashboard: built a Flask backend integrated with OAuth and a separate FastAPI optimization/simulation service, deployed via Docker CI/CD to Azure Container Apps. Strong in productionizing ML (AzureML to batch endpoints) and in performance/scalability patterns (Postgres indexing/JSONB, per-unit data isolation, async throttling + caching for year-long CPU-intensive simulations across 40+ scenarios).”
Junior Solutions Engineer / Full-Stack Engineer specializing in AI-native SaaS and APIs
“Worked at easybee ai building a production-grade "voice of the customer" LLM intake agent, hardening a fragile sandbox prototype with JSON-schema constrained outputs, Python/FastAPI validation middleware, and automated retries. Strong in real-time debugging of agentic workflows (snapshot isolation, modular tracing) and in implementing safety/compliance guardrails like a content-moderation middleware to support enterprise adoption.”
Engineering leader specializing in FinTech ML/AI platforms
“Engineering Manager/player-coach leading Data Infrastructure, ML/DS, and AI Engineering pods who recently shipped multiple production agentic GenAI features. Built privacy-preserving LLM workflows (PII redaction via Microsoft Presidio) and drove an AI expense-approval agent from ambiguous ask to GA, cutting approval time from ~2.5 days to <4 hours with >85% accuracy. Also owned a major LLM cost overrun incident and implemented cost observability plus circuit breakers to prevent runaway agent loops.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems
“At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.”
Intern AI/ML Researcher specializing in computer vision and data engineering
“Built a production-oriented multimodal RAG "Fix Assistant" with FastAPI, Tavily search, BM25 + cross-encoder reranking, and a local Phi-3.5 model, emphasizing strict grounding and fallback/verification modes to prevent hallucinations. Also has hands-on federated learning experience using STADLE to orchestrate edge-node training and aggregation for EV telemetry data, plus experience communicating AI results to non-technical stakeholders (traffic RL/congestion outcomes).”
Mid-level AI/ML Engineer specializing in MLOps and LLM-powered applications
“AI/ML engineer with production experience building a RAG-based internal analytics assistant (Databricks + ADF ingestion, Pinecone vector store, LangChain orchestration) deployed via Docker on AWS SageMaker with CI/CD and MLflow. Strong focus on real-world constraints—latency/cost optimization (LoRA ~60% compute reduction), hallucination control with citation grounding, and enterprise security/governance. Previously at Intuit, delivered an interpretable churn prediction system (PySpark/Databricks, Airflow/Azure ML) that improved retention targeting ~12%.”
Senior Computer Vision Engineer specializing in medical imaging and deep learning
Intern Machine Learning Engineer specializing in cloud-based content moderation
Junior ML Infrastructure Engineer specializing in low-latency LLM and inference serving
Senior Machine Learning Engineer specializing in MLOps and production AI systems
Junior Machine Learning Engineer specializing in NLP, search, and performance optimization
Junior AI/Full-Stack Engineer specializing in NLP and agentic systems
Mid-level Machine Learning Engineer specializing in robotics and autonomous driving
Mid-level Machine Learning Engineer specializing in GenAI, RAG, and computer vision
Intern AI/ML Engineer specializing in LLM agents and RAG systems
Mid-level AI Software Engineer specializing in ML systems and agentic automation