Pre-screened and vetted in the Bay Area.
Intern Data Analyst specializing in data pipelines and LLM/RAG applications
“Built and deployed LLM-powered analytics and reporting systems, including a RAG-based assistant over Snowflake that let business users ask questions in plain English instead of writing SQL. Experienced orchestrating LLM agents (LangChain) and serverless reporting pipelines (AWS Lambda/S3/RDS), with a strong focus on grounded outputs, monitoring/evaluation, and data quality—used daily by non-technical finance and operations teams at Cigna.”
Junior Data Scientist specializing in agentic AI and RAG pipelines
“LLM/agentic systems builder who shipped production workflows at Angel Flight West and Eureka AI, combining LangGraph + RAG (Postgres/pgvector) with strong observability (LangSmith/Langfuse). Delivered large operational gains (address lookup cut from 10 minutes to 60 seconds; accuracy to 92%) and has a track record of quickly stabilizing customer-critical pipelines (Pydantic-enforced JSON for ETL) while partnering with sales/ops to drive adoption.”
Senior Data Engineer specializing in cloud data platforms, ETL pipelines, and analytics
Senior Data Engineering Leader specializing in cloud data platforms, streaming, and AI-enabled pipelines
Junior Data Engineer specializing in cloud data pipelines and streaming
Mid-level Data Scientist specializing in Generative AI and MLOps
“GenAI/LLM engineer with production experience at Allstate building an end-to-end document intelligence workflow for insurance operations—automating document intake, classification, and risk signal extraction. Emphasizes high-reliability design for regulated/high-stakes outputs using schema enforcement, confidence thresholds, validation rules, and human-in-the-loop routing, with metric-driven offline evaluation and production monitoring.”
Junior Data Scientist specializing in generative AI and RAG systems
“Data scientist at Guardian Airwaves building a RAG-powered quiz generator using Grok AI, with hands-on experience solving hard document-ingestion problems (PDFs with images/tables) via unstructured.io and LlamaIndex. Has deployed production systems on AWS EC2 and brings a pragmatic approach to agent reliability (human-in-the-loop, LLM-based eval, latency/cost metrics) while effectively translating RAG concepts to non-technical stakeholders.”
Junior Data Engineer specializing in LLM agents and RAG pipelines
“Built and deployed “ApartmentFinder AI,” a multi-agent system using Google ADK, Gemini, and Google Maps MCP to automate apartment shortlisting and commute-time analysis, cutting a 45–70 minute user workflow down to ~30 seconds. Also has strong delivery/process chops from serving as an SDLC Release Coordinator, managing 52+ releases and reducing SDLC issues by 84%.”
Mid-level Data Collection Moderator and Indie Game Developer specializing in software and data
Director-level Applied AI & Data Analytics Engineer specializing in real-time decisioning systems
“Built and shipped a production AI/LLM agent-based, event-driven credit underwriting/decisioning workflow that automated document understanding, retrieval, risk scoring, and compliance checks—cutting turnaround from ~90 days to ~5 minutes while boosting throughput 200x+ and approvals ~50%. Experienced with Airflow/Prefect orchestration, Redis/RabbitMQ queues, rigorous eval/monitoring, and close collaboration with non-technical underwriting teams.”
Senior Software Engineer specializing in distributed systems and cloud data pipelines