Pre-screened and vetted.
Executive AI/ML technology leader specializing in healthcare, biotech, and legal AI
“Repeat founder and startup advisor with experience spanning academic, health tech, legal tech, sports, and gaming. Has participated in fundraising and due diligence and has built companies, engineering teams, and software platforms from scratch, with a strong product-design-first approach to product-market fit and market selection.”
Junior Robotics & Reinforcement Learning Engineer specializing in dexterous manipulation
“Robotics software engineer (master’s student) who placed 3rd in the CMU VLA challenge and presented at IROS, building an LLM-powered language system (Gemini 2.5) for mobile-robot scene Q&A and language-based navigation. Hands-on ROS1/ROS2 experience including ros2_control + PILZ planning for a KUKA arm, plus simulation (Gazebo) and containerized submissions with Docker.”
Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services
“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”
Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference
“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
Mid-Level Software Engineer specializing in Azure AI and full-stack development
“Hands-on AI/LLM engineer who built a RAG-based product feature end-to-end, including prompt engineering, safety guardrails, and an automated adversarial + load-testing harness. Diagnosed real production issues (null responses) via Azure logs/metrics and drove an architectural fix by separating model deployments to address token/quota limits. Also runs internal developer enablement through short theory-to-hands-on AI workshops after completing a Microsoft AI certification.”
Mid-Level Software Engineer specializing in full-stack development, cloud, and data infrastructure
“Software engineer at Fannie Mae (~3 years) working on high-volume loan data pipelines using AWS (SQS/S3), Java listeners, Postgres, and Python/SQL-based data quality validation. Also built a chess data collection system (leveraging experience as an International Master) with robust retry/monitoring, schema-change handling, and idempotent backfills to prevent bad data from reaching downstream systems.”
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Senior Software Engineer specializing in game UI systems and gameplay features
“Unity/C# game engineer with hands-on ownership of live-service systems, including a real-time multi-channel chat feature for Squid Game: Unleashed that launched alongside Squid Game Season 2 on Netflix and increased player retention and engagement. Brings a strong blend of mobile performance optimization, server-authoritative architecture, multiplayer systems, and practical experimentation with LLM-assisted narrative tooling.”
Intern Machine Learning Engineer specializing in LLM reasoning, agents, and deployment
“AWS AI Lab engineer who deployed a production Chain-of-Thought analytical agent for tabular reasoning, emphasizing grounded tool-constrained workflows with schema-validated intermediate outputs. Built robust evaluation/logging with step-level observability to catch regressions across model versions, and has experience scaling distributed LLM training via Slurm + DeepSpeed/FSDP with checkpointing and failure recovery.”
Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search
“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”
Junior Full-Stack/Product Builder specializing in AI and digital health
“Co-founded academic-index (10,000+ users) and built a full-stack Next.js 14 document upload + client-side OCR + Gemini-powered analysis pipeline with strong production reliability (custom monitoring, retries, quality gates) and measurable gains (accuracy ~94%→98.5%, failures down ~60%). Also owns end-to-end biometric data visualization and a data-driven brand/UX overhaul at pre-seed health/performance startup Absolute Rest, with a background running a multi-client dev studio (Zen Digital).”
Mid-level Data Scientist specializing in anomaly detection and production ML
“Interned at Backblaze building production AI systems for incident response and security operations, including an internal LLM-powered incident triage assistant that used Snowflake + RAG over historical tickets/postmortems and delivered results via Slack and a web UI. Emphasizes reliability (PII filtering, grounding, schema validation, fallbacks) and rigorous evaluation/observability (offline replay, partial rollouts, time-to-first-action metrics, Prometheus/Grafana).”
Junior AI Engineer specializing in LLM systems, RAG, and full-stack automation
“Built and deployed an AI receptionist product for field-service businesses (HVAC/electrician), including real-time Jobber scheduling integrations and Twilio-based calling. Combines hands-on customer/operator shadowing with strong production engineering (queueing to handle API limits, rigorous testing/mocking, mirrored prod environment) and cross-layer troubleshooting, driving user adoption through review/override workflows.”
Senior AI/ML Engineer specializing in computer vision, NLP, and enterprise ML systems
“ML/AI engineer with hands-on ownership of production computer vision and GenAI systems, spanning real-time public safety video analytics and RAG-based knowledge assistants. Stands out for translating research-oriented approaches into scalable, monitored production systems with clear business impact, including 50% latency reductions, 25% faster response times, and 40% lower document search time.”
Mid-level Software Engineer specializing in distributed backend systems on AWS
“Built production systems in the AWS ecosystem, including an internal AI assistant for diagnosing account transfer and permissions issues and an end-to-end account transfer workflow used by enterprise customers. Stands out for combining LLM/RAG design with strong distributed systems reliability practices, emphasizing guardrails, fallbacks, and operational trust in high-stakes workflows.”
Intern Applied Scientist specializing in LLM agents for software engineering
“Applied Scientist intern at Amazon who built a production-adopted LLM-judge to evaluate an agentic chatbot’s intermediate reasoning and tool calls using a knowledge-graph grounding approach. Also published award-winning work (ACM SIGSOFT Distinguished Paper) using LangChain + GPT-4 tools to generate factually grounded commit messages, with rigorous human-centered evaluation metrics.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Senior Data Engineer specializing in AI-driven GTM analytics and LLM evaluation
“Data/analytics engineer who stood up foundational pipelines and services at Meta for the Ray-Ban Meta launch—building a retailer sales ingestion system (S3/Hive) with rigorous DQ checks, 1-day SLAs, and dimensional rollups used by GTM to track sales trends. Also built a modular multi-retailer web-scraping system for out-of-stock alerts and shipped internal GraphQL APIs and an n8n-like workflow builder using serverless (AWS Lambda) with strong testing and observability practices.”
Executive Unity/XR Engineer specializing in real-time mocap and volumetric streaming
“Technical Director/Unity Lead who has shipped multiple Meta Quest immersive experiences and built custom Unity editor tooling to solve content-heavy pipeline constraints, enabling artists to push content directly into a Unity sandbox. Also prototyped a Unity restaurant simulation where an LLM drives NPC reasoning and state machines, using RAG and memory augmentation to reduce hallucinations and stuck behaviors.”