Pre-screened and vetted.
Senior Backend Engineer specializing in GenAI, LLMs, and scalable data pipelines
“Backend/ML platform engineer from Snapsheet who owned production Python services and data pipelines for insurance claims, including an AI document classification/summarization FastAPI service on ECS/Fargate processing 1M+ documents/year. Strong in AWS infrastructure (Terraform, CI/CD, secrets/IAM, autoscaling), Glue/PySpark ETL with schema evolution controls, and legacy SAS-to-microservices modernization with safe, feature-flagged rollouts and measurable performance wins.”
Senior AI/ML Data Scientist specializing in recommender systems, LLMs, and MLOps
“ML/NLP leader with 12+ years of impact across LinkedIn, TikTok, and Levi's, building and productionizing multimodal recommendation and embedding-based search systems. Deep experience in entity resolution, vector retrieval, and rigorous evaluation, with cloud-native deployment/monitoring (MLflow, Airflow, SageMaker/Lambda, Azure ML, Kubernetes) and demonstrated double-digit relevance gains at millions-of-users scale.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Intern Machine Learning Engineer specializing in LLMs, RAG, and model quantization
Staff Data Scientist / AI-ML Engineer specializing in fraud detection, NLP, and recommendations
Mid-level Software Engineer specializing in Python, distributed systems, and AI backend services
Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps
Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference
Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems
Junior Mechanical Engineering & Software Developer specializing in aviation autonomy and retrieval systems
“Robotics/embedded builder who trained an aviation-specific LLM and deployed it offline on an NVIDIA Jetson for an in-flight voice assistant, solving performance and cabling constraints with NVMe storage and Bluetooth. Also has hands-on Raspberry Pi/Arduino robot builds (including a cigarette-butt picking prototype with hydraulic actuation) plus Docker-based FEA work using FEniCS/Gmsh and strong CI/CD + automated testing practices.”
Executive AI Product Leader specializing in FinTech and agentic AI platforms
“Fintech/neobank CTO (5+ years across US and UK markets) now building Payzo Money, a fintech copilot for SMBs covering expenses, accounting, invoicing, and payroll. Pre-revenue and seeking a $5M seed round, with active Bay Area conversations and a clear focus on bank sponsorship plus compliance/operations readiness; leverages Claude-based AI agents to accelerate building with limited resources.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
Senior Backend Engineer specializing in scalable AWS serverless and data pipelines
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Mid-level AI Engineer specializing in Generative AI and MLOps
“Built and deployed a production LLM-powered clinical support assistant at BJC HealthCare (RAG + transformer) to answer patient questions, summarize clinical notes, and support appointment workflows. Implemented PHI-safe data pipelines (Spark/Hadoop/Kafka) with automated scrubbing, dataset versioning, and audit logs, and runs the system on Docker/Kubernetes with Pinecone vector search while partnering closely with clinical operations staff.”
Executive Technology Leader specializing in Generative AI, platform architecture, and digital transformation
“Engineering/technology leader with experience at Expedia and startup OneRail, known for building business-aligned technology roadmaps and scaling orgs rapidly (11 to 120 engineers in a year). Has driven large productivity and efficiency gains by operationalizing AI agents (code reviews, upgrades, security fixes) and implementing ChatOps-based deployment architecture, using data-driven experimentation to manage platform changes and conversion impacts.”
Junior Machine Learning Engineer specializing in LLM systems and inference reliability
“ML/LLM infrastructure-focused engineer who built a production stateful LLM inference service that cuts latency and GPU compute for repeated/overlapping prompts via caching with correctness guardrails. Strong in Kubernetes-based deployment and reliability engineering, using A/B testing and similarity-based evaluation to quantify performance gains without sacrificing output quality.”
Mid-level Machine Learning & Generative AI Engineer specializing in NLP, CV, and RAG systems
“Built and deployed a production LLM-powered RAG document intelligence system used by non-technical enterprise stakeholders, cutting document search time by 40%+ while improving answer consistency. Demonstrates strong MLOps/data workflow orchestration (Airflow, AWS Step Functions, managed schedulers across GCP/Azure) and a metrics-driven approach to reliability, evaluation, and cost/latency optimization with guardrails and observability.”
“Data science/NLP practitioner with experience at NVIDIA and Microsoft building production-grade NLP and data-linking systems. Has delivered high-performing pipelines (e.g., F1 0.92) and large-scale entity resolution (F1 0.89), plus semantic search using embeddings and Pinecone with ~30–40% relevance gains, backed by rigorous validation (A/B tests, ROUGE, MRR) and strong MLOps/workflow tooling (Airflow, Databricks, FastAPI, MLflow, Prometheus/ELK).”
Senior Software Engineer specializing in distributed systems, AI/ML platforms, and cloud-native SaaS
Mid-level Machine Learning Engineer specializing in real-time fraud detection and edge AI