Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in NLP, computer vision, and recommender systems
Mid-level AI/ML Engineer specializing in RAG, NLP, and MLOps
Mid AI/ML Engineer specializing in LLMs, RAG, and multimodal systems
Senior AI/ML Data Scientist specializing in recommender systems, LLMs, and MLOps
“ML/NLP leader with 12+ years of impact across LinkedIn, TikTok, and Levi's, building and productionizing multimodal recommendation and embedding-based search systems. Deep experience in entity resolution, vector retrieval, and rigorous evaluation, with cloud-native deployment/monitoring (MLflow, Airflow, SageMaker/Lambda, Azure ML, Kubernetes) and demonstrated double-digit relevance gains at millions-of-users scale.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference
Principal Machine Learning Scientist specializing in GenAI, LLMs, and RAG
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps
Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure
Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems
Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps
Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference
Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems
Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development
“Backend engineer at Electric Hydrogen who built a serverless device-log ingestion and processing platform in Python/Flask, scaling throughput (4x peak ingestion) while keeping sub-300ms API latency. Strong in Postgres/SQLAlchemy performance (partitioning, materialized views) and production ML integration (ONNX model served via FastAPI microservice with async batch inference, Redis feature caching, and drift monitoring via S3/Lambda). Experienced designing secure multi-tenant systems with schema-per-tenant isolation and KMS-backed encryption.”
Executive AI/ML technology leader specializing in healthcare, biotech, and legal AI
“Repeat founder and startup advisor with experience spanning academic, health tech, legal tech, sports, and gaming. Has participated in fundraising and due diligence and has built companies, engineering teams, and software platforms from scratch, with a strong product-design-first approach to product-market fit and market selection.”
Executive product and AI leader specializing in data platforms and analytics
“Engineering leader with deep experience at Visa building and modernizing large-scale analytics platforms, including refactoring legacy systems into globally available microservices on AWS. Combines hands-on technical judgment in architecture, search platform evaluation, and service reliability with management of distributed international engineering and data teams.”
Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services
“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”
Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference
“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Intern Software Engineer specializing in AI, cloud-native systems, and MLOps
“Backend/full-stack engineer who has owned a production recruiting platform end-to-end (TypeScript/Node microservices for scraping/cleaning/serving job data, RabbitMQ for spike handling, MongoDB + Elasticsearch, AWS containers) with pragmatic CI, logging/alerts, and Docker Compose E2E tests. Also operated high-traffic event pipelines during a Binance internship using Kafka + Redis idempotency, with strong observability and failure-mode/rollback/degradation practices, and has experience designing developer-friendly REST APIs and resilient browser automation for E2E flows.”