Pre-screened and vetted.
Junior Data Scientist specializing in LLM agents, RAG, and reinforcement learning
“McKinsey practitioner who built and deployed production LLM systems for consultants/clients, including a Power BI-integrated multi-agent chatbot (RAG + text-to-SQL + formatting) with custom Python orchestration, verification loops, and a 100+ case eval set achieving ~95% consistency. Also delivered a taxonomy-mapper agent that standardized inconsistent labeling for C-suite stakeholders, cutting a process from >2 weeks to <30 minutes through demos and business-focused communication.”
Mid-level Machine Learning Engineer specializing in MLOps and cloud-native ML systems
Mid-level Data Scientist / GenAI & ML Engineer specializing in LLMs, RAG, and recommendations
Senior Applied ML Scientist specializing in LLMs, ads ranking, and RAG systems
Senior AI/ML Engineer & Data Scientist specializing in NLP, entity resolution, and knowledge graphs
Mid-level Machine Learning Engineer specializing in real-time recommender systems and MLOps
Senior Software Engineer specializing in cloud platforms, data pipelines, and ML
Senior Data Scientist specializing in large-scale ML systems and recommendations
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Senior Python Developer specializing in AI/ML and cloud-native microservices
Staff Data Scientist / AI-ML Engineer specializing in fraud detection, NLP, and recommendations
Mid-level Software Engineer specializing in ML-driven software testing and developer tools
Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Mid-level Machine Learning Engineer specializing in NLP, recommender systems, and on-device ML
Principal Data Scientist / AI Engineer specializing in healthcare-native AI platforms
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning
“ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.”
Mid-level Data Scientist specializing in recommender systems, NLP, and real-time ML pipelines
“AI/LLM engineer who built and productionized an internal RAG-based knowledge system that ingests diverse sources (PDFs, Markdown, Slack), scaled retrieval with distributed FAISS and parallel ingestion, and reduced hallucinations via re-ranking, grounding prompts, and post-generation validation. Also has hands-on orchestration experience with Airflow and Kubernetes for reliable ETL/model pipelines, monitoring, and staged rollouts; reports ~15% accuracy improvement and adoption as the primary internal knowledge tool.”
Mid-level Machine Learning Engineer specializing in fraud detection and real-time personalization
“ML/LLM engineer with Stripe and Adobe experience who productionized a transformer-based Payments Foundation Model for real-time fraud detection at global scale (billions of transactions). Built petabyte-scale ETL/feature pipelines (Spark/EMR, Airflow, dbt, Kafka/Flink) and achieved <100ms multi-region inference (EKS, TorchServe, edge/Lambda, GPU/CPU routing) with strong PCI-DSS/GDPR compliance and explainability (SHAP/LIME), reporting a 64% fraud accuracy improvement.”