Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and scalable inference
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable MLOps
Director-level AI/ML leader specializing in recommender systems and agentic AI
Staff Software Engineer specializing in applied AI agents and full-stack product development
Mid-level AI/ML Engineer specializing in generative AI, LLMs, and MLOps
Mid-level Applied AI Engineer specializing in LLMs, MLOps, and real-time AI systems
Mid-level AI/ML Engineer specializing in LLMs, multilingual NLP, and low-latency MLOps
Senior AI/ML Engineer specializing in LLM agents, RAG, and production ML systems
Staff AI Platform Engineer specializing in enterprise SaaS and cloud AI systems
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications
Junior Data Scientist specializing in LLM agents, RAG, and reinforcement learning
“McKinsey practitioner who built and deployed production LLM systems for consultants/clients, including a Power BI-integrated multi-agent chatbot (RAG + text-to-SQL + formatting) with custom Python orchestration, verification loops, and a 100+ case eval set achieving ~95% consistency. Also delivered a taxonomy-mapper agent that standardized inconsistent labeling for C-suite stakeholders, cutting a process from >2 weeks to <30 minutes through demos and business-focused communication.”
Mid-level AI/ML Engineer specializing in Generative AI, LLM alignment, and RAG
“Built and productionized a real-time enterprise RAG pipeline to improve factual accuracy and reduce LLM hallucinations by grounding responses in constantly changing internal knowledge bases (policies, manuals, FAQs). Experienced in orchestrating end-to-end ML workflows (Airflow/Kubernetes), handling messy multi-format data with schema enforcement (Pydantic/Hydra), and maintaining freshness via streaming incremental embeddings plus batch refresh. Also delivers applied ML solutions with non-technical teams (marketing/CRM) for segmentation and personalized engagement.”
Intern Software/AI Engineer specializing in LLM fine-tuning and agentic RAG systems
“Built and shipped an end-to-end LLM agent during an AT&T internship to automate network troubleshooting, with production-style reliability safeguards (timeouts/retries/fallbacks) and structured, state-machine orchestration; project won 3rd place in AT&T’s nationwide intern innovation challenge and was demoed to leadership. Also handled messy multi-partner data at Tencent by implementing schema validation/normalization, confidence-threshold fallbacks, and idempotent Python/ORM-based pipelines.”
Intern Machine Learning & AI Engineer specializing in computer vision and ML systems
“Robotics/ML engineer with internship experience at Valeo building a deep-learning prototype to replace parts of a legacy SLAM backend for autonomous parking, focused on making models run reliably in real time on embedded hardware (quantization/distillation + TensorRT). Also brings strong MLOps/deployment experience (Docker, Kubernetes on AWS EKS, CI via GitHub Actions) and has supported patent filing by explaining the technical approach to legal stakeholders.”
Intern Applied Scientist / ML Engineer specializing in NLP and conversational AI
“LLM/Conversational AI engineer who built a production multi-turn dialogue system using LoRA fine-tuning on LLaMA, cutting training compute/memory by 90%+ while maintaining low-latency inference via quantization and streaming generation. Experienced in orchestrating end-to-end ML workflows with Prefect/Airflow/Kubeflow (including hyperparameter sweeps and W&B tracking) and improving agent reliability through benchmark-driven testing, shadow-mode rollouts, and stakeholder-informed guardrails.”
Intern Machine Learning Engineer specializing in RAG systems and AWS cloud infrastructure
“Internship at BlueFoxLabs building and deploying an AI/ML RAG system for a biopharma client on top of LibreChat, including an AWS Textract ingestion pipeline and PGVector retrieval deployed to AWS EKS. Demonstrated production-minded scalability work by moving from a vertically scaled EC2 setup to a horizontally scaling Kubernetes/EKS deployment, using CI/CD to safely incorporate requirement changes like tabular document data.”
Senior Machine Learning Engineer specializing in production ML and predictive analytics
“ML/AI engineering leader who has owned end-to-end production systems from experimentation through deployment, monitoring, and iteration at meaningful scale. They describe running a 1M+ records/day prediction platform with 99.9% availability, shipping a RAG-based conversational AI feature for 50,000 active users, and consistently improving precision, latency, reliability, and cost with measurable business impact.”
Mid-level Machine Learning Engineer specializing in LLMs, generative AI, and MLOps
“Built and shipped a production LLM-powered medical scribe that generates structured clinical visit summaries using RAG, strict JSON schemas, and post-generation validation to reduce hallucinations. Experienced in making LLM workflows deterministic and observable (structured logging/metrics/tracing) and in evaluation-driven iteration with metrics like schema pass rate and edit rate; collaborated closely with clinicians and policy stakeholders at Scale AI to drive adoption.”
Mid-level Backend & ML Engineer specializing in LLM systems and scalable AI pipelines
“Built and shipped a real-time AI phone agent for small businesses that handles bookings/FAQs/messages using streaming ASR, an LLM with tool-calling, and TTS; deployed to production for multiple paying customers. Demonstrates strong applied LLM reliability practices (tool-first grounding, retrieval, hard-negative testing, and production monitoring) and experience orchestrating multi-step AI workflows with Airflow, Prefect, and AWS Step Functions.”
Junior Quantum/AI Research Engineer specializing in quantum simulation and LLM alignment