Pre-screened and vetted.
Senior Full-Stack AI/ML Engineer specializing in cloud data platforms and GenAI
Executive Engineering Leader specializing in data platforms, cloud modernization, and AI
Principal Machine Learning Scientist specializing in GenAI, LLMs, and RAG
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps
Senior Software Engineer specializing in AI agents and cloud platforms
Mid-level Software Engineer specializing in Python, distributed systems, and AI backend services
Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure
Executive AI/ML Cloud Architect specializing in enterprise and humanitarian AI systems
Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems
Director-level Software Development Manager specializing in large-scale cloud platforms
Senior Full-Stack Engineer specializing in backend systems and AI applications
“Candidate is deeply focused on AI-native software development, using a deliberate planner/implementer agent workflow with tools like Cursor, Claude, and Kimi. They also built a personal project called Config Proctor, an AI-agent-driven Terraform/AWS self-healing system that identifies infrastructure configuration gaps and proposes fixes.”
Intern Software Engineer specializing in AI agents, RAG pipelines, and semiconductor systems
“Built a web-based interface that connects an internal bug system to an LLM for initial debugging and issue classification, aiming to boost QA and software engineer efficiency while balancing latency and accuracy. Worked as a one-person project and managed constraints like limited hardware and difficulty extracting team debugging context, relying on manager communication and rapid modeling to validate direction.”
Junior Mechanical Engineering & Software Developer specializing in aviation autonomy and retrieval systems
“Robotics/embedded builder who trained an aviation-specific LLM and deployed it offline on an NVIDIA Jetson for an in-flight voice assistant, solving performance and cabling constraints with NVMe storage and Bluetooth. Also has hands-on Raspberry Pi/Arduino robot builds (including a cigarette-butt picking prototype with hydraulic actuation) plus Docker-based FEA work using FEniCS/Gmsh and strong CI/CD + automated testing practices.”
Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services
“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”
Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference
“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
Mid-level AI/ML Engineer specializing in LLM fine-tuning, inference optimization, and AI safety
“AI/LLM engineer with production experience at NVIDIA, where they fine-tuned and deployed a financial-services chatbot and cut latency ~50% using TensorRT + NVIDIA Triton, scaling via Docker/Kubernetes. Also has consulting experience at Accenture delivering a predictive maintenance solution for a logistics network, bridging non-technical stakeholders with actionable dashboards.”
Mid-Level Software Engineer specializing in Azure AI and full-stack development
“Hands-on AI/LLM engineer who built a RAG-based product feature end-to-end, including prompt engineering, safety guardrails, and an automated adversarial + load-testing harness. Diagnosed real production issues (null responses) via Azure logs/metrics and drove an architectural fix by separating model deployments to address token/quota limits. Also runs internal developer enablement through short theory-to-hands-on AI workshops after completing a Microsoft AI certification.”
Executive CTO and Founder specializing in AI platforms and hyper-scale SaaS
“CTO-minded builder seeking to join a startup; previously created an AI-driven platform that abstracted away DevOps and infrastructure for drug discovery researchers. Emphasizes high-leverage, zero-to-one execution with managed cloud/open-source tooling, and a strong reliability/reproducibility mindset validated against existing scientific pipelines.”
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search
“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”