Pre-screened and vetted.
Mid-level Applied AI Engineer specializing in LLMs, MLOps, and real-time AI systems
Mid-level AI/ML Engineer specializing in LLMs, multilingual NLP, and low-latency MLOps
Senior AI/ML Engineer specializing in LLM agents, RAG, and production ML systems
Senior Software Engineer specializing in AI platforms and scalable backend systems
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications
Junior Data Scientist specializing in LLM agents, RAG, and reinforcement learning
“McKinsey practitioner who built and deployed production LLM systems for consultants/clients, including a Power BI-integrated multi-agent chatbot (RAG + text-to-SQL + formatting) with custom Python orchestration, verification loops, and a 100+ case eval set achieving ~95% consistency. Also delivered a taxonomy-mapper agent that standardized inconsistent labeling for C-suite stakeholders, cutting a process from >2 weeks to <30 minutes through demos and business-focused communication.”
Mid-level AI/ML Engineer specializing in Generative AI, LLM alignment, and RAG
“Built and productionized a real-time enterprise RAG pipeline to improve factual accuracy and reduce LLM hallucinations by grounding responses in constantly changing internal knowledge bases (policies, manuals, FAQs). Experienced in orchestrating end-to-end ML workflows (Airflow/Kubernetes), handling messy multi-format data with schema enforcement (Pydantic/Hydra), and maintaining freshness via streaming incremental embeddings plus batch refresh. Also delivers applied ML solutions with non-technical teams (marketing/CRM) for segmentation and personalized engagement.”
Intern Software/AI Engineer specializing in LLM fine-tuning and agentic RAG systems
“Built and shipped an end-to-end LLM agent during an AT&T internship to automate network troubleshooting, with production-style reliability safeguards (timeouts/retries/fallbacks) and structured, state-machine orchestration; project won 3rd place in AT&T’s nationwide intern innovation challenge and was demoed to leadership. Also handled messy multi-partner data at Tencent by implementing schema validation/normalization, confidence-threshold fallbacks, and idempotent Python/ORM-based pipelines.”
Senior AI Research Engineer specializing in LLM agents and large-scale ML
“AT&T Labs builder who deployed a production multi-agent LLM system that lets engineers ask natural-language questions and automatically generates deterministic, schema-grounded Snowflake SQL (200–400 lines) to detect anomalies in massive wireless/network event data (~11B events/day). Experienced with LangChain and Palantir Foundry orchestration, RAG-based result interpretation, and rigorous evaluation/monitoring loops to continuously improve reliability.”
Junior Data Scientist specializing in Generative AI and agentic LLM systems
“LLM/agentic-systems builder who has shipped production tools for investment research and procurement insights, including a company screener that processes thousands of conference-listed companies using FireCrawl + Google Search + Gemini. Demonstrates strong orchestration expertise (LangGraph multi-agent graphs), performance optimization (async/batching to sub-30s), and pragmatic reliability/evaluation practices with stakeholder-friendly UX (real-time cost tracking and model/parameter toggles).”
Intern Machine Learning Engineer specializing in RAG systems and AWS cloud infrastructure
“Internship at BlueFoxLabs building and deploying an AI/ML RAG system for a biopharma client on top of LibreChat, including an AWS Textract ingestion pipeline and PGVector retrieval deployed to AWS EKS. Demonstrated production-minded scalability work by moving from a vertically scaled EC2 setup to a horizontally scaling Kubernetes/EKS deployment, using CI/CD to safely incorporate requirement changes like tabular document data.”
Principal Data Scientist specializing in financial risk, forecasting, and applied ML
“ML/NLP practitioner and technical founder who built an AUP risk-scoring model at Bill.com using TF-IDF + SVD features with XGBoost, and previously created automated data-quality guardrails for a Global Equity Risk stacked ML model at Thomson Reuters. Recently built a RAG-based chatbot for PaymentJock’s Home Affordability Probability product using embeddings and a local vector database (FAISS/Chroma), improving answer quality through chunking rather than expensive fine-tuning.”
Executive AI/ML Engineering Leader specializing in cloud-native SaaS and GenAI platforms
“Engineering leader who modernized and unified a fragmented product suite at Milestone via a multi-year cloud-native roadmap, delivering an MVP in three quarters and boosting team velocity by 40% through cross-functional squads. At Prometheum, led a trust-building hybrid architecture (AWS control plane + customer-hosted data plane) using Kubernetes to ensure sensitive enterprise data never left customer networks while remaining cloud-agnostic across providers.”
Senior Machine Learning Engineer specializing in production ML and predictive analytics
“ML/AI engineering leader who has owned end-to-end production systems from experimentation through deployment, monitoring, and iteration at meaningful scale. They describe running a 1M+ records/day prediction platform with 99.9% availability, shipping a RAG-based conversational AI feature for 50,000 active users, and consistently improving precision, latency, reliability, and cost with measurable business impact.”
Staff Full-Stack Engineer specializing in Healthcare AI and FinTech payments
“Backend/data engineer from Oscar Health specializing in healthcare claims systems on AWS. Built HIPAA-compliant real-time services (FastAPI/Postgres/Kafka on EKS) and serverless ingestion pipelines, and led modernization of a legacy SAS claims pricing system to Python/Spark with rigorous parity validation. Demonstrated measurable impact with high uptime/low latency services and major Snowflake performance and cost reductions.”
Mid-level Machine Learning Engineer specializing in LLMs, generative AI, and MLOps
“Built and shipped a production LLM-powered medical scribe that generates structured clinical visit summaries using RAG, strict JSON schemas, and post-generation validation to reduce hallucinations. Experienced in making LLM workflows deterministic and observable (structured logging/metrics/tracing) and in evaluation-driven iteration with metrics like schema pass rate and edit rate; collaborated closely with clinicians and policy stakeholders at Scale AI to drive adoption.”
Mid-level Backend & ML Engineer specializing in LLM systems and scalable AI pipelines
“Built and shipped a real-time AI phone agent for small businesses that handles bookings/FAQs/messages using streaming ASR, an LLM with tool-calling, and TTS; deployed to production for multiple paying customers. Demonstrates strong applied LLM reliability practices (tool-first grounding, retrieval, hard-negative testing, and production monitoring) and experience orchestrating multi-step AI workflows with Airflow, Prefect, and AWS Step Functions.”
Mid-level Python Backend Developer specializing in cloud-native microservices and AI/ML platforms
“Backend/AI engineer who built a production GPU-backed real-time inference API at Nvidia and debugged burst-induced tail latency, cutting P95 by ~29% through dynamic batching and backpressure. Also shipped an end-to-end RAG + agentic operational diagnostics assistant with strict tool controls, evidence citation, confidence gating, and strong production guardrails, plus demonstrated hands-on Postgres optimization (900ms to 40–60ms).”
Senior Full-Stack & AI Engineer specializing in LLM applications and cloud platforms
Intern Software Engineer specializing in full-stack web development and AI/ML
Senior Full-Stack Software Engineer specializing in React and Node/Python in regulated systems
Senior Full-Stack Software Engineer specializing in Healthcare AI and MVP delivery