Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in LLMs, fairness, and healthcare ML
“ML/NLP practitioner with a master’s thesis focused on domain-adaptive knowledge distillation for LLMs (LLaMA2/sheared LLaMA), showing improved perplexity and ROUGE-L on biomedical data. Also built real-world data linking and search systems: integrated ClinicalTrials.gov with FAERS using fuzzy matching + embeddings, and delivered an LLM-powered FAQ recommender at Hyperledger using sentence-transformers, FAISS, and fine-tuning to mitigate embedding drift.”
Senior Machine Learning Engineer specializing in conversational AI and Generative AI
“ML/AI engineer with experience at Uber and Scale AI, focused on customer service automation across both classical NLP and generative AI systems. Has owned systems from experimentation through production on AWS, including LLM fine-tuning, RAG optimization, safety evaluation, and internal Python platform tooling that improved consistency and engineering velocity.”
Junior Computer Vision & ML Engineer specializing in autonomous perception systems
“LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.”
Intern Data Scientist specializing in GenAI (LLMs, RAG) and ML model optimization
“Built and deployed a production LLM-powered risk assistant for KPMG and Freddie Mac that lets analysts query a confidential Neo4j risk graph in natural language (no Cypher), turning multi-day analysis into minutes with traceable, cited answers. Implemented rigorous guardrails, deterministic verification, RBAC/security controls, and a full eval/observability stack, cutting query error rate by ~50% and iterating through weekly UAT with non-technical risk analysts.”
Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG
“Built and shipped a production LLM pipeline for nursing home Medicare reimbursement (PDF OCR + fact extraction + keyword RAG + QA) that reportedly increased payouts by ~$1K/month per patient. Strong in LLM ops/benchmarking (ground truth, LLM-as-judge, cost/I-O tracking) and pragmatic optimization—swapped retrieval approaches, fine-tuned a small model to cut OCR cost 90%, and migrated workloads to Azure/Temporal to scale nightly processing 10x.”
Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems
“ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.”
Mid-level Data Analytics professional specializing in BI, data engineering, and applied AI
“Built GenMedX, a multi-module clinical AI system for emergency department decision support spanning triage prediction, diagnosis, medication Q&A, and visit summarization. Stands out for combining medical LLM fine-tuning, RAG, and rigorous evaluation/monitoring to drive a major triage recall improvement from 38.5% to 76.6%, with a strong focus on safety, edge-case detection, and production reliability.”
Senior Full-Stack Engineer specializing in AI, FinTech, and Healthcare IT
“AI/full-stack engineer with hands-on production experience across React/TypeScript, Go, and Python, spanning an early-stage education startup and a compliance-sensitive internal healthcare data platform. Stands out for shipping LLM and retrieval-based products with measurable impact, including a 27% recommendation improvement, support for 1M+ daily events, and a 19% lift in task completion in a secure, auditable environment.”
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Mid-Level Software Engineer specializing in Generative AI and RAG systems
“Built a production RAG-based natural-language-to-SQL system at Global Atlantic to replace slow, expensive manual analytics ticket workflows, focusing heavily on retrieval quality and measurable evaluation (200-question ground-truth set; recall@5 improved 0.65→0.78 via semantic chunking). Also built a custom MCP-style agent orchestrator for a personal project (arxiv-ai) to improve flexibility and Langfuse-aligned observability, and has hands-on experience with LangGraph, CrewAI, and n8n.”
Staff Applied Scientist specializing in multimodal LLM safety, robustness, and retrieval
“Built a production LLM-driven archival assistant that turns large, low-quality scanned handwritten files (120+ pages) into structured datasets, overcoming context-window and hierarchy challenges with a two-phase LLM + rules pipeline and reaching 98.1% accuracy (Gemini-2.5 Flash). Also orchestrated a large human-in-the-loop effort with 78 archivists, producing 2,400 high-quality annotations in 4 days via detailed rubrics and support.”
Mid-level AI/ML Engineer specializing in LLMs, FinTech, and Healthcare IT
“Built production GenAI systems in both healthcare and financial services, including a Verily clinical platform and an Accenture financial Q&A product. Stands out for combining advanced RAG, fine-tuning, safety evaluation, and infrastructure engineering to deliver measurable gains in engagement, groundedness, hallucination reduction, and cost efficiency.”
Mid-level Software Engineer specializing in AI/ML and full-stack systems
“Engineer with Apple experience building LLM-powered internal workflow orchestration systems using Python, LangGraph, FastAPI, Redis, vector search, and Kubernetes. Stands out for a highly pragmatic, production-focused approach to agentic systems: deterministic state management, strong guardrails, observability, and human review for high-risk actions.”
Mid-level AI/LLM Engineer specializing in generative AI and ML systems
“AI/LLM-focused engineer with hands-on experience building RAG pipelines, prompt engineering workflows, and multi-agent systems using tools like LangChain. Stands out for combining AI-assisted development with production-grade validation and for leading the architecture/orchestration of agent-based recommendation systems that improved response time, accuracy, and scalability.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Junior AI/ML Engineer specializing in FinTech and generative AI
“Built an end-to-end AI bug triage dashboard that combined React/TypeScript, FastAPI, Postgres, and classical ML to reduce manual engineering triage work by about 40%. Stands out for pragmatic, product-minded AI engineering: choosing interpretable models when they were sufficient, designing human-in-the-loop UX for trust, and separately building an agentic RAG project with vector search, Neo4j knowledge graphs, and reranking.”
Mid AI/ML Engineer specializing in LLM systems and Generative AI
“Built and owned an LLM support copilot at Stripe focused on improving agent ticket resolution. Designed the backend and ML system end to end, using RAG, Redis caching, hybrid vector search, and LoRA fine-tuning to achieve 40% lower latency and 22% higher response accuracy, with continuous quality monitoring via Ragas and related evaluation frameworks.”
Mid-level Data Science AI/ML Engineer specializing in Generative AI, LLMs, and RAG systems
“Built a production RAG-based "knowledge copilot" for support/ops using LangChain/LangGraph, implementing the full pipeline (ingestion, chunking, embeddings, vector DB retrieval/rerank, guarded generation with citations) and operating it as monitored microservices with CI/CD. Also designed an event-driven, streaming backend for real-time inventory ordering predictions that reduced stockouts by 25%, and has hands-on incident response experience stabilizing LLM API latency/5xx spikes using Datadog/APM and resilience patterns.”
Mid-level Machine Learning Engineer specializing in deep learning, MLOps, and real-time inference
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and recommendation systems
Junior Software Engineer specializing in AI/ML systems and LLM-powered document automation
Mid-level AI Engineer specializing in computer vision and RAG systems
Senior Engineering Manager specializing in observability platforms and Generative AI