Pre-screened and vetted.
Junior Machine Learning Engineer specializing in MLOps and real-time systems
“Built and shipped a production GPT-4 + RAG customer support chatbot that materially improved support operations (response time 4 hours to <3 minutes; ~65% tier-1 ticket automation). Demonstrates strong end-to-end LLM engineering across retrieval (Sentence Transformers/Pinecone), safety (multi-layer moderation), cost/latency optimization (caching/streaming, Celery/Redis), and rigorous evaluation/monitoring (shadow deploys, Datadog, 500+ test cases), plus proven stakeholder buy-in leading to 80% adoption.”
Mid-level Machine Learning & AI Engineer specializing in Generative AI, NLP, and MLOps
“Built and deployed production LLM systems for summarizing sensitive legal and financial documents, emphasizing GDPR-aligned privacy controls and scalable hybrid cloud architecture. Experienced with Kubernetes/Airflow orchestration and rigorous testing/monitoring practices, and has delivered measurable business impact (18% conversion lift) by translating AI outputs for non-technical marketing stakeholders.”
Junior AI/ML Engineer specializing in healthcare and financial risk modeling
“Built and productionized a clinical NLP + patient risk stratification platform at Dermanture, combining Spark/PySpark pipelines with BERT/BioBERT for entity extraction and text classification and downstream risk models in TensorFlow/scikit-learn. Experienced running regulated, auditable ML workflows with Airflow and AWS SageMaker, emphasizing data validation (Great Expectations), drift monitoring, and explainability (SHAP) to drive clinician trust and adoption.”
Mid-level Machine Learning Engineer specializing in NLP, computer vision, and LLM systems
“Built a production multi-agent cybersecurity defense simulator orchestrated with CrewAI, combining Red/Blue team LLM agents, a RAG runbook retriever, and an RL remediation agent trained via state-space simplification and reward shaping for rapid incident response. Also partnered with quant analysts and fund managers to deliver an automated trading and portfolio management system using statistical methods plus CNN/LSTM models, reporting up to 15% weekly ROI.”
“Built and deployed a production LLM-powered RAG knowledge system to unify operational/policy information across PDFs, wikis, and databases, emphasizing auditability and low-latency/cost performance. Improved answer relevance at scale by moving from pure vector search to hybrid retrieval with metadata filtering and reranking, and partnered closely with healthcare operations/compliance to define acceptance criteria and human-in-the-loop guardrails.”
“At Liberty Mutual, built a production underwriting decision assistant combining LLM reasoning with quantitative models and strong auditability. Implemented a claims-based response verification pipeline that cut hallucinations from 18% to 3% and materially improved user trust/validation scores. Experienced orchestrating ML/LLM workflows end-to-end with Airflow, Kubeflow Pipelines, and Jenkins, including SLA-focused pipeline hardening.”
Senior Data Scientist specializing in LLM applications, RAG systems, and production ML
“Senior Data Scientist in consulting who has built production RAG systems for insurance/annuity document search at large scale (100K+ PDF pages), emphasizing grounded answers, guardrails, and low-latency retrieval. Experienced in end-to-end MLOps for LLM apps—monitoring, evaluation sets, drift handling, and safe rollouts—and in orchestrating complex pipelines with Prefect/Airflow and deploying services on Kubernetes.”
Mid-level AI/ML Engineer specializing in data engineering, LLM/RAG pipelines, and recommender systems
“Research assistant at St. Louis University who built and deployed a production document-intelligence RAG system (Python/TensorFlow, vector DB, FastAPI) on AWS, focusing on grounding to reduce hallucinations and latency optimization via caching/async/batching. Also developed a personalized recommendation system for the Frenzy social platform and partnered closely with product/UX to define metrics and iterate on hybrid recommenders and cold-start handling.”
Mid-level Machine Learning Engineer specializing in computer vision and reinforcement learning
“Early-stage engineer with hands-on embedded prototyping experience (Arduino/Raspberry Pi) who helped build an award-winning smart glasses project enabling phone notifications via Bluetooth. Strong computer vision performance optimization background, including accelerating 120 FPS inference by moving from TensorFlow to PyTorch and deploying through ONNX + TensorRT quantization, plus Docker-based GPU deployment and CI/ML practices.”
Mid-level AI Engineer specializing in NLP and production ML systems
“AI/LLM engineer who has shipped production RAG chatbots using LangChain/OpenAI with FAISS and FastAPI, focusing on real-world constraints like context windows, concurrency, and latency (reported ~40% latency reduction and <2s average response). Experienced orchestrating AI pipelines with Celery and fault-tolerant long-running workflows with Temporal, and has applied NLP model tradeoff testing (Word2Vec vs BERT) to drive measurable accuracy gains.”
Junior AI/ML Developer specializing in GenAI, LLM agents, and RAG systems
“Built and shipped an agentic RAG chatbot module for NexaCLM to answer questions across large volumes of contracts while minimizing hallucinations and incorrect legal interpretations. Implemented routing between vector retrieval and ReAct-style agent retrieval plus an automated grading/validation layer (cosine-similarity thresholds, retries) and deployed via GitHub Actions to Azure Container Apps, partnering closely with legal stakeholders to define risk/clause-focused objectives.”
Mid-level GenAI Engineer specializing in LLM agents and production AI workflows
“Designed and deployed end-to-end LLM-powered AI agent systems to automate knowledge-intensive workflows across marketing/GTM, recruiting, and support. Brings production reliability rigor (evaluation pipelines, monitoring, testing, A/B experiments) plus orchestration expertise (Airflow, Prefect, custom Python) and a track record of translating non-technical stakeholder goals into working AI solutions (e.g., personalized customer engagement agent at Lara Design).”
Mid-level AI/ML Engineer specializing in Generative AI and LLM-powered NLP
“LLM/AI engineer who built a production automated document-understanding pipeline on Azure using a grounded RAG layer, designed to reduce manual review time for unstructured financial documents. Demonstrates strong real-world scaling and reliability practices (Service Bus queueing, Kubernetes autoscaling, observability, retries/circuit breakers) plus rigorous evaluation (shadow testing, replaying traffic, multilingual edge-case suites) and stakeholder-friendly, evidence-based explainability.”
Senior Machine Learning Engineer specializing in LLMs, RAG, and agentic AI systems
“LLM/RAG practitioner who has taken a support-ticket triage automation system from prototype to production, building the full pipeline (fine-tuned models, FastAPI inference services, vector storage, monitoring) and delivering measurable impact (~40% reduction in triage time). Demonstrates strong operational troubleshooting of LLM/agentic workflows (observability-driven debugging, fixing agent routing/looping) and supports adoption through tailored demos and sales-aligned technical communication.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG systems
“LLM/RAG engineer who has built and shipped production assistants, including a RAG-based teaching assistant (Marvel AI) using LangChain/LlamaIndex/ChromaDB with OpenAI embeddings and Redis vector search, achieving ~30% accuracy gains and ~35% latency reduction. Also deployed FastAPI services on Google Cloud Run with observability and prompt-level monitoring, and partnered with non-technical ops stakeholders to deliver an internal policy-document RAG assistant.”
Mid-level Applied ML Engineer specializing in LLM evaluation and multimodal agent systems
“Full-stack engineer working at the intersection of product and infrastructure, building developer-facing interfaces for AI voice agents in XR/immersive environments plus telemetry-heavy analytics dashboards. Experienced in Postgres telemetry data modeling and performance tuning, and in designing durable multi-step LLM pipelines with idempotency, retries, and strong observability; has operated in fast-moving startup-like teams (Biocom, HandshakeAI).”
Mid-level AI Engineer specializing in AI agents, RAG pipelines, and LLM evaluation
“Built and shipped production LLM systems at Founderbay, including a low-latency voice agent and a graph-based multi-agent research assistant. Strong focus on reliability in real workflows—hybrid SERP + full-site scraping RAG, grounding guardrails, validation checkpoints, and transcript-driven evaluation—plus performance tuning with async FastAPI, Redis caching, and containerization. Also partnered with a non-technical ops lead to automate post-call follow-ups via call summarization, field extraction, and tool-triggered actions.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Junior Machine Learning Engineer specializing in predictive modeling and GenAI RAG systems
“LLM engineer who built and deployed an emotionally intelligent AAC communication system using an emotion-aware RAG pipeline (Empathetic Dialogues + GoEmotions) and a PEFT-adapted model. Experienced with LangChain/LangGraph and custom Python orchestration, focusing on reliability (guards, schema validation, fallbacks), latency optimization, and rigorous evaluation (automatic metrics + human-in-the-loop), with a reported 18% user satisfaction improvement.”
Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval
“Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.”
Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP
“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”