Pre-screened and vetted.
“Built and deployed a production LLM-powered internal AI assistant using a RAG pipeline to help teams search internal PDFs/knowledge bases and generate grounded summaries/answers. Demonstrates strong end-to-end ownership (ingestion through APIs) plus production rigor (monitoring/logging/CI-CD, evaluation metrics) and practical optimizations for hallucination, latency, and answer quality (thresholding, fallbacks, caching, async, re-ranking, two-tier model routing).”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Mid-level Data Scientist specializing in ML, LLM pipelines, and MLOps
“Built and deployed a production LLM-driven document understanding pipeline using LangChain/LangGraph, focusing on reliability via step-by-step prompting, validation checks, and monitoring. Also partnered with non-technical marketing stakeholders at Heartland Community Network to deliver an XGBoost targeting model surfaced in Power BI, improving campaign conversion by 12%.”
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
“Built and deployed a production LLM/RAG intelligent document understanding platform for healthcare clinical documents (notes, discharge summaries, diagnostic reports), integrating spaCy entity extraction, Pinecone vector search, and a Spring Boot API on AWS with monitoring and guardrails. Demonstrates strong MLOps/orchestration (LangChain, Airflow, Kubeflow/Kubernetes) and a metrics-driven evaluation approach, and partnered with a healthcare operations manager to cut manual review time by 80%.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Junior Machine Learning Engineer specializing in predictive modeling and GenAI RAG systems
“LLM engineer who built and deployed an emotionally intelligent AAC communication system using an emotion-aware RAG pipeline (Empathetic Dialogues + GoEmotions) and a PEFT-adapted model. Experienced with LangChain/LangGraph and custom Python orchestration, focusing on reliability (guards, schema validation, fallbacks), latency optimization, and rigorous evaluation (automatic metrics + human-in-the-loop), with a reported 18% user satisfaction improvement.”
Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval
“Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.”
Mid-level AI/ML Engineer specializing in production ML, MLOps, and NLP
“Built and deployed a transformer-based clinical document classification system that processes unstructured clinical notes in a HIPAA-compliant healthcare setting, served via FastAPI on AWS and integrated into an Airflow/S3 pipeline. Demonstrates strong end-to-end MLOps skills (data quality remediation, low-latency inference optimization, monitoring with MLflow/CloudWatch) and effective collaboration with clinicians to drive adoption.”
Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure
“Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.”
Junior Machine Learning Engineer specializing in multimodal systems and LLMs
“Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.”
Mid-Level Software/ML Engineer specializing in NLP, OCR, and fraud detection in FinTech
Junior NLP/ML Engineer specializing in LLMs and retrieval-augmented generation
Mid-level AI Engineer specializing in ServiceNow ITSM automation and LLM/RAG systems
Mid-level Machine Learning Engineer specializing in production ML, MLOps, and LLM retrieval systems
Senior Software Engineer specializing in AI/ML and cloud-native microservices
Mid-level AI Software Engineer specializing in ML services and agentic workflows
Mid-level Machine Learning Engineer specializing in NLP, MLOps, and predictive risk modeling