Pre-screened and vetted.
Junior AI Engineer specializing in LLMs, RAG, and MLOps
“At ReferU.AI, designed and deployed an agentic RAG pipeline that automates multi-jurisdiction legal document drafting, emphasizing hallucination reduction through hybrid retrieval, validation agents, guardrails, and iterative regeneration. Experienced with orchestration frameworks (especially CrewAI) and rigorous testing/evaluation practices including human-in-the-loop review, adversarial testing, and production metrics/logging.”
Junior Machine Learning Engineer specializing in LLMs, NLP, and MLOps
“Developed and productionized VL-Mate, a vision-language, LLM-powered assistant aimed at helping visually impaired users understand their surroundings and query internal knowledge. Emphasizes reliability and safety via confidence thresholds, uncertainty-aware fallbacks, hallucination grounding checks, and rigorous offline + user-in-the-loop evaluation, with experience orchestrating multi-step LLM pipelines (LangChain-style and custom Python async) and deploying on containerized infrastructure.”
Mid-level Robotics Engineer specializing in simulation-to-real ML control
“Robotics/ML engineer who benchmarks and adapts open-source robot action models, building synthetic datasets in Isaac Sim and modifying vendor code to scale training across multiple GPUs. Also built a production-style computer vision pipeline at Zortag—training a tiny YOLO-based classifier for fake-vs-real label detection and deploying it in a real-time iOS app with additional display/spoof detection.”
Mid-level AI/ML Engineer specializing in healthcare ML, MLOps, and LLM/RAG systems
“Healthcare-focused ML/LLM engineer who built a production hybrid RAG workflow to automate prior authorization by retrieving from medical guidelines/historical cases (FAISS) and generating grounded rationales for clinicians. Strong in operationalizing ML with Airflow/Kubeflow/MLflow on SageMaker, optimizing latency (ONNX/quantization/async), and reducing hallucinations via evidence-only prompting; also partnered closely with clinical ops to deploy a readmission prediction tool used in daily rounds.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and shipped a production real-time content moderation platform for Zoom/WebEx-style meetings, combining Whisper speech-to-text with fast NLP classifiers and REST APIs to flag hate speech, bias, and HIPAA-related content under strict latency constraints. Demonstrates strong MLOps/infra depth (Airflow, Kubernetes, Terraform/Helm, observability) and a pragmatic approach to reducing false positives via threshold tuning, context validation, and hard-negative data—while partnering closely with compliance and product stakeholders.”
Mid-level AI/ML Engineer specializing in LLM systems and MLOps
“Built and deployed an AI tutoring assistant end-to-end at Nexora School, spanning discovery with school districts, multi-agent LangGraph/RAG architecture, AWS Bedrock migration, and post-launch stabilization. Stands out for combining hands-on LLM systems engineering with strong educator-facing trust building, FERPA-driven architecture decisions, and disciplined production practices around evals, logging, and messy document ingestion.”
Intern-level Data Scientist specializing in AI and full-stack applications
“Engineer with hands-on experience building production ML and Python backend systems, including a real-time social media monitoring pipeline handling 1000+ events per second and a prototype AI operations assistant for Seattle-Tacoma Airport. Stands out for combining reliability engineering, automation, and LLM/NLP-to-SQL work, with measurable impact such as improving uptime from 92% to 99.4%.”
Mid-level Software & ML Engineer specializing in agentic LLM systems and ML infrastructure
“Built and deployed an LLM-to-SQL automation system in a closed/internal environment, using a retriever–reranker–validator architecture on Kubernetes with strong security controls (semantic + rule-based validation and RBAC), achieving 99% uptime and cutting manual query time ~40%. Also worked on genomic sequence classification and semantic search workflows, orchestrating data prep with Airflow, tracking/deploying with MLflow, and optimizing distributed multi-GPU training on a university Kubernetes cluster.”
Junior Machine Learning Engineer specializing in multimodal systems and LLMs
“Built and productionized a domain-specific LLM-powered RAG knowledge assistant at JerseyStem for answering questions over large internal document corpora, owning the full stack from FAISS retrieval and LoRA/QLoRA fine-tuning to AWS autoscaling GPU deployment. Drove measurable gains (28% accuracy lift, 25% latency reduction) and improved reliability through hybrid retrieval, grounded decoding, preference-model reranking, and Airflow-orchestrated pipelines (35% faster runtime), while partnering closely with non-technical stakeholders to define success metrics and ensure adoption.”
Entry AI Engineer specializing in LLM agents, RAG, and computer vision
“Robotics/AV-focused candidate who contributed to an F1TENTH autonomous vehicle college project, building key autonomy components from raw sensor data to driving commands. Strong in perception and state estimation (visual odometry, particle-filter localization), plus mapping (occupancy grids) and planning/control (RRT, Gap Follow, PID), with hands-on ROS tooling and simulation validation in Gazebo/RViz and ROS environment containerization using Docker.”
Junior Machine Learning Engineer specializing in LLM fine-tuning and semantic retrieval
“Backend engineer with legal-tech and AI workflow experience: built JurisAI, an end-to-end legal research system using OCR + embeddings + Pinecone vector search to deliver citation-grounded LLM answers with safe failure modes (~90% recall@K). Also led a GW Law metadata migration into Caspio with batch validation and parallel rollout, and has strong FastAPI/GCP production reliability and observability practices.”
Entry-level Full-Stack Engineer specializing in AI and distributed systems
“Full-stack engineer who built an AI-based inventory/procurement query system at Botlily/Botlerly using Flask and Google Sheets as a live knowledge base, overcoming Sheets latency with caching and structured in-memory models. Demonstrated strong LLM product engineering (40% accuracy improvement via preprocessing/prompting) and customer-driven iteration with bar/restaurant owners, evolving the tool into a more comprehensive inventory management and forecasting solution.”
Mid-level Software Engineer specializing in AI/ML and Data Engineering
Mid-Level Software/ML Engineer specializing in NLP, OCR, and fraud detection in FinTech
Junior AI Engineer specializing in LLM agents and computer vision
Mid-level Machine Learning Engineer specializing in production ML, MLOps, and LLM retrieval systems
Mid-level Machine Learning Engineer specializing in LLMs and multilingual NLP
Entry-level AI Engineer specializing in NLP, RAG, and backend systems
Mid-level Software Engineer specializing in AI/ML and distributed systems
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices
Junior Full-Stack & LLM Application Developer specializing in agentic RAG systems