Pre-screened and vetted.
Mid-level AI Software Engineer specializing in computer vision and multimodal systems
“Robotics/perception engineer focused on production-grade, real-time systems—optimized self-supervised segmentation on Jetson Nano from ~6–10 FPS to ~20–25 FPS and scaled experimentation/deployment by unifying 15+ edge models in a modular PyTorch Lightning framework. Experienced integrating distributed LiDAR-camera fusion via gRPC/protobuf into mission planning, migrating ROS1→ROS2 Foxy for multi-drone perception, and adding Prometheus-based observability for long-running deployments.”
Junior AI/ML Engineer specializing in NLP, LLMs, and MLOps deployment
“Built and deployed NeuroDoc, a production-grade RAG system for PDF Q&A that delivers citation-backed answers with strong anti-hallucination guardrails. Experienced in orchestrating and scaling ML/LLM pipelines with Kubernetes, Airflow/Prefect, and PyTorch Distributed, and in building rigorous evaluation and citation-verification tooling to ensure reliability in production.”
Mid-level Full-Stack & ML Engineer specializing in AI SaaS, MLOps, and cloud infrastructure
“Built and shipped an AI-powered driver ranking/assignment system at AffirmoAI using LLM intent classification + RAG over pgvector/Postgres, served via FastAPI with a React UI that explains scores. Drove measurable improvements through optimization and iteration (latency down to <800ms, adoption 60%→90%+) and implemented rigorous eval loops with dispatcher ground truth plus cold-start handling for new drivers.”
Mid-level Machine Learning Engineer specializing in fraud detection and LLM systems
“At FiVerity, built and deployed a production LLM/RAG-based Information Gathering Tool for credit union fraud analysts that generates auditable investigation summaries from verified evidence. Focused on high-stakes constraints—hallucination prevention, cross-entity leakage controls, compliance/PII-safe monitoring, and latency—while also shipping customer-facing agentic workflows using CrewAI and LangGraph in close partnership with fraud and compliance stakeholders.”
Intern AI Engineer specializing in LLM systems, RAG, and cloud data pipelines
“Built and deployed a production Dockerized multimodal (voice+text) LLM agent for knowledge management that retrieves from Notion and documents and falls back to Tavily-powered web search with citations when internal notes are missing. Emphasizes production reliability via model-switching fallbacks, caching, strict structured outputs (Pydantic/JSON schema), and MCP-based orchestration with state-aware gating and monitoring to reduce redundant tool calls and improve success rates.”
Intern AI/ML Software Engineer specializing in RAG and medical AI
“ML/LLM engineer with production experience building medical RAG systems to automate chart review, including retrieval + re-ranking and rigorous evaluation. Notably uncovered errors/bias in physician-curated ground truth by tracing answers back to source note chunks and presented evidence to an academic partner, accelerating deployment. Also built a RAG-based FAQ chatbot for a health insurance company and delivered it to non-technical stakeholders via demos.”
Mid-level Machine Learning Engineer specializing in LLMs, GenAI, and Computer Vision
“LLM/agent engineer who built a production multi-agent research automation system using LangGraph (planner, retriever with FAISS, supervisor, evaluator) with structured outputs and citation tracking for traceable reports. Emphasizes reliability and operations—LangSmith-based observability, multi-level testing, hallucination mitigation, and latency/cost controls—plus prior experience as a Computer Vision Software Engineer at Deepsight AI Labs working directly with non-technical customers.”
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and cloud MLOps
“Built and deployed a production LLM/RAG system at CVS to automate clinical documents, addressing PHI compliance, retrieval accuracy, and latency; achieved a 35–40% reduction in review effort through chunking and FP16/INT8 optimization. Also has experience translating AI outputs into actionable insights for non-technical stakeholders (sports analysts).”
Mid-level Applied AI/ML Engineer specializing in agentic systems and LLM automation
“Built a production LLM-powered workflow at Frontier to extract structured signals from messy, high-volume documents and route work to the right teams, replacing a multi-day, error-prone manual process. Emphasizes production reliability with schema/consistency validation, re-prompting and deterministic fallbacks, plus async pipeline optimizations for predictable latency. Experienced with multi-agent orchestration (LangGraph, AutoGen, CrewAI) and AWS workflow tooling (Step Functions, SQS, Lambda), and delivered ~70% safe automation via stakeholder-driven thresholds and human review.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps for financial services
“Built and deployed a production Llama 3-based RAG document Q&A system using FAISS, addressing context-window limits through chunking and keeping retrieval accurate by regularly refreshing embeddings. Has hands-on orchestration experience with LangChain and LlamaIndex for multi-step LLM workflows (including memory management) and collaborates with non-technical teams (e.g., marketing) to deliver AI solutions like recommendation systems.”
Mid-level AI/ML Engineer specializing in fraud detection and NLP
“Built production AI/RAG-style systems for message Q&A and insurance claims workflows, combining data ingestion, indexing/retrieval, and LLM integration with fallback modes. Has hands-on orchestration experience (Airflow, Prefect, LangChain) and cites large operational gains (claims processing reduced to ~45 seconds; manual review -50%; false alerts -30%) through automated, monitored pipelines and close collaboration with non-technical stakeholders.”
Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG
“Built and deployed Clyra.AI, an AI-driven daily scheduling product that uses a LangGraph-based multi-agent LLM pipeline (task extraction, verification, reflection) grounded with strict RAG over emails/documents/calendars and real-world signals like health metrics. Designed a custom agent orchestrator with bounded loops/termination conditions and a self-auditing verification/reflection layer to reduce hallucinations while controlling latency and cost via caching and model distillation.”
Mid-level AI/ML & Backend Engineer specializing in AI platforms and computer vision
“Backend engineer with hands-on experience building real-time, low-latency systems: owned the Python backend for a real-time crowd-monitoring product (top 5% at HackHarvard 2025) using OpenCV, GPU YOLO inference (PyTorch), WebRTC, and OAuth. Also has production Kubernetes/GitOps experience (Helm/Kustomize, GitHub Actions, Argo CD), Kafka-based event pipelines, and executed a minimal-downtime on-prem PostgreSQL migration to AWS EC2.”
Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps
“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”
Mid-level Data Scientist specializing in machine learning, NLP, and healthcare AI
“Senior data scientist with hands-on ownership of production ML and GenAI systems across enterprise churn, clinical Q&A, and real-time fraud detection. Stands out for combining strong MLOps discipline with measurable business impact, including $2M+ retained revenue, 10K TPS low-latency fraud infrastructure, and a clinician-reviewed RAG system that improved retrieval accuracy by ~38%.”
Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs
“Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.”
Junior AI Engineer specializing in ML, LLM systems, and RAG
“Built and deployed an LLM/applied-ML system enabling efficient extraction of useful information from large unstructured multimodal datasets, owning the full pipeline from ingestion to inference and APIs with a strong emphasis on production reliability, latency, and monitoring. Also delivered a voice-based AI workflow for Hindi policy document access for the Election Commission of India by translating non-technical usability needs into iterative demos and a successful implementation.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG
“GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.”
Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection
“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”
Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems
“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”
Mid-level Applied AI Engineer specializing in agentic LLM workflows
“AI engineer with production experience building a LangGraph-based, stateful multi-agent system at MetLife to automate complex insurance claims adjudication, integrating document discovery, Azure Document Intelligence OCR/extraction, and health data analysis. Strong in agent orchestration and production deployment (Docker + FastAPI REST APIs), with a structured approach to reliability, evaluation, and stakeholder-driven requirements.”
Junior Full-Stack Software Developer specializing in AI-powered web and health applications
“Built and launched “Language AI,” a Next.js/TypeScript app that clones a user’s voice (via ElevenLabs) to deliver language lessons in their own voice, using Supabase for auth/Postgres/storage and hosting backend on Render. Post-launch, identified ElevenLabs voice-clone limits after initial users and reworked the pipeline to store audio assets and delete clones to support more concurrent users; also added Google auth to improve adoption. Previously worked in a high-growth startup environment (Study Park) taking concepts from ideation to production.”
Junior AI/ML & Full-Stack Engineer specializing in LLMs and RAG systems
“Forward-deployed engineer who built a production AI drone-control chatbot that lets users fly a drone via natural language while viewing a real-time feed. Implemented RAG over drone SDK documentation (vector DB + top-k retrieval) and LoRA fine-tuning, with a focus on latency, token efficiency, and cost reduction, and regularly works with non-technical clients to integrate and explain AI system architecture.”