Pre-screened and vetted.
Mid-level AI Engineer specializing in multi-agent systems and RAG
“Built and shipped a production LangGraph-based multi-agent LLM analytics/decision copilot that answers questions across SQL/BI systems and unstructured docs, emphasizing grounded, tool-verified outputs with citations and confidence gating. Deep hands-on experience with orchestration (LangGraph, CrewAI, OpenAI Assistants, MCP) plus real-world latency/cost optimization (vLLM batching/KV caching, speculative decoding, quantization) and rigorous eval/observability. Partnered closely with business/ops stakeholders to deliver explainable reporting automation, cutting manual reporting time by 50%+.”
Junior Machine Learning Engineer specializing in NLP and multimodal transformers
“Built and deployed LLM-powered agentic chatbot and text-to-SQL systems using LangGraph/LangChain (and Bedrock), structuring workflows as DAGs with planning/replanning and validation to improve tool-calling reliability and reduce hallucinations. Operates production feedback loops with online/offline metrics, drift detection, and LangSmith-based evaluation pipelines, and regularly partners with business stakeholders and clinicians using slide decks and visual charts.”
Mid-level AI/ML Engineer specializing in Generative AI and intelligent automation
“LLM engineer who built and productionized a system to classify GitHub commits (performance vs non-performance) using zero-/few-shot approaches over commit messages and diffs, working at ~5M-record scale on multi-node NVIDIA GPUs. Experienced orchestrating end-to-end LLM pipelines with Airflow and GitHub Actions, and emphasizes reliability via testing, guardrails, and observability while collaborating closely with non-technical product stakeholders.”
Mid-level Full-Stack AI Engineer specializing in agentic systems and security-hardened pipelines
“Founding/early engineer experience across Asante and a Series A startup (Adgency), shifting from data science/ML into owning production full-stack systems end-to-end. Built core product flows (registration, business profiles, map service), AWS-deployed gRPC microservices with CI/CD, and operated low-latency agent/video ad generation workflows with retries/fallbacks and PostHog-based observability.”
Mid-level Software Engineer specializing in AI/ML for FinTech and Healthcare
“Built and deployed an end-to-end fintech product, FinSight, for bank statement analysis and financial Q&A using a production-style RAG architecture. Stands out for combining FastAPI, OpenAI embeddings, FAISS, hybrid SQL/vector retrieval, and practical reliability work like chunking optimization, validation, and low-latency performance tuning.”
Mid-level GenAI/ML Engineer specializing in LLM systems and RAG chatbots
“Built and shipped a production agentic LLM analytics platform that lets non-SQL business users query relational databases in plain English via a RAG + LangChain/LangGraph workflow and FastAPI service. Emphasizes safety and reliability with guardrails (validation/access control), testing/evaluation frameworks, and performance optimization (caching, monitoring, Dockerized scalable deployment), reducing dependency on data teams and speeding analytics turnaround.”
Mid-level Machine Learning Engineer specializing in computer vision and MLOps on GCP
“ML/AI engineer who deployed a real-time, edge-based computer-vision pipeline for produce recognition in retail self-checkout to reduce shrink. Demonstrates strong end-to-end production chops: multi-camera data calibration/sync, ranking-based modeling for fine-grained classes, latency-focused optimization, and continuous A/B testing/monitoring with guardrails. Experienced with ML orchestration (Kubeflow Pipelines, Airflow) and CI/CD via GitHub Actions, and collaborates closely with store operations to make interventions usable in the checkout flow.”
Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants
“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”
Senior Data Scientist specializing in ML, NLP, and production AI systems
“Machine learning/NLP engineer with deep Azure stack experience (Data Factory, Databricks/Spark, Delta Lake, Azure OpenAI, Azure AI Search) who built end-to-end production systems for semantic clustering, entity resolution, and hybrid search. Demonstrated measurable gains from embedding fine-tuning (~15% retrieval precision, ~10–12% nDCG@10) and designed scalable, quality-checked pipelines with MLOps best practices.”
Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics
“ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.”
Mid-level AI/Machine Learning Engineer specializing in Generative AI, NLP, and MLOps
“Built a production LLM/RAG document analysis system for large financial documents (credit reports/PDFs) to help business analysts extract insights faster. Implemented end-to-end pipeline orchestration with LangChain, vector search (e.g., FAISS), and hallucination controls (context grounding, similarity thresholds, and no-answer fallback), delivered as a Dockerized Python API.”
Mid-level Machine Learning Engineer specializing in data security and GenAI systems
“Built Hexagon’s production Text-to-CAD Copilot that converts text and rough sketches into editable CAD code, combining GraphRAG (Neo4j/LangChain) with a Gemini-powered vision module and multi-agent geometric validation—cutting manual modeling from a day to ~45 seconds and driving retrieval latency below 50ms. Also has large-scale GCP data/ML orchestration experience (Airflow/Cloud Composer, Dataflow, Pub/Sub, Snowflake) processing 50M+ daily records with drift monitoring and automated reliability controls.”
Senior Machine Learning Engineer specializing in LLMs, NLP, and computer vision
“Built and owned production GenAI systems for both infrastructure automation and customer support. Most notably, they created a self-healing multi-cloud incident response system that automated 65% of tier-1 alerts and reduced application crashes by 75%, and also shipped a hybrid RAG support triage agent that automated 60% of tier-1 inquiries with human escalation guardrails.”
“Built and owned a production RAG-based conversational AI system at Entera for real estate analysis, taking it from experimentation through AWS deployment, monitoring, and iterative improvement. Demonstrates strong practical judgment in retrieval design, LLM safety, and scalable Python service architecture, with measurable impact including 30-40% reduction in manual analysis time and roughly 30% better response accuracy.”
Intern Full-Stack & AI Engineer specializing in LLM applications and computer vision
Mid-level Software Engineer specializing in Python, cloud microservices, and AI/RAG systems
Mid-level AI/ML Engineer specializing in MLOps, fraud detection, and LLM/NLP systems
Intern AI Engineer specializing in LLMs, RAG, and multimodal generative AI
Mid-Level Full-Stack Python Developer specializing in cloud microservices and data engineering
Senior AI/ML Engineer specializing in healthcare LLM and conversational AI systems
Senior AI/ML Engineer specializing in Generative AI, LLMs, and document intelligence
Mid-level AI Engineer specializing in Generative AI, RAG, and agentic workflows