Pre-screened and vetted.
“Designed and deployed a production LLM agent platform at the National Institutes of Health to reduce time spent searching fragmented internal documentation, combining RAG grounding with multi-step tool-calling workflows and integration into legacy services via inference APIs. Emphasizes production-grade reliability through automated evaluation on real queries, guardrails/safe-failure behaviors, and ongoing A/B testing and monitoring, and has experience translating non-technical stakeholder goals into measurable success metrics.”
Mid-level GenAI Engineer specializing in LLM fine-tuning, RAG, and MLOps
Mid-level Full-Stack Developer specializing in cloud backend systems and GenAI
Mid-level Generative AI/ML Engineer specializing in LLMs, RAG, and agentic AI
Mid-level Generative AI Engineer specializing in RAG, multi-agent LLM systems, and LLMOps
Mid-level Generative AI Engineer specializing in banking and healthcare AI
Mid-level AI/ML Engineer specializing in LLMs, RAG, and full-stack development
Junior Data Scientist specializing in generative AI and time series forecasting
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and cloud-native MLOps
Senior GenAI Engineer specializing in enterprise LLM systems and RAG platforms
Mid-level AI/ML Engineer specializing in Generative AI and RAG pipelines
“AI/LLM engineer with healthcare domain experience who built a production clinical support “chart bot” for Molina, including PHI-safe ingestion of 200k+ PDF policies, vector retrieval, and a fine-tuned LLaMA served via vLLM on ECS Fargate. Demonstrated measurable performance wins (HNSW + namespace partitioning; 30% inference latency reduction) and a rigorous evaluation/monitoring approach, while partnering closely with nurses and operations teams to shape workflows and guardrails.”
Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps
“Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS
“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”
Senior Full-Stack AI Engineer specializing in Generative AI and FinTech
“Backend engineer who built and owned an AI-powered financial research product end-to-end, using a typed NestJS/GraphQL backend with LangGraph-style agent routing to produce sourced, structured financial analysis. Emphasizes finance-grade correctness (Zod validation, metric registries, unit/empty-result guardrails) while keeping latency low via batching, caching, and fast token streaming, and has led incremental migrations using strangler/feature-flag/shadow traffic patterns.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG
“GenAI/LLM engineer who built and deployed a production RAG system for enterprise document search and decision support, cutting manual lookup time by 40%+. Experienced with LangChain/LangGraph agent orchestration plus Airflow/Prefect for ingestion and incremental reindexing, with a strong focus on reliability (testing, observability) and stakeholder-driven metrics.”
Mid-level AI/ML & Full-Stack Engineer specializing in LLM agents and generative AI
“LLM/agent builder who shipped a live consumer AI-agent app (kalpa.chat) that visualizes complex reasoning as interactive graphs and abstracts multi-provider model usage via a unified wallet. Professionally has applied LangChain/LangGraph to IVR parsing and to scaling a football video-generation pipeline at DAZN, including shipping a VAR-specific retrieval/order fix via SQL after iterating with a non-technical PM.”