Pre-screened and vetted.
Mid-level AI Engineer specializing in Generative AI and multimodal RAG
“Full-stack engineer who helped build and launch an internal genAI platform called GAIL, supporting multiple LLMs, confidential document upload for RAG pipelines, and collaborative chat. Worked across FastAPI, React/TypeScript, AWS/DynamoDB, and Azure, with notable ownership of backend RAG logic, MCP integration architecture, and frontend fixes that improved chat usability.”
Mid-level GenAI Engineer specializing in RAG systems and AI agents
“LLM/agentic systems builder who has deployed production solutions for a resource management firm, using an MCP-driven architecture with Neo4j + Elasticsearch and a ChatGPT frontend to generate candidate/company “SmartPacks” and answer entity Q&A. Also built a LangGraph/LangSmith-orchestrated multi-agent workflow that automates data-infra change requests end-to-end (impact analysis, SQL + tests, and PR creation), and delivered a ~60% latency reduction through TTL-based context caching while improving accuracy via a business data dictionary.”
Junior AI/ML Engineer specializing in LLM agents and RAG systems
“Built and deployed a production, multi-tenant modular agentic AI platform at Easybee AI, using LangChain/LangGraph with Redis-backed durable state to make agents reusable, traceable, and auditable. Emphasizes reliability via strict tool schemas, deterministic controllers, tenant-level policy enforcement, and regression testing derived from real production failures; also delivered AI automation for legal/finance workflows (attorney draw and expense automation) with explainable, deterministic payouts.”
Mid-level Software Engineer specializing in cloud and FinTech systems
“Backend/AI engineer who has built and operated production Node.js/Express services on AWS (Postgres/Redis) and has hands-on experience shipping an AI-powered support agent using RAG (Pinecone + LLM) with grounding checks and evaluation for hallucination rate. Demonstrates strong production reliability/performance debugging, including reducing peak latency from ~2s back to sub-300ms through query and caching optimizations, plus designing agent workflows with retries and human-in-the-loop escalation.”
Mid-level Data Scientist specializing in Generative AI, RAG systems, and MLOps
Mid-level AI/ML Data Engineer specializing in secure ML pipelines and AI governance
Mid-level Software Engineer specializing in AI and cloud-native data platforms
Director-level Product & Data Platform Leader specializing in AI, cloud data, and enterprise governance
Mid-level GenAI/ML Engineer specializing in RAG, semantic search, and LLM systems
Mid-level Machine Learning Engineer specializing in LLMs, Generative AI, and MLOps
Mid-level Full-Stack Software Engineer specializing in cloud, microservices, and AI integration
Senior Full-Stack Software Engineer specializing in AI-native enterprise products
Mid-level AI Engineer specializing in LLM agents and production ML systems
Executive AI Architect specializing in low-power edge/embedded AI systems
Senior Full-Stack Software Engineer specializing in Python, React, and LLM-powered applications
Mid-level Generative AI & ML Engineer specializing in production LLM and RAG systems
“AI/ML engineer who shipped a production blood-test report understanding and personalized supplement recommendation product, using a LangGraph multi-agent pipeline on AWS serverless with OCR via Bedrock and RAG over vetted clinical research. Also built end-to-end recommender system pipelines at ASANTe using Airflow (ingestion, embeddings/features, training, registry, batch scoring/monitoring) with KPI reporting to Tableau, with a strong focus on safety, evaluation, and measurable reliability.”
Mid-level Generative AI Engineer specializing in LLM agents and RAG applications
“GenAI builder and technical lead with ~2 years of hands-on production experience, including GENIE (a GenAI sandbox for ~44,000 Massachusetts public-sector employees) and A-IEP, a multilingual platform helping parents understand complex IEP documents (cut processing from ~15 minutes to ~2 and used by 1,000+ parents). Strong in RAG/agentic architectures, AWS serverless + Step Functions orchestration, and rigorous evaluation/guardrails for reliable real-world deployments.”
Junior AI Software Engineer specializing in LLM agents, RAG, and healthcare NLP
“Backend engineer who built an agentic LLM system for private equity/finance that answers questions over enterprise contracts and documents using a vector-db RAG pipeline. Differentiator is a trust-focused citation framework (with highlighted source text) to reduce hallucinations in high-stakes workflows, plus strong DevOps experience deploying microservices on Kubernetes with Helm/GitOps and building Kafka real-time pipelines.”
Mid-level AI/ML Engineer and Data Scientist specializing in LLMs and MLOps
“Data science/AI intern at University at Buffalo Business Services who built and deployed production systems spanning classic ML and LLM assistants. Delivered real-time competitor intelligence for a Cornell-partnered, $1B beverage launch by scraping/cleaning 5,000+ SKUs and deploying models via API, then built a domain-aware LLM assistant to modernize Excel-based workflows with strong grounding, privacy controls, and sub-5s latency.”
Senior Machine Learning Researcher/Engineer specializing in temporal modeling and production ML systems
“Backend engineer who built and evolved a startup data-processing backend (Express.js/MySQL) handling millions of user data points, with a microservices pipeline integrating multiple social media APIs. Emphasizes reliability and security through comprehensive testing, robust error/retry handling for sequential pagination constraints, and tight IAM/JWT/OAuth-based access controls.”
Intern Full-Stack Software Engineer specializing in AI and backend systems
“AI intern who built core pieces of Cyberdome, a full-stack agentic compliance automation product using Next.js, Python, RAG, Qdrant, and NIST control retrieval. Stands out for combining frontend product work with backend LLM infrastructure, on-prem/local model deployment, and practical iteration based on user trust concerns around proprietary data.”