Pre-screened and vetted.
Junior AI/ML Engineer specializing in LLMs, RAG, and multimodal agents
Mid-level Machine Learning Engineer specializing in MLOps and applied AI
Mid-level GenAI & Analytics Engineer specializing in LLM and cloud cost/finance analytics
Mid-level AI/ML Engineer specializing in NLP, MLOps, and compliance-focused ML systems
Senior AI/ML engineering leader specializing in healthcare and life sciences
Junior AI Research Engineer specializing in NLP, speech and generative AI
Intern AI/Data Science Engineer specializing in LLM agents, data engineering, and predictive analytics
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and document intelligence
Mid-level AI/ML Engineer specializing in NLP, speech AI, and RAG systems
Senior Machine Learning Engineer specializing in Generative AI and NLP
Mid-level AI/ML Engineer specializing in NLP, MLOps, and financial risk & fraud analytics
Mid-level AI Engineer specializing in Generative AI and LLM/RAG systems
Junior Full-Stack/Cloud Engineer specializing in AI and data-driven applications
Mid-level AI/ML Product & Solutions Specialist specializing in GenAI and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, multimodal AI, and backend systems
Mid-level Full-Stack Engineer specializing in AI and FinTech platforms
“Full-stack engineer who built RegArt’s product from 0→1 for enterprise compliance users at clients like HSBC and EY, including the production React frontend, backend APIs, and an LLM-powered search experience. Particularly compelling for startups needing someone who can move across UI, API, and data layers, make pragmatic architecture tradeoffs, and ship fast without over-engineering.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and multimodal modeling
“Built and productionized a telecom-focused RAG assistant by LoRA fine-tuning LLaMA-2 and integrating LangChain+FAISS behind a FastAPI service, with dashboards and a human feedback UI for engineers. Demonstrated measurable impact (≈40% faster document lookup, +8–10% retrieval precision) and strong MLOps rigor via Airflow orchestration, CI/CD, and monitoring for drift and failures.”
Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps
“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”
Mid-level GenAI Engineer specializing in production RAG and LLM fine-tuning
“LLM engineer who built a production seller-support RAG system at eBay using hybrid retrieval (BM25 + Pinecone vectors) with Cohere reranking, LangGraph orchestration, and citation-grounded answers. Strong focus on reliability: semantic/structure-aware chunking, automated Ragas-based evaluation with nightly regressions, and production observability (LangSmith) plus drift monitoring (Arize). Also implemented a multi-agent fraud pipeline with AutoGen using JSON-schema contracts and explicit termination conditions.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS
“AI engineer who built a production RAG-based internal analyst tool at BlackRock, fine-tuning an LLM on proprietary financial data and adding four layers of guardrails (input/retrieval/generation/output) to improve grounding and reduce hallucinations. Implemented a LangChain-based multi-agent orchestration (7 major agents) deployed on AWS ECS, with reliability measured via internal human evaluation, LLM-as-judge, and RLHF/drift monitoring.”
Junior Machine Learning Engineer specializing in LLM systems and GPU inference
“LLM/agent engineer who shipped a production RAG-based recommendation + explanation system that replaced a traditional recommender stack, delivering ~20% CTR lift (and +8% after a reliability iteration) with strong cold-start performance. Demonstrates strong production rigor: schema-constrained generation, typed tool calling, explicit state/orchestration, deep monitoring/feedback loops, and safe integration with messy ERP inventory/order data using normalization, idempotency, and conflict-resolution guardrails.”
Intern Software Engineer specializing in edge AI deployment and distributed systems
“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”
Senior Machine Learning Engineer specializing in optimization, LLMs, and on-device AI
“Engineer with hands-on experience debugging and hardening a fixed-point implementation for an internal PoC, quickly diagnosing overflow/underflow issues that caused intermittent failures across thousands of runs and delivering a code fix. Comfortable presenting technical solutions with layered slide depth and doing follow-up deep dives for interested stakeholders, though has limited direct customer/sales partnership experience.”