Pre-screened and vetted.
“Backend engineer focused on productionizing LLM systems: built a FastAPI-based RAG and multi-agent automation platform deployed with Docker/Kubernetes, prioritizing safe execution and reduced hallucinations. Experienced in refactoring monolithic ML services with feature-flagged incremental rollouts, and implementing JWT/RBAC plus row-level security (e.g., Supabase) for secure, scalable APIs.”
Intern AI/ML Engineer specializing in agentic systems and full-stack development
“Built and scaled a multi-agent LLM automation pipeline during a fintech internship, growing from a rapid 1-week proof-of-concept to a 15+ agent hierarchical system that cut market brief report generation time from ~5 hours to under 30 minutes. Hands-on with agent frameworks (Haystack, CrewAI, LangChain) and experienced in debugging agent communication issues via sandboxed modular testing and context/token management; also regularly gives architecture-first technical demos at multiple hackathons and university events.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and real-time fraud detection
“GenAI/ML engineer who has shipped production agentic systems in highly regulated and high-throughput environments, including an AWS Bedrock-based fraud/compliance workflow at U.S. Bank with PII redaction and hallucination detection that cut investigation time by 50%+. Also built and evaluated RAG and recommendation systems at Target, using RAGAS-driven testing, hybrid retrieval with re-ranking, and SHAP explainability dashboards to align model behavior with merchandising business KPIs.”
Junior Full-Stack Software Engineer specializing in video and security applications
“Full-stack engineer who built and owned a generative-AI pipeline end-to-end inside the Vibecut video editor using Next.js App Router/TypeScript, Gemini-based prompt routing, and Zustand state management, including concurrency-safe requests. Also integrated Python services to access newly released AI tooling, optimized Postgres/S3 data flows for thumbnails, and built Modal-to-Amplitude workflows for Reddit-driven sentiment/metrics in a pre-seed environment while also handling marketing.”
Mid-level AI Engineer specializing in LLMs, RAG, and content automation
“AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.”
Junior Software Engineer specializing in AI, computer vision, and medical imaging
“Unity developer with deep GPU compute experience who shipped a web-deployed CAD-style app requiring real-time mesh manipulation, solving performance and browser memory-limit issues via compute shaders and mesh chunking. Built an independent Unity gravity simulation using Schwarzschild approximation and geodesic integration, and has also worked on game-engine threading/job-queue architecture using AI-assisted workflows.”
Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps
“LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).”
“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”
“Unity/gameplay engineer (Playtika) who built a state-machine/ECS-driven slot/bonus engine in a client-server setup, focusing on consistent outcomes under latency and highly engaging reward sequences. Also implemented server-authoritative real-time challenges/contests via an event-driven messaging system (SignalR-like) across iOS/Android/WebGL/UWP, and validates impact through retention/session/engagement analytics.”
Executive Technology & Product Leader specializing in AI, SaaS platforms, and digital transformation
“Engineering/technology leader who spearheaded an ultra low-latency AI-CDN SaaS platform on a multi-cloud stack (AWS/Azure/Alicloud), helping transform ARHT from a boutique provider into a global SaaS solution. Built distributed engineering and follow-the-sun support teams and helped secure major enterprise clients (TD Bank, Gucci, NATO, EY) while also leading board communications and raising $6M for a public-listed company.”
“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”
Junior Full-Stack & ML Engineer specializing in LLM applications
“Data Scientist (2–3 years) at ZS Associates who has built and productionized agentic LLM systems, including a LangGraph-based multi-LLM prompt-optimization pipeline for entity extraction deployed as a Spring Boot microservice via Jenkins. Also built an Insightmate.ai chatbot and improved its RAG accuracy by diagnosing vector retrieval issues and implementing HyDE query expansion, while partnering with sales and pharma stakeholders to drive adoption (e.g., Zimmer Biomet platform migration into a multi-year partnership).”
Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems
“Built and deployed a production-style vision-language pipeline that generates structured medical reports from chest X-rays using BioViLT embeddings, an image-text alignment module, and BiGPT fine-tuned with LoRA, delivered via Streamlit and hosted on AWS EC2. Also collaborating experience presenting EDA findings, feature importance, and model performance to Ford managers while working with vehicle parts data at Bimcon.”
Executive Enterprise Architect & CTO specializing in cloud, digital transformation, and AI/ML
“Senior enterprise architecture and engineering leader (Sr. Director / Principal Architect) who has owned enterprise IT strategy and governance for a $100M budget and partnered directly with C-suite stakeholders. Led a cruise-industry employee/crew digital transformation, scaling to 10 agile teams (~70 people) using SAFe/TOGAF and making architecture decisions optimized for low-connectivity environments (local database to avoid internet authentication).”
Senior Full-Stack Java Developer specializing in cloud-native microservices
Executive Technology Leader (CTO/CAIO) specializing in AI-first cloud platforms
Mid-level Python Developer specializing in Healthcare and Insurance systems
Mid-level Generative AI Engineer specializing in LLM applications and RAG
Intern Software Engineer specializing in full-stack, Kubernetes, and ML
Intern AI/ML Engineer specializing in LLM agents and RAG systems
Junior Founding Engineer specializing in AI-powered study and research assistants
Mid-level Data Scientist / ML Engineer specializing in financial risk, NLP, and MLOps