Pre-screened and vetted.
Junior AI/ML Engineer specializing in LLM agents and full-stack AI systems
“Built a full-stack dependency impact analysis product ('Blast Radius') that mapped runtime service relationships and reportedly reduced deployment incidents by 40%. Also developed AI evaluation and security benchmarking systems, including WebSEC Arena and a lyric-generation tool fine-tuned on 300,000 song lyrics, with academic interest strong enough to spur a research paper effort.”
Mid AI/ML Engineer specializing in LLMs, RAG, and cloud AI systems
“Built an AI-powered job matching platform end to end using AWS, Gemini, FastAPI, TypeScript, embeddings, and vector search. The standout result was automating manual matching workflows and scaling resume processing to roughly 2,000 resumes per minute while monitoring quality with F1 score and latency metrics.”
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
“LLM engineer/data analyst who built a production RAG QA assistant over the Jurafsky & Martin NLP textbook to reduce hallucinations and provide explainable, source-grounded answers. Experienced with LangChain/LangGraph orchestration, retrieval optimization (embeddings, vector DBs, caching), and rigorous evaluation/monitoring (Retrieval@K, A/B tests, telemetry/drift). Previously communicated analytics insights to non-technical stakeholders at GS Analytics using Power BI and simplified reporting.”
Junior Machine Learning Engineer specializing in cloud-based ML and automation
“Built and shipped a production multi-agent LLM system at Solena that automated internal project intake, validation, reporting, and stakeholder communications using Python, SQL, and LangChain, with strong emphasis on reliability (structured validation, safe defaults, logging, and state tracking). Also used LangGraph to orchestrate a multi-step video summarization pipeline, and has experience partnering with non-technical stakeholders to define “completion” criteria and reporting needs.”
“Forward Deployed Engineer at EasyBee AI who productionized a self-storage customer’s multi-agent LLM system end-to-end—rebuilding it with LangGraph/CrewAI, integrating with real property management + CRM systems via an MCP server, and adding observability/guardrails for reliable daily use. Experienced in live troubleshooting of agentic workflows, developer demos/workshops (including an open-source project, MerryQuery), and partnering with sales to close deals through customer-specific technical demos and fast integration feedback loops.”
Junior Software Engineer specializing in distributed systems and ML platforms
“Built and deployed real-world systems end-to-end across security and healthcare contexts: led a 3-person team delivering a university vehicle tracking system with 30% cost savings and 1-year post-launch monitoring. Also implemented a healthcare RAG chatbot with adaptive query routing that cut LLM costs by 40% while maintaining answer accuracy, and has experience debugging non-deterministic LLM behavior in DevOps pipeline automation.”
Senior AI/ML Engineer specializing in LLMs, AI agents, and cloud-native backend systems
“Built and owned a production-grade RAG/LLM support automation system on AWS using GPT-4, Pinecone, FastAPI, and Redis, taking it from initial experimentation through deployment, monitoring, and iterative improvement. Their work reduced support workload and ticket volume by about 40%, improved CSAT and self-service resolution, and they also created shared Python/LLM infrastructure that accelerated other teams' delivery from weeks to days.”
Junior Full-Stack Engineer specializing in AI-powered web applications
“Full-stack engineer building production AI/RAG systems for benefits workflows, including a state-level deployment that introduced filestore-based evaluation and improved answer correctness by about 30%. Strong across Next.js, backend infrastructure, and AI evaluation tooling, with hands-on experience in LangChain/LangGraph, Langfuse, accessibility-minded UI work, and zero-to-one product leadership in fast-moving environments.”
Mid-level AI/ML Software Engineer specializing in GPU-optimized LLM inference and cloud microservices
“Built and deployed a production RAG-based multilingual analytics assistant for healthcare operations, enabling non-technical teams to query claims/EHR and risk metrics with grounded explanations. Demonstrates strong end-to-end LLM system engineering (retrieval tuning, re-ranking, hallucination controls, verification layers) plus workflow orchestration (Airflow/Composer/Step Functions) and stakeholder-driven iteration via prototypes and dashboards.”
Intern Data Scientist specializing in GenAI agents, RAG, and ML platforms
“LLM/agent systems builder who deployed a production hybrid router for immerso.ai that dynamically selects retrieval vs reasoning vs generative pathways, achieving an 82% factual-accuracy lift. Deep hands-on experience optimizing local Mistral 7B inference (4–5 bit GGUF quantization, KV-cache reuse) and building reliable RAG/agent workflows with LangChain/LangGraph/AutoGen across GCP Cloud Run and AWS (ECS/Lambda).”
Mid-level GenAI Engineer specializing in LLM agents and RAG systems
“Built and deployed a production RAG-based LLM assistant that answers day-to-day operational questions from internal PDFs/SOPs, with strong emphasis on data consistency (metadata versioning, confidence thresholds, conflict handling) and low-latency retrieval at scale. Experienced designing and orchestrating multi-agent LLM workflows (retrieval/validation/generation) and pipeline orchestration for ingestion/embedding/vector-store updates, plus iterative delivery with non-technical operations/business stakeholders.”
Junior AI Engineer specializing in LLM evaluation, prompt engineering, and AI orchestration
“LLM workflow builder who has deployed a personalized GPT experience (including Delphi AI-based knowledge ingestion) and built a LangChain/LangGraph job-aggregation pipeline that ingests, normalizes/dedupes, filters, then uses an LLM to rank and summarize matches. Emphasizes production reliability with structured outputs, retries/fallbacks, metric-driven evaluation, logging/prompt versioning, and A/B testing, and collaborates with non-technical stakeholders through demo-driven iteration.”
Junior Machine Learning & Backend Engineer specializing in LLM systems and ML infrastructure
“Built and deployed production RAG-based document search/Q&A systems (DocChat and an internship marketing RAG), using a React + FastAPI stack on GCP with docs stored in GCP buckets and retrieval via embeddings/vector DB. Emphasizes cost/performance tradeoffs (reported ~40% cost reduction) and ships via Docker (Railway), with load/API testing using JMeter and Swagger; regularly collaborates with a CEO stakeholder to iterate and push changes to production.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG retrieval, and IoT ML systems
“Built production LLM-driven products including a job-hunt AI (job ranking + resume optimization) and an InterviewAI agentic pipeline using LangChain. Focused on practical deployment concerns like securing OpenAI usage via rate limiting and tiered quotas, and demonstrates an applied approach to choosing models, retrieval methods (RAG), and prompting strategies.”
Intern Machine Learning Engineer specializing in Generative AI and RAG systems
“Early-career AI/LLM builder who created and deployed a multi-agent news analysis agent (Patrakarita) using CrewAI, coordinating researcher/analyst roles to turn noisy article URLs into structured, prioritized outputs (claims, tone, verification questions, opposing views). Strong focus on orchestration debugging and reliability evaluation, including measuring hallucination/redundancy and improving reasoning by refactoring pipeline sequencing.”
Junior Data/AI Engineer specializing in MLOps, real-time pipelines, and LLM applications
“Built an LLM-driven MLOps agent at SBD Technologies that automated an EV-charging prediction workflow end-to-end, integrating with real-time Kafka/FastAPI systems supporting 120K+ chargers at 99.99% event delivery. Addressed frequent schema drift by implementing SQLAlchemy/Flyway validation (60% reduction in drift issues) and deployed as Kubernetes microservices with GitHub Actions CI/CD; also has Airflow-based ingestion/crawling experience into Snowflake and stakeholder-facing delivery via a Fleetcharge PWA.”
Mid-level Software Engineer specializing in AI, full-stack systems, and FinTech
“Product-minded full-stack engineer with experience in fintech identity verification and industrial analytics, focused on turning repeated operational pain points into reusable platforms. Built real-time KYC/KYB dashboards, secure cross-platform web components, and a multi-tenant workflow engine that cut onboarding from 2 weeks to 1 day while materially improving conversion, reliability, and developer speed.”
Mid-level AI Engineer specializing in LLM systems and data platforms
“AI/backend engineer who independently built and operated an agentic telecom analytics system end-to-end, using LangGraph and Claude to turn natural language into safe SQL in a regulated environment. He combines startup-speed execution with compliance-minded rigor, citing 95%+ NL-to-SQL accuracy, a 30-minute-to-2-minute workflow improvement, and zero-findings support across three regulatory audit cycles.”
Mid-level Systems Software Engineer specializing in distributed cloud infrastructure
“Backend-leaning full-stack engineer in fintech/payments who shipped an end-to-end Stripe payments + webhook system for a financial microservices platform, emphasizing ledger accuracy via idempotency, transactional writes, retries, and DLQs. Also delivered a real-time React/TypeScript payment status dashboard informed by user interviews, and improved production performance by 35% p95 latency through PostgreSQL tuning and Redis caching on AWS.”
Mid-level Full Stack AI Engineer specializing in LLM and RAG systems
“Founding engineer and full-stack AI builder who single-handedly created Aura Groups Sweden's Trust and Growth platform across frontend, backend, ETL, and LLM services. Has hands-on experience shipping RAG-based products with OpenAI APIs and using them in real workflows, plus early-stage startup experience at nesoi.ai where they helped get an AI learning platform adopted by teams at Bain and Amazon.”
Senior Software Engineer specializing in Applied AI and FinTech
“Backend engineer with experience building an end-to-end civic tech AI platform that ingests city council meeting videos, transcribes them with Whisper, and enables natural-language Q&A via a LangChain/FAISS RAG pipeline. Demonstrated strong systems thinking by tuning retrieval for accuracy/latency/memory (cutting response time ~3s→1s and memory ~500MB→25MB) and by safely migrating an ERP from monolith toward services using dual writes, reconciliation, and idempotency to protect financial workflows.”
Senior Software & AI Engineer specializing in full-stack development and FinTech AI
“Startup-focused full-stack engineer who has worked across fintech and digital health, including Pivotxy and Cybele Health. They combine backend/API development with AI integration, including GPT-powered financial reporting and a finance agent benchmark, and have helped turn manual report workflows that took weeks into outputs generated in minutes.”
Mid-level Full-Stack & XR Developer specializing in GenAI and immersive AR/VR systems
“Built and deployed a "personal second brain" product (CloneMind) with an end-to-end RAG pipeline for retrieving information across PDFs, URLs, images, and audio using Next.js/Node.js/Postgres/Supabase/Redis. Demonstrates strong practical depth in retrieval quality tuning, latency reduction via caching, and stateful orchestration with LangChain/LangGraph, plus experience persuading a non-technical professor stakeholder by shipping a working prototype.”
Entry-Level GenAI/LLM Engineer specializing in agentic systems and RAG
“LLM/AI agent engineer with consulting/contract experience (Kanhaiya Consulting LLC) who deployed a production AI agent to automate BIM list workflows end-to-end—from database understanding and data cleaning to automated visualizations/dashboards. Worked around restricted real-time data access by generating synthetic data and improving outputs via supervised fine-tuning, and uses AWS-based LLMOps observability (Opic/OPEC) plus hybrid retrieval (vector+BM25 with reranking) to optimize relevance, latency, and cost.”