Pre-screened and vetted.
Junior AI/ML Engineer specializing in LLMs, RAG, and information retrieval
“Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.”
Mid-level Backend Engineer specializing in distributed microservices and event-driven systems
“Software engineer (Yellow.ai) who built and productionized an AI-driven resume tailoring system using embeddings + Chroma RAG + QLoRA fine-tuning, deployed via Docker/Kubernetes with CI/CD on a CPU-only Oracle VM. Demonstrates strong reliability/evaluation rigor (custom hallucination/coverage/relevance metrics) and measurable business impact, including a 60% user satisfaction lift from improving chatbot intent accuracy with product and support teams.”
Junior Data Scientist / ML Engineer specializing in LLMs and RAG systems
“Built and deployed a production enterprise LLM-powered RAG assistant for the construction domain, enabling natural-language querying across PDFs/reports and structured sources (SQL/CSV). Implemented an agent-based routing and multi-agent orchestration approach (LangChain/LangGraph) to reduce hallucinations, improve latency, and deliver actionable, structured responses based on stakeholder feedback.”
Intern AI/Data Scientist specializing in LLMs, RAG, and MLOps
“Internship project at Builder Market: built an end-to-end production multimodal LLM application that estimates renovation/replacement costs from appliance photos (CLIP embeddings) or text descriptions, combining fine-tuning with agentic RAG. Focused heavily on real-world performance constraints—latency and cost—using parallel agent workflows, model routing to smaller/open-source models, re-ranking, and retrieval chunking, and collaborated closely with CEO/co-founders to deliver the solution.”
Senior Machine Learning Engineer specializing in NLP, computer vision, and edge AI
“AI/LLM engineer who built a production RAG-based Text2SQL engine using Qdrant, including creating the underlying business/DB documentation, generating a test dataset, and designing detailed SQL-quality metrics for validation. Also partnered with non-technical stakeholders on a speech recognition project to prioritize medical terminology, improving accuracy through targeted corpora, lookup-table correction, and fine-tuning with a modified loss function.”
Mid-level Full-Stack Developer specializing in AI automation and RAG pipelines
“Frontend engineer who has led mobile-first and web React/TypeScript products end-to-end, including an expense tracking app handling sensitive financial data and a real-time messaging/activity dashboard with chat, presence, and contextual side panels. Emphasizes scalable architecture, rigorous component-boundary testing, and production-safe rollout practices (feature flags, analytics/logging, staged releases) to ship reliably in fast-paced environments.”
Mid-level AI Engineer specializing in generative AI, multimodal evaluation, and agentic RAG systems
“Built and productionized an agentic LLM automation system for an insurance client to determine medication eligibility, using prompt-chaining plus a RAG pipeline over policy rules and deploying on AWS (Lambda/Step Functions, Bedrock) with a serverless architecture. Addressed major data/schema mismatch issues via a semantic matching pipeline and validated performance through human agreement scoring, A/B testing, KPI monitoring, and confidence-based human-in-the-loop review.”
Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)
“AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.”
Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics
“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”
Mid-level Machine Learning Engineer specializing in NLP, Generative AI, and RAG systems
“Built and deployed a production LLM-powered phone assistant for a healthcare clinic, combining streaming STT/TTS with RAG over approved clinic documents and strict safety guardrails to prevent unverified medical advice, plus seamless human handoff. Also has hands-on Apache Airflow experience building robust daily ML/data pipelines with data validation, retries/timeouts, monitoring, and metric-gated model deployment, and iterates closely with clinic staff using real call reviews.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.”
“ML/LLM engineer with production experience building a RAG-based LLM support assistant (FastAPI, Redis, Kafka) with multi-layer validation and human-in-the-loop feedback loops to improve accuracy over time. Has orchestration and MLOps depth using Airflow and Kubeflow on Kubernetes (autoscaling, alerting, monitoring) and delivered measurable ops impact (40% ticket efficiency improvement) by partnering closely with customer support teams.”
Mid-level Data Scientist / ML Engineer specializing in MLOps and Generative AI
“Built and deployed an AI agent to help patients navigate complex housing information by scraping and normalizing unstructured data across all 50 U.S. states, then layering a LangChain RAG system with MMR re-ranking to reduce hallucinations. Experienced in orchestrating multi-agent workflows (LangGraph/CrewAI) and production reliability practices (Pydantic-validated outputs, LLM-as-judge evals, tracing). Also delivered stakeholder-facing explainability via SHAP dashboards for a loan-approval predictive model at Welspot.”
Senior Backend Software Engineer specializing in distributed systems and cloud microservices
“Backend engineer with NTT Data experience building Java/Spring Boot services for product-data ingestion, including Kafka-based asynchronous pipelines and Redis read-through caching. Also built a personal RAG system deployed on Google Kubernetes Service using FastAPI, LangChain, and Pinecone with multi-tenant data isolation; holds a Master’s background in Machine Learning.”
Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms
“AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.”
Mid-level AI Software Engineer specializing in computer vision and multimodal systems
“Robotics/perception engineer focused on production-grade, real-time systems—optimized self-supervised segmentation on Jetson Nano from ~6–10 FPS to ~20–25 FPS and scaled experimentation/deployment by unifying 15+ edge models in a modular PyTorch Lightning framework. Experienced integrating distributed LiDAR-camera fusion via gRPC/protobuf into mission planning, migrating ROS1→ROS2 Foxy for multi-drone perception, and adding Prometheus-based observability for long-running deployments.”
Junior AI/ML Engineer specializing in NLP, LLMs, and MLOps deployment
“Built and deployed NeuroDoc, a production-grade RAG system for PDF Q&A that delivers citation-backed answers with strong anti-hallucination guardrails. Experienced in orchestrating and scaling ML/LLM pipelines with Kubernetes, Airflow/Prefect, and PyTorch Distributed, and in building rigorous evaluation and citation-verification tooling to ensure reliability in production.”
Mid-level Machine Learning Engineer specializing in LLM apps, RAG pipelines, and MLOps
“Software engineer with connected-car/automotive production experience who owned an end-to-end remote door lock/unlock feature and introduced unit testing (GTest) plus rig/simulator validation. Also built and productionized an AI-native AWS cloud cost assistant (Lex + GPT-based LLM + Lambda + RAG/vector DB) with guardrails and achieved 94% evaluation accuracy. Helped replace a third-party solution with an in-house build, saving the company ~€9M.”
Mid-level Full-Stack & ML Engineer specializing in AI SaaS, MLOps, and cloud infrastructure
“Built and shipped an AI-powered driver ranking/assignment system at AffirmoAI using LLM intent classification + RAG over pgvector/Postgres, served via FastAPI with a React UI that explains scores. Drove measurable improvements through optimization and iteration (latency down to <800ms, adoption 60%→90%+) and implemented rigorous eval loops with dispatcher ground truth plus cold-start handling for new drivers.”
Mid-level Full-Stack Software Engineer specializing in scalable web apps and automation
“UE5 UI engineer who has shipped production-ready HUD/menu frameworks using C++/Slate/UMG and CommonUI, emphasizing MVVM-style architecture for maintainability and designer-friendly iteration. Strong in UI profiling/optimization (Unreal Insights + Slate Profiler), including Slate list virtualization and event-driven updates that improved UI frame time by ~30% in heavy menu scenarios.”
Mid-Level Software Engineer specializing in Java/Spring microservices and cloud event-driven systems
“LLM/agentic-systems practitioner who has repeatedly taken LLM-driven pricing/decision services from prototype to production using pilots, guardrails, observability, and staged rollouts. Demonstrates strong real-time incident troubleshooting (dependency timeouts, cached fallbacks) and post-incident hardening (isolation/async/alerts), and also supports go-to-market via developer workshops, technical demos, and sales-aligned POCs.”
Mid-level Data Engineer specializing in cloud data platforms and AI/ML analytics
“Backend/data engineer in healthcare who built an AWS-based clinical analytics platform from scratch (DynamoDB/S3/Airflow/dbt) with sub-second clinician query goals, 99.9% uptime, and HIPAA-grade controls (KMS encryption, IAM RBAC, audit trails). Also modernized ML delivery by replacing a manual 4-hour deployment with a 30-minute Docker/GitHub Actions CI/CD pipeline using parallel runs, parity testing, and rollback, and caught critical EHR data edge cases (date formats/timezones) that could have impacted patient care.”
Junior Software Engineer specializing in full-stack web and cloud systems
“Co-op engineer at EnFi who built and maintained a multi-tenant prompt library and LLM workflow tooling used by internal teams and external enterprise clients. Led TypeScript/React package design and standardized a typed workflow abstraction across disparate implementations (React, Go, JSON), improving reliability and developer adoption. Delivered measurable performance gains (~25% latency reduction) and owned end-to-end execution including docs, demos, debugging, and deployment.”
Junior AI/ML Engineer specializing in LLM agents and RAG systems
“Backend/data engineer who built a production-ready multi-agent financial intelligence system (Mycroft) that orchestrates specialized AI agents to analyze real-time market data using FastAPI and Pinecone vector search. Brings strong security/reliability instincts (rate limiting, JWT/OAuth2, retries/backoff, health checks) and has caught high-impact data integrity issues in financial migrations (timezone normalization across global legacy systems).”