Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning
“ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.”
Junior Software Engineer specializing in full-stack and ML/NLP systems
“Entry-level full-stack engineer with internship experience at Amazon (Appstore IAP flow + uninstall recommendation workflow) and a health-tech startup (OneVector) where they built a DSUR reporting workflow end-to-end, including document generation, S3-backed versioning/metadata, and secure preview/download. Demonstrates strong production debugging and reliability mindset (instrumentation, deterministic retrieval, idempotent writes) and focuses on UX/performance in high-stakes user flows.”
Senior Full-Stack Engineer specializing in AI platforms and scalable web systems
“Built and shipped production agentic/LLM systems that could safely perform real customer and subscription operations, not just answer questions. Demonstrates unusually strong depth in agent orchestration, tool safety, evals, tracing, and backend workflow design across Node.js/TypeScript, Go, Redis, Postgres, Kafka, and GPT-4.”
Mid-level Machine Learning & Generative AI Engineer specializing in NLP, CV, and RAG systems
“Built and deployed a production LLM-powered RAG document intelligence system used by non-technical enterprise stakeholders, cutting document search time by 40%+ while improving answer consistency. Demonstrates strong MLOps/data workflow orchestration (Airflow, AWS Step Functions, managed schedulers across GCP/Azure) and a metrics-driven approach to reliability, evaluation, and cost/latency optimization with guardrails and observability.”
Junior Machine Learning Engineer specializing in LLM systems and inference reliability
“ML/LLM infrastructure-focused engineer who built a production stateful LLM inference service that cuts latency and GPU compute for repeated/overlapping prompts via caching with correctness guardrails. Strong in Kubernetes-based deployment and reliability engineering, using A/B testing and similarity-based evaluation to quantify performance gains without sacrificing output quality.”
“Data science/NLP practitioner with experience at NVIDIA and Microsoft building production-grade NLP and data-linking systems. Has delivered high-performing pipelines (e.g., F1 0.92) and large-scale entity resolution (F1 0.89), plus semantic search using embeddings and Pinecone with ~30–40% relevance gains, backed by rigorous validation (A/B tests, ROUGE, MRR) and strong MLOps/workflow tooling (Airflow, Databricks, FastAPI, MLflow, Prometheus/ELK).”
Principal Backend/Platform Engineer specializing in GenAI agent orchestration and LLM pipelines
“LLM-focused engineer/sales-engineering profile with hands-on experience productionizing complex systems: scalable distributed architecture, multi-tenant monitoring, canary/shadow rollouts, and robust fallback strategies. Demonstrated real-time troubleshooting depth (p99 latency spikes traced to DB connection limits causing retry storms) and strong developer-facing communication via RAG workshops and live, customer-specific demos that helped close deals quickly.”
Mid-level Software Engineer specializing in distributed systems and automation
“Engineer with experience at Flexport and Microsoft, focused on full-stack and backend platform work in complex workflow and permissions-heavy systems. They’ve shipped customer-facing Next.js/GraphQL features, improved PostgreSQL-backed authorization performance, and driven an event-driven SQS architecture change that improved throughput by ~20% and sped up new workflow development by ~30%.”
Senior Software Engineer specializing in distributed systems, AI/ML platforms, and cloud-native SaaS
Mid-level Backend/Distributed Systems Engineer specializing in cloud observability and data ingestion
Senior Engineering Leader specializing in FinTech, AI, and data platforms
Executive AI/ML engineering leader specializing in voice AI and logistics automation
Senior Data Engineer specializing in cloud big data pipelines and real-time streaming
“Amazon data engineer who built a real-time fraud detection pipeline for AWS Lambda, tackling multi-region telemetry quality issues and scaling stream processing for billions of daily requests. Strong in production-grade data/ML workflows on AWS (EMR, Glue, Kinesis, SageMaker) with hands-on entity resolution and anomaly detection.”
Junior Machine Learning Engineer specializing in LLMs and data pipelines
“Research Extern at Google DeepMind and former AWS Software Development Engineer Intern with a strong focus on practical, trustworthy AI engineering. Built a multi-agent RAG system for personalized news headline generation using a fine-tuned Flan-T5 model, parallel critic agents, FAISS retrieval, and style embeddings, while also leading a 3-person team on the project.”
Entry-Level Backend/Cloud Engineer specializing in distributed systems and AI platforms
“Full-stack engineer with deep serverless AWS experience who built VidToNote, an AI video analysis platform, end-to-end using Next.js App Router/TypeScript and an event-driven pipeline (API Gateway, Lambda, DynamoDB, S3, Step Functions, SQS). Strong on production reliability and observability (CloudWatch, X-Ray, structured logging), plus data/analytics work in Postgres with measurable query optimizations and durable LLM evaluation workflows. Amazon background; integrated 22 AWS services and completed AWS Solutions Architect Professional certification within a month.”
Entry-Level Software Engineer specializing in ML/NLP and security
“Early-career engineer (internship background) who built a production-style notes product using Next.js App Router with Server Components/Server Actions and a Postgres-backed analytics model. Demonstrates strong performance and reliability instincts—measured DB latency improvements via indexing and cursor pagination, plus durable orchestration with Temporal using idempotency and deterministic workflows.”
Staff/Lead Software Architect specializing in Contact Center platforms and GenAI automation
“Built and deployed production LLM systems in healthcare and at LinkedIn: automated pen-and-paper clinical trial evaluations with a 40x efficiency gain and created an evidence-based Evaluation Agent focused on accuracy and speed. Also used Temporal to orchestrate resilient data-ingestion workflows for customer support staffing prediction, improving prediction outcomes by 40% while handling missing data, retries, and backfills.”
Senior Software Engineer specializing in distributed systems and AI workflow orchestration
“Backend owner at Apple for an AI workflow orchestration service, with hands-on experience stabilizing peak-traffic production systems using OpenTelemetry-style tracing, bounded async concurrency, and database performance tuning. Built and shipped a Python LLM-agent orchestration layer to automate multi-step operational workflows, emphasizing guardrails, auditability, and deterministic fallbacks to keep non-deterministic AI behavior production-safe.”
Junior AI Software Engineer specializing in LLM pipelines, OCR, and RAG
“Built and shipped a production LLM pipeline for nursing home Medicare reimbursement (PDF OCR + fact extraction + keyword RAG + QA) that reportedly increased payouts by ~$1K/month per patient. Strong in LLM ops/benchmarking (ground truth, LLM-as-judge, cost/I-O tracking) and pragmatic optimization—swapped retrieval approaches, fine-tuned a small model to cut OCR cost 90%, and migrated workloads to Azure/Temporal to scale nightly processing 10x.”
Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems
“ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.”
Director-level Engineering Manager specializing in large-scale data and compute platforms
“Platform and distributed-systems leader (player-coach) who owned architecture and reliability for an Amazon analytics/data platform serving ~100K internal users at exabyte scale. Built an ML-driven “Lakeflow” optimization layer that cut pipeline completion times ~20–25% and reduced compute waste >15%, and led major incident response/redesign efforts (e.g., deletion storm) with strong rollout/observability/rollback practices.”