Pre-screened and vetted.
Mid-Level Software Development Engineer specializing in GenAI and full-stack cloud systems
“Full-stack engineer with experience across Magna, C3.ai, and Amazon, building GenAI-enabled products and finance transaction systems. Has shipped Next.js (App Router) + TypeScript features backed by Go/Python RAG pipelines, and emphasizes production quality via load testing, Selenium regression coverage, LLM-aware integration testing, and Azure observability. Also built LangGraph-orchestrated multi-step content generation workflows with robust retry/idempotency strategies.”
Mid-level Backend & Reliability Engineer specializing in AWS, Kubernetes, and automation
“Meta engineer focused on reliability/operations tooling who built a unified real-time health dashboard and scalable telemetry pipelines (AWS + Datadog) for thousands of devices. Also shipped an internal LLM-powered knowledge assistant using RAG over wikis/runbooks/logs with strong guardrails and a rigorous eval loop that drove measurable accuracy improvements via automated doc ingestion and embedding updates.”
Director-level Engineering Manager specializing in large-scale data and compute platforms
“Platform and distributed-systems leader (player-coach) who owned architecture and reliability for an Amazon analytics/data platform serving ~100K internal users at exabyte scale. Built an ML-driven “Lakeflow” optimization layer that cut pipeline completion times ~20–25% and reduced compute waste >15%, and led major incident response/redesign efforts (e.g., deletion storm) with strong rollout/observability/rollback practices.”
Mid-level AI/ML Engineer specializing in MLOps, LLMs, and scalable ML systems
“ML/LLM engineer at Adobe who deployed a transformer-based personalization and campaign-targeting recommender system end-to-end, including PySpark/Airflow pipelines processing 12M+ events/day and containerized inference on AWS SageMaker (Docker/Kubernetes). Also has hands-on LLM workflow experience (RAG, semantic search, prompt optimization, hallucination mitigation) with a metrics-driven approach to reliability, drift monitoring, and reproducible retraining via MLflow.”
Executive ML/AI Founder specializing in agentic analytics and data infrastructure
“Founder of Photosphere Labs (agentic AI for ecommerce data synthesis/analysis) who worked directly with customers to scope, build, demo, and iterate LLM-based solutions, including an AI chat product for brand owners. Previously at Block, built and explained a nuanced causal inference/propensity model tied to Square POS integrations, translating model specs and outputs into business impact for varied client contexts.”
Junior Software Engineer specializing in full-stack and machine learning
“CMU IoT coursework project builder who implemented an end-to-end TinyML gesture recognition system on a Particle Photon + ADXL345, streaming data via MQTT/Node-RED to a real-time Node.js frontend and deploying a quantized logistic regression model on-device. Also explored multi-drone coordination, implementing leader-follower offset control and a pivot/arc turning strategy to avoid collisions, and brings practical Docker/Kubernetes plus CI/CD workflow experience from internships.”
Mid-Level Software Development Engineer specializing in full-stack systems and ML
“AWS engineer who productionized an internal ML-driven data pipeline from a notebook prototype into a scalable, observable Python service (schema validation, deduplication, idempotency, safe retries, versioned transforms, CloudWatch alarms), reducing manual effort and improving data accuracy/trust. Experienced diagnosing workflow issues in real time (e.g., upstream schema changes) and partnering with account managers/support to unblock adoption of seller-facing Marketplace features by demonstrating reliability with concrete metrics.”
Senior Full-Stack Engineer specializing in Healthcare SaaS and supply chain systems
“Backend engineer with healthcare platform experience at Thirty Madison, combining Django for secure, data-heavy core services with FastAPI async microservices for real-time patient monitoring. Led Kubernetes migration with Istio service mesh, autoscaling (HPA), and stateful storage patterns, and implemented GitOps CI/CD using ArgoCD. Also built real-time Kafka streaming pipelines with reliability patterns like idempotent producers and offset management.”
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Mid-Level Software Engineer specializing in Generative AI and RAG systems
“Built a production RAG-based natural-language-to-SQL system at Global Atlantic to replace slow, expensive manual analytics ticket workflows, focusing heavily on retrieval quality and measurable evaluation (200-question ground-truth set; recall@5 improved 0.65→0.78 via semantic chunking). Also built a custom MCP-style agent orchestrator for a personal project (arxiv-ai) to improve flexibility and Langfuse-aligned observability, and has hands-on experience with LangGraph, CrewAI, and n8n.”
Staff Applied Scientist specializing in multimodal LLM safety, robustness, and retrieval
“Built a production LLM-driven archival assistant that turns large, low-quality scanned handwritten files (120+ pages) into structured datasets, overcoming context-window and hierarchy challenges with a two-phase LLM + rules pipeline and reaching 98.1% accuracy (Gemini-2.5 Flash). Also orchestrated a large human-in-the-loop effort with 78 archivists, producing 2,400 high-quality annotations in 4 days via detailed rubrics and support.”
Mid-level AI Engineer specializing in agentic LLM systems
“Built and productionized a dual-agent LLM invoice-processing system for GFI Partners, adding guardrails and audit trails to earn stakeholder trust and drive adoption while cutting operational burden by 75%. Uses LangSmith observability to diagnose real-time workflow regressions and has experience teaching agentic AI concepts (e.g., at Carnegie Mellon) through hands-on, scaffolded demos.”
Staff Software Engineer specializing in cloud platforms for healthcare and financial workflows
“Backend/data engineer with Optum healthcare claims domain experience building high-reliability Python microservices (FastAPI/Kafka/Postgres) and AWS data platforms (EKS, Glue, Redshift). Demonstrated strong production ownership: fixed duplicate Kafka processing via transactional outbox/idempotency, scaled to millions of daily events, and delivered major SQL performance gains (40+ min to <5 min, ~60% CPU reduction). Seeking remote-only work; targets $130k base.”
Senior Software Engineer specializing in Python, cloud platforms, and distributed systems
“Backend/data engineer with production experience at Walmart and HealthSnap building Python services and data pipelines on AWS (EKS, Lambda, Glue, Airflow). Strong reliability and operations focus—implemented idempotency + circuit breakers for peak-traffic consistency issues, GitOps CI/CD, and observability. Demonstrated measurable performance wins (Postgres p95 45s to <5s, ~60% CPU reduction) and modernized SAS batch workflows to Python with parallel-run parity validation and feature-flagged rollout.”
Mid-level Full-Stack Developer specializing in interactive web apps and AWS
“Full-stack, design-minded developer who builds interactive, motion-forward experiences and translates complex creative coding (Three.js/p5.js/GLSL) into accessible UI for non-technical clients. Delivered an end-to-end manufacturing quality control image system for ChargePoint (React dashboard + AWS) and has hands-on field research experience from Hyundai EV user interviews; currently leading development of a virtual gallery for Creative Coding NYC.”
Senior Full-Stack Software Engineer specializing in workflow automation and healthcare AI
“Backend/data engineer who has owned production Python APIs and high-throughput async workflows on AWS (FastAPI, Docker, ECS/EKS/Lambda) with mature reliability practices like idempotency, bounded retries, circuit breakers, and strong observability. Also built AWS Glue ETL into an S3/Redshift lakehouse and modernized legacy batch systems via parallel-run parity testing and feature-flagged migrations, including a SQL tuning win cutting a multi-minute query to under 10 seconds.”
Junior AI Engineer specializing in healthcare analytics and compliance AI
“Built and shipped a production LLM-driven multi-agent platform (ciATHENA) at CustomerInsights.AI to automate analytics/ML/compliance workflows in healthcare and life sciences. Implemented LangGraph/LangChain orchestration with strong backend-style rigor (schemas, Pydantic validation, retries, auditability) and optimized latency/cost while keeping the system usable for non-technical users via guided natural-language interactions and structured/visual outputs.”
“Machine learning software engineer intern experience at Amazon, where they built a production testing framework to inject frames/videos onto devices to measure embedded CV model inference and ensure broad model compatibility via automatic NNA metadata handling. Also built side projects spanning LLM/RAG orchestration (LangChain/LangGraph with reranking and citations) and applied CV/healthcare work (nail disease detection, medical retrieval chatbot).”
Mid-Level Full-Stack Java Engineer specializing in cloud-native web applications
“Full-stack engineer (Snowflake) who shipped an AI/LLM-powered data exploration product end-to-end, spanning Spring Boot/Python services and a polished React UI with streaming responses and robust fallbacks. Experienced operating high-scale AWS deployments (Docker/Kubernetes, SNS/SQS, RDS Postgres, CloudWatch, Jenkins CI/CD) supporting thousands to tens of thousands of concurrent users, including handling real traffic-spike scaling incidents.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Mid-level Machine Learning Engineer specializing in deep learning, MLOps, and real-time inference
Senior Full-Stack Software Engineer specializing in cloud-native microservices