Pre-screened and vetted in the NYC Metro.
Director of AI/ML Engineering specializing in MLOps, data platforms, and 3D computer vision
“Backend/data engineer focused on production ML/LLM systems: built a real-time FastAPI inference API on Kubernetes with strong reliability patterns (timeouts, idempotent retries, centralized error handling). Delivered AWS platforms using EKS + Lambda with GitHub Actions/Helm CI/CD and built Glue-based ETL from S3/Kafka into Snowflake with schema evolution and data-quality controls; also modernized legacy analytics/recommendation workflows into Python services with safe, feature-flagged cutovers.”
Junior AI/ML Engineer specializing in LLM agents, RAG, and distributed systems
“Python backend engineer focused on high-throughput document/PDF processing systems, building end-to-end pipelines that extract structured content for downstream NLP use cases. Demonstrates strong practical MLOps-adjacent infrastructure skills: Kubernetes deployments, GitLab CI, GitOps workflows, and an incremental migration to AWS using EC2/Lambda tradeoffs. Deep hands-on optimization experience (selective OCR, layout-aware extraction, parallelism, caching, idempotency, and backpressure/autoscaling).”
Mid-level Software Engineer specializing in distributed systems and FinTech infrastructure
“Early-career software engineer who owns revenue-critical invoice processing and internal ops tooling end-to-end. Has built TypeScript/React systems backed by MongoDB and Temporal, and designed scalable SQS-based onboarding workflows with FIFO/DLQ monitoring. Notably redesigned an Authzed SpiceDB authorization model, shrinking a 500+ line schema to ~20 lines while meeting sub-100ms p95 latency.”
Junior AI/ML Engineer specializing in MLOps and real-time model serving
“Software engineer with Amazon experience who has built LLM-powered and hybrid ML systems for ad auction/relevance at massive scale. Most notably, they described redesigning brand-query classification with a GPT-4-assisted offline cache plus fallback architecture that improved accuracy from 72% to 99%, reduced latency and costs, and was credited with an estimated $130M revenue lift.”
Director-level Data Engineering Leader specializing in AI/LLM platforms and real-time data systems
Executive Engineering Leader specializing in AI and Financial Services platforms
Senior Full-Stack Engineer specializing in cloud-native microservices and AI/LLM integrations
Mid-level Full-Stack Developer specializing in Java Spring Boot microservices
Senior Backend/Full-Stack Engineer specializing in distributed systems and event-driven APIs
Senior Full-Stack Engineer specializing in Python, React, and AI-powered cloud applications
Executive Product & Technology Leader specializing in SaaS, FinTech, and Media & Entertainment
Mid-level Data Scientist/ML Engineer specializing in LLMs, NLP, and recommender systems
Mid-level Full-Stack Software Engineer specializing in cloud microservices and AI integration
“Backend/distributed-systems engineer with Uber experience building real-time telemetry and safety signal pipelines. Strong in Kafka-based event-driven architectures, low-latency processing under peak load, and production reliability via monitoring, retries, and fallback logic; has Docker/Kubernetes and CI/CD deployment experience.”
Junior Software Engineer specializing in cloud infrastructure and billing systems
“Full-stack product engineer who built a semantic word game end-to-end across web and mobile, including a custom ML-based scoring pipeline that replaced an expensive third-party API. Also has experience shipping real-time social learning features at BU Spark, with strong instincts around product ownership, UX polish, and pragmatic infrastructure choices.”
Senior Software Engineer specializing in AI and FinTech platforms
“Built a production LLM pipeline at Walter AI that scans massive user inboxes, identifies financial newsletters, and extracts trading strategies into structured JSON for downstream paper-trading workflows. Stands out for combining agent architecture with strong production discipline—cutting scan time from 20 to 5 minutes, reducing LLM costs by 90%, and achieving 3-second P99 latency while handling messy, inconsistent email data at scale.”
Mid-level Python Full-Stack Developer specializing in FinTech and real-time data/ML systems
Mid-level Technical Product Manager and R&D Software Engineer specializing in AI products
Staff-level Software Engineer specializing in AI, data platforms, and cloud infrastructure
Mid-Level Software Engineer specializing in FinTech and distributed data platforms
Mid-Level Full-Stack Software Engineer specializing in Java/Spring Boot and React/Angular
Senior Full-Stack Engineer specializing in React and micro-frontends
Mid-level Full-Stack & AI Backend Engineer specializing in LLM/RAG systems