Pre-screened and vetted.
Junior Software/ML Engineer specializing in AI systems, cloud infrastructure, and applied research
“Backend/infra-focused engineer with experience spanning Go-based MCP servers for an AI-assisted Kubernetes on-call diagnosis chatbot and a Python/Flask PagerDuty automation integration. Previously at Tesla, optimized high-volume battery test data in PostgreSQL using JSONB, partitioning, and a timestamp normalization pipeline; also built PyTorch PINN training workflows and achieved a 20x speedup via batch vectorization.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
“Backend/full-stack engineer (Amazon experience) who built an AWS-based integration testing platform using Flask, ECS, Docker, and CloudWatch—cutting 1000+ test cases from ~5 hours to ~30 minutes while improving log visibility for non-engineering users. Also led a zero-downtime EU region migration with rigorous ORR testing, and built a Kinesis/Firehose/S3 + Glue/Spark replay mechanism for resilient data recovery. Side project: reproducible, cost-efficient LLM hosting platform on EKS using CDK and Karpenter for scale-to-zero.”
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Principal Cloud & Digital Transformation Architect specializing in Financial Services and Data Platforms
“Technology-first venture builder with strong familiarity in the VC/accelerator landscape, specializing in greenfield innovation, M&A, and large-scale transformation/modernization. Described building a venture-funded retail banking greenfield startup to integrate lending-as-a-service for SME lending while meeting federal and local financial services compliance requirements.”
Mid-level Software Engineer specializing in AI/LLM and distributed systems
“Recent internship project at Google Workspace building an LLM-driven Python backend pipeline to extract/enrich NLP features from messy customer web domains and integrate them into a Domain Feature Store for personalization and promotions. Also has hands-on Kubernetes/Docker deployment experience for a Digital Signage SaaS backend with GitHub Actions CI, plus strong streaming-systems knowledge (Kafka exactly-once, schema evolution, Flink scaling) and built an information retrieval system handling 30,000+ cases.”
Mid-level Software Engineer specializing in distributed backend systems on AWS
“Built production systems in the AWS ecosystem, including an internal AI assistant for diagnosing account transfer and permissions issues and an end-to-end account transfer workflow used by enterprise customers. Stands out for combining LLM/RAG design with strong distributed systems reliability practices, emphasizing guardrails, fallbacks, and operational trust in high-stakes workflows.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”
Mid-level Software Engineer specializing in backend distributed systems and cloud platforms
“Software engineer at Intel who owns a production Go/Kubernetes backend for supply-chain transparency and end-to-end hardware integrity verification in a hybrid cloud setup (AWS control plane + Azure data plane). Also built and shipped an AI agent workflow for real-estate due diligence that turns raw Excel spreadsheets into structured investment outputs and auto-generated PowerPoint insights using LangGraph, with strong emphasis on verification, observability, and reliability guardrails.”
Mid-level Software Developer specializing in cloud data engineering and MLOps
“Software engineer with strong AWS production experience, including an end-to-end historical backfill system exporting ~10PB of CloudWatch logs into a data lake using Step Functions/Kinesis/Lambda/Firehose/Glue. Emphasizes reliability and operability (DynamoDB checkpointing, monitoring dashboards, CI/CD with canary tests) and has also built customer-facing UI work for the Visa Developer Portal using Angular + Spring Boot, plus React/Redux frontend work.”
Executive Technology Leader specializing in Enterprise AI, Cloud Architecture, and Data Platforms
“Senior data/technology executive who stays hands-on: currently building a Go micro-kernel orchestration layer for medical AI agents to boost concurrency and enforce HIPAA/PHI controls, achieving 26x throughput on migrated workloads. Has led large-scale transformations across healthcare and financial services, including a 45-day data warehouse rebuild at Elara Caring and a data/ML roadmap at Acelity credited with $230M in annual revenue impact prior to 3M acquisition.”
Senior Full-Stack Python Developer specializing in cloud-native RAG and microservices
Mid-level Software Development Engineer specializing in cloud platforms, data engineering, and LLM apps
Intern Software Engineer specializing in data science and network visualization
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
Mid-level AI/ML Engineer specializing in recommender systems, fraud detection, and LLMs
Mid-level Full-Stack Java Engineer specializing in scalable microservices and real-time data systems
Entry Software Engineer specializing in AI infrastructure and ML inference systems
Mid-level AI/ML Engineer specializing in NLP/LLMs and production ML systems
Mid-level Full-Stack Developer specializing in cloud-native microservices and FinTech
Mid-level Machine Learning Engineer specializing in LLMs and RAG systems