Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI and RAG pipelines
“AI/LLM engineer with healthcare domain experience who built a production clinical support “chart bot” for Molina, including PHI-safe ingestion of 200k+ PDF policies, vector retrieval, and a fine-tuned LLaMA served via vLLM on ECS Fargate. Demonstrated measurable performance wins (HNSW + namespace partitioning; 30% inference latency reduction) and a rigorous evaluation/monitoring approach, while partnering closely with nurses and operations teams to shape workflows and guardrails.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and cloud-native ML
“LLM/agent engineer at USAA who built a production GPT-4o RAG conversational assistant for financial analysts, focused on regulatory interpretation and internal documentation search. Emphasizes compliance-grade reliability with strict grounding, safe fallbacks, and full auditability via MLflow/DVC plus human-in-the-loop review; reports ~45% reduction in ticket resolution time.”
Senior Software Engineer specializing in backend APIs and cloud-native services
Mid-level AI/ML Engineer specializing in generative AI and MLOps
Senior Data Engineer specializing in forecasting, analytics platforms, and BI
Mid-Level Full-Stack Software Engineer specializing in cloud-native FinTech and ERP systems
Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and NLP
Mid-level AI/ML Engineer specializing in cloud MLOps and scalable model deployment
Mid-level Backend Software Engineer specializing in Python microservices and cloud-native APIs
Intern Full-Stack Software Engineer specializing in AI-powered web applications
Senior GenAI Engineer specializing in LLM agents and insurance automation
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices
Mid-level Machine Learning Engineer specializing in MLOps and applied data science
Mid-level Full-Stack Software Engineer specializing in GenAI and SaaS platforms
Mid-level AI Engineer specializing in NLP, computer vision, and MLOps
Senior Software Engineer specializing in backend, DevOps, and LLM-powered systems
“Backend-focused Python engineer who has owned production FastAPI services deployed on Kubernetes, including CI/CD (GitLab CI to ECR) and GitOps delivery via ArgoCD/Helm. Has hands-on experience with complex reliability and infrastructure work—solving data inconsistency with validation/partial-data paths, fixing K8s liveness issues via lazy loading, and supporting a phased cloud-to-on-prem migration with dual-writes and monitoring. Also built Kafka-based real-time ingestion consumers handling bursty, high-throughput traffic with async processing and topic/retention tuning.”
Mid-level Machine Learning Engineer specializing in production ML, forecasting, NLP and computer vision
“Built and deployed a production LLM-powered support assistant for customer support agents using a RAG architecture over internal docs and past tickets, with human-in-the-loop review. Demonstrates strong applied LLM engineering focused on real-world constraints (hallucinations, latency, cost) using routing to smaller models, reranking, caching, and rigorous evaluation/monitoring (offline eval sets, A/B tests, KPI tracking).”
Intern AI/Data Scientist specializing in LLMs, RAG, and MLOps
“Internship project at Builder Market: built an end-to-end production multimodal LLM application that estimates renovation/replacement costs from appliance photos (CLIP embeddings) or text descriptions, combining fine-tuning with agentic RAG. Focused heavily on real-world performance constraints—latency and cost—using parallel agent workflows, model routing to smaller/open-source models, re-ranking, and retrieval chunking, and collaborated closely with CEO/co-founders to deliver the solution.”
Mid-level Data Scientist / ML Engineer specializing in healthcare predictive analytics and NLP
“Built and deployed a real-time hospital readmission risk prediction system at NYU Langone Health, combining structured EHR data with BERT-based NLP on clinical notes and serving predictions to clinicians via Azure ML and FHIR APIs. Emphasizes production reliability and clinical trust through SHAP-based explainability and robust healthcare data preprocessing, and reports a 22% reduction in 30-day readmissions.”