Pre-screened and vetted.
Mid-Level AI/ML Software Engineer specializing in agentic LLM systems
“Built and deployed a production LLM-powered multi-agent compliance copilot (life sciences/finance) using LangChain/LangGraph + RAG over vector databases, delivered via async FastAPI on Kubernetes. Emphasizes audit-ready, deterministic outputs with schema constraints and citations, plus rigorous evaluation/monitoring; reports 60%+ reduction in manual research time and successful production adoption.”
Junior Machine Learning Engineer specializing in MLOps and LLM/RAG systems
“LLM/agentic workflow builder focused on productionizing document-processing systems. Redesigned pipelines with LangGraph + RAG, schema-aware validation, and eval/monitoring loops; known for fast incident diagnosis (restored accuracy from ~70% to >95% same day). Partners closely with sales and stakeholders to deliver tailored demos and drive adoption (reported +40%).”
Mid-Level Full-Stack Software Engineer specializing in API-first microservices and cloud platforms
“Backend-focused engineer who built a resume processing and job application platform using Python/MongoDB/Streamlit, including OpenAI-powered skill/keyword extraction and recruiter-facing search/filtering. Has hands-on cloud deployment experience on AWS/Azure and executed an on-prem reservation portal migration to Azure using a phased trial-and-cutover approach; also automated CI/CD with Jenkins and GitHub Actions.”
Mid-Level Software Engineer specializing in Cloud, GenAI, and Federal systems
“Cloud-focused engineer experienced deploying and stabilizing complex production systems that span APIs, infrastructure, and automated workflows, with a strong observability and safe-release mindset (feature flags/canaries/rollbacks). Has hands-on, customer-facing incident leadership, including executing DR regional failover during an AWS us-east-1 outage to maintain service and reportedly save a client ~$10M.”
“LLM/agent workflow engineer with healthcare experience (CVS/CBS Health) who built and deployed a production call-insights platform using Azure OpenAI + LangChain/LangGraph, including sentiment and compliance checks. Demonstrates deep HIPAA/PHI handling (tenant-contained processing, redaction, RBAC/encryption/audit logging) and production rigor (testing, eval sets, validation/retries, autoscaling) to scale to thousands of transcripts.”
“Built and deployed a production Retrieval-Augmented Generation (RAG) platform in a healthcare setting to automate clinical documentation review and summarization, targeting near-real-time, explainable outputs. Emphasizes grounded generation to reduce hallucinations, latency optimizations (chunking/embedding reuse), and PHI-safe workflows with access controls, plus strong orchestration experience using Apache Airflow.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and multimodal modeling
“Built and productionized a telecom-focused RAG assistant by LoRA fine-tuning LLaMA-2 and integrating LangChain+FAISS behind a FastAPI service, with dashboards and a human feedback UI for engineers. Demonstrated measurable impact (≈40% faster document lookup, +8–10% retrieval precision) and strong MLOps rigor via Airflow orchestration, CI/CD, and monitoring for drift and failures.”
Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications
“Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).”
Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps
“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”
Principal Data Scientist specializing in NLP and Generative AI
“ML/NLP practitioner with experience building an embedding-based ad matching and search system at Vericast (BERT embeddings + similarity search) to replace a third-party taxonomy approach, evaluated via a human-curated gold standard. Also built a custom NER pipeline at Allstate for auto accident claims calls using a bidirectional LSTM and achieved 90%+ F1, with a strong emphasis on production-grade ML workflows (testing, CI/CD, orchestration, versioning, validation).”
“字节跳动实习期间将内部AI重量预测模型从“可用但难上线”的单点能力,改造成可商业化复用的通用API:统一多地区接口与评估口径,设计分层兜底与置信度分级,先灰度上线SEA/JP并推动US/EU落地,结合线上结果进行模型微调。具备LLM/RAG/Agent系统的实战排障方法论,以及面向开发者与售前场景的技术演示与跨团队推进能力。”
Executive CTO & AI Architect specializing in regulated SaaS (InsurTech/Healthcare/FinTech)
“Insurance-tech CTO and repeat founder with 10+ years in insurance startups; was employee #4/CTO at Polly (formerly DealerPolicy) and helped scale it from a PowerPoint to 250 employees while raising $180M+. Currently building and selling AgentCanvas.ai—an extensible AI accelerator platform for large insurance agencies—after coding the product end-to-end and now running demos/POCs with prospective buyers.”
Mid-level Backend Software Engineer specializing in AWS cloud and FinTech platforms
“JP Morgan engineer and Texas A&M student web developer who has owned production systems end-to-end, including a real-time ML training workflow that improved internal search relevance by 30%. Experienced with AWS cloud migrations and operating containerized services on ECS with CloudWatch+ELK observability, Terraform infra, and Spinnaker CI/CD; also built event-driven pipelines with RabbitMQ and Elasticsearch at 1M+ record scale.”
Intern Software Engineer specializing in edge AI deployment and distributed systems
“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”
Junior Software Engineer specializing in distributed systems and cloud-native backend services
“Founding engineer at a civic-tech startup (Barrow) who built and operated a Next.js/TypeScript product with map-based public reporting, including clustering and dynamic geospatial loading to improve UX and performance. Also implemented a location-aware RAG chatbot using Pinecone, web scraping/transcription, caching, and fallback web search, and owned post-launch observability plus scaling decisions (load balancing/horizontal scaling) based on API usage patterns.”
Principal Data Scientist specializing in healthcare analytics and medical imaging AI
“Developed an LLM-driven recommendation agent in Azure Databricks to triage oncology patients and trigger second-opinion case creation using medical claims and EHR data. Uses ICD-10/CPT/J-code features in prompts, embeddings + vector DB similarity, and a backtesting framework emphasizing recall to avoid missing clinically relevant cases while supporting business revenue.”
Mid-level Software Engineer specializing in embedded AI and full-stack systems
“Robotics software engineer who built and owned core navigation components for a TurtleBot in ROS/ROS2 and Gazebo, including an RRT-based planner, waypoint-to-velocity motion planning, and PID trajectory tracking. Demonstrates strong real-time debugging skills (control-loop timing under CPU load), costmap/occupancy-grid tuning, and distributed ROS2 communication design using DDS/QoS, plus Docker and CI/CD automation experience from Keysight.”
Engineering Leader specializing in Digital Health, AI, and Cloud Platforms
“Senior Engineering Manager at Roche leading two Scrum teams building internally shared (“inner-sourced”) tools and libraries for a healthcare enterprise. Has led security/compliance-first architecture decisions (e.g., Python AI modules running inside a Java container) and front-end modularization (Angular monorepo to module federation), with a strong focus on developer experience via automated Swagger/OpenAPI documentation and robust testing/versioning practices.”
Senior Machine Learning Engineer specializing in optimization, LLMs, and on-device AI
“Engineer with hands-on experience debugging and hardening a fixed-point implementation for an internal PoC, quickly diagnosing overflow/underflow issues that caused intermittent failures across thousands of runs and delivering a code fix. Comfortable presenting technical solutions with layered slide depth and doing follow-up deep dives for interested stakeholders, though has limited direct customer/sales partnership experience.”
Junior AI/ML Systems Engineer specializing in LLM infrastructure and distributed training
“Built and shipped a production NMT system translating medical documentation for a rare/low-resource language, tackling data scarcity with retrieval-driven pattern matching plus dictionary/grammar- and LLM-based augmentation and validating quality with a linguistic expert. Also develops agentic LLM workflows with LangChain/LangGraph (including a deep-research style system) and has experience aligning medical AI deployments with clinician-defined risk metrics and human-in-the-loop decision making.”
Junior Full-Stack Software Engineer specializing in cloud-native microservices
“Backend engineer with hands-on IoT and AI product work: built a decoupled Raspberry Pi + AWS IoT Core weather monitoring backend and a Dockerized FastAPI LLM service on AWS ECS using OpenAI/HuggingFace with an emerging RAG layer. Also delivered measurable performance gains at DAZN by redesigning event-driven/serverless ingestion (SNS, S3->Lambda->DynamoDB), cutting latency ~30% and boosting throughput ~25% while automating ~90% of manual sync work.”
Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision
“Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.”
Director-level Mobile Engineering Manager specializing in Generative AI and agentic mobile experiences
“iOS player-coach who led end-to-end development of real-time customer support chat and unified notification systems for T-Mobile’s iOS app using SwiftUI, Firebase, WebSockets, and Core Data (including offline handling). Drove measurable reliability/latency gains (~30%) through a major notification refactor and owned a high-severity push-notification incident from rollback through RCA and backward-compatible hotfix, while also scaling team process and people management.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems
“Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.”