Pre-screened and vetted.
“LLM/agent workflow engineer with healthcare experience (CVS/CBS Health) who built and deployed a production call-insights platform using Azure OpenAI + LangChain/LangGraph, including sentiment and compliance checks. Demonstrates deep HIPAA/PHI handling (tenant-contained processing, redaction, RBAC/encryption/audit logging) and production rigor (testing, eval sets, validation/retries, autoscaling) to scale to thousands of transcripts.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and multimodal modeling
“Built and productionized a telecom-focused RAG assistant by LoRA fine-tuning LLaMA-2 and integrating LangChain+FAISS behind a FastAPI service, with dashboards and a human feedback UI for engineers. Demonstrated measurable impact (≈40% faster document lookup, +8–10% retrieval precision) and strong MLOps rigor via Airflow orchestration, CI/CD, and monitoring for drift and failures.”
Intern Software Engineer specializing in C++ systems and performance optimization
“Robotics software intern who worked on a customized ROS1-based middleware, building ROS node orchestration and a ROS topic monitoring system. Improved intra-machine ROS topic performance by using shared memory and circular buffers instead of socket-based IPC, and integrated nightly Jenkins CI with Groovy/Python to run tests and produce code coverage reports.”
Senior Software Engineer specializing in document workflows and API platforms
“Backend engineer with experience building queue-driven Python/Flask systems using Celery, Redis/RabbitMQ, and SQLAlchemy/Postgres, including async/non-blocking architectures for concurrency. Also built a patient-facing full-stack app integrating LLMs (OpenAI/Claude) with streaming responses for real-time UX, and previously delivered high-throughput, reliability-critical background workflows at Dropbox (document expiration with batching, retries, and cache/side-effect handling).”
Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications
“Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).”
Staff Full-Stack Engineer specializing in AI platforms and infrastructure automation
“Backend/full-stack engineer building complex internal platforms and customer-facing demos at the intersection of infrastructure and product. Shipped a no-code Product Lifecycle Manager for manufacturing (3 manufacturers, 1000+ evolving tests) using AWS S3/SQS ingestion and extensible Postgres (EAV+JSONB) with end-to-end traceability. Also built a FastAPI-based company data intelligence platform with Okta-secured RBAC and an LLM/MCP layer for ChatGPT-like analytics over enterprise data sources.”
Mid-level AI/ML Engineer specializing in deep learning, NLP/LLMs, and MLOps
“Built and shipped a real-time oncology risk prediction system used by doctors during patient visits, trained on clinical data in AWS SageMaker and deployed via FastAPI with sub-second responses. Emphasizes clinician-trust features (SHAP explainability, validation checks) and HIPAA-compliant controls (encryption, RBAC, audit logging), plus Kubernetes-based production operations with autoscaling, monitoring, and drift/retraining workflows; collaborated closely with oncologists at Flatiron Health.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI
“Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.”
“字节跳动实习期间将内部AI重量预测模型从“可用但难上线”的单点能力,改造成可商业化复用的通用API:统一多地区接口与评估口径,设计分层兜底与置信度分级,先灰度上线SEA/JP并推动US/EU落地,结合线上结果进行模型微调。具备LLM/RAG/Agent系统的实战排障方法论,以及面向开发者与售前场景的技术演示与跨团队推进能力。”
Junior AI/ML Engineer specializing in real-time computer vision and tracking systems
“Full-stack engineer who built and owned a production real-time computer-vision inference platform at Credence, spanning Next.js App Router/TypeScript frontend with SSE/WebSocket streaming, a Flask backend, and Postgres analytics. Demonstrated measurable performance wins (70% fewer re-renders; latency cut to ~40–50ms) and strong production rigor (durable orchestration, idempotency, observability, AWS EC2 + CI/CD) with tight post-launch UX iteration based on analyst feedback.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps on AWS
“AI engineer who built a production RAG-based internal analyst tool at BlackRock, fine-tuning an LLM on proprietary financial data and adding four layers of guardrails (input/retrieval/generation/output) to improve grounding and reduce hallucinations. Implemented a LangChain-based multi-agent orchestration (7 major agents) deployed on AWS ECS, with reliability measured via internal human evaluation, LLM-as-judge, and RLHF/drift monitoring.”
Intern Software Engineer specializing in edge AI deployment and distributed systems
“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”
Junior Software Engineer specializing in distributed systems and cloud-native backend services
“Founding engineer at a civic-tech startup (Barrow) who built and operated a Next.js/TypeScript product with map-based public reporting, including clustering and dynamic geospatial loading to improve UX and performance. Also implemented a location-aware RAG chatbot using Pinecone, web scraping/transcription, caching, and fallback web search, and owned post-launch observability plus scaling decisions (load balancing/horizontal scaling) based on API usage patterns.”
Mid-level Python Developer specializing in AWS microservices and cloud automation
“Backend engineer focused on Python/FastAPI microservices running on Kubernetes (AWS EKS) with strong GitOps/CI/CD ownership (GitHub Actions + ArgoCD). Demonstrated measurable performance wins (p95 latency cut from >1s to <200ms) and production reliability work across Kafka/Redis streaming and cloud-to-on-prem migrations (RDS/S3 to Postgres/MinIO) using parallel validation and checksum-based consistency checks.”
Senior Machine Learning Engineer specializing in optimization, LLMs, and on-device AI
“Engineer with hands-on experience debugging and hardening a fixed-point implementation for an internal PoC, quickly diagnosing overflow/underflow issues that caused intermittent failures across thousands of runs and delivering a code fix. Comfortable presenting technical solutions with layered slide depth and doing follow-up deep dives for interested stakeholders, though has limited direct customer/sales partnership experience.”
Junior Full-Stack Software Engineer specializing in cloud-native microservices
“Backend engineer with hands-on IoT and AI product work: built a decoupled Raspberry Pi + AWS IoT Core weather monitoring backend and a Dockerized FastAPI LLM service on AWS ECS using OpenAI/HuggingFace with an emerging RAG layer. Also delivered measurable performance gains at DAZN by redesigning event-driven/serverless ingestion (SNS, S3->Lambda->DynamoDB), cutting latency ~30% and boosting throughput ~25% while automating ~90% of manual sync work.”
Mid-Level Software Engineer specializing in full-stack web and cloud systems
“Full-stack engineer with strong data engineering and privacy-domain experience, having owned an automated Data Subject Rights (DSR) processing pipeline end-to-end across Azure SQL and GCP (GCS/BigQuery). Emphasizes production reliability via idempotency, validation checkpoints, structured logging/monitoring, and safe CI/CD-driven deployments, and has also built React+TypeScript + Node/Postgres web apps with scalable, maintainable architecture.”
Mid-level DevOps Engineer specializing in cloud automation and Kubernetes platforms
“Robotics/ML engineer who has built SO(3)-equivariant models for robotic manipulation, including custom equivariant layers and differentiable point-cloud rasterization/derasterization workflows. Also brings 2 years of DevOps experience in banking systems, automating CI/CD and infrastructure at scale (managed 180 OCI servers; reduced rebuild downtime by 80%).”
Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision
“Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.”
“Built and productionized an AI-native, agentic appeals decisioning system for health insurance operations, automating 500k+ scanned appeals/year. Delivered measurable impact by cutting review time from 12–15 minutes to ~3 minutes and auto-resolving ~85% of cases with strong auditability, evaluations, and human-in-the-loop guardrails, deployed as containerized microservices on Azure AKS.”
Senior Python Full-Stack Developer specializing in cloud-native microservices and data platforms
“Backend/data engineer from Oliver Wyman who built and ran production Python (FastAPI) services on AWS (ECS/Lambda/API Gateway) supporting risk modeling and regulatory reporting. Strong in reliability/observability, Glue-based ETL with data quality controls, and legacy SAS-to-Python modernization with rigorous parity validation; also demonstrated measurable SQL performance wins and cost-control improvements in serverless scaling. Based in Raleigh, NC and can travel onsite for important Bethesda-area meetings.”
Senior Python Full-Stack Developer specializing in cloud, data engineering, and ML/GenAI
“Backend/data engineer with hands-on production experience building FastAPI services on AWS and implementing strong reliability/observability (CloudWatch, ELK, correlation IDs, alarms). Has delivered serverless + container solutions with IaC (CloudFormation/Terraform) and Jenkins CI/CD, and built AWS Glue/PySpark pipelines into S3/Redshift with schema-evolution and data-quality safeguards; demonstrated large-scale SQL tuning (45 min to 3 min on a 500M-row workload).”
Intern Software Engineer specializing in LLM agents and full-stack development
“Embedded C++ engineer with Bosch automotive infotainment experience, owning real-time audio middleware modules with strict latency/memory constraints. Strong in profiling/optimizing deterministic behavior, debugging hardware-specific intermittent issues, and building automated test + CI pipelines; currently ramping up on ROS2 concepts (DDS, nodes/topics/services) to transition toward robotics.”
Principal Cloud & Infrastructure Engineer specializing in reliability and regulated data platforms
“Founder/CTO-type startup leader who has built cloud-native data and AI platforms from scratch while owning both technical vision and product direction. Brings rare end-to-end startup experience spanning zero-to-one building, growth-stage execution, and fundraising from early stage through exit, with a strong ability to translate technical complexity into clear investor narratives.”