Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Senior Full-Stack Engineer specializing in AI-powered SaaS and cloud-native analytics
Principal Machine Learning Scientist specializing in GenAI, LLMs, and RAG
Senior Python Developer specializing in AI/ML and cloud-native microservices
Staff Data Scientist / AI-ML Engineer specializing in fraud detection, NLP, and recommendations
Senior Full-Stack Engineer specializing in cloud, real-time data, and web platforms
Senior Software Engineer specializing in cloud platforms, healthcare imaging, and scalable APIs
Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps
Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference
Mid-Level Full-Stack Software Engineer specializing in distributed systems and cloud-native microservices
“Backend engineer (4 years) who built an end-to-end Python backend for a patent-pending in-car massager/heater system, including GraphQL data modeling and Bluetooth integration with an ESP32 microcontroller (reverse engineered a niche protocol). Also has strong platform experience: on-prem Kubernetes/CI-CD (Jenkins/GitLab, exploring ArgoCD GitOps), Terraform-based infra workflows, a RabbitMQ messaging library used across microservices, and an on-prem migration of ~30 critical applications with rollback/parallel-run strategy.”
Intern Robotics Engineer specializing in robot learning, SLAM, and control
“Robotics architect intern/new-grad focused on warehouse AMRs, building ROS2 sensor-fusion and SLAM stacks (FastSLAM-style particle filter) and validating in Gazebo with ground-truth metrics. Also interned at ASML debugging real-time in-vacuum robot behavior via Python state-machine telemetry scripts, identifying a firmware driver issue impacting throughput.”
Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services
“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”
Junior Machine Learning Engineer specializing in computer vision, reinforcement learning, and PINNs
“ML/Simulation engineer who productionized a Multi-Agent Reinforcement Learning system for 30+ firms at Belt and Road Big Data Company, integrating research code into an enterprise backend via Dockerized deployment and scalable data pipelines on GCP/Vertex AI. Demonstrated strong production debugging by tracing apparent network timeouts to hardware memory exhaustion caused by software state-history garbage collection issues, and built custom reward functions to model complex market dynamics (entry/exit, pricing).”
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Mid-level Data Scientist specializing in anomaly detection and production ML
“Interned at Backblaze building production AI systems for incident response and security operations, including an internal LLM-powered incident triage assistant that used Snowflake + RAG over historical tickets/postmortems and delivered results via Slack and a web UI. Emphasizes reliability (PII filtering, grounding, schema validation, fallbacks) and rigorous evaluation/observability (offline replay, partial rollouts, time-to-first-action metrics, Prometheus/Grafana).”
Mid-Level Backend Engineer specializing in REST APIs and AWS
“Backend engineer who built a new REST eligibility service at Barclays that unified siloed account logic (card/loan/deposit) and integrated with web/mobile, ultimately serving millions of users daily. Also built an end-to-end LLM-based pharmaceutical care-plan generation tool in a rapid Columbia startup competition, emphasizing configurable design, strict validation, persistence, and robust error handling.”
Senior Software Engineer specializing in AWS-based distributed systems and FinTech platforms
“Backend engineer with Amazon experience building large-scale, automated financial/accounting and pricing systems on AWS. Designed a fault-tolerant Step Functions + DynamoDB workflow platform handling 100K+ messages/sec to compute fair values and generate journal entries in under 3 seconds, and led safe API refactors using shadow mismatch testing. Also uncovered a major legacy pricing bug (tax vs non-tax swap) that cut mismatch rates from 5–10% to ~0.5% and materially improved price acceptance/business outcomes.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
Mid-level AI/ML Engineer specializing in recommender systems, fraud detection, and LLMs
Mid-level AI/ML Engineer specializing in NLP/LLMs and production ML systems
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multi-agent systems