Pre-screened and vetted.
Staff Software Engineer specializing in cloud-native healthcare and payments platforms
Senior Full-Stack AI Engineer specializing in LLM/RAG and production ML platforms
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and scalable inference
“ML/LLM engineer who built and shipped an LLM-powered internal knowledge assistant at Meta, focusing on production-grade RAG to reduce hallucinations and improve trust. Deep experience with scaling and serving (FSDP/DeepSpeed/LoRA, Triton, Kubernetes autoscaling) and reliability practices (Airflow retraining, MLflow versioning, monitoring with rollback), including sub-100ms latency and ~35% GPU memory reduction.”
Senior AI/ML Engineer specializing in LLMs, multimodal AI, and scalable MLOps
“ML/NLP engineer with experience at NVIDIA and Cruise building production-grade AI systems across genomics/biomedical research and autonomous vehicle data. Has delivered multimodal LLM pipelines, large-scale entity resolution, and hybrid semantic search (BERT embeddings + FAISS + Elasticsearch), with measurable impact (≈40% accuracy/retrieval gains; ≈30% data consistency improvement) and strong MLOps practices (Kubernetes, CI/CD, MLflow, Prometheus/Grafana).”
Senior Machine Learning Engineer specializing in AI/ML, NLP, and computer vision
“McKinsey & Company ML/NLP practitioner who builds production-grade AI systems across sectors (notably healthcare and finance), including RAG/LLM solutions, entity resolution pipelines, and embedding-powered search with vector databases. Demonstrated measurable impact (40% reduction in data duplication) and strong MLOps/data workflow practices (Airflow, MLflow, Spark, AWS/GCP, Prometheus, CI/CD).”
Engineering Manager / Tech Lead specializing in large-scale distributed systems
“Software engineer focused on personalization and data/ML infrastructure who built a GenAI/LLM-driven carousel ranking system end-to-end, delivering a reported 6–7% order-rate lift. Also designed large-scale personalization ETL (15PB for ~100M users) and created a custom Airflow operator to integrate with Databricks under enterprise version constraints, with hands-on on-call and data-quality reliability improvements.”
Mid-level Full-Stack Developer specializing in Java/Spring Boot and React
“NVIDIA engineer who built and shipped a production LLM-powered enterprise knowledge system (summarization, transcription, and Q&A) that cut document retrieval time ~30%. Deep hands-on experience with RAG (FAISS/Pinecone), GPU-accelerated microservices on AWS, and reliability/safety practices (Guardrails AI, prompt A/B testing, canary releases) plus strong MLOps orchestration across Airflow, Step Functions, and Kubernetes GitOps.”
Senior AI/ML Engineer specializing in LLMs, RAG, and multimodal systems
Senior Machine Learning Engineer specializing in LLMs and scalable MLOps
Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps
Senior Data Scientist / ML Engineer specializing in LLMs, generative AI, and MLOps
Mid-level Data Scientist / GenAI & ML Engineer specializing in LLM apps and MLOps
Senior Python Backend Engineer specializing in cloud-native SaaS and microservices
Senior Full-Stack Python Engineer specializing in cloud microservices and AI/LLM systems
Senior Full-Stack Engineer specializing in AI/ML product engineering
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and scalable inference
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable MLOps
Mid-level Full-Stack Software Engineer specializing in FinTech analytics and security