Pre-screened and vetted.
Staff-level Software Engineer specializing in LLM inference infrastructure and scalable model serving
Executive AI/ML Engineer specializing in LLMs, NLP, and production ML systems
Mid-level Machine Learning Engineer specializing in NLP, MLOps, and Generative AI
“Built and deployed a production LLM conversational AI system at OpenAI supporting chat, summarization, and semantic search at 1M+ requests/day, driving major latency (40%) and accuracy (25%) improvements through Pinecone optimization and tighter RAG with re-ranking. Also has Amazon experience improving recommendation systems by translating ML metrics into business terms to boost CTR and conversions, with strong MLOps/orchestration depth (Airflow, MLflow, SageMaker, Kubeflow).”
Staff AI Full-Stack Engineer specializing in LLMs, multi-agent systems, and Voice AI
Mid-level Full-Stack Developer specializing in Java/Spring Boot and React
“NVIDIA engineer who built and shipped a production LLM-powered enterprise knowledge system (summarization, transcription, and Q&A) that cut document retrieval time ~30%. Deep hands-on experience with RAG (FAISS/Pinecone), GPU-accelerated microservices on AWS, and reliability/safety practices (Guardrails AI, prompt A/B testing, canary releases) plus strong MLOps orchestration across Airflow, Step Functions, and Kubernetes GitOps.”
Executive AI/IoT Engineering Leader specializing in full-stack and edge AI systems
Senior AI/ML Engineer specializing in LLMs, RAG, and multimodal systems
Senior Machine Learning Engineer specializing in LLMs and scalable MLOps
Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and scalable inference
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable MLOps
Senior Software Engineer specializing in cloud architecture and machine learning
Mid-level AI/ML Engineer specializing in generative AI, LLMs, and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
Mid-level Machine Learning Engineer specializing in LLMs, generative AI, and MLOps
“Built and shipped a production LLM-powered medical scribe that generates structured clinical visit summaries using RAG, strict JSON schemas, and post-generation validation to reduce hallucinations. Experienced in making LLM workflows deterministic and observable (structured logging/metrics/tracing) and in evaluation-driven iteration with metrics like schema pass rate and edit rate; collaborated closely with clinicians and policy stakeholders at Scale AI to drive adoption.”