Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLMs, search ranking, and multimodal ML
Staff Full-Stack Engineer specializing in cloud microservices and AI-enabled platforms
Senior Agentic AI & Backend Engineer specializing in LLM platforms and multi-agent systems
Senior AI/ML Software Engineer specializing in LLMs, NLP, and scalable ML platforms
Intern AI/ML Engineer specializing in LLM agents, RAG, and low-latency systems
Senior AI/ML Engineer specializing in LLMs, RAG, and multimodal recommendation systems
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Senior Full-Stack Software Engineer specializing in Telehealth and FinTech
Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure
Senior AI/ML Engineer specializing in GenAI, agentic systems, and healthcare AI
Mid-Level Software Development Engineer specializing in AWS serverless and ML/GenAI
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search
“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Senior Full-Stack Python Developer specializing in cloud-native RAG and microservices
Mid-level Machine Learning Engineer specializing in LLMs and RAG systems
Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems
Staff Software Engineer specializing in FinTech, AI/ML, and cloud microservices
Senior Customer Success & Technical Account Leader specializing in AI/ML infrastructure
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Director of AI/ML Engineering specializing in MLOps, data platforms, and 3D computer vision
“Backend/data engineer focused on production ML/LLM systems: built a real-time FastAPI inference API on Kubernetes with strong reliability patterns (timeouts, idempotent retries, centralized error handling). Delivered AWS platforms using EKS + Lambda with GitHub Actions/Helm CI/CD and built Glue-based ETL from S3/Kafka into Snowflake with schema evolution and data-quality controls; also modernized legacy analytics/recommendation workflows into Python services with safe, feature-flagged cutovers.”
Senior Software Engineer specializing in developer tools, cloud automation, and generative AI
“Built and deployed a production chatbot on osvaldocalles.com and iterated through real-world LLM engineering issues: model quota/cost tradeoffs (migrating to Nova Pro), RAG accuracy via semantic chunking, AWS IAM/guardrail/security pitfalls, and Lambda/API Gateway streaming constraints (prefers JS for streaming layer). Experienced with agent orchestration using Strands SDK (AWS-focused) and LangGraph (Vercel/container deployments), plus evaluation pipelines using LLM-as-evaluator, dashboards, and staged model rollouts.”