Pre-screened and vetted.
Senior AI Engineer specializing in LLM systems and scalable backend platforms
Senior Full-Stack Software Engineer specializing in Telehealth and FinTech
Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure
Senior AI/ML Engineer specializing in GenAI, agentic systems, and healthcare AI
Mid-Level Software Development Engineer specializing in AWS serverless and ML/GenAI
Senior Software Engineer specializing in AI backend platforms and FinTech systems
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search
“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Senior Full-Stack Python Developer specializing in cloud-native RAG and microservices
Mid-level Machine Learning Engineer specializing in LLMs and RAG systems
Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems
Staff Software Engineer specializing in FinTech, AI/ML, and cloud microservices
Senior Customer Success & Technical Account Leader specializing in AI/ML infrastructure
Senior Full-Stack Engineer specializing in telehealth and commerce platforms
Director of Machine Learning specializing in GenAI platforms and enterprise AI/ML
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Director of AI/ML Engineering specializing in MLOps, data platforms, and 3D computer vision
“Backend/data engineer focused on production ML/LLM systems: built a real-time FastAPI inference API on Kubernetes with strong reliability patterns (timeouts, idempotent retries, centralized error handling). Delivered AWS platforms using EKS + Lambda with GitHub Actions/Helm CI/CD and built Glue-based ETL from S3/Kafka into Snowflake with schema evolution and data-quality controls; also modernized legacy analytics/recommendation workflows into Python services with safe, feature-flagged cutovers.”
Senior Software Engineer specializing in developer tools, cloud automation, and generative AI
“Built and deployed a production chatbot on osvaldocalles.com and iterated through real-world LLM engineering issues: model quota/cost tradeoffs (migrating to Nova Pro), RAG accuracy via semantic chunking, AWS IAM/guardrail/security pitfalls, and Lambda/API Gateway streaming constraints (prefers JS for streaming layer). Experienced with agent orchestration using Strands SDK (AWS-focused) and LangGraph (Vercel/container deployments), plus evaluation pipelines using LLM-as-evaluator, dashboards, and staged model rollouts.”
Senior AI/ML Engineer specializing in LLMs, RAG, and cloud-native MLOps
“Built and owned a real-time clinical AI assistant at Andor Health, taking it from prototype through deployment, monitoring, and iterative improvement. Brings strong healthcare-focused GenAI experience across RAG, hybrid retrieval, LoRA fine-tuning, and production Python services, with measurable gains in accuracy, speed, and reliability.”
Mid-level Data Scientist specializing in recommender systems, NLP, and real-time ML pipelines
“AI/LLM engineer who built and productionized an internal RAG-based knowledge system that ingests diverse sources (PDFs, Markdown, Slack), scaled retrieval with distributed FAISS and parallel ingestion, and reduced hallucinations via re-ranking, grounding prompts, and post-generation validation. Also has hands-on orchestration experience with Airflow and Kubernetes for reliable ETL/model pipelines, monitoring, and staged rollouts; reports ~15% accuracy improvement and adoption as the primary internal knowledge tool.”