Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in GPU inference and LLM platforms
“Built and deployed an LLM-powered platform that turns models into scalable REST/gRPC APIs, focusing on keeping GPU-backed inference fast and stable during traffic spikes. Experienced with AWS orchestration (EKS/ECS/Step Functions), safe model rollouts, and production-grade monitoring/testing for reliable AI agents and workflows.”
Intern AI/ML Engineer specializing in NLP, LLMs, and semantic search
“Built and deployed a production RAG-based semantic search and summarization system for large legal/technical document sets, owning the full backend (embeddings, vector store, chunking, prompting) and driving a reported 40–60% reduction in manual review time. Experienced with LangChain/LlamaIndex plus Airflow/Temporal-style orchestration, and applies rigorous evaluation/monitoring (A/B tests, drift detection, staged rollouts) to keep agentic systems reliable. Also partnered with a supply-chain manager at TE Connectivity to deliver an AI inventory recommendation tool projected to drive millions in value.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Senior Full-Stack Python Developer specializing in cloud-native RAG and microservices
Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems
Staff Software Engineer specializing in FinTech, AI/ML, and cloud microservices
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Director of AI/ML Engineering specializing in MLOps, data platforms, and 3D computer vision
“Backend/data engineer focused on production ML/LLM systems: built a real-time FastAPI inference API on Kubernetes with strong reliability patterns (timeouts, idempotent retries, centralized error handling). Delivered AWS platforms using EKS + Lambda with GitHub Actions/Helm CI/CD and built Glue-based ETL from S3/Kafka into Snowflake with schema evolution and data-quality controls; also modernized legacy analytics/recommendation workflows into Python services with safe, feature-flagged cutovers.”
Senior Software Engineer specializing in developer tools, cloud automation, and generative AI
“Built and deployed a production chatbot on osvaldocalles.com and iterated through real-world LLM engineering issues: model quota/cost tradeoffs (migrating to Nova Pro), RAG accuracy via semantic chunking, AWS IAM/guardrail/security pitfalls, and Lambda/API Gateway streaming constraints (prefers JS for streaming layer). Experienced with agent orchestration using Strands SDK (AWS-focused) and LangGraph (Vercel/container deployments), plus evaluation pipelines using LLM-as-evaluator, dashboards, and staged model rollouts.”
Mid-level Data Scientist specializing in recommender systems, NLP, and real-time ML pipelines
“AI/LLM engineer who built and productionized an internal RAG-based knowledge system that ingests diverse sources (PDFs, Markdown, Slack), scaled retrieval with distributed FAISS and parallel ingestion, and reduced hallucinations via re-ranking, grounding prompts, and post-generation validation. Also has hands-on orchestration experience with Airflow and Kubernetes for reliable ETL/model pipelines, monitoring, and staged rollouts; reports ~15% accuracy improvement and adoption as the primary internal knowledge tool.”
Mid-level Machine Learning & Generative AI Engineer specializing in NLP, CV, and RAG systems
“Built and deployed a production LLM-powered RAG document intelligence system used by non-technical enterprise stakeholders, cutting document search time by 40%+ while improving answer consistency. Demonstrates strong MLOps/data workflow orchestration (Airflow, AWS Step Functions, managed schedulers across GCP/Azure) and a metrics-driven approach to reliability, evaluation, and cost/latency optimization with guardrails and observability.”
Senior AI & Data Engineer specializing in LLM agents, RAG, and data platforms
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
Executive Technology Leader (CTO/Principal Engineer) specializing in cloud-native platforms and AI
Executive AI Platform & Innovation Leader specializing in Banking, GenAI, and AI Governance
Senior Full-Stack Engineer specializing in cloud-native microservices and AI/LLM integrations
Staff Machine Learning Engineer specializing in LLMs, recommendations, and MLOps
Intern Software Engineer specializing in AI/ML and LLM applications
Mid-level AI/ML Engineer specializing in LLM RAG pipelines and cloud MLOps
Senior Staff Full-Stack Engineer specializing in AI copilots and cloud platforms
Mid-level Agentic AI & ML Engineer specializing in LLM agents and RAG systems