Pre-screened and vetted.
Mid-level AI Developer & Machine Learning Engineer specializing in LLM and MLOps systems
“Built and deployed an enterprise RAG application at Centene to help clinical teams retrieve insights from large internal policy document sets, cutting manual research by 30–40%. Implemented custom domain-adapted embeddings (SageMaker + BERT transfer learning) and hybrid retrieval (BM25 + Pinecone) to drive a 22% relevance lift, and ran the system in production on AWS EKS with CI/CD, MLflow, and Prometheus monitoring (99% uptime, ~40% latency reduction).”
Mid-level AI/ML & Data Engineer specializing in MLOps and cloud data pipelines
“AI/ML engineer (Merkle) with hands-on experience deploying RAG-based LLM applications and real-time recommendation engines into production. Strong in cloud/on-prem architectures, GPU autoscaling, caching, and network optimization—delivered measurable latency reductions (40–70%) and improved retrieval relevance by systematically benchmarking chunking/embedding configurations and validating pipelines via CI/CD.”
Mid-level Data Scientist specializing in healthcare ML and GenAI
“Healthcare data/NLP practitioner with experience at UnitedHealthcare building production ML systems that connect unstructured call center transcripts and medical notes to structured claims data. Has delivered measurable impact (25% classification accuracy lift; ~30% relevance improvement) using classical NLP, embeddings (Sentence-BERT + FAISS), and AWS SageMaker deployments with robust validation and drift monitoring.”
Mid-Level Forward Deployed AI Engineer specializing in RAG systems and backend microservices
“LLM solutions practitioner with SOC/alert-triage experience who takes LLM prototypes to production using RAG (Pinecone), FastAPI services, guardrails, CI/CD, monitoring, and robust fallback logic. Known for rapid real-time debugging of embedding/vector and agent workflow issues, and for driving adoption through code-first workshops and sales-aligned custom demos with measurable improvements (35% faster triage; 40% increase in correct tool usage).”
Mid-level Machine Learning Engineer specializing in data security and GenAI systems
“Built Hexagon’s production Text-to-CAD Copilot that converts text and rough sketches into editable CAD code, combining GraphRAG (Neo4j/LangChain) with a Gemini-powered vision module and multi-agent geometric validation—cutting manual modeling from a day to ~45 seconds and driving retrieval latency below 50ms. Also has large-scale GCP data/ML orchestration experience (Airflow/Cloud Composer, Dataflow, Pub/Sub, Snowflake) processing 50M+ daily records with drift monitoring and automated reliability controls.”
Junior Full-Stack Software Engineer specializing in cloud web apps and authentication
“Full-stack engineer with Deloitte and CrowdDoing experience shipping production web platforms on AWS (EC2/RDS/S3/Fargate) using React/TypeScript and Node/Express/PostgreSQL. Built customer-facing authentication/SSO flows (OAuth2 + JWT) and state-specific US privacy consent workflows, and also delivered a Python/Flask LLM-based finance document parser chatbot with vector DB integration and latency optimizations.”
Mid-level AI/ML Engineer specializing in LLMs, RAG pipelines, and MLOps
“Data professional with ~4 years of experience, most recently at AIG (insurance), building ML/NLP systems for fraud detection and policy automation using transformers, CNNs, and clustering/anomaly detection. Also developed a RAG-based knowledge retrieval system, iterating across embedding models and moving to production based on precision and latency SLAs, then containerizing and deploying with SageMaker and CI/CD.”
Mid-level Full-Stack Java Engineer specializing in cloud-native, event-driven systems
“Backend engineer with airline operations domain experience who modernized flight-ops systems from batch updates to real-time streaming on AWS (Kafka + Spring Boot microservices), improving latency and stability through metric-driven tuning and idempotency. Also shipped a production LLM decision-support component using RAG over operational logs and internal procedures, with strong guardrails and an evaluation/regression loop to reduce hallucinations and enforce grounding.”
Mid-level Data Engineer specializing in multi-cloud real-time data pipelines
“Data engineer with healthcare/clinical trial domain experience who owned a 100TB+/month AWS pipeline end-to-end (Glue/S3/Redshift/Airflow) and drove measurable outcomes (20% lower latency, 99.9% reliability, 40% less manual reporting). Also built production data services and API-based ingestion on GCP (Cloud Run/Functions/BigQuery) with strong validation, versioning, and safe migration practices, and launched an early-stage RAG solution (LangChain + GPT-4) for researchers.”
Senior Full-Stack AI Engineer specializing in LLM/RAG agentic systems
“Built and deployed JobMatcher AI, an LLM-driven workflow automation product for job seekers that extracts requirements from job descriptions, matches to user skills, and generates tailored outreach. Demonstrated strong production engineering by cutting per-run cost ~70%, improving reliability with retries/backoff/fallbacks, and reducing hallucinations via schema validation and templating; also orchestrated the system with LangGraph plus Docker Compose across API, vector DB, and workers.”
Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps
“Built and deployed a domain-specific LLM chatbot for research/support, cutting manual effort by ~50%. Demonstrates strong applied LLM engineering: RAG, prompt grounding with citations and fallbacks, embedding/top-k tuning, and production monitoring (confidence, latency, feedback loops). Experienced orchestrating agent workflows with LangChain-style pipelines and continuous evaluation to maintain reliability.”
Mid-level AI/Backend Engineer specializing in RAG and data platforms
“Built and shipped a production LLM-powered financial Q&A interface that extracts precise numeric data from PDFs using a hybrid AWS Textract + LLM normalization pipeline, with confidence gating and guardrails to prevent unreliable answers. Experienced with LangChain-based RAG orchestration (chunking, memory, structured outputs) and collaborated closely with PMs/analysts on IRS Form 990 extraction requirements.”
Senior AI/ML & Robotics Research Engineer specializing in SLAM and multi-modal perception
“Robotics engineer who built a smart campus tour robot on a Kobuki Turtlebot using ROS 1, implementing a full navigation stack (semantic world model, A* planner, tour executor, path follower) and integrating SLAM (gmapping) plus a hybrid reactive safety controller. Experienced taking systems from Gazebo simulation to real hardware, including extensive real-world debugging and Docker-based development to handle ROS/Ubuntu version constraints; planning a move to ROS 2 on Turtlebot 4.”
Mid-level AI Engineer specializing in agentic LLM systems and RAG platforms
“Built and shipped Serrano AI, a multi-tenant SaaS conversational AI platform that automates Odoo ERP workflows and lets ops/finance/supply-chain teams query ERP data in natural language. Implemented a multi-agent architecture (LangChain/LangGraph/CrewAI) with hybrid RAG over ERP schemas, deployed on Heroku/Vercel with production observability, cutting reporting time by ~80% while addressing hallucinations, latency, and schema complexity.”
Mid-Level Data/ML Engineer specializing in Generative AI and cloud data platforms
“Built and productionized an LLM-based financial document analysis system using a RAG pipeline, including robust ingestion/chunking/embedding workflows, vector DB retrieval, and an AWS-deployed FastAPI service containerized with Docker. Demonstrates strong applied expertise in improving retrieval quality and latency at scale, plus hands-on experience debugging agentic/LLM workflows with monitoring and trace-based analysis while supporting demos and customer-facing adoption.”
Mid-level Software Engineer specializing in full-stack web and cloud automation
“Full-stack TypeScript/Angular/Node engineer who owned a production healthcare application for a pharmaceutical client, supporting 100K+ monthly users across 10+ countries. Strong focus on maintainability and quality (reusable localized component library, ~90% unit test coverage, SonarQube in CI/CD) plus performance work (reported 15% client-side latency reduction and up to 50% backend latency reduction) while migrating legacy mobile code with strict backward compatibility.”
Junior AI and Backend Engineer specializing in LLM systems
“AI/LLM engineer who has shipped production RAG copilots and multi-agent workflows, including a real-time Llama3 (Ollama) copilot backend handling 12k+ concurrent queries at 99.9% uptime. Deep on orchestration (Langflow/Airflow/Kubernetes), reliability evaluation (hallucination detection, p95 latency, token cost), and monitoring (Prometheus/Grafana), with demonstrated stakeholder-facing analytics delivery via Tableau.”
Mid-level Software Engineer specializing in Python backend and AI/GenAI
“Backend/infrastructure-focused engineer building AI-agent products for small businesses, including a customer-service agent platform with intent routing, RAG over Pinecone, and external booking API integration. Has shipped Python/FastAPI services with JWT auth, versioned APIs, Docker deployments to AWS EC2 via GitHub Actions, and production monitoring with Prometheus/Grafana.”
Principal AI Engineer specializing in agentic systems and cloud-native platforms
“Built a production RAG-powered analytics copilot at Aya Healthcare for operations leaders and analysts on a large healthcare staffing platform processing over a billion telemetry records annually. Stands out for strong production-minded agent engineering: deterministic orchestration, grounding-first design, deep observability, and data-driven workflow changes such as confidence-based human review for a PR review agent.”
Mid Software Engineer specializing in backend, full-stack, and AI systems
“Full-stack engineer with 3+ years of backend and frontend experience who has built production AI products for enterprise document and policy workflows. Stands out for owning end-to-end systems that combine React, FastAPI, RAG, vector search, and AWS deployment, with measurable impact including 65% less manual review time and significantly faster knowledge-query resolution.”
Entry-level Full-Stack Developer specializing in AI-powered web applications
“Solo builder of multiple AI products including CareerCopilot and ResearchMind AI, with hands-on experience across full-stack engineering, streaming LLM UX, grounded research workflows, and conversational AI. Also brings research depth as first author of an IEEE FG2026 paper on out-of-distribution deepfake detection, plus early-stage R&D experience building real-time robotics control software during an internship at ISRO's Space Applications Centre.”
Mid-level Software Engineer specializing in frontend and full-stack FinTech systems
“Frontend engineer with experience building data-intensive, near real-time React/TypeScript applications in credit reporting and logistics tracking. Stands out for performance-focused architecture across dashboards and map-based products, including large dataset optimization, profiling, and Mapbox visualizations with live asset tracking.”
Entry-Level Machine Learning & Cloud Engineer specializing in AI data pipelines
“Early-career cloud/appsec-focused engineer with hands-on experience building secure, observable microservice systems on AWS (IAM least privilege, KMS encryption, Secrets Manager, CloudWatch, ALB) and troubleshooting autoscaling-related 500s down to connection pooling issues. Also deployed heavy ML workloads on Kubernetes by decomposing diffusion/transformer services, using workload identity to eliminate static credentials, and maintaining GitOps-style deployment audit trails.”
Mid-level Data Scientist / ML Engineer specializing in Generative AI, RAG, and MLOps
“Built and productionized a RAG-based LLM research assistant for biomedical and regulatory document search using Mixtral 7B on SageMaker, LangChain, and Milvus, cutting research time by ~40%. Has hands-on multi-cloud MLOps experience across AWS/Azure/GCP with Kubeflow/Airflow/Composer plus Terraform + ArgoCD, and applies rigorous evaluation/monitoring (latency, accuracy, hallucinations). Also partnered with a non-technical PM to deliver an insurance policy Q&A chatbot that reduced customer response time by 30%+.”