Pre-screened and vetted.
Junior AI Platform Engineer specializing in LLM platforms, RAG, and Kubernetes
Mid-level Full-Stack Software Engineer specializing in cloud-native apps and GenAI
Senior Backend Software Engineer specializing in Python, cloud microservices, and FinTech
Senior Full-Stack & AI Engineer specializing in cloud microservices and RAG systems
Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time AI systems
Mid-Level Full-Stack Developer specializing in cloud-native microservices and GenAI
Mid-level Data Scientist / AI/ML Engineer specializing in secure cloud ML and GenAI
Junior Full-Stack/DevOps Engineer specializing in AWS, Kubernetes, and AI model evaluation
Mid-level Data Scientist / ML Engineer specializing in NLP, GenAI, and cloud ML deployment
Senior AI Python Engineer specializing in Generative AI and MLOps
Junior Data Scientist specializing in ML, LLMs, and RAG applications
“University hackathon finalist (2nd place) who built CareerSpark, a production-style multi-agent career guidance app in 24 hours using a hierarchical debate architecture with a moderator/judge agent. Has startup internship experience at LiveSpheres AI using LangChain for multi-LLM orchestration, and demonstrates a structured approach to testing/evaluation (golden sets, integration sims, latency/accuracy KPIs) plus strong non-technical stakeholder communication.”
Mid-level Data Scientist specializing in Generative AI and multimodal systems
“Recent J&J intern who built a conversational RAG agent and led a shift from a monolithic model to a modular RAG workflow, cutting response time from several days to under a second by tackling data fragmentation, context retention, and embedding/latency optimization. Also worked on a large (7B-parameter) multimodal VQA pipeline for healthcare research and stays current via NeurIPS/ICLR and open-source contributions.”
Mid-level Data Scientist specializing in MLOps and Generative AI
“Robotics software/ML engineer who built perception and navigation-related ML systems for autonomous supermarket carts, including object detection, shelf recognition, and obstacle avoidance. Strong ROS/ROS2 practitioner who optimized real-time performance (reported 50% latency reduction) and deployed containerized ROS/ML pipelines at scale using Docker, Kubernetes, and CI/CD.”
Senior AI/ML & Full-Stack Engineer specializing in GenAI, RAG, and MLOps platforms
“Backend/data platform engineer who owned end-to-end production services for a fleet analytics/GenAI platform, spanning FastAPI microservices on Kubernetes and AWS (EKS + Lambda) event-driven workloads. Strong in reliability/observability (OpenTelemetry, circuit breakers, idempotency), data pipelines (Glue/Airflow/Snowflake), and measurable performance/cost wins (SQL 10s to <800ms P95; ~30% compute cost reduction).”
Mid-level AI/ML Engineer specializing in agentic AI and full-stack (MERN) applications
“Built and deployed a production real-time voice AI support agent that answers inbound calls, identifies callers, troubleshoots via a knowledge base, and automatically creates/updates tickets with escalation to humans when needed. Demonstrates strong reliability/latency engineering (streaming, schema validation, idempotency, DB constraints) and uses LangGraph state machines plus OpenAI Agents SDK for multi-agent routing, with KPI-driven testing and monitoring.”
Junior Software Engineer specializing in cloud-native microservices and ML/LLM pipelines
“Backend-leaning full-stack engineer who ships AI-enabled products end-to-end: built CodeChat, a production internal codebase Q&A tool using RAG with Pinecone and a model-agnostic wrapper across OpenAI/Anthropic/AWS Bedrock, cutting AWS costs ~50% and latency ~45%. Also built and operated RealityStream, a Flask-based real-time forecasting API with JWT/RBAC, MLflow model versioning, and Prometheus/Grafana observability, including handling a real production latency incident via rollback, preloading, and caching.”
Mid-level AI Engineer specializing in LLMs, RAG, and healthcare AI
“Built and scaled an AI-powered voice/chat patient engagement platform at Penn Medicine from early prototype into production clinical workflows, focusing on latency, edge cases, and user trust. Strong in LLM reliability engineering (structured prompts, validation/fallbacks), real-time troubleshooting with observability, and cross-functional enablement through pilots, demos, and sales/customer partnership.”
Mid-level Full-Stack Engineer specializing in cloud-native microservices and AI automation
“Software engineer/product owner who has led end-to-end delivery of AI and content-management platforms, including building RAG-based reliability improvements and migrating fragile systems to containerized AWS ECS/Kubernetes with Terraform-managed CI/CD. Experienced designing event-driven microservices (SQS/SNS/RabbitMQ), scaling queue consumers with autoscaling, and creating internal Python tooling to standardize data connectors (e.g., BigQuery/Airtable/internal APIs) to speed iteration.”
Mid-level AI Engineer specializing in GenAI, LLM integration, and RAG pipelines
“Built and led deployment of an autonomous, self-correcting multi-agent knowledge retrieval and validation system at HCA Healthcare to reduce heavy manual research/validation in clinical/compliance documentation. Deeply focused on production reliability and cost—used LangGraph StateGraph orchestration plus ONNX/CUDA/quantization to cut GPU costs by 25%, and partnered with the Compliance VP using real-time contradiction-rate dashboards to hit a 40% automation goal without compromising compliance.”
Mid-level Full-Stack Developer specializing in Java/Spring Boot, Angular, and AWS
“Full-stack engineer with recent Mutual of Omaha experience building a cloud-native microservices application in Java/Spring Boot with a React/Angular frontend, integrating multiple AWS services (Lambda, S3, DynamoDB, SQS). Has hands-on experience operationalizing AI features via OpenAI/AWS Bedrock and improving reliability/performance through caching, async processing, and CI/CD pipeline optimization.”