Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in deep learning and generative AI
“AI/ML engineer who has deployed transformer-based NLP systems to production via Python REST APIs and Kubernetes on AWS/Azure, with a strong focus on latency optimization (p95), reliability, and scalable orchestration. Demonstrates pragmatic model tradeoff decision-making and strong stakeholder collaboration—improving adoption by making outputs more actionable with summaries, extracted fields, and confidence indicators.”
Senior Full-Stack Software Engineer specializing in API architecture and AI agentic RAG systems
“Hands-on backend/AI engineer who solo-built two production Claude-based agent systems: an internal Slack RAG over Confluence/Jira/code/regulatory docs and a HIPAA/GDPR-compliant patient chatbot with embedding guardrails and expert-in-the-loop evals. Also architected a multi-region patient portal + microservices platform with Terraform/CI-CD and federated gateways, delivering major onboarding automation and strong reliability wins (PgBouncer, chaos/perf testing).”
Junior Software Engineer specializing in full-stack systems and LLM automation
“Full-stack engineer who shipped a production "Financial Insight" assistant dashboard in Next.js App Router/TypeScript, integrating a RAG pipeline (embeddings + ChromaDB + LLM) via route handlers and owning post-launch performance (latency, token cost, retrieval relevance). Also built/optimized Postgres-backed workflows for an outbound dialer and callback routing engine handling ~10,000 daily contacts, validating query performance with EXPLAIN (ANALYZE, BUFFERS).”
“ML/NLP engineer with recent Scotiabank experience building production-grade indexing automation over large-scale emails and customer databases, combining LLM fine-tuning (Mistral, XLM-R) with fuzzy matching to exceed 95% accuracy under strict banking constraints. Also built a RAG-based chat agent using Gecko embeddings, Vertex AI Search, Gemini, and cross-encoder reranking, and delivered a text-to-SQL chatbot at SOTI through iterative fine-tuning and benchmark-driven experimentation.”
Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems
“Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.”
Mid-level Full-Stack Software Engineer specializing in FinTech and cloud platforms
“Software engineer who built and launched an end-to-end Ad Scheduler that automated campaign creation across Google Ads and Meta using Azure Functions/Service Bus, PostgreSQL, and a React frontend—reducing manual marketing ops work. Also shipped a production internal RAG chatbot leveraging a data warehouse + Cube semantic layer, Gemini embeddings, vector search, and Claude, with Langfuse tracing and brand-based access controls; work was cut short due to layoffs.”
Junior Software Engineer specializing in backend systems and LLM/RAG applications
“Full-stack engineer who built a cloud storage app feature (file upload/management) with Next.js App Router + TypeScript and owned post-launch improvements. Also has internship experience building a geospatial AI chatbot: designed Postgres/PostGIS data models and optimized spatial queries, and implemented an LLM workflow orchestrated with LangChain/LangGraph plus a RAG pipeline grounded in OpenStreetMap data to reduce hallucinations.”
Mid-level Data Scientist specializing in MLOps and Generative AI
“Robotics software/ML engineer who built perception and navigation-related ML systems for autonomous supermarket carts, including object detection, shelf recognition, and obstacle avoidance. Strong ROS/ROS2 practitioner who optimized real-time performance (reported 50% latency reduction) and deployed containerized ROS/ML pipelines at scale using Docker, Kubernetes, and CI/CD.”
Junior AI/ML Engineer specializing in RAG systems and cloud-native MLOps
“Built and shipped a production LLM-powered RAG system at Upstart enabling natural-language search across 50k+ scattered internal technical docs. Delivered sub-300ms p95 latency for ~50 active users with strong hallucination safeguards (retrieval-first, thresholds, citations) plus robust testing/monitoring and cost controls (prompt caching cutting API spend ~20%).”
Senior AI/ML & Full-Stack Engineer specializing in GenAI, RAG, and MLOps platforms
“Backend/data platform engineer who owned end-to-end production services for a fleet analytics/GenAI platform, spanning FastAPI microservices on Kubernetes and AWS (EKS + Lambda) event-driven workloads. Strong in reliability/observability (OpenTelemetry, circuit breakers, idempotency), data pipelines (Glue/Airflow/Snowflake), and measurable performance/cost wins (SQL 10s to <800ms P95; ~30% compute cost reduction).”
Entry-Level AI/ML Engineer specializing in LLM apps, RAG pipelines, and production ML systems
“AI/LLM practitioner at iFrog Marketing Solutions who drove a RAG chatbot from prototype to production in a legacy, AI-resistant environment by validating customer needs and building a business case. Implemented production-grade LLM practices (CI/CD eval gating, rollbacks, prompt/context engineering) and led internal workshops to bring non-AI-native developers up to speed while partnering with sales on tailored demos to drive adoption.”
Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications
“GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.”
Mid-level Machine Learning & Full-Stack Engineer specializing in GenAI platforms
“LLM/agent builder who has shipped production AI systems in the wellness space, including an LLM-powered food tracking product used by 5000+ users and a voice/call-routing onboarding workflow using LangGraph/LangChain with LiveKit and Twilio. Strong focus on practical reliability work: latency reduction, retrieval/embedding tuning, and CI-driven evaluation with simulations and metrics.”
Mid-level Full-Stack Developer specializing in FinTech platforms and cloud-native microservices
“Backend engineer focused on AI-enabled systems, having built a production-style RAG pipeline (vector search + LLM) exposed via Python/Flask endpoints with strong observability and hallucination-reduction techniques. Demonstrates deep performance work in PostgreSQL/SQLAlchemy (5x faster analytics queries) and high-throughput optimization using Celery + Redis (800ms to 120ms latency, 3x throughput), plus schema-per-tenant multi-tenancy with tenant-aware middleware and logging.”
Senior Full-Stack Java Developer specializing in microservices, cloud, and modern web UIs
“Robotics software engineer who built the software layer for an autonomous warehouse sorting system, spanning navigation/path planning, task scheduling, and backend services. Deep hands-on ROS 2 Foxy experience (Nav2/costmaps) and real-time multi-robot debugging, using simulation-driven analysis plus incremental/partial re-planning to handle dynamic obstacles in production-like warehouse environments.”
Mid-level AI/ML Engineer specializing in agentic AI and full-stack (MERN) applications
“Built and deployed a production real-time voice AI support agent that answers inbound calls, identifies callers, troubleshoots via a knowledge base, and automatically creates/updates tickets with escalation to humans when needed. Demonstrates strong reliability/latency engineering (streaming, schema validation, idempotency, DB constraints) and uses LangGraph state machines plus OpenAI Agents SDK for multi-agent routing, with KPI-driven testing and monitoring.”
“JavaScript/React performance-focused engineer who contributed upstream to an open-source virtualization/pagination library, fixing overlapping-fetch race conditions and introducing prefetch/deduping patterns that cut load times from ~3s to <900ms and reduced render thrash ~35%. Also built healthcare automation systems (clinical summary and claims triage), including a FastAPI + RAG pipeline that retrieved CPT/ICD evidence, improving decision accuracy from 67% to 86% and reducing turnaround time by 40%.”
Mid-level AI/ML Engineer specializing in FinTech risk, fraud detection, and GenAI/RAG systems
“Built and productionized Azure-based LLM/RAG systems for regulatory/compliance use cases, including automating analyst research and compliance report generation across large unstructured document sets. Demonstrates strong practical depth in hallucination mitigation, hybrid retrieval tuning (BM25 + embeddings), and production MLOps (Databricks, Cognitive Search, AKS, Airflow/MLflow), plus proven ability to deliver auditable, explainable solutions with non-technical compliance teams.”
Mid-level Machine Learning Engineer specializing in LLMs, agentic AI, and risk/fraud modeling
“Built and productionized an agentic LLM workflow during a summer internship to transform unstructured clinical reports into analytics-ready structured data, using a LangChain multi-agent design plus an LLM-as-a-judge layer to control quality in a regulated setting. Also has experience orchestrating ML pipelines at Piramal Capital using AWS Step Functions/EventBridge/CloudWatch, with strong emphasis on observability, evaluation rigor, and measurable impact (80–90% reduction in manual data entry).”
Mid-level AI/ML Engineer specializing in NLP and conversational AI
“ML/NLP engineer focused on real-time IT ops analytics, building a predictive maintenance/anomaly detection platform end-to-end (multi-source ETL, streaming, modeling, and production deployment on GCP/Vertex AI). Uses deep learning (LSTMs, autoencoders/VAEs) plus embeddings (SentenceBERT) and vector search to improve incident correlation and search, citing ~40% reduction in duplicate alert noise.”
Junior Software Engineer specializing in cloud-native microservices and ML/LLM pipelines
“Backend-leaning full-stack engineer who ships AI-enabled products end-to-end: built CodeChat, a production internal codebase Q&A tool using RAG with Pinecone and a model-agnostic wrapper across OpenAI/Anthropic/AWS Bedrock, cutting AWS costs ~50% and latency ~45%. Also built and operated RealityStream, a Flask-based real-time forecasting API with JWT/RBAC, MLflow model versioning, and Prometheus/Grafana observability, including handling a real production latency incident via rollback, preloading, and caching.”
Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG
“Built production LLM + hybrid RAG and multi-agent orchestration systems at Wells Fargo to automate complaint document/audio transcript understanding and categorization, addressing vocabulary drift via embedding + vector index updates instead of frequent retraining. Strong in LLM workflow reliability (testing/benchmarks/observability) and stakeholder-facing delivery with explainability (citations/SHAP-style justifications) and Tableau dashboards.”
Mid-Level Full-Stack Software Engineer specializing in distributed systems and cloud integrations
“Backend engineer with enterprise SaaS experience (Zoho) who owned an end-to-end cloud integration between Endpoint Central and ServiceDesk Plus, redesigning device onboarding across 64+ scenarios and building a fault-tolerant sync engine that recovered 100% failed transactions. Also built and operated production systems across the stack—FastAPI services with strong testing/observability, React+TypeScript portals, PostgreSQL performance tuning, and AWS deployments with real incident response (RDS CPU saturation resolved with zero downtime).”
Mid-level AI Engineer specializing in LLMs, RAG, and agentic platforms
“Built and shipped a production RAG-based assistant that lets parents ask natural-language questions about their child’s learning progress, using pgvector retrieval (child-id filtered) and Redis caching to hit ~180ms latency. Implemented real-world guardrails and compliance (Llama Guard, COPPA, retrieval thresholds, fallbacks) with 99.5% uptime, and ran human-in-the-loop eval loops that improved satisfaction from 3.8 to 4.2 while serving 60k+ monthly users and reducing costs significantly.”