Pre-screened and vetted.
Mid-level Data Scientist / ML Engineer specializing in LLMs and predictive analytics
Mid-Level Software Engineer specializing in distributed systems and GenAI
“Capgemini engineer with 4+ years building and deploying high-availability, low-latency fraud detection APIs and multi-cluster distributed systems for a Fortune 20 bank, including zero-downtime production rollouts and multi-layer (SQL/network/hardware) performance debugging. Also built a Python + OpenAI/LangChain LLM-powered grading workflow for Austin School for Women, cutting feedback time from 90 minutes to 5 minutes per submission for 200+ learners.”
Mid-level AI Engineer specializing in GenAI agents and RAG for IT operations
“Built and operates a production LLM agent for enterprise IT operations that triages and drafts resolutions for high-volume ServiceNow tickets using LangChain + RAG (Pinecone/pgvector) and AWS Bedrock/OpenAI. Emphasizes reliability with schema-validated stages, offline eval datasets from real tickets, and CloudWatch-driven monitoring/guardrails; system scales to 40K+ tickets/month and cut resolution time ~28%.”
Mid-level AI/ML Engineer specializing in conversational AI, NLP, and LLM-powered RAG systems
Senior Data Engineer specializing in multi-cloud data platforms and generative AI
Mid-level AI/ML Software Engineer specializing in Generative AI and NLP
Junior Robotics & Machine Learning Engineer specializing in perception, SLAM, and edge AI
“Built and deployed an Azure-based, fine-tuned CLIP visual retrieval system at Staples for a ~300k-item product catalog, improving edge-case recall by 12% by engineering a custom delta-similarity/dynamic-margin loss. Also has robotics experience using ROS2 for sensor/compute orchestration, including GPS-time-synchronized sensor triggering for robot swarms and latency-bounded optical-flow benchmarking for edge deployment.”
Mid-level Software & AI Engineer specializing in Robotics, LLMs, and Reinforcement Learning
“Robotics/AI Master's thesis researcher building an LLM-driven workflow to generate and evaluate robot policies before running them in an environment. Also built a local LLM-based real-time target-tracking robot using a pan-tilt camera with LangChain + Ollama, and has hands-on ROS 2/Gazebo experience including URDF-based simulation and a TurtleBot multi-agent chase project.”
Mid-level AI/ML Engineer specializing in LLM systems and cloud MLOps
“Built a production LLM-powered fraud detection platform at Wells Fargo, combining OpenAI/Hugging Face models with RAG-based explanations to make flagged transactions interpretable for risk and compliance teams. Delivered low-latency, real-time inference at high scale on AWS (SageMaker + EKS), with strong observability and security controls, reducing manual reviews and false positives in a regulated environment.”
Mid-level Software Engineer specializing in Agentic AI and RAG systems
“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”
Junior Software Engineer specializing in AI, LLM systems, and full-stack development
“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level Machine Learning Engineer specializing in computer vision and generative AI
“Built and deployed an LLM/RAG system that uses differential privacy and distributional similarity checks to transform private data into a non-sensitive knowledge base while preserving utility. Also has experience demonstrating adversarial ML concepts (FGSM) to non-technical audiences by focusing on observable model behavior rather than implementation details.”
Intern Software Engineer specializing in AI/LLMs and full-stack development
“AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).”
Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI
“Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation
“Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.”
Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems
“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”
Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps
“Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.”
Mid-level AI/ML Engineer specializing in MLOps, NLP, and Computer Vision
“Built and deployed a production LLM-powered text extraction/classification system that converts messy unstructured reports into searchable insights, running on AWS SageMaker with automated retraining and monitoring. Strong in orchestration (Step Functions/Kubernetes/Airflow patterns) and reliability practices (gold datasets, prompt/tool unit tests, shadow/canary/A-B testing, guardrails/rollback), and has experience translating non-technical stakeholder needs into an NLP workflow plus dashboard.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”