Pre-screened and vetted.
Junior AI Engineer specializing in RAG pipelines and agentic AI systems
“Built and shipped production RAG/agentic systems in high-stakes domains (biomedical and legal), including an enterprise biomedical document retrieval platform over ~10k scientific docs and a multilingual African-law assistant at the World Bank. Deep hands-on experience with LangChain/LangGraph/LlamaIndex and evaluation tooling (LLM-as-a-judge, safety/hallucination detection), with measurable gains in retrieval quality and hallucination reduction.”
Mid-level Software Engineer specializing in ML, LLM apps, and cloud data systems
“Built a production SQL chatbot for access-log analytics that replaced manual custom report requests with natural-language querying, using LangGraph and a ChromaDB-backed RAG pipeline for grounded, consistent answers. Implemented a privacy-preserving design where the LLM never sees raw customer data (only query metadata) and has experience building multi-agent/tool-calling systems with LangGraph (DeepAgents), including solving sub-agent communication drift via self-reflection.”
Mid-level Data Scientist specializing in ML, NLP, and Generative AI
“Data engineering / ML practitioner with experience at MetLife building transformer-based sentiment analysis over large unstructured datasets and productionizing pipelines with Airflow/PySpark/Hadoop (reported 52% efficiency gain). Also implemented embedding-based semantic search using Pinecone/Weaviate to improve retrieval relevance and enable RAG for customer support and document matching use cases.”
Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps
“Built an LLM-powered academic research assistant for a professor (LangChain + OpenAI + arXiv) focused on synthesizing papers quickly, with emphasis on reliability (ReAct prompting, citation verification) and cost control (caching). Has production MLOps/orchestration experience at Cisco and HCL Tech using Kubernetes, plus MLflow and GitHub Actions for lifecycle management and CI/CD.”
Mid-level Data Scientist & Generative AI Engineer specializing in LLMs and RAG
“ML/NLP practitioner who built a retrieval-augmented generation (RAG) system for large financial and operational document sets using Sentence-Transformers (all-mpnet-base-v2) and a vector DB (e.g., Pinecone), with a strong focus on retrieval evaluation and chunking strategy optimization. Experienced in entity resolution (rules + embedding similarity with type-specific thresholds) and in productionizing scalable Python data workflows using Airflow/Dagster and Spark.”
Mid-level AI/ML Engineer specializing in NLP, LLMs, and RAG for finance and healthcare
“Built an AI lending assistant (RAG + DeBERTa) used by credit analysts to retrieve policies and past loan decisions, tackling real production issues like hallucinations, document quality, and sub-second latency. Deployed a modular, Dockerized AWS architecture (ECS/EMR + load balancer) with load testing, caching/precomputed embeddings, and CloudWatch monitoring, and used Airflow to automate scheduled data/embedding/vector DB refresh pipelines with retries and alerts.”
Mid-level AI/ML Engineer specializing in LLMs, GenAI, and NLP
“AI/ML Engineer who built a production RAG-based LLM system for insurance policy documents, turning thousands of messy PDFs into a searchable index using LangChain, Azure AI Search vectors, hybrid retrieval, and FastAPI. Strong focus on evaluation (MRR/precision@k/recall@k, REGAS) and performance optimization (vLLM), with prior clinical NLP experience using BERT-based NER validated on ground-truth datasets.”
Junior Data Scientist specializing in ML, LLMs, and RAG applications
“University hackathon finalist (2nd place) who built CareerSpark, a production-style multi-agent career guidance app in 24 hours using a hierarchical debate architecture with a moderator/judge agent. Has startup internship experience at LiveSpheres AI using LangChain for multi-LLM orchestration, and demonstrates a structured approach to testing/evaluation (golden sets, integration sims, latency/accuracy KPIs) plus strong non-technical stakeholder communication.”
Mid-level Data Scientist specializing in Generative AI and multimodal systems
“Recent J&J intern who built a conversational RAG agent and led a shift from a monolithic model to a modular RAG workflow, cutting response time from several days to under a second by tackling data fragmentation, context retention, and embedding/latency optimization. Also worked on a large (7B-parameter) multimodal VQA pipeline for healthcare research and stays current via NeurIPS/ICLR and open-source contributions.”
Senior Computer Vision & Robotics Engineer specializing in perception and warehouse automation
“Robotics engineer with hands-on experience scaling a multi-vendor heterogeneous warehouse robot fleet, building a distributed “traffic manager” for collision avoidance and real-time rerouting using CBS/MAPF and DCOP-style negotiation. Strong real-time/safety-critical systems background (RTOS, deterministic lock-free multithreading) plus modern perception and simulation tooling (CNN-LSTM/transformers, CARLA/Isaac Sim, VIO/GTSAM, camera-IMU calibration). Startup-oriented and comfortable moving quickly from prototype to production.”
Mid-level Conversational AI Developer specializing in enterprise chatbots and RAG
“ML/AI practitioner with hands-on experience deploying models to production and optimizing for low-latency inference using pruning/quantization, with deployments on AWS SageMaker and Azure ML. Has orchestrated end-to-end ML pipelines with Airflow and Kubeflow (ingestion through evaluation) and emphasizes reproducibility via containerization and version-controlled artifacts, while effectively partnering with non-technical stakeholders using dashboards and business-aligned metrics.”
Mid-level Machine Learning Engineer specializing in deep learning and generative AI
“AI/ML engineer who has deployed transformer-based NLP systems to production via Python REST APIs and Kubernetes on AWS/Azure, with a strong focus on latency optimization (p95), reliability, and scalable orchestration. Demonstrates pragmatic model tradeoff decision-making and strong stakeholder collaboration—improving adoption by making outputs more actionable with summaries, extracted fields, and confidence indicators.”
Junior Software Engineer specializing in full-stack systems and LLM automation
“Full-stack engineer who shipped a production "Financial Insight" assistant dashboard in Next.js App Router/TypeScript, integrating a RAG pipeline (embeddings + ChromaDB + LLM) via route handlers and owning post-launch performance (latency, token cost, retrieval relevance). Also built/optimized Postgres-backed workflows for an outbound dialer and callback routing engine handling ~10,000 daily contacts, validating query performance with EXPLAIN (ANALYZE, BUFFERS).”
Senior Data Scientist/ML Engineer specializing in scalable ML and LLM systems
“Built and deployed an end-to-end product that brings a research-paper approach into production for large-scale time-series clustering, with attention to partitioning, latency, and scalability. Also designed a Python-based backend validation service (comparing outputs to database ground truths) and handled production reliability issues by reproducing dataset-specific crashes and hardening corner-case behavior with client-friendly errors.”
Executive CTO / Software R&D Leader specializing in mobile, GPU computing, and quantitative finance
“Serial entrepreneur since leaving corporate in 2009, working largely for equity on multiple startups. Building (1) academically rigorous, anti-overfitting quant/backtesting tools for retail investors (with potential applicability to smaller hedge funds lacking quant staff) and (2) a partner-led “social-as-a-service” platform for verticals like real estate/PropTech (including FSBO use cases) focused on first-party data capture vs. big tech.”
Mid-Level Full-Stack Software Developer specializing in Java microservices and modern web apps
“Software engineer with experience building and iterating high-volume Spring Boot microservices on AWS (Docker/Kubernetes) and integrating with React front-ends. Also delivered an LLM-powered document summarization system using embeddings + retrieval (RAG) with grounding/guardrails and built evaluation loops that directly drove retrieval and chunking improvements. Has scaled Kafka-based pipelines processing millions of messy financial/infrastructure records with reliability and cost/latency tradeoff management.”
Senior Data Engineer specializing in cloud data platforms and ML pipelines
“Data engineer focused on AWS-based enterprise data platforms, owning end-to-end pipelines from multi-source batch/stream ingestion (Glue/Kinesis/StreamSets/Airflow) through PySpark transformations into curated datasets for Redshift/Snowflake. Emphasizes production reliability with strong monitoring/observability and data quality gates, and reports ~30% performance improvement plus improved SLAs and latency after optimization.”
Mid-level Data Scientist specializing in MLOps and Generative AI
“Robotics software/ML engineer who built perception and navigation-related ML systems for autonomous supermarket carts, including object detection, shelf recognition, and obstacle avoidance. Strong ROS/ROS2 practitioner who optimized real-time performance (reported 50% latency reduction) and deployed containerized ROS/ML pipelines at scale using Docker, Kubernetes, and CI/CD.”
Senior Software Engineer specializing in cloud-native microservices (AWS, Java, Kafka)
“Backend engineer with hands-on experience modernizing high-volume transactional systems by decomposing monoliths into Spring Boot microservices on AWS, using Kafka for async workflows and Redis/SQL tuning for latency. Has built Python/FastAPI services with strong API contracts and production-grade security (OAuth2/JWT, RBAC, row-level security), and proactively hardened payment flows against race conditions and double-charging via idempotency.”
Junior AI/ML Engineer specializing in RAG systems and cloud-native MLOps
“Built and shipped a production LLM-powered RAG system at Upstart enabling natural-language search across 50k+ scattered internal technical docs. Delivered sub-300ms p95 latency for ~50 active users with strong hallucination safeguards (retrieval-first, thresholds, citations) plus robust testing/monitoring and cost controls (prompt caching cutting API spend ~20%).”
Senior AI/ML & Full-Stack Engineer specializing in GenAI, RAG, and MLOps platforms
“Backend/data platform engineer who owned end-to-end production services for a fleet analytics/GenAI platform, spanning FastAPI microservices on Kubernetes and AWS (EKS + Lambda) event-driven workloads. Strong in reliability/observability (OpenTelemetry, circuit breakers, idempotency), data pipelines (Glue/Airflow/Snowflake), and measurable performance/cost wins (SQL 10s to <800ms P95; ~30% compute cost reduction).”
Mid-level Robotics Engineer specializing in autonomy, perception, and sensor fusion
“Robotics software engineer who contributed to an autonomous bartender robot (mobile base + ReactorX200 arm), owning manipulation/grasping, Gazebo simulation, and a YOLOv6 object-detection pipeline built from a manually collected/labeled dataset. Also handled system-level hardware bring-up integrating Raspberry Pi to ESP32 over micro-ROS on ROS2 Foxy, and has additional ROS package experience in EKF sensor fusion (IMU+GPS) and an autonomous disaster response boat.”
Mid-level Machine Learning Engineer specializing in Generative AI and LLM applications
“GenAI engineer who has deployed production LLM/RAG chatbots for internal document search, focusing on reliability (hallucination reduction via prompt guardrails + retrieval filtering) and performance (latency improvements via caching). Experienced with LangChain/LangGraph orchestration for multi-step agent workflows and iterates using monitoring/logs and benchmark-driven evaluation while partnering closely with product and business teams.”