Pre-screened and vetted.
Mid-level Data Analyst specializing in financial risk and healthcare analytics
“AI/ML engineer focused on real-time, production-grade LLM systems, with a robotics-adjacent mindset around latency/accuracy tradeoffs and modular pipelines. Built a scalable RAG-based assistant orchestrated as microservices on Kubernetes with Kafka async messaging, ONNX/quantization optimizations, and monitoring (Prometheus/Grafana), citing a ~35% hallucination reduction; has also experimented with ROS Noetic/Gazebo to understand ROS concepts.”
Mid-level AI/ML Engineer specializing in LLMs, NLP, and MLOps
“AI/ML engineer with healthcare domain depth who led a HIPAA-compliant, production LLM system at McKesson to automate clinical document understanding—extracting entities, summarizing provider notes, and supporting authorization decisions. Hands-on across Spark/Python ETL, Hugging Face + LoRA/QLoRA fine-tuning, RAG, and cloud-native MLOps (Airflow/Kubernetes/Step Functions, MLflow, blue-green on EKS/GKE), with explicit work on PHI handling and hallucination reduction.”
Mid-level AI/ML Engineer specializing in Generative AI and LLMOps
“Built and deployed a GPT-based RAG enterprise search system for healthcare clinicians, emphasizing low-latency performance and reduced hallucinations while maintaining end-to-end HIPAA compliance. Demonstrates deep applied experience with PHI-safe data governance (detection/redaction/de-identification), secure Azure ML deployment patterns, and orchestration of production LLM workflows using LangChain and Airflow.”
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and AI integrations
“Backend engineer who has delivered large, measurable performance wins (10x throughput, 67% latency reduction) by combining Flask microservices, Redis caching, and AWS autoscaling/observability. Has hands-on depth in SQLAlchemy/Postgres optimization and production scaling pitfalls (cache consistency, connection exhaustion), plus experience deploying real-time ML inference (XGBoost) on AWS Lambda and building secure multi-tenant Kubernetes isolation.”
Mid-level Applied AI Engineer specializing in knowledge graphs, GraphRAG, and urban mobility
“ML/NLP practitioner focused on knowledge-graph-based retrieval for LLM question answering, including an urban/autonomous-vehicle decision-making use case. Built a hierarchical GraphRAG + vector database system and an entity-resolution pipeline that blends spatial and semantic similarity, validated using LLM-generated synthetic datasets; uses Python tooling like RDFLib, GraphDB, OpenAI APIs, and LangChain.”
Senior ML Engineer & Data Scientist specializing in LLM agents, retrieval/ranking, and MLOps
“Machine Learning Engineer currently at Webster Bank building an enterprise-scale LLM agent for Temenos Journey Manager/Maestro, using RAG-style multi-stage retrieval with FAISS/Pinecone, hybrid dense+sparse search, and LoRA fine-tuning optimized via NDCG/MAP and A/B testing. Previously handled messy incident/telemetry data at Deuta Werke GmbH with deterministic + fuzzy entity resolution, and has strong production data engineering experience across Spark/Hadoop and Python ETL systems.”
Executive CISO specializing in cybersecurity strategy, AI governance, and enterprise risk
“Founder of Nexus Security Advisors, an early-stage cybersecurity consulting firm focused on supporting small and mid-sized businesses with practical security needs. Brings cybersecurity leadership plus AI governance and compliance focus, and has experience designing and delivering hands-on cyber threat intelligence training by validating needs with leadership and partnering to build a practical curriculum.”
Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning
“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”
Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI
“ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.”
Mid-Level Software Development Engineer specializing in full-stack and LLM/AI systems
“AI engineer with hands-on production experience building an end-to-end RAG system that reduced document-answering time from hours to minutes, improving accuracy through chunk overlap and hybrid BM25+semantic retrieval. Also built a LangGraph-based agent that researches company financial news via web search (Google Serper), using Pydantic structured outputs and checkpointing for reliability; experienced collaborating with non-technical stakeholders at JPMC and communicating ROI.”
Mid-level GenAI Engineer specializing in production AI agents and evaluation pipelines
“Built and shipped a production LLM-powered internal operations automation platform using LangChain RAG (Pinecone) and FastAPI microservices, deployed on AWS EKS, serving 10k+ daily interactions. Implemented a rigorous evaluation/observability stack (golden datasets, prompt regression tests, MLflow, retrieval metrics, hallucination monitoring) that drove hallucinations below 2% and improved reliability, and partnered closely with non-technical ops leaders to cut manual lookup work by 60%+.”
Mid-level Data & GenAI Engineer specializing in lakehouse, streaming, and RAG platforms
“Built a production internal LLM-powered knowledge assistant using a RAG architecture (Python, LLM APIs, cloud services) that answers employee questions with sourced, grounded responses from internal documents. Demonstrates strong practical depth in retrieval tuning (chunking/metadata filters), orchestration with LangChain, and production reliability practices (latency optimization, automated embedding refresh, evaluation metrics, logging/monitoring) while partnering closely with non-technical operations teams.”
Mid-level Full-Stack Developer specializing in cloud-native microservices and distributed systems
“Software engineer with hands-on ownership of both fintech checkout improvements (saved payment methods/one-click checkout with tokenization and feature-flag rollouts) and production LLM/RAG systems for customer support. Demonstrates strong operational rigor via guardrails, evaluation loops integrated into CI/CD, and scalable data pipelines handling messy PDFs/CSVs/logs with reliability and observability.”
Senior AI Engineer specializing in forward-deployed voice agents and incident-response automation
“FDE at Bland.ai and founder of Fi (incident-response agent) who routinely takes LLM/agentic concepts from prototype to production. Has hands-on experience reverse-engineering undocumented systems to deliver integrations, building LLM testbeds for voice-agent reliability, and rapidly shipping RAG/semantic search solutions (e.g., Confluence runbooks) after deep customer discovery with DevOps/SRE teams.”
Junior Backend Software Engineer specializing in conversational AI and cloud APIs
“Backend/ML-focused software engineer who built and evolved a Python/FastAPI backend for a large-scale conversational AI platform, decoupling API and inference services to improve stability and deployment velocity. Experienced in production hardening (timeouts/fallbacks/monitoring), secure multi-tenant systems (JWT/RBAC/RLS), and low-risk migrations using shadow deployments and incremental traffic ramp-ups.”
Mid-level AI Engineer specializing in LLMs, RAG, and content automation
“AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.”
Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs
“Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.”
Junior Software Engineer specializing in AI, backend systems, and AWS cloud
“Built and shipped a production multi-agent conversational AI platform (Monitor agent + RAG + 4 additional agents) with enterprise REST APIs, using ChromaDB-grounded WCAG knowledge to keep responses accurate while varying tone via personality modes and conversation memory. Has experience at LinkedIn delivering technical demos and pre-sales guidance to both engineering teams and C-level stakeholders, acting as a translator between sales and technical teams to drive adoption.”
Principal Software Architect specializing in AI/ML and cloud-native full-stack platforms
“AI/LLM engineer who built a production content-generation system for nursing education, combining multimodal RAG over proprietary PDFs (including images) with structured Cosmos DB data and external sources. Strong focus on production reliability—prompt-chaining with LangChain, validation/guardrails, and Azure-based monitoring/observability—plus experience designing Azure AI agents with tool integrations like Bing Search.”
Mid-level AI/ML Engineer specializing in GenAI, LLMs, and computer vision
“Built and productionized a multi-agent, LLM-powered document understanding system to replace manual review of long documents, using LangGraph orchestration plus RAG to reduce hallucinations. Implemented layered reliability controls (structured templates, checker agent, and human-in-the-loop feedback) and reported ~40% speed improvement after orchestration; also has hands-on Airflow experience for scheduled data pipelines.”
Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps
“LLM/ML platform engineer with hands-on experience taking an LLM document summarization prototype into a production-grade service on AWS EKS, emphasizing low-latency inference, drift monitoring, and safe CI/CD rollouts (canary + rollback). Strong in real-time debugging of agentic/RAG systems (tracing, retrieval/index drift fixes) and in developer enablement through practical workshops (Docker/Kubernetes/FastAPI) plus pre-sales support via demos and benchmarks to close pilots.”
Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training
“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”
Junior Data Scientist specializing in fraud analytics and cloud data platforms
“Built and deployed production LLM-powered document summarization/classification systems using embeddings, vector databases (RAG-style retrieval), and automated evaluation (BERTScore/ROUGE), with a focus on monitoring and scalable cloud pipelines. Also partnered with a fraud analytics team to deliver a transaction anomaly detection solution, translating model outputs into Power BI dashboards and actionable KPIs while iterating on thresholds and alerts based on stakeholder feedback.”
Intern Machine Learning & AI Automation Engineer specializing in ML workflows and AI hardware
“ML practitioner with hands-on experience adapting diffusion models (DDPM + U-Net in PyTorch) to improve low-dose CT medical imaging quality via denoising and upsampling against high-dose ground truth. Also built a RAG workflow during a recent internship by cleaning client survey data, embedding with OpenAI text-embedding-3-large, and indexing in Pinecone with MD5 deduplication, alongside a strong emphasis on production-grade Python practices.”