Pre-screened and vetted.
“GenAI/ML engineer from Deloitte who built and shipped a production RAG-based internal search assistant for support teams, delivering quantified operational gains (20% effort reduction, 35% faster manual lookup). Experienced in enterprise-grade LLM reliability (grounding/hallucination control), compliance/security constraints, and rapid release cycles using CI/CD, MLflow, and orchestration tools (Airflow, Databricks Jobs, LangChain).”
Senior GenAI/ML Engineer specializing in LLMs, RAG, and multimodal generative AI
“LLM/RAG engineer with production deployments in highly regulated domains (Frost Bank and GE Healthcare). Built secure, explainable document-grounded Q&A systems using LoRA fine-tuning, strict RAG with confidence thresholds, and citation-based responses; also established evaluation/monitoring (golden QA sets, hallucination tracking, drift) and achieved ~40% latency reduction through retrieval/prompt tuning.”
Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“Built a production AI-driven contract/document extraction system combining OCR, normalization, and LLM schema-guided extraction, orchestrated with PySpark and Azure Data Factory and loaded into PostgreSQL for analytics. Emphasizes reliability at scale—using strict JSON schemas, confidence scoring, targeted retries, and multi-layer validation to control hallucinations while processing thousands of PDFs per hour—and partners closely with non-technical business teams to refine fields and deliver usable dashboards.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and Clinical AI
“Built and productionized a HIPAA-compliant LLM+RAG Clinical AI assistant at Optum, fine-tuning GPT/LLaMA on de-identified patient notes and integrating FAISS/Pinecone for sub-second retrieval; reported to cut diagnosis time by ~20 minutes per case. Experienced in orchestrating ML pipelines (Airflow, AWS Step Functions, Azure Data Factory) and in reliability techniques for LLM systems (grounding, citations, confidence filters, monitoring) while partnering closely with clinicians and compliance teams.”
Mid-level Data Analyst specializing in financial risk and healthcare analytics
“AI/ML engineer focused on real-time, production-grade LLM systems, with a robotics-adjacent mindset around latency/accuracy tradeoffs and modular pipelines. Built a scalable RAG-based assistant orchestrated as microservices on Kubernetes with Kafka async messaging, ONNX/quantization optimizations, and monitoring (Prometheus/Grafana), citing a ~35% hallucination reduction; has also experimented with ROS Noetic/Gazebo to understand ROS concepts.”
Junior Machine Learning Engineer specializing in generative AI and computer vision
“AI engineer who deployed a production LLM-powered safety system for an education platform, combining rule-based checks, multi-LLM verification, and selective context (prompt+image vs image-only) to prevent explicit prompts/images from getting through. Strong focus on reliability via benchmarking, trace-based failure analysis, and continuous improvement driven by stakeholder feedback and manual review.”
Senior Machine Learning Engineer specializing in MLOps and NLP/GenAI
“Built a production LLM-agent framework for a startup that performs daily financial/trading analysis by combining live market data with internal tools, including a centralized memory module to prevent context drift and reduce hallucinations. Also implemented an Airflow-orchestrated retail price forecasting pipeline deployed to AWS endpoints, scaling parallel workloads via Kubernetes Executor and validating systems with rigorous functional + LLM-specific metrics and cross-team collaboration.”
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and AI integrations
“Backend engineer who has delivered large, measurable performance wins (10x throughput, 67% latency reduction) by combining Flask microservices, Redis caching, and AWS autoscaling/observability. Has hands-on depth in SQLAlchemy/Postgres optimization and production scaling pitfalls (cache consistency, connection exhaustion), plus experience deploying real-time ML inference (XGBoost) on AWS Lambda and building secure multi-tenant Kubernetes isolation.”
Mid-Level Software Engineer specializing in AI/ML and distributed systems
“Software engineer with production experience building a serverless monolith and multi-layer video pipeline at easyML, plus hands-on integration of multiple LLM providers (Grok/Claude/OpenAI) into a full-stack app. Interested in robotics via computer vision (OpenCV/OpenMMLab), with a strong real-time systems mindset around SLOs, latency, determinism, and reliability; also has low-level OS experience writing a keyboard device driver.”
Mid-level Data Scientist specializing in MLOps, LLM/RAG applications, and deep learning
“Built and deployed a production compliance automation RAG system (at Citi) that generates citation-backed, schema-validated risk summaries for regulatory document review. Emphasizes regulated-environment reliability with retrieval-only grounding, abstention, confidence thresholds, and immutable audit logging, plus orchestration using LangChain/LangGraph and Airflow. Reported ~60% reduction in compliance review effort while maintaining high precision and traceability.”
Mid-level AI/ML Engineer specializing in enterprise ML, MLOps, and Generative AI
“ML/LLM engineer who has shipped production RAG systems (LangChain + HF Transformers + FAISS) with hybrid retrieval and cross-encoder re-ranking, deployed via FastAPI/Docker/Kubernetes and monitored with MLflow. Also partnered with wealth advisors at Edward Jones to deliver a client retention model with SHAP-driven explanations and a dashboard that improved trust, adoption, and reduced high-value client churn.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and real-time fraud detection
“GenAI/ML engineer who has shipped production agentic systems in highly regulated and high-throughput environments, including an AWS Bedrock-based fraud/compliance workflow at U.S. Bank with PII redaction and hallucination detection that cut investigation time by 50%+. Also built and evaluated RAG and recommendation systems at Target, using RAGAS-driven testing, hybrid retrieval with re-ranking, and SHAP explainability dashboards to align model behavior with merchandising business KPIs.”
Executive Technology Leader specializing in SaaS scale-up, Azure cloud, and AI/ML products
“Former President/CEO who led MyGov through a successful acquisition, now on sabbatical building ncognait LLC—an AI-enabled app studio with one product launched (Taistful) and another in private beta (AIDONIS). Focused on startup CTO/founding roles and highly opinionated about using agentic coding to dramatically compress product development cycles and compete with larger incumbents.”
Junior Cloud & AI/ML Engineer specializing in AWS GovCloud and MLOps
“Robotics software engineer with hands-on ROS 2 autonomy experience on an obstacle-avoiding quadrotor (ROS 2 + Gazebo + PX4 + Nav2/SLAM), including custom work to extend Nav2 into a 3D aerial domain and output PX4 trajectory setpoints. Also built cost-saving ML infrastructure (PostgreSQL + AWS data-cleaning pipeline) and improved object detection accuracy by 40% using CUDA/PyTorch, with strong containerization and CI/CD practices (Docker + Kubernetes, aggressive version pinning) to prevent environment drift.”
Mid-level AI Engineer specializing in LLMs, RAG, and content automation
“AI/LLM engineer who built a production autonomous GenAI content ecosystem that generates short-form scripts, extracts viral highlights from long-form video, and dubs content into 33+ languages. Focused on making LLM outputs production-safe via schema enforcement, token-to-time alignment, critic-agent verification, and scalable async orchestration—cutting manual workflows by ~90% and saving $200k+ annually.”
Principal Software Architect specializing in AI/ML and cloud-native full-stack platforms
“AI/LLM engineer who built a production content-generation system for nursing education, combining multimodal RAG over proprietary PDFs (including images) with structured Cosmos DB data and external sources. Strong focus on production reliability—prompt-chaining with LangChain, validation/guardrails, and Azure-based monitoring/observability—plus experience designing Azure AI agents with tool integrations like Bing Search.”
Mid-level AI/ML Engineer specializing in GenAI, LLMs, and computer vision
“Built and productionized a multi-agent, LLM-powered document understanding system to replace manual review of long documents, using LangGraph orchestration plus RAG to reduce hallucinations. Implemented layered reliability controls (structured templates, checker agent, and human-in-the-loop feedback) and reported ~40% speed improvement after orchestration; also has hands-on Airflow experience for scheduled data pipelines.”
Mid-level Machine Learning Engineer specializing in GenAI, LLMs, and real-time ML systems
“Built and deployed a production long-form article summarization system using BART/T5/PEGASUS, tackling real-world constraints like token limits, latency/quality tradeoffs, and factual drift via chunking/merge logic and constrained decoding. Uses pragmatic Python-based pipeline orchestration (scheduled jobs, modular scripts, logging/retries) and iterates with stakeholder feedback to make outputs genuinely useful for content workflows.”
Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training
“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”
Intern Machine Learning & AI Automation Engineer specializing in ML workflows and AI hardware
“ML practitioner with hands-on experience adapting diffusion models (DDPM + U-Net in PyTorch) to improve low-dose CT medical imaging quality via denoising and upsampling against high-dose ground truth. Also built a RAG workflow during a recent internship by cleaning client survey data, embedding with OpenAI text-embedding-3-large, and indexing in Pinecone with MD5 deduplication, alongside a strong emphasis on production-grade Python practices.”
Mid-level Full-Stack Developer specializing in FinTech and Healthcare systems
“Open-source contributor who improved React Query’s caching/subscription behavior to reduce unnecessary re-renders via debouncing and batched updates, validated with benchmarking and extensive tests. Also maintained a Flask extension and resolved production background-task hangs by tracing Redis connection handling issues, adding cleanup/retry logic and troubleshooting docs. In a fast-paced startup, owned the design of a Celery+Redis multi-queue background processing system with Prometheus-based observability.”
Executive Technology Leader (CTO) specializing in AI, cloud, and distributed platforms
“Engineering leader who stays hands-on in high-leverage technical areas (architecture, scalability, reliability) while operating at an executive level. Led StagePilot’s shift from a tightly coupled legacy system to a cloud-native, event-driven real-time platform proven at 1M+ concurrent users, and previously scaled multiple SRE teams at McGraw-Hill with SLOs, on-call, and blameless ops practices.”
Senior Software Engineer specializing in Python automation and hybrid cloud integration
“Embodied AI / robotics-focused ML engineer with experience at JPMorgan and EY building language-to-robot control systems that connect transformer/LLM intent to safe real-world robotic actions. Designed production-grade, low-latency architectures (Kafka/Redis, monitoring, CI/CD) and applied sim-to-real and model distillation to make research ideas deployable on physical systems.”