Pre-screened and vetted.
Junior Product Engineer specializing in AI and SaaS
“Product intern at an AI startup (AdvisorGPT) who helped turn an LLM-based prototype into a production SEO blog-generation workflow that matched a firm’s tone/voice and targeted specific search phrases. Strong at bridging technical and non-technical teams, rapidly learning new AI tooling, and driving adoption through customer calls, UX improvements, and customer-facing demos/workshops.”
Senior Full-Stack Software Engineer specializing in Insurance, FinTech, and AI/ML applications
“AI/backend engineer who fine-tuned and deployed a production LLM chatbot using a LangChain + FAISS RAG pipeline, improving latency with PEFT/LoRA and driving strong business impact (40% customer adoption; 92% satisfaction). Also served as technical lead on a data aggregation system for underwriting/quoting, introducing GraphQL for more efficient, maintainable querying and applying CDC to keep cached ranking data fresh at scale.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and cloud-native ML
“LLM/agent engineer at USAA who built a production GPT-4o RAG conversational assistant for financial analysts, focused on regulatory interpretation and internal documentation search. Emphasizes compliance-grade reliability with strict grounding, safe fallbacks, and full auditability via MLflow/DVC plus human-in-the-loop review; reports ~45% reduction in ticket resolution time.”
Mid-level Full-Stack AI Engineer specializing in agentic LLM platforms
Mid-level Data Science & AI Engineer specializing in LLMs and cloud ML platforms
“Built and deployed an LLM-powered mental health therapy assistant at AppHealth that segments users by stress level and delivers personalized, non-medical guidance. Implemented healthcare-focused safety guardrails (secondary LLM output filtering) and a multi-agent router workflow validated via statistical tests and therapist review, then scaled training/inference on AWS (EC2/Lambda/DynamoDB) with Kubernetes.”
Mid-level Machine Learning Engineer specializing in production ML, forecasting, NLP and computer vision
“Built and deployed a production LLM-powered support assistant for customer support agents using a RAG architecture over internal docs and past tickets, with human-in-the-loop review. Demonstrates strong applied LLM engineering focused on real-world constraints (hallucinations, latency, cost) using routing to smaller models, reranking, caching, and rigorous evaluation/monitoring (offline eval sets, A/B tests, KPI tracking).”
“ML/LLM engineer with production experience building a RAG-based LLM support assistant (FastAPI, Redis, Kafka) with multi-layer validation and human-in-the-loop feedback loops to improve accuracy over time. Has orchestration and MLOps depth using Airflow and Kubeflow on Kubernetes (autoscaling, alerting, monitoring) and delivered measurable ops impact (40% ticket efficiency improvement) by partnering closely with customer support teams.”
Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines
“Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.”
Intern AI Researcher specializing in NLP, LLMs, and knowledge graphs
“Built and shipped “LabMate,” a production AI assistant specialized in laboratory hardware, using a weighted multi-source RAG pipeline with reranking and reasoning-focused query decomposition to handle complex user questions. Deployed on a local GPU cluster with vLLM and NVIDIA MPS (plus OCR/VLM components), and established evaluation using synthetic + public reasoning datasets while collaborating weekly with non-technical admins to align requirements and resource constraints.”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS
“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”
Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps
“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”
Mid-level AI/ML & Backend Engineer specializing in AI platforms and computer vision
“Backend engineer with hands-on experience building real-time, low-latency systems: owned the Python backend for a real-time crowd-monitoring product (top 5% at HackHarvard 2025) using OpenCV, GPU YOLO inference (PyTorch), WebRTC, and OAuth. Also has production Kubernetes/GitOps experience (Helm/Kustomize, GitHub Actions, Argo CD), Kafka-based event pipelines, and executed a minimal-downtime on-prem PostgreSQL migration to AWS EC2.”
Junior Robotics & AI Engineer specializing in autonomous systems and 3D perception
“Robotics software engineer who led system design for an Autonomous Trash Collecting ASV presented at the IEEE ICRA 2025 “Robots in the Wild” workshop, integrating YOLOv8-based perception with ROS autonomy logic to detour for trash while preserving a scientific survey mission. Also built ROS2 UAV capabilities combining ArUco detection, RTAB-Map SLAM, and PX4 integration, with strong simulation (Gazebo/VTD/MSC Adams) and CI/CD QA automation experience.”
Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics
“ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.”
Mid-level Applied AI/ML Engineer specializing in agentic systems and LLM automation
“Built a production LLM-powered workflow at Frontier to extract structured signals from messy, high-volume documents and route work to the right teams, replacing a multi-day, error-prone manual process. Emphasizes production reliability with schema/consistency validation, re-prompting and deterministic fallbacks, plus async pipeline optimizations for predictable latency. Experienced with multi-agent orchestration (LangGraph, AutoGen, CrewAI) and AWS workflow tooling (Step Functions, SQS, Lambda), and delivered ~70% safe automation via stakeholder-driven thresholds and human review.”
Mid-level AI/ML Engineer specializing in healthcare ML and generative AI
“AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.”
Mid-level AI & Data Scientist specializing in LLMs, RAG, and healthcare NLP
“Built a production LLM/RAG solution for healthcare operations teams to query large policy and care-guideline repositories in natural language. Improved domain alignment using vector retrieval plus parameter-efficient fine-tuning and prompt optimization, validated through internal user testing and metrics, cutting manual lookup time by ~40%. Also has hands-on experience orchestrating automated ML pipelines with Apache Airflow.”
Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and GenAI
“Full-stack engineer with cloud and GenAI experience who has owned production features end-to-end, including a reporting dashboard optimized from 14s to 5s using query/API refactoring and monitored via AWS CloudWatch. Also productionized an OpenAI-powered chatbot using LangChain with prompt design, guardrails, and evaluation via production logs and user feedback, and has led incremental legacy-to-microservices modernization with parallel run to avoid regressions.”
Junior Software Engineer specializing in AI/ML, data pipelines, and cloud APIs
“Hands-on AI/LLM practitioner who built a RAG-based customer support chatbot and tackled production issues like data chunking complexity and response-time lag. Uses techniques such as overlapping chunks, semantic search, context engineering, and query routing, and has experience presenting technical demos/workshops to developer audiences.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”
Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception
“Built and shipped a production multi-agent personal financial assistant at AlphevaAI on AWS ECS, combining FastAPI microservices, Redis/SQS orchestration, and Pinecone-based hybrid RAG (semantic + BM25) to ground financial guidance. Improved routing accuracy with an embedding-based SetFit + logistic regression intent classifier feeding an LLM router, and optimized UX with live streaming plus cost controls via model tiering and caching.”
Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection
“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”
Mid-level AI Engineer specializing in LLMs, RAG, and data engineering
“AI Engineer Co-Op at Northeastern University who built a production Patient Persona Chat Bot to help nursing students practice clinical interactions, fine-tuning Llama 3 and integrating a LangChain + Pinecone RAG pipeline deployed on Amazon Bedrock. Emphasizes clinical accuracy and reliability with guardrails, retrieval filtering, and continuous evaluation, and also brings strong data engineering/orchestration experience (Airflow, EMR/PySpark, ADF, dbt, Databricks, Snowflake).”