Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“Built and deployed a production RAG pipeline at PNC Financial Services to let risk/compliance analysts query millions of internal financial documents in natural language, reducing manual search and speeding regulatory validation. Demonstrates deep practical experience with large-scale document ingestion/OCR cleanup, retrieval performance tuning (hierarchical indexing, caching), and LLM reliability controls (grounding, citations, abstention), plus cloud orchestration on Azure and AWS.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG pipelines
“AI/LLM engineer with healthcare domain experience who built a production clinical support “chart bot” for Molina, including PHI-safe ingestion of 200k+ PDF policies, vector retrieval, and a fine-tuned LLaMA served via vLLM on ECS Fargate. Demonstrated measurable performance wins (HNSW + namespace partitioning; 30% inference latency reduction) and a rigorous evaluation/monitoring approach, while partnering closely with nurses and operations teams to shape workflows and guardrails.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and cloud-native ML
“LLM/agent engineer at USAA who built a production GPT-4o RAG conversational assistant for financial analysts, focused on regulatory interpretation and internal documentation search. Emphasizes compliance-grade reliability with strict grounding, safe fallbacks, and full auditability via MLflow/DVC plus human-in-the-loop review; reports ~45% reduction in ticket resolution time.”
Mid-level Full-Stack Software Engineer specializing in GenAI and SaaS platforms
Mid-level AI Engineer specializing in NLP, computer vision, and MLOps
Mid-level Autonomous Robotics Engineer specializing in ROS2, SLAM, and perception
“Robotics software engineer with deep ROS2 experience who built a modular autonomous robotics stack (perception/sensor fusion, localization+mapping, and planning). Led development of a LiDAR+camera fusion and multi-object tracking pipeline (PCL + YOLO + Kalman filtering) and debugged real-time SLAM/localization issues via QoS/timestamp synchronization, EKF tuning, and SLAM Toolbox parameter optimization using Gazebo/RViz and rosbag replay.”
Junior Data Scientist / ML Engineer specializing in LLMs and RAG systems
“Built and deployed a production enterprise LLM-powered RAG assistant for the construction domain, enabling natural-language querying across PDFs/reports and structured sources (SQL/CSV). Implemented an agent-based routing and multi-agent orchestration approach (LangChain/LangGraph) to reduce hallucinations, improve latency, and deliver actionable, structured responses based on stakeholder feedback.”
Mid-level AI/ML Engineer specializing in NLP, MLOps, and predictive analytics
“AI/ML Engineer at Fifth Third Bank who has shipped production fraud detection and risk analysis systems combining ML models with LLM-powered insights/explanations, including real-time monitoring, drift detection, and automated retraining under regulatory explainability constraints. Also built a hybrid-retrieval internal knowledge-base QA system (+20% top-5 relevance) and delivered a customer support chatbot that reduced first response time by 30% through strong stakeholder collaboration.”
Senior Machine Learning Engineer specializing in NLP, computer vision, and edge AI
“AI/LLM engineer who built a production RAG-based Text2SQL engine using Qdrant, including creating the underlying business/DB documentation, generating a test dataset, and designing detailed SQL-quality metrics for validation. Also partnered with non-technical stakeholders on a speech recognition project to prioritize medical terminology, improving accuracy through targeted corpora, lookup-table correction, and fine-tuning with a modified loss function.”
Mid-level AI Engineer specializing in generative AI, multimodal evaluation, and agentic RAG systems
“Built and productionized an agentic LLM automation system for an insurance client to determine medication eligibility, using prompt-chaining plus a RAG pipeline over policy rules and deploying on AWS (Lambda/Step Functions, Bedrock) with a serverless architecture. Addressed major data/schema mismatch issues via a semantic matching pipeline and validated performance through human agreement scoring, A/B testing, KPI monitoring, and confidence-based human-in-the-loop review.”
Mid-level AI/ML Engineer specializing in fraud detection and Generative AI (RAG)
“AI/ML engineer who has shipped production LLM and ML systems, including a RAG pipeline that ingested ~500k insurance/client documents to help adjusters answer questions faster and more consistently. Experienced in handling messy real-world document formats, tuning retrieval/chunking, and reducing latency via vector search optimization, precomputed embeddings, and caching. Also built orchestrated fraud-detection deployment workflows using AWS Step Functions and SageMaker, and partners closely with non-technical operations teams on NLP automation.”
Mid-level AI/ML Engineer specializing in fraud detection, recommender systems, and forecasting
“ML engineer/data scientist who built and deployed a real-time fraud detection platform at Citi on AWS SageMaker, processing 3M+ daily transactions and improving fraud response by 28%. Combines unsupervised anomaly detection (autoencoders) with ensemble models (XGBoost/Random Forest) plus Airflow/Step Functions orchestration, drift monitoring, and explainability (SHAP) to keep models reliable and compliant in production.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.”
Mid-level AI/ML Engineer specializing in MLOps, NLP, and real-time ML pipelines
“Built a production, real-time insurance claims document-understanding and fraud-detection pipeline using TensorFlow + fine-tuned BERT, deployed on AWS (SageMaker/Lambda/API Gateway) with automated retraining via MLflow and Jenkins. Addressed noisy documents and latency using augmentation and model distillation (3x faster), cutting claims ops manual review by ~50% and reducing fraudulent payouts.”
Mid-level Data Scientist specializing in cloud ML, MLOps, and predictive analytics
“NLP/ML engineer with hands-on healthcare and support-ticket text experience, building clinical-note structuring and semantic linking systems using spaCy, BERT clinical embeddings, and FAISS. Emphasizes production-grade delivery (Airflow/Databricks, PySpark, Docker, AWS/FastAPI/Lambda) and rigorous validation via clinician-labeled datasets, retrieval metrics, and user feedback.”
Senior Software Engineer specializing in Python microservices, cloud platforms, and ML-powered APIs
“Backend/data engineer focused on AWS-native Python systems: built a FastAPI microservice on ECS/Fargate serving real-time analytics at millions of daily requests with strong reliability (OAuth2/JWT, retries/timeouts, correlation IDs) and autoscaling. Also delivered Glue/PySpark ETL pipelines to curated S3 Parquet/Athena with schema evolution + data quality controls, owned Airflow pipeline incidents, and has a track record of measurable performance and cost optimizations (e.g., ~80%+ query latency reduction; reduced logging/NAT/Fargate spend).”
Intern Data Scientist specializing in robotics localization and SLAM
“Robotics/embodied-AI practitioner who built a TurtleBot3 LiDAR-fingerprint localization pipeline end-to-end (autonomous data collection + multi-head NN) achieving ~30 cm error in a 10x10 m space. Also has industry experience at Infineon building large-scale production data/AI pipelines and rapidly fixing a deployed recommendation system by correcting upstream data normalization, improving accuracy by 20%+.”
Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG
“Built and deployed Clyra.AI, an AI-driven daily scheduling product that uses a LangGraph-based multi-agent LLM pipeline (task extraction, verification, reflection) grounded with strict RAG over emails/documents/calendars and real-world signals like health metrics. Designed a custom agent orchestrator with bounded loops/termination conditions and a self-auditing verification/reflection layer to reduce hallucinations while controlling latency and cost via caching and model distillation.”
Mid-level Machine Learning Engineer specializing in LLMs, GenAI, and Computer Vision
“LLM/agent engineer who built a production multi-agent research automation system using LangGraph (planner, retriever with FAISS, supervisor, evaluator) with structured outputs and citation tracking for traceable reports. Emphasizes reliability and operations—LangSmith-based observability, multi-level testing, hallucination mitigation, and latency/cost controls—plus prior experience as a Computer Vision Software Engineer at Deepsight AI Labs working directly with non-technical customers.”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS
“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”
Mid-level AI/ML Engineer specializing in fraud detection and healthcare predictive analytics
“ML/AI engineer with production experience in high-scale banking fraud detection at Truist, building an end-to-end pipeline (Airflow/AWS Glue/Snowflake, PyTorch/sklearn) with automated retraining and Kubernetes-based deployment; delivered measurable gains (22% fewer false positives, 15% higher recall) and reduced manual ops ~40%. Also partnered with clinicians at Kellton to deploy an LLM system for summarizing/classifying clinical notes, improving review time and decision speed.”