Pre-screened and vetted.
Mid-level GenAI Engineer specializing in AI agents, RAG, and LLM evaluation
“Asset Management Risk professional at Fidelity Investments who built and productionized an agentic RAG platform enabling compliance and analysts to query 10,000+ fund documents with cited answers in seconds. Implemented structure-aware semantic chunking (AWS Textract), hierarchical retrieval, and hybrid search to raise accuracy from 68% to 94%, and built an evaluation framework tracking accuracy/latency/cost/hallucinations—delivering 40+ hours/month saved and zero critical production failures.”
“Backend engineer with experience in both regulated healthcare and finance: built a multi-agent RAG system to generate FDA regulatory approval documents for biomedical devices, improving retrieval accuracy via hybrid search (semantic + BM25) and hierarchical chunking. Previously at JPMorgan Chase, led a Java microservice refactor and AWS migration using Elasticsearch-first patterns, caching, and safe rollout strategies (parallel runs, canary, blue-green) in asset/wealth management.”
Senior AI & Machine Learning Engineer specializing in NLP, GenAI, and MLOps
“ML/GenAI practitioner with healthcare domain depth who built and deployed a production cervical-cancer EMR classification system using a hybrid rules + medical BERT approach, optimized for high recall under severe class imbalance and PHI constraints. Experienced running end-to-end production ML/LLM pipelines with Apache Airflow (validation, promotion/rollback, monitoring, retraining) and partnering closely with clinicians to calibrate thresholds and implement human-in-the-loop review.”
Mid-level Machine Learning Engineer specializing in Generative AI and MLOps
“LLM/agent engineer who has shipped production RAG chatbots in sustainability-focused domains, including a packaging recommendation assistant that standardized messy user inputs and used Pinecone-backed retrieval over product/regulatory data. Experienced orchestrating end-to-end ML workflows with Airflow and AWS Step Functions/Lambda, emphasizing reliability (property-based testing, circuit breakers, OpenTelemetry) and measurable performance (latency/cost). Partnered closely with non-technical leadership to ship 3 weeks early, driving adoption by 150+ businesses and ~20% reported waste reduction.”
Mid-level AI/ML Engineer specializing in LLM applications and cloud-native systems
“LLM engineer who has shipped production AI systems, including an RFP requirements extraction platform (OpenAI o4-mini + Azure AI Search + FastAPI) achieving 90%+ accuracy and ~5x throughput through grounding, structured outputs, parallelization, and caching. Also partnered with legal/compliance stakeholders at Nexteer Automotive to deliver an AI document comparison tool with traceability and confidence indicators, adopted by non-technical users and saving ~2 FTEs of review time.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and fraud/risk analytics in Financial Services
“Built and shipped a production-grade GenAI Fraud & Compliance Investigation Copilot for a large US bank, integrating OCR docs, structured data, and prior case history to generate grounded, regulator-friendly summaries and red-flag highlights. Demonstrates strong end-to-end LLM systems engineering (LangGraph/LangChain, hybrid retrieval with FAISS+BM25, guardrails/citations, streaming/latency optimization) plus rigorous evaluation and close partnership with compliance stakeholders.”
Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI
“Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).”
Mid-level AI/ML Engineer specializing in Generative AI and MLOps
“GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.”
Mid-level AI/ML Engineer specializing in healthcare NLP, real-time risk systems, and ML platforms
“LLM-focused customer-facing engineer who repeatedly takes document Q&A and agentic prototypes into secure, monitored production systems. Experienced in reducing hallucinations via RAG + guardrails, diagnosing retrieval/embedding issues in real time, and partnering with sales to run metrics-driven PoCs that overcome accuracy/security objections and drive adoption.”
Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps
“Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.”
Intern Robotics & Computer Vision Engineer specializing in surgical robotics
“Robotics software engineer who built and owned an autonomous laparoscope tracking system on a UR3e with an eye-in-hand RealSense camera, integrating YOLO-based tool detection with velocity control under a strict RCM constraint and deploying successfully in a hospital setting. Deep ROS2/MoveIt2 experience (architecture, QoS, custom nodes) plus autonomy stack work across SLAM, planning, and real-time latency/control debugging.”
Mid-level Applied AI Engineer specializing in LLM infrastructure and model optimization
“LLM engineer who has deployed privacy-preserving, real-time workplace risk monitoring over massive enterprise chat/email streams, tackling latency, hallucinations, and extreme class imbalance with model benchmarking, RAG + fine-tuning, and a pre-filter alerting layer. Also built an agentic legal contract drafting system (Jurisagent) using LangGraph/LangChain with deterministic multi-agent control flow, structured outputs, and reliability-focused evaluation/telemetry.”
Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps
“AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps
“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”
Intern Generative AI Engineer specializing in RAG and multi-agent systems
“Built and deployed a production RAG-based multi-agent chatbot during an internship to help consultants answer client questions and guide users through new IT systems with step-by-step instructions. Demonstrates hands-on experience with LangGraph/LangChain/Google ADK, unstructured document parsing and chunking for RAG, and a reliability-first approach to agent workflows (metrics, fallbacks, human-in-the-loop, guardrails).”
Mid-level AI/ML Engineer specializing in Databricks, MLOps, and real-time fraud detection
“ML/LLM engineer building production, real-time fraud detection for financial transactions using a two-tier architecture (fast ML + GPT) to deliver both low-latency decisions and analyst-friendly risk explanations. Experienced orchestrating end-to-end retraining, drift monitoring, and automated model promotion with Databricks Jobs/Workflows and MLflow, and partnering closely with fraud analysts to tune alerts, thresholds, and dashboards.”
Junior Data & Machine Learning Engineer specializing in MLOps and NLP
“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”
Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision
“ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).”
Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision
“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”
Senior AI/ML Engineer specializing in GenAI agents and LLM workflows
“LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.”
Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems
“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”
Junior AI Engineer specializing in LLM pipelines, RAG, and computer vision
“Built and deployed an on-prem, HIPAA-compliant LLM pipeline for oncology-focused clinical note generation and decision support, emphasizing grounded differential diagnosis and explainable reasoning via RAG to reduce hallucinations. Also created a LangGraph-based multi-agent academic paper search system integrating Tavily, arXiv, and Semantic Scholar with an orchestrator that routes tasks to specialized sub-agents.”