Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in cloud-native GenAI and RAG systems
“Built and productionized an internal GenAI chatbot that makes company policy/SOP knowledge instantly searchable, using a secure RAG architecture on AWS (Bedrock/Titan embeddings/OpenSearch Serverless, Textract/Lambda/S3 ingestion, Claude 3 Sonnet). Demonstrates strong MLOps/orchestration experience (Airflow, Step Functions with Lambda/Glue/SageMaker) and a rigorous reliability approach (RAGAS metrics, A/B testing, citation validation, monitoring), including collaboration with compliance stakeholders via review dashboards.”
Mid-level AI/ML Engineer specializing in predictive modeling, data pipelines, and RAG systems
“Built and productionized an LLM-powered internal knowledge search system in a regulated environment, using embeddings/vector DB retrieval with strict grounding and confidence gating to reduce hallucinations. Reported ~45% accuracy improvement over keyword search and implemented end-to-end orchestration, monitoring, CI/CD, and incremental re-indexing to manage latency and data freshness while driving adoption with business stakeholders.”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“Built and deployed a production RAG pipeline at PNC Financial Services to let risk/compliance analysts query millions of internal financial documents in natural language, reducing manual search and speeding regulatory validation. Demonstrates deep practical experience with large-scale document ingestion/OCR cleanup, retrieval performance tuning (hierarchical indexing, caching), and LLM reliability controls (grounding, citations, abstention), plus cloud orchestration on Azure and AWS.”
Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and NLP
Mid-level Data Scientist / Machine Learning Engineer specializing in NLP and computer vision
Mid-level Data Scientist / AI/ML Engineer specializing in Generative AI and healthcare analytics
Mid-level Data Science & AI Engineer specializing in LLMs and cloud ML platforms
“Built and deployed an LLM-powered mental health therapy assistant at AppHealth that segments users by stress level and delivers personalized, non-medical guidance. Implemented healthcare-focused safety guardrails (secondary LLM output filtering) and a multi-agent router workflow validated via statistical tests and therapist review, then scaled training/inference on AWS (EC2/Lambda/DynamoDB) with Kubernetes.”
Mid-level Data Scientist specializing in ML, LLMs, and Azure MLOps
“Cloud/ML engineer with production deployment experience on Azure (Dockerized models, managed APIs, data pipelines) who has repeatedly stabilized unreliable systems—e.g., taking an API-driven analytics pipeline from ~60% to 98% reliability and an Azure ML service from ~80% to 97% by addressing rate limits, container memory, and gateway timeouts. Also built an explainable contract-risk model for entertainment bookings (Transformers + SHAP) and integrated it into a legacy booking system via a Flask REST API, plus prior IoT work at Nissan processing CAN bus sensor streams for diagnostics/anomaly insights.”
Intern AI/Data Scientist specializing in LLMs, RAG, and MLOps
“Internship project at Builder Market: built an end-to-end production multimodal LLM application that estimates renovation/replacement costs from appliance photos (CLIP embeddings) or text descriptions, combining fine-tuning with agentic RAG. Focused heavily on real-world performance constraints—latency and cost—using parallel agent workflows, model routing to smaller/open-source models, re-ranking, and retrieval chunking, and collaborated closely with CEO/co-founders to deliver the solution.”
Mid-level Data Scientist / ML Engineer specializing in healthcare predictive analytics and NLP
“Built and deployed a real-time hospital readmission risk prediction system at NYU Langone Health, combining structured EHR data with BERT-based NLP on clinical notes and serving predictions to clinicians via Azure ML and FHIR APIs. Emphasizes production reliability and clinical trust through SHAP-based explainability and robust healthcare data preprocessing, and reports a 22% reduction in 30-day readmissions.”
Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics
“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.”
Mid-level Data Scientist specializing in cloud ML, MLOps, and predictive analytics
“NLP/ML engineer with hands-on healthcare and support-ticket text experience, building clinical-note structuring and semantic linking systems using spaCy, BERT clinical embeddings, and FAISS. Emphasizes production-grade delivery (Airflow/Databricks, PySpark, Docker, AWS/FastAPI/Lambda) and rigorous validation via clinician-labeled datasets, retrieval metrics, and user feedback.”
Junior AI/ML Engineer specializing in NLP, LLMs, and MLOps deployment
“Built and deployed NeuroDoc, a production-grade RAG system for PDF Q&A that delivers citation-backed answers with strong anti-hallucination guardrails. Experienced in orchestrating and scaling ML/LLM pipelines with Kubernetes, Airflow/Prefect, and PyTorch Distributed, and in building rigorous evaluation and citation-verification tooling to ensure reliability in production.”
Entry-level AI/ML Engineer specializing in AWS MLOps and computer vision
“Built and shipped a production RAG question-answering system using LangChain/OpenAI, Docker, and FastAPI, then reduced hallucinations through disciplined retrieval tuning and constrained prompting. Also implemented a custom evaluation framework (QA-pair dataset) to measure faithfulness/relevance and deployed containerized ML microservices on AWS ECS/Fargate with ALB and rolling, zero-downtime updates.”
Mid-level AI/ML Engineer specializing in healthcare ML and generative AI
“AI/LLM engineer at Humana who built and deployed a HIPAA-aware RAG system for clinical record retrieval, cutting search time dramatically and improving retrieval efficiency by 30%. Experienced with Spark-scale data preprocessing, QLoRA fine-tuning, LangChain orchestration, and MLflow+SageMaker integration, with a strong testing/evaluation discipline (A/B tests, human eval) to hit 95%+ accuracy and production latency targets.”
Mid-Level AI Backend Engineer specializing in Python, LLM/RAG, and healthcare/insurance platforms
“AI Backend Engineer in MetLife’s claims technology group who built and deployed a production LLM-based decision support system that helps claim adjusters quickly find relevant policy rules from long PDFs and historical notes. Designed it as multiple production-grade services with retrieval-first guardrails, continuous validation, and Airflow-orchestrated pipelines for ingestion, embeddings, and vector index updates to keep the system reliable as policies and data evolve.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“Built a production RAG-based healthcare chatbot to retrieve patient medical documents spread across multiple platforms, reducing manual and error-prone searching. Implemented semantic search with custom embeddings (Hugging Face) and Pinecone, deployed via FastAPI/Docker on AWS SageMaker with MLflow tracking, and optimized fine-tuning cost using LoRA while orchestrating retraining pipelines in Airflow.”
Mid-level Machine Learning Engineer specializing in LLM platforms and robotic perception
“Built and shipped a production multi-agent personal financial assistant at AlphevaAI on AWS ECS, combining FastAPI microservices, Redis/SQS orchestration, and Pinecone-based hybrid RAG (semantic + BM25) to ground financial guidance. Improved routing accuracy with an embedding-based SetFit + logistic regression intent classifier feeding an LLM router, and optimized UX with live streaming plus cost controls via model tiering and caching.”
Mid-level AI/ML Engineer specializing in GenAI, MLOps, and anomaly detection
“LLM/MLOps engineer who has shipped a production RAG-based technical documentation assistant (FastAPI) cutting manual review by 45%, with deep hands-on retrieval optimization in Pinecone/LangChain (HNSW, hybrid + multi-query search, caching). Also brings healthcare domain experience—building Airflow-orchestrated EHR pipelines and delivering FDA-auditability-friendly predictive maintenance solutions using SHAP/LIME explainability surfaced in Power BI.”
Mid-level AI Engineer & Data Scientist specializing in LLMs, RAG, and multimodal systems
“LLM/GenAI engineer who built a production AI-powered credit risk policy summarization and compliance alerting platform at HCL Tech, focused on factual accuracy and auditability for a financial client. Implemented a multi-retriever LangChain RAG architecture with citations-only prompting, fallback agents, and human-in-the-loop legal review—cutting manual review time by 35% and scaling to 12 teams.”