Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps
“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”
Junior Data & Machine Learning Engineer specializing in MLOps and NLP
“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”
Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision
“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”
Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision
“ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).”
Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems
“Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.”
Intern-level Software Engineer specializing in GenAI, RAG, and backend systems
“AI/LLM engineer focused on shipping production-grade agents that automate support, sales intake, and ERP-connected workflows. Stands out for combining strong orchestration and guardrails with measurable business outcomes, including 45% faster support handling, ~$1.2M annual savings, 18% higher customer satisfaction, and 99.5%+ reliability in production.”
“Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.”
Junior AI Engineer specializing in fraud detection, credit risk, and LLMs in FinTech
“AI engineer with production experience building a high-accuracy (98%) fraud detection system operating at real-time latency (1–2s) over millions of transactions, using a multi-model pipeline approach to meet performance constraints. Also implemented Airflow-orchestrated workflows (DAGs, retries, alerts) to replace brittle cron scripts and is currently pursuing a master’s project on real-time ASL-to-text conversion.”
Intern Data Scientist specializing in generative AI and forecasting
“ML/NLP practitioner working across healthcare and business/finance use cases: currently fine-tuning a domain-specific Llama 3.1 model for safe reasoning over EHRs/clinical notes using RAG + RL/DPO and RAGAS-based evaluation. Has built UMLS-driven entity normalization pipelines with quantified quality gains and developed embedding/vector-DB systems (FAISS) for semantic matching and forecasting/recommendation applications at Aurora AI and Banxico.”
Intern Software Engineer specializing in ML/NLP and LLM applications
“Full-stack AI/LLM engineer who has deployed a production LLM backend (Mistral 14B) on GKE to auto-transform datasets and generate runnable ML training pipelines, addressing hallucinations, schema mismatch, latency, and burst scaling with caching/prompt compression and HPA. Also has internship experience (Splunk, BlackOffer) delivering data automation and 10+ Power BI dashboards for non-technical stakeholders with measurable efficiency gains.”
Senior AI/ML Engineer specializing in Generative AI and agentic multi-agent systems
“Built and shipped a production LLM-powered multi-agent RAG system to automate complex internal support workflows, integrating tool execution (SQL/APIs) with validation guardrails to reduce hallucinations. Optimized for real-world latency and cost via model routing, caching, and async parallel tool calls, and enforced reliability with CI-gated golden test sets derived from anonymized production queries.”
Mid-level Generative AI Engineer specializing in LLM fine-tuning, RAG, and agentic systems
“Built and deployed a production multi-agent RAG system at JPMorgan Chase to automate regulated credit analysis and compliance clause discovery across large internal policy/document libraries. Implemented LangGraph-based supervisor orchestration with structured state management (Azure OpenAI) to support long-running, resumable workflows, plus hybrid retrieval + re-ranking and guardrails for reliability. Strong at evaluation/observability (trace logging, LLM-judge, HITL) and at communicating results to non-technical stakeholders via Power BI embeds and Streamlit prototypes.”
Intern AI/ML Engineer specializing in Generative AI and applied machine learning
“New graduate with hands-on LLM work building a RAG pipeline (HNSW, lexical reranking/boosting, ReAct) and optimizing it through ablation to dramatically reduce latency. Also building a modular personal assistant with a custom wake word model, router-driven agent selection, and integrations like Spotify with secrets managed via .env.”
Mid-level Generative AI Engineer specializing in enterprise LLM and healthcare AI solutions
“Built and owned an end-to-end LLM-powered fraud investigation assistant that automated case summaries and risk analysis, cutting analyst investigation/documentation time by 40%. Stands out for translating RAG concepts into a production-grade internal platform with strong evaluation, monitoring, and reusable Python service architecture that improved both analyst trust and engineering velocity.”
Senior AI/ML Engineer specializing in LLMs, NLP, and enterprise conversational AI
“Built and owned a production conversational AI platform for a healthcare contact center, including RAG-based agent assist, hybrid retrieval, safety guardrails, and production monitoring. Stands out for combining LLM product delivery with strong operational rigor, driving a reported 25-30% improvement in handling time in a sensitive healthcare environment.”
Mid-level AI/Analytics Product & Data Professional specializing in LLM and dashboard automation
“Built and shipped open-source LLM/RAG systems, including a generative AI assistant grounded on ~30,000 scraped university web pages, improving response accuracy ~30% by moving from TF-IDF-only retrieval to a hybrid sentence-transformer approach with fallback controls. Also partnered with non-technical leadership at Securi.ai to deliver real-time predictive analytics dashboards (Elasticsearch + Jira/ServiceNow) that reduced project overhead by 18%.”
Mid-level AI/ML Engineer specializing in LLM agents and RAG systems
“LLM/agentic systems builder at Verizon who deployed a LangGraph-orchestrated multi-agent ticket-automation platform with RAG (FAISS) to replace brittle rule-based bots. Improved routing correctness by ~30–40%, hit ~300ms latency targets via model routing, and reduced ops workload by ~60% through tight iteration with non-technical stakeholders and strong testing/observability practices.”
Mid-Level Full-Stack Software Engineer specializing in AWS cloud and microservices
“Backend/LLM engineer who built a production-critical Amazon Bedrock + RAG correction and compliance layer for employee communications, integrating tightly with existing Spring Boot/AWS microservices to reduce manual review while keeping outputs explainable and auditable. Also designed an event-driven system processing 10M+ events/day (SQS/Lambda/DynamoDB/Elasticsearch) and handled on-call incidents with strong observability and reliability patterns (idempotency, retries, hotspot mitigation).”
Mid-level AI/ML Engineer specializing in Generative AI, Conversational AI, and RAG systems
“Built and shipped a production enterprise RAG knowledge assistant that returns grounded, cited answers and uses confidence-based fallbacks (clarifying questions/abstention) with monitoring and compliance controls for sensitive data. Implemented end-to-end agent orchestration (function calling, structured JSON, state, retries/rate limits) plus eval/feedback loops, and achieved a reported 30–40% improvement in knowledge-task completion time while reducing hallucinations via retrieval improvements.”
Mid-level AI/ML Engineer specializing in NLP, Generative AI, and predictive analytics
“GenAI/LLM engineer who architected and deployed a production RAG “research assistant” for JPMorgan Chase’s regulatory compliance team, focused on safety-critical behavior (mandatory citations, refusal when evidence is missing). Deep hands-on experience with LlamaIndex, Pinecone, Hugging Face embeddings, LangGraph agent workflows, and metric-driven evaluation (golden sets, TruLens), including a reported 28% relevancy lift via cross-encoder re-ranking.”
Mid-level Machine Learning Engineer specializing in LLMs and AI products
“Applied ML/LLM engineer currently building AppleCare’s production chat recommender, owning the full lifecycle from transcript cleaning and fine-tuning through distributed deployment, monitoring, and iterative improvement. Their work delivered >10% copy-count improvement, 5% lower modification rate, 60% cost reduction, and $1.1M profitability in 2025, and they also created a reasoning-data generation approach that enabled a reasoning model and a judge model that cut eval time by over 99%.”
Senior Data Scientist / Generative AI Engineer specializing in fraud, risk, and MLOps
“Built and deployed a production LLM/RAG fraud investigation system to replace manual investigator workflows, combining transaction data, historical cases, and policy documents with agent-style steps and LoRA fine-tuning. Demonstrates strong reliability engineering (grounding, citations, abstention paths), performance optimization (retrieval/indexing/caching), and end-to-end MLOps orchestration using Azure ML Pipelines/MLflow plus Kubernetes/Argo with canary and rollback deployments.”
Mid-level AI/ML Engineer specializing in GenAI, RAG, and enterprise data platforms
“Built and shipped a production LLM-powered RAG assistant for enterprise internal document search (PDFs, knowledge bases, structured data), addressing real-world issues like noisy documents, hallucinations, and latency with grounded prompting, retrieval-confidence fallbacks, and performance optimizations. Also partnered with compliance and business teams at JPMc to deliver a solution aligned with regulatory constraints, supported by monitoring, feedback loops, and systematic evaluation.”
Senior AI/ML Engineer specializing in LLMs, GenAI, and MLOps
“AI/ML engineer (Cognizant) who built a production, real-time credit card fraud detection platform combining deep-learning anomaly detection with an LLM-based explanation layer. Strong focus on regulated deployment: addressed class imbalance and feature drift, and added guardrails (SHAP/structured inputs, fine-tuning on analyst reports, rule-based validation) to keep explanations accurate and compliant. Orchestrated the full pipeline with Airflow + Databricks/Spark and used MLflow/Prometheus plus A/B and shadow deployments for measurable reliability.”