Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps
“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference
Principal Machine Learning Scientist specializing in GenAI, LLMs, and RAG
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps
Senior Machine Learning Engineer specializing in LLM inference and GPU infrastructure
Mid-level Machine Learning Engineer specializing in generative AI, NLP, and MLOps
Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference
Senior Machine Learning Engineer specializing in GenAI, NLP, and recommendation systems
Mid-Level Software Engineer specializing in cloud infrastructure and full-stack web development
“Backend engineer at Electric Hydrogen who built a serverless device-log ingestion and processing platform in Python/Flask, scaling throughput (4x peak ingestion) while keeping sub-300ms API latency. Strong in Postgres/SQLAlchemy performance (partitioning, materialized views) and production ML integration (ONNX model served via FastAPI microservice with async batch inference, Redis feature caching, and drift monitoring via S3/Lambda). Experienced designing secure multi-tenant systems with schema-per-tenant isolation and KMS-backed encryption.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and scalable inference
“Backend/retrieval-focused engineer with production experience at Perplexity building a large-scale real-time Q&A system using retrieval-augmented generation, emphasizing low-latency, high-quality answers through ranking, context optimization, and caching. Also has orchestration experience from both product-facing LLM pipelines and large-scale infrastructure workflows at Meta, and has partnered with non-technical stakeholders to align AI trade-offs with business goals.”
Staff-level Machine Learning Engineer specializing in LLMs and MLOps for Financial Services
“Machine learning/NLP practitioner at J.P. Morgan who led development of a production RAG system and an entity resolution pipeline for complex financial data. Deep hands-on experience with embeddings (Sentence-BERT), vector search (FAISS/pgvector), LLM fine-tuning (LoRA/PEFT), and rigorous evaluation (human-in-the-loop + A/B testing) backed by strong MLOps on AWS (Docker/Kubernetes, MLflow, Prometheus/Datadog).”
Mid-level AI/ML Engineer specializing in LLM alignment, safety, and scalable inference
“Built and productionized an AWS-hosted, Kubernetes-orchestrated RAG assistant that enables natural-language Q&A over internal document repositories with grounded answers and citations. Demonstrates strong applied LLM engineering: hallucination mitigation, hybrid retrieval + re-ranking, and rigorous evaluation via benchmarks and A/B testing, plus real-world scaling of compute-heavy inference with dynamic batching and monitoring.”
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Mid-level AI/ML Engineer specializing in recommender systems, fraud detection, and LLMs
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multi-agent systems
Senior AI/ML Engineer specializing in personalization, recommendations, and forecasting
Mid-level Generative AI & Machine Learning Engineer specializing in LLMs and RAG
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning
“ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.”
Mid-level AI Engineer specializing in Generative AI and MLOps
“Built and deployed a production LLM-powered clinical support assistant at BJC HealthCare (RAG + transformer) to answer patient questions, summarize clinical notes, and support appointment workflows. Implemented PHI-safe data pipelines (Spark/Hadoop/Kafka) with automated scrubbing, dataset versioning, and audit logs, and runs the system on Docker/Kubernetes with Pinecone vector search while partnering closely with clinical operations staff.”
Mid-level Data Scientist specializing in recommender systems, NLP, and real-time ML pipelines
“AI/LLM engineer who built and productionized an internal RAG-based knowledge system that ingests diverse sources (PDFs, Markdown, Slack), scaled retrieval with distributed FAISS and parallel ingestion, and reduced hallucinations via re-ranking, grounding prompts, and post-generation validation. Also has hands-on orchestration experience with Airflow and Kubernetes for reliable ETL/model pipelines, monitoring, and staged rollouts; reports ~15% accuracy improvement and adoption as the primary internal knowledge tool.”
Mid-level Machine Learning Engineer specializing in fraud detection and real-time personalization
“ML/LLM engineer with Stripe and Adobe experience who productionized a transformer-based Payments Foundation Model for real-time fraud detection at global scale (billions of transactions). Built petabyte-scale ETL/feature pipelines (Spark/EMR, Airflow, dbt, Kafka/Flink) and achieved <100ms multi-region inference (EKS, TorchServe, edge/Lambda, GPU/CPU routing) with strong PCI-DSS/GDPR compliance and explainability (SHAP/LIME), reporting a 64% fraud accuracy improvement.”