Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and scalable GPU inference
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production MLOps
Executive growth and operations leader specializing in Enterprise SaaS and AI
Mid-level AI/ML Engineer specializing in FinTech risk and fraud systems
Mid-level AI/ML Engineer specializing in LLM training, RAG, and low-latency inference
Mid-level AI/ML Engineer specializing in fraud detection and clinical LLM assistants
“Built and deployed a production clinical support LLM assistant at Mayo Clinic using a LangChain-orchestrated RAG architecture (Llama 2/PaLM) over de-identified clinical records, integrating BigQuery with Pinecone for semantic retrieval. Focused on healthcare-critical reliability by reducing hallucinations through grounding, implementing HIPAA-aligned privacy controls (Cloud DLP, VPC Service Controls), and running structured evaluations with clinician feedback.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and MLOps
“ML/LLM engineer who built a production RAG system (GPT-4 + FAISS + FastAPI) to deliver fast, grounded answers from proprietary documents, optimizing for sub-200ms latency and high-concurrency scale. Strong MLOps/observability background: drift monitoring with Prometheus + Streamlit, automated retraining via Airflow, Kubernetes autoscaling, and MLflow-managed model lifecycle, plus inference cost reduction through quantization and structured pruning.”
Senior AI/ML Engineer specializing in Generative AI, NLP, and RAG systems
“ML/NLP engineer focused on production-grade data and search/recommendation systems: built an end-to-end pipeline that connects unstructured customer feedback with product data using TF-IDF/BERT, Spark, and AWS (SageMaker/S3), orchestrated with Airflow and monitored for drift. Also has hands-on experience with entity resolution at scale and improving search relevance via BERT embeddings, FAISS vector search, and domain fine-tuning validated with precision@k and A/B testing.”
Mid-level AI/ML Engineer specializing in NLP, computer vision, and MLOps
Mid-Level Full-Stack Software Engineer specializing in FinTech and cloud-native AI systems
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multi-agent systems
Mid-level AI/ML Engineer specializing in GPU-accelerated LLM and vision systems
Senior AI/ML Engineer specializing in personalization, recommendations, and forecasting
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and GPU-accelerated cloud systems
Mid-level AI/ML Engineer specializing in LLMs, ranking systems, and MLOps
Principal Data Scientist / AI Engineer specializing in healthcare-native AI platforms
Mid-level AI/ML Engineer specializing in LLMs, RAG, and multimodal deep learning
“ML/LLM engineer who has built and productionized a large multimodal LLM pipeline end-to-end—fine-tuning a 20B+ parameter model with distributed/FSDP training and deploying on Kubernetes via Triton for ~5x throughput. Strong focus on reliability and safety (monitoring with SHAP, guardrails, A/B testing) with reported ~22% relevance lift and reduced harmful/incorrect outputs, plus experience orchestrating ETL/retraining workflows with Airflow across S3/Snowflake/RDS.”
Mid-level AI Engineer specializing in Generative AI and MLOps
“Built and deployed a production LLM-powered clinical support assistant at BJC HealthCare (RAG + transformer) to answer patient questions, summarize clinical notes, and support appointment workflows. Implemented PHI-safe data pipelines (Spark/Hadoop/Kafka) with automated scrubbing, dataset versioning, and audit logs, and runs the system on Docker/Kubernetes with Pinecone vector search while partnering closely with clinical operations staff.”
Mid-level AI & ML Engineer specializing in NLP, LLMs, and scalable ML systems
“AI/ML engineer with experience spanning Accenture healthcare NLP systems, academic research, and Apple on-device LLM integration. Stands out for owning regulated production pipelines end-to-end—from HIPAA-compliant clinical NLP and EHR integrations to incident prevention, experiment tracking, and optimized on-device inference with LLaMA 3.”