Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLMs, RAG, and fraud/risk analytics in Financial Services
“Built and shipped a production-grade GenAI Fraud & Compliance Investigation Copilot for a large US bank, integrating OCR docs, structured data, and prior case history to generate grounded, regulator-friendly summaries and red-flag highlights. Demonstrates strong end-to-end LLM systems engineering (LangGraph/LangChain, hybrid retrieval with FAISS+BM25, guardrails/citations, streaming/latency optimization) plus rigorous evaluation and close partnership with compliance stakeholders.”
Mid-level Software Engineer specializing in AWS, full-stack development, and AI data systems
“Backend engineer who built a Python-based data profiling/statistics platform processing up to 50M rows and ~300 metrics, using a DAG execution model, multithreading, and smart caching to cut processing time by up to 70%. Also improved PostgreSQL query performance from 12s to 2s via indexing/query rewrites, integrated an LLM (LangChain + OpenAI) for explainable “chat with the pipeline” functionality, and designed an AWS EC2+SQS architecture for scalable, isolated per-user processing.”
Mid-level AI/ML Engineer specializing in NLP, MLOps, and Generative AI
“Built and deployed a production generative AI chatbot at NVIDIA using LangChain + GPT-3 integrated with internal data sources, cutting response time nearly in half and improving CSAT by ~12 points. Also delivered LLM-driven QA tools by fine-tuning Hugging Face transformer models and deploying via an AWS-based pipeline (Lambda/Glue/S3) with orchestration (Airflow/Step Functions), CI/CD, Kubernetes, and monitoring (MLflow/Splunk/Power BI).”
Mid-level Software Engineer specializing in ML platforms and cloud-native backend systems
“Software engineer with experience at Google and the City and County of San Francisco building production AI systems, including a RAG-based internal support chatbot and ML-driven ticket priority tagging. Has scaled data/ML platforms with Airflow on GCP (1M+ records/day, 99.9% SLA) and deployed multi-component systems with Docker and Kubernetes (GKE), using modern LLM tooling (LangChain/CrewAI, Claude/OpenAI, Pinecone/ChromaDB, Bedrock/Ollama).”
Staff/Lead Data Scientist specializing in Generative AI, NLP/LLMs, and MLOps
“Lead Data Scientist (10+ years) with recent work in healthcare data: built production pipelines that unify EHR, genomics, and clinical notes using NLP (spaCy/BERT/BioBERT) and scalable Spark-based processing. Also led development of domain-specific LLM/NLP systems for chatbots and semantic search, deploying models via FastAPI/Flask and improving retrieval with FAISS-backed, fine-tuned clinical embeddings and RAG-style workflows.”
Mid-level AI/ML Engineer specializing in Generative AI and MLOps
“GenAI/LLM engineer and architect who built and deployed a production generative AI financial forecasting and scenario analysis platform at McKinsey, leveraging Claude (Anthropic), LangChain, Airflow, MLflow, and AWS SageMaker. Demonstrates strong LLMOps/MLOps rigor (monitoring, drift detection, automated retraining) and deep experience implementing global privacy controls (GDPR, differential privacy, audit trails) while partnering closely with finance executives and legal/IT stakeholders.”
Junior AI/ML Engineer specializing in production LLM systems and RAG
“LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.”
Senior Data Scientist / ML Engineer specializing in GenAI, LLMs, and NLP
“ML/NLP engineer focused on production GenAI and data linking systems: built a large-scale RAG pipeline over millions of support docs using LangChain/Pinecone and added a LangGraph-based validation layer to cut hallucinations ~40%. Also built scalable PySpark entity resolution (95%+ accuracy) and fine-tuned Sentence-BERT embeddings with contrastive learning for ~30% relevance lift, with strong CI/CD and observability practices (OpenTelemetry, Prometheus/Grafana).”
Mid-level AI/ML Engineer specializing in NLP, RAG, and MLOps
“Built a production LLM/RAG-based “model excellence scoring” system at Uber to automatically evaluate hundreds of ML models, standardizing quality assessment and cutting evaluation time from days to minutes on GCP. Also delivered an NLP document classification solution for insurance claims at Globe Life, partnering closely with compliance/operations and improving routing accuracy from ~85% manual to 93% with the model.”
Junior Implementation Manager / Solution Engineer specializing in AI, ERP integrations, and predictive maintenance
“LLM/agentic workflow practitioner (Continuum AI) who productionized an LLM system for manufacturing RMA intake and warranty claims by moving from a brittle prompt to a modular pipeline with RAG, function-calling extraction, deterministic validation, and strong observability. Also diagnosed and fixed an agentic ticket-triage misrouting issue by tracing failures to retrieval timeouts, adding guardrails/fallbacks, and implementing retries plus continuous evaluation—bringing misroutes near zero while creating a repeatable debugging playbook.”
Mid-level Software Engineer specializing in systems, storage, and machine learning
“Robotics-focused engineer who built a non-holonomic self-driving car on Raspberry Pi 5 using ROS 2, implementing sensor fusion (robot_localization EKF), 2D SLAM (slam_toolbox), custom Hybrid A*/RRT* planners, and MPC trajectory tracking. Demonstrated strong real-time debugging and performance tuning (timestamp sync, CPU contention mitigation) and is extending the platform toward CV-based plant identification and autonomous plant watering.”
Mid-Level Full-Stack Software Engineer specializing in cloud-native web platforms and AI tooling
“Built the backend for “codeGuard,” an AI-powered static code analysis platform, using FastAPI and Docker. Structured the system into API/service/execution layers and addressed heavy-workload container resource/cleanup issues via strict CPU/memory limits and a queued execution model.”
Mid-level Applied AI Engineer specializing in LLM infrastructure and model optimization
“LLM engineer who has deployed privacy-preserving, real-time workplace risk monitoring over massive enterprise chat/email streams, tackling latency, hallucinations, and extreme class imbalance with model benchmarking, RAG + fine-tuning, and a pre-filter alerting layer. Also built an agentic legal contract drafting system (Jurisagent) using LangGraph/LangChain with deterministic multi-agent control flow, structured outputs, and reliability-focused evaluation/telemetry.”
Intern Software Engineer specializing in distributed systems and security
“Built a production LLM-powered analyst assistant at Discern Security to speed up SOC investigations using a RAG pipeline over security vendor documentation (Python PDF ingestion, vector search). Demonstrates deep, security-critical LLM engineering: structure-aware chunking with custom table parsing, grounded/cited responses, prompt-injection defenses, and post-generation validation, validated via golden datasets and adversarial testing; tool is used daily by analysts.”
Senior Data Scientist specializing in GenAI agents and causal inference
“Built and deployed a production healthcare medical review agent that automates call-transcript summarization and medication reconciliation using a hybrid deterministic + LangGraph-orchestrated LLM workflow. Demonstrates strong reliability engineering (guardrails, schema validation, confidence thresholds, golden/adversarial eval, Langfuse monitoring) in a regulated environment, delivering 60% lower latency and 70%+ efficiency gains while partnering closely with care managers and operations.”
Junior AI/ML Engineer specializing in applied LLMs, security, and reinforcement learning
“Built and shipped a production LLM-powered investor research feature for a fintech product, focused on grounded answers and minimizing hallucinations. Implemented retrieval-quality and evidence-coverage gating with clear refusal fallbacks, and evaluates systems with regression tests and metrics like correct-refusal rate, hallucination rate, and latency. Comfortable orchestrating workflows with LangChain or custom Python depending on production needs.”
Mid-level Data Scientist/ML Engineer specializing in GenAI agents and MLOps
“AI/LLM engineer at Capital One who deployed a production RAG-powered fraud analysis and document intelligence platform using LangChain, OpenAI, Pinecone, Kafka, and AWS. Focused on reliability in real-time investigations via hybrid retrieval, schema-validated outputs, and LLM verification loops, reporting review-time reduction from hours to minutes and ~99% fraud detection precision.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and MLOps
“Built and productionized a RAG-based analytics Q&A assistant for a financial analytics team, enabling natural-language querying across 200+ datasets (SQL tables, PDFs, compliance docs, wikis) and cutting turnaround time by 60%. Deep experience delivering regulated, audit-ready LLM systems on Azure (Azure OpenAI + LangChain) with strict grounding/citations, hybrid retrieval, and AKS-based low-latency deployment, plus strong collaboration with compliance analysts and auditors via iterative Gradio demos.”
Intern Generative AI Engineer specializing in RAG and multi-agent systems
“Built and deployed a production RAG-based multi-agent chatbot during an internship to help consultants answer client questions and guide users through new IT systems with step-by-step instructions. Demonstrates hands-on experience with LangGraph/LangChain/Google ADK, unstructured document parsing and chunking for RAG, and a reliability-first approach to agent workflows (metrics, fallbacks, human-in-the-loop, guardrails).”
Junior Data & Machine Learning Engineer specializing in MLOps and NLP
“ML/LLM practitioner with production experience building a healthcare review sentiment pipeline (RateMDs) using Hugging Face Transformers plus a LangChain+FAISS RAG layer for interactive querying. Also led orchestration-driven optimization of Nike’s Fusion ETL pipeline, improving runtime efficiency by 20%, and has experience translating ML outputs into Tableau dashboards for non-technical healthcare stakeholders (e.g., readmission risk).”
Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision
“Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.”
Principal Software Engineer specializing in AI/LLM platforms, payments, and healthcare systems
“Engineering player-coach who recently shipped an agent-based workflow to extract key info from unstructured web data (browser agents + CDP) and populate daily digests/calendars, owning architecture through testing. Also built a Flask-based LLM evaluation and regression testing system using G-Eval/Confident AI dashboards, and applies a rigorous, research-driven approach to selecting third-party tools with stakeholder buy-in; has healthcare ops/onboarding workflow experience at Vivio Health.”