Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in predictive modeling, data pipelines, and RAG systems
“Built and productionized an LLM-powered internal knowledge search system in a regulated environment, using embeddings/vector DB retrieval with strict grounding and confidence gating to reduce hallucinations. Reported ~45% accuracy improvement over keyword search and implemented end-to-end orchestration, monitoring, CI/CD, and incremental re-indexing to manage latency and data freshness while driving adoption with business stakeholders.”
Junior Machine Learning Engineer specializing in semantic search and retrieval systems
“Built and shipped a production RAG system (“TROJAN KNOWLEDGE”) for answering questions over technical PDFs, using a 3-stage retrieval stack (BM25 + FAISS + cross-encoder) to lift F1 from 71% to 84%. Drove major performance gains with a 3-level cache (memory/Redis/disk) cutting latency from ~200ms to ~10ms, and added Prometheus/Grafana monitoring plus LangChain-based fallback logic to handle OpenAI rate limits under load.”
Senior Full-Stack Software Engineer specializing in Insurance, FinTech, and AI/ML applications
“AI/backend engineer who fine-tuned and deployed a production LLM chatbot using a LangChain + FAISS RAG pipeline, improving latency with PEFT/LoRA and driving strong business impact (40% customer adoption; 92% satisfaction). Also served as technical lead on a data aggregation system for underwriting/quoting, introducing GraphQL for more efficient, maintainable querying and applying CDC to keep cached ranking data fresh at scale.”
Junior Data Scientist / Software Engineer specializing in data pipelines and applied ML
“Built a production RAG chatbot for Worcester Polytechnic Institute that indexes 500+ webpages using FAISS + Llama 3, with strong grounding/hallucination controls (confidence thresholds and citations). Also has internship experience orchestrating multi-step ETL pipelines with AWS Step Functions and delivered a 30x faster fraud/claims triage workflow at Munich Re using association rules and stakeholder-friendly dashboards.”
Mid-level AI/ML Engineer specializing in Generative AI and RAG pipelines
“AI/LLM engineer with healthcare domain experience who built a production clinical support “chart bot” for Molina, including PHI-safe ingestion of 200k+ PDF policies, vector retrieval, and a fine-tuned LLaMA served via vLLM on ECS Fargate. Demonstrated measurable performance wins (HNSW + namespace partitioning; 30% inference latency reduction) and a rigorous evaluation/monitoring approach, while partnering closely with nurses and operations teams to shape workflows and guardrails.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and cloud-native ML
“LLM/agent engineer at USAA who built a production GPT-4o RAG conversational assistant for financial analysts, focused on regulatory interpretation and internal documentation search. Emphasizes compliance-grade reliability with strict grounding, safe fallbacks, and full auditability via MLflow/DVC plus human-in-the-loop review; reports ~45% reduction in ticket resolution time.”
Mid-level Deployed Engineer specializing in LLM agents and enterprise cloud integrations
“LLM/agent production specialist with strong customer-facing and pre-sales chops: turns demo-grade prototypes into reliable, compliant deployments using RAG tuning, guardrails, evals in CI, and observability with staged rollouts/rollback. Known for engineering-first workshops (including live break-and-fix on retrieval misses, tool timeouts, and prompt injection) that win over skeptical senior developers and drive adoption.”
Mid-level AI/ML Engineer specializing in generative AI and MLOps
Mid-level AI/ML Engineer specializing in financial risk, fraud detection, and NLP
Mid-level Full-Stack Software Engineer specializing in GenAI and SaaS platforms
Senior Software Engineer specializing in backend, DevOps, and LLM-powered systems
“Backend-focused Python engineer who has owned production FastAPI services deployed on Kubernetes, including CI/CD (GitLab CI to ECR) and GitOps delivery via ArgoCD/Helm. Has hands-on experience with complex reliability and infrastructure work—solving data inconsistency with validation/partial-data paths, fixing K8s liveness issues via lazy loading, and supporting a phased cloud-to-on-prem migration with dual-writes and monitoring. Also built Kafka-based real-time ingestion consumers handling bursty, high-throughput traffic with async processing and topic/retention tuning.”
Junior AI/ML Engineer specializing in LLMs, RAG, and information retrieval
“Internship experience shipping production AI systems: built an end-to-end RAG platform (Python/FastAPI + LangChain/LangGraph + vector search) to answer support questions from unstructured internal docs, with a strong focus on hallucination prevention through confidence gating and rigorous offline/online evaluation. Also delivered an AI-driven personalization/analytics feature using an unsupervised clustering pipeline, iterating with PMs to align statistically strong clusters with actionable business segmentation.”
Junior Data Scientist / ML Engineer specializing in LLMs and RAG systems
“Built and deployed a production enterprise LLM-powered RAG assistant for the construction domain, enabling natural-language querying across PDFs/reports and structured sources (SQL/CSV). Implemented an agent-based routing and multi-agent orchestration approach (LangChain/LangGraph) to reduce hallucinations, improve latency, and deliver actionable, structured responses based on stakeholder feedback.”
Mid-level AI Engineer specializing in generative AI, multimodal evaluation, and agentic RAG systems
“Built and productionized an agentic LLM automation system for an insurance client to determine medication eligibility, using prompt-chaining plus a RAG pipeline over policy rules and deploying on AWS (Lambda/Step Functions, Bedrock) with a serverless architecture. Addressed major data/schema mismatch issues via a semantic matching pipeline and validated performance through human agreement scoring, A/B testing, KPI monitoring, and confidence-based human-in-the-loop review.”
Mid-level AI/ML Engineer specializing in fraud detection, recommender systems, and forecasting
“ML engineer/data scientist who built and deployed a real-time fraud detection platform at Citi on AWS SageMaker, processing 3M+ daily transactions and improving fraud response by 28%. Combines unsupervised anomaly detection (autoencoders) with ensemble models (XGBoost/Random Forest) plus Airflow/Step Functions orchestration, drift monitoring, and explainability (SHAP) to keep models reliable and compliant in production.”
Mid-level AI Engineer specializing in Generative AI, RAG systems, and fraud analytics
“Built and deployed a RAG-based student/faculty support chatbot at a university that answers from official syllabus/policy documents and now supports 4,000+ students while reducing repetitive support requests. Hands-on with LangChain, LangGraph, and CrewAI to orchestrate reliable agentic workflows, with a strong focus on testing/monitoring in production and cross-functional delivery (e.g., marketing analytics automation at Steve Madden).”
Mid-level Machine Learning Engineer specializing in NLP, Generative AI, and RAG systems
“Built and deployed a production LLM-powered phone assistant for a healthcare clinic, combining streaming STT/TTS with RAG over approved clinic documents and strict safety guardrails to prevent unverified medical advice, plus seamless human handoff. Also has hands-on Apache Airflow experience building robust daily ML/data pipelines with data validation, retries/timeouts, monitoring, and metric-gated model deployment, and iterates closely with clinic staff using real call reviews.”
“ML/LLM engineer with production experience building a RAG-based LLM support assistant (FastAPI, Redis, Kafka) with multi-layer validation and human-in-the-loop feedback loops to improve accuracy over time. Has orchestration and MLOps depth using Airflow and Kubeflow on Kubernetes (autoscaling, alerting, monitoring) and delivered measurable ops impact (40% ticket efficiency improvement) by partnering closely with customer support teams.”
Mid-level Data Scientist / ML Engineer specializing in MLOps and Generative AI
“Built and deployed an AI agent to help patients navigate complex housing information by scraping and normalizing unstructured data across all 50 U.S. states, then layering a LangChain RAG system with MMR re-ranking to reduce hallucinations. Experienced in orchestrating multi-agent workflows (LangGraph/CrewAI) and production reliability practices (Pydantic-validated outputs, LLM-as-judge evals, tracing). Also delivered stakeholder-facing explainability via SHAP dashboards for a loan-approval predictive model at Welspot.”
Mid-level Data Scientist specializing in cloud ML, MLOps, and predictive analytics
“NLP/ML engineer with hands-on healthcare and support-ticket text experience, building clinical-note structuring and semantic linking systems using spaCy, BERT clinical embeddings, and FAISS. Emphasizes production-grade delivery (Airflow/Databricks, PySpark, Docker, AWS/FastAPI/Lambda) and rigorous validation via clinician-labeled datasets, retrieval metrics, and user feedback.”
Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms
“AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.”
Junior AI/ML Engineer specializing in NLP, LLMs, and MLOps deployment
“Built and deployed NeuroDoc, a production-grade RAG system for PDF Q&A that delivers citation-backed answers with strong anti-hallucination guardrails. Experienced in orchestrating and scaling ML/LLM pipelines with Kubernetes, Airflow/Prefect, and PyTorch Distributed, and in building rigorous evaluation and citation-verification tooling to ensure reliability in production.”
Mid-level Data Engineer specializing in healthcare data platforms and MLOps
“ML/NLP practitioner with healthcare payer experience at HCSC, focused on connecting messy unstructured clinical notes to structured claims/provider data to improve fraud-analytics workflows. Has hands-on experience fine-tuning transformers in AWS SageMaker, building large-scale embedding search with FAISS, and implementing robust entity resolution using golden datasets, precision/recall calibration, and production monitoring for drift.”