Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in NLP, fraud detection, and MLOps
“LLM/ML platform engineer with hands-on experience taking an LLM document summarization prototype into a production-grade service on AWS EKS, emphasizing low-latency inference, drift monitoring, and safe CI/CD rollouts (canary + rollback). Strong in real-time debugging of agentic/RAG systems (tracing, retrieval/index drift fixes) and in developer enablement through practical workshops (Docker/Kubernetes/FastAPI) plus pre-sales support via demos and benchmarks to close pilots.”
Mid-level Generative AI Engineer specializing in LLM apps, RAG, and MLOps
“LLM/GenAI engineer with US Bank experience building a production financial-document intelligence platform using LangChain/LangGraph, GPT-4, and Amazon OpenSearch. Delivered a RAG-based assistant for compliance/audit teams with grounded, cited answers, focusing on reducing hallucinations and latency, and deployed securely on AWS (SageMaker/EKS) with CI/CD and evaluation tooling (LangSmith, RAGAS).”
Mid-level AI/ML & MLOps Engineer specializing in cloud AI infrastructure and GenAI
“At HPE, led and deployed an enterprise-grade LLM document intelligence platform for an insurance client, automating extraction from highly variable PDFs/scans/emails and raising field accuracy from 74% to 93%. Built a LangChain/Pinecone/OpenSearch RAG framework to cut hallucinations by 37% and operationalized LangSmith evals in CI, driving a 41% triage accuracy lift and >33% fewer incorrect resolutions while partnering closely with claims operations via HITL workflows.”
Mid-level ML/AI Engineer specializing in NLP, RAG pipelines, and financial risk & fraud systems
“Built and shipped LLM/RAG systems in finance and startup settings, including a Goldman Sachs document intelligence platform that indexed ~8TB of regulatory filings and delivered cited, conversational answers with <2s latency—cutting compliance research by ~4.5 hours per batch. Also developed LangChain-based agent workflows at Finta to automate CRM enrichment and investor lookup with strong testing, tracing (LangSmith), privacy guardrails, and auditability.”
Mid-level Data Engineer specializing in cloud data platforms, Spark, and streaming pipelines
“Data/MLOps engineer (Cognizant background) who owned an AWS/Airflow/Snowflake healthcare transactions pipeline processing ~8–10M records/day and cut pipeline/data-quality incidents by ~33%. Also built and deployed a production FastAPI model-inference service on Kubernetes (Docker, HPA) with strong observability (Prometheus/Grafana), versioned endpoints, and resilient backfill/idempotent external data ingestion patterns.”
Mid-level Machine Learning Engineer specializing in MLOps and GenAI analytics
“ML/LLM practitioner who has deployed a production RAG-based trouble-call identifier using multiple datasets (device, network, past complaints). Experienced in end-to-end MLOps (FastAPI + Docker + Kubernetes with HPA) and in evaluating/monitoring LLM behavior to reduce hallucinations, with additional applied work in forecasting/anomaly detection and churn prediction for retention campaigns.”
Mid-level Data Engineer specializing in AWS/Azure pipelines and streaming analytics
“Data engineer with experience across healthcare and geospatial risk systems, owning end-to-end pipelines from ingestion through serving on AWS/Azure stacks. Built HIPAA-compliant data quality gates and CDC for millions of daily claims, and also delivered a real-time wildfire risk platform with 20-minute refresh cycles and a 60% data accuracy lift. Strong in streaming (Kafka), Spark performance tuning, and production-grade orchestration/CI/CD (Airflow, Docker, Jenkins, GitHub Actions, Terraform).”
Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training
“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”
Junior Data Scientist specializing in healthcare ML and clinical NLP/LLMs
“Healthcare-focused LLM engineer who has built two production clinical applications: an automated structured clinical report generator from physician-patient conversations and a RAG-based chatbot for retrieving patient history (procedures, allergies, etc.). Demonstrates strong applied RAG expertise (overlapping chunking, entity dependency graphs, temporal filtering, graph RAG) to reduce hallucinations/omissions and partners closely with clinicians to automate hospital workflows.”
Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech
“ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.”
Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services
“Finance-domain ML/LLM engineer who has shipped production systems including a RAG-based financial insights assistant with a custom post-generation validation layer that verifies atomic claims against retrieved source text to prevent hallucinations in compliance-critical workflows. Also built large-scale MLOps automation on AWS using Kubeflow + MLflow + CI/CD for fraud detection and credit risk models processing 500M+ transactions/day with a 99.99% uptime goal, and partnered closely with JP Morgan risk/compliance stakeholders on NLP-driven compliance monitoring.”
Mid Software Engineer specializing in machine learning and real-time data systems
“Hands-on implementation-focused candidate with experience owning cloud deployments and putting LLM/RAG workflows into production. They stand out for combining customer-facing deployment ownership with practical AI systems work, including retrieval tuning, hallucination mitigation, production incident response, and document-processing pipelines for messy real-world inputs.”
Junior Data Analyst specializing in ML, NLP, and cloud data pipelines
“Built and deployed a GenAI-powered PhD career intelligence platform at NYU that maps academic backgrounds to career paths and converts long academic CVs into job-ready resumes. Stands out for treating LLM systems as structured production pipelines—combining NLP extraction, embeddings, orchestration, and AWS deployment—to improve recommendation quality and cut resume preparation time by 70%.”
“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and enterprise AI
“Built an enterprise RAG-based document intelligence system at Freddie Mac for regulatory and financial documents, helping analysts cut search time from hours to minutes while improving retrieval accuracy by ~30%. Stands out for combining LLM product delivery with compliance-grade auditability, production monitoring, and scalable Python/FastAPI service design.”
Executive engineering leader specializing in AI, cloud, and SaaS platforms
“Senior engineering executive with 8+ years leading large-scale SaaS modernization across AI, compliance, ecommerce, streaming, IoT, and travel. Has led a 150+ global engineering org, modernized seven cloud-native platforms for a $400M business, and consolidated travel systems processing $1B+ annually while staying hands-on in architecture, incident response, and AI integration.”
Mid-level AI Engineer specializing in LLMs, RAG, and production ML systems
“Built and shipped an AI-powered RAG diagnostic assistant at Ford for EV technicians, integrating GPT-based models with LangChain, FAISS, and SageMaker into real technician workflows. Stands out for combining strong production LLM architecture with practical safety guardrails, monitoring, and measurable impact: 45% better diagnostic accuracy and roughly 30 minutes saved per case.”
“AI/full-stack engineer in gaming analytics who joined Omnic.ai at a 2-person stage, helped grow with the company, and built both backend and frontend for real-time gameplay analysis products. He combines computer vision production experience with LLM/RAG systems work, and has already led 4 employees while shipping 12 models in a fast-moving startup environment.”
Mid-level Backend/Full-Stack Engineer specializing in cloud, AI, and distributed systems
“Built and shipped internal AI support systems spanning Angular/TypeScript frontends, Java/Spring/AWS backends, and Claude-powered troubleshooting workflows. Stands out for combining full-stack product delivery with practical LLM engineering, including RAG, structured outputs, production evals, and careful human-in-the-loop safety decisions. Has shipped systems serving 150-800 daily sessions at 99.5% availability while reducing repetitive support burden.”
Mid Software Engineer specializing in FinTech and ML-powered backend systems
“Backend-leaning full-stack engineer who has shipped real-time, customer-facing dashboards and ticketing/payment features at Freshworks and Global Payments. Strong in Python API design (Django/Flask/FastAPI) and React/TypeScript UIs, with hands-on experience scaling PostgreSQL for high transaction volumes and operating services on AWS, including incident response and HIPAA-aligned security controls.”
Mid-level Software Engineer specializing in backend systems and applied AI
“Full-stack/product-minded engineer with strong React/TypeScript depth who has owned systems end-to-end, from UI architecture to backend services and data design. At Qualcomm, they built both a telemetry dashboard and an ML model drift monitoring platform for 20+ edge models, including post-launch tuning that cut false positives by 60%. They also demonstrate 0→1 startup execution by solo-building a production RAG document Q&A platform with JWT auth, Stripe gating, and sub-300ms retrieval.”
Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms
“At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation
“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”