Pre-screened and vetted.
Mid-level Machine Learning Engineer specializing in LLMs, RAG, and MLOps
“Built and deployed a production RAG system for financial/compliance teams using GPT-4, Claude, and local models to retrieve and summarize thousands of internal documents with strong security controls (role-based retrieval, PII masking). Drove significant operational gains (30+ hours/week saved, ~35% productivity lift, ~45% faster responses) and orchestrated end-to-end ingestion/embedding/index refresh pipelines with Airflow, S3, and SageMaker while partnering closely with compliance stakeholders on auditability and traceability.”
Mid-level Data Scientist / ML Engineer specializing in MLOps and Generative AI
“Built and deployed an AI agent to help patients navigate complex housing information by scraping and normalizing unstructured data across all 50 U.S. states, then layering a LangChain RAG system with MMR re-ranking to reduce hallucinations. Experienced in orchestrating multi-agent workflows (LangGraph/CrewAI) and production reliability practices (Pydantic-validated outputs, LLM-as-judge evals, tracing). Also delivered stakeholder-facing explainability via SHAP dashboards for a loan-approval predictive model at Welspot.”
Senior Backend Software Engineer specializing in distributed systems and cloud microservices
“Backend engineer with NTT Data experience building Java/Spring Boot services for product-data ingestion, including Kafka-based asynchronous pipelines and Redis read-through caching. Also built a personal RAG system deployed on Google Kubernetes Service using FastAPI, LangChain, and Pinecone with multi-tenant data isolation; holds a Master’s background in Machine Learning.”
Mid-level Data Scientist/MLOps Engineer specializing in NLP, GenAI, and cloud ML platforms
“AI/ML engineer who led production deployment of a multimodal (text/video/image) RAG system on GCP using Gemini 2.5 + Vertex AI Vector Search, scaling to 10M+ documents with sub-second latency and +40% retrieval accuracy. Strong MLOps/orchestration background (Kubernetes, CI/CD, Airflow, MLflow) with proven impact on reliability (75% fewer incidents) and deployment speed (92% faster), plus experience delivering explainable ML (XGBoost + SHAP + Tableau) to non-technical retail stakeholders.”
Junior AI/ML Engineer specializing in NLP, LLMs, and MLOps deployment
“Built and deployed NeuroDoc, a production-grade RAG system for PDF Q&A that delivers citation-backed answers with strong anti-hallucination guardrails. Experienced in orchestrating and scaling ML/LLM pipelines with Kubernetes, Airflow/Prefect, and PyTorch Distributed, and in building rigorous evaluation and citation-verification tooling to ensure reliability in production.”
Mid-level Machine Learning Engineer specializing in LLM apps, RAG pipelines, and MLOps
“Software engineer with connected-car/automotive production experience who owned an end-to-end remote door lock/unlock feature and introduced unit testing (GTest) plus rig/simulator validation. Also built and productionized an AI-native AWS cloud cost assistant (Lex + GPT-based LLM + Lambda + RAG/vector DB) with guardrails and achieved 94% evaluation accuracy. Helped replace a third-party solution with an in-house build, saving the company ~€9M.”
Senior Full-Stack AI Engineer specializing in Generative AI and FinTech
“Backend engineer who built and owned an AI-powered financial research product end-to-end, using a typed NestJS/GraphQL backend with LangGraph-style agent routing to produce sourced, structured financial analysis. Emphasizes finance-grade correctness (Zod validation, metric registries, unit/empty-result guardrails) while keeping latency low via batching, caching, and fast token streaming, and has led incremental migrations using strangler/feature-flag/shadow traffic patterns.”
Mid-Level Software & Infrastructure Engineer specializing in cloud, distributed systems, and AI
“Backend/data engineer who helped evolve Bitnimbus LLC’s Kafka-as-a-service MVP from a monolith into an event-driven distributed system, using careful design, parallel rollouts, and idempotent event handling to maintain correctness. Also built production-grade API and database security (JWT scopes, rate limiting, explicit Postgres policies/RLS-style controls) and improved Prometheus monitoring by eliminating false outages via heartbeat metrics and windowed aggregation.”
Executive CTO / Software Architect specializing in GenAI, FinTech, and PropTech
“Entrepreneur/fintech product builder who raised a $100K pre-seed from ex-Google/Microsoft execs and built a real-time, direct-to-vendor bill pay micropayments platform. Previously helped scale Norton LifeLock to 1M users (2003) and also created Karma LA, a fraud-resistant, verified donation system (including VA veteran verification) aimed at improving trust and conversion in giving.”
Mid-Level Software Engineer specializing in Java/Spring microservices and cloud event-driven systems
“LLM/agentic-systems practitioner who has repeatedly taken LLM-driven pricing/decision services from prototype to production using pilots, guardrails, observability, and staged rollouts. Demonstrates strong real-time incident troubleshooting (dependency timeouts, cached fallbacks) and post-incident hardening (isolation/async/alerts), and also supports go-to-market via developer workshops, technical demos, and sales-aligned POCs.”
Junior AI/ML Engineer specializing in LLM agents and RAG systems
“Backend/data engineer who built a production-ready multi-agent financial intelligence system (Mycroft) that orchestrates specialized AI agents to analyze real-time market data using FastAPI and Pinecone vector search. Brings strong security/reliability instincts (rate limiting, JWT/OAuth2, retries/backoff, health checks) and has caught high-impact data integrity issues in financial migrations (timezone normalization across global legacy systems).”
Junior Machine Learning Engineer specializing in LLMs and RAG systems
“Production-focused applied ML/LLM engineer who has deployed an LLM-powered RAG assistant and improved reliability through rigorous retrieval evaluation (recall/MRR), reranking, and guardrails that prevent confident wrong answers. Experienced running containerized ML/LLM services on Kubernetes (including AWS-managed layers) with CI/CD and observability, and has delivered a real-time predictive maintenance system using streaming sensor data and time-series anomaly detection in close partnership with maintenance teams.”
Mid-level AI & Machine Learning Engineer specializing in Generative AI and MLOps
“Built a production GPT-4/LangChain/Pinecone RAG “AI Copilot” at Northern Trust to automate financial report generation and analyst Q&A over internal structured (SQL warehouse) and unstructured policy data. Focused on real-world production challenges—grounding and latency—achieving major speed gains (seconds to milliseconds) via MiniLM embedding optimization and Redis caching, and implemented rigorous testing/evaluation with MLflow-backed metrics while aligning compliance and finance stakeholders for deployment.”
Mid-Level AI/Full-Stack Engineer specializing in agentic LLM systems and RAG
“Built and deployed Clyra.AI, an AI-driven daily scheduling product that uses a LangGraph-based multi-agent LLM pipeline (task extraction, verification, reflection) grounded with strict RAG over emails/documents/calendars and real-world signals like health metrics. Designed a custom agent orchestrator with bounded loops/termination conditions and a self-auditing verification/reflection layer to reduce hallucinations while controlling latency and cost via caching and model distillation.”
Entry-level AI/ML Engineer specializing in AWS MLOps and computer vision
“Built and shipped a production RAG question-answering system using LangChain/OpenAI, Docker, and FastAPI, then reduced hallucinations through disciplined retrieval tuning and constrained prompting. Also implemented a custom evaluation framework (QA-pair dataset) to measure faithfulness/relevance and deployed containerized ML microservices on AWS ECS/Fargate with ALB and rolling, zero-downtime updates.”
Intern AI/ML Software Engineer specializing in RAG and medical AI
“ML/LLM engineer with production experience building medical RAG systems to automate chart review, including retrieval + re-ranking and rigorous evaluation. Notably uncovered errors/bias in physician-curated ground truth by tracing answers back to source note chunks and presented evidence to an academic partner, accelerating deployment. Also built a RAG-based FAQ chatbot for a health insurance company and delivered it to non-technical stakeholders via demos.”
Mid-level Machine Learning Engineer specializing in LLMs, GenAI, and Computer Vision
“LLM/agent engineer who built a production multi-agent research automation system using LangGraph (planner, retriever with FAISS, supervisor, evaluator) with structured outputs and citation tracking for traceable reports. Emphasizes reliability and operations—LangSmith-based observability, multi-level testing, hallucination mitigation, and latency/cost controls—plus prior experience as a Computer Vision Software Engineer at Deepsight AI Labs working directly with non-technical customers.”
Mid-level Full-Stack Developer specializing in Java/Spring Boot, React, and cloud microservices
“Backend/platform engineer with hands-on ownership of Kubernetes GitOps delivery (GitHub Actions + Argo CD) on AWS EKS, including progressive rollouts and reliable rollback across interdependent microservices. Built a Python/FastAPI ML-driven document-processing service (PostgreSQL + S3) to complement existing Spring Boot systems, and implemented Kafka streaming pipelines with Schema Registry plus Prometheus/Grafana observability. Also supported a hybrid cloud-to-on-prem migration for compliance/latency with phased rollout and incremental PostgreSQL migration.”
Intern AI Researcher specializing in NLP, LLMs, and knowledge graphs
“Built and shipped “LabMate,” a production AI assistant specialized in laboratory hardware, using a weighted multi-source RAG pipeline with reranking and reasoning-focused query decomposition to handle complex user questions. Deployed on a local GPU cluster with vLLM and NVIDIA MPS (plus OCR/VLM components), and established evaluation using synthetic + public reasoning datasets while collaborating weekly with non-technical admins to align requirements and resource constraints.”
Mid-level AI/ML Software Engineer specializing in data pipelines, BI dashboards, and computer vision
“Graduate Assistant Intern at Friends University who built and deployed a GenAI-driven requirement understanding system that automates extraction and semantic grouping of technical requirements from large unstructured documents. Demonstrates strong LLM engineering rigor (golden datasets, regression testing, post-processing validation) and production-minded delivery using LangChain/LlamaIndex orchestration, FastAPI microservices, Docker, and cloud deployment.”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal AI on AWS
“Built and deployed a production RAG-based enterprise document intelligence platform for financial/compliance/operational documents on AWS (Spark/Glue ingestion, embeddings + vector DB, LangChain orchestration, REST APIs on Docker/Kubernetes). Deep hands-on experience orchestrating multi-step and multi-agent LLM workflows (LangChain, LangGraph, CrewAI) with strong focus on grounding, evaluation, observability, and cost/latency optimization, and has partnered closely with non-technical finance/compliance teams to drive adoption.”
Senior Data Scientist/Software Engineer specializing in ML systems and cloud DevOps
“AI software engineer with experience spanning LLM/RAG production systems and regulated fintech infrastructure. Built an end-to-end natural-language-to-SQL analytics assistant (Weaviate + GPT-4 + Supabase) shipped as an API with 92% accuracy and major time savings for non-technical users, and also owned demand-forecasting and CI/CD/containerization improvements for a Bank of America core banking deployment at Infosys.”
Mid-level AI/ML Engineer specializing in GenAI and cloud MLOps
“Applied LLMs to high-stakes domains (wildfire risk for emergency teams and loan approval via a fine-tuned IBM Granite model), with a strong focus on reliability—using RAG-based cross-validation to reduce hallucinations and continuous ingestion pipelines (MODIS satellite imagery via AWS Lambda) to keep data current. Experienced in production orchestration and MLOps-style workflows using Airflow, AWS Step Functions, and SageMaker Pipelines, and collaborates closely with analysts on KPI-driven evaluation.”