Pre-screened and vetted.
Mid-level Software Engineer specializing in AI platforms and full-stack systems
“Built and shipped a production AI-powered Q&A/RAG onboarding assistant at One Community Global that unified knowledge across Notion, Google Docs, and Slack, cutting volunteer onboarding time by 45%. Demonstrates strong end-to-end ownership: LangChain agent orchestration integrated into a FastAPI backend, rigorous evaluation (200-query dataset, ~85% accuracy), and production feedback/monitoring with source-attributed answers to build user trust.”
Mid-level Software Engineer specializing in FinTech and cloud-native microservices
“Built and launched an internal AI troubleshooting assistant focused on safe, retrieval-first root cause analysis for enterprise systems, with strong attention to monitoring, fallback behavior, and post-launch iteration. Also owns full-stack product work across React and Java/Spring Boot, including high-volume financial operations workflows, and reports measurable LLM improvements such as ~30-40% latency reduction.”
Junior Software Engineer specializing in AI, LLM systems, and full-stack development
“Product-focused full-stack engineer at startup (Zippy) who shipped a production multi-agent AI system for restaurant operations plus payments workflows. Built end-to-end: RAG grounded on a Notion knowledge base, structured function-calling task routing, FastAPI/JWT multi-tenant backend, and a polished React+TypeScript owner dashboard. Has real production incident experience (duplicate Stripe webhooks) and reports ~94% task-routing accuracy under load.”
Intern LLM/GenAI Engineer specializing in RAG, agentic systems, and low-latency inference
“Interned at Larsen & Toubro where they built and deployed an agentic RAG document question-answering system to reduce time spent searching documents and improve trustworthiness. Implemented ReAct-style multi-step orchestration with LangChain/LlamaIndex plus evidence-bounded generation, grounding/citations, and rigorous evaluation—cutting latency ~40%, hallucinations ~35%, and unsafe outputs ~40% while collaborating closely with non-technical business/ops stakeholders.”
Mid-level AI/ML Engineer specializing in GenAI and predictive modeling
“Built and deployed a GPT-4-powered medical assistant for clinical staff to reduce time spent searching guidelines and EHR information, with a strong emphasis on safety and compliance. Uses strict RAG, confidence thresholds, and fallback behaviors to prevent hallucinations, and runs production-grade workflows orchestrated with LangChain/LangGraph plus Docker/Kubernetes/MLflow and monitoring for reliability and cost.”
Intern Software Engineer specializing in AI/LLMs and full-stack development
“AI/ML infrastructure-focused engineer who has built production RAG systems from scratch (Supabase/pgvector + OpenAI embeddings) and iterated using formal eval metrics to improve retrieval quality. Also debugged real-time audio issues in a LiveKit-based pipeline by correlating packet loss with VAD behavior, and has deep experience building brittle, customer-specific financial platform integrations in Python/Playwright (2FA, redirects, token refresh, rate limits).”
Mid-level Machine Learning Engineer specializing in Generative AI and RAG systems
“LLM/ML engineer who has shipped an enterprise RAG-based Q&A system (LangChain/LlamaIndex, FAISS + Azure Cognitive Search, GPT-3.5/4 via OpenAI/Azure OpenAI) to production on Docker + Kubernetes/OpenShift, tackling hallucinations, retrieval quality, latency/cost, and RBAC/IAM security. Also partnered with operations leaders to turn manual reporting into an LLM-powered summarization and forecasting dashboard driven by real KPIs and iterative stakeholder feedback.”
Junior Applied AI Engineer specializing in LLMs, RAG, and agentic systems
“Co-founded a healthcare AI startup building and deploying software directly with end users, emphasizing rapid shipping, deep user interviews, and workflow-first adoption. Has hands-on production deployment experience on AWS (including diagnosing a silent AWS App Runner failure caused by an ARM vs amd64 Docker build mismatch) and is motivated by customer-facing, travel-heavy roles to keep engineering tightly connected to real-world usage.”
Mid-level Data & AI Engineer specializing in data engineering, analytics, and LLM/RAG apps
“Built a production RAG-based “unified assistant” that consolidates siloed company documents into a single chatbot while enforcing fine-grained access control via RBAC/metadata filtering with OAuth2/JWT. Experienced orchestrating LLM workflows with LangChain/LangGraph + FastAPI (async + caching) and measuring performance via retrieval accuracy and response-time SLAs. Also delivered a churn analytics solution with dashboards and automated retention campaigns using n8n.”
Mid-level AI/ML Engineer specializing in Generative AI and healthcare data
“Built and deployed a production RAG-based document Q&A system on Azure OpenAI to help business teams search thousands of PDFs/Word files, using Qdrant vector search, MongoDB, and a Flask API. Demonstrates strong production engineering (streaming large-file ingestion, parallel preprocessing, monitoring/retries) plus systematic prompt/embedding/chunking experimentation to improve accuracy and reduce hallucinations, and has hands-on orchestration experience with ADF/Airflow/Databricks/Synapse.”
Mid-level Data Scientist specializing in ML, MLOps, and Generative AI
“ML/NLP engineer who built a RAG-based technical assistant for Caterpillar field engineers, transforming PDF keyword search into intent-based semantic retrieval across manuals, logs, sensor reports, and technician notes. Strong in productionizing data/ML systems (Airflow, PySpark) with rigorous preprocessing, entity resolution, and evaluation—delivering measurable gains in accuracy, relevance, and duplicate reduction.”
Mid-level Data & Machine Learning Engineer specializing in production ML and data platforms
“Built and deployed a production LLM system that scraped Google Maps menu photos, extracted structured prices via OpenAI, and cross-validated them against website-scraped data to automate data-quality verification at scale (replacing costly manual contractor checks). Demonstrates strong reliability instincts—precision-first prompting, output gating with image-quality metadata, and fuzzy matching/RAG techniques—plus solid orchestration (Dagster/Airflow) and observability (Sentry, Prometheus/Grafana).”
Mid-level Full-Stack Engineer specializing in cloud-native systems and LLM applications
“Customer-support/engineering background spanning Informatica PowerCenter ETL and IBM demos/workshops, with hands-on experience hardening data workflows for production (error tables/reject links, validation, restart strategies, alerting, performance tuning). Also demonstrates a clear, systems-level approach to diagnosing LLM/agentic workflow issues (prompt/RAG/tooling/memory) using instrumentation and iterative fixes, and has partnered with sales on POCs by defining success metrics and mapping solutions to customer architectures.”
Mid-Level Software Engineer specializing in Java microservices and event-driven systems
“Backend engineer on Morgan Stanley’s trade risk and compliance platform, building Java/Spring Boot microservices that validate equity and fixed-income trades at multi-million-events/day scale. Shipped an LLM-assisted trade exception analysis feature using RAG over internal policy documents and trade history, with production-grade guardrails (confidence thresholds, audit logs, human-in-the-loop) and measurable performance wins (~30–35% faster reporting) through PostgreSQL tuning and Redis caching.”
Mid-level AI/ML Engineer specializing in healthcare analytics and MLOps
“AI/ML engineer at Cigna Healthcare building a production, HIPAA-compliant LLM-powered clinical insights platform that summarizes unstructured medical notes using a fine-tuned transformer + RAG on AWS. Demonstrates strong end-to-end MLOps and cloud optimization (distillation, Spot/Lambda/Auto Scaling) with quantified outcomes (~28% accuracy lift, ~40% less manual review, ~25% lower ops cost) and strong clinician-facing explainability via SHAP and dashboards.”
Mid-level Generative AI Engineer specializing in LLM systems and RAG
“Currently at Huntington Bank, built a production-grade RAG system that helps business/operations teams get grounded answers from large volumes of internal enterprise documents. Owns ingestion and FastAPI backend, tuned hybrid BM25+vector retrieval and chunking for relevance, and evaluates reliability with metrics and observability (LangSmith, CloudWatch, Prometheus/Grafana) while partnering closely with non-technical stakeholders.”
Junior Software Engineer specializing in Full-Stack and ML for FinTech
“Full-stack engineer with fintech trading-platform experience who shipped and operated a real-time portfolio P&L/performance feature end-to-end (React + Node/WebSockets + MongoDB) on AWS, including significant performance tuning under peak trading load. Also built a Spark-based trading analytics pipeline with idempotency and reconciliation for auditability, and has a personal React/TS + Node/Express project (Artsy) with JWT auth and schema-evolution practices.”
Senior Data Engineer specializing in Palantir Foundry and Snowflake for regulated industries
“Data engineer focused on high-volume transaction pipelines (2M+ per day) using Snowflake/Snowpipe, Spark/PySpark, Kafka, and Airflow, with a strong emphasis on schema/data-quality enforcement and reliability improvements. Also built a greenfield compliance-focused RAG solution, using CloudWatch monitoring and adding ingestion validation to prevent malformed OCR documents from degrading search quality.”
Junior Software Engineer specializing in cloud microservices and database systems
“Grad student who co-developed a safety-oriented mental health LLM consulting agent using RAG + Gemini and Hugging Face emotion detection to assess user crisis level and adapt responses. Implemented a key reliability improvement for CRISIS scenarios by bypassing generative output and returning direct, emotionless, knowledge-base guidance to seek immediate real-world help.”
Senior AI/ML Engineer specializing in Generative AI and RAG
“ML/NLP practitioner at Morf Health focused on unifying fragmented healthcare data by linking structured patient/encounter records with unstructured clinical notes. Has hands-on experience with transformer embeddings, vector databases, and domain fine-tuning, plus rigorous evaluation (precision/recall) and human-in-the-loop validation with clinical SMEs to make pipelines production-grade.”
Mid-level AI/ML Engineer specializing in fraud detection, NLP, and MLOps
“Built a production real-time fraud detection and customer-support automation platform at Citibank, tackling extreme class imbalance (reported ~1:5000) and strict latency constraints. Combines hands-on MLOps (Airflow, Kubernetes, MLflow; Snowflake/Spark/S3 integrations; CI/CD model promotion) with cross-functional delivery to Risk & Compliance focused on interpretability and reducing false positives.”
Mid-level AI Solutions Engineer specializing in enterprise GenAI and automation
“Built and shipped multiple production LLM/agentic systems, including an agentic RAG NL-to-SQL analytics app that cut manual reporting from 9 hours/week to 15 minutes by grounding on schema-aware retrieval and robust fallback/monitoring. Also implemented a LangChain supervisor-orchestrated enterprise IT automation agent that routes requests for search, identity validation, and action execution, and created a RAG search tool spanning Jira/Confluence/SharePoint for operations stakeholders.”
Mid-level AI/ML Engineer specializing in healthcare, fraud detection, and recommender systems
“Healthcare-focused applied ML/LLM engineer who has deployed production systems including an LLM medical documentation assistant that summarizes unstructured EHR notes into physician-ready structured outputs. Experienced building secure, compliant pipelines (PHI minimization, RBAC, encryption) and scaling via Docker/Kubernetes/Azure ML, plus orchestrating ETL/ML workflows with Airflow and Kubeflow; also built an LLM-driven clinical coding assistant at Centene with measurable performance metrics.”
Intern Data Scientist specializing in ML, NLP, and MLOps for healthcare and enterprise AI
“Built a production multi-cloud LLM-driven IT ticket automation system using LangGraph, Azure + Pinecone RAG, and an Ollama-hosted LLM on AWS, with Terraform-managed infra and PostgreSQL audit/state tracking for reliability. Also partnered with UW School of Medicine & Public Health students to deliver a glioma survival risk-ranking model, translating clinical feedback into practical pipeline improvements (imputation, site harmonization) and stakeholder-friendly visualizations.”