Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in Generative AI, LLMs, and RAG for financial services
Senior Software Engineer specializing in distributed systems and cloud infrastructure
Senior AI/ML Engineer specializing in GenAI, MLOps, and healthcare analytics
Senior Data Engineer specializing in Cloud Data Platforms and Generative AI
Mid-level Data Engineer specializing in GCP, Spark, and healthcare analytics
Mid-level Data Scientist / AI-ML Engineer specializing in Generative AI and LLM applications
“Built a production GenAI-powered analytics assistant to reduce reliance on data analysts by enabling natural-language Q&A over Databricks/Power BI dashboards, backed by vector search (Pinecone/Milvus) and a Neo4j knowledge graph, including multimodal support via OpenAI Vision. Demonstrates strong real-world LLM reliability engineering with strict RAG, LangGraph multi-step verification, and Guardrails/custom validators, plus broad orchestration and production monitoring experience (Airflow, ADF, Step Functions, Kubernetes, Prometheus/CloudWatch).”
Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps
“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”
Senior Python Full-Stack Developer specializing in cloud, data engineering, and ML/GenAI
“Backend/data engineer with hands-on production experience building FastAPI services on AWS and implementing strong reliability/observability (CloudWatch, ELK, correlation IDs, alarms). Has delivered serverless + container solutions with IaC (CloudFormation/Terraform) and Jenkins CI/CD, and built AWS Glue/PySpark pipelines into S3/Redshift with schema-evolution and data-quality safeguards; demonstrated large-scale SQL tuning (45 min to 3 min on a 500M-row workload).”
Principal Cloud & Infrastructure Engineer specializing in reliability and regulated data platforms
“Founder/CTO-type startup leader who has built cloud-native data and AI platforms from scratch while owning both technical vision and product direction. Brings rare end-to-end startup experience spanning zero-to-one building, growth-stage execution, and fundraising from early stage through exit, with a strong ability to translate technical complexity into clear investor narratives.”
Mid-level GenAI/ML Engineer specializing in LLM agents and RAG for Financial Services & Healthcare
“Built and deployed a production GenAI internal support agent at Bank of America (“Ask GPS/AskGPT”) using RAG on Azure, focused on reducing escalations and improving response quality for repetitive knowledge-based queries. Demonstrates strong production LLM engineering: custom LangChain orchestration, retrieval tuning to reduce hallucinations, rigorous offline/online evaluation, and model benchmarking with dynamic routing (e.g., GPT-4 vs Claude).”
Engineering Manager specializing in secure cloud platforms and key management
“Engineering leader (Credit Karma/fintech) who built internal developer-platform frameworks at near open-source scale, including a unified key management system migrating ~450 services from on-prem HSM to a centralized KMS. Known for driving large cross-team migrations with strong safety mechanisms (canaries, shadow reads, rollback toggles) and measurable DX/ops improvements (60% fewer provisioning requests; 40% faster time-to-first-secret).”
Mid-level AI Engineer specializing in Generative AI, MLOps, and NLP for finance and healthcare
“Built and deployed a secure, production LLM-based document summarization and risk-highlighting tool for financial auditors, running inside a private Azure environment to protect confidential data. Focused on reliability (hallucination mitigation via retrieval-based prompts and source citations) and validated performance through comparisons to auditor summaries plus a user pilot, cutting review time by about half.”
Senior Full-Stack Software Engineer specializing in Python and AWS
“Backend/data engineer who has built production Python microservices (FastAPI) and AWS-native platforms for event ingestion and analytics, combining ECS/Fargate + Lambda with CloudFormation-driven environments and strong secrets/IAM practices. Experienced modernizing legacy logic with parallel-run parity validation and safe phased cutovers, and has demonstrated measurable SQL tuning wins (20–30s down to 1–2s) plus incident ownership in Glue/Step Functions ETL pipelines.”
Junior Software Engineer specializing in data platforms and full-stack development
“Software engineer with Warner Music Group experience owning and shipping analyst-facing data products (marketing/streaming data dashboards) end-to-end with high adoption through continuous stakeholder feedback. Also builds side projects with TypeScript/React and domain-driven API design, emphasizing flexibility (including swapping databases mid-development) and pragmatic microservices reliability patterns (logging, timeouts, retry backoff).”
Intern AI/ML Engineer specializing in RAG, multimodal AI, and LLM systems
“Built and shipped 'PetPulse,' a production AI pet-health note system that records voice notes, transcribes them, converts transcripts into structured symptom/event data, and supports grounded Q&A over a user’s notes and vet PDFs. Demonstrates full-stack LLM product execution (FastAPI + GPT-4 + Firebase), with concrete reliability/performance work (async endpoints, caching, RAG/embeddings, function calling) and user-centered iteration with a non-technical product stakeholder.”
Senior Machine Learning Engineer specializing in NLP and generative AI
“ML/AI engineer focused on production NLP and voice AI systems in the restaurant tech space, with hands-on work spanning ASR, intent classification, LLM fine-tuning, and deployment monitoring at Presto AI. They highlight a 15% improvement in full-AI ordering rate and also built a restaurant sentiment analysis product at Wisely that they say became a standout feature in a $10M acquisition context.”
Mid-level AI/ML Engineer specializing in MLOps, computer vision, and NLP
“GenAI/ML engineer from Lucid Motors who built and productionized an LLM-powered RAG diagnostic assistant for manufacturing and maintenance teams, deployed on AWS with Docker/Kubernetes and MLflow. Demonstrates end-to-end ownership from retrieval/prompt design to scalability, monitoring, and workflow integration via APIs, plus production ML pipeline orchestration with Kubeflow (Spark/Kafka + TensorFlow) for predictive maintenance use cases.”
“Built and deployed a production LLM-powered RAG assistant for semiconductor manufacturing failure analysis, reducing engineer triage effort by grounding outputs in retrieved evidence and gating responses with SPC + ML signals (LSTM anomaly scores, XGBoost probabilities). Experienced with LangChain/LangGraph to ship reliable, observable multi-step agents with branching/fallback logic, and evaluates impact using both technical metrics and business KPIs like mean time to triage and downtime reduction.”
Mid-level Generative AI Engineer specializing in enterprise RAG and multimodal NLP
“Built and deployed a production LLM/RAG chatbot at Wells Fargo for securely querying regulated financial and compliance documents, emphasizing low hallucination rates, explainability, and strict governance. Experienced with LangChain multi-agent orchestration plus Airflow/Prefect pipelines for ingestion, embeddings, evaluation, and retraining, and partnered closely with compliance/operations to drive adoption through demos and feedback-driven retrieval rules.”