Pre-screened and vetted.
Mid-level Data Analytics & ML Engineer specializing in NLP, LLMs, and cloud data platforms
“At KPMG, built and productionized a secure RAG-based LLM assistant that lets business and risk stakeholders query data warehouses in natural language, reducing dependence on data engineers for ad-hoc analysis. Demonstrates strong production rigor (Airflow orchestration, CI/CD, containerization), retrieval/embedding tuning (rechunking, semantic abstraction for structured data), and reliability controls (confidence thresholds, refusal behavior, monitoring and canary evals).”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation
“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”
Junior Full-Stack Machine Learning Engineer specializing in production ML systems
“Software engineer who owned end-to-end delivery of customer-facing agricultural forecast reporting (crop yield/health) and iterated quickly via rigorous edge-case testing and customer feedback. Also built an internal ML training platform (TypeScript/React + Flask/Python + MongoDB) used by every developer, with architecture designed to stay responsive under heavy compute load.”
Mid-level Data Scientist / ML Engineer specializing in streaming ML systems for healthcare and IoT
“ML/GenAI engineer with production experience building an LLM-powered governance layer that summarizes verified drift/performance signals into validation reports and release notes, designed for regulated environments with de-identification and non-blocking fallbacks. Strong Airflow-based orchestration background across healthcare and finance, integrating Databricks/Spark and MLflow for scalable retraining/monitoring. Demonstrated ability to partner with non-technical healthcare operations teams to deliver actionable risk-scoring outputs via dashboards and automated reporting.”
Intern Full-Stack & ML Engineer specializing in AI products and data-driven optimization
“Worked in a startup building an automated carbon accounting/climate reporting product, partnering with client IT and internal cross-functional teams to ship features and train end users. Also has software engineering internship experience debugging complex multi-workflow systems, including uncovering a significant (~20%) data annotation error by instrumenting and testing each workflow step.”
Mid-level GTM & Product Marketing Strategist specializing in B2B SaaS and GenAI
“Growth creative marketer who led end-to-end experimentation for Kahana’s Oasis agentic browser launch, repositioning it as a task-specific “productivity multiplier” and validating the message via structured A/B tests across Meta, LinkedIn, and landing pages. Reported performance lift included CPA reductions (23% Meta, 17% LinkedIn) and a 28% ROAS increase, with a repeatable modular framework for rapid creative iteration and hands-on direction of UGC creators and editors.”
Mid-level Machine Learning Engineer specializing in LLM agents, RAG, and MLOps
“Built production LLM systems including a real-time customer feedback analysis and workflow automation platform using RAG and multi-agent orchestration with confidence-based human escalation, addressing privacy and legacy integration challenges. Also automated ML operations with Airflow/Kubernetes (e.g., daily churn model retraining) cutting retraining time to under 30 minutes, and demonstrates a rigorous testing/monitoring approach plus strong non-technical stakeholder collaboration.”
“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”
Mid-level Data Engineer specializing in cloud ETL/ELT and healthcare analytics
“Healthcare-focused data engineer/ML practitioner with experience at Lightbeam Health Solutions and Humana building production entity-resolution and semantic similarity pipelines across EMR, lab, and claims data. Uses NLP/ML (spaCy, scikit-learn, BioBERT/LightGBM) plus Snowflake/Airflow and vector search (Pinecone) to improve linkage accuracy (reported 90%) and semantic match quality (reported +12–15%), while reducing manual cleanup by 40%+.”
Senior AI/ML Engineer specializing in Generative AI, RAG, and agentic systems
“GenAI/LLM ML engineer (currently at Webprobo) building an enterprise GenAI platform with document intelligence and automation on AWS and blockchain. Has hands-on experience with RAG, LLM evaluation tooling, and orchestrating production LLM workflows with Apache Airflow, plus deep exposure to reliability challenges in globally distributed/edge deployments. Also partnered with business/marketing stakeholders at a banking client to deliver an AI-driven customer retention insights solution.”
Mid-level Generative AI & Machine Learning Engineer specializing in agentic LLM systems
“Built and deployed a production agentic LLM knowledge assistant that answers complex questions over internal documents, APIs, and databases using a RAG architecture (FAISS/Pinecone) and LangChain/LangGraph orchestration. Emphasizes production-grade reliability and hallucination control through grounding, confidence thresholds, validation, retries/fallbacks, and full observability (logging/metrics/traces) with continuous evaluation and feedback loops.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and ML systems
“At Inertia Systems, built a production LLM-powered ingestion pipeline that converts heterogeneous sources (PDF/JSON/IFC/SQL and financial tables) into standardized text and uses GraphRAG to construct a knowledge graph with verified dependency relationships. Also has hands-on HPC orchestration experience with SLURM, including creating a custom wrapper process manager to improve resource utilization under restrictive scheduling policies.”
Mid-level Machine Learning Engineer specializing in NLP, LLMs, and applied research
“New grad SDE (AI/ML) who built and deployed an LLM-based chatbot framework used across technology, military, and banking contexts, focusing on model selection tradeoffs (latency vs accuracy) through prototyping and benchmarking. Also built a multi-agent "eaterybot" using PyAutoGen/AutoGen with a manager agent orchestrating specialized agents, and emphasizes rigorous testing with adversarial/edge-case datasets and hallucination checks.”
Mid-level Data Scientist specializing in Generative AI, MLOps, and cloud data platforms
“GenAI/ML engineer (CitiusTech) who has deployed production RAG systems for compliance/operations document Q&A, using Pinecone + FastAPI microservices on Kubernetes with strong monitoring and guardrails. Also built a GenAI-powered incident triage/routing solution in collaboration with non-technical stakeholders, achieving 35% faster response times and 40% fewer misclassified tickets, and has hands-on orchestration experience with Airflow and AutoSys.”
Intern Software Engineer specializing in cloud, big data, and test automation
“Internship experience at Qualitest building and deploying an LLM-powered test automation system that reduced manual test creation and improved efficiency (~40%). Demonstrates strong production engineering for LLM systems (timeouts/retries/monitoring/caching, prompt optimization, batching) and has scaled workflows to 100+ concurrent jobs; also has orchestration experience with AWS Step Functions and Kubernetes.”
Mid-Level Software Development Engineer specializing in full-stack and cloud-native systems
“Backend engineer who has shipped production LLM-powered features, including an AI-assisted developer tool on AWS (Spring Boot) and a blog platform capability using embeddings + Elasticsearch for semantic retrieval and LLM-generated summaries/recommendations. Demonstrates practical tradeoff management (quality/latency/cost), guardrails to reduce hallucinations, and evaluation-driven iteration using real user queries and observability via ELK.”
Intern Software Engineer specializing in Applied AI and LLM systems
“Built and deployed a production RAG-based conversational "Yelp for AI tools" at Search-AI Inc., focused on personalized, explainable AI tool recommendations from thousands of options. Emphasizes production-grade reliability and performance (hybrid retrieval, async two-stage pipelines) and is also building a multi-agent orchestration layer (MAgIc) with typed memory and controlled coordination policies.”
Junior Machine Learning Engineer specializing in NLP and biomedical entity extraction
“Built and deployed a production LLM-powered biomedical knowledge extraction pipeline that processed millions of papers to identify tools/techniques and produce a unified knowledge graph via active learning NER (Prodigy + spaCy transformers) and entity linking (Bio-tools/Wikidata). Addressed hard NLP engineering challenges like WordPiece span-offset alignment and scaled inference over ~1.5M documents using batching/caching, containerized services, async workers, and orchestration with Prefect/Airflow.”
Mid-level Data Scientist / ML Engineer specializing in secure GenAI and financial compliance
“Built a production "sentinel insight engine" to tame information overload from millions of product reviews and support transcripts, combining Azure OpenAI (GPT-3.5) zero-shot classification with a fine-tuned T5 summarizer to generate weekly actionable product insights. Demonstrated strong MLOps/production engineering by adding drift monitoring with embedding-based detection, integrating REST with legacy SOAP/queue-based CRM via FastAPI middleware, and scaling reliably on Kubernetes with HPA.”
Mid-level Data Scientist & AI Engineer specializing in RAG, agentic AI, and production ML
“AI/data engineer who built a production LLM-powered schema drift detection system (LangChain/LangGraph) to catch semantic data changes before they break downstream analytics/ML. Deployed on AWS with Docker/S3 and implemented an LLM-as-a-judge evaluation framework to improve trust, reduce hallucinations, and control false positives/alert fatigue. Collaborated with non-technical risk/business analytics stakeholders at EY by delivering human-readable drift explanations that improved confidence in financial analytics dashboards.”
Senior Data & Backend Engineer specializing in cloud data pipelines and LLM/RAG systems
“Data engineer with end-to-end ownership of large-scale retail and clinical data ingestion/processing on AWS, including real-time streaming and batch pipelines. Delivered measurable outcomes: 20M daily transactions processed, latency cut from 4 hours to 5 minutes, ~70% fewer failures, and 120+ pipelines running at 99.8% reliability with full audit compliance.”
Director-level AI/ML & Computer Vision Engineer specializing in robotics and multimodal AI
“Candidate is not currently pursuing entrepreneurship (no business plan and no capital raised) and is not familiar with the VC/accelerator landscape. They show pragmatic, problem-first thinking about evaluating startup ideas—prioritizing real customer pain points and the quality of the founding team—and are open to working for others rather than founding "at all costs."”
Junior Software Engineer specializing in AI agents and backend systems
“Backend/AI workflow engineer who built a production event-personalization service (FastAPI + AWS Lambda) and solved real-world reliability/latency issues with deterministic routing, caching, and query/index optimization. Also built an end-to-end Gmail-based job application tracking agent using a lightweight RAG pipeline with Gemini, strong guardrails (Pydantic schemas, confidence thresholds), and offline regression tests to prevent drift and hallucination-driven data corruption.”