Pre-screened and vetted.
Executive Business Operations & Pricing Leader specializing in SaaS and software-enabled services
Mid-level AI Engineer specializing in LLM applications and personalization
Executive AI Platform & Innovation Leader specializing in Banking, GenAI, and AI Governance
Director-level Data Engineering Leader specializing in AI/LLM platforms and real-time data systems
Mid-level Machine Learning Engineer specializing in MLOps, RAG, and real-time personalization
Senior Full-Stack Engineer specializing in cloud-native SaaS and AI/ML integration
Junior Data Scientist & Data Engineer specializing in ML and scalable data pipelines
Executive Engineering Leader (CTO/VP) specializing in platform scaling and video streaming
Director of Software Engineering specializing in enterprise Data, ML & AI platforms
“Former Walmart Director of Software Engineering who left in March 2025 to build products for clients. Recently delivered an LLM/RAG-based UNSPSC classification solution for an MRO client using a multi-stage retrieval + web search + prompt-engineering workflow, and has led large-scale retail forecasting initiatives and high-severity cloud-migration incidents end-to-end.”
Mid-level Full-Stack Software Engineer specializing in cloud microservices and AI integration
“Backend/distributed-systems engineer with Uber experience building real-time telemetry and safety signal pipelines. Strong in Kafka-based event-driven architectures, low-latency processing under peak load, and production reliability via monitoring, retries, and fallback logic; has Docker/Kubernetes and CI/CD deployment experience.”
Mid-level Machine Learning Engineer specializing in LLMs, fairness, and healthcare ML
“ML/NLP practitioner with a master’s thesis focused on domain-adaptive knowledge distillation for LLMs (LLaMA2/sheared LLaMA), showing improved perplexity and ROUGE-L on biomedical data. Also built real-world data linking and search systems: integrated ClinicalTrials.gov with FAERS using fuzzy matching + embeddings, and delivered an LLM-powered FAQ recommender at Hyperledger using sentence-transformers, FAISS, and fine-tuning to mitigate embedding drift.”
Intern Software Engineer specializing in data engineering and LLM/RAG systems
“Built and productionized enterprise LLM/RAG systems, including a Boeing internal solution that gave 400+ program managers conversational access to 1M+ rows of schedule data, with strong emphasis on governance, reliability, and reducing hallucinations in tabular domains. Also has experience running developer-focused workshops (UC Berkeley computer architecture) and partnering with customer-facing stakeholders to drive adoption of a compliance-sensitive NLP product (SEC-aligned) at Penserra.”
Intern Data Scientist specializing in marketing analytics and data engineering
“AI/LLM practitioner with internships at Dell Technologies and Roche who built and deployed a healthcare-focused "Doctor LLM" by fine-tuning Meta Llama 3.2 on healthcaremagic.json, emphasizing safety guardrails to prevent harmful medical advice. Experienced in productionizing AI workflows with monitoring, testing, and orchestration (Airflow, Kubernetes), and in delivering AI-agent-driven competitive landscape insights to non-technical business stakeholders.”
Junior Computer Vision & ML Engineer specializing in autonomous perception systems
“LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.”
Senior Data Engineer & Render Tools Developer specializing in VFX and render farm pipelines
“Real-time simulation/physics engineer who optimized character effects and cloth for the "Infinity" game by implementing and profiling multiple ODE integrators, including pioneering the largely undocumented Parker-Sochacki method (optimized 5/7 sims; >30% speedup on a particle system). Also built SPH fluid solvers in Unity (C#) and created Grafana/Python Dash dashboards to analyze latency/throughput, with strong interest in applying math/physics and tooling to soccer/football gameplay.”
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Junior AI Engineer specializing in healthcare analytics and compliance AI
“Built and shipped a production LLM-driven multi-agent platform (ciATHENA) at CustomerInsights.AI to automate analytics/ML/compliance workflows in healthcare and life sciences. Implemented LangGraph/LangChain orchestration with strong backend-style rigor (schemas, Pydantic validation, retries, auditability) and optimized latency/cost while keeping the system usable for non-technical users via guided natural-language interactions and structured/visual outputs.”
Principal Platform Engineer specializing in AI-driven document automation
“Backend engineer who built an event-driven, multi-service resume review system integrating AI/ML workflows. Demonstrated strong performance engineering (e.g., composite indexing dropping latency from ~600ms to ~35ms and major P95 gains) and high-throughput pipeline optimization via caching, batching, and worker concurrency tuning, with multi-tenant isolation implemented across DB and Redis.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Mid-level Machine Learning Engineer specializing in deep learning, MLOps, and real-time inference