Pre-screened and vetted.
Mid-level AI Engineer specializing in LLM applications and personalization
Executive AI Platform & Innovation Leader specializing in Banking, GenAI, and AI Governance
Director-level Data Engineering Leader specializing in AI/LLM platforms and real-time data systems
Mid-level Machine Learning Engineer specializing in MLOps, RAG, and real-time personalization
Senior Full-Stack Engineer specializing in cloud-native SaaS and AI/ML integration
Junior Data Scientist & Data Engineer specializing in ML and scalable data pipelines
Mid AI/ML Engineer specializing in LLM alignment and scalable AI systems
Executive Engineering Leader (CTO/VP) specializing in platform scaling and video streaming
Director of Software Engineering specializing in enterprise Data, ML & AI platforms
“Former Walmart Director of Software Engineering who left in March 2025 to build products for clients. Recently delivered an LLM/RAG-based UNSPSC classification solution for an MRO client using a multi-stage retrieval + web search + prompt-engineering workflow, and has led large-scale retail forecasting initiatives and high-severity cloud-migration incidents end-to-end.”
Senior AI/ML Engineer specializing in conversational and generative AI
“Built and productionized an LLM-based support assistant end-to-end, including RAG, APIs, monitoring, guardrails, and agent feedback loops. Stands out for translating GenAI prototypes into reliable production systems with structured evaluation, safety controls, and reusable Python infrastructure that improved both support quality and engineering velocity.”
Mid-level Full-Stack Software Engineer specializing in cloud microservices and AI integration
“Backend/distributed-systems engineer with Uber experience building real-time telemetry and safety signal pipelines. Strong in Kafka-based event-driven architectures, low-latency processing under peak load, and production reliability via monitoring, retries, and fallback logic; has Docker/Kubernetes and CI/CD deployment experience.”
Intern Software Engineer specializing in data engineering and LLM/RAG systems
“Built and productionized enterprise LLM/RAG systems, including a Boeing internal solution that gave 400+ program managers conversational access to 1M+ rows of schedule data, with strong emphasis on governance, reliability, and reducing hallucinations in tabular domains. Also has experience running developer-focused workshops (UC Berkeley computer architecture) and partnering with customer-facing stakeholders to drive adoption of a compliance-sensitive NLP product (SEC-aligned) at Penserra.”
Mid-level Machine Learning Engineer specializing in LLMs, fairness, and healthcare ML
“ML/NLP practitioner with a master’s thesis focused on domain-adaptive knowledge distillation for LLMs (LLaMA2/sheared LLaMA), showing improved perplexity and ROUGE-L on biomedical data. Also built real-world data linking and search systems: integrated ClinicalTrials.gov with FAERS using fuzzy matching + embeddings, and delivered an LLM-powered FAQ recommender at Hyperledger using sentence-transformers, FAISS, and fine-tuning to mitigate embedding drift.”
Junior Computer Vision & ML Engineer specializing in autonomous perception systems
“LLM/RAG engineer who built a production-style multi-agent orchestrator for resume-to-recommendation workflows (PDF ingestion through screening and recommendations), emphasizing prompt tuning and strict JSON output contracts. Currently building a RAG application for an NGO using Airflow (DAGs + embeddings) and tackling messy, missing/imbalanced data; has hands-on retrieval stack experience (FAISS/HNSW, bge embeddings) and uses rigorous evaluation metrics for groundedness and hallucination control.”
Intern Data Scientist specializing in marketing analytics and data engineering
“AI/LLM practitioner with internships at Dell Technologies and Roche who built and deployed a healthcare-focused "Doctor LLM" by fine-tuning Meta Llama 3.2 on healthcaremagic.json, emphasizing safety guardrails to prevent harmful medical advice. Experienced in productionizing AI workflows with monitoring, testing, and orchestration (Airflow, Kubernetes), and in delivering AI-agent-driven competitive landscape insights to non-technical business stakeholders.”
Senior Data Engineer & Render Tools Developer specializing in VFX and render farm pipelines
“Real-time simulation/physics engineer who optimized character effects and cloth for the "Infinity" game by implementing and profiling multiple ODE integrators, including pioneering the largely undocumented Parker-Sochacki method (optimized 5/7 sims; >30% speedup on a particle system). Also built SPH fluid solvers in Unity (C#) and created Grafana/Python Dash dashboards to analyze latency/throughput, with strong interest in applying math/physics and tooling to soccer/football gameplay.”
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Principal Platform Engineer specializing in AI-driven document automation
“Backend engineer who built an event-driven, multi-service resume review system integrating AI/ML workflows. Demonstrated strong performance engineering (e.g., composite indexing dropping latency from ~600ms to ~35ms and major P95 gains) and high-throughput pipeline optimization via caching, batching, and worker concurrency tuning, with multi-tenant isolation implemented across DB and Redis.”