Pre-screened and vetted.
Mid-level AI/ML Engineer specializing in LLM, RAG, and multimodal systems
Mid-Level Software Engineer specializing in cloud-native backend and distributed systems
Mid-level Data Scientist specializing in GenAI, LLMs, and MLOps
Senior Data Scientist specializing in predictive modeling and recommendation systems
Mid AI/ML Engineer specializing in LLM alignment and scalable AI systems
Junior Machine Learning Engineer specializing in LLMs and data pipelines
“Research Extern at Google DeepMind and former AWS Software Development Engineer Intern with a strong focus on practical, trustworthy AI engineering. Built a multi-agent RAG system for personalized news headline generation using a fine-tuned Flan-T5 model, parallel critic agents, FAISS retrieval, and style embeddings, while also leading a 3-person team on the project.”
Senior Machine Learning Engineer specializing in conversational AI and Generative AI
“ML/AI engineer with experience at Uber and Scale AI, focused on customer service automation across both classical NLP and generative AI systems. Has owned systems from experimentation through production on AWS, including LLM fine-tuning, RAG optimization, safety evaluation, and internal Python platform tooling that improved consistency and engineering velocity.”
Mid-Level Software Development Engineer specializing in full-stack systems and ML
“AWS engineer who productionized an internal ML-driven data pipeline from a notebook prototype into a scalable, observable Python service (schema validation, deduplication, idempotency, safe retries, versioned transforms, CloudWatch alarms), reducing manual effort and improving data accuracy/trust. Experienced diagnosing workflow issues in real time (e.g., upstream schema changes) and partnering with account managers/support to unblock adoption of seller-facing Marketplace features by demonstrating reliability with concrete metrics.”
Junior ML Engineer specializing in Generative AI and LLM applications
“Built a production internal knowledge assistant using a RAG pipeline over large spreadsheets, PDFs, and support documents, using transformer embeddings stored in FAISS. Focused on real-world production challenges—format normalization, retrieval quality, hallucination reduction (context-only + citations), and latency—using hybrid retrieval, quantization, and containerized deployment, and communicated the workflow to non-technical stakeholders using simple analogies.”
Intern Applied AI/Software Engineer specializing in computer vision and full-stack platforms
“Built production LLM systems focused on reliability and safety, including a plain-English deployment tool that generates validated plans and provisions to Kubernetes while preventing unsafe actions via schema enforcement and plan/execute separation. Also created multi-LLM workflows (LangGraph) and stakeholder-friendly demos at Bosch, including a PyQt/FastAPI/CUDA app comparing SAM2 vs SAMWISE for on-device object detection with intuitive UX for business users.”
Mid-level Machine Learning Engineer specializing in GPU-accelerated LLM training and inference
“ML/LLM engineer with production experience building a multi-GPU LLM inference platform using TensorRT and vLLM, achieving ~40% p95 latency reduction through batching/KV caching, quantization, and CUDA/runtime tuning. Also has end-to-end orchestration experience (Kubernetes, Airflow) and has delivered real-time fraud detection systems at Accenture in close collaboration with non-technical risk and product stakeholders.”
Senior Software Engineer specializing in AI and FinTech platforms
“Built a production LLM pipeline at Walter AI that scans massive user inboxes, identifies financial newsletters, and extracts trading strategies into structured JSON for downstream paper-trading workflows. Stands out for combining agent architecture with strong production discipline—cutting scan time from 20 to 5 minutes, reducing LLM costs by 90%, and achieving 3-second P99 latency while handling messy, inconsistent email data at scale.”
Mid-level AI Engineer specializing in machine learning and healthcare research
“Backend engineer with end-to-end ownership of scientific and AI-powered systems, including neuron imaging pipelines at Monell Chemical Senses Center and an LLM-based structured information extraction platform for Wharton and PSG. Stands out for turning messy, compute-heavy workflows into reliable production backends with measurable impact, including saving researchers over 50 hours per week.”
Junior Machine Learning Engineer specializing in generative AI and computer vision
“Built production AI features for image editing and object removal, including an agent that guides users to the right pipeline, validates inputs, refines prompts, and routes requests to GPU-backed generation services. Brings hands-on experience across multimodal control, generative model optimization, and post-launch iteration driven by failure analysis and user feedback.”
Mid-level AI/ML Engineer specializing in LLMs, MLOps, and recommendation systems
Mid-level AI/ML Engineer specializing in fraud detection and customer lifetime value modeling
Mid-level Data Analytics Engineer specializing in cloud data platforms and FinTech
Intern software engineer specializing in backend, cloud, and security systems
Senior AI/ML Engineer specializing in Generative AI, NLP, and LLM systems