Pre-screened and vetted.
Junior Machine Learning Engineer specializing in computer vision and LLM applications
“Built and led an autonomous driving software effort for Formula Student, owning the full autonomy stack (perception, planning, control) orchestrated in ROS. Implemented stereo depth + YOLO object detection, RRT/RRT* planning, and a robust SLAM pipeline (Kalman filter, submapping) while leveraging Gazebo simulation and modern deployment tooling (Docker/Kubernetes, AWS, GitHub Actions CI/CD).”
Intern AI/ML Engineer specializing in LLMs, MLOps, and distributed training
“Founding AI engineer (June 2024) at Talon Labs who built and productionized an LLM-powered chatbot for interacting with proprietary supply-chain documents, deployed at large scale (25–100,000 users). Experienced with RAG/LLM orchestration (LangChain, LlamaIndex, Groq AI) and production ops tooling (Kubernetes, Docker, Kubeflow, Airflow), with a metrics-driven approach to evaluation, observability, and stakeholder alignment.”
Mid-level AI/ML Engineer specializing in fraud detection and risk analytics in Financial Services
“Finance-domain ML/LLM engineer who has shipped production systems including a RAG-based financial insights assistant with a custom post-generation validation layer that verifies atomic claims against retrieved source text to prevent hallucinations in compliance-critical workflows. Also built large-scale MLOps automation on AWS using Kubeflow + MLflow + CI/CD for fraud detection and credit risk models processing 500M+ transactions/day with a 99.99% uptime goal, and partnered closely with JP Morgan risk/compliance stakeholders on NLP-driven compliance monitoring.”
Mid-level AI/ML Engineer specializing in MLOps, LLMs, and real-time inference in FinTech
“ML/LLM engineer who has deployed a production LLM-powered assistant for intent classification and query routing (order recommendation/support deflection), combining BERT fine-tuning with an embedding-based retrieval layer and optimizing for low-latency inference. Experienced with end-to-end reliability practices—Airflow-orchestrated ETL, data validation/alerting, MLflow experiment tracking, and iterative improvements driven by user feedback and monitoring.”
Mid Software Engineer specializing in machine learning and real-time data systems
“Hands-on implementation-focused candidate with experience owning cloud deployments and putting LLM/RAG workflows into production. They stand out for combining customer-facing deployment ownership with practical AI systems work, including retrieval tuning, hallucination mitigation, production incident response, and document-processing pipelines for messy real-world inputs.”
“Built and deployed a production RAG-based internal knowledge assistant that let analysts query company documents in natural language, using LangChain/LangGraph with Pinecone and a FastAPI service for integration. Emphasizes reliability in production through hallucination mitigation (retrieval tuning + prompt guardrails) and measurable evaluation/monitoring (accuracy, latency, task completion, hallucination rate), iterating based on user feedback.”
Mid-level Software Engineer specializing in FinTech backend systems
“Built and deployed an AI-driven expense categorization workflow integrating OpenAI API and PGVector to automate general ledger coding. Stands out for combining LLM/embedding architecture with finance operations context, stakeholder-facing deployment ownership, and measurable impact of roughly 30%+ reduction in manual coding effort.”
Mid-level Software Engineer specializing in backend systems and applied AI
“Full-stack/product-minded engineer with strong React/TypeScript depth who has owned systems end-to-end, from UI architecture to backend services and data design. At Qualcomm, they built both a telemetry dashboard and an ML model drift monitoring platform for 20+ edge models, including post-launch tuning that cut false positives by 60%. They also demonstrate 0→1 startup execution by solo-building a production RAG document Q&A platform with JWT auth, Stripe gating, and sub-300ms retrieval.”
Mid-level AI Engineer specializing in LLMs, RAG, and production ML systems
“Built and shipped an AI-powered RAG diagnostic assistant at Ford for EV technicians, integrating GPT-based models with LangChain, FAISS, and SageMaker into real technician workflows. Stands out for combining strong production LLM architecture with practical safety guardrails, monitoring, and measurable impact: 45% better diagnostic accuracy and roughly 30 minutes saved per case.”
Mid-level Generative AI Engineer specializing in LLMs, RAG, and multimodal generation
“Open-source JavaScript contributor focused on performance and maintainability in data visualization libraries—refactored legacy ES5 into modular ES6, added tests/docs, and delivered ~30% faster load times with positive community adoption. Also optimized a React dashboard (~40% load-time reduction) and took ownership in an ambiguous AI product initiative by setting milestones, standing up an initial ML pipeline, and shipping a prototype in ~6 weeks that became the basis for production.”
“ML/GenAI engineer with recent CVS Health experience building a production RAG system over unstructured financial/research documents using LangChain, FAISS, and Pinecone, plus LoRA/PEFT fine-tuning of GPT/LLaMA for domain-aware summarization. Demonstrates strong applied MLOps and data engineering skills (Airflow/Prefect, Docker/Kubernetes, CI/CD, MLflow) and measurable impact (sub-second retrieval, ~40% better context retrieval, ~25% entity matching improvement).”
Senior AI/ML Engineer specializing in Generative AI, RAG, and agentic systems
“GenAI/LLM ML engineer (currently at Webprobo) building an enterprise GenAI platform with document intelligence and automation on AWS and blockchain. Has hands-on experience with RAG, LLM evaluation tooling, and orchestrating production LLM workflows with Apache Airflow, plus deep exposure to reliability challenges in globally distributed/edge deployments. Also partnered with business/marketing stakeholders at a banking client to deliver an AI-driven customer retention insights solution.”
Junior Software Engineer specializing in AI, RAG systems, and backend development
“Built an NYU software engineering capstone called “Smart Cash AI,” a multi-agent LLM-powered web app that curates offline-ready podcasts/articles/videos/news based on user preferences and commute schedules. Architected agent orchestration (discovery/downloader/summarizer), real-time progress via WebSockets, and an ETL normalization layer across RSS/YouTube and other sources with GUID-based deduplication, retries, and failure isolation to keep the system predictable.”
Mid-level Data Scientist specializing in Generative AI, MLOps, and cloud data platforms
“GenAI/ML engineer (CitiusTech) who has deployed production RAG systems for compliance/operations document Q&A, using Pinecone + FastAPI microservices on Kubernetes with strong monitoring and guardrails. Also built a GenAI-powered incident triage/routing solution in collaboration with non-technical stakeholders, achieving 35% faster response times and 40% fewer misclassified tickets, and has hands-on orchestration experience with Airflow and AutoSys.”
Intern-level Software Engineer specializing in AI/ML systems
“Built production LLM/RAG systems during a UPS internship, including a shipment knowledge agent used across 15+ hubs worldwide and a multi-agent PDF RAG workflow. Stands out for combining hands-on enterprise integration with rigorous evaluation, hallucination reduction, and efficient fine-tuning techniques like LoRA.”
Mid-level Software Engineer specializing in distributed backend and AI analytics platforms
“Full-stack engineer at BigCommerce who combines customer-facing deployment ownership with hands-on AI/LLM systems work. Built and launched merchant analytics and predictive inventory workflows using React, TypeScript, FastAPI, Kafka, AWS, and RAG-style architectures, and has real production experience debugging non-deterministic AI issues caused by data pipeline freshness and event-ordering problems.”
Mid-level Applied AI Engineer specializing in agentic LLM workflows
“Master’s-in-Data-Science candidate (UHV) with 4+ years in AI engineering building production LLM and multimodal systems. Designed an LLM-powered workflow automation platform using RAG over vector stores with guardrails (schema/output validation, fallbacks) and a rigorous evaluation/monitoring framework including drift tracking and shadow deployments. Experienced orchestrating large-scale vision-language pipelines with Airflow and Kubernetes (OCR, distributed training) and partnering with non-technical ops stakeholders to cut cycle time and reduce errors.”
Mid-level Machine Learning Engineer specializing in LLM systems and healthcare data automation
“React performance-focused engineer who contributed performance patches back to an open-source context+reducer state helper after profiling and fixing excessive re-renders in an enterprise project management platform at Easley Dunn Productions. Also built an end-to-end LLM-driven pipeline at Prime Healthcare to normalize millions of supply-chain records, reducing defects by 80% and saving 160+ hours/month.”
Mid-level Data Scientist specializing in Generative AI, RAG systems, and ML engineering
“AI/LLM engineer who built a production QA RAG for a University of Massachusetts faculty success initiative, cutting service tickets by 70%. Strong end-to-end RAG implementation skills (LangChain, Qdrant, hybrid/HyDE retrieval, FastAPI) with rigorous evaluation (RAGAS, LLM-as-judge) and practical handling of constraints like API rate limits and cost. Prior cross-functional delivery experience collaborating with SMEs and business owners at TCS and IBM.”
Mid-level Data Scientist specializing in NLP, LLMs, and RAG systems
“Built and deployed a production-style vision-language pipeline that generates structured medical reports from chest X-rays using BioViLT embeddings, an image-text alignment module, and BiGPT fine-tuned with LoRA, delivered via Streamlit and hosted on AWS EC2. Also collaborating experience presenting EDA findings, feature importance, and model performance to Ford managers while working with vehicle parts data at Bimcon.”
Senior Full-Stack & GenAI Engineer specializing in healthcare and financial services
“Built and deployed a production LLM-powered customer support assistant using a RAG backend in Python, focused on deflecting repetitive Tier-1 tickets and reducing resolution time. Demonstrates strong production engineering instincts around reliability (confidence scoring + human fallback), scalability/cost optimization (multi-stage pipelines), and workflow orchestration/observability (LangChain, custom DAGs, structured logging, step metrics).”
Mid-level Data Analyst and Product professional specializing in FinTech and AI applications
“Payments/product-focused operator with hands-on experience owning complex bank connectivity deployments at Paystand, including a migration that raised connection success from under 50% to 79%. Also built a production-grade multi-agent document intelligence system on AWS Bedrock for structured enterprise document extraction, combining real-world fintech domain pain points with modern LLM architecture.”
Senior AI/ML Engineer specializing in Generative AI and agentic systems
“Built and deployed an agentic RAG assistant in production to automate enterprise knowledge search and multi-step workflows with tool calling, tackling real-world issues like hallucinations, retrieval accuracy, and latency. Demonstrates strong LLMOps and orchestration depth (MLflow, Airflow, LangGraph/LangChain/LlamaIndex) plus a metrics-driven approach to agent testing/evaluation and cross-functional delivery with business stakeholders.”
Junior Backend and ML Engineer specializing in distributed systems and LLM infrastructure
“Backend engineer with strong ownership across authentication, API infrastructure, and AI-powered document workflows. They built and operated a production auth microservice supporting 10,000+ users with measurable latency and security improvements, and also shipped hackathon and applied-AI systems including legal document and medical document retrieval/Q&A products.”