Pre-screened and vetted.
Junior Machine Learning Engineer specializing in LLMs, RAG, and on-device AI
“Built an "Offline Study Assistant" that runs LLM inference locally on a 5-year-old Android device using Llama.cpp and the Android NDK, achieving a 27x speedup and cutting time-to-first-token from 11 minutes to 30 seconds. Also has applied backend/API experience with FastAPI, Supabase (Auth + RLS), and production hardening of a RAG system at Hashmint using Celery and Redis to eliminate PDF-processing-related query failures.”
Junior AI/Software Engineer specializing in LLM agents, RAG, and full-stack ML systems
“Backend engineer who built an Emergency Alert System with Virginia Tech for the City of Alexandria, focusing on real-time ingestion, secure dashboards, and AI-assisted prioritization. Emphasizes high-stakes reliability with guardrails (hybrid rules+LLM, confidence-based fallbacks), scalable async processing, and defense-in-depth security (JWT/RBAC plus database row-level security).”
Mid-Level Full-Stack Software Engineer specializing in Java/Spring and React
“Full-stack engineer who has shipped a real-time social engagement feature (live messaging + personalized feeds) for a career networking platform, owning everything from WebSockets/SSE and JWT+Redis auth through Docker/Kubernetes production deployment. Also built a production Flask backend for an AI-driven movie recommendation system on AWS, with strong API design (versioning/error standards) and hands-on performance tuning (Typesense +47% query improvement, Postgres indexing, Redis caching, CloudWatch-driven incident response).”
Mid-level Conversational AI Engineer specializing in enterprise chatbots and workflow automation
“Built a production LLM/RAG document extraction and game/quiz content workflow using LLaMA 2, LangChain/LangGraph, and FAISS, achieving ~94% accuracy and reducing turnaround from hours to minutes. Demonstrates strong applied MLOps/orchestration (CI/CD, MLflow, Databricks/PySpark), robust handling of noisy/variable document layouts (layout chunking + OCR fallbacks), and practical reliability practices (human-in-the-loop routing, drift monitoring, A/B testing).”
Junior Full-Stack Software Engineer specializing in cloud-native systems and ML tooling
“New-grad backend engineer who built a real-time genome analysis pipeline, replacing a slow batch system with an event-driven distributed architecture in Python/Redis and a React progress dashboard. Reports ~6x improvement and cutting analysis time from days to hours with zero data loss under peak load, emphasizing reliability patterns like retries and idempotency plus API security (JWT/RBAC/HTTPS).”
Intern Software & AI Engineer specializing in distributed systems and LLM applications
“Stony Brook Fall 2024 capstone contributor who built a ROS2-based warehouse mobile robot prototype, owning perception and SLAM integration end-to-end. Strong in real-time robotics optimization on Jetson Orin (TensorRT/CUDA, ROS2 tracing/Nsight) and in distributed ROS2 communications (DDS discovery/QoS, MAVLink-to-ROS2 bridging), with a full simulation/testing/deployment toolchain (Gazebo, CI tests, Docker/K3s).”
Intern Data Scientist specializing in Generative AI and NLP
“Backend/AI engineer with internship experience building an AI-powered financial insights platform (FastAPI, Redis, BigQuery) and prior HCL experience leading a monolith-to-microservices refactor (Flask, Kafka) using blue-green deployments. Demonstrates strong performance/security focus (OAuth/JWT/RBAC, encryption) and measurable impact on latency, downtime, and ML model reliability; MVP was submitted to Google’s accelerator program.”
Mid-level Full-Stack Software Engineer specializing in cloud-native SaaS and microservices
“At Unar Labs, built and operationalized LLM capabilities inside a cloud-native SaaS product, emphasizing production reliability (fallbacks, observability, cost/latency/quality monitoring) and iterative improvement from user feedback. Also acts as a customer-facing technical lead—running developer demos/workshops and supporting sales through discovery, pilots/POCs, and technical walkthroughs to drive production adoption.”
Mid-level Data & Machine Learning Engineer specializing in anomaly detection and forecasting
“Built and productionized an agentic RAG assistant using Ollama + LangChain + MCP + ChromaDB to speed up and standardize access to operational knowledge from tickets and runbooks. Focused on real-world reliability: mitigated timeouts/latency with retries and concurrency limits, improved retrieval via chunking/embedding iteration, and reduced hallucinations through citation-grounding and confidence-based abstention. Also partnered with non-technical ops staff to deliver anomaly detection/monitoring by translating operational needs into model signals, thresholds, and alerting logic.”
Mid-level AI/ML Engineer specializing in LLMs, RAG, and production inference
“AI/LLM engineer who built a production resume-parsing and candidate-matching platform at Quadrant Technologies, combining agentic LangChain workflows, VLM-based document template extraction (~85% accuracy), and a hybrid RAG backend for resume-to-JD search. Notably integrated automated LLM evals and metric-based CI/CD quality gates to catch silent prompt/model regressions, and led a 3-person team across frontend/backend/testing.”
Entry-Level Backend Engineer specializing in analytics automation and cloud data pipelines
“Forward Deployment Engineer focused on application security and production integrations, with hands-on experience hardening API-driven ticketing systems (JWT/RBAC/rate limiting/log redaction) and implementing CI/CD security controls (Bandit SAST, SCA, container hardening). Strong in diagnosing peak-load production issues using logs/metrics/infra signals and driving durable fixes like adaptive throttling and backoff, while aligning engineering, business, and leadership stakeholders on risk and SLA impact.”
Mid-Level Software/AI Engineer specializing in backend systems, data pipelines, and RAG automation
“Backend engineer with experience modernizing high-traffic subscription and payment systems (TCS) by moving to event-driven Spring Boot microservices with Kafka, adding idempotency/state management to eliminate duplicate processing. Built and scaled FastAPI services for AI automation workflows (360DMMC) with versioned contracts, JWT security, and strong observability, and has led live refactors using feature flags, parallel runs, and data reconciliation.”
Junior Full-Stack Engineer specializing in web apps, cloud services, and data migrations
“Built SparkyAI, a gamified college-essay writing assistant (hackathon project at ASU in 2025) using React/styled-components, Firebase (OAuth/DB), and OpenAI APIs, with concrete scalability and performance measures like rate limiting, indexed queries, code splitting, and conversation caching. Also designed a global low-latency voice-to-LLM architecture leveraging WebRTC, regional containerized services, global load balancing, streaming STT/TTS, and end-to-end encryption with minimal logging.”
Mid-level Customer Success Engineer specializing in SaaS platform support and API integrations
“Security-focused engineer/customer-facing technical lead with SaaS platform experience at Ipsilon Lab, advising customers on API security and secure SDLC improvements. Has implemented production AppSec tooling (SAST/SCA), designed AWS least-privilege agent/scanning deployments, and led Kubernetes CI/CD security-agent integrations with Secrets Manager and PR gating. Strong track record troubleshooting complex customer integrations end-to-end (logs/metrics/traces through DB execution plans) and driving measurable stability/security posture improvements.”
Senior Site Reliability Engineer specializing in cloud observability and incident response
“Backend engineer experienced in evolving high-scale legacy on-prem systems into cloud-native, event-driven microservices on AWS/Kubernetes (noted peak traffic ~1.5M QPS). Strong focus on reliability engineering and operational excellence—SLO-driven observability, GitOps/canary rollouts, chaos testing, and preventing cascading failures (e.g., retry-storm mitigation).”
Mid-level AI Engineer & Researcher specializing in healthcare AI and multimodal LLM systems
“Backend/ML engineer focused on clinical AI transparency who built ShifaMind, an explainability-enforced clinical ML system using UMLS/MIMIC-IV/PubMed data with RAG, GraphSAGE, and cross-attention. Demonstrated strong production engineering via FastAPI API design and safe migrations (feature flags/shadow inference), plus HIPAA-aligned auth/RLS patterns; also delivered a real-time comet detection system reaching 97.7% accuracy.”
Senior Full-Stack Software Engineer specializing in SaaS platforms on AWS
“Full-stack engineer with strong DevOps/AWS experience who ships end-to-end React/TypeScript + Node/Python systems and operates them in production. Built an LLM-assisted recommendations workflow for a SaaS product with robust reliability controls (schema-validated JSON outputs, fallbacks, caching, monitoring) and measured impact via adoption, time saved, and override rates; also experienced delivering MVPs fast in early-stage startup ambiguity.”
Intern Software Engineer specializing in IAM, iOS, and AI security
“Early-career engineer who built a self-directed production-grade security scanning/analysis pipeline that normalizes multi-scanner results, correlates CVEs, and uses an LLM to generate exploit hypotheses—then hardened it for real-world reliability (timeouts, confidence scoring, feature flags, graceful degradation). Also integrated a real-time audio ML model into Discord/Zoom and debugged intermittent latency/dropouts across Python inference, virtual audio drivers, and network jitter; experienced with IAM integrations (Entra ID/Salesforce) and cloud tooling (AWS/Docker/Kubernetes).”
Mid-Level Software Engineer specializing in distributed systems and cloud microservices
“Built and productionized a RAG-based semantic search system for video-derived data, focusing on measurable success metrics (p95 latency, reliability, cost/request) and strong observability (prompt versions, retrieved docs, tool calls, token usage). Experienced in diagnosing real-time issues in LLM/agentic workflows and in supporting go-to-market efforts through tailored technical demos, rapid POCs, and post-close onboarding.”
Senior Technical Product Manager specializing in B2B/B2C SaaS and mobile products
“Built and launched his own free-to-play mobile trivia games (Unity/Xcode/Android Studio), monetizing through IAPs and multiple ad networks (AppLovin/Chartboost/AdMob). Combines hands-on implementation with growth and analytics rigor—Facebook Ads + ASO, and daily tracking of retention, crashes, conversion, and LTV:CAC—plus a clear approach to A/B testing and live events tooling.”
Senior Full-Stack Engineer specializing in React Native/Next.js and crypto analytics
“AI/LLM engineer with crypto/social-data domain experience who built an end-to-end AI trading signals and analysis backend using Next.js, Inngest, Gemini, and Supabase, including real-time progress UX and robust rate-limit/reliability handling. Also published an MCP SDK on NPM for authenticated agent tool access and built autonomous Twitter-based AI agents; strengthened reliability at LunarCrush by adding CI-gated E2E testing across web and mobile (Cypress/Maestro).”
Senior Software Engineer specializing in cloud systems and Web3 platforms
Intern Software Engineer specializing in distributed systems and cloud services