Pre-screened and vetted.
Director-level AI & Data Science leader specializing in GenAI, LLMs, and MLOps
“ML/NLP engineer currently working in NYC on a system that connects complex unstructured data sources to deliver personalized insights, using embeddings + vector DB retrieval and a RAG architecture (LangChain, Pinecone/OpenSearch). Strong focus on production constraints—especially low-latency retrieval—using FAISS/ANN, PCA, index partitioning, and Redis caching, plus PEFT fine-tuning (LoRA/QLoRA) and KPI/SLA-driven promotion to production.”
Staff Full-Stack Engineer specializing in AI platforms and infrastructure automation
“Backend/full-stack engineer building complex internal platforms and customer-facing demos at the intersection of infrastructure and product. Shipped a no-code Product Lifecycle Manager for manufacturing (3 manufacturers, 1000+ evolving tests) using AWS S3/SQS ingestion and extensible Postgres (EAV+JSONB) with end-to-end traceability. Also built a FastAPI-based company data intelligence platform with Okta-secured RBAC and an LLM/MCP layer for ChatGPT-like analytics over enterprise data sources.”
Mid-level AI/ML Engineer specializing in deep learning, NLP/LLMs, and MLOps
“Built and shipped a real-time oncology risk prediction system used by doctors during patient visits, trained on clinical data in AWS SageMaker and deployed via FastAPI with sub-second responses. Emphasizes clinician-trust features (SHAP explainability, validation checks) and HIPAA-compliant controls (encryption, RBAC, audit logging), plus Kubernetes-based production operations with autoscaling, monitoring, and drift/retraining workflows; collaborated closely with oncologists at Flatiron Health.”
Mid-level Full-Stack Java Developer specializing in cloud-native microservices
“Software engineer with strong compliance-domain experience who built a customer-facing compliance and reporting dashboard using React/TypeScript with Spring Boot microservices. Demonstrates mature production engineering practices—contract-first APIs, event-driven architecture (Kafka/RabbitMQ), caching (Redis), and robust CI/CD + observability (Prometheus/Grafana/ELK)—and also created a Python-based audit automation tool adopted into the standard release process.”
Principal Data Scientist specializing in NLP and Generative AI
“ML/NLP practitioner with experience building an embedding-based ad matching and search system at Vericast (BERT embeddings + similarity search) to replace a third-party taxonomy approach, evaluated via a human-curated gold standard. Also built a custom NER pipeline at Allstate for auto accident claims calls using a bidirectional LSTM and achieved 90%+ F1, with a strong emphasis on production-grade ML workflows (testing, CI/CD, orchestration, versioning, validation).”
Mid-level AI/ML Engineer specializing in Generative AI, RAG, and Conversational AI
“Built a production RAG-based GenAI copilot backend at Aetna using Python/FastAPI, GPT-4, LangChain, and Azure AI Search, deployed on AKS with Prometheus/Grafana observability. Owned the system end-to-end (ingestion through deployment) and improved peak-time reliability by addressing vector search and embedding bottlenecks with Redis caching, index optimization, and async processing, plus added anti-hallucination guardrails via retrieval confidence thresholds.”
Senior Software Engineer specializing in identity, cloud-native microservices, and reactive web apps
“Product-focused full-stack engineer with Walmart and Dell experience who built and shipped a real-time engagement dashboard end-to-end (Kafka Streams, Spring Boot, React/TypeScript/D3) used daily by business teams, moving them from next-day reports to real-time decisioning. Strong in performance/reliability (Redis caching cut latency ~40%, 90%+ test coverage, Prometheus/CloudWatch monitoring) and production operations on AWS/EKS including handling a cascading failure from a memory leak with zero-downtime rollback and redeploy.”
Junior AI/ML Engineer specializing in real-time computer vision and tracking systems
“Full-stack engineer who built and owned a production real-time computer-vision inference platform at Credence, spanning Next.js App Router/TypeScript frontend with SSE/WebSocket streaming, a Flask backend, and Postgres analytics. Demonstrated measurable performance wins (70% fewer re-renders; latency cut to ~40–50ms) and strong production rigor (durable orchestration, idempotency, observability, AWS EC2 + CI/CD) with tight post-launch UX iteration based on analyst feedback.”
Junior Machine Learning Engineer specializing in LLM systems and GPU inference
“LLM/agent engineer who shipped a production RAG-based recommendation + explanation system that replaced a traditional recommender stack, delivering ~20% CTR lift (and +8% after a reliability iteration) with strong cold-start performance. Demonstrates strong production rigor: schema-constrained generation, typed tool calling, explicit state/orchestration, deep monitoring/feedback loops, and safe integration with messy ERP inventory/order data using normalization, idempotency, and conflict-resolution guardrails.”
Intern Software Engineer specializing in edge AI deployment and distributed systems
“Full-stack engineer who built an enterprise search platform (Codlens) delivering natural-language Q&A over Jira/Slack using embeddings, vector DB search, re-ranking (RRF), and LLM responses with source grounding. Also designed and benchmarked a distributed IAM system with Postgres transaction-log replication and Raft-based quorum consistency, reporting ~253 TPS at ~60ms latency in a multi-node setup. Experience spans early-stage startups (Zetic AI, Sagwara Capital) and large-scale orgs (Akamai, Atlassian).”
Mid-level AI Engineer specializing in GenAI, NLP, and MLOps
“LLM/agentic-systems engineer with PayPal experience hardening an LLM-powered fraud support assistant from prototype to production, focusing on low-latency distributed architecture, rigorous evaluation/testing, and security/compliance. Comfortable in customer-facing and GTM contexts—runs technical demos/workshops, builds tailored pilots, and aligns sales/CS with engineering to close deals and drive adoption.”
Junior AI Engineer specializing in agentic workflows and ML platforms
“Building a production LLM/agent system for a leading US dental provider that extracts rules from payer handbooks/portals and EDI 271 responses to validate and improve patient cost estimates. Combines GCP stack (BigQuery, GKE, Cloud Run, Pub/Sub, Vertex AI) with strong agent reliability practices (observability, validator agents, grounding, PII/hallucination guardrails, confidence scoring) and has led non-technical customer stakeholders on enterprise ServiceNow↔Aha sync and AI-powered enterprise search/summarization.”
Intern Software Developer and ML Researcher specializing in medical imaging and computer vision
“AI/ML practitioner with experience spanning audio/LLM applications (built "Iota" using Whisper, tiktoken, and a local Ollama-served LLM) and healthcare ML (Facemed.ai; UChicago Radiology). Demonstrates a production-oriented mindset—focus on data/model fit, deterministic field testing, and operational safeguards—and has improved research evaluation workflows via a hash-table-based concurrent model tracking approach.”
Mid-level Research Engineer specializing in machine learning and computational neuroscience
“Master’s-level ML researcher with hands-on embodied/edge deployment experience: built a Google Glass motion-tracking system at Sandia using MobileNetV1 + LSTM trained in TensorFlow and deployed via TensorFlow Lite. Has reimplemented transformer-based research for a thesis and demonstrated strong judgment adapting quickly when upstream assumptions changed, and stays current through active reading groups and a JEPA collaboration.”
Mid-level Software Engineer specializing in embedded AI and full-stack systems
“Robotics software engineer who built and owned core navigation components for a TurtleBot in ROS/ROS2 and Gazebo, including an RRT-based planner, waypoint-to-velocity motion planning, and PID trajectory tracking. Demonstrates strong real-time debugging skills (control-loop timing under CPU load), costmap/occupancy-grid tuning, and distributed ROS2 communication design using DDS/QoS, plus Docker and CI/CD automation experience from Keysight.”
Junior AI/ML Systems Engineer specializing in LLM infrastructure and distributed training
“Built and shipped a production NMT system translating medical documentation for a rare/low-resource language, tackling data scarcity with retrieval-driven pattern matching plus dictionary/grammar- and LLM-based augmentation and validating quality with a linguistic expert. Also develops agentic LLM workflows with LangChain/LangGraph (including a deep-research style system) and has experience aligning medical AI deployments with clinician-defined risk metrics and human-in-the-loop decision making.”
Mid-level DevOps Engineer specializing in cloud automation and Kubernetes platforms
“Robotics/ML engineer who has built SO(3)-equivariant models for robotic manipulation, including custom equivariant layers and differentiable point-cloud rasterization/derasterization workflows. Also brings 2 years of DevOps experience in banking systems, automating CI/CD and infrastructure at scale (managed 180 OCI servers; reduced rebuild downtime by 80%).”
Mid-level AI/ML Engineer specializing in Generative AI, NLP, and Computer Vision
“Built an LLM-powered learning assistant (EduQuizPro/EduCrest Pro) that uses RAG over URLs and PDFs to generate quizzes, notes, and explanations for students/professors. Emphasizes production robustness—implemented dependency fallbacks (FAISS/Sentence Transformers/Gradio), CLI-safe mode, and NumPy-based indexing—along with a custom orchestration layer to keep multi-step AI workflows reliable.”
Senior Python Full-Stack Developer specializing in cloud-native microservices and data platforms
“Backend/data engineer from Oliver Wyman who built and ran production Python (FastAPI) services on AWS (ECS/Lambda/API Gateway) supporting risk modeling and regulatory reporting. Strong in reliability/observability, Glue-based ETL with data quality controls, and legacy SAS-to-Python modernization with rigorous parity validation; also demonstrated measurable SQL performance wins and cost-control improvements in serverless scaling. Based in Raleigh, NC and can travel onsite for important Bethesda-area meetings.”
Mid-level AI/ML Engineer specializing in LLM agents, RAG, and enterprise ML systems
“Built a production multi-agent recommendation/RAG system for internal data analysts to speed up weekly report creation by improving document discovery and automating report/SQL generation. Implemented LangGraph-based orchestration with deterministic agent routing, robust error handling (interrupt/resume), and metadata-driven semantic chunking for diverse PDF/document formats, plus monitoring for latency, throughput, and token/cost efficiency.”
Intern Robotics/ADAS Engineer specializing in perception, sensor fusion, and state estimation
“Robotics software engineer who built a multi-agent dense warehouse mapping system in ROS 2, including LiDAR-camera fusion SLAM, timestamp-based synchronization, and DDS-based inter-robot pose/keyframe exchange under bandwidth constraints. Also applied Gaussian Splatting for selective photorealistic dense reconstruction and optimized real-time performance with node composition, bounded queues, and QoS tuning; experienced with Gazebo/CARLA/Unity simulation and Dockerized ROS 2 deployments.”
Intern Software Engineer specializing in LLM agents and full-stack development
“Embedded C++ engineer with Bosch automotive infotainment experience, owning real-time audio middleware modules with strict latency/memory constraints. Strong in profiling/optimizing deterministic behavior, debugging hardware-specific intermittent issues, and building automated test + CI pipelines; currently ramping up on ROS2 concepts (DDS, nodes/topics/services) to transition toward robotics.”
Junior Software Engineer specializing in backend, cloud, and machine learning systems
“Built Digipulse, a university project that ingested and clustered Bluesky tweet data at scale and used Gemini to generate near-real-time topic summaries, processing 1M+ tweets per day. Also brings Intel experience with Prometheus and Kubernetes, including production monitoring and incident troubleshooting.”
Junior Machine Learning Engineer specializing in AI, computer vision, and data systems
“Built and owned an end-to-end AV operations automation and dashboarding platform for USC event operations, used daily to coordinate hundreds of live events. Delivered a React/TypeScript full-stack system integrating Smartsheet APIs with strong reliability practices (typed contracts, validation/fallbacks, safe rollouts) and experience with queue-based microservice patterns (idempotency, retries, DLQs, monitoring).”