Vetted Observability Professionals

Pre-screened and vetted.

PP

Intern Software Engineer specializing in distributed systems and security

San Jose, CA6y exp
AnyLogUniversity of Pennsylvania

Built a production LLM-powered analyst assistant at Discern Security to speed up SOC investigations using a RAG pipeline over security vendor documentation (Python PDF ingestion, vector search). Demonstrates deep, security-critical LLM engineering: structure-aware chunking with custom table parsing, grounded/cited responses, prompt-injection defenses, and post-generation validation, validated via golden datasets and adversarial testing; tool is used daily by analysts.

View profile
RV

Rucha Visal

Screened

Mid-Level Software Development Engineer specializing in distributed systems and full-stack web apps

Seattle, USA4y exp
AmazonUniversity of North Carolina at Charlotte

Software engineer who owned customer-facing, high-traffic TypeScript/React + TypeScript backend systems end-to-end, emphasizing safe velocity through feature flags, staged rollouts, observability, and rollback-ready incremental delivery. Reports shipping more frequently with fewer production incidents and faster recovery due to these guardrails.

View profile
SS

Mid-level Business Data Analyst specializing in Financial Services and Healthcare analytics

USA4y exp
VisaGeorge Mason University

Full-stack engineer (~4 years) who has owned and shipped customer-facing SaaS onboarding and a role-based real-time analytics dashboard using TypeScript/React with a modular backend. Experienced in microservices with RabbitMQ and strong observability practices (correlation IDs, structured logging, queue metrics), and built an internal deployment tracker integrated with CI/CD that replaced manual spreadsheet/Slack processes.

View profile
CD

Mid-Level Software Developer specializing in Java microservices and cloud-native systems

St. Louis, MO5y exp
EpsilonSaint Louis University

Backend engineer focused on cloud/distributed systems, deploying Java 17/Spring Boot microservices on AWS EKS with RDS and Kafka. Demonstrated strong production readiness work (DB lock mitigation, Kafka idempotency, gradual rollouts) and delivered a major latency improvement (~400ms to ~100ms). Also has proven cross-layer troubleshooting skills, isolating intermittent API timeouts to a specific Kubernetes node’s network interface issue, and partners closely with ops teams to build dashboards and workflow automation (including Python scripts).

View profile
LL

Lisa Li

Screened

Director-level Engineering Leader specializing in SaaS, Cloud, and AI/ML delivery

Katy, Texas19y exp
Sainsbury'sThe Open University

Engineering leader who has led 100+ engineers at Sainsbury’s Tech and previously scaled an org from 6 to 60+ at AND Digital. Drove a high-impact modernization of a pricing/decisioning platform serving 1,700 stores—moving from batch monolith to real-time Kafka-based event-driven microservices with MLOps, IaC (Terraform), and zero-trust—delivering £18m+ annual profit uplift and 10+ deploys/day.

View profile
NM

Nathan Moore

Screened

Principal Architect specializing in SRE, DevOps, and large-scale cloud/CDN platforms

Dallas, Texas14y exp
Inertia LabsUCLA

Engineering leader who drove the conception, PRD, architecture, and delivery of MaxCDN’s next-generation CDN platform ("E2"), including control plane work, hardware deployment planning, and observability/billing data processing. Also built Krypton Labs’ engineering team from the first hires, using a flat Agile structure and emphasizing constructive conflict, strong documentation, and remote-team accountability.

View profile
ZI

Senior Machine Learning Engineer specializing in LLMs, RAG, and computer vision

San Diego, CA10y exp
SOTER AIUC San Diego

Built an "AskMyVideo" system that turns YouTube videos into queryable knowledge graphs by transcribing audio (Whisper), chunking and embedding content, and enabling traceable answers back to exact timestamps. Strong in entity resolution (rules + fuzzy matching + TF-IDF/cosine with PR-curve thresholding) and modern retrieval stacks (FAISS, hybrid dense/sparse, domain fine-tuning with ~12% precision gain), with a production mindset using Airflow/Prefect, Docker/FastAPI, and LangSmith/Prometheus/Grafana observability.

View profile
SA

Mid-level Full-Stack Software Engineer specializing in FinTech and payments platforms

Texas, USA4y exp
PayPalNortheastern University

Worked on payments and wallet transactions, with an emphasis on observability and root-cause analysis. Delivered end-to-end A/B testing optimization and implemented Jenkins-based CI/CD automation that reduced manual implementation to 35% and cut deployments to ~2 minutes, with attention to operational considerations like on-call/call rotations.

View profile
AT

Aarya Tallada

Screened

Entry-Level Software Engineer specializing in backend platforms for Financial Services

Tampa, FL1y exp
CitigroupUCLA

At Citi, helped lead the productionization of an internal LLM-driven automation workflow into a production-ready developer platform, focusing on determinism/reproducibility, security, and cost controls. Implemented prompt versioning/registry, JSON schema validation, sanitization, and deep telemetry (including manual edit-distance) plus human-in-the-loop review and phased rollout—driving major SDLC efficiency gains (e.g., test script creation cut from ~1 week to ~1 day).

View profile
Shanmukha Koganti - Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision in Bay Area, CA

Mid-level AI/ML Engineer specializing in recommender systems and edge computer vision

Bay Area, CA6y exp
ShopifyUniversity of North Texas

ML/AI engineer with production experience at Shopify and Intel, building a deep learning product ranking system that lifted add-to-cart ~14% and serving real-time similarity search via FAISS+Redis under <20ms latency at massive scale. Also deployed computer vision models to 100+ retail edge locations using Docker/Ansible/k3s with zero-downtime rollouts, and applies strong MLOps practices (A/B testing, canary/shadow, observability) plus performance optimization (OpenVINO, INT8).

View profile
Alex Vo - Staff Backend Software Engineer specializing in telemetry pipelines and observability in San Jose, CA

Alex Vo

Screened

Staff Backend Software Engineer specializing in telemetry pipelines and observability

San Jose, CA3y exp
VMwareUC Irvine

Backend engineer from VMware focused on proprietary enterprise systems (monitoring tools, data pipelines, and APIs). Drove a ClickHouse migration POC (local to remote host) using a dual-write/cutover approach and source-level debugging across Node/driver differences during a Node 12→20 upgrade, and delivered measurable performance gains (~20% CPU/memory improvement) through batching and streaming ingestion.

View profile
Umesh Toprani - Executive Cloud & Product Operations Leader specializing in SaaS transformation

Umesh Toprani

Screened

Executive Cloud & Product Operations Leader specializing in SaaS transformation

26y exp
Savara GroupUniversity of Wisconsin

Transformation leader brought into Barracuda Networks to orchestrate a founder-level shift from legacy software to SaaS across 14 product lines. Drove a metrics- and operating-margin-focused, multi-cloud (AWS/Azure/private cloud for DR) execution plan while aligning engineering, GTM, finance (subscription accounting), and support—contributing to nearly doubling revenue over ~4 years.

View profile
Atulya Bist - Junior Data Scientist / Software Engineer specializing in LLM analytics and robotics in Los Angeles, CA

Atulya Bist

Screened

Junior Data Scientist / Software Engineer specializing in LLM analytics and robotics

Los Angeles, CA3y exp
Applied MaterialsUSC

Robotics/ML engineer who implemented TD3 and PPO in PyTorch to solve the challenging OpenAI Gymnasium humanoid-v5 MuJoCo task, including custom networks, rollout logic, and training scripts. Also has hands-on robotics coursework experience with ROS-based RRT motion planning on a real robotic arm, plus practical CI/CD and containerization experience (Docker, Jenkins, GitHub Actions). Currently exploring world models (VAE + sequence generator) using Euro Truck Simulator data.

View profile
Nagarjuna Vaddineni - Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and data pipelines in Seattle, WA

Mid-level Full-Stack Software Engineer specializing in cloud-native microservices and data pipelines

Seattle, WA6y exp
AmazonTexas A&M University-Kingsville

Amazon backend engineer who built and operated high-scale Java Spring Boot microservices on AWS (EKS/EC2) handling millions of daily transactions, with deep experience debugging p95 latency and database/ORM bottlenecks. Shipped an AI-driven real-time personalization feature by integrating SageMaker model inference end-to-end with low-latency caching and graceful fallbacks, and designed robust order/payment orchestration with retries, compensations, and DLQ-based escalation.

View profile
Shruti Krishnagiri - Executive Engineering Leader & Technical Founder specializing in AI automation platforms in San Francisco Bay Area, California

Executive Engineering Leader & Technical Founder specializing in AI automation platforms

San Francisco Bay Area, California20y exp
BundledStanford University

Founder/CTO who built and shipped a consumer subscription-bundling platform end-to-end (architecture, implementation, testing) and scaled it to thousands of customers and major partners. Previously led a major reliability overhaul at Chan Zuckerberg Initiative for a Google-Docs-like ed-tech product—boosted observability, introduced incident management, and migrated to a Docker-based scalable architecture. Heavy user of AI tools (Cursor/Claude) for development, testing, and code review, with a strong bias toward lightweight, fast-moving execution.

View profile
Sai Dinesh Pusapati - Senior AI/ML Engineer specializing in GenAI agents and LLM workflows in San Francisco, CA

Senior AI/ML Engineer specializing in GenAI agents and LLM workflows

San Francisco, CA6y exp
Scale AIBelhaven University

LLM/AI engineer with production experience building a retrieval-based document intelligence system that extracts information from PDFs/emails, backed by Python + Spark pipelines. Focused on reliability and cost/latency optimization (caching, batch processing) and has hands-on orchestration experience with Airflow (sensors, retries, alerts). Also partnered with business stakeholders to deliver customer feedback classification/summarization for faster sentiment insights.

View profile
KC

Kevin Cruz

Screened

Senior Gen AI Engineer specializing in agentic LLM systems

Tempe, AZ15y exp
OpendoorUSC

Built and owned end-to-end production systems for a healthcare platform, including a predictive task recommendation feature (React + FastAPI + ML on AWS ECS) that cut backlog 20% and saved coordinators ~10 hours/week. Also productionized an AI-native RAG system (vector DB + LLM) delivering 40% faster query resolution, and led phased modernization of a monolithic FastAPI service into async microservices using feature flags and canary releases.

View profile
Akhil Kunala - Mid-level Software Engineer specializing in backend systems and cloud-native FinTech in Seattle, WA

Akhil Kunala

Screened

Mid-level Software Engineer specializing in backend systems and cloud-native FinTech

Seattle, WA5y exp
AmazonUniversity of North Texas

Amazon engineer with 5+ years of experience who built an AI-assisted log investigation and triage workflow that cut debugging time by about 30% during on-call incidents. Combines observability tooling like CloudWatch and Splunk with Python, prompt engineering, and RAG-based diagnostics, and has practical experience orchestrating agentic AI workflows with a strong human-in-the-loop reliability focus.

View profile
Prakash Bhanu - Director of Software Engineering specializing in cloud, platform, and FinTech systems in Sunnyvale, CA

Prakash Bhanu

Screened

Director of Software Engineering specializing in cloud, platform, and FinTech systems

Sunnyvale, CA22y exp
Cast & CrewSofia University

Senior software engineering leader with broad 0-to-1 product experience spanning web apps, microservices, monoliths, messaging platforms, ML/AI products, and large-scale distributed systems. Notable examples include building a payroll/finance product for cast and crew, a distributed messaging platform, and a Walmart application deployed across multiple CDNs and clouds handling hundreds of TPS, with personal ownership across architecture, design, coding, and support.

View profile
SL

S Latha Naidu

Screened

Mid-level Software Engineer specializing in AI-powered full-stack systems

Seattle, WA4y exp
AmazonUniversity of Colorado Denver

Backend-focused engineer with experience at AWS building a global alarm processing platform (Python, Lambda/SQS/DynamoDB) handling traffic spikes and reliability issues; resolved duplicate alerts and latency under load by fixing hot partitions and enforcing idempotency. Previously at Cognizant, built Java/PostgreSQL backend workflows for healthcare dashboards using pre-aggregated summary tables, strong SQL optimization, and state-driven job orchestration with ELK-based observability and production guardrails.

View profile
SB

Executive Product Leader specializing in AI-powered B2B SaaS

Fairfax, VA21y exp
3Pillar GlobalUniversity of Texas at Austin

Senior product leader with a track record of transforming legacy and labor-intensive products into AI-native, high-growth platforms across automotive SaaS, legal tech, enterprise software, and UCaaS. Most notably, they rebuilt CallRevu’s call analysis engine to replace human review with AI, cutting processing costs by 93% and reducing turnaround from 30 minutes to under 3 seconds, while also launching a new adjacent product line.

View profile
SR

Sriraksha Rao

Screened

Junior Software Engineer specializing in AI systems and distributed backend platforms

San Diego, CA3y exp
Relevance LabsUC San Diego

Built end-to-end AI features across both fitness and insurance domains, including a full-stack personalized workout recommendation system and a production RAG-based insurance QA assistant at Relevance Labs. Stands out for combining backend/distributed systems skills with practical LLM architecture, evaluation, and risk-aware human-in-the-loop design; notably reduced unnecessary LLM calls by 40% while improving latency and answer reliability.

View profile
AN

Abhay Naik

Screened

Mid-level Data Engineer specializing in cloud-native analytics and enterprise integrations

Remote3y exp
The GrooveUC Berkeley

Built and productionized an LLM-powered clinical assistant at a healthcare startup, re-architecting a prototype into a robust RAG system on AWS with guardrails, citations, monitoring, and automated tests for clinical reliability. Works closely with clinicians to convert workflow feedback into evaluation criteria and iterative system improvements, and has hands-on experience debugging agentic systems in real time (including during live client demos).

View profile
AS

Staff-level Software Engineer specializing in identity, access management, and platform security

Grapevine, TX4y exp
PaycomRice University

Backend engineer focused on scalable, security-first platform architecture—recently built an end-to-end centralized access-control system that launched successfully with ~50k early adopters and was designed to support ~10x traffic growth. Experienced in production authn/authz (token verification, handoff/session migration), and in de-risking migrations via feature flags, phased rollouts, A/B testing, and Splunk-based monitoring.

View profile

Need someone specific?

AI Search