Vetted Site Reliability Engineering Professionals

Pre-screened and vetted.

Aaron Li - Junior AI/ML Engineer specializing in production LLM systems and RAG in Atlanta, GA

Aaron Li

Screened

Junior AI/ML Engineer specializing in production LLM systems and RAG

Atlanta, GA2y exp
Georgia Institute of TechnologyUniversity of Chicago

LLM/document AI engineer who owned a production-grade contract extraction pipeline at CORAMA.AI, ingesting PDFs and dynamic JavaScript sites from 1,000+ government sources. Built a hybrid deterministic+LLM system with two-phase prompting, Pydantic guardrails, confidence scoring, and human-in-the-loop review—cutting error rates from ~35% to <5% and processing 50k+ documents at ~95% accuracy. Also built clinician-in-the-loop orchestration in research, reducing manual labeling time from 3–4 hours to ~50 minutes.

View profile
AD

Mid Backend Software Engineer specializing in FinTech platforms

Jersey City, NJ3y exp
JPMorgan ChaseNYU

Frontend-leaning full-stack engineer with hands-on experience building financial operations and transaction monitoring products from 0→1 through production scale. They stand out for owning React UI architecture, backend/API integration, and data-layer performance decisions while making pragmatic startup tradeoffs and improving features post-launch based on latency, error, and user feedback.

View profile
Melbourne Brown - Senior Software Engineer specializing in AI-driven cloud-native platforms in Atlanta, GA

Senior Software Engineer specializing in AI-driven cloud-native platforms

Atlanta, GA12y exp
McKinsey & CompanyKennesaw State University

Engineer with unusual breadth: from a tiny startup building racehorse medical-record systems on credit-card chips for live racetrack demos to modern AI-powered contract intelligence platforms in production. Brings hands-on full-stack and backend depth across React, Python, .NET, PostgreSQL, Kubernetes, and Azure, with a track record of making complex, reliability-sensitive systems work in real-world conditions.

View profile
Neil Gariepy - Executive technology leader specializing in AI, cloud operations, and information security in Portland, OR

Neil Gariepy

Screened

Executive technology leader specializing in AI, cloud operations, and information security

Portland, OR26y exp
Airship GroupUniversity of San Francisco

Veteran software engineering executive with ~20 years of experience who grew from hands-on IC and release engineer into VP leadership across AI, security, infrastructure, and platform engineering. Particularly compelling for senior platform or CTO-track roles: he combines board-level strategy with real technical depth, including architecting Airship's multi-agent generative AI platform, leading GDPR/security programs, and scaling NetSuite infrastructure to 99.99% uptime across six global data centers.

View profile
KJ

Krishi Jain

Screened

Junior Implementation Manager / Solution Engineer specializing in AI, ERP integrations, and predictive maintenance

Chicago, IL2y exp
Continuum AIWestcliff University

LLM/agentic workflow practitioner (Continuum AI) who productionized an LLM system for manufacturing RMA intake and warranty claims by moving from a brittle prompt to a modular pipeline with RAG, function-calling extraction, deterministic validation, and strong observability. Also diagnosed and fixed an agentic ticket-triage misrouting issue by tracing failures to retrieval timeouts, adding guardrails/fallbacks, and implementing retries plus continuous evaluation—bringing misroutes near zero while creating a repeatable debugging playbook.

View profile
MB

Manoj Bagul

Screened

Executive Engineering & AI Platform Leader in Enterprise SaaS

New York, NY25y exp
Qlaws.aiSavitribai Phule Pune University

Healthcare data platform builder with experience at Aetion delivering a rule-based EMR/EHR ingestion and validation framework that cut onboarding from 8–10 weeks to hours and unlocked $30M+ in revenue over ~3 years. Motivated to found an AI/agent-driven healthcare solution, with a specific interest in using PET scans, doctor notes, and treatment data with LLMs to help predict cancer progression and guide next-step treatments.

View profile
NM

Nathan Moore

Screened

Principal Architect specializing in SRE, DevOps, and large-scale cloud/CDN platforms

Dallas, Texas14y exp
Inertia LabsUCLA

Engineering leader who drove the conception, PRD, architecture, and delivery of MaxCDN’s next-generation CDN platform ("E2"), including control plane work, hardware deployment planning, and observability/billing data processing. Also built Krypton Labs’ engineering team from the first hires, using a flat Agile structure and emphasizing constructive conflict, strong documentation, and remote-team accountability.

View profile
Alex Vo - Staff Backend Software Engineer specializing in telemetry pipelines and observability in San Jose, CA

Alex Vo

Screened

Staff Backend Software Engineer specializing in telemetry pipelines and observability

San Jose, CA3y exp
VMwareUC Irvine

Backend engineer from VMware focused on proprietary enterprise systems (monitoring tools, data pipelines, and APIs). Drove a ClickHouse migration POC (local to remote host) using a dual-write/cutover approach and source-level debugging across Node/driver differences during a Node 12→20 upgrade, and delivered measurable performance gains (~20% CPU/memory improvement) through batching and streaming ingestion.

View profile
Shruti Krishnagiri - Executive Engineering Leader & Technical Founder specializing in AI automation platforms in San Francisco Bay Area, California

Executive Engineering Leader & Technical Founder specializing in AI automation platforms

San Francisco Bay Area, California20y exp
BundledStanford University

Founder/CTO who built and shipped a consumer subscription-bundling platform end-to-end (architecture, implementation, testing) and scaled it to thousands of customers and major partners. Previously led a major reliability overhaul at Chan Zuckerberg Initiative for a Google-Docs-like ed-tech product—boosted observability, introduced incident management, and migrated to a Docker-based scalable architecture. Heavy user of AI tools (Cursor/Claude) for development, testing, and code review, with a strong bias toward lightweight, fast-moving execution.

View profile
RM

Ruby Medeiros

Screened

Staff SRE and Software Engineer specializing in distributed systems and cloud reliability

11y exp
ArenaNOVA University Lisbon

Built a production B2C behavioral interview system for job seekers using LangGraph/LangChain on AWS Bedrock with Nova models, plus a FastAPI backend and Vercel AI SDK frontend. Stands out for practical agent reliability work: local stress testing, OpenTelemetry-to-Datadog observability, token/cost monitoring, and guardrails to keep conversations on track and resistant to instruction override.

View profile
PS

Palak Siroya

Screened

Senior Site Reliability Engineer specializing in Azure cloud reliability and data analytics

Renton, WA10y exp
MicrosoftCentral Washington University

AppSec-focused customer advisor with hands-on experience integrating SAST/DAST/SCA into production CI/CD (Azure DevOps) and designing secure agent/scanning deployments in AWS (least-privilege IAM, private subnets, VPC endpoints). Demonstrates strong incident troubleshooting using logs/metrics/traces to diagnose load-related failures (timeouts/retry storms) and drive durable fixes, while tailoring risk/tradeoff communication across engineering, security, and leadership stakeholders.

View profile
Lamar Petty - Mid-level Full-Stack Product Engineer specializing in data-driven web apps and healthcare systems in San Francisco, CA

Lamar Petty

Screened

Mid-level Full-Stack Product Engineer specializing in data-driven web apps and healthcare systems

San Francisco, CA13y exp
Wikimedia FoundationGeorgia Tech

Full-stack engineer with production experience shipping a healthcare-focused web app (Pregnancy-Pal) using Next.js/TypeScript on GCP, integrating a Python/Flask middleware and FHIR server for patient/practitioner dashboards and messaging. Former Wikimedia Foundation Android engineer who led the end-to-end 'Year in Review' feature and built robust automated testing/CI practices (Espresso, GitHub Actions matrix). Strong emphasis on reliability via rigorous validation, comprehensive Postman testing, and detailed API documentation.

View profile
PS

Senior Software Engineer specializing in backend infrastructure, cloud automation, and reliability

Mountain View, CA8y exp
OracleStony Brook University

End-to-end deployment owner for Oracle document delivery/print services in a hospital-like production environment, focused on reliability/performance at scale (thousands of systems). Also describes implementing event-driven RAG/agentic LLM workflows with attention to embeddings/index consistency, latency, and measurable improvements in response relevance and operational efficiency.

View profile
VS

Director-level Software Engineering Leader specializing in FinTech and platform modernization

Sammamish, WA13y exp
Capital OneAnna University

Director-level Senior Manager of Software Engineering at Discover with roughly a decade in web application engineering leadership, focused on modernizing legacy banking platforms into cloud-native SPA architectures. Stands out for combining large-team people leadership with hands-on technical depth in architecture, debugging, and prototyping, including GenAI experimentation and high-scale customer-facing migrations.

View profile
Mos Fard - Senior Distributed Systems Architect specializing in backend platforms and FinTech in Scottsdale, AZ

Mos Fard

Screened

Senior Distributed Systems Architect specializing in backend platforms and FinTech

Scottsdale, AZ12y exp
PayPalArizona State University

Full-stack engineer who built an AI-powered visual product discovery feature end to end across web, mobile, backend, and ML integration. Particularly strong in TypeScript-first monorepo architecture, serverless AWS microservices, and productionizing computer vision/LLM pipelines with monitoring, prompt refinement, and human-in-the-loop quality controls.

View profile
NB

Nibir Bora

Screened

Senior engineering leader specializing in cloud infrastructure and platform engineering

Los Angeles, CA11y exp
DittoUSC

Engineering leader with deep platform and Kubernetes expertise who scaled a compute team at CloudKitchens/Adams from 5 to 12 engineers while driving major infrastructure outcomes. Notable achievements include completing a GCP-to-Azure migration in one year, cutting cloud costs by ~40%, and leading governance, reliability, and AI-based anomaly detection initiatives across a large microservices platform.

View profile
SC

Director-level technology architect specializing in AI, cloud platforms, and AdTech

Glendale, CA13y exp
DisneyD.Y. Patil College of Engineering

Architecture leader from Disney who managed system, AI, and data architects while staying hands-on in solution design. Has experience building LLM-based video advertising products, designing Kafka-based real-time data architectures, and using MVP/POC approaches to align product and executive stakeholders.

View profile
VS

Mid-Level Software Engineer specializing in LLM agents and real-time data streaming

8y exp
AmazonRutgers University–New Brunswick

Software engineer with experience at Striim and Amazon who ships end-to-end production systems across UI, backend, ML, and operations. Built a real-time PII detection capability for a streaming data platform by integrating Python ML inference into a Java monolith via gRPC sidecars, achieving ~3M events/hour throughput and ~93% accuracy, and helped drive enterprise adoption (Fiserv, CVS). Also modernized internal Amazon tooling for multi-region scale with modularization and fully automated deployments.

View profile
SP

Mid-level Backend Software Engineer specializing in Python APIs and payment systems

USA6y exp
StripeSouthern Illinois University Carbondale

Backend/ML systems engineer with Stripe payments experience who built an asynchronous processing upgrade handling millions of API requests, cutting peak latency ~20–25% while preserving strict financial consistency via idempotency-safe retries and robust validation/fallbacks. Also built scalable ETL pipelines for messy CSV/Excel/API data with strong observability (structured logging/monitoring) and reliability mechanisms.

View profile
Deepika Gotla - Senior Technical Support Engineer specializing in Azure Cloud & Generative AI in Bellevue, WA

Deepika Gotla

Screened

Senior Technical Support Engineer specializing in Azure Cloud & Generative AI

Bellevue, WA7y exp
MicrosoftSUNY New Paltz

Microsoft cloud/infra engineer with 5+ years supporting enterprise Azure environments, specializing in security-focused networking (private endpoints, DNS) and production troubleshooting across Azure Front Door/App Gateway WAF/AKS. Has implemented posture improvements via Defender for Cloud, Azure Policy, and RBAC tightening, and also designs secure AWS agent/scanner integrations and modern EKS/GitHub Actions/Secrets Manager observability-enabled SDK rollouts.

View profile
Shriya Bannikop - Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems in Seattle, WA

Mid-level Software Engineer specializing in cloud platforms, data engineering, and distributed systems

Seattle, WA5y exp
Amazon Web ServicesKLE Technological University

Full-stack engineer who built and owned an AI-assisted job-matching dashboard in Next.js App Router/TypeScript, keeping LLM logic server-side and improving performance via deduplication, caching/revalidation, and streaming (35% fewer duplicate LLM calls; 40% faster first render). Also has strong data/backend chops: designed Postgres models and optimized queries at million-record scale (1.8s to 120ms) and built durable AWS multi-region telemetry workflows with idempotency, retries, and monitoring.

View profile
SS

Shawn Souto

Screened

Director-level Engineering Leader specializing in AI and enterprise SaaS

Newport Beach, CA17y exp
SAPDeVry University

Engineering leader who has operated effectively in both a VC-backed startup and SAP, combining director-level org leadership with day-to-day technical depth. Notable for re-architecting integrations that produced a 3x revenue gain, leading a 90-engineer matrixed organization, and staying hands-on in GenAI, infrastructure, and full-stack problem solving.

View profile
Krishna Guda - Principal Software Engineer specializing in AI-native FinTech systems in Mountain View, CA

Krishna Guda

Screened

Principal Software Engineer specializing in AI-native FinTech systems

Mountain View, CA23y exp
Credit SesameBITS Pilani

Fintech product engineer working on a large-scale credit monitoring platform (tens of millions of users) with deep experience in regulated banking integrations, PII security, and step-up/MFA flows. Has shipped customer-facing React/TypeScript experiences driven by Optimizely experimentation and built reliable partner-facing microservices/SDKs on AWS, including resolving production traffic loss caused by edge security (DataDome/CAPTCHA) conflicts with payment providers.

View profile
Antonio Richmond - Executive Operations Leader and Electrical Engineer specializing in industrial scale-up and automation in Los Gatos, CA

Executive Operations Leader and Electrical Engineer specializing in industrial scale-up and automation

Los Gatos, CA16y exp
Blue Planet SystemsSavannah State University

Operations leader with heavy industrial and climate/industrial innovation exposure: led ops for a novel cement-disruption product involving site selection near CO2 emitters and feedstock/offtake sourcing to reach a proof-of-concept facility. Previously recommissioned an oil re-refinery within a year, owning cross-functional hiring plus compliance, budgeting, and community relations—delivering on time and on budget.

View profile

Need someone specific?

AI Search