Reval Logo
Home Browse Talent Skilled in Observability

Vetted Observability Professionals

Pre-screened and vetted.

ObservabilityPythonDockerCI/CDAWSKubernetes
KG

Kiriti Golla

Mid-level AI/ML Engineer specializing in LLM fine-tuning, RAG, and MLOps

Bay Area, CA5y exp
MicrosoftSUNY Polytechnic Institute
A/B TestingAgileAmazon S3Anomaly DetectionApache KafkaApache Spark+151
View profile
BH

Brett Higgins

Senior Full-Stack Engineer specializing in AI/GenAI and cloud-native platforms

Dallas, TX13y exp
Kamet Consulting GroupUniversity of Texas at Austin
ReactReact NativeNext.jsAngularReduxTypeScript+124
View profile
AS

Amareswara Surapuraju

Mid-level Software Engineer specializing in real-time backend systems and FinTech payments

San Francisco, CA6y exp
StripeWebster University
PythonFastAPIDjangoFlaskSQLAlchemyJava+151
View profile
JL

Jack Livingston

Senior Software Engineer specializing in Python, cloud microservices, and conversational AI

Chapel Hill, NC11y exp
GoogleUniversity of North Carolina at Chapel Hill
PythonDjangoFastAPIFlaskGoNode.js+96
View profile
AB

Anirudh Belwadi

Mid-level Backend/Distributed Systems Engineer specializing in AWS serverless architectures

Bellevue, WA2y exp
AmazonCarnegie Mellon University
JavaPythonTypeScriptReactFlaskSpring Boot+60
View profile
ML

Maria Li

Junior Firmware Engineer specializing in embedded-cloud integrations

New York, NY2y exp
SamsaraUC Berkeley
PythonGoJavaJavaScriptTypeScriptSQL+28
View profile
TM

Thomas Maxwell

Senior Software Engineer specializing in cloud platforms for healthcare and e-commerce

San Jose, CA11y exp
AmazonGeorgia Tech
AgileAngularApache AirflowApache HadoopApache KafkaApache Spark+268
View profile
FZ

Flora Zeng

Junior Software Engineer specializing in observability and cloud infrastructure

2y exp
FastlyUC Berkeley
GoJavaTypeScriptPythonJavaScriptC+++32
View profile
SY

Srinadh Y

Mid-level AI/ML Engineer specializing in LLMs, multimodal systems, and MLOps

5y exp
MetaEast Texas A&M University
A/B TestingAnomaly DetectionApache CassandraApache KafkaApache SparkAWS+179
View profile
CC

Christopher Chu

Screened

Senior Backend Engineer specializing in distributed systems and cloud microservices

Beaverton, Oregon11y exp
NikeUC San Diego

“Backend/data engineer with experience at Nike building high-volume order orchestration and validation APIs using FastAPI microservices on AWS EKS with Kafka, Redis, and Postgres. Strong in production reliability (timeouts/retries/idempotency), GitOps (Argo CD) + Terraform deployments, and data pipelines (AWS Glue/S3), with hands-on incident ownership and legacy modernization into API-driven services.”

AgileAPI DesignArgo CDAsynchronous ProcessingBatch ProcessingCI/CD+136
View profile
PP

Pranshu Patel

Screened

Director-level Software Development Manager specializing in cloud DDoS protection

Santa Clara, CA12y exp
Amazon Web ServicesUniversity of Maryland, College Park

“AWS Software Development Manager leading globally deployed, production-critical DDoS protection (L3/L4) across AWS. Known for scaling teams and driving cross-org tiger-team initiatives from concept through worldwide rollout, including performance-focused Python architecture changes and a major JDK 8→21 migration while maintaining strict backward compatibility. Also led an internal SDK-like integration framework improving APIs, documentation, and onboarding for major AWS service teams.”

Project managementCommunicationLeadershipDistributed systemsAmazon CloudFrontAmazon EC2+45
View profile
SM

Shant Mardigian

Screened

Executive Engineering Leader specializing in scalable streaming, media supply chain, and AI operations

19y exp
DisneyUCLA

“Tech executive with Disney experience who has repeatedly scaled and restructured engineering organizations (from 4 to 30 and up to 100+), using OKRs/KPIs to drive business-aligned roadmaps. Hands-on with architecture and platform strategy, including adopting MongoDB Atlas to centralize transactional data and building shared core services (security/permissions, auditing, compliance) to increase product velocity across distributed teams.”

Microservices architectureCloud-native architectureAWSInfrastructure as CodeDevOpsAutomation+93
View profile
JF

Jeffery Faneuff

Screened

Executive Engineering Leader specializing in AI-driven SaaS and IoT platforms

Los Angeles, CA22y exp
VantivaBabson College

“Engineering leader who built and delivered an IoT smart-spaces platform for the self-storage and smart-living domains, translating customer requirements into architecture, capability maps, and a multi-milestone roadmap. Personally stood up missing AI/ML capabilities (including churn prediction) using Databricks (Delta Lake/MLflow), enabling follow-on features like energy optimization and security/anomaly detection. Scaled an org from 20 to 80+ with disciplined Agile planning (Jira Advanced Roadmaps/Confluence) and strong executive/customer-facing leadership during high-stakes customer commitments.”

AgileAndroidAngularJSApache TomcatAWSAWS CloudFormation+163
View profile
RB

Riccardo Bernardi

Screened

Senior Infrastructure Engineer specializing in cloud, Kubernetes, and MLOps

San Francisco, USA6y exp
ATLANTIA SpaUniversity of Bologna

“LLMOps-focused technical leader who took an LLM use case from prototype to production for a non-technical customer by combining trust-building and structured enablement with a robust AWS/Kubernetes-based MLOps stack. Built observability and rollback mechanisms (Grafana + MLflow) to troubleshoot in real time, and scaled delivery by hiring a 5-person team while partnering with sales to manage expectations and drive adoption across departments.”

AlgorithmsAmazon API GatewayAmazon EMRAmazon S3Apache AirflowArgo CD+85
View profile
SS

Sujit Singh

Screened

Engineering Director specializing in backend & data platforms for enterprise SaaS and cybersecurity

San Jose, CA21y exp
SplunkHarvard Extension School

“Backend/data engineering player-coach on a UEBA cloud security analytics platform who standardized MLOps and detection development for 180+ detections, cutting ship time from 6–7 weeks to ~3 weeks while reducing false positives. Proven at operating large-scale streaming + Spark systems (200K+ events/sec, 100+ TB/day), driving major reliability/cost improvements, and leading incident response and team execution through GA.”

OnboardingPerformance managementData engineeringDistributed systemsBatch processingObservability+59
View profile
SA

Shiva Arcot

Screened

Director of Security & Data Platform Engineering specializing in AI-driven cloud security

Sunnyvale, CA24y exp
ProofpointSanta Clara University

“Player-coach engineering leader focused on scalable data security scanning and risk detection in hybrid cloud, owning architecture and core implementation of an incremental/parallel DSPM scanning engine. Shipped production improvements including 60% lower scan latency and 30% fewer false positives, with strong emphasis on correctness under concurrency, multi-tenant observability (SLOs/burn-rate alerts), and disciplined rollout practices (feature flags, shadow scans, canaries).”

Anomaly DetectionApache AirflowApache CassandraAWSAWS GlueBusiness Intelligence+126
View profile
KD

Kusumita Dasgupta

Screened

Director-level Engineering Leader specializing in data platforms, cloud systems, and LLM products

United States22y exp
Intuition IntelligenceUSC

“Engineering leader/player-coach with recent hands-on work delivering an agentic AI MVP on Amazon Bedrock (conversational UI + supervisor agent routing between internal knowledge and external sources). Previously drove large-scale data platform cost optimization at Twitter, saving ~$3M–$5M annually, and has owned production incidents end-to-end with a focus on analytics/monitoring improvements and team coaching.”

JavaPythonScalaSpring BootNode.jsTypeScript+101
View profile
SP

Sanket Patel

Screened

Director-level Front-End Engineering Leader specializing in scalable web and mobile apps

Palo Alto, CA17y exp
AmazonSan Diego State University

“Amazon engineer/leader who drove a major modernization of the AWS Database Migration Service Console, migrating a monolithic UI to a micro-frontend architecture while improving performance, reliability, and engineering standards. Operates as a player-coach (80/20 hands-on/management), with demonstrated incident ownership and process improvements across Amazon and Walmart Labs.”

AgileAngularAngularJSAWSC#Confluence+85
View profile
HK

Harish Kasu

Screened

Mid-level AI/ML Engineer specializing in Generative AI, RAG, and MLOps

San Francisco, CA5y exp
NVIDIATexas A&M University-Kingsville

“AI/LLM engineer with production experience at NVIDIA and Microsoft, including building a RAG-based enterprise knowledge assistant that improved accuracy by 42% and scaled to thousands of queries. Deep in inference optimization (TensorRT-LLM, Triton, quantization, speculative decoding) and MLOps/observability (Prometheus/Grafana, MLflow, LangSmith), plus orchestration with Kubeflow/Airflow across multi-cloud.”

PythonFastAPIFlaskRSQLJava+204
View profile
YW

Yishi Wang

Screened

Junior Machine Learning & Data Science professional specializing in LLMs and analytics

Chicago, IL3y exp
MintelNorthwestern University

“Amazon internship experience building production GenAI analytics for the returns organization: a multi-agent LLM+RAG system that let analysts query multiple heterogeneous data sources in natural language without hand-written SQL. Also built and operationalized four Apache Airflow DAGs for large-scale ETL, emphasizing observability and freshness-aware metadata to keep outputs accurate and up to date.”

A/B TestingAWSAWS LambdaBERTBusiness IntelligenceC+++125
View profile
GU

Gabriel Undrerwood

Screened

Engineering executive specializing in production ML systems and enterprise SaaS

San Francisco, CA26y exp
FLYRCarnegie Mellon University

“Engineering/data platform leader from FLYR (airline ML forecasting and automated pricing) who built scalable ingestion/ETL and a canonical data model to onboard airlines with highly heterogeneous source systems. Created a golden-metrics layer for airline KPIs and implemented monitoring/backfill capabilities, cutting onboarding time by 50%+ while improving SLA performance and controlling cloud/ML training costs through stronger data quality gates.”

LeadershipMLOpsObservabilityProgram ManagementComplianceData Ingestion+109
View profile
JO

Jun Ouyang

Screened

Principal Software Engineer / Tech Lead specializing in distributed systems, payments, and reliability

San Francisco, CA20y exp
DoorDashZhejiang University

“Backend engineer with DoorDash experience building production-critical systems spanning LLM-based real-time safety moderation (SendBird callbacks + ChatGPT risk scoring with automated actions) and large-scale payments data pipelines (Kafka to CockroachDB with aggregation APIs). Also led cross-team reliability work to standardize SLOs and drove an incident redesign from batch pull to real-time push callbacks to eliminate critical-event latency.”

JavaKotlinPythonAWSDockerKubernetes+69
View profile
1...678...119

Related

Software EngineersSoftware DevelopersFull Stack DevelopersMachine Learning EngineersSoftware Development EngineersDevOps EngineersEngineeringAI & Machine LearningExecutive & LeadershipData & Analytics

Need someone specific?

AI Search