Unified incident command

Telemetry becomes decisions before humans become a dependency.

PulseOps correlates alerts, traces, deploys, ownership, and runbooks into one incident graph. The operator sees impact, probable cause, next action, and the smallest responder set before the first page fans out.

Signals merged

137

Into 1 incident

Suppressed pages

3

Policy + maintenance window

On-call now

Shift, escalations, and scheduled handoffs

Fri, Jul 4

2 active escalations
Auth Platform primary until 18:00 UTC
Checkout SRE secondary until 18:00 UTC
Database platform maintenance window 16:00-17:30 UTC
09:15 rollback approval requested
09:24 executive status update drafted
09:40 follow-the-sun handoff at risk if sev1 reopens

Open Incidents

7

2 sev1, 3 sev2

Auto-Resolved

61%

+8% last 30d

Median MTTR

14m

-6m from baseline

Alert Reduction

92%

After correlation

Pages Sent

11

3 suppressed

Pending Approvals

4

Runbook gates

Services Protected

184

Across 12 teams

Error Budget Burn

1.7x

checkout-api

AI Summaries

38

Today

Escalation Fan-out

2.1

Avg responders

Incident Volume

Correlated incident count through the response window

Alert Stream vs Correlated Output

Raw alerts fall while one incident graph stays stable

Latency Trend

P95 latency through mitigation and rollback

Error Budget Burn

Service burn rate during the incident window

Auto-Remediation Trend

Executed runbooks during detection and recovery

Prevented Incidents

AI and policy prevented human wake-ups

Root Cause Mix

Most likely categories across recent incidents

Pages Sent

Human pages after automation and suppression filters

Incident Intelligence

Filters, bulk actions, inline updates, and exportable response state

IncidentSeverityStatusLikely CauseOwnerNext Action
Sev1
Deployment 24.7.4Auth Platform
Sev2
Redis saturationCheckout SRE
Sev2
Regional DNS propagationIdentity Team
Sev3
Deploy concurrency mismatchPlatform Ops

Inline edits simulate optimistic incident operations.

Activity Feed

Timeline fragments from AI, runbooks, and responders

Auto rollback executed

auth-edge reverted release 24.7.4 after latency spike detection

4m ago

Executive update drafted

PulseOps generated sev1 summary for checkout-auth cascade

11m ago

Escalation suppressed

redis-platform page skipped due to maintenance window match

18m ago

Runbook approval requested

Scale worker pool from 8 to 14 replicas for checkout-api

24m ago

Quick Actions

Policy-aware actions with lightweight form capture