Key Metrics
Overview
OpsWorker tracks several categories of operational metrics to help you understand your alert landscape and measure the impact of automated investigation.
Alert Metrics
| Metric | Description |
|---|---|
| Total alerts | Number of signals received from all monitoring sources |
| Alerts by severity | Breakdown: critical, warning, info |
| Alerts by source | Breakdown: Prometheus, Grafana, Datadog, CloudWatch |
| Alerts by cluster | Volume per connected cluster |
| Alerts by namespace | Most active namespaces |
| Alert trend | Week-over-week and month-over-month comparisons |
Investigation Metrics
| Metric | Description |
|---|---|
| Total investigations | Number of completed investigations |
| Investigation completion rate | Percentage that completed successfully vs. failed |
| Average investigation time | Mean time from alert to completed investigation |
| Investigations by cluster | Volume per cluster |
| Investigation outcomes | Distribution of root cause types identified |
Time Saved Metrics
| Metric | Description |
|---|---|
| Estimated hours saved | Investigations × average manual investigation time |
| Time saved per investigation | Based on typical manual investigation duration (30–80 min baseline) |
| Cumulative savings | Running total over selected time period |
Feedback Metrics
| Metric | Description |
|---|---|
| Accuracy rate | Percentage of investigations rated "Accurate" |
| Partial accuracy rate | Percentage rated "Partially Accurate" |
| Feedback response rate | Percentage of investigations that received feedback |
Using Metrics
- Report to leadership: Use time saved and investigation count for ROI reporting
- Track improvement: Watch alert volume trends — decreasing volume indicates root causes are being addressed
- Identify hot spots: Use namespace and cluster breakdowns to focus engineering effort
- Evaluate accuracy: Use feedback metrics to assess investigation quality over time
Next Steps
- Investigation Analytics — Deeper investigation analysis
- Team Impact — Broader team impact measurement