Operational Insights
Overview
Operational Insights is a dashboard in the OpsWorker portal that provides visibility into your alert landscape, investigation performance, and the operational impact of automated investigation on your team.
Use Operational Insights to answer questions like:
- How many alerts are we handling across all clusters?
- How much investigation time is OpsWorker saving the team?
- Which namespaces or services generate the most alerts?
- Are our alert volumes trending up or down?
Dashboard Overview
The Insights dashboard displays data across several dimensions:
Alert Statistics
- Total alerts received across all connected sources
- Breakdown by severity (critical, warning, info)
- Breakdown by cluster and namespace
- Alert volume trends over time
Investigation Metrics
- Total investigations completed
- Average investigation completion time
- Investigation outcomes and accuracy ratings from team feedback
- Most commonly investigated alert types
Time Saved
- Estimated engineering hours saved based on investigation automation
- Calculated from: number of investigations × average manual investigation time
- Filterable by time range, cluster, and workspace
Cluster Health
- Per-cluster alert volumes and investigation frequency
- Clusters with the highest alert activity
- Connectivity status overview
Key Metrics
| Metric | What It Measures |
|---|---|
| Alert volume | Total signals received, by severity and source |
| Investigation count | Number of completed investigations |
| Investigation accuracy | Team feedback ratings on investigation quality |
| Time saved | Estimated engineering hours saved |
| Top namespaces | Most active namespaces by alert volume |
| Trend direction | Week-over-week alert volume comparison |
Using Insights
- Track operational improvement — Monitor whether alert volumes decrease as your team addresses root causes identified by OpsWorker
- Identify noisy areas — Spot namespaces or services that generate disproportionate alerts
- Measure ROI — Quantify the time saved for leadership reporting
- Guide alert tuning — Use patterns to refine monitoring thresholds in your alerting tools
Next Steps
- Key Metrics — Detailed breakdown of available metrics
- Investigation Analytics — Deep dive into investigation performance
- Team Impact — Measure the impact on your team