Skip to main content

Operational Insights

Overview

Operational Insights is a dashboard in the OpsWorker portal that provides visibility into your alert landscape, investigation performance, and the operational impact of automated investigation on your team.

Use Operational Insights to answer questions like:

  • How many alerts are we handling across all clusters?
  • How much investigation time is OpsWorker saving the team?
  • Which namespaces or services generate the most alerts?
  • Are our alert volumes trending up or down?

Dashboard Overview

The Insights dashboard displays data across several dimensions:

Alert Statistics

  • Total alerts received across all connected sources
  • Breakdown by severity (critical, warning, info)
  • Breakdown by cluster and namespace
  • Alert volume trends over time

Investigation Metrics

  • Total investigations completed
  • Average investigation completion time
  • Investigation outcomes and accuracy ratings from team feedback
  • Most commonly investigated alert types

Time Saved

  • Estimated engineering hours saved based on investigation automation
  • Calculated from: number of investigations × average manual investigation time
  • Filterable by time range, cluster, and workspace

Cluster Health

  • Per-cluster alert volumes and investigation frequency
  • Clusters with the highest alert activity
  • Connectivity status overview

Key Metrics

MetricWhat It Measures
Alert volumeTotal signals received, by severity and source
Investigation countNumber of completed investigations
Investigation accuracyTeam feedback ratings on investigation quality
Time savedEstimated engineering hours saved
Top namespacesMost active namespaces by alert volume
Trend directionWeek-over-week alert volume comparison

Using Insights

  • Track operational improvement — Monitor whether alert volumes decrease as your team addresses root causes identified by OpsWorker
  • Identify noisy areas — Spot namespaces or services that generate disproportionate alerts
  • Measure ROI — Quantify the time saved for leadership reporting
  • Guide alert tuning — Use patterns to refine monitoring thresholds in your alerting tools

Next Steps