Automatic Investigations
Overview
Automatic investigations run without any manual intervention. When an alert matches a configured alert rule with auto-investigation enabled, OpsWorker starts an investigation immediately — 24/7, regardless of who's on call or what time it is.
How It Works
sequenceDiagram
participant Mon as Monitoring System
participant OW as OpsWorker
participant Rules as Alert Rules
participant AI as Investigation Engine
participant Slack as Slack
Mon->>OW: Alert webhook
OW->>Rules: Evaluate alert against rules
Rules-->>OW: Match found (auto-investigate: on)
OW->>AI: Start investigation
AI->>AI: Discover, collect, analyze
AI->>Slack: Post results
Configuration
Auto-investigation is controlled by alert rules:
- Create an alert rule that matches the alerts you want investigated
- Enable the Auto-investigate toggle on that rule
- Any alert matching the rule will automatically trigger an investigation
Enabling Auto-Investigation
In the OpsWorker portal:
- Go to Configurations → Alert Rules
- Create a new rule or edit an existing one
- Set your filter criteria (namespace, severity, labels)
- Toggle Auto-investigate to Enabled
- Save the rule
Rate Limiting
OpsWorker prevents duplicate investigations for the same alert:
- If the same alert fires multiple times within a short window, only one investigation is created
- This prevents unnecessary work during alert storms
- Each unique alert combination (name + resource + namespace) is tracked
Coverage
| Feature | Benefit |
|---|---|
| 24/7 operation | Investigations run day and night, weekdays and weekends |
| No human trigger | The alert itself starts the investigation — no one needs to be awake |
| Consistent quality | Every alert gets the same thorough investigation process |
| Immediate start | Investigation begins the moment the alert arrives |
Best Practices
- Start with critical alerts — Enable auto-investigation for critical severity alerts first
- Expand gradually — Add warning severity and more namespaces as you gain confidence in investigation quality
- Use feedback — Rate investigation accuracy via Slack buttons to help improve results
- Review the daily digest — Check the daily digest to see what's being investigated and what's being missed
Next Steps
- Configure Alert Rules — Set up auto-investigation rules
- Investigation Lifecycle — What happens during an investigation