Creating monitors
- Go to OTel Collector and click Create Monitors on the cluster card.
- Select the observability platform (Datadog or Grafana).
- Select the namespace where the collector is installed (default:
obsy-system). - Optionally select a Slack channel — monitor alerts will be posted there.
- Click Create Monitors.
@webhook-obsy notification handle so alerts route back to Obsy for RCA and incident creation.
The 11 monitors
Gateway health
| Monitor | What it detects |
|---|---|
| Gateway pod health | Gateway pod is not running or restarting frequently |
| High span export failure rate | More than 5% of spans are failing to export |
| Export queue near capacity | Queue is more than 80% full (backpressure building) |
| High memory usage | Gateway process memory exceeds 1.5 GB |
| Memory limiter refusing spans | Memory limiter is dropping incoming spans |
Node collector health
| Monitor | What it detects |
|---|---|
| Node collector pod health | One or more node collector DaemonSet pods are not running |
| High metric export failure rate | Node collector failing to export metrics |
| High log export failure rate | Node collector failing to export log records |
Ingestion health
| Monitor | What it detects |
|---|---|
| No spans received | Gateway has received zero spans for 30+ minutes (possible instrumentation break) |
| No metrics received | Node collector has received zero metrics for 30+ minutes |
| No logs received | Node collector has received zero log records for 30+ minutes |
Notification routing
All 11 monitors include@webhook-obsy as a notification handle. When any monitor triggers:
- Datadog/Grafana sends the alert to your Obsy webhook URL
- Obsy creates an Alert entity and runs RCA
- If auto-create rules match, an incident is opened
- If Slack is configured, a message is posted to your incident channel
@slack-{channel-name} is also added to each monitor’s notification list — alerts go to both Obsy and Slack simultaneously.