- Weekly engineering health reviews
- Investigating whether the collector is dropping data
- Sharing collector status with stakeholders who don’t have Obsy access
Generating a report
- Click Generate report.
- Select the cluster and the observability platform to read metrics from.
- Click Generate.
Report sections
Collector health summary
An overall status (Healthy / Degraded / Critical) based on error rates and queue depth.Ingestion metrics
| Metric | Description |
|---|---|
| Spans accepted | Total spans received by the gateway |
| Spans rejected | Spans dropped by the memory limiter |
| Spans exported | Spans successfully sent to the platform |
| Metrics accepted / exported | Same for metric data points |
| Logs accepted / exported | Same for log records |
Export health
Shows the ratio of sent vs failed exports per signal type. A high failure ratio indicates a problem with the exporter config or the upstream platform’s ingestion endpoint.Resource usage
| Metric | Description |
|---|---|
| Memory RSS | Gateway process memory usage |
| CPU seconds | Gateway CPU time |
| Queue size | Current number of items waiting in the export queue |
Golden signal coverage
For each golden signal (latency, traffic, errors, saturation), shows whether the collector is receiving and forwarding data for it. A missing signal means your services aren’t instrumented for it.Viewing past reports
Reports are stored and accessible from the Reliability Reports list. Click any report to view or share it. Each report has a permalink suitable for sharing with stakeholders.Datadog metric naming
Obsy queries Datadog using the native OTel metric names (without the_total suffix):
_total suffix when converting OTLP monotonic sums to Datadog counters. If you’re building your own Datadog queries for OTel metrics, use the names without _total.