Incident History

Tempo Write Path Failing

This incident has been resolved.

1732288715 - 1732298605 Resolved

Support Ticket Portal Unavailable

This incident has been resolved.

1732208680 - 1732214021 Resolved

Severe degradation of certain tests

This incident has been resolved.

1732088510 - 1732100481 Resolved

k6 test data processing severely degraded

This incident has been resolved.

1732052746 - 1732054779 Resolved

Write path outage in us-central1 region

Due to this bug reported in https://github.com/kubernetes/kubernetes/issues/127370, we were affected by an issue causing K8S service endpoints not getting updated when pods are stopped/started if there are more than 1k pods matching the service. This caused a temporary outage in Mimir gossiping services, which further resulted in failures to ingest and query metrics for a short time. This issue has been resolved.

1732036353 - 1732036353 Resolved

Issues with new stack creation

This incident has been resolved.

1731973594 - 1731982110 Resolved

Adaptive Metrics Degraded Performance

We continue to observe a continued period of recovery. At this time, we are considering this issue resolved. No further updates.

1731945686 - 1731970691 Resolved

Grafana Cloud Portal Accessibility Issues

This incident has been resolved.

1731926863 - 1731928352 Resolved

Degraded dashboard performance due to the erroneous security policy

Rollback has been completed as of 17:20 UTC. At this time, we are considering this issue resolved. No further updates.

1731673295 - 1731691382 Resolved

Tempo Ingestion Disruption

This incident has been resolved.

1731593509 - 1731595453 Resolved
⮜ Previous Next ⮞