Complete outage in prod-me-central-1

Incident ongoing

Investigating

We are continuing to investigate this issue.

1776697906

Investigating

We have not received any further updates from AWS at this time. However, we are actively monitoring the outage and will provide additional information as it becomes available. Also, please continue to refer to the AWS status page for more detailed updates. https://health.aws.amazon.com/health/status

All the guidance previously included about stack migration is still relevant. Please reach out to our Support team if you have any questions.

1773922427

Investigating

We are actively monitoring the situation, but at this time there are no new updates to share. The next update will be provided once we have more information to share. Please reach out to our Support team if you have any questions.

1772662950

Investigating

We are continuing to investigate this issue.

1772620115

Investigating

Please continue to refer to the AWS status page for more detailed updates specific to AWS. https://health.aws.amazon.com/health/status

AWS are recommending that affected customers move workloads to alternate regions, and we are recommending the same.

Customers who are impacted and who cannot wait for a restoration of service are asked to:

Create a Grafana Cloud stack in an alternate region

Update clients to send telemetry to the new region, if using Grafana Alloy then you can use Fleet Management https://grafana.com/docs/grafana-cloud/send-data/fleet-management/introduction/

If your instance remains available and you have not configured your dashboards as code, then you may be able to use grafanactl to migrate dashboards https://grafana.com/docs/grafana/latest/as-code/observability-as-code/grafana-cli/grafanacli-workflows/ https://grafana.github.io/grafanactl/

We are continuing to work with our CSP at this time, and will provide updates as they are available.

1772489897

Investigating

AWS are recommending that affected customers move workloads to alternate regions https://health.aws.amazon.com/health/status and we are recommending the same.

Customers who are impacted and who cannot wait for a restoration of service are asked to:

Create a Grafana Cloud stack in an alternate region

Update clients to send telemetry to the new region, if using Grafana Alloy then you can use Fleet Management https://grafana.com/docs/grafana-cloud/send-data/fleet-management/introduction/

If your instance remains available and you have not configured your dashboards as code, then you may be able to use grafanactl to migrate dashboards https://grafana.com/docs/grafana/latest/as-code/observability-as-code/grafana-cli/grafanacli-workflows/ https://grafana.github.io/grafanactl/

We will provide updates when we have them, but we do not have an expected resolution time at this point.

1772447473

Investigating

Customers are recommended to configure a new blank stack in an alternative Grafana Cloud region and to reconfigure their clients (such as Grafana Alloy) to send telemetry to that region, Fleet Management can be used for this purpose https://grafana.com/docs/grafana-cloud/send-data/fleet-management/introduction/

1772445863

Investigating

We are updating this incident to reflect a complete outage in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.

1772440575

Investigating

We are observing write and read outage errors across all databases (metrics, logs, traces) in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.

1772439688

Investigating

We are observing write and read outage errors across all databases (metrics, logs, traces) in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.

1772439249

Investigating

We are seeing elevated write and read path errors in prod-me-central-1, due to an on-going AWS UAE data center issue. We will provide further updates accordingly.

1772433809