Delayed Cluster Operations and Degraded Performance following AWS Outage


Incident resolved in 10h22m51s

Resolved

This incident has been resolved.

1760992218

Update

We are continuing to monitor for any further issues.

1760991319

Update

Affected services are starting to recover along with AWS.

1760988630

Update

Atlas App Services is experiencing increased processing delays in the US-EAST-1 and US-EAST-2 regions along with degraded performance in Triggers updates.

We are closely monitoring the situation and the official AWS status page.

Please check the AWS Status page for the latest updates: https://health.aws.amazon.com/health/status

1760978861

Update

Atlas Data Federation is experiencing issues connecting to downstream atlas clusters.

Atlas Charts is also experiencing issues connecting to customer deployments to retrieve data to render Charts.

We are continuing to monitor for any further issues.

1760975908

Update

We are experiencing delays in provisioning and starting new Stream processors. We are also observing a slight degradation in login success rate to cloud.mongodb.com.

We are closely monitoring the situation and the official AWS status page.

Please check the AWS Status page for the latest updates: https://health.aws.amazon.com/health/status

1760974595

Update

We are experiencing delays in provisioning additional resources for Triggers apps and ability to register.

Accessing cluster data through the Data Explorer Atlas UI is also degraded.

We are closely monitoring the situation and the official AWS status page.

Please check the AWS Status page for the latest updates: https://health.aws.amazon.com/health/status

1760972714

Update

Cloud Operations remain delayed with degraded performance. Access to backing clusters is unaffected. We are also observing a degradation in monitoring visibility, with approximately 20% of metrics destined for DataDog being dropped.

We are observing signs of recovery, with provisioning speeds returning to normal in all regions outside of US-EAST-1. Provisioning within US-EAST-1 remains significantly degraded.

Next Steps: We are closely monitoring the situation and the official AWS status page.

Please check the AWS Status page for the latest updates: https://health.aws.amazon.com/health/status

1760972552

Update

AWS is experiencing an outage in the US-EAST-1 Region which is being handled as a P1 incident. This was first identified at 08:51 UTC. Cloud Operations are delayed with degraded performance. Access to backing clusters is unaffected. We are observing signs of recovery at this point in time and we're closely monitoring the AWS status page.

Please check the AWS Status page for the latest updates: https://health.aws.amazon.com/health/status

1760954847