Network Outage
The incident has been resolved.
The incident has been resolved.
This incident has been resolved.
Based on our initial analysis with AWS, this incident was caused by an automated certificate update for the ElastiCache middleware.
At 02:58 AM on October 29 (Beijing Time), AWS initiated an automated certificate update for our ElastiCache instances. During this process, the primary and replica nodes of the ElastiCache cluster experienced issues, preventing backend services from accessing the component.
We have raised two critical issues with AWS Support:
AWS Support has escalated these issues to their internal engineering team for a detailed root cause analysis. We will provide further updates as soon as we receive more information from AWS.
Between 08:02 and 11:52 UTC+8, some users experienced intermittent issues accessing our cloud services.
After a joint investigation with our cloud provider, AWS, we have confirmed the root cause was network instability from the carrier, Cogent. Access requests routed through the Cogent network were subject to timeouts and packet loss.
Due to several recent incidents involving this provider, AWS has proactively rerouted traffic away from Cogent to alternative network paths. This action significantly mitigates the risk of similar disruptions in the future.
Cogent Network Status: https://ecogent.cogentco.com/network-status
Following a joint investigation with our cloud provider, AWS, we have confirmed the root cause was network instability from the carrier, Cogent. This instability caused access requests routed through the Cogent network to experience timeouts and packet loss.
This is the same issue detailed in our another incident report: https://status.bambulab.com/incidents/s6wds2nqyght
This incident has been resolved. It lasted about 10 minutes.
This incident has been resolved.
This incident has been resolved.
Due to a surge in traffic on makerworld.com at 17:55 UTC, users are experiencing slow loading times, and some pages are not displaying correctly. This issue lasted for approximately 20 minutes.
This incident has been resolved.