Partial Service Outage


Incident resolved in 1h55m33s

Update

Service Disruption Caused by ElastiCache Automated Certificate Rotation

Summary

Based on our initial analysis with AWS, this incident was caused by an automated certificate update for the ElastiCache middleware.

Timeline

At 02:58 AM on October 29 (Beijing Time), AWS initiated an automated certificate update for our ElastiCache instances. During this process, the primary and replica nodes of the ElastiCache cluster experienced issues, preventing backend services from accessing the component.

Next Steps & Action Items

We have raised two critical issues with AWS Support:

AWS Support has escalated these issues to their internal engineering team for a detailed root cause analysis. We will provide further updates as soon as we receive more information from AWS.

1761796189

Resolved

The incident has been resolved. We will share a detailed update shortly.

1761686678

Update

A fix has been implemented and we are monitoring the results.

1761683431

Investigating

We are continuing to investigate this issue.

1761681539

Investigating

We are currently investigating this issue.

1761679745