zkSync Releases Updates on Network Downtime, Improves Monitoring System

The zkSync team released an update on yesterday's zkSync Era outage, announcing it has improved the monitoring system. The team reminded the public that the Era mainnet is still in Alpha. It cannot completely rule out occasional hiccups in the short term, but it is "constantly refining and improving the system" to decrease the number of incidents.

The zkSync Era network experienced a downtime from 1:52 to 6:02am CET yesterday. The reason is the database for the block queue failed, causing block production to halt. The database health alert did not trigger because it could not connect to it to collect metrics. The server API was unaffected, and transactions continued to be added to the mempool. Though the team had comprehensive monitoring, logging, and alerting in place across all components, as the API was functional, none of them were triggered. In addition, when the problem occurred, it was around 2am for all the team members and no one was online.

Eventually, the fix was implemented within 5 minutes. The team said it has assigned a special role to the database monitoring agents, enabling them to connect to the database and continuously gather metrics, even during database issues. If the database monitoring agent is malfunctioning, the team will be alerted. In the meantime, the team said the only long-term solution for liveness and availability is the decentralization of the sequencer, which will be a critical priority for its engineering team.

 

TokenInsight is dedicated to covering the most important and cutting-edge trends in the world of crypto. If you have information to share with us, please feel free to contact our email news@tokeninsight.com. Your trust will be well respected.

Source