Last night’s outage was caused by a default OS upgrade automation. While each Notehub infrastructure service is supported by multiple, redundant server instances, the automated upgrade incorrectly impacted multiple servers at the same time. This caused some key services used by the UI and API to go offline.
To insure the issue doesn’t recur, we have disabled this default automation. Then, to ensure we still maintain our OS, we will implement a custom OS upgrade automation to prevent redundant server instances going offline at the same time.