What Happened?
At approximately 11:15 am EST on Monday, March 2, 2026, core services in Canada experienced a system-wide downtime. The incident was caused by database connection exhaustion. The team suspects this was caused by a background process responsible for resetting user statuses (transitioning Hypercare users from “Unavailable” or “Busy” statuses back to “Available” after a set expiry date and time) generated a high volume of sustained, long-lived connections. The database has a fixed limit on permissible concurrent connections and these hanging connections saturated the pool, preventing any new requests from being processed.
Impact
All Hypercare services were inaccessible for Canadian users from approximately 11:15 am EST until 12:22 pm EST on Monday, March 2, 2026.
Resolution and Next Steps
The Engineering team restored services by manually terminating stalled connections and increased the capacity on the database pool. The team disabled the automated status reset feature to stabilize the environment and allow core services to resume normal operation. The automated status reset feature has been running intermittently in a controlled environment to ensure users status’ are reset appropriately..
To reduce the chances of a recurrence and improve our response time, the following actions are being taken:
We apologize for the disruption caused by today’s unplanned downtime. We thank everyone for their patience and continued support.