Date: August 21, 2025
Duration: 17:26 - 19:27 UTC
Status: Resolved
On August 21, 2025, our platform experienced elevated errors and high latency across several services, including sign-in and API requests. The disruption was caused by network link saturation between one of our upstream providers and our cloud hosting environment. This particular issue with our network provider created a widespread disruption of internet traffic.
During the incident, some customers encountered difficulties logging in with username/password or SSO, along with delays and intermittent errors in the API, Dashboard, Admin Portal, and Directory Sync. While the extent varied over time, a meaningful portion of requests were degraded or failed entirely.
The disruption stemmed from connectivity issues between our upstream network provider and our cloud infrastructure provider. This caused elevated latency, timeouts, and service unavailability until traffic conditions were stabilized.
17:26 UTC – Elevated errors observed and internal incident created
17:47 UTC - Root cause identified as a network issue with our upstream provider
17:52 UTC - Team begins work on mitigation of secondary impacts due to increased number of open requests, while monitoring network conditions
19:27 UTC - Problem with upstream network provider is resolved and response times return to normal
Our engineering team investigated multiple paths to remediation while closely monitoring the upstream provider’s recovery efforts. Service performance returned to normal once network stability was restored, after which we transitioned to monitoring to ensure ongoing stability.
To improve resilience and reduce the likelihood of similar issues in the future, we are:
We regret the significant impact this incident had on you and your customers. We are committed to implementing lasting improvements to ensure greater stability going forward, and will keep you updated on our progress.