The Downtime
Live website, cloud & network outage reports as they happen — plus practical uptime, DRaaS and load-balancing guides. Automatically tracked and corroborated from official status pages by Pingy.io.
Developing now
Latest reports & guides
Twilio Resolves SMS Delivery Issues to Etisalat UAE Subscribers
SMS delivery delays and failures affecting a subset of Twilio sender IDs to Etisalat network in the United Arab Emirates have been resolved.
Zoom Contact Center and Rooms Scheduling Service Degradation Resolved in U.S.
Zoom resolved a service degradation affecting Contact Center Services and Rooms Scheduling status display in the U.S. region.
Anthropic's Claude.ai Resolves Elevated Error Rate Incident
The AI chatbot service experienced elevated errors that have now been resolved.
GitHub Resolves Issues With Next Edit Suggestions and Completions
GitHub has resolved elevated errors affecting its Next Edit Suggestions and Completions features.
Twilio Experiences SMS Delivery Delays to MTN Nigeria; Service Recovering
SMS messages from Twilio to MTN network subscribers in Nigeria faced delays as the company identified and worked to resolve the issue.
Anthropic Resolves Elevated Error Rates Affecting Multiple Claude Models
Service disruption impacted Opus, Sonnet, and Haiku model variants; all systems now recovered.
Health Checks Done Right: Liveness vs Readiness vs Deep Checks
Learn the difference between liveness, readiness, and deep health checks — and how to implement each one correctly so your monitoring actually catches real problems.
Setting Realistic SLOs, SLAs, and Error Budgets
A practical guide to defining uptime targets that your team can actually hit, measure, and defend.
How to Design for Five-Nines (99.999%) Uptime
A practical engineering guide to the architecture, tradeoffs, and operational discipline required to hit 99.999% availability.
Datadog Resolves Metrics Queries Outage
Major incident affecting metrics queries has been resolved after fix implementation.
OpenAI resolves ChatGPT file upload and download errors
Service experienced elevated errors affecting file operations before full recovery.
Anthropic resolves elevated error rates across API models
Critical outage affected multiple models for roughly 85 minutes before resolution.