Cloud Service Outages Highlight Resilience Needs in Development
Summary
Hide ▲
Show ▼
Cloud service disruptions are increasingly affecting developers and DevOps teams, highlighting the need for resilience in software development workflows. Recent outages, such as those experienced by Anthropic's Claude.ai and GitHub, have underscored the potential impact of cloud service failures on development processes. These incidents have led to calls for better preparedness and resilience strategies to mitigate the effects of future outages. The reliance on cloud-based tools, including CI/CD pipelines, IDEs, and AI-assisted coding platforms, creates a systemic single point of failure. Minor outages can cascade across multiple teams and projects, halting development pipelines and delaying releases. Experts recommend implementing local-first workflows, designing fallbacks and failovers, and caching dependencies to build resilience. Overall, DevOps services have been reliable, but incidents have occurred. For example, Azure DevOps experienced 74 incidents in the first half of 2025, including a significant performance degradation lasting 159 hours. GitHub saw a 58% increase in incidents year-over-year, with 17 major outages totaling over 100 hours of disruption.
Timeline
-
25.09.2025 16:39 1 articles · 2h ago
Anthropic's Claude.ai and Console Services Experience System-Wide Outage
On September 10, 2025, Anthropic's Claude.ai and Console services, along with the associated API, suffered a system-wide outage lasting about 30 minutes. This outage highlighted the potential impact of cloud service failures on development processes and led to calls for better preparedness and resilience strategies. The outage caused users to joke about the downtime, with some comparing it to 'it's compiling' and others noting the need to code like 'a caveman.' This incident, along with other recent outages, has underscored the need for development teams to implement resilience strategies to mitigate the effects of future disruptions.
Show sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
Information Snippets
-
Anthropic's Claude.ai and Console services experienced a system-wide outage on September 10, 2025, lasting about 30 minutes.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
GitHub reported degraded performance across multiple services in July 2025, leading to about 4% of requests failing.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
The Shai-Hulud worm compromised over 500 packages in the Node package manager (npm) ecosystem, causing significant disruption.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
Azure DevOps suffered 74 incidents in the first half of 2025, including a performance degradation lasting 159 hours.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
GitHub incidents rose 58% year-over-year in the first half of 2025, with 17 major outages totaling over 100 hours of disruption.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
GitLab faced 59 incidents in the first half of 2025, with a total of 1,346 hours of disruption.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
GitLab has maintained a 99.8% uptime service-level objective, with no significant site-wide outages in the past five years.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
Experts recommend implementing local-first workflows, designing fallbacks and failovers, and caching dependencies to build resilience.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39
-
Development teams should focus on making workflows resilient, adding redundancies, having backups, and having alternative test-and-build environments.
First reported: 25.09.2025 16:391 source, 1 articleShow sources
- How Cloud Service Disruptions Are Making Resilience Critical for Developers — www.darkreading.com — 25.09.2025 16:39