A lot of comments are critical of overuse of the clouds or AWS specifically. But for me it boils down to a couple of more specific issues that can be improved upon.
The first thing is within the customers' grasp: let's think critically about our system design in terms of third parties. The more consolidation there is in the SaaS industry, the more we'll see business' interdependence. In other words, if there is a really compelling offering for managed services that run outside your immediate control, there will be a ratcheting of probability that on any given day something in your product is broken. I know there's an age old truism about build vs buy: only build things that comprise your business' core competency and buy the rest. Authentication is a classic example. Use a third party IDP or SSO service and focus more on your own business logic. It generally makes business sense, but I think there needs to be a bit of thought given to the resiliency within our own products around these third party integrations -- can we handle this third party's outage, can we fail over or allow for a degraded experience for our end users without totally falling down ourselves? I think for Engineering departments, there's a job to do here to advocate design and advocate for a middle ground between total reliance on an integration and super expensive full redundancy or bringing all these features in-house unnecessarily.
The other issue is lurking within AWS: global services' dependence on specific regions. It looks like with at least one of the outages, we saw an outsized impact to AWS from some core functionality within AWS being wholly dependent on us-east-1 in particular. That feels counter to what AWS architects preach themselves. I recently talked with our CTO about this and he speculated that Amazon's culture of lots of small self-sufficient teams may be partly to blame. When you strongly incentivize individuals and small teams to perform at their best and forget some non-functional requirements, you may end up with this kind of tech debt build-up that bites you unexpectedly. Could there have been a reason for AWS web console to stay up despite us-east-1 having problems if it was designed better or if they put a bit more investment into its redundancy?
I suggest you track down source material and not constrain yourself to the news summary. The article links to the original BuzzAngle Music report for 2016, which mentions this:
There were 11,489 cassettes purchased during the Holiday Season (an increase of 140% over 2015).
Take an astronaut to launch.