Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
Cloud

Submission + - Amazon's Christmas Eve Outage Teaches Recovery Lessons->

Nerval's Lobster writes: "Amazon’s explanation for the problem that took down Netflix and other sites on Christmas Eve: human error. The Web giant blamed an unnamed developer who ran a maintenance process against state data used by the company’s Elastic Load Balancers, or ELBs. That mistake cascaded into other areas. At its peak, 6.8 percent of the company’s ELBs were affected—which might not sound like a lot, but they were balancing loads across multiple servers. Netflix was forced to apologize for the outage, publicly pinning the blame on AWS infrastructure. Amazon’s mea culpa highlights two areas in which the company can improve: access to its infrastructure, and disaster recovery (even if that disaster was self-inflicted)."
Link to Original Source
This discussion was created for logged-in users only, but now has been archived. No new comments can be posted.

Amazon's Christmas Eve Outage Teaches Recovery Lessons

Comments Filter:

The steady state of disks is full. -- Ken Thompson

Working...