Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
Check out the new SourceForge HTML5 internet speed test! No Flash necessary and runs on all devices. Also, Slashdot's Facebook page has a chat bot now. Message it for stories and more. ×
Facebook

Submission + - Making Facebook Self Healing (facebook.com)

djeps writes: I've used to achieve this with Nagios Event Handlers scripts and RabbitMQ. But facebook has done it for a far larger scale than my old days of sysadmin: When your infrastructure is the size of Facebook’s, there are always broken servers and pieces of software that have gone down or are generally misbehaving. In most cases, our systems are engineered such that these issues cause little or no impact to people using the site. But sometimes small outages can become bigger outages, causing errors or poor performance on the site. If a piece of broken software or hardware does impact the site, then it's important that we fix it or replace it as quickly as possible. Even if it's not causing issues for users yet, it could in the future so we need to take care of it quickly.
This discussion was created for logged-in users only, but now has been archived. No new comments can be posted.

Making Facebook Self Healing

Comments Filter:

Mr. Cole's Axiom: The sum of the intelligence on the planet is a constant; the population is growing.

Working...