Comment: Re:Why? (Score 1) 152
That "just" leaves the DNS system as a possible valid target. You know, the system that's probably more important than SSL.
Comment: Re:Who is "Versign"? (Score 1) 85
Comment: Re:Who is "Versign"? (Score 4, Informative) 85
Until late 2010, Verisign also ran the dominant SSL business. That red circle with the black digitized check at the bottom of your bank's web page? Yeah, that. The SSL business was sold to Symantec, are are trying to slowly rebrand. For the security of the internet, SSL is also kinda important.
Comment: Re:Fun at scale. (Score 1) 500
So google's doing it wrong? So is Yahoo? They shouldn't have so many computers? How about Amazon? Supercomputers are all doing it wrong?
It's called "running a successful service at internet scale". And it's a really good gig if you can get it
Comment: Re:Fun at scale. (Score 1) 500
A second specific comment
The configuration of a system is much more complex than most configuration management tools consider. The tools generally limit themselves to the list of things a "sane" person would change.
The list of things that actually affect the running of your system is much, much larger.
- Libraries. Did you hand-jam in a specific openssl version for some application?
- Programs. Did you hand-upgrade openssh on one system?
- /usr/local. Is it in the path of a shell script used to launch a service? Is everything under it managed?
- Permissions. Did someone do "chmod -r" somewhere they should not have?
If you write rules in puppet to handle all of that, your set of rules blows up to be insanely detailed, long, and completely unmanagable.
But the reinstall handles it all. In an automated, scripted fashion that allows you to easily change what you need.
Seriously people. Cobbler & similar install servers. They need to be part of any large scale host management. And since they are already there, are easy to leverage into being a large part of your large scale host management. And then reinstalling the server is the sane solution.
Comment: Re:Fun at scale. (Score 1) 500
cfengine, puppet, chef et all are in the set of acceptable solutions. And if you have per-host information you care about keeping, superior to blindly reimaging.
But why do you have per-host information? Per-host information (log files, or important data on local storage) is an inherent management pain. The best answer is to keep that to the minimum set of hosts possible, and use coarse tools on the majority. Then you're manually managing 2 hosts, and bulk managing 998. Which is a cubic ton better than manually managing most of 1000 hosts. (remote syslog is your friend.)
(Upgrade? Really? Um, no. Reinstall. Again, you have to be able to reinstall quickly and accurately. And since you can do that, why not do that?)
Comment: Re:Fun at scale. (Score 1) 500
You should.
And then, once you figure out what's wrong, you should reimage the box to fix it. Yes, I'd dead serious. Manually futzing with one box ("configuration drift") is farther up the list of reasons why things break than you would believe.
Comment: Fun at scale. (Score 1) 500
You have 1000 servers. You need to upgrade them to RHEL 6. Do you put a DVD in each of 1000 DVD drives?
NO!
You use an image server. Kickstart. Cobbler. Figure out how the new image looks like, and then pxeboot 1000 servers. That goes much faster. (to the sysadmin above, reimaging a server should take 25 minutes, most of which is spent surfing slashdot, not an hour).
So now, you've got a server that's misbehaving. One of 1000. Out of pure coincidence, honest, the one server you were manually futzing with last week, but that can't possibly be connected. Fixing that server yourself will cause more "configuration drift", and leave you with one server that's still different than the 999 other servers. And hey, that image server is still on your network. Just reimage the thing.
It's popular because it's the answer that scales. kthxbye.