With Linux Clusters, Seeing Is Believing 208
Roland Piquepaille writes "As the recent release of the last Top500 list reminded us last month, the most powerful computers now are reaching speeds of dozens of teraflops. When these machines run a nuclear simulation or a global climate model for days or weeks, they produce datasets of tens of terabytes. How to visualize, analyze and understand such massive amounts of data? The answer is now obvious: using Linux clusters. In this very long article, "From Seeing to Understanding," Science & Technology Review looks at the technologies used at Lawrence Livermore National Laboratory (LLNL), which will host the IBM's BlueGene/L next year. Visualization will be handled by a 128- or 256-node Linux cluster. Each node contains two processors sharing one graphic card. Meanwhile, the EVEREST built by Oak Ridge National Laboratory (ORNL), has a 35 million pixels screen piloted by a 14-node dual Opteron cluster sending images to 27 projectors. Now that Linux superclusters have almost swallowed the high-end scientific computing market, they're building momentum in the high-end visualization one. The article linked above is 9-page long when printed and contains tons of information. This overview is more focusing on the hardware deployed at these two labs."
Realisation about this procedure (Score:5, Funny)
Re:Realisation about this procedure (Score:2, Funny)
Re:Realisation about this procedure (Score:2, Funny)
Re:Realisation about this procedure (Score:1)
Re:Realisation about this procedure (Score:3, Funny)
uh-oh, more bad pick-up lines for Linux Geeks:
"you don't need to imagine how big my beowulf cluster is"
Re:Realisation about this procedure (Score:4, Funny)
And it's always the guys with small clusters who say that size doesn't matter?
Re:Realisation about this procedure (Score:2)
Mac OS X has similar benefits (Score:4, Interesting)
By contrast, NCSA's surprise entry in November 2003's list, Tungsten, achieved 9.82 Tflops for $12M asset cost.
Double the cost, for a Top 100 supercomputer's-worth lower performance.
And it wasn't because Virginia Tech had "free student labor": it doesn't take $6M in labor to assemble a cluster. Even if we give it an extremely, horrendously liberal $1M for systems integration and installation, System X is still ridiculously cheaper.
I know there will be a dozen predictable responses to this, deriding System X, Virginia Tech, Apple, Mac OS X, linpack, Top 500, and coming up with one excuse after another. But won't anyone consider the possibility that these Mac OS X clusters are worth something?
Re:Mac OS X has similar benefits (Score:1, Insightful)
Re:Mac OS X has similar benefits (Score:2)
Re:Mac OS X has similar benefits (Score:1)
Re:Mac OS X has similar benefits (Score:4, Informative)
Re:Mac OS X has similar benefits (Score:5, Insightful)
We at Terra Soft have just released Y-HPC, our version of Yellow Dog Linux, with a full 64-bit development environment, and a bunch of cluster tools built in.
I'm not much of a marketting drone, but being as I am part of the Y-HPC team, I had to put a shameless plug in. Bottom line is, it kicks OSX's ass any 2 ways you look at it.
Y-HPC [terrasoftsolutions.com]
Re:Mac OS X has similar benefits (Score:3, Insightful)
I've tried installing YDL on a small G5 cluster. It was a PITA to get running (3 installs before I was able to get the X server running right). And still I can't find any fan control. After 5 minutes the fans spool up to "ludicrous speed" and stick there.
I really want to like YDL. I've been talking to the folks who do OSCAR about trying to get OSCAR to support YDL. But I'm not sure how it will work out yet, at least until I can figure out how to turn down the fans!
Re:Mac OS X has similar benefits (Score:4, Informative)
Fan control was integrated into the kernel over a month ago, and is most definitelly in the first version we released last week.
We have also developed a nice pretty installer for the head node in a cluster, and wrote Y-Imager (front end for Argonne's System Imager), to automate the building of compute nodes in a cluster.
No offense taken
Re:Mac OS X has similar benefits (Score:4, Insightful)
Re:Mac OS X has similar benefits (Score:2)
Re:Mac OS X has similar benefits (Score:2, Funny)
C'mon, Steve...
Re:Mac OS X has similar benefits (Score:5, Informative)
By contrast, NCSA's surprise entry in November 2003's list, Tungsten, achieved 9.82 Tflops for $12M asset cost.
When I looked here [uiuc.edu], I found this: ``Tungsten entered production mode in Novermber 2003 and has a peak performance of 15.36 teraflops (15.36 trillion calculations per second).''
To me, that looks faster than System X, not slower.
Let's see: NCSA stands for ``National Center for Supercomputing Applications''. ``NCSA [uiuc.edu] is a key partner in the National Science Foundation's TeraGrid project, a $100-million effort to offer researchers remote access ...''
Looks as if the NCSA has a huge budget. I'd guess that ``gold-plated everything'' and ``leave no dollars unspent'' are basic specs for everythig they buy.
What can we learn about Virginia Tech? How about this [vt.edu]:
In addition to the volunteer labor, I'd guess that Virginia Tech had very different design goals, in which price was a factor. NCSA's bureaucracy probably accounted for a lot of those extra $6M they spent. Different designs and goals probably had a lot to do with the rest of the price, but I suspect that a bureaucratic procurement process was the main cause for the higher price of the Xeon system.Yes, System X and the Apple hardware is pretty neat, but don't use the price/performance ratio of these two systems as a metric for the relative worth of Linux and OSX clusters.
It's unfair and meaningless to compare volunteer labor and academic pricing and scrounging on a limited budget to bureaucratic design, bureaucratic procurement and an unlimited budget.
Rpeak, not Rmax (Score:5, Insightful)
The speed you quoted is the theoretical peak, not the actual maximum achieved in a real world calculation (like the Top 500 organization's use of Linpack).
System X's equivalent theoretical peak is 20.24 TFlops.
I'm also not indicting Linux clusters in the least; they've clearly shown they can outperform traditionally architected and constructed supercomputers for many tasks, with the benefit of using commodity parts - at commodity pricing. All I'm saying is that there's a new player here, and it's a real contender, and has done a lot for very little money...which was the whole goal of Linux clusters in this realm in the first place.
(Also, as I said, the volunteer labor model is irrelevant - let's just pretend it was professionally installed for an additional $1M, or even $2M if that would satisfy you. It's still several million dollars cheaper, and 3Tflops greater performance. These are BOTH rackmount clusters with similar amounts of nodes and processors, running a commodity OS with fast interconnects. There are differences, yes, and perhaps even differences in goals. But looking past that, price/performance for something like this is still an important metric.)
Re:Rpeak, not Rmax (Score:2)
My point is that volunteer labor is only the beginning of the price difference between the two systems. The big, federally-funded bureaucracy and the departmentally-funded state university project have very different ways of doing things, and I'm only surprised that the cost and performance difference wasn't
Re:Mac OS X has similar benefits (Score:2)
I also seem to recall that Apple gave VT an *exceptionally* good deal on the hardware -- basically at cost. Any money Apple loses on the deal is a tax writeoff as either an advertising expense or charitable contribution. If you built an identical cluster and had to pay full retail for the boxes, I guarantee you'll spend a WHOLE lot more than VT did.
The NCSA, on the other hand, is a federal agency and therefore any commodity boxes they buy a
Re:Mac OS X has similar benefits (Score:2, Insightful)
Vendors will often give away hardware in order to
Apple gave away nothing (Score:2)
Re:Apple gave away nothing (Score:2)
The initial cluster cost $5.2M.
Apple took back the Power Mac G5 tower systems that were a part of the initial cluster.
For an *additional* $600,000, they replaced it with Xserve G5s.
1100 dual Xserve G5s - today - costs $2.8M. So, with all their other stuff (RAM, storage, Infiniband interconnect, ethernet switches, etc.) the prices they've quoted since the inception of this project are more than reasonable. Additionally, VT has said publicly numerous times (no, I'm not going to search
Re:Apple gave away nothing (Score:2)
VT isn't lying, but Apple has taken a
Re:Apple gave away nothing (Score:2)
That was INCLUDED in the original $5.2M. And I even referenced all of that (interconnect, etc.) in my last paragraph.
Re:Mac OS X has similar benefits (Score:2)
Don't forget that Mellanox *donated* 24 mts9600 infiniband switches. At $58,000 a piece, you've got $1.4 million worth of equipment for free.
Re:Mac OS X has similar benefits (Score:2)
VT already switched to Xserve G5s (Score:2)
Re:Mac OS X has similar benefits (Score:5, Insightful)
Your right!
1st, System X or the "Big Mac" was thrown together so that people like us would talk about it and to get a good standing for the November 2003 top 500 list. They did an excellent job at this.
Now for some reality. The system is not yet operational.
When it was first thown together, everyone "in the know" and myself questioned how this was going to work without a reliable memory subsystem, and the VT people responded that they were going to write software to correct any hardware errors, and we said OK, whatever. Then, they said, hmm, we kinda needa a reliable memory subsystem, so lets rip out all 1,100+ machines and start over with these new Xserve boxes that have ECC memory in them.
This system has not come up yet with the new Xserves, according to their website [vt.edu].
Now, I'm going to make a comment on Linpack. Linpack, like all good benchmarks are really good at measuring that benchmark's performance. Linpack is a good benchmark, but it is also a benchmark that does not require much RAM per node to run. Some applications do need a good amount of RAM/node to run and being that RAM costs $$, the cost adds up very quickly, and the cost/cpu/teraflop goes down accordingly.
With the comparison between System X and Tungsten NCSA cluster. Personally, I don't know why the Tungsten cluster cost more because the Mac cluster has more RAM/node and each node should have been cheaper in general. The NCSA cluster uses Myrinet which I know is expensive, but I do not know that in comparison to the Infiniband equipment on the Macs. Supposedly, the Infiniband interconnects were what got System X on the top500 list with such good results, or at least that is what the head of the project told me.
Although its popular here on slashdot because many of the readers are younger and inexperienced (and have no money) that they praise anything that costs less and extra brownie points go towards an underdog like AMD or Linux, however in the real world people actually will pay extra for something to ensure that it works. Working equipment may seem superfluous to the dorm room Linux guru, but trust me, I know what its like to work with equipment that cost about $1 mil and it doesn't work. We could have gone with the 2nd bidder at $1.2 mil and it would have worked. Yes, we "saved" $200,000, but we also wasted well over $500,000 when one considers that over 50% of the equipment is faulty and many people's time has been wasted.
Yeah, VT really didn't do anything... (Score:4, Interesting)
You dropped something (Score:2, Funny)
How to write complete sentences?
Re:You dropped something (Score:1)
not suitable for the Slashdot demographics (Score:3, Funny)
I hope the poster doesn't actually expect any of us to post any meaningful comments based on having read that article, it's a lost cause.. At least on me.
Re:not suitable for the Slashdot demographics (Score:1)
Is that US or metric tons? (Score:5, Funny)
Damn! What kind of paper stock are you printing on?
Re:Is that US or metric tons? (Score:3, Funny)
Re:Is that US or metric tons? (Score:3, Funny)
Roland Piquepaille and Slashdot (Score:5, Interesting)
I think most of you are aware of the controversy surrounding regular Slashdot article submitter Roland Piquepaille. For those of you who don't know, please allow me to bring forth all the facts. Roland Piquepaille has an online journal (I refuse to use the word "blog") located at www.primidi.com [primidi.com]. It is titled "Roland Piquepaille's Technology Trends". It consists almost entirely of content, both text and pictures, taken from reputable news websites and online technical journals. He does give credit to the other websites, but it wasn't always so. Only after many complaints were raised by the Slashdot readership did he start giving credit where credit was due. However, this is not what the controversy is about.
Roland Piquepaille's Technology Trends serves online advertisements through a service called Blogads, located at www.blogads.com. Blogads is not your traditional online advertiser; rather than base payments on click-throughs, Blogads pays a flat fee based on the level of traffic your online journal generates. This way Blogads can guarantee that an advertisement on a particular online journal will reach a particular number of users. So advertisements on high traffic online journals are appropriately more expensive to buy, but the advertisement is guaranteed to be seen by a large amount of people. This, in turn, encourages people like Roland Piquepaille to try their best to increase traffic to their journals in order to increase the going rates for advertisements on their web pages. But advertisers do have some flexibility. Blogads serves two classes of advertisements. The premium ad space that is seen at the top of the web page by all viewers is reserved for "Special Advertisers"; it holds only one advertisement. The secondary ad space is located near the bottom half of the page, so that the user must scroll down the window to see it. This space can contain up to four advertisements and is reserved for regular advertisers, or just "Advertisers". Visit Roland Piquepaille's Technology Trends (www.primidi.com [primidi.com]) to see it for yourself.
Before we talk about money, let's talk about the service that Roland Piquepaille provides in his journal. He goes out and looks for interesting articles about new and emerging technologies. He provides a very brief overview of the articles, then copies a few choice paragraphs and the occasional picture from each article and puts them up on his web page. Finally, he adds a minimal amount of original content between the copied-and-pasted text in an effort to make the journal entry coherent and appear to add value to the original articles. Nothing more, nothing less.
Now let's talk about money. Visit http://www.blogads.com/order_html?adstrip_category =tech&politics= [blogads.com] to check the following facts for yourself. As of today, December XX 2004, the going rate for the premium advertisement space on Roland Piquepaille's Technology Trends is $375 for one month. One of the four standard advertisements costs $150 for one month. So, the maximum advertising space brings in $375 x 1 + $150 x 4 = $975 for one month. Obviously not all $975 will go directly to Roland Piquepaille, as Blogads gets a portion of that as a service fee, but he will receive the majority of it. According to the FAQ [blogads.com], Blogads takes 20%. So Roland Piquepaille gets 80% of $975, a maximum of $780 each month. www.primidi.com is hosted by clara.net (look it up at http://www.networksolutions.com/en_US/whois/index. jhtml [networksolutions.com]). Browsing clara.net's hosting solutions, the most expensive hosting service is their Clarahost Advanced (http://ww [clara.net]
Re:Roland Piquepaille and Slashdot (Score:2, Offtopic)
Oh, please, you give Roland WAY too much credit. He doesn't add any original content. He just copies and pastes.
Re:Roland Piquepaille and Slashdot (Score:1, Funny)
Re:Roland Piquepaille and Slashdot (Score:3, Informative)
Are you actually Roland Piquepaille? If so, that's a really neat trick to move traffic to that site. If not, then he may be thankful for your comment, after all :-)
30 accepted stories since August 29th, 2004! (Score:5, Interesting)
Re:Roland Piquepaille and Slashdot (Score:2)
Re:Roland Piquepaille and Slashdot (Score:2, Insightful)
Re:Roland Piquepaille and Slashdot (Score:2)
Re:Roland Piquepaille and Slashdot (Score:5, Funny)
Re:Roland Piquepaille and Slashdot (Score:2)
Re:Roland Piquepaille and Slashdot (Score:2)
Re:Roland Piquepaille and Slashdot (Score:2)
Whenever I submit a story, I am serving Roland and not asking for anything in return, as is every other poster, why should he be special?
I think I hav
Re:Roland Piquepaille and Slashdot (Score:2)
FYI -- Here are a couple of ways to avoid the NYT log-in:
http://nytimes.blogspace.com/genlink
http://www.bugmenot.com/
Re:Roland Piquepaille and Slashdot (Score:2)
Maybe so, my main gripe with Roland comes from Copyright issues. As the AC put in his post, Roland has a habit of copying and pasting his entire article. On one hand, he usually does give credit to where he lifted his text. On the other hand, giving cr
Wow! (Score:3, Funny)
Obligatory (Score:3, Funny)
Re:Wow! (Score:2)
When Harlie was one [amazon.com] was the first book that I recall about a computer that designed another more complex computer that only it could understand.
Maybe Harlie was a Linux cluster.
Collection of interesting visualization samples (Score:1)
Re:Wow! (Score:2)
Ok, now we just need another supercomputer to test the supercomputer the supercomputer built us to interpret the output of the supe
Re:Wow! (Score:2)
Re:Wow! (Score:2)
Big Screen! (Score:2)
Re:Big Screen! (Score:3, Informative)
I know you're joking, but since I'm the hardware architect for the LLNL viz effort, I'll bite anyway. :-)
Here's what you'll need at minimum:
Re:Big Screen! (Score:2)
Well, you could use projectors to get a seamless screen from XP's built-in multi-monitor capability. I believe that the number is 10 screens simultaneous. This provides for a 3x3 matrix and an extra for controlling the damn thing. But you'll probably only get your hands on 1024x768 (786k pixels) so 9 would amount to 7Mpixel.
You'll probably have to wait on that 35Mpixel screen if you want borderless. Otherwise, go get yourse
Regarding the story title (Score:3, Funny)
Does this mean that we don't have to just imagine a Beowulf cluster anymore?
You are correct, sir (Score:4, Funny)
Finally.... (Score:3, Funny)
You would think so (Score:3, Informative)
Re:You would think so (Score:2, Informative)
Re:You would think so (Score:2)
Fuck Roland Piquepaille (Score:5, Insightful)
Re:... Roland Piquepaille (Score:2, Insightful)
We all know that's not true.
Re:Fuck Roland Piquepaille (Score:2)
If junkbuster drops all the blogspot.com content, roland pays for the bandwidth, but doesn't get the payoff. Filtering proxies can be a very good thing...
http://www.junkbuster.com
And to think .. (Score:1)
Re:And to think .. (Score:2)
Re:And to think .. (Score:2)
Building clusters with linux is easy. (Score:4, Interesting)
Re:Building clusters with linux is easy. (Score:4, Interesting)
Computers were initially monolithic machines that effectively had a single core. By the 70's, the processing on many mainframes had branched out so that a single mainframe was often a number of seperate systems integrated into a whole (though nothing on the level we see today). By the 80's it seemed to swing back to monolithic designs (standalone pc's, ubercomputer Crays) and it wasn't until the 90's that dual and quad processing became commonplace (though the technology had existed before).
Eventually, someone will hit on a revolutionary new technology (sort of like how transistors, IC's, and microprocessors were revoloutionary) that renders current LVSI systems obsolete (optical? quantum?), and the cost/power ratio will shift dramatically, making it more economical to go back to singular (and more expensive) powerful cores rather than cheap (but weaker) distributed cores.
Re:Building clusters with linux is easy. (Score:2, Insightful)
Re:Building clusters with linux is easy. (Score:2, Informative)
When VLSI hit the market, it became cheaper to have one ultrapowerful machine, compared to having a cluster of older IC-based hardware. You got more firepower for the money. That's not to say it wouldn't still pay to combine multiple Nth Generation machines, but a great deal of the cost advantage would be lost.
Clusters exist in their current diversity because it is simply the cheapest and most effective way to create powerful supercomputers. If you have
very long article... (Score:4, Interesting)
Really... (Score:5, Insightful)
While some simulations parallelize very well to cluster environments, there are still plenty tasks that don't split up like that.
The reason clusters make up a lot of the Top 500 list is that they are relatively cheap and you can make them faster by adding more nodes - whereas traditional supercomputers need to be deisgned from the ground up.
mail address (Score:2)
Re:mail address (Score:2)
They are running a secret project about the use of supercomputers to analyze spam.
Not fair, Linux! (Score:2, Funny)
Paraview (Score:2)
Once you have your visualization cluster, decided on the CPU, the interconnect, the OS, etc., you might ask what kind of application [paraview.org] you can run on it.
Single image shared vs distributed memory in Linux (Score:4, Insightful)
Damn.... (Score:2)
Why does
There should be a highly visible disclaimer on everyone of his posts: "This link goes to an external site that is NOT the article's original site, and this external site is unendorsed by Slashdot. This external site profits from traffic generated by clicking on this link."
Someone needs to write a Firefox extension that filters any mention of his "ov
Yeah, Roland the Plogger again. (Score:2)
Re:Damn.... (Score:2)
No firefox filter needed. Just add these lines to your HOSTS file
127.0.0.1 www.blogads.com
127.0.0.1 blogads.com
127.0.0.1 images.blogads.com
Then Roland will stop getting his revenues. Ph33r d4 5145hd07 3ff3c7! MUAHAHAHAHAHA!!!
Re:Damn.... (Score:2)
I don't mind ads, especially on blogs that I support. I would want such sites to receive their ad revenue from my visits. I do mind what I believe are "sneaky" methods of getting traffic in order to get more revenue.
At the very least, Roland's posts should say "My overview of this article is located here." Saying "this overview" instead is misleading...it leads the reader to believe that the original article's authors created so
From imagining to understanding... (Score:3, Insightful)
Beautiful but Scary (Score:2)
Note: Feds, leave me alone, this was NOT a classified demonstration. Just
Graphic Card Technology (Score:3, Interesting)
Namely, it allows for graphics cards to operate better in situations exactly like this; clustered applications. As it stands, the graphics card can crunch an enormous amount of data, but is extremely poor at sending it back to the CPU & system. It's optimized for screen dumping only.
Sony's Cell is going to be absolutely crucial as a tech demo for this foresighted technology. We're heading towards a more distributed computer architecture where various specialized units pipe data between each other.
In summation,
Its my hope that eventually graphics cards will catch up and perform better bi-directionally. After that, we've got to wait another 5 years for PCI-E implementations to catch up and perform better switching (vis-a-vise multiple fully-switched x16 busses). We are moving away from the CPU for high performance computing; the cpu currently performs both control and data-processing. Graphics cards are just the first wave of the distributed architecture phenomena, Cell will be a light-year jump towards the future of computing in the intricate levels of hardware reconfigurability. there's a good powerpoint on the patents behind cell here [unc.edu].
Ultimately this will lead towards the tearing down of the computer as a monolithic device, and a rethinking of what exactly the network and os's roles are. Queue exo-kernel and DragonFly BSD debates.
Re:Graphic Card Technology (Score:2)
many of these emerging distributed technologies rely upon increased switching capabilities. ps3 has some astronomical amount of internal bandwidth**. if cpu's actually are getting significantly harder to make faster, is there any correlation to the difficulty in making cheaper faster switching? i'm a computer engineer, i know a reasonable amount about the difficulties in scaling cpu performance. but from a fabrication standpoint, i'm really not familiar with the challenges of enhanced
Scientific community is so stupid (Score:3, Funny)
I've done some of this myself with POV-Ray (Score:2)
where are the Video Cards which should go in? (Score:2)
On 07 October 2004:
"Nvidia 6800 Ultra as rare as hens' Doc Marten boots"
http://www.theinquirer.net/?article=18932 [theinquirer.net]
and on 05 December 2004 : still no 6800 Ultra available!! :
"6800 Ultra hardly available in EU $740 for the card that you can't buy"
http://www.theinquirer.net/?article=20055 [theinquirer.net]
Robert
Re:Uh huh ... (Score:3, Insightful)
Re:Uh huh ... (Score:2)
I *love* FreeBSD and I'd choose it over Linux any time.
But obviously, Linux *is not* garbage.
Since you're proving to be such a dickhead, would you please stop using any of the marvelous BSD systems [slashdot.org]? Please?.. Thanks.
--
Being able to read *other people's* source code is a nice thing, not a 'fundamental freedom'.
Re:Nice pictures, but... (Score:2)
That was my reaction, exactly. You only need to render what you can perceive at any given time. It seems that there are many within the "supercomputing" community that are more intellectually invested in justifying their cool (and expensive) toys than in finding efficient solutions to problems...