Sun Releases Starcat 305
SilentChris writes: "Sun has released the Starcat server, a beast with up to 106 processors running Unix. Anyone have an extra couple [million] bucks lying around?" They're not cheap.
Disclaimer: "These opinions are my own, though for a small fee they be yours too." -- Dave Haynie
Yeah (Score:3, Funny)
Re:Yeah (Score:2, Interesting)
http://www.sun.com/smi/Press/sunflash/2001-09/sun
-Steve
just wait (Score:5, Funny)
Re:just wait (Score:2)
And I believe your code should have been:
ass.remove(ass.getStick());
But then again, I don't pick nits
For those beowolf comments (Score:5, Informative)
This is a system that is very good at things like fluid dynamics and massive database operations. It is not a good idea if all you want to do is get to the top of the list for the SETI@Home project
Re:For those beowolf comments (Score:5, Funny)
Been there, done that. Tech here working with the StarFire used to run Seti@Home on idle systems. 64 400MHz UltraSparcs. Team Sun@Home rose rather quickly in the ranks those days, I hear. ;)
Re:For those beowolf comments (Score:3, Informative)
This would make a killer render system, assuming the renderer can handle that many threads.
This is why beowulf rendering is bad. Network performance for shared memory sucks.
With renders hitting the 2GB + mark for memory useage, do you really want a network passing that data arround.
What could happen with systems like this is that the render time vs. load time would get extremely lopsided. 30 minute loads and under a minute a frame. It would force a rethink of how the render jobs get distributed and ran.Best case would be a few of these, for each different departments render needs. But then we are talking 20+ million for rendering. That buys a lot of intel boxes.
If I was given one, I would try to use it. But I don't think I could ever seriously suggest buying one. But that is me and my particular application.
Re:For those beowolf comments (Score:2, Insightful)
I'd be worried about liquid hydrogen being a conductor. I'd rather use liquid helium. Almost as cold, and definitely non-conductive. Of course, you may end up inducing superconductivity in some of the copper in the chip and on the board, and that might be bad.
Re:For those beowolf comments (Score:2, Informative)
Re:For those beowolf comments (Score:4, Informative)
The fastest networking technologies do not approach the speed and responsiveness of a memory bus. Yet a cluster design uses networking in place of a memory bus some of the time.
If there's not a lot of data, it doesn't matter much. If there's tons and tons of data, a cluster design is inefficient.
Re:For those beowolf comments (Score:2, Interesting)
Re:For those beowolf comments (Score:2, Informative)
I'm pulled entire memory/cpu boards out of running E4500's without a problem when cpu failure occurs.
I even have had to script dynamic online-offline of processors for various load testing purposes to determine how the system scales as CPU's are added/removed
If there is one thing about the big iron I like, it is the ability to manipulate the system on a level such as this.
Now Stratus systems are even more amazing...but as they said in Conan. "That is another story"
Where 106 probably comes from (Score:3, Informative)
Given the way other Sun boxen like the E3500 work, I expect that's the 15K has 18 boards, each of which takes 3 modules, either 2xCPU of 8GB RAM.
That means that 72 CPU / 288 GB memory is 18 boards, each with 2 2xCPU modules and one 18 GB memory module, and the box is full.
Since you always need some memory, the most CPUs you can get is 17 boards w/ 6 each and one with 4. Of course, that leaves you with 8GB of memory for your 106 CPUs.
The other end is (17 x 3 + 1 x 2) 8 GB memory modules for 424 GB on a pair of CPUs
But that's just a guess...
106? (Score:1)
Re:106? (Score:1)
Re:106? (Score:2, Insightful)
Re:106? (Score:1)
Re:106? (Score:5, Informative)
There are 18 "cpu/memory" boards that hold 4 cpu's each. This brings the system up to a total of 72 cpu's and 576GB of ram.
Now, if you want an server that just does number crunching and dont care about I/O, you can then add 'MaxCPU" modules. Each module holds two additional cpu's (no memory) and occupies the hPCI module slot (a hot swap PCI case that can hold what looks like two to four pci cards). You can use up to 17 of the hPCI module slots to hold MaxCPU modules. (there are 18 pci channels on the system, and at least one must be used for accessing the boot disk).
So there ya have it, 106 cpu's and half a terabyte of ram. I think that in most cases, folks will opt to not use the MaxCPU modules and just stick to the 72 cpu limit.
Re:106? (Score:2)
Re:106? (Score:2)
Re:106? (Score:2)
Yeah, and nobody will ever need more than 640KB of RAM. My company alone has two E10Ks (the predecessor to this beast), so I think there's a decent market for these...
Re:106? (Score:1)
Clarification please (Score:1)
Actually for that price I hope it functions as a refridgerator, as well as a dishwasher, robotic maid and would provide protection during nuclear fallout.
Re:Clarification please (Score:3, Informative)
Re:Clarification please (*real* HA) (Score:2)
Given how much you can shuck and trash without powering down the whole machine, a second box is going to be for the people who really need the 'five nines' availability. Other than an extreme power failure or WTC-style disaster, it's hard to think of a situation that would require shutting down all 6 power supplies at the same time.
That's it... (Score:2, Redundant)
And is it just me or did the old "Thundercats" show just pop into your head?
Re:That's it... (Score:3, Funny)
Just watch out. I hear Debbie has quite a few lawyers.
Re:That's it... (Score:2, Funny)
store.sun.com (Score:2, Funny)
Configuration Error
1) Error calling config servlet: sunir.webdesk.common.checker.ConfigInternalExcepti on: Couldn't get sql connection
Then I tried to reload the page and didn't even get a response....
Re:store.sun.com (Score:1)
They need one! (Score:3, Funny)
Re:They need one! (Score:3, Funny)
but slashdot took it out.
*yawn* (Score:2, Informative)
Re:*yawn* (Score:1)
And will they be around in two years????
Re:*yawn* (Score:2)
Re:*yawn* (Score:2, Troll)
SGI's future prospects are also dim, it's not a safe long-term purchase.
Power! Must have power! (Score:1)
Not a good sign (Score:1)
No such luck. I wonder if we
I need one of these for home (Score:2)
I'm all for new and spiffy hardware, but some things are just too much. I woner what the margin on one of these things is... I'm having trouble seeing how they expect to sell enough of these to cover the R&D costs... Maybe they'll sell them to the NSA so that agency can be brought into the 21st century...
--CTH
Re:I need one of these for home (Score:2)
Re:I need one of these for home (Score:2)
Before I started working for Sun I drooled over the E10k, but thought I'd never see more than one in a computer room. How wrong I've been. I've since walked into datacenters and seen rows of 10k's all humming along. And I'd imagine I'll see the same thing with the 15k's in a few years.
All I want for Christmas... (Score:4, Funny)
I've been a real good gEek this year. I wrote several white-hat worms to fix IIS holes. I defended IP rights in the Linux kernel. I also mirrored the LOTR trailer.
Could I please get just one little old Starcat Server from Sun? Please make sure it is the 106 processor version with 576 GB of RAM.
I will be real good and use my idle time for SETI.
Your pal,
digital_freedom
P.S. Chocolate chip cookies are your favorite right?
Re:All I want for Christmas... (Score:2)
/. the dot in dot com?? (Score:1)
Perhaps they should use one of those bad boys for the webserver?
I think their database is not running on a Starcat (Score:1)
1) Error calling config servlet:
sunir.webdesk.common.checker.ConfigInternalExce
Couldn't get sql connection
106 CPUs? (Score:1)
According to the specifications [sun.com], it has support for up to 18 quad-processor boards. That's a total of 72 processors, so where are they getting the 106 number from?
Wow, up to 72 *hot swappable* PCI devices!
Cool (Score:3, Insightful)
I want 2 of them!!! (Score:1)
That's, of course, if Tribes 2 ever gets ported to S'low'aris. ;)
Re:I want 2 of them!!! (Score:2)
You can forget about Tribes 2 being ported anywhere...didn't Sierra close the doors? If Tribes 2 were ported, it's far more likely the x86 arch would be chosen as a target instead of the SPARC arch, as game-playing weenies don't care for anything without a BIOS.
The marketrons are going to _freak out_. (Score:5, Funny)
Re:The marketrons are going to _freak out_. (Score:2, Funny)
I think I have room on my MasterCard...
partitions (Score:4, Interesting)
As someone who does nothing with these types of systems, nor follows them, I think it's great that you can have different processor speeds using "partitions."
I wonder if memory is treated the same way... i.e., separated by "partitions," or if you also have a choice to use it as one, large unified memory resource... or, I wonder if memory can be dynamically partitioned... hmm.
Actually, now that I'm thinking about it... are all of the processor partitions considered peers? I mean, are the partitions all treated as if they were a single processor... then treated equally?
Re:partitions (Score:5, Informative)
Now, if the starcat treats domains (partitions) the same as the E10k (I haven't been to training yet on it), then each domain at minimum will consist of 4 processors and 32 gigs of ram, ie 1 processor board. Basicaly these doamins are treated as seperate boxes as far as Solaris is concerned. You configure a domain to say contain 2 system boards, and then when you load Solaris, it then sees 8 processors and 64 gigs of memory. This way you can allocate resources as the need fits. But this means it doesn't look like the virtual processor that mainframes present.
The starcat may deal with processors above 72 in a different way, but I honestly don't know at this time how it deals with them.
Hope this helps answer your question.
Re:partitions (Score:2)
Re:partitions (Score:2, Informative)
The Register has an article about the launch of the StarCat [theregister.co.uk] including a quote where McNealy said, with tears in his eyes, "God I hate my job."
Apparently, McNealy had a hard time speaking during the event, which was held in New York City, due to the death of a long-time Sun employee in the terrorist attacks on the WTC.
Re:partitions (Score:2, Informative)
I'll bet their hoping (Score:3, Interesting)
Re: They're not cheap. (Score:2, Redundant)
Since when has new (let alone the latest and greatest) Sun hardware been "cheap?" Sure, there might be some good values but it's never cheap.
And then there's the product itself. It's huge, it's fast, it's not intended for the home user, or even a medium-sized business. Let's just look at the specs, pulled from Sun's info page:
Up to 18 fifth-generation Dynamic System Domains, which are fully configurable while applications are running.
Are you trying to be funny by saying it's not cheap? Was that a +5 Insightful comment that we all would have missed out on had you not enlightened us? Are you implying that there are other manufacturers who do sell these kinds of systems for cheap?
In your defense, you do have an astonishing command of the perfectly obvious.
Starcat and Oracle ID card (Score:1)
I can't build one... (Score:2)
1) Error calling config servlet: sunir.webdesk.common.checker.ConfigInternalExcept
Hmmm... looks like their DB server is down.
Re:I can't build one... (Score:3, Insightful)
We may want to forgive Sun for being a bit slow in getting their DB server back up and pretty. A huge chuck of their support staff is helping bale out clients whose data centers got blown up out east. On the other hand, it's your brand new product, you gotta make sure it's available to be bought up. But then again, who in their right mind would just go online and buy one of these? I'd bet ALL of these are sold through meetings between sales reps and IT purchasers.
Wonder what this would've done for Final Fantasy? (Score:2)
Just thinking "could they've just slapped a few of these suckers into place instead?"
Brought TCO WAAAY down (Score:2)
Intel based hardware does NOT have great MTBF (Mean Time Between Faiulre) unfortunately but inevitably as Intel boxes are commodities built for least possible expense. As a result large Intel farms mean near constant maintainence. Soem machine is always on the verge of failure.
In contrast so much horse power in a highly reliable box means both fewer machiens to fai lAND much lower MTBF per machine.
The result is much lower Total Cost of Ownership.
Re:Brought TCO WAAAY down (Score:3, Funny)
Let's see...
Sparc 1 box, CPU pretty much part of the MB, unless you get one of those fancy Weitek things. CPU fails, tech replaces MB, no pins to bend.
Sparc 2, Sparc IPX, etc, same story.
Sparc 20, suddenly we have CPUs with pins on them. Coulda happened. Of course, that hasn't been current tech for several years.
UltraSparc line comes out...pressure fittings for the Enterprise servers, no pins to bend. Deskside UltraSparc (like E250, E450), no pins to bend, the CPU is on a card just like Intel does these days.
Ultrasparc III line comes out, big servers don't even use pressure fittings--if you lose a CPU, you get a new system board. Deskside US III (SunBlade 1000) uses a card similar to older deskside units, and has rails to line it up and a torque tool to seat it. Don't see too many bent pins there.
So, apparently you got burned once a very long time ago with a Sparc 20. Don't you think it's time to get over it?
Re:Wonder what this would've done for Final Fantas (Score:2)
Is it just me (Score:2)
1) Error calling config servlet: sunir.webdesk.common.webconfig.WebConfigException
wow...
106 procs, so what (Score:2, Informative)
Re:106 procs, so what (Score:4, Informative)
The sun system is an smp based system, everything connects to a common backplane and each board has equal access to all of the other boards. With the sgi, the speed of accessing memory on the local board or boards in the same cabinet is much faster than hits to memory in remote cabinets.
From what I can tell, Sun is planing on producing a special system board that goes into one of those 18 slots. Thus, with 19 StarCats you can create one big system with 1836 cpu's and 9.7TB of ram. (think of a system in the middle that acts as the center of a star) it will most likely be based on a COMA architecture rather than a ccNUMA. Like the SGI, memory access will depend on the distance between the requesting cpu and the storage location. The difference is that under COMA, if a cpu requests a particular bit of memory a lot, that page is either migrated or copied to a memory bank on that cpu's memory board (so if 5 cpu's all need read only access to the same bit of memory, then they can each have their own copy in a local memory bank. write updates are what make the system a pain in the ass to manage ).
Re:106 procs, so what (Score:3, Interesting)
Re:106 procs, so what (Score:2, Funny)
Re:106 procs, so what (Score:2)
Sorry, I wasn't clear. SunFire 6800's. Loads up to six system boards with 4 processors and 1GB/proc. each, at present. (plus four I/O boards)
Where would you plug one in? (Score:2, Funny)
Can you get it bundled with 12 30A 240V 100meter extention cords? (So you could plug it directly in to every panel on the block.
Nooo! (Score:2, Funny)
"Not cheap", he says. (Score:2)
Insecure (Score:3, Funny)
I thought it said Starcraft =). (Score:2, Troll)
strcat? (Score:5, Funny)
Buy online (Score:2, Funny)
"Click here to buy online"
Who the heck buys this stuff online ???!?!?!
"Mhmm I think I'll charge this one to my Discover card"
Super-Size that please? (Score:2, Funny)
Umm... Lemme see... It's not breakfast anymore is it?
No sir....
Hmmm... Can I have one of those Sun 15k servers?
Ok... 1 Sun 15k server... That will be $1,808,110.00.... Would you like to super-size that order?
Yeah sure... Why not..
Aight... That will come to $4,140,830.00
Would you like to add on the extra disk array for only $480,400.00
No thats Ok
Ok... With tax your total comes to $4,430,688.10
Please drive around to the next window
TCO (Score:2, Funny)
With 106 UltraSPARC III Cu 900-MHz processors, more than half a terabyte of memory, and fifth-generation Dynamic System Domains, the Sun Fire 15k server helps redefine total cost of ownership in data center environments
Yeah, no kidding...
SGI can do 1024 processors :-P (Score:2, Informative)
the use of this type of system (Score:3, Interesting)
(Disclaimer, I work a lot with E10Ks, so this post is written mostly from my experience with those.)
The 15K is basically just an improvement on the E10K architecture, from what I've seen and heard from Sun's SSEs. The E10K started out life as the Cray SuperServer, and was sold to Sun for a song. It's not architecturally perfect. The E10K is set up to allow individual system boards to be part of domains (aka partitions), which can make for some great scalability in the domains. I've seen tiny little one-system-board domains, and domains with 13 fully populated system boards in them.
One of the major advantages to this platform is the fact that you can hot-swap everything except the centerplane. (Of course, I've never seen a centerplane fail.) The E10K also has Dynamic Reconfiguration, where you can remove system boards from a running domain, but unless your platform is set up in a certain, specific way, this doesn't work as well as advertised. I've personally never used it. The best thing about the E10K is the use of the System Service Processor, which handles all the administrative tasks for the entire cabinet. I've heard that the SSP is now integrated into the 15K, thus eliminating the need for a separate system to perform these tasks and monitoring.
The only thing I've ever seen this class of system used for is data warehousing. No modeling, no graphics rendering, just Oracle databases. Just because it has a large number of processors, doesn't mean they're going to be suitable for every task imaginable. (I used to have a 180MHz Indy R5000, that got 68kkeys/sec in d.net. My 166MMX got something like 350kkeys/sec.) These are workhorse processors, not sports-car style processors.
Though I wonder if Sun's gotten around to fixing that nasty ecache parity error problem with their processors... Having a domain randomly crash because the parity bit on a processor got flipped is no fun when you're dealing with a large production database. I have a feeling that problem will continue to plague them in the 15K.
Re:the use of this type of system (Score:2)
Well actually the Ex500's also have DR, and of course the newer Ex800's. DR is a great idea, works great in lab environment. Honestly it's cool to see the running kernel jump from one system board to the other. But it's a product that I have never seen in the wild.
Though I wonder if Sun's gotten around to fixing that nasty ecache parity error problem with their processors... Having a domain randomly crash because the parity bit on a processor got flipped is no fun when you're dealing with a large production database. I have a feeling that problem will continue to plague them in the 15K.
Remember the 15k uses UltraSPARC III's rather than the UltraSPARC II's. And the newer UltraSPARC II's have mirrored ecache to take care of that problem. I do believe the UltraSPARC III's also have that same set up. But when the chips are down, I'd rather have the box crash to maintain data intregrity rather than spew garbage (can we say the floating point bug in the Intel Pentium?). But I think you summed it up best by saying "These are workhorse processors, not sports-car style processors."
Re:the use of this type of system (Score:2)
Speak for yourself
Need an admin? (Score:2)
Contact hubert@feyrer.de.
Re:Scheduling airplanes? (Score:1)
Re:Scheduling airplanes? (Score:5, Interesting)
"Huh? I understand that the nation's air traffic controllers may need updated equipment in light of the existing crisis, but how hard can scheduling be? I could see a use for a massively parallel monster like this in, say, flow-through or structural analysis or something, but scheduling? "
What your missing is that this isn't a matter of airtraffic control. This is a matter of determining which planes and crews to fly to which locations at what times to maximize revenue. This is a classic, big, nasty travelling salesman problem. The bigger of a beast of a machine you get, the closer you get to an optimized solution. I.E. Most passengers willing to pay this most money with the least use of resources. It's a huge problem that needs massive computational power.
Re:Scheduling airplanes? (Score:2, Informative)
Look into operations research. An early on thing you will learn in OR is the simplex method, where in you boil down a lot of your information (cost to move part A from here to there and buying B if needed, etc...) into a matrix and then use something like the Simplex Method you can maximizie say your profit, or minimize the delays, or minimize the total number of planes in the air, etc..
Now, does the world's flight system have a lot of inputs - you bet, planes, fuel, flight crew, union regulations, holidays, tons of things that you and I could probably never think of...
Now... I just wanted to post that there are ways of solving these problems without a graph - yes a graph is a lovely way for highly dynamic systems, but if you want to answer the question "What is the best schedule given these 1,000 limitations?" then look into operations research. Yes, I mentioned the simplex method, and I fully exepct following posts arguing Parametric Linear Programming or maybe some Markov Chains and their impact, but the core is still the same - if you have a set of limitations and are looking to maximumize (or minimize) something, then operations research is a fun thing!
Re:Scheduling airplanes? (Score:2)
Sort of accidently read that as "this machine is optimized to run IE". :) Don't mod me down, I submitted the story. :)
Slow but reliable (Score:2, Informative)
most AFTN Mesassages (fliight plan, met data etc) fly between airports as 2400baud. some of the faster ones use 28,8 modems, or X25
the computers are often pentium class machines (running slack 4.something) or stratus servers (hpux ones)
the point is speed is not nessasary, ensureing the message gets there reliably before the aircraft does is nessasary.
Scheduling can be NP (Score:2)
Simply put, with 10 airports and 10 airplanes, our first step has 100 possible _initial_ (opening) moves, and they permutations mushroom really quickly after that. He ended up using a small Beowulf that always had the current real world state, and this way he could shuffle scheduling on a daily basis, adapting to changes.
In comparision, our local bus network is re-scheduling, it's going to take one guru a couple of months to "zen" a new schedule.
Xix.
Re:Scheduling airplanes? (Score:2)
My current client has initals that might contain the letters L, A, and U, and they use an RS/6000SP for exactly this type of work. Nice machine. This Sun box is a direct competitor to the SP, but very different architecturally. Large scale SMP vs MPP.
Simplex Method (Score:2)
Example, to fly this plane means R fuel cost, S crew cost, T maintenance, U airport costs, V etc... Now put in this information for all possible flights, and all the passenger demand info (you'd want regular flight days, as well as holidays, etc..) and put all of this into a gigantic equations where you are trying to maximize or minimize something - maximize profits, or maybe minimize certain costs. Then you can relate this to a rather large matrix. Now so far we haven't asked for any info you wouldn't need for your graph. Then you apply one of the methods from operations research to your data set and you get either a number, or maybe a bound range of an answer - without the contruction of the graph, or all of the graph traversals...
Yes, if you want to do something on-the-fly, like have a computer fly the planes for you, and update as weather changes then you'd probably want to think about a graph. But to generate the day-to-day schedules for your planes, look into OR.
Here's the textbook I reccoment to learn Operations Research: here [mhhe.com]
Re:Come on people! (Score:1)
Or why do you think ordering of these babies is already Constrained: Demand for this product is so strong that it exceeds availability and is subject to shipment delay. If you proceed with your order, Sun will send you an e-mail with the scheduled shipment date as soon as possible. ??
Re:Not cheap? (Score:1)
The site isn't doing so well. [joesacher.com]
Re:That's great, but... (Score:2)
Re:That's great, but... (Score:2)
Re:Why do you call it StarCat? (Score:2)
Pretty simple, StarCat was it's codename in development. Just like Serengti was for the E3800, E48x0 and E6800.
Re:SGI Origin 3000 (Score:3, Informative)
Re:Up to 848 megs of cache. (Score:2)
I actually ran into this as a problem about 10 years ago. In 1992, we bought a machine from IBM that we nicknamed 'brutus'. The idea behind brutus was pretty simple: we bought the machine, and with just about every dollar we had left for the project we bought memory.
Brutus ended up with 380 Meg of ram (this was back in 1992, when most people were really happy to have 8 meg of ram and a 200 meg hard disk). It also had a single 800meg hard disk. This is where the trouble began. After loading AIX and all of the upgrades, etc, we had about 100 meg left for swap space. Unfortunately, AIX needs swap space for backing store before it will allocate memory, so we had a mondo-expensive box with about 200meg of ram that we couldn't get to.
We cobbled together enough money for a second 800meg drive, and while we were waiting (months) for it to be delivered, our IBM rep assured us that the extra memory wasn't completely wasted.
"The system will use it as a disk cache."
I pulled the extra memory boards and swapped them into a bunch of other (smaller) boxes while we waited for the extra backing store.
You may wonder what we wanted 380 meg of RAM for back then... It was a graphics lab, and some of the grad students were doing research on volume visualization. 380 meg allowed you to play with a 512x512x512 cube in ram (8 bits per voxel, 2 copies).