Intel Unveils Next Gen Itanium Processor 169
MojoKid writes "This week, at ISSCC Intel unveiled its next-generation Itanium processor, codenamed Poulson. This new design is easily the most significant update to Itanium Intel has ever built and could upset the current balance of power at the highest-end of the server / mainframe market. It may also be the Itanium that fully redeems the brand name and sheds the last vestiges of negativity that have dogged the chip since it launched ten years ago. Poulson incorporates a number of advances in its record-breaking 3.1 Billion transistors. It's socket-compatible with the older Tukwila processors and offers up to eight cores and 54MB of on-die memory."
His name was Robert Paulson (Score:4, Funny)
Re: (Score:3)
This the itanium team; but could they have chosen something a little less, er, pessimistic?
Re: (Score:2)
> This the itanium team; but could they have chosen something a little less, er, pessimistic?
Well, this IS Project Mayhem...! :-)
Re: (Score:2)
I think IBM's POWER team and AMD better check their basement for strange looking white trucks...
Re: (Score:2)
Itanium flashbacks (Score:5, Insightful)
Does anyone else cringe when they here Itanium? The early chips still give me nightmares.
Re: (Score:2)
Re:Itanium flashbacks (Score:5, Interesting)
I work with the world's foremost experts on optimizing for Itanium 2. All available compilers suck. If you are willing to invest the effort to hand tweek, you can squeeze amazing performance out of the processors. They are extremely memory bound (hence 54MB cache now on chip). It is usually faster to recalculate numerical values than to fetch stored results.
We work with large high performance computing systems/clusters. IBM Power 7 is fastest hands down for numerical work if you plan to use the crap output from the compiler directly. Recent Intel Xeon is as fast as Power 7 if you adjust all the fiddly settings and use some trial and error, but Xeon doesn't scale well for Symmetric Multi-Processing (SMP). Itanium 2 wins by a bit if you invest huge effort. Power 7 would probably be fastest overall for numerical work if we invested the same effort into optimizing that we do for Itanium. However, we don't have to invest the effort for Power 7 to be "fast enough".
Re:Itanium flashbacks (Score:4, Interesting)
AFAIK the problem with the Itanic was the Itanic was better at "embarrassingly parallel" problems. But that meant you could usually get the same (or better) performance with two or more x86 servers at a lower cost... And the x86 processors would do better than the Itanic on code that's not been optimized by super experts.
Re: (Score:2)
Re: (Score:2)
Whereas with the x86, old code could in many cases still run faster due to the processor doing the parallelization internally.
Re:Itanium flashbacks (Score:5, Funny)
| I work with the world's foremost experts on optimizing for Itanium 2....
So when your whole team orders lunch, do you get a medium pizza or a large?
Re:Itanium flashbacks (Score:5, Funny)
Re: (Score:2)
Personal pan pizza, I'm betting.
Re: (Score:3)
They order 8 personal pizzas, all to be put in the same box. Only one person at a time is allowed to remove a slice.
Re: (Score:2)
Memory bandwidth seems to be the next big bottle neck. I wonder what is the "ideal" memory to CPU ratio.
I wonder what it would be like to have a system with no real ram just cache. Imagine CPUs with 4 GB of cache in a system where all the memory above 32bits was the cache of another CPU. You could access the memory of the other CPU as the speed of RAM today. It would be a really massive MP system to be sure. Of course then you would still want some RAM even if it just for DMA IO and Video.
Yea I am sure I a
Re: (Score:3)
The cache has to be backed by *something*. Either that, or you have to have some protocol wherein when you kick the last copy of something out of one cache, you arrange for it to get stored in another, in which case it isn't really a cache at that level so much as a set of dynamically assigned addresses system-wide.
Consider a smaller system first as an example. Suppose you had two CPUs, each only one level of cache, and each with 1MB. That's 2MB total system memory. Now suppose the first CPU reads throu
Re: (Score:2)
I was thinking of Cache as being more like on board fast ram that ran at CPU speed than as cache as we see it today.
To take your database example the way I imagine it working is a CPU would send commands to all the CPUs to find the records that contain x. Each CPU would search it's own memory of records and then just transmit the records to the requesting CPU. It would take a differn't programing model that what we use today. In a way I was thinking of it as smart ram. It seems dumb that a CPU has to read a
Re: (Score:3)
Re: (Score:2)
I do understand that is why I said I wonder what the ideal trade off between memory and CPU will be. Right now such a system would be useless because caches are only a few MBs. What happens when we can put 4 GB on the die? There are very few problems that can not fit in 4GB. I do understand the register, L1, L2, L3, main memory, mass storage structure. But we are already having memory access issues and moving to more and more cores. There is just a really nice symmetry to each CPU having 32 bits of CPU spee
Re: (Score:2)
Re: (Score:2)
...If you want to be able to write fast software, I suggest you read Ulrich Drepper's What Every Programmer Should Know About Memory [akkadia.org]. It's not that long, and very informative.
It's 114 pages of not that long, but who's counting?
Re: (Score:2)
Re: (Score:2)
I'm pretty sure he meant that what is now cache on the CPU would be configured as RAM but still as fast (or faster) than the current cache.
Re: (Score:2)
He used the phrase "No real RAM, just cache." I work with embedded processors that split their memories between cache and RAM (many of the C6000-family DSP cores), and also work with processors that are all cache (most general purpose processors). The phrase "no real RAM, just cache" implies that there wouldn't be any directly mapped memory, only indirectly mapped memory (ie. "cache" by his definition).
How else would you interpret "no real RAM, just cache"?
Re: (Score:2)
The first-order reasons is distance. The L1 is closest to where the data is needed, the L2 farther away, and the L3 still farther. Its not possible to simply make the L1 larger without also increasing the largest distance and thus its latency.
So the L1 is kept small on purpose, to reduce its latency (to 2 or 3 clock cycles these days)
Trust me. The CPU manufactur
Re: (Score:2)
Re: (Score:3)
So it runs like a bat out of hell when you massage it correctly. Good for you. Know what we call a processor that nobody can write a decent software stack for? A shitty processor.
Re: (Score:3)
I work with the world's foremost experts on optimizing for Itanium 2. All available compilers suck.
I sometimes work on compilers for HPC, and this is caused by two, related, things. The first one is that no one cares. Itanium is such a small market that, even if you can get both Itanium users to buy your compiler, it's not worth the investment.
IBM Power 7 is fastest hands down for numerical work if you plan to use the crap output from the compiler directly.
POWER 7 is a pretty generic RISC design with a few CISCy tweaks. We've got 40 years and millions of dollars of research to look at when designing compilers for it. For Itanium? Not so much. It doesn't help that Itanium is so unlike everything else that it's h
LLVM and Itanium? (Score:2)
Re: (Score:2)
Re: (Score:2)
I sometimes work on compilers for HPC, and this is caused by two, related, things. The first one is that no one cares. Itanium is such a small market that, even if you can get both Itanium users to buy your compiler, it's not worth the investment.
I heard a story from a guy at Redhat that the team that maintains the Itanium port was putting together a pool to buy the last remaining Itaniums from the customers (for more than it would cost to replace them) and then throwing the Itaniums off the roof.
Announcing 5 trillion digits of Pi! (Score:2)
APL rocks when your floating point addition latency exceeds your main memory fetch latency and your programmers don't mind that the trig operators are selected by manifest constants on the left side of the circle operator.
Itanium is trying to fit the niche where SIMD is not applicable, yet arithmetic instruction density is high relative to memory transactions.
I spent too much time last night reading about big constants. y-cruncher is sick. It's also not open source, and the core algorithm (Hybrid NTT mult
Re: (Score:2)
Ultimate Computer of Failure (Score:4, Funny)
ITANIC processor ... maybe you could have the OS installed on an external drive connected via USB1.0.
RAMBUS memory
Voodoo5 video card
i can't think of a hard drive crappy enough
obviously the OS would be WindowsME.
Re: (Score:2)
Any SATA drive with the Sandy Bridge SATA-2 controller?
Re:Ultimate Computer of Failure (Score:5, Insightful)
IBM/Hitachi Deskstar AKA: Deathstar
Re: (Score:2)
Funny thing is, I had a 60GB DeskStar, and it ran non-stop for eight years powering my router/DNS/proxy box. Did I just luck out, or was it a particular revision that had the issues?
Re: (Score:2)
Oddly enough, I had an 80G that still ran up until about 6 months ago, but there were quite a few that failed. I remember that line though. It was around the time I worked in a repair shop and we sent a great many in for replacement.
Re: (Score:2)
Better option: Anything from Conner Peripherals.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
IOmega "Click" drive for removable storage/backups. It was 40MB in a CF-card sized spinning disk.
--Joe
Re: (Score:2)
vista would make it even slower then winME
and off course make those shitty IBM deathstar disks (i raid 0 as suggested) squeel in pain with all the indexing
Re: (Score:2)
Quantum Bigfoot: the name itself defies parody. It was that bad.
Re: (Score:2)
it's ititanic 2 (Score:2)
it's ititanic 2
Re: (Score:2)
I was a contractor when they were working on this "next gen" 64bit CPU everybody was excited, then later when I read about it, I couldn't understand where this new architecture would fit. Then X86/64 came out and there really seemed to be no place for it.
IMHO, don't throw good money after bad.
Re: (Score:2)
AIUI while itanium was a failure on the desktop and windows/linux server intel was successful in pulling a number of vendors (HP being the best known in the west) into using it for their unix and/or mainframe systems and sales for that purpose provide Intel with enough revenue to justify keeping it alive.
Re: (Score:2)
I knew a guy who had one once. He used to use it for drying his clothes.
The marketing hype made me cringe. It was pretty obvious that the whole thing would be a disaster from the start.
itanic relied on "good compilers" to get any performance. However, compilers that "good" will never be made, since you can't write a compiler that can predict the future.
However, you can put hardware in your CPU to reorder instructions at run time based on observations of the behaviour of the running code, speculatively ex
Just one thing... (Score:5, Funny)
Re:Just one thing... (Score:5, Funny)
This one melts right through them.
Re: (Score:2)
This one melts right through them.
And itself.
Re: (Score:2)
So it melts right after hitting the iceberg? Impressive.
Re: (Score:2)
Itanic 3: now sinking twice as fast!
Pity Pentiums can't be socket compatible (Score:2)
Instead of everytime a new one comes along a new motherboard is required. Rather kicks any CPU upgrading possibilities into the long grass.
Re: (Score:2)
AMD's processors still possess an upgrade path. I have no idea what you're talking about.
AM2->AM2+->AM3
I personally went from Windsor->Agena->Deneb without switching motherboard. Some time later, I upgraded my motherboard and kept my Deneb. I'm now thinking of upgrading my Deneb to a Thuban.
Re: (Score:2)
Given the nature of modern CPU bus specifications, your proposal would be so mechanically, electrically and logically complex that it would undoubtedly be cheaper just to buy a whole new computer system.
What MoJoKid, own Intel Stock??? (Score:2)
Re: (Score:2)
Re: (Score:2)
Additionally, if Poulson came out today, it would probably be the fastest processor in the world (4-6x the raw performance of the Itanium 9300 should put it slightly ahead of Power7).
Ok. So you are telling me that today, Power7 is almost 4-6x the raw performance of Itanium 9300? But if we wait until the end of 2012 or early 2013 when Poulson ships, it will be slightly ahead of today's Power7?
Are you trying to help or hurt Itanium with this info?
Re: (Score:2)
Re: (Score:2)
Here's a recent refresher: Tukwilla specs released early 2008 which had an initial ship estimate of Q42008, actual ship Q12010
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Marketing at it finest (Score:2)
It may also be the Itanium that fully redeems the brand name and sheds the last vestiges of negativity that have dogged the chip since it launched ten years ago. Poulson incorporates a number of advances in its record-breaking 3.1 Billion transistors. It's socket-compatible with the older Tukwila processors and offers up to eight cores and 54MB of on-die memory.
That is so ridiculous that it is not funny.
Biggest complain about Itanic was always absence of cheap versions, something companies can put on engineer' desks.
Seeing what people do around AMD64 architecture, I doubt Itanic would ever become mainstream - it would remain forever a pet platform of HP's service unit. Similar to IBM's POWER: something sufficiently incompatible so that customers can't migrate overnight to competitor's platform.
Re: (Score:2)
It may also be the Itanium that fully redeems the brand name and sheds the last vestiges of negativity that have dogged the chip since it launched ten years ago. Poulson incorporates a number of advances in its record-breaking 3.1 Billion transistors. It's socket-compatible with the older Tukwila processors and offers up to eight cores and 54MB of on-die memory.
That is so ridiculous that it is not funny.
Biggest complain about Itanic was always absence of cheap versions, something companies can put on engineer' desks.
Seeing what people do around AMD64 architecture, I doubt Itanic would ever become mainstream - it would remain forever a pet platform of HP's service unit. Similar to IBM's POWER: something sufficiently incompatible so that customers can't migrate overnight to competitor's platform.
Yes, HPUX is the only major OS for the platform in the West - but (so I've heard from Intel sales and engineering folks) Japan (specifically Fujitsu) buys a lot of these things. So do some major companies in the US - but they also write their own applications/OSes for the platform (ie, they're NOT running HPUX on it).
Re: (Score:3)
A timeless story (Score:2, Funny)
"I turn my body from the sun. What ho, Tashtego! let me hear thy hammer. Oh! ye three unsurrendered spires of mine; thou uncracked keel; and only god-bullied hull; thou firm deck, and haughty helm, and Pole-pointed prow,- death- glorious chip fab! must ye then perish, and
Re: (Score:2)
Oh, but MIPS are still being used commonly. http://www.mips.com/ [mips.com]
Re: (Score:2)
Re: (Score:2)
The company I work for makes 64-bit MIPS processors with up to 16 cores (soon 32 cores). They're optimized for I/O, networking, storage and security and lack floating point though. The nice thing with MIPS is that it's easy to extend the instruction set without breaking anything. We've added a number of instructions and have extended gcc/binutils to take advantage of it. And yes, they all run Linux.
Being a traditional RISC type platform makes it not too difficult for the compiler to optimize code for.
Re: (Score:2)
We've added a number of instructions and have extended gcc/binutils to take advantage of it.
And this is the bit that makes so many people hate MIPS. Every vendor has their own GCC / binutils fork, often based on an old version, with patches that can't be pushed upstream without breaking other architectures.
Re: (Score:2)
Actually ours is in the mainline gcc, so there's no special fork unless you want the bleeding edge support, which always takes a little while to get checked in to the main branch.
Re: (Score:2)
The MIPS architecture is alive and well from the looks of things, and not just in the realm of HPC, and it is probably the most popular RISC CPU architecture in history. I carry such a processor around with me every day in the form of a CXD2962GG CPU in my Sony PlayStation Portable, which is essentially a clone of the MIPS R4000. The PSX used a MIPS R3000-family CPU that was essentially the same as what you would find inside an old SGI Indigo. Apparently the PS2's Emotion Engine was actually a MIPS III/IV a
Re: (Score:2)
I'd buy that shit. The only thing that would lure me away from it would be a 2012 Pinto!
Re: (Score:2)
I loved my Pinto. I *miss* my Pinto.
Re: (Score:2)
I was a little surpirsed to learn it was still being produced. I thought it had quietly been discontinued. I don't personally know anyone who operates an Itanium based server.
Re: (Score:2)
Re: (Score:2)
I'd doubt there are more people on Itanium VMS than on VAX. LOL
Re: (Score:2)
Wow ... I'm shocked that VMS still exists. I haven't heard anybody mention that in a long time.
Like what? It's been so long since I've seen it, I've long since forgotten almost everything about it. I'm surprised there's cool OS features that everyone else hasn't stolen -- that is, of course, assuming they're actually useful for anybody.
Re: (Score:2)
Automatic versioning of files, for one thing. That was kinda neat - you could tell the filesystem to keep 'n' versions and could access them explicitly like hello.c;3. The OS would return the last revision if you didn't specify the version.
Re: (Score:2)
Ah, yes. I had forgotten that one ... that came in handy when I was in university.
Re: (Score:2)
Real ACLs? Real clusters, even over TCP/IP WANs? RMS? Not crashing? A filesystem that doesn't corrupt your data, even if you do crash?
'V'+1, 'M'+1, 'S'+1 ? (Score:2)
Were there features left out?
Re:Whytanium? (Score:5, Informative)
Re: (Score:2)
It is a failure simply because the cost of engineering a new processor architecture is not really recouped by the current sales. Let's take a look at the math:
1. It costs hundreds of millions of dollars to design a new CPU
Re: (Score:2)
I think your yield estimates are probably off by at least 2x, but that's not really important. Assuming your numbers are correct, a $1 Billion dollar money-maker is nothing to walk away from, even if it's "only" 4% to 8% of Intel's revenue.
I think continuing to develop Itanium would be worth it to Intel, even if they were to take a loss on it, year after year. A lot of the R&D effort that goes into making those monster chips, for example, will be directly applicable to future mass-market processors. The
Re: (Score:2)
Yah, and probably over half of the actual applications are running in PA and Alpha translations mode on HP-UX or VMS. The IA market share would be close to 0 if HP wasn't forcing everyone stuck on HP-UX or VMS to use them. Trust me, no one but HPC people (who aren't doing $/flop or watt/flop) are looking at them and saying "I really like what I see, so i'm going to get one and run my application on it".
I'm pretty darn familiar with HP-UX on IA, and I would take a good PA machine over anything with IA, any d
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Oddly, I believe Ubuntu will do you full commercial support for UltraSPARC.
Slim pickings, admittedly, but still.
Re: (Score:2)
Eerie, isn't it?
Re: (Score:2)
AFAIK, Sandy Bridge includes a GPU on die. So, of course it's going to need more interconnects.
The market for enthusiast boards is pretty small for intel, especially when you consider the sheer volume of low end crap fests OEMs plop out a year. I don't think they're trying to screw their OEM partners as the cost per-unit is probably pretty small.
I'm just curious how this is any better than integrating the GPU into the chipset.
Re:Isn't it strange... (Score:4, Insightful)
That's because the high-end server world accepts level of single-core performance the consumer world doesn't. These processors are not something you want on your PC. You want something with better memory management, way faster I/O with ram and GPU, etc. OTOH, you usually don't care about multi-processor.
But faster I/O usually means putting more things on the die (hence amd's integrated memory controllers, now followed by Intel) and having larger busses/more efficient protocols, and acting on that means changing the socket. And the north bridge, if one is left. And the memory, for a faster one. You wouldn't get enough speedup from changing the cpu alone with everything else pin-compatible to make it worth it.
Meanwhile, the itanic spends its time waiting for the ram to answer... but since you put a lot of them in the box, in aggregate they can be useful.
OG.
Re: (Score:2)
If that's the case then you're buying desktop CPUs from the wrong vendor.
Re: (Score:2)
Well there are really two choices for desktop CPUs, AMD and Intel.
AMDs next gen of CPUs (bulldozer) will apparently not work in existing boards. Sandy bridge is pretty much ruled out (unless you are a masochist) by the recall. So whichever side of the fence you go for right now you will most likely be buying into a dying socket.