Intel's Big Chip

Catch up on stories from the past week (and beyond) at the Slashdot story archive

Intel's Big Chip 282

Posted by CmdrTaco on Monday February 04, 2002 @03:45PM from the not-so-micro-after-all dept.

DeadBugs writes "News.com has an article about the size of the upcoming revision for the Itanium. The "McKinley" chip will be 464 square millimeters which would make it one of the largest ever produced. Most of this is due to the 64 bit registers and 3MB of Level 3 Cache. There is also a link to an article about "Chivano" an Itanium which will include concepts from the Alpha architecture"

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 282 Comments Log In/Create an Account

Comments Filter:

Wow! (Score:2, Insightful)

by JoeLinux ( 20366 ) writes:

Intel gets it right! Morce_Cache==Good_Thing. Was anyone else scratching their head over the 8k of level 1 cache on the Pentium 4?

Joe
- It's how you use it (Score:4, Interesting)
  
  by ouija147 ( 467204 ) writes: on Monday February 04, 2002 @04:04PM (#2951839)
  
  Way back when the 386 was hot stuff there was a series of mother boards that had a 64K of cache that was outperformed by a board that had 16K of cache.
  
  How? The 16K board cache was four way set associative. This allowed for the CPU to determine in one clock cycle if the next instruction was in cache. The 64K cache design could not always do this. Thus it was often slower. Why not make the 64K cache 4 way set associative? Cost. The overhead in silcon and motherboard space made this impossible at the time.
  
  Parent Share
  twitter facebook
  - Re:It's how you use it (Score:2, Informative)
    
    by Anonymous Coward writes:
    
    That's not what set associativity means, although you're quite correct that 16Kb of 4-way sa cache will usually be better than 64Kb of direct mapped cache.
    
    If you actually want to know what you're talking about, I'd suggest reading "Computer Organization and Design : The Hardware/Software Interface" by Patterson and Hennessy.
    - Re:It's how you use it (Score:2)
      
      by haruharaharu ( 443975 ) writes:
      
      For those who don't know, n-way set associativity means that a cache can retain n values that map to the same cache line, so a 4-way cache can hold 4 lines that map to the same cache position.
    - - Re:I have the book (Score:2)
        
        by Chris Burke ( 6130 ) writes:
        
        Obviously they aren't adding it for grins. I'm sure they did architectural studies that showed a performance gain comensurate with the cost. They probably chose that value after examing a number of possibilities and finding that to be the "sweet spot".
  - Re:It's how you use it (Score:2)
    
    by Chris Burke ( 6130 ) writes:
    
    Actually, given otherwise equivalent designs, a 4-way set associative cache is going to be slower (in terms of maximum access frequency) than a direct-mapped cache. Thus the assertion that 4-way associativity allowed the cache to be accessed in one clock is false. Maybe some other engineering feat allowed them to do this, but it wasn't associativity (because that hurt, not helped).
    
    But as far as overall performance, yes I'd be willing to bet that the 4-way cache beat the larger 1-way.
- Re:Wow! (Score:2, Funny)
  
  by Sinfamous ( 522772 ) writes:
  
  /me looks over at the powermac's 1ghz dual G4's with 2MB cache each already selling. /me shrugs. nothing to see here.
- Re:Wow! (Score:2, Interesting)
  
  by pagercam2 ( 533686 ) writes:
  
  Well more cache is a good thing an unmamafacurable chip or an overly pricey chip is really bad. The yeilds on a 464 sq mm chip will be really low, thats 21.5mm on a side assuming a square die and they ussually are. 25.4mm=inch means that this is very nearly a square inch. I'm working on a 10mm x10mm chip now and we expect a ~40% yeild just due to area. SRAM is twice as likely to fail as general logic so large caches L1 or L2 just go to reducing yeild. The larger the chip failures go up exponentially. An example of this would be you wanting to make a table out of pine so you go to the lumber yard and buy some wood, you get what you get and some pieces have wholes or knots that would ruin you table so you toss those. If you make a small table you may be able to get the table by avoiding the knots, but as the table gets bigger your ability to find a good full size peice deceases to the point that every peice of wood has at least one knot so you can never make a knot free table. If you get 20 defects per wafer and you only get 50 die per wafer you will probably get 30 chips (60% yeild) if your die increase to 30 per wafer you may only get 10 (33% yeild). If the die grows to 20/wafer you may get none (0% yeild). All wafers cost pretty much the same so yeild*wafer cost pretty much sets the cost, the lower the yeild the higher the cost. HP has done some interesting work in making chips with extra resources, such that problems can be avoided, route around the defect, they are quite a ways from making this work but this is probably what the future of silicon is going to look like. The other big factor in cost is testing and the more complex a chip the longer the test, its getting to the point that testing costs are out weighting silicon costs, so a self repair chip can help both yeild and testing issues.
  - Re:Wow! (Score:2)
    
    by svirre ( 39068 ) writes:
    
    You are basically right about exponetial drop in yield as area goes up. Some of your calculations do miss the mark however as you assume uniform distribution of defects instead of the more realistic random distribution.
    
    What intel (and any sane manufacturer who drop a significant amount of memory on die) will do is to add redundancy to regular structures. The l3 cache _will_ have redundancy, l2 most likely. l1 is more of a toss-up.
    
    Redundancy is currently a standard engineering practice (Actually, most designs use IP modules bought from specialiced memory vendors who can integrate this kind of functionality for you).
- - Re:Wow! (Score:2)
    
    by SilentChris ( 452960 ) writes:
    
    Slightly? Word (the king of exageration) runs fine on my machine when I only have 32 megs of RAM free. StarOffice's word processor does reasonably well with 16. Heck, earlier versions of Office (up until around 2000) worked with 8.
    They eat up more room on the hard drive, definitely. But in terms of memory in usage they are still extremely lightweight (as they should be).
  - Re:Wow! (Score:2)
    
    by MindStalker ( 22827 ) writes:
    
    In L1 and L2 increasing memory size, also increases latency, which is exactly what you DON'T want in onboard cache. I've seen benchmarks that have shown that with good speed main memory, a cache over 2MB can slow actually slow you down. So I'm just hoping that they have some new tech memory, as they plan on putting 3MB on it. But as this is 64bit its very likly that the size point may be much larger, where you start to loose speed.
- - Re:Wow! (Score:2, Informative)
    
    by fitten ( 521191 ) writes:
    
    This is correct. The way your software uses the cache(s) is what determines the performance. For instance, a naively written app may have horrible cache behavior even on large caches. Some study of the underlying algorithms and adjustment to the cache access patterns can speed up the app manyfold.
  - - NOT not wow! (Score:3, Informative)
      
      by plover ( 150551 ) writes:
      
      You're absolutely correct in that a substantially larger die will result in substantially lower yields (excepting any magical breakthroughs in chip fabrication, which are always possible.)
      But there are segments of today's market that are willing to pay almost any price for a high-performance chip. These people will fork over a $1000 without blinking an eye if they think it will speed up their business.
      Look at any commercial server available today. They're priced around $15000 - $20000. If chip prices go to $1000 instead of the $400 they're probably paying, that makes a difference of $2400, or about 12%, in a 4 way box. Even if chip prices went to $2000, it's a $5600 difference, or a 28% difference. If your processors are your bottleneck, then you've gained a lot of improvement for not-very-much delta in money.
      Sure, a $2000 chip is out of reach for most home users today, but there is always a market for just about anything faster they can produce.
      And there are enough crazed overclockers [tomshardware.com] out there that'll spend whatever it takes to raise their frame rates on Quake III. It'll sell. It'll also drive the market to a new standard, which also sells chips.
      - Re:NOT not wow! (Score:2)
        
        by khuber ( 5664 ) writes:
        
        Look at any commercial server available today. They're priced around $15000 - $20000
        Heh. Maybe if by "server" you mean high-end Windows workstation. A midrange Sun server like a sunfire 4800 is around $350k list, say $300k. And that's just a test machine where I work - the production stuff runs on a combination of 4500s and 6500s, soon 4800s/6800s. 6500s are over a million IIRC.
        But anyway, your point is still right on, and even more valid with datacenter-class servers, disk arrays, and so on. Paying $2k for the processor is nothing.
        -Kevin
        
        Re:NOT not wow! (Score:2)
        
        by khuber ( 5664 ) writes:
        
        A Sun e450 is a nice box for a reasonable price. A dual config is less than $30k now for a nice workgroup server.
        Yes, definitely. You can even buy a low end v880 for 30k. I was teasing a bit. We still have 220s, 250s and 420s and 450s that are more reasonably priced boxes, though I personally don't like the 2xx machines. The bigger machines are used for applications where memory and/or processing requirements are really high.
        We'll probably be getting more 880s though because they are pretty expandable and hold a lot of memory. They go up to 8 processors / 32 GB.
        These Suns, while not the fastest in town (see IBM Power4), are a proven 64 bit architecture that has been around and doing real work longer than the Itanium hype. It bugs me to see so much attention given to the Intel machines. I understand the desire for good commodity machines, but these things are going to be pricey anyway. And I could give a rip about running Windows on servers.
        -Kevin
big chip... big fan (Score:3, Insightful)

by Transient0 ( 175617 ) writes: on Monday February 04, 2002 @03:48PM (#2951725) Homepage

i wonder if the oversized chip will lead to particular cooling difficulties(i.e. standard fans and heatsinks can't cool the entire surface area)...

Share
twitter facebook
- Re:big chip... big fan (Score:3, Funny)
  
  by Anonymous Coward writes:
  
  Nope. I'm already moving on this issue, with the assistance of several eager venture capitalists.
  
  We're going to start up a business modifying Sears deep freezers, providing a means of placing a PC directly into it.
  
  Although your entire computer system will be the size of a bathtub and double your electric bill, you WILL be able to use PC's based on these new CPU's.
  
  We're also going to figure out how to work rain forest defoliation into the process.
- Re:big chip... big fan (Score:4, Troll)
  
  by fobbman ( 131816 ) writes: on Monday February 04, 2002 @04:13PM (#2951900) Homepage
  
  Actually, according to this article [theinquirer.net] over at The Inquirer [theinquirer.net] Intel can now demo a 5GHz chip using the .13 micros process that can run at room temperature. That'd be interesting to see in action if it benches as well as a 5GHz chip should in comparison to current chips.
  
  Parent Share
  twitter facebook
  - Re:big chip... big fan (Score:3, Informative)
    
    by haruharaharu ( 443975 ) writes:
    
    Intel can now demo a 5GHz chip using the .13 micros process that can run at room temperature.
    
    Big deal. It has 12 instructions, is ~2mm^2, and consumes 267mW. This looks more like research than something that you would use for real work.
- - Re:big chip... big fan (Score:5, Informative)
    
    by Sebastopol ( 189276 ) writes: on Monday February 04, 2002 @04:06PM (#2951854) Homepage
    
    Wouldn't a larger surface area allow for better cooling? Isn't that the whole principle of a heatsink in the first place?
    
    If the die uniformly heats, then yes, this is true. But that's not always the case. The latest P3's are so low power that you just need a heatsink or fan-sink, depending on frequency. The first P4s had a head spreader that sat on the back of the die and connected to the fansink.
    
    Plus heat in a die goes up/down easier then left/right because the thermal conductivity of the heatsink is much better than that of silicon, and is closer than the edge of the die. If you've got local hot spots on the die, a bigger die doesn't by you anything. The thermal properties and requirements of the heatsink are driven more by local heat density than by overall heat.
    
    Tom Pabst had a good discussion about this a while ago, but I can't remember the article's URL.
    
    Parent Share
    twitter facebook
    - Re:big chip... big fan (Score:2)
      
      by SilentChris ( 452960 ) writes:
      
      "The latest P3's are so low power that you just need a heatsink or fan-sink, depending on frequency."
      That doesn't mean they necessarily run cooler. My machine of choice (a laptop) has a SpeedStep PIII which runs blazing hot at peak CPU levels. True, this is mostly while playing Unreal Tournament ;) but the chip gets hot enough that I literally can't keep the machine on my lap without some serious pain.
    - Re:big chip... big fan (Score:2, Insightful)
      
      by Sokie ( 60732 ) writes:
      
      But if you have drastic enough temperature deltas in the die itself to make this appreciable, then there is no heatsink design that is going to be precise enough to effectively combat those hot spots. Basically you have to design your heatsink to cool the hottest part of the chip to acceptable levels. Increasing the surface area of the die (without increasing the amount of total heat output) will lower (however slightly) the highest temperature spot on the die, making it (again however slightly) easier to cool.
      
      Also, there is going to be horizontal thermal dissapation regardless of the thermal resistance of the materials involved. If I have a die that generates 1W of heat in a 200 mm^2 area, even if it is not generated uniformly, and I then increase the area to 300 mm^2 with the same total heat output value, that extra area IS going to be heated, probably substantially, regardless of whether it is generating it's own heat. That means that the overall average temperature has to go down somewhere.
      
      The points you make are however perfectly valid and relevant, but the point of the original poster was supposing that it would be larger surface area of the die that would make it harder to cool, not any new thermal hotspots. I was simply saying that there aren't many ways that increasing the surface area can make something harder to cool.
      - Re:big chip... big fan (Score:2)
        
        by Sebastopol ( 189276 ) writes:
        
        even if it is not generated uniformly, and I then increase the area to 300 mm^2 with the same total heat output value, that extra area IS going to be heated, probably
        
        The logic that causes the hotspot does not necessarily scale with the size of the die. If you slap on a 3MB cache onto the same die, you have the same heat problem. Now add to that the die shrink from process and it is even worse.
        
        I was simply saying that there aren't many ways that increasing the surface area can make something harder to cool.
        
        Well, we're picking nits here, but you originally asked why the die wasn't easier to cool because it is bigger. But judging by your second reply, your first question wasn't worded as clearly.
      - Re:big chip... big fan (Score:2)
        
        by Chris Burke ( 6130 ) writes:
        
        If I have a die that generates 1W of heat in a 200 mm^2 area, even if it is not generated uniformly, and I then increase the area to 300 mm^2 with the same total heat output value, that extra area IS going to be heated, probably substantially, regardless of whether it is generating it's own heat. That means that the overall average temperature has to go down somewhere.
        
        Sure, but that still says nothing about the hotspots.
        
        When you increase the area from 200mm^2 to 300mm^2, you aren't increasing the size of the old core, you're just tacking silicon onto it. The same core is still sitting there, unchanged. It will still produce 1W over a 200mm^2 area, it's just surrounded by more silicon. The average temp is colder because the extra silicon isn't generating heat, and thus the average is lowered. But the hotspots haven't changed their local temperature at all (it may in fact be worse due to worse heat propogation in silicon vs air, but that effect is probably negligible).
        
        And it gets worse. The article didn't quote relative power figures for the two chips, but depending on the way they did it, that new block of silicon may be -hurting-, as it's producing it's own heat. Large memory arrays are one of the worst heat producers. Being smart about how you access the array may save you from catastrophe, but it is definitely going to increase the total power output.
        
        So, basically, just having more silicon doesn't help, and having more silicon that contains high-power memory arrays hurts.
  - Re:big chip... big fan (Score:2, Insightful)
    
    by m4g02 ( 541882 ) writes:
    
    Not really, that would apply if the heatsink became biger while the chip keeped the same size, its like 1 (common chip size) - 1 (heatsink size) is 0, while 2 (new chip size) - 2 (heatsink size) also is 0, i guess the porblem isnt the heat but the backwards compatibility with other accessories... what isnt really like a porblem.
Die size war? (Score:5, Funny)

by Guitarzan ( 57028 ) writes: on Monday February 04, 2002 @03:51PM (#2951746)

Is this the start of the manly "Mine is bigger than yours" battle?

Share
twitter facebook
Nice review. (Score:2, Informative)

by Psmylie ( 169236 ) writes:

Another analyst said, "Jesus, that's big."
Straight and to the point. Nice.
Intel's new marketing strategy (Score:2, Funny)

by Stripsurge ( 162174 ) writes:

Our new CPU is so big it will CRUSH the competition... No, really. We mean it quite literally :)
- Re:Intel's new marketing strategy (Score:2)
  
  by haruharaharu ( 443975 ) writes:
  
  Our new CPU is so big it will CRUSH the competition
  
  WARNING: CPU will not dispense product.
$300 to produce? (Score:2, Funny)

by diesel_jackass ( 534880 ) writes:

if Pentiums cost $50 to produce, will these cost 6x as much as a Pentium?

hmmm... sorry Intel, I'll stick to AMD till I hit the lotto, or have some other good reason to spend money like it was going out of style.
- Re:$300 to produce? (Score:2)
  
  by E-Rock ( 84950 ) writes:
  
  This chip ain't for you or me. It's for big datacenters with processing requests that boggle the mind. Kinda like the Xeon processors.
  - Re:$300 to produce? (Score:2, Funny)
    
    by diesel_jackass ( 534880 ) writes:
    
    processing requests that boggle the mind.
    
    like if you wanted to play Q3 in a VMWare session?
    
    (or play D00M 3 at all)
  - Re:$300 to produce? (Score:2)
    
    by DarkEdgeX ( 212110 ) writes:
    
    And this justifies the mark-up HOW exactly?
    
    Going from $300 to $3000 per processor at retail seems a bit extreme if you ask me. And don't mod this as a flame, I actually LIKE Intel's work, but it's a joke how much they charge for their processors compared to how much it costs to make them.
    - Re:$300 to produce? (Score:2)
      
      by kinkie ( 15482 ) writes:
      
      Well, those chips have to be designed for instance, and since the numbers are low there's lots of markup just for that. Then there's the bad yields due to the big die-size, and marketing and the other kind of expenses a company has to pay just to survive, such as office space, computers, secretaries, Microsoft software, Linux softwa^H^H^H^H^H^H^H^H^H, etc. A company which just re-sells its employees' work has often a markup of 50% just for these things. Then there's the manifacturing plants' mortage (you know, a silicon manifacturing plant costs a few billion euros), and you see how it stacks up quite fast..
      - Re:$300 to produce? (Score:2)
        
        by DarkEdgeX ( 212110 ) writes:
        
        One would assume that the $300 cited for manufacture would INCLUDE such things as R&D and Administrative expenses. Otherwise, you're not really telling people how much they cost. (And this is really besides the point-- the things you list, while accurate for a startup, don't really apply to a company like Intel with a lot of cash to burn on innovation). I'm still missing the justification of those rather large prices.
    - - Re:$300 to produce? (Score:2)
        
        by DarkEdgeX ( 212110 ) writes:
        
        Right... and this invalidates the idea that $3000 for something that takes $300 to produce is overkill how exactly? Atleast the other poster gave some reasonable ideas (R&D and so forth) but you'd kind of think those would be tallied into the final cost of production. (And would pay for themselves if the units sold in quantity at even 1.5x the production (manufacture) cost.)
L3 on die? (Score:2, Interesting)

by Soong ( 7225 ) writes:

Sounds silly to me. By the time you get out to the 3rd level of cache, on a 1GHz core, there should be enough slow down that chip to chip interconnect will be adequately fast.

Either Intel has actually put research into this and discovered that it's a good tradeoff performancewise, or they've still got marketing driven engineering and someone said "wow! over 3 MB of on chip cache!"

Any guess on the wattage? Has Intel broken 100 Watts on their upward march of hot chips?
- Of course. (Score:2)
  
  by Chris Burke ( 6130 ) writes:
  
  Eventually you'll have 128MB of DRAM on chip. Why? Because it'd be faster. Closer memory is better.
  
  Having the L3 on chip makes the same amount of sense as having the L2 on chip -- which is to say, lots. First, you can run the L3 at core clock speeds. No external bus is ever going to run as fast as pure silicon. This means that the latency is going to be much lower than for an off-chip L3. This means the average memory access time will be lower, which means better performance. Second, the bandwidth can easily be higher, since you don't have to pay with pins for extra data lines and, again, you're running at core speeds.
  
  For those programs whose working sets fit into this amount of memory, the on-chip L3 is going to blow the doors off an otherwise equal off-chip L3.
Note, this is the *DIE* size (Score:4, Informative)

by jaxdahl ( 227487 ) writes: on Monday February 04, 2002 @03:55PM (#2951777)

The Athlon chips i have are around 2-2.5 inches on a side, however, the die in the middle is quite small, i'd estimate it it be 200-250 square mm, so a 400+ square millimeter is huge, compared to that.

Anyone have any exact numbers for the chips? I didn't get a ruler out to measure it.

Share
twitter facebook
- Amd competition. more numbers. (Score:5, Informative)
  
  by leuk_he ( 194174 ) writes: on Monday February 04, 2002 @04:32PM (#2952018) Homepage Journal
  
  Now that you mention AMD. It has been roumoured last week all over the net that intel has a backup plan, an P4 with 64bit extenstions
  
  os.opinion article [osopinion.com]
  news.com [com.com]
  
  by the way, the amd hammer is expected to 105 mmm^2 [theinquirer.org] on 130 nanometer (.13).
  
  the current amd MP (palomino) has a die size of 129mm [electronicstimes.com] on .18.
  
  the original P4 has a die size of 217mm and is now at 150 mm^2.(with a bigger cache)
  
  Note that the original article does mention the 424 size is on .18 and the next generation is on .13. note that this can make a differce of a factor 2 (13^2/18^2= 0.52)
  
  Parent Share
  twitter facebook
- AMD Athlon (Score:2, Informative)
  
  by Vagrant ( 518197 ) writes:
  
  184 square mm die size (prior to Athlon 800)
  102 square mm die size (Athlon 800) ... source [earthweb.com]
  Note that this article also states that: Intel has also incorporated a substantial amount of redundant circuitry in the processor, Krewell said. Chipmakers often use redundant circuitry to boost yields. Sometimes, circuits come out scrambled on a finished chip. If the manufacturer has put in two sets of the same circuits, the chip will function properly because it can use the second set.
  You could have a dual Pentium machine and not even know it :)
  I guess this redundancy is why the chip has gone up 10% in size in the last couple of months ... (see this article) [com.com] which quotes: One of the reasons for McKinley's bigger price tag, Krewell said, is that it will cover nearly 440 square millimeters in area--or more than twice that of the Pentium 4.
- - Re:What effects die size (Score:2)
    
    by kilrogg ( 119108 ) writes:
    
    That's were flipchip has an advantage, you can more freely place your bumps (the flipchip equiv of a pad) anywhere on die. The I/O cicuitry still take up a lot of space(output buffers/esd protections), but at least you aren't stuck with empty space in the middle. Another pro is no bondwire inductance, which is good for high data rate I/Os. Main downside is the high cost of the substrate (i.e. package), this can be a big problem for high volume/low-margin chips like CPU.
    Anyway, with 3MB of onboard cache, I doubt the I/O pads are to blame for the large die size of this CPU.
Die Photo and Size (Score:5, Informative)

by rbeattie ( 43187 ) writes: <russ@russellbeattie.com> on Monday February 04, 2002 @04:01PM (#2951824) Homepage

Ace's Hardware has this bit with more information [aceshardware.com] including links to an Intel presentation.

"Slide 22 of the presentation features a die photo of McKinley. The large 3 MB L3 cache is notable, and according to the presentation, it consumes 20% less area than traditional designs and is overall 85% efficient (~70% for traditional designs)."

And here's a story with the photo [theinquirer.net] from that same article (no need to download 2.5 meg pdf...)

-Russ

Share
twitter facebook
A few minor points (Score:4, Insightful)

by mtnharo ( 523610 ) writes: <greengeek AT earthlink DOT net> on Monday February 04, 2002 @04:03PM (#2951836) Homepage

Just for some minor clarifications: The 464 mm squared is the area of the actual cpu die. Like the little square on top of an athlon. So 2 cm per side die is kind of huge for a processor. The actual processor out of the box would have to be much larger than previous models. Next, 3 MB cache sounds nice, but L3? It may be on die, but by that point the clock reduction probably makes it perform equivalently to a 256 k L1 cache, or a 512 or larger L2. Not that it won't help a lot for complicated instructions, and it's probably less expensive/difficult to engineer to hook a larger amount of cache to a slower pipeline than to add more cache deeper into the cpu's core. 64-bit cpu's will be important in the future, but only when compatible apps and OS designs become mainstream.

Share
twitter facebook
- Thanks! Where would we be without clarifications? (Score:3, Informative)
  
  by megalomang ( 217790 ) writes:
  
  Thanks for your "clarifications". You have saved us all from a life of ignorance.
  
  What you meant to say (and what the article said), is that 464mm^2 is size of the actual die size of the processor This includes the CPU and the caches. The CPU is a relatively small portion of the processor die, and noting there is 3MB of L3, the total cache may amount to 2/3 of the die size. The square on top of the athlon is also the entire processor die: cpu, caches and all.
  
  Also, L3 cache can never perform "equivalently" to L2 or L1 cache unless it runs at core speed. And I can tell you now, it doesn't -- or they wouldn't need L1 and L2. The L3 cache probably runs at something like 10 access cycles or more. It's not difficult to engineer 10 access cycles into any pipeline -- it's impossible. Which is precisely why it's not L1.
  
  I'm quite sure the engineers at Intel have done their modeling homework and determined that however fast the L4 memory may be, the L3 will improve performance by that much more.
  
  Remember, this processor is not meant to go on you or any other Joe Sixpack's desktop. It is meant to sit inside the workstations on the desks of engineers and in the racks of high-bandwidth servers. These platforms are specifically designed to run hundreds of tasks simultaneously and handle staggeringly high memory bandwidths. It has nothing to do with "complicated instructions." The L3 exists for swapping out large pages of memory in large bursts from a significantly larger sized L4 memory (think on the order of 100's of GB) from L5 memory (local drives and SANs) that has an incomprehendable virtual memory space.
  
  This has absolutely nothing to do with mainstream. I'm quite certain an OS already exists that will run on the platform. An IA-64 Linux is well under way (try http://www.linuxia64.org [linuxia64.org]) and you can bet that Compaq, HP, Dell, and Intel have put a total of more than 100x your lifetime earnings into developing software for that platform.
  
  Intel could not care less whether you or 99.9% of the /. readers out there ever buy an IA-64. They don't give a crap about your market segment, but I'm sure if you want to drop $10K+ on a IA64 workstation, be my guest. Your choices are limited. Either choose IA64 or UltraSparc. Or maybe if AMD ever gets a design win, you might get a chance to buy a Hammer box.
- Cache Design 101. (Score:3, Informative)
  
  by Christopher Thomas ( 11717 ) writes:
  
  Next, 3 MB cache sounds nice, but L3? It may be on die, but by that point the clock reduction probably makes it perform equivalently to a 256 k L1 cache, or a 512 or larger L2.
  
  A fundamental rule of building caches is that a larger cache is slower and dissipates more energy per lookup than a smaller cache. This is why multilevel cacheing exists in the first place (otherwise we'd just have a huge L1 cache - and before you mention it, due to architectural sneakiness, HP's giant L1 cache isn't really an L1 cache).
  
  So, you can't just spend the L3 area on making a bigger L2. You'd end up with a slower, hotter L2, which could easily _degrade_ performance.
  
  As long as the L3 cache is faster to access than main memory, it'll be useful for some things. Whether it'll be useful for *most* things is another issue. This depends on the "working set" of the applications you're running (how much memory they repeatedly access). I guess Intel's banking on working sets being larger than most caches.
  
  Another possibility is that they're testing the cache architecture for use in future SMT or CMP designs (both of which would have multiple independent executioin contexts running). If you're running multiple *independent* contexts, the working set grows with the number of contexts.
- - Re:A few minor points (Score:2)
    
    by Steveftoth ( 78419 ) writes:
    
    Java already runs in a 64 bit enviroment. Sparc III is 64 bit. The solaris jvm is 64-bit if you install the 64 version. (There are 2 versions, one normal 32 and a extended 64 version.)
Itanium at 1.6 GHz in 2003 ? (Score:3, Insightful)

by Utopia ( 149375 ) writes: on Monday February 04, 2002 @04:07PM (#2951861)

For the article
Madison is expected to come out in 2003 and run between 1.2GHz and 1.6GHz, according to sources.

I wonder how Intel expects people to adopt Itanium-based processors considering
that x86 processors will be running at 4GHz in 2003.

Share
twitter facebook
- Re:Itanium at 1.6 GHz in 2003 ? (Score:2)
  
  by jmv ( 93421 ) writes:
  
  AFAIK, current Itanium chips run around 1 GHz and are way faster than a P4 2.2 GHz. Did someone mention "clockspeed is not everything"? It you look at (non x86-based MPP) super-computers, I'd bet none of them runs with CPUS faster than 1 GHz, yet each processor is often an order (or two) of magnitude faster.
  - Re:Itanium at 1.6 GHz in 2003 ? (Score:2, Informative)
    
    by Utopia ( 149375 ) writes:
    
    According to SPEC CPU2000 Results [spec.org], The Itanium at 800 Mhz performances equivalent to a Pentium III 800 Mhz. In fact the Pentium III is a little faster.
    
    Do you have any other figures to substantiate your claim ?
  - Re:Itanium at 1.6 GHz in 2003 ? (Score:4, Interesting)
    
    by HiredMan ( 5546 ) writes: on Monday February 04, 2002 @08:00PM (#2952936) Journal
    
    AFAIK - Enough said.
    
    As people have pointed out the 800Mhz Itanium chips - the fastest you can buy - have an integer performance slightly less than an 800Mhz PIII.
    
    From the article: "Applications will be about one and a half to two times faster than what you get on a (current) Itanium"
    I'm assuming this is WITH the huge L3 cache in pilot systems if they are claimed actual application performance.
    
    Let's compare this to the REAL competition: IBMs Power4.
    
    IBM Power4 1.3GHz - shipping for a while now:
    SPECint2000 = 814 SPECint_base2000 = 790
    SPECfp2000 = 1169 SPECfp_base2000 = 1098
    
    Even the best Itanium reported int numbers are:
    SPECint2000 = 365 SPECint_base2000 = 358
    (Same box) SPECfp2000 = 610 SPECfp_base2000 = 526
    
    Even if the McKinley (which doesn't ship for 6 months or so) produces double the Itanium numbers (which it won't) it'll still lag the currently shipping Power4 chips.
    And with only an clock speed increase of 60% over the next three years IBM can stay ahead simply by getting the 1.8Ghz models out the door in the next 24 months. (That's assuming that the 1.6Ghz McKinleys will even outperform the current Power4s.)
    
    It looks like Intel has increased clock speed by 25% added a bunch of L3 cache and is claiming 150%-200% gain. I think Intel has a (big) dog on their hands and they're trying to dress it up. The P4 performance will probably continue to outrun their flagship "server" chip and because of AMD Intel can't afford to strangle the P4's performance as they might have been able to in the past.
    
    Intel said, "Wait for Merced." - which we did for years. Then they said, "Well, the Itanium sucks, but wait for McKinley!"
    
    =tkk
    
    Parent Share
    twitter facebook
    - - Re:Itanium at 1.6 GHz in 2003 ? (Score:3, Interesting)
        
        by HiredMan ( 5546 ) writes:
        
        They did a lot more than that. It has a shorter pipeline, higher clockrate, additional integer units, on-die L3 cache.
        
        That's true.
        
        150-200% is a modest prediction for performance.
        
        This was the prediction of an Intel representative. I can't imagine he was TOO conservative... Then again it's academic since no one is actually running software on an Itanium - who can compare their current results with future ones? ;)
        But seriously - the faster clock speed and cache (since Int operations are much more sensitive to cache changes) would account for a nice bump in performance. I'd expect nearly a 50% increase in speed simply from the changes I noted. Even if it is twices as fast then new chip arch is only reponsible for a small increase in that speed.
        
        My point is that HP decided as early as 1996 that the Merced project would never surpass PA-RISC and essentially took their marbles and went home. McKinley was an attempt to get something out of the project after it was clearly headed for failure. Intel should have known they had a dog on their hands and yet the flogged the FUD for years and after billions of dollars they have yet to deploy a compelling technology.
        
        You should also note in your SPEC marks that there's accusations that IBM "cheated" with their submissions.
        
        Thank goodness Intel has never been accussed of anything so horrid!
        
        I'm not sure on the details on it, but I was reading parts of it on www.realworldtech.com the other day.
        
        Well if it's on the Internet it MUST be true...
        Let me get this straight - because you "heard something" you can't back up I should note that IBM's officially submitted Spec results are faked? How do you figure?
        
        =tkk
        
        Re:Itanium at 1.6 GHz in 2003 ? (Score:2)
        
        by OnanTheBarbarian ( 245959 ) writes:
        
        As far as I know, the controversy about SPECfp 2000 relates to Sun's compiler, not IBM's. The usual "Your compiler does too well on SPEC" whining, nothing serious.
        
        I think SPEC needs to release a new benchmark set every 6 months, with about 50 really large programs in it each time. Ok, that's a little silly, but otherwise "gaming" the SPECs is essentially inevitable.
- Who cares about GHz... (Score:5, Insightful)
  
  by jbf ( 30261 ) writes: on Monday February 04, 2002 @04:32PM (#2952017)
  
  ... if you can't run the apps.
  
  Intel x86 is restricted to 48-bit addressing (with segment registers), and practically 64GB with modern OSes. (http://linux-mm.org/)
  
  If I want more than 64GB of addressable physical memory (which I do for some apps), then who cares if you can give me a 32-bit x86 running at 900GHz, it's not going to do diddly squat for me, since _going over the PCI bus_ for swap is going to kill me vs a 1.6GHz 64-bit processor. And since you need to go over the PCI bus just to get to a pseudo-disk stuffed with RAM, that solution is still bogus.
  
  I see your point that this isn't what Joe Blow's gonna put on his desk. But the improved address space is definately a big win, and that's assuming that they can't ramp up the clock speed in a hurry.
  
  Parent Share
  twitter facebook
- Re:Itanium at 1.6 GHz in 2003 ? (Score:2)
  
  by nosferatu-man ( 13652 ) writes:
  
  IA32 and IA64 are radically different ISAs aimed at radically
  different markets. There's nothing on any recent Intel roadmaps that
  will have Itanic replacing x86 on the desktop. Conversely, 4ghz of
  Hot P4 Action is meaningless to an application that requires more than
  4gb of process address space.
  
  A 1.6ghz McKinely ought to be a very competitive performer, especially
  on floating-point intensive code.
  
  Peace,
  (jfb)
  - Re:Itanium at 1.6 GHz in 2003 ? (Score:3, Informative)
    
    by roca ( 43122 ) writes:
    
    > There's nothing on any recent Intel roadmaps that
    > will have Itanic replacing x86 on the desktop.
    
    Which is really going to hurt them. The latest version of Everquest recommends 512MB of RAM. High-end gamers are going to need 64-bit addressing in a few years. AMD will be able to supply cheap 64-bit chips, Intel will be playing catch-up at best.
    - Re:Itanium at 1.6 GHz in 2003 ? (Score:2)
      
      by nosferatu-man ( 13652 ) writes:
      
      Good god, are you serious? 512mb RAM for a VIDEO GAME? O tempora! O
      mores!
      
      Yeesh.
      
      (jfb)
  - Re:Itanium at 1.6 GHz in 2003 ? (Score:3, Funny)
    
    by sconeu ( 64226 ) writes:
    
    4ghz of Hot P4 Action
    
    I'm sorry, but I just got the mental image of the geek pr0n site that would use this tagline!
64 bit regs is new? (Score:4, Interesting)

by gTsiros ( 205624 ) writes: on Monday February 04, 2002 @04:07PM (#2951865)

Yeah, right. Intel is the big player. Right.

My calculator's [google.com] processor has 64 bit registers. You think i'm trolling? Check it out for yourself:
google search [google.com]

There are a lot more (and more powerful) procs out there, but this one just seems more appropriate for intel bashing ;)

Share
twitter facebook
- Re:64 bit regs is new? (Score:2)
  
  by SilentChris ( 452960 ) writes:
  
  Your calculator also has a lack of cache, a variety of register instructions and compatibility with other architecture. Your point is?
- Re:64 bit regs is new? (Score:2)
  
  by DarkEdgeX ( 212110 ) writes:
  
  Yah, but how MANY does it have? The IA-64 architecture, as I recall, calls for 256 such registers. I sincerely doubt your calculator has that many (let alone the redundant sets that no doubt exist internally to speed up execution). It is dumb to say it's the first processor with 64-bit registers, though..
- Re:64 bit regs is new? (Score:3, Informative)
  
  by morcheeba ( 260908 ) writes:
  
  A quick summary of the Saturn microprocessor, for those interested...
  
  The Saturn processor is a propietary HP chip used in many of its calculators. It's generally considered a 4 bit chip (since this is the internal data bus size), but it has four 64-bit registers [zoy.org]. I think the coolest part of the chip is that each instruction can operate on various portions of these registers -- for example, only the upper nibble, or only the lowest 4 nibbles. Since this is a calculator, math is generally done in BCD format. Externally, the chip connects using an 8-bit data bus. The address bus width (and therefore the PC, too) is 20 bits wide, and each address refers to a nibble of data. Maximum addressable memory = 1 meganibbles = 512KB. Most of the calculator firmware (such as calculating the sine of a number or matrix manipulation) is interpreted RPL to allow code reuse code (to save time, and to ensure bug-free implementations)
  
  HP did a great job with this calculator, including releasing internal documenation and development tools. More info here [hpcalc.org], or use google.
  
  It's a shame that HP shut down thier calculator division. [slashdot.org]
It's the whole retro thing... (Score:2, Funny)

by Dutchmaan ( 442553 ) writes:

..perhaps we're trying to get back to the good ol' days when you could walk around inside your computer....

or maybe they're taking the term "big iron" a little too seriously..
Coffee warmer built-in! (Score:3, Funny)

by Insightfill ( 554828 ) writes: on Monday February 04, 2002 @04:20PM (#2951940) Homepage

At that size, a smallish mug should fit nicely on it. No use wasting all that heat!

Share
twitter facebook
- Re:Coffee warmer built-in! (Score:3, Funny)
  
  by sharkey ( 16670 ) writes:
  
  At that size, a smallish mug should fit nicely on it
  
  Now, someone needs to figure out how to mount it on the CD/DVD tray, so the cup-holder will be heated.
- Re:Coffee warmer built-in! (Score:2)
  
  by rlowe69 ( 74867 ) writes:
  
  At that size, a smallish mug should fit nicely on it. No use wasting all that heat!
  
  For now, maybe. According to an article [theinquirer.net] linked in an above post, the chips will be cooling down ... to room temperature! Unfortunately, this means no more hot coffee. :) Here's the quote from the article:
  
  And Intel claims McKinleys in the future will run at .13 micron and at 5GHz at normal room temperature, because of the low power circuits it will use.
  
  I don't drink coffee, but I was kinda hoping to have something to keep my feet warm. :)
That's Almost 3 bits per millimetre! (Score:3, Funny)

by Bobzibub ( 20561 ) writes: on Monday February 04, 2002 @04:26PM (#2951979)

"464 square millimeters which would make it one of the largest ever produced....due to the 64 bit registers." 464^.5=21.54mm a side.
64bits/21.54mm=2.97 bits/mm

They've GOT to start using smaller wavelengths!

Share
twitter facebook
Nothing new here - take a look at the hp-pa 8800 (Score:5, Interesting)

by Anonymous Coward writes: on Monday February 04, 2002 @04:31PM (#2952011)

http://www.lostcircuits.com/cpu/hp_pa8800

Has 3Mbyte L1 cache and 32Mbyte L2 cache and
a transistor count of 300 million.

To quote:

"The HP PA-8800 L1 cache is probably the biggest L1 that ever existed so far with separate 750 KBytes of data and instruction cache for each core. This results in no less of 4 blocks of ¾ MB density each for a total of an unprecedented 3 MB L1 cache, physically twice as much as the combined L1+L2 on IBM's Power4. Accordingly, the transistor count of the HP-PA8800 is with 300 Million transistors almost twice as high as the 170 Million transistors of the IBM Power4 and results in a die size of 23.6x15.5 mm2 or 361 mm2. The L2 cache of the PA-8800 is off-chip and consists of four 72 Mbit "1 Transistor SRAM" chips developed by Enhanced Memory Systems.

http://www.cpus.hp.com/technical_references/PA-8 70 0wp.pdf

has a roadmap of the hp-pa and Itanium chips so
really there is nothing new or exciting to report
that hasn't already been said 9 months ago.

Share
twitter facebook
- Re:Nothing new here - take a look at the hp-pa 880 (Score:3, Informative)
  
  by roca ( 43122 ) writes:
  
  The question with an L1 cache of that size is how many cycles it takes to access the cache. It's easy to make a huge L1 cache, you just pay in increased access time. It's not impressive until we know the latency numbers as well as the size.
  - Re:Nothing new here - take a look at the hp-pa 880 (Score:2, Informative)
    
    by Saidin ( 105994 ) writes:
    
    The latency is no secret. It is a 2 cycle latency cache. Pseudo 2-way set associative (you can load from an even and an odd row at the same time, but not 2 even or 2 odd)
Yes, but is Itanium going anywhere? (Score:2)

by sphealey ( 2855 ) writes:

Last I heard, Intel may have dug themselves a hole with Itanium. It's incompatibility with existing apps means that there is no desktop demand to drive economy of scale. Therefore the price isn't coming down and the price/performance is not improving faster than the older, "inferior" technology. How will they escape this death spiral?

sPh
- Re:Yes, but is Itanium going anywhere? (Score:2)
  
  by MtViewGuy ( 197597 ) writes:
  
  You are correct, alas.
  
  Technically, the Itanium architecture is a great idea, but with no real software available for the CPU (do we really have a native-register Linux distribution for this CPU?) the processor is not going to be very popular.
  - Re:Yes, but is Itanium going anywhere? (Score:2)
    
    by roca ( 43122 ) writes:
    
    Technically, the architecture is terrible. Intel bet that they could eliminate on-chip support for out-of-order execution and designed an architecture around that assumption, hoping that compilers would be able to compensate. They lost the bet. So Itanium is good for regular workloads, especially those with predictable memory accesses, but sucks hard for workloads with irregular memory access (like, say, programs that spend a lot of time traversing complex data structures).
- Re:It emulates the 32 bit instruction set (Score:2)
  
  by acomj ( 20611 ) writes:
  
  I think it does run older programs. In a simulation/emulation mode. Don't know much on how it works though...
AMD's Response (Score:2)

by Murdock037 ( 469526 ) writes:

I wonder if AMD will start some sort of "size isn't everything" initiative.

Maybe they could offer some sort of conversion system, so that consumers can easily convert between centimeters and inches, and understand that AMD's new 1.5"+ chips perform about the same as Intel's 20mm McKinleys...
So... (Score:2)

by sohp ( 22984 ) writes:

Is that an Itanium in your pocket, or are you just glad to see me?
What I've always wondered ... (Score:2, Interesting)

by felicity ( 870 ) writes:

... since Intel basically already owned the Alpha chip, and the Alpha already outperforms the standard Intel chips, and the Alpha is already 64-bit, and already has software for it, and has a long proven track record, and had design plans going out many many years to make improvements to the design ... Why is Intel spending so much time and money on this new chip which is already over budget and behind schedule?
I've never been able to figure that out.
- Re:What I've always wondered ... (Score:2)
  
  by leandrod ( 17766 ) writes:
  
  First, pride. Not invented here syndrome -- while Intel has already digested some external technology like the StrongARM, still their engineers must have some pride left -- and big ones, since the best engineers have already left and the mediocre ones tend to be the proudest. Also, it's not Intel only, but Intel and HP, so if Intel would shift everything to the Alpha HP would have to agree on that.
  
  Second, dumbness. VLIW probably isn't a good idea anyway to general purpose microprocessors, and while EPIC tries to address some VLIW shortcomings it makes for a pretty complex architecture which negates some of the VLIW proposed benefits. The picture gets still worse if you throw in IA32 compatibility.
  
  Third, they own the design, but once again the best engineers left the company. No self-respecting engineer wants to work for Intel, they have a long history of abusing employees and imposing dumb management decisions on technicians for a long time now, in their branchs all over the world.
  
  So the only sane architectures left with a future on the market are PowerPC and UltraSPARC. Sad but true.
Product names... (Score:2)

by torpor ( 458 ) writes:

"... also announced was the new 'Chicanium', which combines the 200Mnerdz power processing of the Itemium with the sexual prowess, digitally extracted, of a cheap Tijuana hooker and her little brother. Release date, 2002."

(The processor, not her little brother.)
Importance of 64-bit architectures (Score:2)

by billstewart ( 78916 ) writes:

The computer world is increasingly reaching the point that we need 64-bit addressing - the price of memory has reached the point that computers with more than 4GB of RAM are not only feasible, but becoming common, and for a couple years we've had disk drives bigger than 4GB. This means that 32-bit addresses are no longer enough - and with the 36-bit and 48-bit segment-based things that allow machines to address more memory, we're rehashing all the ugly tricks we had to play with 20-bit and 24-bit addressing on 16-bit-addressed 8086/80286 machines. Ugly, ugly, ugly!

For most people, 64-bit arithmetic isn't critical - most applications don't deal with ints larger than a billion, though us crypto people who do lots of bignum math are happy to get a 4x speedup. Otherwise, the quality of floating point implementations is likely to be more important. So it would be possible to get by with 32-bit arithmetic and 64-bit addresses, like we did with the Motorola 68000's 16-bit-ints and 32-bit registers and addresses - but that was also somewhat tacky, and led to *lots* of bugs in code that assumed ints and pointers were the same size, though perhaps we've evolved enough past K&R C that newer software won't make that mistake as often.

A real problem this time around is that the C language and its relatives really do like 32-bit integers, and many of the Unix system calls also assume 32 bits. If you make the native int/pointer sizes 64 bits, there's a lot of stuff that will probably break. What kind of experience have people had running code on DEC Alphas and other real-64-bit chips?
What about the Alpha? (Score:2)

by evilviper ( 135110 ) writes:

Of all the media bull I've heard about the 64-Bit Intel/AMD chip, I've yet to hear ONE SINGLE person claim that the new chip will be somehow better than the Alpha.

Tell you what. Since all the apps need to be fixed (or at least recompiled) to work on 64-bit processors anyhow, why not just go the route of porting everything to the Alpha? We could use this to finally get the hell away from Intel's terrible chipset.

And for all of you that think the Alpha will be dying soon, there are plenty of companies other than Compaq with Alpha products that are far better quality than Intel, and will likely be cheaper as well.

http://www.microway.com/products/ws/alpha_21164.ht ml [microway.com]
- - Re:What about the Alpha? (Score:2)
    
    by evilviper ( 135110 ) writes:
    
    Perhaps I should have qualified my statement by saying: I have never heard one competent & trustworthy source make any reasonable claim that the 64-bit Intel chips will be any better than the Itanium. Of course there are plenty of people like you out there to spread the marketing fud.
    - Re:What about the Alpha? (Score:2)
      
      by jbischof ( 139557 ) writes:
      
      I think you mean better than Alpha.
      Who would you like to be your trustworthy source? Someone from Intel? Someone from Alpha? Someone from AMD? Good luck getting a trustworth estimate of processor performance from anyone.
      
      As far as spreading marketing fud, I haven't seen any marketing on the Itanium, so I really wouldn't know what to spread.
      
      As far as Itanium being better than Alpha, nobody knows what alpha could have done, and Itanium is a processor of the future, a technology that is being hammered out. You have to give Intel credit for trying to build something as advanced and ambitious as the Itanium.
eetimes coverage (Score:2)

by morcheeba ( 260908 ) writes:

EETimes has a nice article [eet.com] with a good graphic comparing the internal workings of the Itanium vs. McKinley ... a good level of detail: 10 vs. 8 pipeline stages, differing bus widths and speeds, execution units, etc.

The article also talks about other intel innovations disclosed at the International Solid-State Circuits Conference
- Re:Large? (Score:2)
  
  by WolfWithoutAClause ( 162946 ) writes:
  
  Yes, that would be just the raw silicon. Normal chips are typically under a cm^2 in size; everything else is packaging.
- Re:Large? (Score:2, Insightful)
  
  by max cohen ( 163682 ) writes:
  
  The deal is this: larger die size chips are harder manufacture cost-effectively make because you get fewer good die per wafer. A wafer is a fixed size and costs the same to process, regardless of the number of functional die you actually get from it. So, the more die you fit on it, and the more die that actually work when you are finished processing, the lower your final selling price can be. Since the die size for this chip is much larger than "normal" and will be made in fabs that arguably will have the same defectivity rates, they will likely get less working die from a wafer than "normal" and thus their costs to the consumer will be larger.
  - Large? HUMONGOUS! Is Intel daft? (Score:2)
    
    by SysKoll ( 48967 ) writes:
    
    A wafer is a fixed size and costs the same to process, regardless of the number of functional die you actually get from it. So, the more die you fit on it, and the more die that actually work when you are finished processing, the lower your final selling price can be.
    You're absolutely right. On top of that, the yield is going to be ridiculous. See, a wafer has defects. To get a good approximation, imagine a 6 or 8-inch target on which you shoot darts. The best wafer processes give you about half a dozen defects, and boy are these wafers expensive. Each time you have a defect, the chip that is engraved on this spot will be faulty and be rejected.
    You can easily see that for a given defect density, the same wafer will have approximately the same number of bad chips (even if you split hair with the probability of getting two defects tiled by the same chip). With a small die, you can easily squeeze more good chips around defect spots. One more reason why a small die size is key to yield.
    So this chip is going to cost a freaking fortune to manufacture, especially with the bleeding edge process they are boasting.
    But wait, it does not stop here. 22 x 22 mm chips, huh? Assume that the clock tree (i.e., the tree-like circuit that distributes the clock signal in the chip) has a longest path of 10 mm. That's already the heck of a skew on the signal. And you can easily increase that longest path estimate by 30-50% because signals can't propagate in straigth lines, they have to be routed along structures. This alone guarantees the clock speed will never go as high as competing chip's frequencies.
    This is a sheer waste of engineering resources. For a processor, such a size is just not practical.
    Conclusion: This thing is a demonstrator. It will never fly. It's not meant to. And even for a demonstrator, it's too bulky.
    
    -- SysKoll
- Re:largest ever produced? (Score:2)
  
  by chancycat ( 104884 ) writes:
  
  Die size, not pachage size. The package for one of these will be massive, no doubt.
  - Re:largest ever produced? (Score:2)
    
    by mfago ( 514801 ) writes:
    
    Actually the package is shown here [theinquirer.net] and it looks pretty reasonable.
    - Re:largest ever produced? (Score:2)
      
      by gorilla ( 36491 ) writes:
      
      Package size is mainly driven by the need to get enough space to put all the connections onto it. Most of the reason for the McK being big isn't going to increase the connections - if you have 3Mb of cache you need the exact same address lines if you had 0 cache.
- Re:Humm... aren't they a bit late? (Score:2, Insightful)
  
  by ocelotbob ( 173602 ) writes:
  
  You're missing the market. These chips, at least until they get them up to X86 speeds, aren't going to be used very much in the workstation market. These chips are going to be used in such places as database servers, where the current hack of 36 bit addressing used in Intel's current high end chips, the Xenons, is starting to fall apart. 64 bits doesn't mean much performance wise - in some applications, it's slower than 32 bits, but it means a world of difference when talking about storage. Instead of having hacks to get around the 4 gig barrier, one could, in theory at least, keep an entire database in memory.
- - Re:"JESUS, that's big" (Score:2)
    
    by Anonymous DWord ( 466154 ) writes:
    
    Nonono, his package, not his die size.
- Re:Less Logic, More Cache? (Score:3, Interesting)
  
  by roca ( 43122 ) writes:
  
  The cache miss penalty is huge in IA64 because it can't reorder stalled instructions. That's one reason its performance is terrible on irregular memory-intensive applications (i.e., most server workloads). Anything that reduces the cache miss rate has got to help.
- Nothing Moore (Score:2, Funny)
  
  by ackthpt ( 218170 ) writes:
  
  Once upon a mid-day dreary, while I plodded, weak and weary,
  Through an informative article about a truly massive core,
  While I nodded, the newsfeed was slashdotted, suddenly there came a tapping,
  As if FedEx gently rapping, rapping at my chamber door,
  "Prob'ly FedEx," I muttered, "with boxes of reminders;
  reminders of the law of Gordon Moore."
- Re:Not reccomended for use at (Score:2)
  
  by MaxVlast ( 103795 ) writes:
  
  Surely it's named after Mt. McKinley [peakbagger.com], the highest mountain in North America.
  - Re:Not reccomended for use at (Score:2)
    
    by sconeu ( 64226 ) writes:
    
    And just who or what do you think Mt. McKinley is named after?
    - Re:Not reccomended for use at (Score:2)
      
      by MaxVlast ( 103795 ) writes:
      
      Right, but I suspect that the processor has a lot more in common with the big chunk of rock (other than temperature -- Mt. McKinley is quite cold) than it does with a dead imperialist/jingoist figurehead.
      
      That's like advocating equal rights on Martin Luther's birthday. Sure Martin Luther King was named after him, but still...
- Re:Straight from the article... (Score:2)
  
  by MaxVlast ( 103795 ) writes:
  
  I find that journalism like the above is often the most effective way of communicating a point.
- Re:Wow--- some of the stuff I've seen (Score:2)
  
  by Knobby ( 71829 ) writes:
  
  Maya 4 runs on x86 and PPC boxes too.. How does the ability to run Maya 4 make the Itanium powerful.. I'd be much more impressed if you'd told me that Intel had Maya 4 running on a 500MHz Itanium and it blew the doors off a dual Athalon or Dual 1GHz G4 box..
  
  I want to see numbers.. How does the Itanium compare to the Ultrasparc 3 or Power4 processors? Does it play nicely in SMP configurations like the Power4? etc..
- Re:Intel is nuts!!!! (Score:2)
  
  by the eric conspiracy ( 20178 ) writes:
  
  It is never a good idea to make a product out of research project.
  
  Well, that's a rather cold view. After all, where is progress going to come from if people don't try to make products out of research projects. Heck, we would still be using piles of stones to count our bushels of grain otherwise.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Wow! (Score:2, Insightful)

It's how you use it (Score:4, Interesting)

Re:It's how you use it (Score:2, Informative)

Re:It's how you use it (Score:2)

Re:I have the book (Score:2)

Re:It's how you use it (Score:2)

Re:Wow! (Score:2, Funny)

Re:Wow! (Score:2, Interesting)

Re:Wow! (Score:2)

Re:Wow! (Score:2)

Re:Wow! (Score:2)

Re:Wow! (Score:2, Informative)

NOT not wow! (Score:3, Informative)

Re:NOT not wow! (Score:2)

Re:NOT not wow! (Score:2)

big chip... big fan (Score:3, Insightful)

Re:big chip... big fan (Score:3, Funny)

Re:big chip... big fan (Score:4, Troll)

Re:big chip... big fan (Score:3, Informative)

Re:big chip... big fan (Score:5, Informative)

Re:big chip... big fan (Score:2)

Re:big chip... big fan (Score:2, Insightful)

Re:big chip... big fan (Score:2)

Re:big chip... big fan (Score:2)

Re:big chip... big fan (Score:2, Insightful)

Die size war? (Score:5, Funny)

Nice review. (Score:2, Informative)

Intel's new marketing strategy (Score:2, Funny)

Re:Intel's new marketing strategy (Score:2)

$300 to produce? (Score:2, Funny)

Re:$300 to produce? (Score:2)

Re:$300 to produce? (Score:2, Funny)

Re:$300 to produce? (Score:2)

Re:$300 to produce? (Score:2)

Re:$300 to produce? (Score:2)

Re:$300 to produce? (Score:2)

L3 on die? (Score:2, Interesting)

Of course. (Score:2)

Note, this is the *DIE* size (Score:4, Informative)

Amd competition. more numbers. (Score:5, Informative)

AMD Athlon (Score:2, Informative)

Re:What effects die size (Score:2)

Die Photo and Size (Score:5, Informative)

A few minor points (Score:4, Insightful)

Thanks! Where would we be without clarifications? (Score:3, Informative)

Cache Design 101. (Score:3, Informative)

Re:A few minor points (Score:2)

Itanium at 1.6 GHz in 2003 ? (Score:3, Insightful)

Re:Itanium at 1.6 GHz in 2003 ? (Score:2)

Re:Itanium at 1.6 GHz in 2003 ? (Score:2, Informative)

Re:Itanium at 1.6 GHz in 2003 ? (Score:4, Interesting)

Re:Itanium at 1.6 GHz in 2003 ? (Score:3, Interesting)

Re:Itanium at 1.6 GHz in 2003 ? (Score:2)

Who cares about GHz... (Score:5, Insightful)

Re:Itanium at 1.6 GHz in 2003 ? (Score:2)

Re:Itanium at 1.6 GHz in 2003 ? (Score:3, Informative)

Re:Itanium at 1.6 GHz in 2003 ? (Score:2)

Re:Itanium at 1.6 GHz in 2003 ? (Score:3, Funny)

64 bit regs is new? (Score:4, Interesting)

Re:64 bit regs is new? (Score:2)

Re:64 bit regs is new? (Score:2)

Re:64 bit regs is new? (Score:3, Informative)

It's the whole retro thing... (Score:2, Funny)

Coffee warmer built-in! (Score:3, Funny)

Re:Coffee warmer built-in! (Score:3, Funny)

Re:Coffee warmer built-in! (Score:2)

That's Almost 3 bits per millimetre! (Score:3, Funny)

Nothing new here - take a look at the hp-pa 8800 (Score:5, Interesting)

Re:Nothing new here - take a look at the hp-pa 880 (Score:3, Informative)

Re:Nothing new here - take a look at the hp-pa 880 (Score:2, Informative)

Yes, but is Itanium going anywhere? (Score:2)

Re:Yes, but is Itanium going anywhere? (Score:2)

Re:Yes, but is Itanium going anywhere? (Score:2)

Re:It emulates the 32 bit instruction set (Score:2)

AMD's Response (Score:2)

So... (Score:2)

What I've always wondered ... (Score:2, Interesting)

Re:What I've always wondered ... (Score:2)

Product names... (Score:2)

Importance of 64-bit architectures (Score:2)

Note, this is the DIE size (Score:4, Informative)