The Pentium IV Dissected 164
An AC pointed sent us this: "In this extremely well written and technical article, the author points out the various mistakes that Intel made with the production of the Pentium IV, the fact that Intel and other manufacturers have been misleading customers about the performance of the Pentium IV, and the amount of work that will be pushed onto software developers backs to get a piece of software to run at a reasonable speed." Beginning section readable by anybody; by the end you need to know a little more assembly language than is healthy for anyone, but excellent overall. For a Cliff's Notes version of the above, try this NYTimes article discussing the chip in non-technical terms. My guess is that most computer buyers will continue to compare only clock speeds, however.
Creditials (Score:2)
The author is Darek Mihocka, "President and Founder, Emulators Inc." according to the article. Their main product is a Mac emulator for the PC. The corp shares his ego: "Our Macintosh and Atari emulators are simply the fastest on the planet. Period." Slashdot featured another of his rants [slashdot.org] earlier this year. That said, the reason SoftMac is fastest is because it's written in assembly (and even some machine code!). When it comes to code execution speed, he knows what he's talking about.
As for the P4, read the article closely. He realizes Intel is going for a brute-force, high-clock chip (compares to RISC). He admits it performs faster for some tasks, just far less efficiently. He just thinks Intel should have concentrated on better design (like AMD) instead of getting the big marketing win: a new chip with a huge clock speed. What's the point? Don't spend the big money on the P4 now since AMD has better design and will scale better beyond 1.5GHz
Re:Bad execution, not architecture (Score:1)
They rule... very good people to represent a company like Intel....
Re:Pentium IV is still good, but we'll have to wai (Score:1)
When the 386 came out, people like you said "dos doesn't multitask anyway, and who needs 4mb of memory?"
When the Pentium came out, people like you said "my 486 is perfectly fine, this new pentium thing is a waste"
When the Pentium Pro came out, people like you said "what a waste of money, PPro is not too much faster than Pentium"
When the Pentium II for consumers came out, people like you said, "My pentium 133 is fine, whats the difference anyway"
The first generation of most chips are not a great price/performance combo. But the 2nd and third generations of these chips will get progressively better and better, just as the Athlon has improved from the POS P2 knockoff that it originally was.
AMD is the winner in the consumer market right now, after years and years of going nowhere. But consumer PC market is a relatively low-margin and fickle market. Intel makes their cash in the mainstream business market. Few, if any of the big-name business pc vendors even offer AMD-based pc's, since AMD cannot deliver a consistent supply of chips.
Re:Stop Being so Biased Against Intel (Score:2)
Re:P4 is slow? Overpriced? Says who? (Score:2)
Incorrect. There is no Athlon DDR moterboard released yet, but RDR (and SDR, obviously) motherboards are plentiful. We compare what's available, not vaporware.
also, did you notice that the pentium 4 machine had a top of the line hard drive (ibm deskstar 75gxp) and video card (geforce2gts) whereas the amd machines used an older ibm hard drive and a diamond stealth 3d pci(WTF?!!?) on the ddr machine and a western digital hd + nvidia tnt2 m64 on the sdr machine?
All of this is irrelevant for SPEC, which is a CPU only benchmark.
or how about the fact that all the tests were done with an intel compiler????
Well, where's AMD's compiler then? The benchmarks are compiled with the vendor's compiler of choice. WHat the results mean is that with the best available compiler, the P4 performs much better than Athlon. With the average compile, this might not be the case, but anybody who is the least bit performance conscious is going to recompile everything.
Then there's the system prices, I have no idea where you got these prices, but assuming all 3 systems use the same components except cpu+mb+ram, the prices would probably look like:
The CPU prices are irrelevant; people buy systems, not CPU's. You can buy a P4 Gateway system for $2000. I have never seen a namebrand 1.2 GHz Athlon system for less than $1500 (though I haven't been shopping for them).
so based on these figures, the p4 is OVERPRICED!
Compared to Alpha (less than 10% more performance, at quadruple the price)?
Re:Who is this guy anyway? (Score:1)
The one good thing about the P IV is that it sounds like it can be scaled to even higher clock speeds. If a 1.5GHz P IV runs like a 900MHz PIII, then a 3GHz P IV would run twice as fast. Not 3+ times as fast like you'd expect, but enough faster to make upgrading worthwhile. Once Intel hits that level, then they can start selling P IV's, and then there will be a reason for software vendors to write for it. Unless AMD scales up the Athlon equally fast...
more cluelessness (Score:1)
Tricks such as "register
renaming", "out of order execution", and "predication".
In other words, if the programmer won't fix the code, the chip will do it for him
First he's complaining the intel's cpus place too much demand on compilers, now he's complaining that the chip is optimizing instructions. I guess intel can't do ANYTHING right! Also, 1) afaik, there is no ISA that allows for register rename hints, 2) out of order execution is useful for doing hit-over-miss. Compilers can't predicted cache misses.
Quote:
The PowerPC G3 and G4 chips use much the same tricks (after all, all these silicon engineers went to the same schools and read
the same technical papers) which is why the G3 runs faster than a similarly clocked 603 or 604 chip
G3s are NOT faster than 603es at the same clock speed. G3s use 603 based cores.
Handwaving (Score:1)
If by "specific examples" you mean "handwaving", then yes, I would agree.
On many occaisions, he fails to provide or cite any code or data to support his claims. For example, in "Why the AMD Athlon doesn't suck", he claims that "The AMD Athlon has no partial register stall" while he does not state how he determined that this is the case, either from an empirical or engineering standpoint. That screams either "This part of my thesis is not important enough to support with facts or data" or "I don't
And in "The Benchmarks", he states that "Running other tests using various emulators, I found that in general the Pentium 4 runs emulators such as SoftMac 2000 SLOWER in most cases than the 650 MHz Pentium III and 600 MHz AMD Athlon." Which tests and emulators? (And why do we care about emulators since the great majority of end users don't?)
Additionally, he advocates changing
It is also rather annoying that he repeatedly states "CLOCK SPEED IS NOT EVERYTHING" while making comparisons such as "Pentium 4 fails to keep up with even the 600 MHz chips".
(And he needs an editor really, really, badly.)
Bad execution, not architecture (Score:5)
He didn't say that the overall architecture is bad. He didn't say that the P4 will lead to bad designs in the future. He said that some of the choices for the present P4 configuration are bad and that people would be better served by spending their money elsewhere. If people buy Intel chips no matter what the actual price to value ratio is, then Intel has won and the consumer has lost.
The author gives very good explanations of the limitations of the present incarnation of the P4. He also explains what he thinks needs to be fixed. With all those fixes, the P4, in a few years, will likely be a really good chip. The design isn't beyond repair, it's just flawed.
I remember the 486SX clearly - and how my father was duped by the hype. The same thing's happening here. Also, if Intel really believed the P4 was its best chip, why are the colored guys on TV hyping the P3 like there's no tomorrow? (No, that's not a racist remark. If you've seen the ad, you know what I mean.)
The bigger problem is that, even though you can get around the limitations of the P4 chip by writing a really smart compiler, the P3's and below will be around for years, so you won't necessarily be using the optimization settings in generic code. You'll likely see 'Word 2005 for the P4' and 'Word 2005 for the P3 and below', although there's nothing preventing them from being on the same DVD and the installer choosing the right version.
If you can get past some of the strong language in the article (Intel engineers are stupid; boycott Intel; etc.) you can see that he's not anti-Intel per se. He's anti-Intel's marketing guys, who seem to be running the company at the moment. The decisions made in the present P4 incarnation have to be marketing's - no other explanation holds water. You can't design the next generation chip and then deliberateley cripple it. That's like having a son and then cutting off his foot to see how he gets along in the real world. I doubt engineers had much to say in the present P4 configuration.
The author provides pretty convincing proof that the best value for your money is an Athlon system, right now. I haven't seen anyone here able to refute that statement. It's the same conclusion that a couple of other people have reached. From all I've read over the past few months, I have to agree.
--
Re:Technical Flaw in the article: (Score:3)
It is also worth noting that to a certain extent, history is repeating itself. The Zilog Z80 was itself a clone of the Intel 8080. By the late 70's, Zilog, which was originally an upstart clone chip vendor, had overtaken Intel by building a better and cheaper product. Intel's follow-on to the Z80, the 8085, much like the P4 today was largely a disappointment. Intel was forced to move to 16 bit with the 8086 (and the 'ginsu' 8088) in order to grab back the market they had lost to Zilog. Intel was successful mainly because they succeded in selling the 8088 to IBM, which bailed them out. Zilog's 16 bit processor the Z8000 was a failure because it was too ambitious, and not at all compatible with their 8 bit designs, despite the fact that many people thought it was superior to Intel's 16 bit chips which were largely just warmed over 8 bit designs with larger registers.
It remains to be seen how things will sort out now. For all intents and purposes, Intel's P3 and P4 look to be beaten technically and price/performance wise by AMD. Intel appears to be largely betting on the IA64 to win back the market, but unlike the 8 bit -> 16 bit transition, it is Intel who is betting on a totally new and mostly incompatible architecture for 64 bits rather than AMD, who appears to be charting a much more conservative extension of the basic x86 architecture to 64 bits. If AMD gets software support for their 64 bit architecture before Intel does, which may happen because it is less of a jump, or AMD is able to push 64 bit processors into lower pricepoint boxes quicker, which also seems doable, Intel could be in trouble. One other big thing will be whether the AMD architecture runs existing 32 bit x86 code faster than the Intel IA64 processors do. Since many people will be largely dependant on legacy applications, if AMD can offer the promise of 64 bit applications in the future and better performance for existing 32 bit apps, then Intel will really be hurting.
Be Careful (Score:1)
Re:Oh My God! (Score:1)
Boycott Intel (Score:2)
It may seem obvious to some, but thats exactly the point. Who cares if it's shoddily produced and a poor performer, it's got two very important things going for it.
1. It's got the fastest clock speeds out there.
2. It's got the Intel (tm) brand name.
The average computer user doesn't have a clue that it performs slower than a slower clocked AMD chip. They see the higher number, and assume that means it's better. Who's AMD? They don't have all those nice commercials with Blue Man Group and all, and the nice logo. Selling chips isn't really about technology as much as it is about marketing. For example, Cyrixs PR266/PR300/etc - they didn't actually run at 300mhz but they said that they performed equal to around a 300mhz processor, so they sold them as "300"'s, figuring consumers would assume that means 300mhz. That was all bs - Cyrix just couldn't keep their clockspeeds rising at the same rate as Intel, and realized that they could take advantage of the average consumers ignorance. Intel seems to be banking on that same ignorance today; I think this line sums it all up the best:
What it boils down to is this - just like at Microsoft and just like at Apple, the marketing scumbags at Intel have prevailed and pushed sound engineering aside.
We can't allow Intel to charge a premium for poorly performing chips, nor can we allow them to lie about their ability. The only solution is to boycott the P4 and all Intel products. Buy AMD, you'll be happy you did (I am).
1.4 and 1.5 GHz? (Score:1)
Re:mod parent up (Score:1)
Re:Creditials (Score:2)
It will be interesting to see just how the next round of AMD V Intel pans out. Will the next AMD chip have similar clock speeds to the P4? If it does not then, regardless of performance, I fear for it, because everyone except knowledgible Slashdot types buy on clockspeed basis alone. I know I used too, before I became really interested in this computing lark!
As another respondant says, I suppose his credentials don't matter so much when he gives evidence to back up his claims. But still, if you are not a real expert, its good to know that he isn't just some quack, and quite useful to me! Thanks.
Re:Two things (Score:1)
Re:this guy needs a clue (Score:1)
Re:Creditials (Score:1)
Assembly and machine code are synonyms.
At last --- someone who actually read the article (Score:1)
It appears the whole article can be boiled down to these points of interest:
If the road maps hold out, Intel is screwed. (Score:2)
"AMD is hoping that the re-worked core will bring the Athlon to at least 1.7GHz by the second half of 2001. By this time the 1.2GHz Athlon CPU on 266MHz front side bus will occupy the lowest rung on AMD's performance ladder. Once the Palomino runs out of headroom, the next horse will escape from the barn."
If the road-maps of both companies can be followed, then Intel has a serious mess on its hands.
next round for the Wintel monopoly? (Score:1)
If Mihocka's analysis is correct, it could be interpreted as a ploy to perk up slow software and hardware sales. Right now we're in a market where people (consumers) are mostly satisfied with the performance hardware and software they have. They've gotten off the upgrade treadmill because they've found that for 90% of what they do, any PC, even older Pentiums, will perform fine provided the user has sufficient memory and video power.
Current software runs poorly on the P4 because the design is so different from the earlier Pentium family that code optimized for those chips needs to be completely recompiled or re-written for the P4. Microsoft has the compiler, it can update the compiler, rebuild all its apps (and call them Office 2004) and tell users that they should buy a P4 and new software to have the fastest performance available today. MSFT and Intel both make boatloads more money selling stuff to people who would otherwise be happy to stick with what they have.
Basically, if some buys a P4 and finds out that it doesn't perform well with existing software, they'll be enticed to buy upgrades from MSFT. Someone buying the latest MSFT software will be enticed to buy a P4 to get the most performance out of the software.
Re:Pentium IV is still good, but we'll have to wai (Score:1)
Re:P4 is slow? Overpriced? Says who? (Score:2)
RDRAM motherboards for the P4 are not "plentiful" The only ones that exist are the intel boards and the asus, two motherboards does not count as being plentiful.
And DDR motherboards are not "vaporware." Vaporware products are products that do not exist. DDR systems are available from places like Micron.
Well, where's AMD's compiler then? The benchmarks are compiled with the vendor's compiler of choice. WHat the results mean is that with the best available compiler, the P4 performs much better than Athlon. With the average compile, this might not be the case, but anybody who is the least bit performance conscious is going to recompile everything.
The fact that the benchmarks were done with an intel compiler shows that the results are biased toward one vendor. What if the test was done with gcc? how would the results turn out then? And as for recompiling everything to get the most performance, how are we going to get the source code to closed-source programs?
The CPU prices are irrelevant; people buy systems, not CPU's. You can buy a P4 Gateway system for $2000. I have never seen a namebrand 1.2 GHz Athlon system for less than $1500 (though I haven't been shopping for them).
of course the cpu prices are relevant, assuming 2 systems have the same monitor/case/video card/hard drive, everything else comes down to cpu+ram+mb prices. If a p4 system can be built for $2000, then the same system can be built for a little ore than $1000 with athlon/sdr ram. Places like gateway just happens to be selling their athlon 1.2 ghz systems for more than they're worth to make more profit whereas they probably are barely making a profit with that 1.4 ghz p4 for $2000.
Zetetic
Seeking; proceeding by inquiry.
Elench
A specious but fallacious argument; a sophism.
Re:Pentium 4 blunder (Score:3)
That is excatly what people said about the Pentium Pro when it came out and ran 16-bit code slower than the Pentium. And look what happend. The P6 arch proved to be extremly scalable, extensible, and yes, profitable. At this early point, I see no reason to assume that PentiumIV can't repeat this.
--
Quality of Moderation (Score:1)
The fifth highly-moderated comment is substantial: that what's bad engineering design for this point in time may actually provide a better platform on which to build high clock speed chips a year or two out. The commentor doesn't show why AMD's currently superior chips can't gain similar speed -- but at least begins a proper challenge to the paper under discussion's very credible technical analysis of why the Intel chips currently look bad.
Historically, no on stays at the top forever; some 'underdog' always wins. Most underdogs lose, badly. But anyone who's claimed "Intel/the Roman Emperor/the British Empire/IBM/the Nazis/CBS/Rock n Roll is just too big, smart and powerful to ever yield top position" through what, afterwards, will appear shear idiocy has been wrong. We're not only surrounded by idiocy, it infiltrates us as individuals.
But could we tone it down in our moderation, where it's now become typical /. crap to award points to whoever says "This is typical /. crap" in the most typical, /. crap way?
Re:Boycott Intel (Score:1)
Amen. I was screwed by Cyrix back with my P150. It was *much* slower than a Pentium 150. When I finally upgraded, I got an Athlon, and I couldn't be happier.
-----
From sources close to Intel (Score:1)
First of all, there is no central point of authority. The chip developers do not have a single way to add comments or say that a certain part is bad.
Secondly, the actual Fab plant where the chips are made in Albequerque, New Mexico is continually being constructed and it is __incredibly__ dusty around there. That is probably one of the contributing factors to the incredibly dirtiness of the plant.
Intel is a shoddy operation as far as I've seen it and I suggest to stay away until they have a bit better recent track record.
HELO #kuro5hin
------------
Re:Two things (Score:1)
I don't think it was so much a matter of cost as a matter of time to market. More silicon==lower yields==longer development time to get a manufacturable process. Intel could have included many of the axed features in the design, but it would have resulted in an extra 6 months to a year before the product could be released. Meanwhile, the Athlon would have been wiping the floor with the Pentium III and gaining market share like crazy. Intel couldn't let AMD have both the performance crown and the MHz crown.
The current Pentium IV is more of a stop-gap measure, just watch in a year or so intel will release a revised version with a larger cache and some fixes for the other problems mentioned in the article.
P.S. Will the next Pentium be a P5 or a P8 (or perhaps a Sexium)?
Two things (Score:4)
Secondly, I thought the entire point of the Pentium IV is that it is focused on different areas to the PIII and others. Specifically, it is designed for a media rich environment, and was designed with the future in mind. I would guess (bear in mind, I don't have any credentials) that we won't see the best of the PIV until a year or two down the line, when compilers are properly optimised for it and people start programming with its architecture in mind. Until then, I fear we are making unfair comparisons. Just my guess!
Re:Ordering a boycott? (Score:1)
Pentium 4 blunder (Score:2)
Re:Who is this guy anyway? (Score:1)
Faa -- The whole game of the mainstream computing market is trying to introduce something new without breaking back-compatibility. It's not the programmers that are lazy -- it's the consumers, who are still out there holding onto 8086 and 80286 and i386 software with white knuckles. (In fact, one nice thing about this
Intel *has* a solution for poor speed on 'legacy' code -- it's called cranking the clock speed up to 2GHz, efficiency be damned. This is faster on legacy code than any P6 that Intel could possibly make, BTW.
For 90% of the people for which that isn't a good enough solution, they can recompile. The other 10% is either out of luck or a Quake player that just has a bug up his ass that his shiny new 1.5Ghz chip isn't running at maximum efficency.
The author of the article makes his technical points, but the guy is a crank. See his previous Slashdot appearance complaining about Apple and other Mac software vendors dropping O40 support, for example (thus making his handcrafted emulators useless, even though Mac users were more than happy to leave 68K behind.)
Ordering a boycott? (Score:1)
Hmmm... (Score:2)
--
what about compilers? (Score:1)
which makes one wonder whether the real problem might be not the processors, but the compilers. by that i mean, the traditional c compiler doesn't really have enough information to know when to apply what optimization. consider traditional tight loop over a region of memory - c lets you implement it as a straightforward for loop on array references, an "increment pointer and dereference, until pointer reaches some value" loop, and so on.
now the problem is, the compiler doesn't have any sort of an idea of what we're trying to do. had it known we're performing bitmap manipulation with multiplication over an integer buffer, it might be able to partially unroll the loop to fill in the pipeline, or automatically insert MMX code when it sees its appropriate - same with the parallel-floating-point-op instruction sets. but since the c compiler doesn't know what data it's moving around, or what the user is really trying to do on a macro level, it doesn't know any better than to produce a pretty much straight translation of the c code.
this way performance suffers because the compiler isn't smart enough to automatically support the subtle features of those new processors, and that in turn can be traced back to the languages such as c not retaining enough information about what is being computed to support such automatization.
does this mean we may finally start seeing a move to higher-level languages, when low-level ones fail to compile as optimally as they ought to? i hope so. but considering how much c code there's still floating around out there, i won't hold my breath.
Re:Ordering a boycott? (Score:1)
I'm close to being in the market for a new computer (couple more months of paying off bills and I should have the cash to start), and I was comparing prices. Even if the Pentium 3 wasn't such a dog, I wouldn't get it. There are much cheaper alternatives. That and I was absolutely amazed at how cheap hard drives are now. (Sue me, I haven't comparison shopped computers in three years.)
I'm probably going to go with an AMD Athlon. And not because of this article. With the money I save on that, I can get a bigger hard drive, a scanner and some other neat toys.
Just my 2 shekels.
Kierthos
Poorly written article (Score:1)
Re:Tom's Hardware contradicts this (Score:1)
"Give the anarchist a cigarette"
Re:mod parent up (Score:1)
IA64? No thanks... (Score:1)
I know, Mhz is not everything but if you look closely at the performance difference of lets say an Athlon (highest possible clock) and an Alpha (highest clock as well) then you won't see that much of a difference except in the price.
What I mean is that the P4 is going to be clocked over 2Ghz very soon and will probably reach 4Ghz in the next two years. At that time, IA64 will still be trying to reach 1Ghz. IA64 is the biggest flop of Intel so far so I am not too eager to put my hand on it. Besides, the only reason to want 64 bit is to be able to access 2Gb of address space per process. I don't really see this as being necessary right now...
Re:Two things (Score:1)
My hope is that AMD takes this opportunity to make a name for themselves and convince the Dells of the world to sell computers with AMD chips too. Unfortunately AMD seems to have a knack for blowing opportunities.
Re:Bad execution, not architecture (Score:1)
I suspect the PIV is a chip waiting for a process shrink, then you will see what it is all about. Remember the ugly, nasty Pentium 60MHz in 0.6 micron? Remember how much less heat a P66 in 0.5 put out? Remember that 0.5 micron chips later hit 90/100 MHz? That will happen here too. The Intel forecast of 2GHz by 4Q01 is probably too conservative, or they are playing coy.
I think the article does have a couple of interesting points though -- Intel relied on the trace cache too much, and probably didn't notice how much of a bottleneck it would be for their execution units if they could only issue 3 instructions from the trace cache (I suspect that some design parameter got changed late in the game, because that is a pretty bad mistake that I would expect to be noticed). However, I also think the author didn't realize how hard it is to run any SRAM-type structure at 1.5GHz, especially to scale it up to bigger sizes (his entire rant about 8K versus 64K I found pretty humorous).
The other humorous thing in the article is the comparison of cycle counts. The author spends lots of bold tags on making sure we know that MHz is not the only thing, but then looks at cycle counts. Well, bub, they are representing the same thing. For a given architecture one must consider the quotient of the clock cycle counts and the clock frequency to get a realistic measure of performance. You can implement a given chip with lots of short-fast pipeline stages or fewer, bigger stages. One approach is not "better" than the other -- it's dependent on process technology and what sample set of code you use to benchmark it on. Therefore, like most aspects of processor design, it's a tradeoff.
On thing he does get very right: I certainly wouldn't buy a PIV right now. But, I think that in 12 months everything will look OK. I also don't think that a "bad" (i.e. a little slow) first version of a chip is a reason to discount an entire architectural implementation.
Re:P4 is slow? Overpriced? Says who? (Score:2)
That's not the point. The point is, using the Intel compiler on a P4 is faster than using ANY compiler with the Athlon. Thus, if you wanted a system with the fastest possible performance, you would use the combination of the P4 processor and the Intel compiler. The P4 with GCC, or an Athlon, would be an inferior choice (for performance).
of course the cpu prices are relevant, assuming 2 systems have the same monitor/case/video card/hard drive, everything else comes down to cpu+ram+mb prices. If a p4 system can be built for $2000, then the same system can be built for a little ore than $1000 with athlon/sdr ram. Places like gateway just happens to be selling their athlon 1.2 ghz systems for more than they're worth to make more profit whereas they probably are barely making a profit with that 1.4 ghz p4 for $2000.
Typically, only complete system prices are compared. It may be true that a P4 is double the price of an Athlon, but that's comparing the CPU itself. But the CPU is a small part of the system cost, so a computer using Athlon costs more than half for a comparably equipped system. To compare CPU prices is to magnify their actual effect.
Comparing CPU prices is to repeat Transmeta's fallacy (who claimed that a CPU with half the power consumption would speed up battery life of the system, when in fact, the CPU was not even the main power hog in a system)
Re:Who is this guy anyway? (Score:2)
He put up a good facade, but in reality his article wasn't a decent analysis -- it did have its good points, but there was so much BS in there that it was hardly worth the effort.
---
Re:Marketing/finance pukes (Score:1)
I agree, it seems that the PIV is really a half finished product that was rushed to market to prevent a massive loss of market share.
Engineers and geeks know that will only make things worse, but marketing and many other management pukes were raised on "The Little Engine that Could." They think that if you just try hard enough the impossible will happen.
I disagree. Sure, the PIV is not an Athlon killer in terms of performance, but there are plenty of clueless pointy-haired bosses and Joe six-packs out there that will never know that they shipped out a truckload of cash for only a marginal improvement in performance. After all most of their applications won't need that kind of performance anyway.
Grabbing the MHz crown (if not the actual performance crown) is a stroke of marketing genius. It buys Intel a little time to come out with an improved PIV that really is an Athlon killer.
Re:Pentium 4 blunder (Score:1)
As for AMD -- they are the one milking i386. Sledgehammer will be the most kick-ass 64-bit chip to ever run 16-bit code on Windows ME.
Re:Creditials (Score:1)
Reference was made several times in the article about the POOR optimizations of the compiler (he kept mentioning Microsoft compilers). Seems Microsoft (according to the article) lags about 3 years behind in getting their compilers to optimize for the newest processors. So, I suppose, the answer is that the compilers are not "halfway decent".
Re:1.4 and 1.5 GHz? (Score:2)
I'd guess you have never tried to push a video clip through a Sorensen CODEC.
Re:P4 is slow? Overpriced? Says who? (Score:2)
Zetetic
Seeking; proceeding by inquiry.
Elench
A specious but fallacious argument; a sophism.
Re:Creditials (Score:1)
No they ain't.
Modern assemblers (macro-assemblers) do memory allocation of variables for you, as well as subroutine calling. If you care about data alignment in some arcane subroutine or some weird speed up when calling a subroutine (such as leaving data on registers) then you need to write directly in machine code.
Re:I have no life... (Score:2)
Grade: Technical Info - A; Language Mechanics - C (Score:1)
"Compare this to the 8086 and 80286 whose 16-bit instructions could only use certain INSTRUCTIONS for certain operations"
should instead read,
"Compare this to the 8086 and 80286 whose 16-bit instructions could only use certain REGISTERS for certain operations."
(I emphasized the word that should be changed by capitalizing it.)
I very much appreciated the information and insights that the author provided in his article.
Re:Poorly written article (Score:1)
Re:Bad execution, not architecture (Score:1)
I dunno, the whole "BOYCOT INTEL AND ALL IT'S PRODUCTS" seemed a little biased to me
In my opinion, this guy obviously has no clue about trade offs in chip design, and needs to get off his soapbox and read more before making such moronic articles. Making yourself look stupid is never a good thing. But I guess such a pompous ass as this really thinks that because he can write some assembler and do some timings that he can take on all of Intel's chip designers.
Oh ye of little faith (and knowledge). (Score:1)
But you're neglecting what appears to be Intel's strategy. For now, push the PIII. Probably until mid next year. By that time, the compilers will be ready and PIV will probably be at 2.0Ghz. Considering the fact that with optimized code the PIV outperforms anything out there (yes, including AMD) in many benchmarks _now_, I'd say unless AMD comes up with something fast they'll be WAY behind the performance curve by then.
The shrink to
For more information (Score:1)
www.tomshardware.com
www.anandtech.com
Who is this guy anyway? (Score:3)
Yeah, right. Ok, lets address stuff in order:
1. Prime95. Prime95 right now is optimized for current processors. The author received a Pentium 4 system a couple weeks ago, and is rewriting his code right now. When the reoptimization is completed, expect a factor of two improvement.
2. Small L1 cache. The author seems to believe that a larger L1 cache is always good. What he fails to address is that larger caches are inherently slower, and going from a 3 cycle 16KB cache to a 2 cycle 8KB cache improves performance, given a fast L2 cache.
3. No L3 cache. Sure this would have been nice -- but also expensive. Given the intelligence of the i850 chipset (including memory look-ahead reads) and the bandwidth of RDRAM, it isn't really necessary.
4. Instruction decode. Hello? Anyone home? At most 1% of instructions will have to be decoded. That's the point of the trace cache. And yes, Virginia, that cache is large enough.
5. Slow rotates and shifts. That's the price you have to pay if you want a fast clock. Variable shifts are algorithmically expensive (in fact, within a factor of log log N of multiplies, but that's a different matter).
6. Etc. I could go on point by point, but the pattern remains. The author clearly doesn't understand the tradeoffs necessary when designing processors, and looks at one side without considering what it is being traded for.
My opinion is that the Pentium 4 is a very well designed processor. Not only did the designers build a processor which can be run at high speeds, they allowed themselves room to add improvements later without requiring a lengthy redesign of the entire processor. High clock speeds mean that signal flight time is a problem? That's why there are two cycles dedicated to moving data across the processor. Got extra silicon? Double the number of SSE units to allow SSE instructions to complete in half the time. Decide that you want an L3 cache? Throw one on.
Sure the Pentium 4 doesn't perform great on code not optimized for it. But neither did the 486, the Pentium, or the Pentium Pro. And which would you prefer to have right now, a 250MHz 386, or a 1GHz Pentium III?
Re:Ordering a boycott? (Score:1)
Re:Ordering a boycott? (Score:1)
Re:Stop Being so Biased Against Intel (Score:1)
Pentium IV is still good, but we'll have to wait (Score:4)
However, that's not the whole story. Intel has always introduced new chips, tweaked them, put production in gear, lowered the cost, then inundated the public with high quality, high performance, low cost processors. I doubt the P4 will be much difference. With the process change (to 0.13 micron I believe) for the P4 comes, combined with the normal bug fixes, combined with better memory support (such as DDR SDRAM), combined with much higher clock speeds (we're talking over 2 GHz), combined with major production volumes and lower prices, the result will be a screaming fast processor that will be hard to beat. The P4's main advantage (and essentially it's entire raison d'etre) is that it has a whopping 20 stage pipeline. That means one thing, you can shove gigahertz down it's throat like you can't do to any other processor. Sure the P4 may not be as "tight" and efficient as some of the other processors out now (which is why it's foolish to be an early adopter), but what it lacks in effectiveness it will eventually make up for in raw cycles. Right now (with all of the P4's flaws, including those that can be fixed, mind you) the P4 runs at maybe 80% of what the idealized speed of a PIII or Athlon would be at the same clock speed, but they expect the P4 to hit 2GHz by Q3 '01 which means you need around a 1.6 GHz proc. of the old style to keep up with it. And this assumes that some of the weak points of the P4 (most importantly, the horrendous memory system forced on it by the Rambus contract) remain, which won't be the case.
I'm not saying the P4 will blow everything out of the water next year (it won't), but it will be fully mature and it will be leading the pack and will be very difficult to compete with.
Re: (Score:2)
rewriting code (Score:3)
I'd contend that it's a fair comparison with what AMD had to put up with -- FPU benchmarks intended for two FPU pipeline chips on a three FPU pipeline system (Athlon). Were benchmarks rewritten right away? No.
Re:That's not how it works... (Score:1)
>>If you buy both Sega's and Sony's systems, both companies get what they want: your money. Sega has no reason to improve its products because you already bought one; and Sony has also no reason to improve because you also bought one of theirs as well.
You buy more of the one that seems better at the time and less of the one that seems poorer at the time.
>>If consumers don't discriminate between quality and non-quality goods (or cheap and non-cheap goods), then no competitive situation exists.
This requires the ability to discriminate, which is not at all simple. Even after a long time of running Windows, Linux, and FreeBSD side-by-side the evidence is essentially anecdotal.
>>In an ideal world, people wouldn't have any consumer loyalty at all -- they'd always vote with their money and buy whatever product is the best product available.
If it's a winner-take-all situation, the competition disappears. In particular, if it takes a second or third look to determine which is actually better, the situation is rather more complex. Competition can exist when most consumers are brand loyal if there is a vocal minority in the middle with very little brand loyalty.
Re:Creditials (Score:1)
Re:Who is this guy anyway? (Score:3)
And once apps are optimized for the P4, every Joe Casual User will have to buy one to get decent performance. I'm sure that's just a coincidence, though.
Re:Who is this guy anyway? (Score:1)
Re:Pentium IV is still good, but we'll have to wai (Score:1)
With the exception of the Pentium 4 and the 1.3 Ghz P3's, which are still not really out, Intel has had few shipping problems. Millions of Celerons and P3s have shipped on time. AMD has always had difficulty shipping enough chips on time at the right price. Their new Dresden plant is their only exception.
He's myopic, only seeing the parts he touches. (Score:1)
Re:That's not how it works... (Score:1)
So you're saying it would be illegal for Sony to give away its consoles? Are you saying they can't charge $100? Just what are you saying? I think you just made that up.
can you guys PLEASE get some REAL information? (Score:2)
I have a 1.6GHz P4 computer system (prerelease, not overclocked) and a IA64 system here at work, and as *Tom's Hardware Guide* clearly points out here in their *latest* comparison:
That's meaning that the P4 1.5 ran at 1.7+GHz without issue while the 1.2GHz Athalon only could reach 1.466GHz.
And *most importantly* the P4 is at the beginning of their production run, while AMD is straining their current clock speeds. 1.8 and 2.0GHz P4s will be out pre-fab within months, and AMD is stressing their line to do 1.2.
See for yourself [tomshardware.com]
So *please* don't flame Intel needlessly unless you have hard evidence.
As well the IA64 architecture is *awesome*. 128 64-bit general purpose registers, an additional 128 64-bit floating point registers, and much much more. The coding that I am doing runs like 10x faster on a 666MHz IA64 than it does on a 800MHz PIII (literally!).
I don't mean to flame, but this type of I-am-going-to-spread-biased-misinformation-becaus
Re:Who is this guy anyway? (Score:2)
You can't really make blanket statements like that, any more than what you are accusing the author of doing.
Which is better depends on the miss rate and miss penalty, as well as the speed of the L1 cache. And of course the miss rate depends on what software you're running, as well as the size and organization of the cache.
If you know all the variables then you can run up the numbers, but without them you can't really make too many blanket statements.
Or you can look at benchmarks, or (best of all) you can try the systems side by side and see which really works for you, and whether the faster one is worth the extra cost, if any.
> 4. Instruction decode. Hello? Anyone home? At most 1% of instructions will have to be decoded.
I didn't read the article (don't do registrations, free or otherwise), but if you and the author are using standard terminology, then every instruction has to be decoded. "Decode" just means looking at the bits in the instruction and deciding what to do. Every processor has to do this on every instruction, and the fact that it's a decision process means that bits have to ripple through gates, which in turn means that time is consumed. Its complexity can indeed be a factor in a processor's speed.
--
Re:Technical Flaw in the article: (Score:1)
* The 8088 had an 8-bit bus, while the 8086 used a 16-bit bus. The 8088 had less pins, and was considerably cheaper.
I repeat again: The 8086 and 8088 were both 40-pin devices!
There was no savings of PINS or CHIP SIZE; perhaps a bit of die was saved since you didn't need bidirectional drivers on 8 more of the address lines but the chip size and pin counts were identical!
Pinouts were different, as was menitoned several times in this thread...
Big flaw: No SMP! (Score:2)
According to our own software, a 1.5 GHz P4 clocks in at just over a 1.1 GHz PIII. Not too bad in absolute terms, though there's no doubt the TBird kills it in price/performance, especially when the whole system price (including RAM) is considered. Still, I'm not ashamed to have one on my desk, I just don't want to be the one paying for it. Nothing new there - the Pentium Pro sucked at 16 bit software and cost far more, but it (and the P6 core) were still very successful.
The P4 has two decent advantages - RAM bandwidth (for those who need it), and SSE2, which is finally really useful to me. I can double and sometimes even triple the performance of all my MMX code, and that easily outstrips the Athlon. This won't apply to most code, true, but it sure makes a difference to my software.
However, 95% of all my customers don't use P4s, or even Athlons - they use dual PIIIs. 2 x 900 MHz PIII chips beats any P4 or Athlon system comfortably, and is still doesn't quite break the bank :-) This, and only this, is what has kept my customer base loyal to Intel while the Athlon has been storming the castle.
Biggest flaw in the P4? No SMP! I still can't believe it. Their one big advantage over AMD in the higher end systems, the one they've been pushing to all their workstation customers, and the P4 WILL NOT DO IT. And now, of course, when AMD are finally on the verge of releasing their SMP chipset (can it be true?), Intel neatly snatch defeat from the jaws of victory, letting AMD through the gate, and locking themselves outside...
Of course, there's still the Foster, AKA P4 Xeon. It will do dual, quad and 8-way systems, and this promises to be the ultimate system for my software (I use a dual Foster too, and it is nice, no question). But at what price? It's bad enough my customers having to mortgage their homes for 1 GB or 2 GB of Rambus RAM, but to have to pay Xeon-level prices for a dual system as well is going to drive them into the welcoming arms of a waiting DDR dual Athlon.
Guess which system I'll be buying next for myself.
Namarrgon
Re:Who is this guy anyway? (Score:1)
1. When the reoptimization is completed, expect a factor of two improvement
Expectations are still estimates. The modified MPEG4 FlasK encoder numbers are certainly more accurate- and they aren't exactly promising.
2. that larger caches are inherently slower
While a larger cache isn't always good, it only has to be usually better to get more performance.
3. Slow rotates and shifts.
He wasn't arguing that Intel's shift/rotate unit was sub-par, but that using it in a solution for a partial register stall was a step back. Of course he uses the magic words certain and can, but this looks why he talks about slow shift/rotate as a problem.
I agree that the PIV overall is well designed for future expansion- scalability, and the L3 cache, but these things aren't here yet. By the time we have compilers optimizing for the PIV, and the option of ordering our PIV + L3 both AMD and Intel will be pushing the next level of chips.
Re:Sorry, never made it past the opening paragraph (Score:1)
"Die size" has nothing to do with the issues he presents. If you can refute the claims he makes in the *middle* section of the article, by all means do. If not, shaddap and siddown!
Re:Who is this guy anyway? (Score:1)
Re:Who is this guy anyway? (Score:2)
It may not be obvious to someone who's had one semester of logic design that the speed of a boolean circuit in real silicon isn't just a function of its depth. Issues like fan-out and trying to implement the circuit on a plane, etc. end up killing you for larger circuits. A naive, two-level circuit, though it has minimal depth, isn't necessarily the fastest in real silicon.
Consider something as simple as the parity fuction. It can be shown that a boolean circuit of constant depth implementing the parity function grows exponentially with the number of inputs. This is a big problem in that your inputs will be forced to each drive an ever increasing number of gates as the number of bits increases. At some point you have to alter the electrical characteristics of the circuit (ends up making it slower) or add drivers (ends up slower).
Suppose instead you allow the depth of the circuit to increase. Now the number of gates you need grows linear with the number of inputs rather than as 2^n. It gets even better. Every doubling of the number of inputs only adds one more level to the depth of the circuit.
Exercise for the reader: in what way does the arrangement of the drivers added to the first circuit resemble the arrangement of the gates of the second circuit?
What bothers me the most is the contemtuous tone you used in replying to cperciva. He didn't deserve it.
Re:Sorry, never made it past the opening paragraph (Score:4)
A good editor would have removed the BOYCOTT ALL INTEL stuff or at least moved it down a bit. But I feel for the author here: he paid $4000 for a system which isn't as good as a (much) cheaper Athlon.
Crusoe watchers take note: there's a nice little summary of the Crusoe's performance and why he's very impressed with that CPU's architecture. That summary alone is worth reading.
Recursion? (Score:2)
P4 is slow? Overpriced? Says who? (Score:2)
1.5 GHz Pentium 4:
SpecINT2000: 536
SpecINT2000: 558
System price: $2,000
833 MHz Alpha 21264:
SpecINT2000: 544
SpecFP2000: 658
System price: $8,000 (???)
1,2 GHz Athlon:
SpecFP2000: 350
SpecINT2000: 458
System price: $1,500
So what is the FACTUAL basis that The Pentium 4 is slow and/or overpriced?
Still more work to be done with the media... (Score:2)
The truism that NYT is the standard bearer for print media still holds, I believe, so consider this from the article linked in the blurb:
Case in point that the open-source movement hasn't gone far enough in educating the reporters. Sure, blather technobabble all you want at them, and they'll glaze as surely as I have today here at work. But to get them to preach your stuff, you've got to make them understand that Windows isn't the only solution out there.
--
Read the damn article: wasted silicon on extra ALU (Score:2)
Re:1.4 and 1.5 GHz? (Score:2)
It is needed for work and for play:
At work:
Re:Who is this guy anyway? (Score:2)
If it is so easy, show me a variable shift which takes less than O(n log n) transistors and O(log n) stages. I sure can't work out how to do it.
Re:IA64? No thanks... (Score:2)
I use to remember times where Alpha was more than 3 times faster than anything Intel would build.
They also had the highest clock speed (150Mhz vs. 66Mhz for Intel).
But the situation has changed!
The following figures are Spec95_int and Spec95_fp
AMD Althon/650Mhz ---> 29.4 - 22.4
Alpha 21264/667Mhz ----> 32.1 - 49.0
So at the same clock speed, Athlon is as fast as Alpha whereas it is half the speed for floating operations.
But this is at the same speed. Athlon reaches way higher frequencies so the gap is very small in fp and Alpha is beaten in int. And the price difference is massive.
I have got an Alpha server, a Dual ultra-sparc workstation and a whole bunch of PCs, believe me, the speed is about the same for 1/10th of the price.
Re:Ordering a boycott? (Score:2)
mod this up (Score:2)
Re:From sources close to Intel (Score:2)
It won't support my hardware. Thats why it sucks.
It won't support my CPU either. Dunno why M$ doesn't support Sparc. But THEY SUCK. The wintel P4 Sucks too.
Re:Who is this guy anyway? (Score:2)
Which is exactly why Intel wrote their own optimizing compiler. They're even writing a Linux version, which is supposed to be undergoing a public beta test in January.
he forgot something (Score:2)
I mean how bad are things getting at Intel ? ? ?
That's not how it works... (Score:3)
Buying products from everyone doesn't accomplish this. If you buy both Sega's and Sony's systems, both companies get what they want: your money. Sega has no reason to improve its products because you already bought one; and Sony has also no reason to improve because you also bought one of theirs as well. If consumers don't discriminate between quality and non-quality goods (or cheap and non-cheap goods), then no competitive situation exists.
So if you really want to see forward progress, don't support both. Support whichever one is putting out the product you believe is most worthy of success. If you like Sega's system better, buy it; now you're giving Sony an incentive to make its system more attractive to you by being more like Sega -- which is good for you! And if you like Sony's better, buy it and give Sega to do business like Sony.
Of course, competition also requires consumers not to be very brand loyal. A lot of die-hard Linux or Windows users would be reluctant to switch operating systems even if they'd be happier with the other one. So, there's no harm in changing your "loyalty" and finding a new "adversary" (as you put it) to go up against. In an ideal world, people wouldn't have any consumer loyalty at all -- they'd always vote with their money and buy whatever product is the best product available.
Yu Suzuki
A new scapegoat is always necessary (Score:4)
No the PIV is not a great chip. Hell, it's not even a good chip. But once AMD got onto the scene, it looked like we were itching and scratching to find a way to go against the "bigger company" (Intel, Microsoft, and now RedHat notwithstanding). In 6 months, we'll have a whole new "adversary" to rile up the tech community.
Enough is enough. Yes, the PIV has flaws. Every chip has flaws. You pay extra to get just a smidgen more performance, but that's why AMD is referred to as the "price/performance leader".
However, if we don't root for Intel, and AMD suddenly takes over, who won't put their money down that we will go against AMD? I say support both (I use the same mentality in buying a Sega Dreamcast/PS2; boxed distros of Linux and Windows 2000). Without competition on both sides, even "the Man's", there will be no forward progress.
Re:Who is this guy anyway? (Score:2)
Which article were you reading? There were statistics (including cycle counts), comparisons of compiled code, and in-depth reasons for the points that were made. I am not a processor guru and so I'm not sure if they were all good reasons, but there was a large amount of technical backup for the claims that were made. Did you not read past the first section (anti-Intel invective) or the second section (a brief history of PC microprocessors)?
True, the anti-Intel bias was a little disconcerting, but that's because I think you should separate out the technical arguments from the name-calling, and consolidate all of the "boycott Intel" and "Intel engineers are idiots" at the end. Others feel differently, apparently :)
Re:P4 is slow? Overpriced? Says who? (Score:2)
cpu: specint specfp
amd 1.2 ghz ddr 496 420
amd 1.2 ghz sdr 458 350
intl 1.4 ghz 536 558
also, did you notice that the pentium 4 machine had a top of the line hard drive (ibm deskstar 75gxp) and video card (geforce2gts) whereas the amd machines used an older ibm hard drive and a diamond stealth 3d pci(WTF?!!?) on the ddr machine and a western digital hd + nvidia tnt2 m64 on the sdr machine? or how about the fact that all the tests were done with an intel compiler????
Then there's the system prices, I have no idea where you got these prices, but assuming all 3 systems use the same components except cpu+mb+ram, the prices would probably look like:
amd 1.2 ghz cpu: $300?
intel 1.4 ghz cpu w/128 rdram bundle: $1165 sdr ram, 128 MB: $56
ddr ram, 128 MB: $200?
asus p4 motherboard: $302
asus amd sdr motherboard: $140
amd ddr motherboard: $200?
(all prices taken from mwave.com, ddr prices estimated)
so, putting the cpu+mb+ram together, the costs are:
amd sdr: $496
amd ddr: $700
intel p4: $1467
so based on these figures, the p4 is OVERPRICED!
Zetetic
Seeking; proceeding by inquiry.
Elench
A specious but fallacious argument; a sophism.
Technical Flaw in the article: (Score:2)
If I'm not mistaken, the 8086 and 8088 were both manufactured in 40 pin ceramic (and later plastic) DIP packages. There was no reduction in pin count but rather in internal drivers.
Re:Two things (Score:2)
First, I'd be extremely interested to see what this guys credentials are - its interesting to see him take the entire Intel CPU design team to task over this.
He's not taking the Intel engineers to task. He's taking the Intel marketing people to task. What, you don't think the Intel engineers didn't want a larger L1 cache, more execution units, etc? Of course they did. But more silicon=higher costs so you can bet that it's the marketing guys who lopped off all that extra silicon.
I would bet you that the actual Intel engineers who designed the chip would probably agree with most of this guy's points!
http://www.bootyproject.org [bootyproject.org]
Sorry, never made it past the opening paragraphs.. (Score:2)
Although I'm sure the author knows a lot about processors, he is so obviously biased against Intel (and towards AMD) that getting any information from this article is like learning about Linux from Microsoft
What this guy needs is a good editor, and perhaps a few chill pills...
Re:Two things (Score:2)
Or would you rather just accept (with no evidence) an Intel engineer telling you "the p4 rocks, buy one today?"