AMD Delays Hammer 357
TeJarz writes "C|Net reports that their next processor (Hammer) has been rescheduled from its original Q4 release to Q1 2003. To quote C|Net: 'The delays are occurring to accommodate the release of a new version of Athlon with a 333MHz bus, said Crank. Current Athlons come with a 200MHz bus and 256KB of secondary cache.' Let's hope this doesn't get moved again."
Current Athlons (Score:3, Insightful)
Re:Current Athlons (Score:3, Funny)
Re:Current Athlons (Score:2)
I laughed so friggin hard at that! That got passed around the office REALLY fast!
MegaBizzatch. Wonderful!
Re:Current Athlons (Score:2, Informative)
Re:Current Athlons (Score:4, Informative)
is there a real difference? (Score:3, Interesting)
Re:is there a real difference? (Score:5, Informative)
With single data rate a new address can be sent every clock for all memory requests.
With double data rate a new address can be send with every other "clock", but while data transmission rate stays the same. Effectively this means transferring double data for each request, while the amount of requests doesn't change.
This isn't very serious problem, since single bytes/bus wide data aren't usually transferred, but whole cachelines of 32/64 bytes. They will generate 4/8 sequential burst requests nullifying much of the "halfclocked" address generation potential latency problems.
Ok, so why can't the addresses be sent like the data is another question which someone else with more knowledge might explain.. Maybe it would complicate things too much since the request-answer mechanism should be pipelined to accept new requests until previous requests are served. Or maybe the physical bus has some limitations, like using the same pins for address/data, which would simply make it impossible to send new addresses simultaneously (on falling edge of clock) while receiving data.
Re:is there a real difference? (Score:2, Informative)
The additional latency required to synchronize the address with the rising edge rather than either edge is negligible when you consider the total amount of time required to perform the fetch from L3 or L4. Therefore there is no need to endure the more complex design to implement this.
Most data is fetched in bursts. So there are typically 4 or more data phases per request. Consequently, there is no need for as high bandwidth for the address bus as for data.
Plus, as another post said, it reduces the power requirements. This, combined with the fact that there are typically 4 or 8 data transfers per address is why P4 has gone to QDR buses. This way, there is one address per cycle, and an entire 4-unit burst can be completed in a cycle so the address bus could theoretically be completely saturated. Once you pass QDR (to Octal DR?), you may start requiring a higher data rate on the address bus as well for performing two 4-unit bursts per cycle.
Re:Current Athlons (Score:2)
It allowed a nominal 1.89Mhz Tandy Color Computer 3 to often outrun a stock AT because while the AT was throwing wait states by the half dozen anytime it wanted to actually DO anything the 6809 was actually reading or writing from memory on better than half of it's cycles.
And since the CoCo's memory system never imposed wait states, run times were 100% deterministic. Made lots of programming jobs a LOT simpler to implement. Of course I really doubt it would be possible to attain today's speeds and remain deterministic, what with all of the cacheing that now goes on. Guess times change.
Re:Current Athlons (Score:2)
More interesting is, that it is explicitly stated in the article:
Re:Current Athlons (Score:2)
Not surprising (Score:4, Funny)
Re:Not surprising (Score:2)
266 Bus (Score:3, Interesting)
Comment removed (Score:4, Insightful)
Re:Comment non-sense (Score:2)
Re:Comment non-sense (Score:2)
I don't have the same motivation for 64 bit machines (I need them for cycle servers with big memory), but I'm just as anxious for a commoditized 64 bit platform to emerge.
-Paul Komarek
Re:Comment non-sense (Score:2)
You are thinking about this all wrong. You seem to believe that a thousand dollar AMD chip is going to perform like a mainframe. You may be seriously disappointed when you figure out that these new 64-bit chips aren't going to make your current systems obsolete at all.
Re:Comment non-sense (Score:2)
Re:Comment non-sense (Score:5, Insightful)
We need 64 bit machines to accomodate massive memory for our research. I'm really hoping the Hammer can provide a relatively inexpensive and *commoditized* 64 bit platform for us to work on, compared to existing 64 bit (workstation/server) platforms. And I want it yesterday. Actually, I want it last year.
I have no idea what the editors or submitter meant, of course.
-Paul Komarek
Re:Comment non-sense (Score:3, Interesting)
We are migrating from our Alphas to dual P4's and seeing a serious drop in performance that should not exist
Re:Comment non-sense (Score:4, Interesting)
Here are the timings. Note that these are just via "time" on GNU/Linux or a wall clock on Windows (or something -- I didn't do the Windows tests).
P4 dual Xeon 1.7GHz/gcc: 82 seconds
P3 1000/msvc: 18 seconds
Athlon C 600/msvc: 2 seconds
P3 1000/msvc, using floats and sse:
2 seconds
Alpha 667/gcc: 2 seconds
Athlon XP 1900+ 0.88 seconds
I guess the Athlon's clock was closer to the P4's clock than I recalled in my original post. Either way, the slowdown on the Pentiums can be easily seen.
-Paul Komarek
Why Pentium IVs are slow (Score:4, Informative)
If you want more in-depth numbers you can compare appendix C of the Intel Pentium 4 Optimaztion Manual [intel.com] with chapter 29 of Agner Fog's Pentium/II/III Optimization Manual [leto.net]. You can see the Athlon numbers in Appendix F of AMD's Athlon Optimization Manual [amd.com].
If you want to do number crunching with Pentium 4s your best bet is to use the SSE2 instructions/registers. You should be able to get a noticable speedup by using the Intel C++ compiler [intel.com] and telling it to use SSE2 instructions. If you want to eek out max performance you'll have to use assembly language. Though you can probably get most of the way there using the Intel C++ Compiler's SSE2 intrinsics.
I'm curious as to why your code is so much slower on a P4 than on an Athlon. The best way to find out would be to look at the assembly code that gcc is producing. You can do that by using gcc's -S option. If you'd like send me the C code and the output from -S and I'll see if I see anything obvious.
I'm somewhat paranoid about posting my email address. My paranoia seems to work, as I've received no more than the occasional spam in the last few years. My email address is my slashdot user name at woh.rr.com.
Re:Why Pentium IVs are slow (Score:2)
-Paul Komarek
Re:Why Pentium IVs are slow (Score:2)
Re:Comment non-sense (Score:3, Interesting)
Well, if it's stalling for data, your problem is probably that the P4 has a *tiny* L1 data cache compared to... uh... anything. It's only 8K, compared to the Athlons 64K. See the following URLs:
http://www.tomshardware.com/cpu/02q2/02040
http://www.geek.com/procspec/intel/nor
http://www.geek.com/procspec/amd/k7sel
It's probably also worth noting that Intel does NOT list the P4 as a "server processor". The P4 is listed as a desktop or workstation processor. Only P3, Xeon, and Itanium chips are recommended for server use:
http://www.intel.com/products/browse/proces
You might want to show that to management and reconsider your purchase of P4 equipment. Even a P3 is likely to perform better.
Re:Comment non-sense (Score:2)
And by saying that, I don't mean to imply that I think the P3 is a good choice, (I like the Athlons
Re:Comment non-sense (Score:2)
Quite honestly, I think workstations tend to be more floating-point intensive than servers. For example, how many floating-point calculations does 3D CAD software do vs. Sendmail or LDAP?
So, new PC customers should be buying "servers" for any graphics, mathematics, or scientific work. This only increases my dislike of Intel's marketing tactics.
Perhaps Intel should market the P4 as an administrative assistant's toy, and let the engineers and scientists go to Sun, SGI, HP, and IBM for real workstations?
Re:Comment non-sense (Score:2)
-Paul Komarek
Re:Comment non-sense (Score:2)
-Paul Komarek
Re:Comment non-sense (Score:2)
Darek Mihocka of emulators.com has written a whole bunch of stuff about the Pentium 4. He has examples of code that performs badly on Pentium 4, although I'm not sure how the most recent versions of the P4 would work on his code samples.
http://www.emulators.com/pentium4.htm [emulators.com]
steveha
Re:Comment non-sense (Score:2)
Re:Comment non-sense (Score:2)
There's also the issue that finding replacement sysadmins for the Alphas isn't as easy as it is for the x86 machines. Alphas aren't much different to admin, but it can be a bit of a speedbump.
-Paul Komarek
Re:Comment non-sense (Score:2)
"But judging from the benchmarks you posted further below, I question your know-how. You compare a GCC-compiled program running on a Pentium 4 to MSVC-compiled programs running on Athlons?"
I could snottily retort that I question your reading know-how, since msvc was for P-III and Athlon C, while gcc was for all the rest; but I won't.
For the tests, I used the same compilers we use for development and distrobution. We don't have time to screw with the industry's popularity contests. We do algorithmic and data structure work, aiming for 10000% speed-ups that just aren't available through compiler cleverness. The Intel compiler won't help us when compiling on Alpha, MIPS, or PA-Risc.
-Paul Komarek
Re:Comment non-sense (Score:2)
The bottom line is that Itaniums only seem to make sense for people who get them for free.
-Paul Komarek
Re:Comment non-sense (Score:2)
I can't spend all my research time optimizing for one silly cpu. My code is run on about 4 different cpus (only two different instruction sets, though) at present. Another cpu (with another instruction set) will be added soon. If Intel wants my code to run fast on the P4 Xeon, they can contribute to GCC; but I don't care. I'm happy to recommend that users of my code buy Athlons or Alphas.
-Paul Komarek
Re:Comment non-sense (Score:2)
That said, which version of gcc did you use? There seems to be vast differences between them (and certain companies seems to like 2.96.x which is NOT a valid gcc version. If gcc -v gives you the 2.96.x version, get a new gcc) and there are reports about speen increases in the 3.x series.
I was mostly curious, I really would like to see that code of yours, but I realise that you wouldn't wanna give it away. Any chance you could write some dummy code that gives the same results (as far as the P4 being slow that is)?
Re:Comment non-sense (Score:2)
The P4 used gcc 3x, while the Alpha and Athlon XP used gcc 2.96. =-) If anything, this should give the P4 an advantage.
The Athlon C and P-III results were all msvc. I don't know which version was used, because I didn't do the tests.
-Paul Komarek
Re:Comment non-sense (Score:2)
-Paul Komarek
Re:Comment non-sense (Score:2, Interesting)
There's a damn good reason I want this to come out soon. The sooner AMD comes out with Hammer the sooner Intel has some extremely serious competition. If Hammer can stand up to its hype the P4 won't look so hot, especially if Hammer ramps well in clock speed. Strong competition = lowering of prices. Also, Athlon XPs would then be pushed into the value market. So not only would Intel be forced to drop prices on their desktop and server CPUs, but AMD's old lineup would become and absolute steal. Sounds good for the average consumer, eh? Lets hope for no more delays.
-Yoweigh
some people use these for work, you know (Score:3, Insightful)
Hard as that may be to believe, some people use their computers for real work. And some of those people run into that dreaded 4G limit--4G is not a lot of memory anymore these days. And many of these people would love to have the choice of a Hammer over Itanium.
Re: (Score:2)
Re:Comment non-sense (Score:2)
This guy is quite rude, offensive and disrespectfull to others and is very arrogant. Ignore him and add him to your foe list like me. Also he has been very supportive of drm in a troll like way and feels free to flame other people who actually like their fair use rights.
Re: (Score:2)
Invested in AMD. (Score:2)
Re:Invested in AMD. (Score:2)
Really, there are only two companies positioned to provide CPUs for PCs. Intel is showing more and more signs of losing their grip on bits of the market, and AMD only really needs to gain small bits of Intel's narket share to be wildly successful.
Besides, you should never sell during a recession. Now, if you want risky products, join me and buy some Abiomed (makers of the Abicor heart). Might not work out long term, but if they work out the kinks people will pay any price...
Re:Invested in AMD. (Score:2)
Re:Invested in AMD. (Score:2)
What you're saying is true..... AMD only has to get double their current market share (about 40% of the market) to start raking in the millions...
and it's wildly undervalued, in my opinion.
I might look into this Abiomed company..... sounds interesting.... I might be a customer in the not-too-distant future!
Re:Invested in AMD. (Score:3, Interesting)
Down 7% on Intel's 2 cent per share dividend.
They'll get beaten again tomorrow.
They'll get beaten at Christmas.
They'll get beaten until Sledgehammer is released.. not Claw hammer which will have no x86-64 desktop software support right off the bat, and will have to rely solely on it's pure x86 performance.
Microsoft shafted them on the X-box because Intel paid Microsoft 200 million to use the Pentium III. Nvidia was stuck with an unused AMD integrated chipset for X-box and Nforce was born.
Intel will pay Microsoft to shaft them again. No x86-64 Windows XP for AMD despite AMD testifying on Microsoft's behalf in exchange for anti-trust testimony. AMD made an unenforceable agreement with Microsoft. To enforce it would perjure themselves.
So Intel wins again.
Until Sledgehammer arrives. Sledgehammer is a server/workstation chip and will have full support of the dominant server operating system, Linux. Microsoft must support Sledgehammer or risk losing more of their already weak server market share.
Long after Microsoft has done the work to get Windows running on the Server, Microsoft will incorporate x86-64 support into their desktop OS.
Probably about the same time as they support Hyper threading and SSE3 for Intel.
Re:Invested in AMD. (Score:2)
I heard from a guy I know (Score:2, Interesting)
Bold move CNet! (Score:2, Funny)
So now C|Net is making processors too?! (Sorry, couldn't resist...)
Re:Bold move CNet! (Score:2, Funny)
"C|Net gives this CPU 10 out of 10!"
(ok, be gentle, this is my first post, literally
AMD Delays Hammer... (Score:2, Funny)
Honesty? (Score:3, Interesting)
Everything I have seen shows that Intel is doing much better in performance and climbing. AMD claims there is no real technological reason, yet there must be. Anyone have insights? It seems that it would be prudent for AMD to issue better explanations -- how could it hurt to be honest? I want to see competition, if they are going to lag in performance, then they present no reason for people to buy. (A similarly performing Intel chip is close in price right now)
Re:Honesty? (Score:2)
The real reason (Score:5, Funny)
Good (Score:3, Informative)
Re:Good (Score:2)
Re:Good (Score:2)
I bet it would be easy to disable it by default in the bios to boot XP or Linux. Microsoft should of picked TCPA which is more open and already has XP drivers for it by IBM.
Re:Good (Score:5, Informative)
Q: Can Linux, FreeBSD or another open source OS run on "Palladium" hardware?
A: Virtually anything that runs on a Windows-based machine today will still run on a "Palladium" machine (there are some esoteric exceptions[1]). If you currently have a machine that runs both Linux and Windows, you would be able to have that same functionality on a "Palladium" machine.
The exceptions are here
[1] These exceptions include the following:
1.)Some debuggers may need to be updated to work in the "Palladium" environment, but they can still work.
2.)Some special performance tools may need to be updated.
3.)Software that writes directly to TCPA hardware will need to be updated.
4.)Memory scrub routines (at the hardware level) will need attention.
5.)Third-party crash dump software may need to be updated.
6.)BIOS mode hibernation features will need to be updated to work with "Palladium."
Its these 6 reasons why palladium is still beta and why AMD is probably waiting before releasing Hammer.
Re:Good (Score:2)
Since when [do] OPERATING SYSTEMS run on Windows? They are completly independent of software platform, all they need is hardware.
Keep in mind, that FAQ was from Microsoft. I shall explain.
From their point of view, any computer that can run Windows is a "Windows-based machine". They're aware that some oddballs insist on perverting these machines by running other experimental operating systems on them, but that doesn't change the fact that the machine was designed, built and sold to run Windows. It is, therefore, a "Windows-based machine", even if it happens to have some OSS crud running on it.
When used properly, as God and Bill intended, it's just a "Windows machine", of course. Or, in MS common parlance, a "computer".
the other side... (Score:3, Informative)
The biggest problem with current processors is that to design such devices we *have* to use dynamic logic. Ask any VLSI design engineer.. that is no joke. Infact many multipliers and dividers have to be hand edited! So delays are expected and it does reflect upon the desigers and companiesd in any way.
Before you ask.. I do now work for AMD, i work in another VLSI company, thats why i say.. its tough. Millions of gates thousands to be hand edited its a bitch.. but as they say the fruits of labour are sweet... and for AMD hammer is going to be the sweetest
Re:the other side... (Score:2)
This is true for any processor, 64-bit, 32-bit, or otherwise. If you want the last 20%-30% of the performance, it will involve hand-optimization and an ungodly amount of work.
Designing a 64-bit chip vs. a 32-bit chip, OTOH, is mainly just replicating elements, though you do need design tweaking for a few pieces that don't scale well.
Re. dynamic logic, it really depends on the process you're using and what your design goals are. There are a lot of "gotchas" that you're undoubtedly already aware of that can degrade performance in a dynamic logic system, and other tradeoffs that are made when adding dynamic components to an otherwise-static system.
As with other design choices, there's no _requirement_ to do this. You just get a performance boost for certain specialized types of structure, which can justify the headaches if that kind of structure is on your critical path.
For a 64-bit processor that doesn't use dynamic logic (last I checked), just look at the MIPS line.
Gain this, lose that (Score:2, Interesting)
Re:Gain this, lose that (Score:2)
Re:Gain this, lose that (Score:2)
On the other hand, if not, there are a lot of processors out there now that would leave AMD and Intel's new offerings in the dust.
conspiracy theory.... (Score:2)
real : MS;
int : palladium;
int * : hammer;
hmmm is it for integrating palladuim support!
};//end of struct
Sorry... couldnt help it
This is very bad news... (Score:2, Informative)
1) AMD is currently losing huge amounts of money. The hammer would have allowed them to sell at the high-performance end of the market again where the sales prices are higher and might have helped them reduce the flow of red ink.
2) The delay will badly hurt AMD partners such as motherboard and chipset vendors who have developed supporting products for hammer.
3) The hammer had a potential performance lead over Intel that will be greatly eroded by the time it finally appears.
4) Critical software development for hammer will be slowed which will slow eventual market acceptance of hammer.
5) The delay will build momentum for Itanium.
6) The delay will greatly reduce the pressure on Microsoft to support hammer and will give Microsoft the opportunity to also build momentum for Itanium. Depending on market conditions when the hammer finally appears, it is now even possible that Microsoft will never need to support hammer.
7) This delay is so serious that it creates real doubts that hammer will *ever* be a viable product.
Delays!?!?!?!? (Score:2, Funny)
The poster dies in a fit of agony...
we believed it was hammer time (Score:5, Funny)
Re:we believed it was hammer time (Score:3, Funny)
I was expecting this. (Score:2, Informative)
At least one woman was fired for making a Linux test CD and distributing it internally around the company, against that manager's wishes. Her name's on the test CD, and it was still being used inside AMD last week, but she answered too many Linux questions for people outside her department and as such was labeled "not a team player" in the internal politics. As far as I can tell, that was the most knowledgeable linux person they had anywhere near that area.
AMD makes great processors, but until they get a new motherboard testing department, they'll have nothing to put them in.
short rant and a question (Score:5, Interesting)
However, the notion of "fsb" is a little blurred with the hammer. Hammers will be directly connected to dimm banks and have integrated memory controllers, so the speed of the fsb will no longer be a determining factor in memory bandwidth. (* see mp note below) The traditional fsb to the traditional northbridge will be replaced by a "high speed" hypertransport link to a chip that connects to the agp slot, and has another (slower) hypertransport link to what could be called the south bridge. This "south bridge" will then connect the pci bus, serial ports, hard drives, usb ports, and any other devices that need to talk to the processor or main memory.
*What does this mean for MP systems? Well, that's actually the really cool part. By moving the memory controller onto the processor and providing communication between processors over a hypertransport link (3.2GB/sec for dual, 6.4GB/sec for quad and above), memory bandwidth actually increases as more cpus are added! This is in contrast to a normal MP system where as more cpus are added, there is increased competition for a fixed resource (main memory) which is already the bottleneck in many single processor applications.
That's my rant on terminology. Here's the question:
I'm no kernel hacker, and I certainly don't know anything about writing schedulers, but it seems like this would require a change in how processes are handled in hammer mp systems. In traditional mp systems, every processor has equal access to main memory. If a process gets moved from one cpu to another, there's initial overhead to do the moving, but after that it can still get to its areas in memory without any problems. On a hammer mp system, migrating a process from one cpu to another would mean that in order to access its memory it would have to reach out of its cpu's hypertransport link, into another cpu's memory controller (which may or may not be busy) and into the attached ram. Considering there would not be enough bandwidth available on the 3.2GB/sec hypertransport bus (in the case of a dp system) for both processors to reach into eachothers 166MHz DDR memory at the same time without suffering a performance hit, it seems like there would definitely be an advantage to keeping processes close to their data.
What changes would this require to scheduling and process management code, if any? Has this already been addressed, or are there people working on it in the linux kernel?
Re:short rant and a question (Score:2)
Simple Topology API [google.com] is one thread about this stuff.
Re:short rant and a question (Score:4, Informative)
Essentially this would be a NUMA system (non-uniform memory architecture). As far as I know Linux 2.6 will have support for these systems.
In a real NUMA machine there would be a hierarchy of clusters of processors. Each cluster functions a bit like a traditional SMP system, but the clusters are interconnected over "low"-bandwidth busses. This makes memory accesses across clusters slower than direct accesses into the clusters' memory.
Both the VM and the scheduler will have to know about this.
Another point with NUMA systems is the possibility of gaps in the main memory (discontinues memory). Kernel hackers are currently working on support for that (discontigmem patch, merged in 2.5.34).
hammer mp: consequences for kernel arch? (Score:2, Interesting)
This isn't a particulary new requirement. You have to be careful about selecting pages for processes today even on single CPU systems to avoid cache thrashing. Because of the way first or second-level CPU-caches map to physical memory, certain memory-access pattern lead to constand reloading of the cache, making it pretty ineffective, even worse if it wasn't there in the first place. By carefully mapping physical pages to virtual memory the OS can avoid this problem. Solaris does this, I don't know about Linux. Probably.
So, this is one new requirement for the memory management code. No problem, we just make sure all process pages belong to one particular CPU and schedule this process to this CPU only. Everything is fast and nice. Intel is doomed. Or is it? Not so fast, all this is probably a bad idea:
We can't make sure pages on the right CPU a even available. What if they are not? Give out wrong pages? This would lead to results in running time which are not reproducable. This is really bad. It gets worse. What it the right CPU is not available because it's running some other process?
Probably it's best to allocate evenly distributed pages (some fast, some not so fast) to processes and not schedule them special in any way.
Easy ;)
Re:hammer mp: consequences for kernel arch? (Score:2)
As far as existing chips go, hyperthreading also messes a lot with running times, because when you get processor time depends on when another process has cache misses.
Hammer memory isn't so rosy. (Score:2)
This is true only if the processors are running tasks with unrelated working sets (and if the data for each task is in that processor's memory).
If you have tasks that require memory managed by another processor, you have to go through the hypertransport link and the other processor's memory controller to get it. This will be _slow_. HT is decent, but nowhere near as good as a direct connection to memory, and there _will_ be delays due to arbitration on the second chip and the various buffering stages the data transfer has to go through.
So, for multiple processors working on a shared workload, you're screwed.
The only way to ameliorate this is to have very smart OS-level memory management that can duplicate shared-but-not-modified pages across multiple memory banks, and both OS and processor support for update-based coherence between the banks. The hardware support for this is a bit tricky, and the OS support will be a nightmare if the OS wasn't NUMA-friendly to begin with.
And under some cases - like tasks on multiple processors competing for access to a lock or all heavily modifying the same data page - you're screwed no matter what you do.
So, don't rejoice yet. We'll only know for sure how well this will work when we have Hammer systems on our desks.
Re:Hammer memory isn't so rosy. (Score:2)
So, for multiple processors working on a shared workload, you're screwed.
From the Hammer presentations I've seen, this is not true at all. The HT link between CPUs is 6.4GB/s, which is actually faster than the direct-attached memory (~5.3GB/s). Since the HT controllers are running at >2GHz, they introduce minimal latency.
And under some cases - like tasks on multiple processors competing for access to a lock or all heavily modifying the same data page - you're screwed no matter what you do.
I don't think this is true either. Contention for a cache line will simply bounce the line between caches, which is much faster on Hammer than on a 400MHz shared-bus SMP.
They're having clock speed issues with Hammers... (Score:5, Interesting)
How the hell do I know that??? Look where I live, take a guess...The birds outside my window know things.
C|net is making processors? ;) (Score:2)
Seems smart to me... (Score:3, Interesting)
I would imagine it would be better to release Hammer ASAP and create the 64-bit market themselves. Then again, I don't know the logistics required for such a launch, nor do I know exactly how much better, if any better, x86-64 would perform. Let's face it, not many people care about 64-bit versus 32-bit, they only know what the dork at CompUSA tells them. And if Hammers can't outscore P4's in the 32-bit apps that very short-sighted people care about, then there is really no place for Hammer in the consumer market.
From what I've heard, mostly from internet gossip, is that AMD is having problems making Hammer scale high enough to beat the P4 in 32-bit apps, although it only requires roughly 1 Hammer MHz to beat 3 P4 MHz. I've also heard that AMD is having problems making Hammers run above 800MHz. With the expected debut of the P4 at clock speeds above 3GHz, the Hammer doesn't stand much of a chance in 32-bit apps.
In short, don't expect to see Hammers until Intel manages to salvage the Itanic.
Excellent! (Score:2)
Anything with 64-bit doom is good enough for me :)
Not big a deal as people think (Score:2)
While AMD works out the bugs of their Hammer line of CPU's, don't forget that AMD still has a card to play in terms of CPU competition with Intel: the Barton-core Athlon CPU due later this fall.
Unlike the Athlon CPU core designs since the original Thunderbird-core Athlon's, the Barton-core Athlon sports a larger 512 KB L2 cache on the CPU die, which will offer dramatic performance increases, especially with memory-intensive programs. Remember, the current Thoroughbred-core Athlon CPU rated at 2600+ already has reached parity with the Intel Pentium 4 2.53 GHz part, and that's with only 256 KB of L2 cache on the CPU die and using DDR266 DDR-SDRAM! What will the Barton-core Athlon do?
Good news to me, kinda. (Score:2)
Why? Because I'd like to get a Barton CPU for my next computer. I'm already in the waiting game for the NV30 and (to a much lesser extent) Serial ATA, so putting a better CPU on the list isn't a big deal.
Why not Hammer? Because I know better than to buy a first generation CPU with first generation motherboards. Barton is just a mild revision to a 4 year old CPU core, and the motherboards are now hitting their 6th generation (KT133, KT133A, KT266, KT266A, KT333, KT400).
For those who need the speed, power, and addressibility of a 64-bit chip this announcement sucks, but for those just looking for a faster current generation chip it's not entirely bad.
I was afraid of this -- the Osborne 2 effect (Score:2)
Re:Who wants to bet... (Score:2)
Do you think its called Wintel because they couldn't figure out how to spell AMD?
Re:Who wants to bet... (Score:2)
Re:Who wants to bet... (Score:2)
Re:Who wants to bet... (Score:2)
FWIW, that went out with the K6-III at the latest. None of the Athlons or Durons I've installed have had the Windows logo on them in any manner--not printed, not engraved.
Re:Prefection (Score:2)
Depends on what you're doing.
The P4, especially configured expensively, has a kickass memory subsystem on the motherboard (dual-channel anything will score high on bandwidth-bound tests). The fact that the Athlon doesn't has hurt its relative benchmark results even more than the speed war has.
I still love the Althlon, and I still avoid the P4 on (personal) principle, but a faster memory bus is a Good Thing for AMD.
The Hammer will live or die on this too. We don't have a real-world test of how well its memory subsystem works yet. The NUMA scheme for multiprocessor systems also gives me pause (without migration/copying of non-local pages, it'll bog down like crazy under certain conditions).
A well-performing Hammer will push AMD back into prominence. I strongly suspect that they're at least partly buying time to tune the core.
Re:Prefection (Score:2)
Re:Prefection (Score:2)
Re:200 MHz? (Score:2)
Best... comment... ever.
Re:To anyone complaing because they have old syste (Score:2, Troll)
dude - if you've got an Athlon2000+ that's "patiently waiting around" then you must have bought the thing when it was brand new -- and paid a hell of a premium to let it sit doing nothing. same chip's probably half price now. i can almost buy your "my 550 celeron runs everything i need!" story
Re:To anyone complaing because they have old syste (Score:3, Insightful)
I do happen to agree with not freaking out about a processor release date. But do realize that people are excited about this cpu for a reason.
BTW many of us here were using computers and programming before you were born. Your only 20 for Pete's sake.
Re:Well now what ... (Score:2, Informative)
I could see them having trouble with the memory controller. It supposedly runs at full clock speed and handles the memory requests from all processors (in a multi-processor system) and the AGP bus etc without intervention from the CPU itself. Hence the reason for a 2GHz controller even if you're only running 333MHz DDR.
Re:Real reason for delay (Score:2)
Was it because of the humor article "Is Your Son a Computer Hacker?" where it said:
"If your son has requested a new "processor" from a company called "AMD", this is genuine cause for alarm. AMD is a third-world based company who make inferior, "knock-off" copies of American processor chips. They use child labor extensively in their third world sweatshops, and they deliberately disable the security features that American processor makers, such as Intel, use to prevent hacking. AMD chips are never sold in stores, and you will most likely be told that you have to order them from internet sites. Do not buy this chip!"
Any info would be appreciated, since adequacy.org redicects to kiro5in. What's AMD's beef?
Re:FUK AMD (Score:2)