Is Prescott 64-bit?
unassimilatible writes "According to The Inquirer, Intel's new Prescott has 64 bit instructions lurking inside. Could really rain on the parade of those who thought the new Athlon 64's would be supreme - especially when you look at Intel's price roadmap. Don't run out and buy an Athlon 64 just yet..."
Itanium is a full fix to the problem. The horrendous x86 ISA is completely replaced by an explicitly-parallel (EPIC) instruction set that has all the nice properties of a RISC machine (easier to compile for, less stress on the memory system as you get 128 registers, easier for the machine to decode the instructions as they're fix format and don't require RISC conversion, etc.). The problems with it are:
1. You need a compiler that "knows" how to bundle instructions effectively (a VLIW-compiler). GCC clearly isn't there yet (it's not uncommon for the intel compiler to beat gcc by 30->50% when running computationally-intensive stuff)
2. Being completely different than x86, it can't be very efficient at emulating x86 programs.
AMD partially fixes the problem by extending the x86 ISA to 64 bits, *and* adding 8 general purpose registers. Because they just extended the ISA, running old code is just as fast. Furthermore, new code can benefit from from the extra 8 registers, and run even faster.
For the short term the Opteron is a pretty impressive chip, but I really don't see how AMD is going to stay on Moore's curve with such a shitty instruction set architecture.
P.S. Clearly 32 bits can only address 4GB of RAM, and for *some* servers more addressing space buys you something. But I'd say they are a very small minority.
The Raven
As far as I know, the UltraSPARC made its debut in 1995, while the first 64-bit Alpha from DEC was announced in 1992. 64-bit MIPS and PA-RISC chips were probably sometime between those two dates. See here.
Ubi dubium, ibi libertas.
Athlon chipsets sucked rocks for a long time, and were really unstable. But VIA finally got their act together, I think with the KT133A.
AFAIK, other than stomping on occasional driver bugs, Athlon chips have been pretty excellent ever since. I have an Athlon 1900+ on an ASUS A7V333 that's rock solid, and a new Athlon 2500+ on an Nforce2 board that's not quite as solid, but which is still pretty good.
I'd like to see some improvements on the NForce2 chip stability. It's not all the way there yet, in my opinion. But the VIA chipsets are extremely solid.
You do realize that there has been no such thing as a "CISC processor" since the Pentium Pro came out. Underneath the X86 bytecode VM, Pentium IVs, Athlons, etc. are highly advanced RISC cores with multiple concurrent execution units.
The main reason that the huge expensive power sucking Itanium scrapes out a small lead in benchmarks over X86 CPUs is because of its expensive huge power sucking cache.
You're guess is basically on the right track. I don't want to violate any NDAs, but let me just say that the AAA and AAS opcodes will now support Unicode.
As the author of the article, I had to REALLY make things vague. The people involved would be hurt badly by Intel if their names got out. Some of the situations that were told to me make it quite apparent who was leaking. That was as specific as I could make it :(.
-Charlie
gotta compete. Intel had to come with something better(cost effective) than Athalon64. If there was no competition, we would be still using 8088/6
LOL, Intel is actually their largest competitor. Every time they release a new chip guess who they are primarily up against? People who are running other Intel chips.
Without AMD though, I'm sure Intel would keep their new chips at higher price points for a bit longer and milk the power user crowd for a little more money.
Intel x86 CPUs can already address 36 bits of physical memory, which should be enough for the next few years.
Intel doesn't do on-board memory controllers because they got burned by Timna.
A really good example of what you are talking about is the G5. It simply extends an efficient architecture to 64 bits. Other than upping the memory limit, it does precious little to performance. The chip in 32 bit more is about as fast as 64 bit, and only starts to show a difference when memory useage gets large.
As for AMD, you can see the effect by running a program in 32 bit mode, then running a 32 bit program recompiled to take advantage of the registers in 'compatibility' mode. There is quite a difference.
-Charlie
Don't run out and buy an Athlon 64 just yet...
To anyone that 64 bits might make a difference for, they're steering clear of Intel, who has stated they're not going to focus on that desktop market for another 5 years. So all this article amounts to is Prescott FUD to support Intel's (misguided) roadmap.
Disclaimer: I own some AMD stock and I do my Unix development on Mac OS X.
However, the compact legacy CISC instruction set does conserve on instruction cache space. This offsets much of the cost of the conversion logic. Moreover, it allows custom optimizations for the exact architecture du jour without affecting binary compatibility.
And any way you look at it, you have to read from memory a lot more often with 8 "general purpose" registers than 32 real GPRs, which is what most sane CPUs have.
Many modern X86 CPUs have more than 32 real GPRs which are utilized by register renaming. Like quantum mechanics, the processor state for any given instruction is smeared out over time and space, and the CPU is operating on many instructions simultaneously. The number of visible registers just doesn't matter as much as it would seem on the surface.
Itanium doesn't have to do this, PowerPC doesn't have to do this, no modern ISA requires this nonsense.
They will when somebody figures out the next architecture trick that doesn't match the assumption of the designers of their ISAs. Take a look at history; remember when MIPS stood for "Microprocessor Without Interlocked Pipeline Stages"? What did the R4000 introduce? Could it be - interlocked pipeline stages? Exposing CPU implementation details to the software is not something that wears well over time.
It doesn't just "scrape out a lead" in floating-point benchmarks, it absolutely destroys the x86 competition.
That's because the FPU has not been very important in the X86 market up to this point. Business and multimedia apps just don't need it. If AMD or Intel put their efforts into an X86 with ultimate FPU performance, it could match or beat the Itanium.
I suspect that Intel took advantage of the huge schedule delays in the Itanium to throw in more FPU horsepower because it had to beat the consumer-grade chips on something.
And oh yeah, its running at what, half the clockspeed of the P4? If Itanium had the same economies of scale behind it at this point, there would be no competition.
As I said, the cache and memory architecture is the primary factor in the performance of CPUs today. Clockspeed, instruction set, registers ... who cares? Everything that's not cache is only a small fraction of the die size.
All of that hardware architecture stuff is a red herring. Worrying about those non-issues has caused the Itanium schedule to slip nearly a decade while they desperately tried to write a C compiler that could statically wring out performance from their brittle concurrent execution model without the benefits of the run-time statistics information available to the X86 code translators.
I didn't consider timing when I wrote the story, or any of it's predecessors. Silly as I am going to the A64 launch tuesday. Anyway, I have been chasing this story since the chip-architect articles. The timing was unfortunate, but it wasn't an Intel plant, that much I can assure you.
For about 3 months, I have known there was 64 bit functionality there, but I didn't have enough to prove it to my own satisfaction. I chased leads, interviewed people, and got that info.
The fact that IDF brought me into close proximity with a ton of sources was the thing that got me so much info so quickly. There was only one thing from Intel directly, the rest were from third parties supporting the chip. If IDF had happened last January, I probably would have gotten the info then.
-Charlie
You can order amd64 systems from places like appro and Penguin Computing right now, with decent sized collections of 64-bit applications provided by popular distributions such as SuSE. Let's not forget that the amd64 CPU's can run ia32 binaries at speeds faster than many ia32 CPU's and on a system with an amd64 kernel allow for more aggregate address space consumption across processes and the ability to install tremendous amounts of physical memory for buffers and cache even if individual processes can only take advantage of a few gigabytes.
With other groups like the Debian project well underway in their amd64 porting efforts, you can expect thousands of popular applications built for the amd64 platform. There's tons of software available for amd64 already, and you can bet by the time that AMD releases their "Athlon64" or whatever they're targeting the low-end market with, there will be even more.
According to Le Inq, Prescott takes more than that.
Now these may have been taken from a roadmap that I really should not have seen, but you can see that the 100w number is a bit conservative. The next few generations are specced to narrow the gap between min and max power usage, but not lessen it. Depressing.
http://www.theinquirer.net/?article=11588
-Charlie
I'd like to inform you that any modern processor in a laptop will run hot. If you don't believe that to be the case, I invite you to run a p3 or p4 laptop on your lap for several hours.
Tell me, Amsterdam Vallon, what broke on your AMD college computer? Unless it was a defect with the construction of an AMD processor, your point will prove irrelevant. I'm using an AMD processor right now, and my Windows 2000 machine got a virus thanks to IE and broke. That's not AMDs fault. My old motherboard needed a flash upgrade to use an XP 1800+. That's not AMDs fault. My hard drive was an old 1Gb and after years of service, died. That's not AMDs fault. Furthermore, if you managed to crush your core, or if you installed inadequate cooling or did a substandard installation initially in any way, you cannot blame AMD. They make processors. Installing a P4 with some sub-par Aladin chipset motherboard by PC-CHIPS, a 100 watt power supply, an IBM DeathStar hard drive, cheap ram made in some communist country and a Socket 7 heatsink will result in your machine breaking as well.
For the record, I'm on my fourth Athlon. I've used the chips without problems, upgraded without a hitch, and run the new chip without problems until I decided to upgrade again. My next machine is undoubtedly going to be an Athlon 64 as a result of the quality I've witnessed.
It's been a long time.
To be more accurate, Intel didn't do it first, but rather the NexGen 5x86 did it first. AMD bought NexGen, in part b/c their K5 sucked ass. and it was the NexGen engineers (in part at least) who made the K6 as well.
So, it wasn't just 6th gen processors that had RISC emulating CISC.
--
tabris
I also agree with you about RSE being a mess - but stackable registers (similar to register windows in Solaris) is a very effective mechanism for reducing memory accesses. It does make out-of-order execution a living hell, but in the end it all comes down to stressing the memory less, as RAM doesn't follow Moore's curve ...
The Raven
x86 machines have had 48-bit address spaces for years. Some of them even bring out a few more pins, so you can address more than 4GB of memory. It's even supported by both Linux and Windows. You can't have more than 4GB per process space, but you can have more than 4GB in the machine. Works fine.
Mike Magee STARTED The Register, and still owns part of it.
He left due to an internal disagreement and started The Inquirer.
Get som facts before you babble my friend!
Maurice W. Hilarius Voice: (778) 347-9907
88000 was a Motorola RISC processor from the late 1980's. Perhaps you mean Intel i860? (Intel's late 1980's RISC processor.)
You don't upgrade to get a faster CPU. Not these days. You upgradee new board supports SATA RAID, which will give you a performance
for other reasons -- your old motherboard is maxed out for RAM, and
you need more. Your old motherboard is USB1.1 and you want 2.0. You
could get an expansion card, but you've only got one slot left and you
really wanted to add IEEEwhateverthenumberisforthattrademarkedbus.
Th
boost for disk-intensive applications. And so on and so forth.
Do you go for a faster CPU while you're upgrading? Well, sure.
Nobody wants to buy a new computer with the same MHz number as the
old one, for psychological reasons if nothing else. But unless you
raytrace animations for a living or something, it's probably not the
thing driving you to upgrade.
Cut that out, or I will ship you to Norilsk in a box.