AMD Opteron Due In April
updog writes "Here's an article from Infoworld claiming that the new 64-bit AMD Opteron is ready to launch on April 22. Some of the notable features of the new chip are an address space capable of addressing up to 1 Terabyte of memory, the ability to link up to 8 processors without any external chips, and backwards compatibility with existing 32-bit applications ..." PapaFSmurf, meanwhile, links to a disclaimer-heavy article posted at amdboard.com which says that 64-bit Athlons may arrive in June rather than September as previously expected.
Imagine how fast THE INTERNET is gonna be now!
TWICE as fast, at least!
If moderation could change anything, it would be illegal.
Barton isn't enough to keep AMD going against Intel until Sept. Simply not gonna happen, and I think they have seen that coming and are trying to head it off by launching the Athlon64 closer to its originally planned release.
Once the Athlon64 is available and people are building systems using it, AMD just stole back the "King of PC processors" title and in a BIG way.
Natural != (nontoxic || beneficial)
Affordable x86 64-bit servers for the masses, this is going to revitalize AMD and really put it on the map as a serious challenger to Intel. I hope Chipzilla wakes up and sees that its incredibly expensive and backwards-incompatible Itanic 2 chips are the result of engineers developing for themselves instead of developing for the needs of their customers. Finally, AMD will be able to court the high profit business market, though I fear that they might alienate their hardcore enthusiast consumers in the process...
AMD's enchanted April
AMD's 64-bit alternative
By Tom Yager March 14, 2003
After years of hype, the AMD Opteron 64-bit processor will debut in April. The company and its shareholders might curse the rotten timing, but the current contracted market is actually the perfect setting for AMD's new technology. While other chipmakers scramble to adapt, AMD seems to have designed current business challenges and priorities into its architecture. Considering how long Opteron has been in engineering, AMD is either very smart or very lucky. Opteron may be an opportune solution for customers looking to consolidate their servers and reduce operating costs.
The advantages of AMD's new design are many. The most talked-about feature is the CPU's support for 64-bit applications. Unlike previous 64-bit processors, Opteron implements the full x86-32 instruction set. Software that runs on a Pentium III or AMD Athlon now will run unmodified on Opteron. Opteron-based servers will likely spend the majority of their time running the 32-bit Windows and Linux programs that businesses use today.
Software written to exploit Opteron's 64-bit capabilities will break through the barriers that prevent the x86 from running extremely demanding server and technical applications. A vastly expanded address space (up to 1TB of physical memory), a larger set of high-speed registers, and new instructions will take affordable servers to a higher level of performance. Running in 64-bit mode, an Opteron application can crunch through mountains of in-memory data and perform blazingly fast data transfers to network and storage devices.
Unlike other x86 processors, the Opteron CPU has the inherent ability to link up to eight processors without specialized chips. Every processor has three HyperTransport bus controllers for fast connections to other CPUs and devices. Instead of using an external memory controller, which complicates system design and adds latency, AMD links memory directly to each CPU. The design has plenty of headroom to accommodate faster memory and I/O devices. The only speed limit is the 19.2GB per second capacity of each chip's combined HyperTransport channels, which exceeds the top speed of the most capable PC server bus.
In systems that require more than eight processors, Opteron will rely on external chipsets to provide communications between CPUs. The fact that HyperTransport is already on the chip simplifies the engineering. Systems running two- and four-CPU configurations -- which account for most x86 server sales -- will ship in 2003. How soon larger systems appear depends entirely on market demand.
Answering critics
The chief criticisms leveled against the platform by Intel and critical analysts -- mainly that Opteron is immature technology and that Microsoft is dragging its feet porting Windows to it -- will prove groundless. The well-respected and thoroughly debugged Athlon x86 processor is the foundation of the Opteron chip. The remarkable HyperTransport bus that AMD uses to tie Opteron chips to each other and to I/O devices is already in widespread use. The DDR (double data rate) memory that AMD has chosen for its first implementation is inexpensive and readily available. AMD's chipset implements standard PCI-X and AGP ( Accelerated Graphics Port ) peripheral buses. System manufacturers and customers will have relatively few adjustments to make.
The Windows question is slightly trickier, but it isn't an issue Intel should press too hard. Yes, the sole 64-bit version of Windows runs on Itanium and Itanium 2. However, Microsoft has repeatedly stated that it strongly prefers AMD's 64-bit architecture to Intel's. Opteron is not stuck in the same spot as Intel at the launch of Itanium. Intel had to wait for Microsoft to announce its Itanium-specific port of Windows . Opteron already runs 32-bit Windows at full speed, while other 64-bit CPUs must use emulation to run most Windows software. Microsoft's engineering task, which it needn't hurry to accomplish, is to
Duron? Opteron? Who comes up with these? They sound like types of rubber, or possibly fuel additives...
Actually, more people than you'd think. Most modern processors are limited to a 4 GB address space and one has to "hack" around this limitation to achieve larger spaces. The reason why I say more people than you'd think is that the 1 TB space would allow one to map any location on the largest hard drives available today to a virtual memory location. There would be a *tremendous* performance boost in doing this as opposed to other contemporary file management schemes.
The only thing the article references is that Newisys is leading the charge, but I don't see anything I would consider a source. Racksaver is claiming that evaluation units are available now, but mostly there's just a lot of Opteron Server Evaluation signups. Does anyone have any real information? Cost? Non-evaluation release? Anything?
AMD.com
No, I ma not making a joke. Yesterday, Friday March 15 2003 I bought my new Soyo KT400 Dragon motherboard and a AMD 2600 Athlon with 333 bus. I have been cutting edge for 24 hours.
I never will win.
On a more positive note, any body know of motherboards for these monsters yet?
Great people don't need people to complete them, great people complete other people. -- Matthew Pawlikowski.
The release date of April 22 was released a long time ago
(ie: January 21, 2003, just incase you didn't get the picture)
To make a pun demonstrates the highest understanding of a language
The big question is whether or not Palladium will be built into this chip. Anyone know?
The Opteron is being launched on the 22nd of April. It was code named Sledgehammer, and is what Newisys and others are using. It is the server version of Hammer.
Athlon 64 is coming out in September. It is the desktop and mobile version of Hammer that was codenamed Clawhammer.
ignorance is bliss. googlefiberatx.com
The width of the machine is not the number of threads it's able to run at a given time (SMT like P4), but the number of instructions you can execute in a cycle (in a parallel fashion, of course). The difference between x86 and Itanium is that on Itanium the instructions are "explicitly parallel", i.e. the compiler bundles instructions together and the processor knows it can execute them in parallel, while on x86 the processor is smarter and determines itself which
micro-ops (actually it's rops for AMD) it can execute in parallel.
You're very likely going to see speedups on 32 bit code, simply because Opteron is twice as wide as P4 (and this has nothing to do with the size of the operands, which probably is not going to matter as much).
As far as the 64 apps are concerned - Linux already runs on it.
The Raven
here
and Athlon64 boards here
If you use a source-based distribution, like Gentoo, everything in your system will be compiled for your 64-bit architecture when it's installed. You'll be able to take advantage of your new 64-bit architecture right from the get-go.
Windows is still going to use the swap file.
Since the advent of 32 bits processors there's been a huge amount of research trying to shrink the space taken by data-structures and similars. I've read of techniques devoted to save a couple of bits in Java classes for instance. Now we get addresses 64 bits long... I guess we can forget all of that... Any thoughts guys? Is it really worth trying to squeeze bits in object code if every 5/10 years processor technogy makes a leap?
And how many apps for 64 bit exist in the market?
And how many 32-bit OSses and apps were there back in 1985, when the 80386 was released? At the time it was released, it was treated by most users as an even faster 8086. It took ten years before a semi-32bit OS was accepted mainstream, and on top of that another 7 years before every sold PC had a full 32-bits OS. The success of the 386 was in its backward compatibility, and so will the success of x86-64 and the failure of the Itanium as mainstream-cpu be.
There are previous posts repeating this, but there's one problem with the theory: That taking advantage of the 64-bit architecture from the get-go will actually give a speed benefit. It also assumes that all of the source code is '64-bit clean', although that is likely not going to be much of a problem.
The problem is that, sure, everything is compiled into 64-bit mode. Fine. But can the compiler optimize the 64-bit code as well as it can optimize the older 32-bit code? Will the compiler make good use of the extra registers? I'm willing to bet that, for the first while, 64-bit AMD compilers will generate slower code than their 32-bit counterpart. (For 99% of all applications -- those which do NOT need more than 2 GB of memory).
And, of course, there is my major question: What kind of context switch and/or process latency can we expect from the Athlon64 & Opteron? I realize nearly all hard real-time apps don't need anything this powerful-- most good engineers will just put a microcontroller in to handle the hard real-time, and buffer things enough so that it doesn't matter that the workstation isn't hard real-time; but it does have a serious impact on other aspects of system responsiveness, as well as overall system performance for a microkernel architecture (such as HURD, Darwin, or QNX).
For that matter-- how will the Opteron's context switch time compare to other 'server' processors, like the UltraSparc, POWER4, Itanium, and, for good measure, PowerPC? Most of the arguments I've seen about modern x86 having a horrible context switch time don't seem to hold up to benchmarks I've seen-- where identically-clocked PowerPC and Pentiums take nearly the same time (and hence nearly the same number of clock cycles) to context switch...
-- Sometimes you have to turn the lights off in order to see.
Simple... The Opteron is designed for server use and has the massive cache etc while the athlon64 will be priced for home use.
Depends.
Linux has been running on 64-bit architectures for some time now, so 64-bit'ness isn't a new thing. And with distros like Debian that support everything from m68k up to Alpha a large portion of the issues have been taken care of already. Debian 'sarge' currently has 10058 packages, all of which compile and run (I'm not going to say all are 64-bit bug free, that would be stupid).
And also most Open Source apps are used on Solaris, Tru64, HP-UX, etc which are 64-bit. Windows may have a big transition ahead of it, but for the rest of us it'll be just like any other motherboard upgrade =)
However, Michael Dell has historically been Andy Grove's bitch...
I dount Dell will ever use an AMD chip, even if it means losing sales.
The article did include all details of all 64-bit Intel CPUs which are backwards compatible with x86 CPUs without emulation.
I am TheRaven on Soylent News
No, the ceiling is not stupid.
Limiting the physical memory to 40 bits reduces the cost of building other system components, such as chipsets and motherboards, dramatically. Further, 1TB of RAM is sufficient for the current market. That is 1000 1GB parts, to give it some perspective.
As customers begin to approach 1TB requirements, AMD only has to implement more lines. No need for any segmentation hacks. The ISA needs no modification. This is a pragmatic and wise design decision.
64 bits of address space is still very useful without having actual RAM to back it up. It means you can map large quantities of storage into RAM directly. For example; if you have 10TB of disk, you can map all of it into a single virtual address space and address it with simple offsets. Obviously this is useful for large databases.
Maw! Fire up the karma burner!
Cost.
These chips (for the first few years *at least*) will be low end or mid range small servers. Small in that they won't be competing with Sun Fire 15K servers which themselves only support physical memory of up to 576GB. No one will need those extra 24 memory address lines, so why build chips or motherboards with them?
Another way of looking at it is real-estate.
Currently 1GB dimms are the normal upper end, with a few 2GB dimms around.
At that rate, can you imagine the motherboard that could actually reach the 1TB RAM limit? 512 or 1024 dimm slots?....
Would that fit into the ATX form factor?
Actually, it makes a lot of sense. Digital did the same thing with the Alpha. The architecture supports 64-bit addresses, but the current implementation does not. Who actually needs 64-bit addresses now? I suspect that anyone who does is not going to be using x86 hardware. Software can be written now which can take advantage of 64-bit addresses, and it will not need to be recompiled for newer versions of the series.
I am TheRaven on Soylent News
The benefits should be substantial. Besides the normal benefit of havin 8 CPU's you also get massive memory-bandwidth. Each prosessor has a memory-bank of it's own with 128bit mem-bus to it. But they can also use the mem-banks of the other CPU's (altrough there is a latency-penalty when doing that. But the latency is pretty good when compared to other architectures). So you could say that 1 Opteron has 1 x 128bit mem-bus, 2 Opterons have 2 x 128bit mem-bus, 4 have 4 x 128bit bus and 8 have 8 x 128bit mem-bus.
Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
Manufacturing ICs is not an exact science. Very small impurities can render a die, or a part of a die unusable. The manufacturer can either throw these away, or disable the broken part and sell it as a crippled version. Intel's Celerons were just the P2/3s that had failed part of the cache tests. A lot of the difference between the Athlon64 and Opteron is cache size. A second is SMP support. The Opteron has 3 HyperTransport controllers on die that allow it to communicate with other CPUs with no 'glue' architecture. The Athlon64 will only have one, so any Opterons which have one defective HT controller can be sold as 2-way SMP parts, and those with 2 defective HT controllers will be Athlon64s (those with 23 defective HT controllers will be marketed as paperweights).
AMD has only developed a single CPU, and it will only manufacture a single CPU. It will market this as the most expensive Opteron. All the other versions are simply failed versions of this, with the broken parts disabled.
As an aside, you actually could enable the other half of the cache on some Celerons by mutilating the CPU a little. It was a particularly bad idea, since errors in cache tend to make your system rather unstable, but it could be done...
I am TheRaven on Soylent News
Blockquoth the poster:
Your caution in this is reasonable, but don't forget that the GNU community is using GCC, which supports a gazillion of architectures, so:
So I would expect that the code GCC produces for x86-64 will be comparable to that produced for i386, and probably faster from the beginning, because of the extra registers. The extra size might affect some cache critical programs, but with the default integer size still being 32 bit, I wouldn't worry too much about that.
Vasilis Vasaitis
Late readers: please moderate at Newest First, with a low threshold, to promote late writers.
Just a word to root for the ISO standard terminology,
which disambiguates the base-10 and base-2 scale
prefixes: 2^40 bytes is a tebibyte.
Also, although the Hammer implementation may be
limited by physical address lines, I believe the
AMD architecture uses a 48-bit address space,
for 256 tebibytes of addressable memory.
-I like my women like I like my tea: green-
You will probably need a 32bit glibc. 32bit code runs in a different processor mode (think the V86 mode used in dosemu)
I compiled everything from source for my Alpha, not one problem with 64-bit cleanliness. That was a problem in the early 90s. There are enough 64-bit chips around now, that every serious application has been compiled on one at some time.
GCC is already ready for the Hammer chips. If compiling for x86-64 you get code generated that can make use of mmx, mmxext, 3dnow, 3dnowext, sse, and sse2. It actually prefers those instructions over the old x87 functions by default.
I've heard that the Hammer is 2.5 times faster per clock than the lastest Athlons in 32-bit mode, and faster still in 64.
I WILL be building a machine around 2 of these chips as soon as the first Tyan board ships with PCI-X slots.
And you're getting this pricing for the Opteron from where? Everything I've been hearing has the low end stuff -=undercutting=- the current Xeon pricing, while still running more than standard Athlons/P4's.
Geez.
Its going to be really sad how quad motherboards are still going to cost at least $800, even though they've got to be downright trivial to make compared to modern quad system. No central switching logic, just interconnect buses between processors! PCB and sockets aint that expensive, there's really no excuse. But its going to happen anyways.
Speak nothing of the many-thousand dollar eight processor boards.
Damn cushy profit margins.
The Abit BP6 was my introduction to low cost SMP. Now I've got a craving for more, but I dont think its going to happen. Even thought it could.
Maybe someone will get smart and make a enthusiast board. I seriously doubt it though. Not when there's bigger fish to fry. How long is it going to take for someone to realize that although less profitable, there will be untapped demand for non-server class quad systems.
Myren
In the article, it said "...AMD links memory directly to each CPU", which right off the bat makes me think NUMA (Non Uniform Memory Access). NUMA machines are made up of nodes, which have a processor or two, and a bank of local memory. But one CPU can access any memory in the system, it just takes longer if it isn't local - hence the name NUMA. Did anyone else get that impression?
Gotta get me one of these!
I don't think this is true. The Opteron will be replacing AMD's current Multi-processor offering (Which is the Athlon MP) and the Athlon will replace the uni-processor offerings (Athlon XP.)
There will probably be some "High End" Opteron chips with large cache, but overall the chips will most likely be priced for market share.
I do agree with the part about the main competitor being Itanium, but not price-wise. Ultimately I believe that Itanium will offer better performance, but at the price of compatibility and cost.
I can't wait to have a 64-bit workstation!
- It's not the Macs I hate. It's Digg users. -
What makes the Opteron a server chip is the presense of three hypertransport links, the bus used for communication between multiple CPUs and other components such as the motherboard chipset. The Athlon64 will have only one. This is important since hypertransport, unlike say PCI, uses point-to-point links. The AGP and PCI bridges could be on separate hypertransport links and in theory we could see things like gigE controllers directly attached to the hypertransport bus.
Also, last I heard, the Opteron will use Dual DDR memory, while Athlon64 will have to make do with single-channel DDR. Recall that both Hammer chips (SledgeHammer, aka Opteron, and ClawHammer, aka Athlon64), have the memmory controller integrated onto the CPU.
For both of these reasons, the Opteron and Athlon64 sockets are incompatible (Socket 754 vs Socket 940). There's an old review with plenty of information here
Oh man, I'm so excited about:
gimp2.0_x86-64 provides gimp
gimp2.0_x86-32 provides gimp
yes, the little things in life...
-l
Help cure AIDS, cancer, and more. Donate your unused computer time to worldcommunitygrid.org. Join Team Slashdot!
Opteron is AMD's first real entry into the server market. AthlonMP was never truly meant for anything other than workstations or very low level entry servers. Itanium is Intel's offering for very strong, high performance servers. Their Xeon chip is the high-end workstation to mid-range server chip. All indications are that AMD will be targetting low to mid range servers with the Opteron.
Intel's point is that they don't believe anything other than high-end servers will use 64-bit chips effectively. AMD's point is that anyone can use whatever they like. A dual Opteron with 2MB L2 will most likely be targetted against dual Xeon machines. AMD will try to offer a better price point and the ability to run 64-bit applications to potential customers in their attempt to win partners. The launch of the Opteron had best go off MUCH better than the launch of the AthlonMP if AMD hopes to make it in the server business. Already weary of new products, big businesses will be looking for any excuse not to go with an Opteron. Even Intel has trouble convincing long-time customers to invest in new products, as evidenced by their dismal Itanium 1 launch. If the chipset and board problems we saw in the Tyan AthlonMPs creep up in any of the Opteron boards, AMD's cash cow will be seen in the business community as little more than the 'roo meat at McD's. That, potentially, could end up being the nail in AMD's coffin.
They have a lot riding on this launch, so let's all hope it goes off without a hitch. If it does, I think Xeon processors will be collecting dust within a year's time while Opterons slowly replace what's currently in the workplace. In very, very few circumstances will a company look at Opterons as an alternative for Itaniums. In terms of performance? Who knows; we haven't see benchmarks on production Opterons yet. If it's everything we're told it is, it may very well outperform the Itanium 2s vis-a-vis.
-- "Government is the great fiction through which everybody endeavors to live at the expense of everybody else."
Ohh, very cool. I was still under the impression that Athlon64's would be single-processor systems. In that case, I can't wait for a Dual Processor, 64-bit workstation!
As for price, they will have to be cheap to stay competitive with Intel's 32-bit offerings. The new 64-bit core of the Opteron/Athlon64 will replace the entire line of AMD's processors, so there will be cheap Athlon64's in time.
I just hope that the AMD chips aren't used solely as "Fast 32-bit" processors; I would hope that Microsoft and other software developers keep to the promise to have 64-bit versions of windows and other packages. Once again Microsoft will be developing Windows for multiple platforms; it used to be x84, MIPS, PPC, and Alpha. Now it will be x86, x86-64, Itanium.
Of course, Linux already has x86-64 support, but for mainstream acceptance we need everything to support it, even Windows. (ie- let's not turn this into a Linux vs Windows fight.)
- It's not the Macs I hate. It's Digg users. -
The so-called "very low end server" market makes up 90% of all x86 servers sold. Most x86 based data centers, albiet Linux or Windows, use single or dual-processor servers in load-balanced clusters. Most of these servers run Pentium 3 processors.
These servers handle most of the workload, while a small few "high end x86" servers can handle database things. Sometimes, the database sits on non-x86 platforms.
The AthlonMP is a higher-performance alternative to the dual-Pentium 3 server/workstation, and can even perform well against a dual-xeon box.
It's hard to know where AMD is trying to position this processor, but it seems to me that the new processor core is positioned to compete with Intel on all fronts. Perhaps the processor will perform so well and be so much cheaper then the Itanium that it will bring very-high performance computing to places that could normally not afford it. We don't know; we'll have to wait and see.
- It's not the Macs I hate. It's Digg users. -
Nothing that I've seen on the 'net even hints that the Opterons will be that much of an increase in speed over the Athlon that exists today running at near the same clockspeed. The best figure that I've seen is only 20% improvement in speed in 64-bit mode because of the new registers.
The best database server comparison I've seen in a while is http://www.anandtech.com/IT/showdoc.html?i=1747 which shows that while the AthlonMP can keep up with the non-HT Xeons that are clocked near its PR rating, the Xeons are walking off from that point just by virtue of clockspeed, even before you add in HT.
Remeber, 64 != 32x2 in this case. Clock for clock, the Opterons are probably not going to be *that* much faster than an Athlon on many things. Personally, I'll be impressed if we see a 33% speed improvement at the same clock speed and I would bet that we could attribute a lot of that to FSB speed increases over the AthlonMP.
The Opteron's ace probably isn't going to be a huge performance increase. It's going to be that it can address a lot of memory.
Nothing hints to this? What about this very same slashdot article? They are giving the Athlon64's the "PR" ratings of Barton-core AthlonXP's at a much lower clock speed. This seems to be an indication of a pretty good speed increase in just running 32-bit software.
When running 64-bit software, it will depend on how heavily the software utilizes the extra data width. What type of application. And so forth.
It all depends on your perspective. Perhaps in your case, addressing 20GB of RAM is important to you. For me, 4GB is more then enough for any application I've used besides large database applications.
For me, a 20% speed increase out of the box before 64-bit software comes into play is quite acceptable. IMO, as long as the Athlon64 remains competitive with the Pentium 4 in terms of performance, then I'll love buying and using AMD chips now and in the future.
Besides, can you see of any other way the mass-market of computer users is going to be able to get their hands on 64-bit computers? I'm more excited about higher performance desktops then the high end server market.
- It's not the Macs I hate. It's Digg users. -
They really shouldn't have double the memory foot-print. Code size increases only slightly (from an average of 3.4 bytes per instruction to 3.8 bytes per instruction). Integer size remains the same. Only long integers and pointers double in size. You'd probably get more of a noticible growth by compiling with GCC with inlining enabled.
A deep unwavering belief is a sure sign you're missing something...
Sorry, the uptime for Opterons will be worse than any 64-bit competitor. Why? you may ask. Basically, there are two reasons: ut2k3 and CS, both of which run natively on x86-64. So, what will all vigilant admins do? Just a guess.
When 386 came out we already had exhausted 20bit adress space we had and trying funny hacks with accessing the memory thorugh the keyboard controller chip (not really but A20 thing was close enough.) And 32 bits merely 4096ed it. The transition from 32 bits to 64 bits will increase the addressable space 4 billion times, which is a million times higher than four thousand of 386. That should make this transition quite a bit more durable.
Gentlemen, you can't fight in here, this is the War Room!
AMD's Opteron processor will be offered in three varieties:
Opteron 1xx series - Single processor workstations/servers
Opteron 2xx series - Dual processor workstations/server
Opteron 8xx series - Up to 8-way SMP servers
The first two are pretty directly targeting the market that the Xeons currently sell into, but the last one starts to touch on the Itanium's market. The Xeon tends not to scale well beyond 2 processors, and in fact, most Xeons won't work at all in anything more than 2-way systems (only the Xeon MP is certified for use in 4+ processor systems).
As for price, AMD hasn't announce any prices, but I think it's quite reasonable to assume that the 1xx series of Opterons will be quite reasonably priced (probably not significantly more than an AthlonXP or P4), the 2xx series will be priced similar to the Xeon (about 10-20% more than a P4 of the same clock rate), while the 8xx chips will cost about the same as a Xeon MP ($1000-$2000).
Of course, when it comes to servers, the processor is only one relatively small part of the equation, and so far all the Opteron servers I've seen have been pretty high-end systems, so I don't expect them to be cheap.