Slashdot Mirror


Linus Has Harsh Words For Itanium

Anonymous Coward writes "As a follow up to the earlier story "Intel: No Rush to 64-bit Desktop"... In words that Intel are likely to be far from happy with, the Finnish luminary has stuck the boot into Itanium. His responses to some questions on processor architecture are sure to be music to AMD's ears. Linus, in an Inquirer interview concludes: "Code size matters. Price matters. Real world matters. And ia-64... falls flat on its face on ALL of these."" Of course, Linus works for a chip maker ;)

149 of 787 comments (clear)

  1. But it's still a year away, isn't it? by More+Karma+Than+God · · Score: 3, Insightful

    Not to mention the fact that most home users won't see a 2X performance boost from 64 bits.

    --
    Go here to create your own Slashdot dis
    1. Re:But it's still a year away, isn't it? by Waffle+Iron · · Score: 4, Insightful
      Not to mention the fact that most home users won't see a 2X performance boost from 64 bits.

      Most home users are going to see a performance drop from 64 bits. 64-bit code needs 8 bytes to hold every pointer. This will serve to eat up more cache and memory bandwidth, which are already major bottlenecks for any CPU.

      Unless you have a program that actually needs to work on more than 2G of data at one time, 64 bits buys you nothing but extra time waiting to move around millions extra of zeroed out upper bytes.

      Some people need that much data in memory at once, but the average home user today doesn't. People typically mention video, but if there's one thing that's easy to stream in and out of a smaller memory space, it's multimedia data.

    2. Re:But it's still a year away, isn't it? by KewlPC · · Score: 4, Informative

      64-bit code needs 8 bytes to hold every pointer. This will serve to eat up more cache and memory bandwidth, which are already major bottlenecks for any CPU.

      The only thing this eats up is cache; because the system has a correspondingly wider data bus, there isn't a hit in memory bandwidth (unless the designers are trying to be cheap bastards and give a 64-bit CPU the same data bus width you'd use for a 32-bit CPU). And most 64-bit CPUs have a lot of cache.

      And as for what kind of applications you potentially need several gigabytes worth of memory for, there's scientific processing and the like.

    3. Re:But it's still a year away, isn't it? by Waffle+Iron · · Score: 4, Informative
      The only thing this eats up is cache; because the system has a correspondingly wider data bus, there isn't a hit in memory bandwidth (unless the designers are trying to be cheap bastards and give a 64-bit CPU the same data bus width you'd use for a 32-bit CPU)

      Ever since the 8086/8088 duo, the bus width of a CPU has been decoupled from its word size. For a long time, the external bus width of (non Rambus) 32-bit CPUs has been wider than 32 bits. This works because the memory unit fetches entire cache lines. The CPU designers could be less cheap bastards today and bring out 32-bit CPUs with 256-bit wide busses if they wanted to.

      And most 64-bit CPUs have a lot of cache.

      You could put a lot of cache in a 32-bit CPU. You could put a small cache in a 64-bit CPU. In fact, the biggest difference between high-end and low-end CPUs is just the size of their caches.

      To be fair, the current Itanium has an enormous cache that uses the vast majority of the die size and dicates its price and power consumption. It's logic core really isn't that big. If you embedded an X86 core in all of that cache, you'd get a very fast chip. If you teamed up an Itanium core in a Celeron cache, you'd get Celeron-level performance. 64 bits has little to do with it; you're mostly paying for cache and bandwidth when you buy high end CPUs.

    4. Re:But it's still a year away, isn't it? by EvilTwinSkippy · · Score: 4, Informative
      News flash...

      The pentium architecture has been loading 64 bits of memory at a time since the PII. They have to because that is the only way the RAM has a chance in hell of keeping up with the processor. Basically they load 2 instructions at once, and have them execute at double the speed of the RAM. (That's also part of why you get such a kick in the pants when you optimize with the -mcpu=i686 flag in gcc.)

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    5. Re:But it's still a year away, isn't it? by hayriye · · Score: 3, Funny

      No, 32 bits was twice faster than 16 bits, and 64 bits is twice faster than 32 bits. (Quotation from "Practical Handbook of IT Managers")

  2. Obsessive by snack-a-lot · · Score: 3, Insightful

    Not only does he work for a chip maker, he's like totally obsessed with the i386 architecture. I guess it's what he cut his teeth and and he's going to stick with it. But to think that no-one else has a use for it is very short-sighted.

    It'll probably still make it into the kernel, though. I mean, alpha and sun architectures are in there, so ...

    1. Re:Obsessive by leviramsey · · Score: 2, Informative

      Seeing as Intel was (still is?) a major backer of Red Hat, I'd imagine that Red Hat's kernel hackers have already ported it and will see to it that support for Itanium makes it into the release kernel.

    2. Re:Obsessive by petong · · Score: 3, Informative
      Red Hat already has an ia-64 release that costs like $700.

      But you can always download Debian for the ia-64 architecture for free...

      --
      Libation.com - Fine wine and beer

    3. Re:Obsessive by dildatron · · Score: 3, Informative

      Mandrake also has an ia64 port of their distrobution. I just installed it a few weeks ago (version 8.1 I believe).

      RedHat does as well, but their installer would lock up at the end of the install every time, with no errors in the install log. I installed Mandrake after I could not get RedHat to install.

      This was on a first generation (lion) itanium.

      --


      If you had nuts on your chin, would they be chin nuts?
    4. Re:Obsessive by Pharmboy · · Score: 4, Interesting

      Not only does he work for a chip maker, he's like totally obsessed with the i386 architecture. I guess it's what he cut his teeth and and he's going to stick with it. But to think that no-one else has a use for it is very short-sighted.

      He works for a company that doesn't build chips with the i386 architecture. Its emulated in firmwear, "code morphing" is what they call it. Its slightly slower than hardware but its worth the trade for power consumption.

      I am betting he has worked with plenty of morph code, creating virtual cpus, subsets of the i386 chip, or different completely. This is akin to designing hardware, in software.

      I can't see how him working for Transmeta hurt his understanding of processors. Seems like it would actually enhance his understanding.

      --
      Tequila: It's not just for breakfast anymore!
    5. Re:Obsessive by JWSmythe · · Score: 4, Informative

      It looks like all the big Linux distributions have gotten together to support the IA-64 Linux development.. This was the first hit on search "Linux IA-64" on google.


      http://www.linuxia64.org/

      Working distributions date back from 03/2000

      Straight from their page:

      IA-64 Linux Distributions
      # Caldera Systems (initial release 8/4/00) Download at ftp.caldera.com/pub/OpenLinux64
      # Debian (initial release 8/10/01) Download at www.debian.org/ports/ia64
      # Red Hat (initial release 5/17/00) Download at ftp.redhat.com/pub/redhat/ia64
      # SuSE (initial release 6/13/00) Download at ftp.suse.com/pub/suse/ia64
      # TurboLinux (initial release 3/13/00) Download at www.turbolinux.com/ia64.html

      Their short list of representative companies include: Caldera Systems, CERN, Debian, Hewlett Packard, IBM, Intel, Linuxcare, NEC, Red Hat, SGI, SuSE, TurboLinux, and VA Linux Systems.

      If you search their site, you'll see a few emails from Linus in their mailing list archives, so he's obviously involved at least to a degree (I couldn't imagine him not being involved). I dare say he's educated in the matter, and would know all the in's and out's of say putting together an OS. :)

      I'm sure support will be included eventually.. Well, maybe not.. I know Linux will run on SGI, DEC Alpha, ARM (I'm running Linux on a Compaq iPaq with an ARM CPU), so maybe they'll leave it as a patch and let folks do seperate distributions.

      I guess it's all in how widely used a processor is.. Not the average Joe has an SGI, Alpha, or Itanium at their house. (I'll keep quiet about the 150Mhz SGI Indy that we use as a doorstop).

      --
      Serious? Seriousness is well above my pay grade.
  3. ... AND FOR THE RECORD by YOU+ARE+SO+FIRED! · · Score: 3, Informative

    This is from the Linux-Kernel mailing list, not an Inquirer interview. Here is the post.

  4. All in one by Thalias · · Score: 2, Insightful

    Wow, just about everything under one topic. Linux, AMD, and Intel. So by this we are going to have 64-bit processors soon, is that what I'm hearing? Or will this turn out to be like most computer issues and come out a few years from now?

    1. Re:All in one by gearheadsmp · · Score: 2, Informative

      Well, you kind of forgot how MS server 2003 doesn't have support for x86-64, but has support for Itanium II. Can't leave MS out of the fray ;)

  5. Linus too Harsh by enigma32 · · Score: 4, Insightful

    Now, we all know that the Itanium isn't everything it's cracked up to be, and I think none of us at are wrong in blaming intel for coming out with a lousy product....

    But, isn't one of those situations he mentions in the interview (namely, running a large database server) what this chip is designed to be doing?

    As I recall, the IA64 isn't designed for the desktop user... In fact, desktop users probably don't even need 64 processing for a number of years still....

    Yet we're attacking Intel for making the chip to fit it's niche?

    Perhaps we need to be more fair in the context of the usefulness of the chip, instead of considering it in all contexts and criticizing it based on that?

    1. Re:Linus too Harsh by TerryMathews · · Score: 2, Interesting
      In fact, desktop users probably don't even need 64 processing for a number of years still....


      Desktop users need at least 64 bit memory registers real soon now. DDR memory is fairly cheap ($100/GB in 512MB increments). 32 bit processors top out at 2GB of RAM without resorting to trickery.

      Plus, when it comes to processors, technology develops features and then software develops around said features, not vice versa. Once there is a 64 bit desktop processor, software companies will develop 64 bit software.

      Heck, using VS .net2 or whatever it will be called, all you might have to do is add an option to your build configuration.
      --
      -- Terry
    2. Re:Linus too Harsh by Dynedain · · Score: 5, Interesting

      As I recall, the IA64 isn't designed for the desktop user... In fact, desktop users probably don't even need 64 processing for a number of years still....

      I need more than 4GB RAM (3.5 if I want it stable) for video editing and 3D rendering.

      AMD is developing their 64bit chipsets with the desktop market in mind, as well as the server market. Intel has forgone the desktop, which will turn out to be a huge blunder. Especially when its already a determined fact that the 32bit emulation mode on AMDs line slaughters the 32bit mode on the Itanium.

      --
      I'm out of my mind right now, but feel free to leave a message.....
    3. Re:Linus too Harsh by Amiga+Trombone · · Score: 5, Interesting

      But, isn't one of those situations he mentions in the interview (namely, running a large database server) what this chip is designed to be doing?

      Sure, but it doesn't really do it significantly better than some of the more common RISC architectures (Sparc, Power, Alpha), and it's a lot more expensive.

      As I recall, the IA64 isn't designed for the desktop user... In fact, desktop users probably don't even need 64 processing for a number of years still....

      Probably not, but a lot more desktops get sold than high-end servers. If AMD manages to get a toe-hold on the desktop with their 64-bit solution, the chances are a lot better x86-64 will migrate up the food chain than ia64 will migrate down.

      Perhaps we need to be more fair in the context of the usefulness of the chip, instead of considering it in all contexts and criticizing it based on that?

      Well, that's the point. How useful is it really? What compelling reasons are there for using it in place of a x86-64 on the low end, or something like Power or Sparc on the high-end? All things considered, it really isn't a bad chip. But it is a solution in search of a problem.

    4. Re:Linus too Harsh by angst_ridden_hipster · · Score: 5, Funny

      Good God, man, haven't you ever heard of polygon reduction? Bump mapping? Image mapping?

      It's hard to believe you *really* need all of that RAM. Then again, I haven't done 3D in years.

      When I was a CG guy, we dreamt of bus speeds above 66MHz. We couldn't even imagine having more than 32M RAM. And we thought it was reasonable to wait two days for a 2k image to render...

      --
      Eloi, Eloi, lema sabachtani?
      www.fogbound.net
    5. Re:Linus too Harsh by be-fan · · Score: 3, Insightful

      Actually, no. All of Intel's non-server chipsets top out at 2GB. The reason is that 4GB isn't possible without PAE thanks to stuff (BIOS, APIC, AGP memory, etc) that have to be mapped at the top end of the 4GB. Since 3GB is a rather weird size for main memory (hard to fill on dual-channel chipsets) 2GB is pretty much the reasonable max. PC2100 DDR (optimal for dual channel P4's) run about $50 for 512MB, or $200 for a GB. That's about what I paid for 64MB for my PII-300, and is quite a reasonable amount to budget for memory in a high end or even mid-range machine.

      --
      A deep unwavering belief is a sure sign you're missing something...
    6. Re:Linus too Harsh by angst_ridden_hipster · · Score: 4, Funny

      Ohboy, that's going to look really harsh when you read it after Slashdot removed my fake HTML tags reading "RANT" and "OLD MAN VOICE" around some of the content.

      preview, preview, preview...

      --
      Eloi, Eloi, lema sabachtani?
      www.fogbound.net
    7. Re:Linus too Harsh by penguinboy · · Score: 3, Informative

      While the systems can use 4GB of RAM, applications can't have the entire 4GB. RAM must be split into two segments - OS and Apps - usually a 2/2 or 1/3 split.

    8. Re:Linus too Harsh by tupps · · Score: 3, Insightful
      The thing is there has been so much hype recently about 64bit that people will assume that they need 64bit (of course it will twice as good as 32bit ;-)

      Good marketing beats good engineering!

      --
      Go out and get sailing!
    9. Re:Linus too Harsh by Dynedain · · Score: 2, Informative

      guess what? CAD users have been the driving force in high-end workstations for quite a while now....current machines still aren't sufficient enough to do near-photorealistic design in real time....and they won't be for anytime soon. Untill then, this niche market (if I'm an anomoly, why is Autodesk such a huge developer and Microsoft's biggest supporter?) will continue demanding better

      --
      I'm out of my mind right now, but feel free to leave a message.....
    10. Re:Linus too Harsh by ryanvm · · Score: 2

      You're an anomaly because you use your computer to do video editing and 3D modeling, not because you use AutoCad.

      I've been around AutoCad users for the last 6 years (machine shop and city government), and I've concluded that most people using AutoCad aren't using it anywhere near its potential.

      Besides, Windows can barely keep track of 4GB of RAM as it is. I can guarentee you that Windows will be shitting all over itself for the first 18 months after Win64 is released.

    11. Re:Linus too Harsh by MonaLisa · · Score: 3, Insightful

      > Sure, but it doesn't really do it significantly better than
      > some of the more common RISC architectures (Sparc, >Power, Alpha), and it's a lot more expensive.

      This is bullshit. The Itanium2 slaughters the Ultrasparc 3 by factors of 3-4, and is 25% or so better than POWER4, and is a LOT cheaper than either one of these. You call the Alpha cost effective? It can cost nearly $100K for a loaded dual processor Alpha box, and the dual Itanium2 is still faster for close to an order of magnitude less money. I have 6 Itanium2 machines and more than 30 p690 POWER4 machines at my disposal and develop computationally intensive code on all of the above and more every day. You can buy a 900Mhz Itanium2 workstation for under $5K. The Pentium4 is a better deal at under $2K, but it is about half the performance of the cheapest Itanium2, and you're stuck at 32 bits. Anyone that would buy a high-end workstation will recompile, so emulation mode performance is absolutely meaningless.

    12. Re:Linus too Harsh by Jeff+DeMaagd · · Score: 2, Interesting



      Sure, but it doesn't really do it significantly better than some of the more common RISC architectures (Sparc, Power, Alpha), and it's a lot more expensive.

      Uh, Itanium 2's really aren't _that_ expensive, are they, at least compared to the lot you stated? I thought that an HP I2 workstation could be had for $8000, which isn't too bad for a 64 bit workstation. New Alphas certainly aren't cheap either, I have no idea about the latest Sparcs and Powers.

    13. Re:Linus too Harsh by puetzk · · Score: 2, Informative

      if you want to allow fast syscalls (trust me, you do) you need to keep the kernel mapped permanently to cheapen the context switch from app to kernel. You also probbly want to separate physical memory (mapped into the kernel space directly) and virtual space, so that you can have swap and mmap'ed files. You also probably need to keep some address space to map I/O devices into, and some for DMA buffers (unless you really want to give up DMA to get that last memory). What with all the memory on modern video cards, mapping them (to say nothing of the AGP window) is pretty huge too.

      So, unless you want to rewrite a lot of stuff, and throw performance completely down the toilet, you need most of that 4GB address space for things other than app VM space. The current linux split is 1G/3G (1 gig to map physical ram into the kernel and store kernel data structures), 3G of total app address space into which devices, files, swap, or physical ram pages can get mapped. You can also set linxu up for 2/2 I think (which gives you more physical ram at the expense of what each app can use) or 4G/PAE (which takes the performance hit and separates the app and kernel the apps get all 4G themselves, and the kernel uses PAE to map up to 32G in a separate way). But the performance hit is very significant unless your app uses almost no system calls or I/O (device I/O has to get copied around into lowmem for this case).

      --
      The Matrix is going down for reboot now! Stopping reality: OK. The system is halted.
    14. Re:Linus too Harsh by CapnFreedom · · Score: 2, Interesting

      But, isn't one of those situations he mentions in the interview (namely, running a large database server) what this chip is designed to be doing?

      Sure, but it doesn't really do it significantly better than some of the more common RISC architectures (Sparc, Power, Alpha), and it's a lot more expensive.


      Do you have any data to back that up?

      According to this press release, the current record for the TPC-Cbenchmark on a 32-way machine is held by an NEC server with Itanium 2's.

    15. Re:Linus too Harsh by linux11 · · Score: 2, Interesting

      The computer industry has come to expect that Morre's law not only applies to processor speed but also to memory. Anotherwords, there has been an a doubling in the amount of memory approx. once every 18-24 months. A home/small office computer in 1979 contain approx. 64K of memory. Now, 24 years later, it is expected between 12 to 16 iterations will have taken place. In reality, the industry is between 13-15 iterations where several workstations come with 512 Megs of RAM upgradable to 1 Gig and Dell is already shipping workstations with up to 1.5 Gig of RAM. While you might argue if anyone needs that much memory, such an arguement should also question if anyone need a 2Ghz or 3Ghz computer. As computer speed increases, the amount of memory also needs to increase otherwise the usefulness of the increase speed will exceed the amount of data available to take advantage of it. The bottom line is that if the data is not available in memory then the additional clock cycles should be considered as nothing more than additional NOOPs.

      So, given the current trend of increase in memory on workstations, it would not be surprising if there is demand for workstations to exceed 4GB of memory by 2006. Aside from the ever cludgy PAE work-around, Intel does not seem ready to deliver such a workstation. At the same time, IA64 is not preferable to proven architechures such as SPARC or PPC for large databases and does a poor job in performing as being backward compatible with IA32. Thus, it does not fit well in a niche in the current server enviroment and will not be ready soon enough when the workstation enviroment demands it.

      What I find the most interesting that no one seems to be discussing is that MicroSoft seems to be trying to remove their dependence on Intel. MicroSoft already showed that they no longer are dedicated to Intel when they provided support for AMD's 3D-Now. With their push to move from Win32 to .Net IL-code and the purchase of Connectix, they may be positioning themselves for the possiblity of releasing a Windows for PPC. This will provide MS an architechure to build from with a more proven track record than IA64 and without several of the limitations of IA32. At the same time, the purchase of Connectix would allow them to provide legacy compatiblity with IA32 code.

  6. wow by BigBir3d · · Score: 5, Funny

    Linus being opinionated and brash? Never!

  7. Re:Itanium 2 is great by Goalie_Ca · · Score: 5, Insightful

    "but speaking strictly from the technological point of view"
    I think that was his point. It's great technically but it sucks in the real world. If its not practical its a shitty architecture IMHO.
    I also think the x86-64 is a more viable solution as well.

    --

    ----
    Go canucks, habs, and sens!
  8. How to improve x86 by MBCook · · Score: 4, Interesting

    Now I'm no programming guru, but it seems to me that the x86-64 architecture is a great one. In fact, the only thing that I could see being done to improve it would be to add more general purpose registers. I believe that the new registers are all GP (IIRC), but I think that makeing them ALL GP (even the older ones) would be good, and maybe bring up the number of registers to a good round 32 or something. Am I missing something glaring wrong? If you're going to toss out all of the x86 stuff (like ia-64), I think you should be able to emulate it in hardware about as fast as current x86 processors can. When Apple switched to PPC, couldn't they emulate 68k code about as fast (or at least faster than 1/2 the speed of) the fastest 68k chips?

    --
    Comment forecast: Bits of genius surrounded by a sea of mediocrity.
    1. Re:How to improve x86 by _typo · · Score: 4, Informative
      but I think that makeing them ALL GP (even the older ones) would be good, and maybe bring up the number of registers to a good round 32 or something. Am I missing something glaring wrong?

      Well, the only reason why the other registers aren't GP on x86 is that there are instructions that use them implicitly. If you don't care about these instructions you can use them as regular registers.

      As an example the EDI register is used by the SCAS* instructions as a pointer to memory. If you don't care about the instructions that use this register like that you're free to do regular operations on the EDI register, it has no limitations on what you can do with it.

      You're right to say that there are few registers though. Before I learned x86 I learned MIPS and there you got all the glory of 28+ GP registers. In the simple examples we did I never needed to push and pop from the stack.

      --

      Pedro Côrte-Real.

    2. Re:How to improve x86 by VAXman · · Score: 2, Interesting

      Did you read the source for the Inquirier article?

      Linus spends a lot of time debunking the "more registers is better" myth. X86 implementations have been addressing this issue for a long time, both by register renaming and by having extremely fast L1 data caches (esp. on P4). Adding more registers will not help speed up code much at all - and anyways requires a recompile and won't help improving legacy code.

    3. Re:How to improve x86 by be-fan · · Score: 4, Informative

      What you're missing is that x86 chips have a ginormous amount of internal rename registers (128 in a P4). The bump to 16 *visible* registers in the Athlon-64 is to allow the compiler optimizer to give more information to the CPU about variable usage. I'm guessing that AMD found that more than 16 visible GPRs really didn't help the compiler's allocation routines any.

      --
      A deep unwavering belief is a sure sign you're missing something...
    4. Re:How to improve x86 by PeterM+from+Berkeley · · Score: 3, Informative

      I attended an information session by someone from AMD at UCB. It was my understanding from his presentation that the tricks they were using to get up to 16 registers without compromising the ability to run existing 32-bit code made it impossible to get past 16 registers.

      They would've liked to have 32 registers, but it simply couldn't be done in a backward-compatible way.

      If you want more information on this, and more than a guess, AMD has much information up on its website.

  9. VAX is definitely the best by fozzy(pro) · · Score: 4, Funny

    The best architecture is still VAX. Clearly string operations at the processor levels is what any procesor needs to be the best and fastest ;}

    1. Re:VAX is definitely the best by snStarter · · Score: 4, Insightful

      For the expensive memory environment for which it was designed the VAX was fabulous. And it was designed to be scalable as well.

      You can snicker at the CISC VAX architecture, but it ran multi-user in less RAM than many processors today have CACHE. Remember 2 MB of RAM was a lot when the 11/780 was introduced. 600 MB drives were considered HUGE and were the size of washing machines.

      Its scalable architecture let a copy of VMS from the lowliest processor be physically mounted on the most capable and boot just fine.

      It had BCD instructions too, not just string.

      But Gorden Bell got a lot more right than he got wrong. And the compact and orthogonal instruction set of the VAX looks pretty good today.

    2. Re:VAX is definitely the best by SN74S181 · · Score: 2, Funny

      Hell, I ran multiuser with five users on my Altos 586 (not a pentium, it was an 8086 machine that had five terminal ports). It was an 8086 machine with 512K of RAM and (get this) Microsoft Xenix (from before SCO).

    3. Re:VAX is definitely the best by EvilTwinSkippy · · Score: 2, Interesting
      We hate the damn thing, but a vax from 1986 is STILL running our digital planetarium projector.

      And you know, it doesn't do a half bad job. If only someone around here know vax enough to do anything BUT the planetarium shows...

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    4. Re:VAX is definitely the best by hughk · · Score: 2, Informative
      The VAX architecture did end up being subsetted and some of the string instructions were dumped (to be implemented via emulation, or in-line code generated during compilation). However, block moves stayed.

      The main point about the VAX arcitecture is that there was very close liasion between the OS architects and the hardware developers, the result being a secure operating system that worked well with little reources.

      Interestingly enough, VMS did get ported to the Alpha, and some of the OS level MACRO-32 assembler code ended up being compiled for the Alpha. Some of the biggest apps still run on OpenVMS Alpha, and I await with trepidation the port to Itanium.

      --
      See my journal, I write things there
  10. Just a little reminder by NedTheNerd · · Score: 2, Informative

    the fact that linus works for a chip maker doesnt really matter because he dosn't develop the chips. he gets paid there to develop the linux kernel.

  11. the return of "worse is better" by banky · · Score: 4, Insightful

    Worse is better

    although the original essay talks about Unix and the LISP machines, it just keeps being true. Linus talks about the "charming oddities", well there you go: worse is better. Try for perfection, and the real world will eat you alive.

    I also think he's right about the masses being what matter; I think Intel is still thinking about the data centre, not Joe Sixpack, with Itanium.

    --
    ZOMG I WOULD LOVE TO KNOW ABOUT YOUR FEELINGS ON MACINTOSH VERSUS WINDOWS, VI VERSUS EMACS, AND HOW YOU'RE NOT A DORK
    1. Re:the return of "worse is better" by On+Lawn · · Score: 4, Interesting


      One of my favorite all-time quotes from a flame war was about this topic.

      "While they sit in ivory towers, the mongols are multiplying in the hills. Soon, the towers will lay waste and the hordes will have moved on victorious."

      Of course he was talking about the ivory tower of PERL, and how the TCL was going to become the dominant force in scripting language. But I've loved the allegory ever since.

      --------------------
      OnRoad: Becuase hacking funner with a hacksaw.

    2. Re:the return of "worse is better" by Anonymous Coward · · Score: 2, Insightful

      With each new generation Intel says, "This is for servers, not for desktops." This was true with the 80286 & '386, but they ended-up on desktops. "For servers, not desktops," is an expression of Intel's
      wish to get into servers. 64-bits are already on alot
      of desktops, but they aren't Intel's chips.

    3. Re:the return of "worse is better" by BeBoxer · · Score: 2, Insightful

      Of course he was talking about the ivory tower of PERL

      PERL is an ivory tower? Wow, that must've been some flame war. Last I checked Perl stood for "Practical Extraction and Report Language", and I'm pretty sure that "Practical" is not a member of the set "Ivory Tower". It's a great quote, I just wouldn't every guess is was about Perl and Tcl. x86 vs. RISC maybe. But not Perl, unless you consider it one of the hordes. :-)

    4. Re:the return of "worse is better" by EvilTwinSkippy · · Score: 2, Funny

      Gotta remember to put some oregano on my words for when I have to eat them.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    5. Re:the return of "worse is better" by Fnkmaster · · Score: 4, Insightful
      Geez, some people really take things too literally. If they looked at Perl and said "we can make something worse than this", they are some sick, sick fucks.


      Perl IS the "worse is better" language.


      Tcl goes so far to the "worse" end, it comes back around the circle at "utterly fucking miserable".

  12. Intel is irrelevant anyway by d00dman · · Score: 3, Interesting

    As much as we depend on intel to push cpu manufacturing techniques to new heights, they have fallen down in the desktop market anyway. Ive lost count on how many new units they've added for poor lowlevel optimizers to keep up with. This with the slap in the face of reduced instructions per tic in the p4 so they could juice up the multiplier and sell "faster"mhz cpu's at double the price is more than enuf for me to stop watching them. Im far more interested in the new power5 coming out of IBM for a 64bit architecture to pay attention to. BTW, what ever happened to alpha 21364? is a 64bit cpu really newsworthy?

    1. Re:Intel is irrelevant anyway by EvilTwinSkippy · · Score: 2, Interesting
      Back in '98 when I was an EE major, we studied the IA-64 in our computer architecture class. It didn't take long for me to realize that damn thing was going to fly like a brick.

      My first model was to try to calculate Easter on assembler. Sure all of that data can sit in a register, with room to spare. The problem is getting the INSTRUCTIONS in and out. Since the damn thing is so superpipelined, you get to drop in one instuction, wait, drop an address to the register, wait, load data into register, wait, ...

      I developed my EASTER test after a disasterous presentation where I had to demonstrate a processor architecture I had designed. It worked great for vector math, but it crawled on everything else...

      Now if they are hiring dropout EE majors to design their next chip, I'll be happy to give them my resume.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
  13. Transmeta and x86-64 by neuroslime · · Score: 2, Informative

    "It's worth noting that Torvalds' employer, Transmeta, has licensed x86-64 so he is likely to have access to Hammer hardware." This sounds really interesting. Any ideas what it means?

  14. Chip Maker by LongJohnStewartMill · · Score: 4, Funny

    Of course, Linus works for a chip maker

    And if trends continue, it could be Old Dutch.

  15. Of course, Linus works for a chip maker... by StevenMaurer · · Score: 4, Interesting

    So he is more likely to know what he's talking about.

    Personally, I'm getting a bit tired of all the inane cynicism that passes for reflective commentary in modern society. While it's true that the world has its villians, it is more true that people often just hold opinions irrespective of their economic interest. I for one, trust that Linus is among these favored many.

    (Not joking this time)

    1. Re:Of course, Linus works for a chip maker... by drinkypoo · · Score: 2, Insightful
      Personally, I'm tired of people who don't realize that cynicism is a good way not to get boned up the ass by someone you thought was your friend, and expect us all to be loving hippie peaceniks.

      Now sure, Linus seems to be a great guy, and I'm not saying he isn't, but he still has his slant to things, conscious or not, deliberate or not, malicious or not. So it does pay to keep all things in perspective. He appears to be more benevolent than most, a competent programmer, and a personable individual, but he is also an employee of transmeta, and that affects what he says or doesn't say.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  16. the benifits of 64bit processors? by nailchipper · · Score: 2, Insightful

    what will the 'masses' do with a 64-bit processor? the best reason to move up to 64bits is to increase maximum memory, and althought memory is now cheap, its not that cheap!

    32bit processors can have up to 4GB of RAM. The most memory i know someone to have is 1GB, and computers most often come with half of that, 512MB. We still have a long way before we hit the 4GB ceiling (a long while!).

    I am actually a tad worried for AMD, since they plan on coming out with the x86-64 pretty soon. And i dont know who will actually buy it (or need to buy it).

    64bit processors belong where they are most needed, specialized machines.

    --


    what is nailchipper?
    1. Re:the benifits of 64bit processors? by u19925 · · Score: 4, Insightful

      i have used several PCs/Sparcs and in all of them, before discarding, I have upgraded memory to 2x to 4x times originally it came with. We don't need more than 4 GB now; but if you were to buy a computer now, you need to make sure that you would be able to upgrade ram to 4x. This means if your need is greater than 1 GB, then 32 bit system is not suitable for you. At present, all my home and work machines (new ones) are ordered with 1 GB. This is at the border of 4x memory expansion possible with 32 bit system. So in a year or so, I will definitely not buy another 32 bit machine.

    2. Re:the benifits of 64bit processors? by questionlp · · Score: 2, Informative

      There are 32-bit x86 processors that can handle more than 32-bit memory addressing, the Intel Xeon processors come to mind (which can address up to 36-bits)... the only problem is that the application and OS needs to support windowing or PAE (Physical Address Extension) to allow use of > 4GB of memory.

      The only problem with windowing and use of PAE is that there is a long delay (from the processor's point of view) to shift the window compared to accessing something within that window. On the bright side, the delay isn't nowhere as bad as having to go to virtual memory and paging files.

    3. Re:the benifits of 64bit processors? by silas_moeckel · · Score: 2, Informative

      Hrm I'm running at least 2 on a workstation (4 512 meg sticks) and as much as I can cram into a server (PIII servers have 1 gig chips cheap enough) Cmon a modern video card has 128 megs of ram on it with the exception of RamBus ram is cheap comparitivly. Run under Linux and Ram can be one of the largest speed ups out there I runn about a 1 gig used memory heap on my workstation and another 2 gigs that ends up beind drive cache for a mid size scsi raid and that cache makes all the difference in the long run.

      --
      No sir I dont like it.
    4. Re:the benifits of 64bit processors? by TFloore · · Score: 2, Interesting

      Benefit? That's easy... anyone that owns a digital camera.

      Because you'll be buying a new one eventually... and in a couple years, you'll be buying 20megapixel digital cameras for $400. In other words, all your favorite geeks will have one.

      And they'll want to edit these pictures on their computers. You wouldn't think that would need so much memory...

      But.

      Try it with 10 pictures at once, cause you want to see which has the best overall shot of everyone in your group standing in front of your friend that had a cool wipeout skiing.

      You make a couple minor changes in 7 of the pictures, adjust color balance, change brightness, red-eye reduction, stuff like that. Your image editing app of course has multiple undo. (Don't they all?)

      You copy a few of the images around, shoving them through the clipboard.

      4 copies each (multiple undo, remember) of 10 images each at 80MB per image (32-bit color on 20megapixels...) that's... 4x10x80MB = 3.2gig of ram you just used.

      Just for editing 10 pictures from your $400 consumer digital camera.

      Now do you see why you want a 64-bit cpu that handles more than 3gig of ram for a single process?

      (Yes, I purposely assumed that a simple "adjust color balance" requires a complete seperate copy of the image for undo, and it probably doesn't... but you get the point now, don't you?)

      --
      This is my sig. There are many like it but this one is... Oops. Frank, I've got your sig again! Where's mine?
  17. Itanium is a pain to optimize by Billly+Gates · · Score: 4, Interesting
    This is the problem with gcc and the architecture in general. Its very had to optimize code with it. Even with Intel's compiler optimized code is only marginally faster if any compared to a top of the line pIV. Gcc obviously will perform alot worse then Intel's own compiler and its the only one Linux can compile with.

    Sun has an interesting( biased) peace on Itanium. If I were buying a server I would avoid Itanium like the plauge. It is possible that Intel could even cancel the whole project and leave customers high and dry. Not to mention software availability is a problem.

    I prefer the risc architecture. I like the idea of keeping things simple and efficient which is alot like structured programming. VLIW does not follow this ethic.

  18. hmm... by kennyj449 · · Score: 2, Insightful

    Can't say I disagree with Linus's logic, but I don't know if this was that great of a decision politically-speaking. It might not matter, but if anything linux *needs* support from big players like Intel and vice versa in order to grow. This won't necessarily hurt, but I doubt it can help matters on the Intel front.

  19. Re:Where's the Harsh Words for Transmeta? by Anonymous Coward · · Score: 3, Insightful

    >>They delivered a revolutionary product.

    It's not 'revolutionary', if there is no revolution.

    People toss this word about like it means 'incremental change'. The Industrial Revolution was a revolution because it entirely changed the way people live and work. How is anything Transmeta done even remotely close to something of this level? It's not.

  20. Take it with a boulder of salt. by caouchouc · · Score: 3, Interesting

    The Inquirer.com isn't exactly a bastion of responsible reporting.

    It doesn't look like an interview took place at all. It looks like they took some choice quotes out of context from the kernel development mailing list to spur some pageviews.

  21. Re:When is the... by sweetooth · · Score: 3, Informative

    If I recall correctly the Crusoe processor is 128bit . It is simply executing 32bit code through "code morphing"

  22. You know it. by grub · · Score: 4, Funny


    Netcraft confirms it: Itanium is dying.

    One more crippling bombshell hit the already beleagured Itanium community when Slashdot confirmed that Linus thinks Intel dropped the ball with Itanium. Itanium now powers 0.00% or all servers. Coming on the heels of a Netcraft survey which plainly states that Itanium has gained absolutely NO market share. This reenforces what we've known all along: Itanium is collapsing in complete disarray.

    You don't need to be a Kreskin to predict Itanium's future. The writing is on the wall: Itanium faces a bleak future, in fact there won't be any future at all because Itanium is dying. Intel has dumped millions into Itanium, red ink flows like a river of blood.

    All major surveys show that Itanium has steadily held its ground at 0.00% use while millions of other processors are produced daily. If Itanium is to survive at all it will be among CPU dilettante dabblers and hangers-on. Nothing short of a miracle could save Itaniu, at this point in time. For all practical purposes, Itanium is dead.

    --
    Trolling is a art,
    1. Re:You know it. by MBCook · · Score: 5, Funny
      What are you, mad man? Last month the Itanium powered 0% of all servers on the internet. This month it powers 0% of all servers on the internet.

      0% * 10,000% = 0%

      Therefor, the Itanium grew 10,000% last month. The Itanium is a major hit! Get your numbers straight man. Geeze.

      :)

      --
      Comment forecast: Bits of genius surrounded by a sea of mediocrity.
  23. Re:the reason the Itanic is a bomb.. by umofomia · · Score: 2, Informative
    Trouble is Windows runs on i386 and uh uh uh, it runs on i386 and uh uh uh. Well, it only runs on souped up versions of the 80386, and I'll bet it'll never run on anything else.
    Completely untrue. Microsoft has builds of Windows in ia64 and even AMD's 64-bit architecture. They certainly will be released with Windows Server 2003.
  24. This is probably a troll, but... by tunah · · Score: 4, Informative
    Slang:

    Mickey-mouse == poor quality, inconsistent

    Outfit == organization, company.

    --
    Free Java games for your phone: Tontie, Sokoban
  25. Re:Itanium 2 is great by Billly+Gates · · Score: 3, Informative
    "Itanium 2 is a great architecture/....


    What the hell are you smoking? I want some.

    Every risc archeticture with the exception of the sparc3 performs better. Especially IBM's power4 and the upcomming power5.

    Also there is more then speed when comparing architectures. Itanium is a terrible platform to write compilers for. Alot of optimizations which are tradionally done in the chip at runtime itself must be set by compiler options. Not all of it can be done efficiently like this.

    Speedwise Alpha is getting old now but still is the fastest chip around untill the power5 comes out this fall. For coding and optimization, Mips is the best cpu around.

  26. but the chip is a BARGAIN! by TheGratefulNet · · Score: 4, Funny

    look here:

    pricewatch

    almost $3000 for the chip. wow, and for so many mhz, too...

    --

    --
    "It is now safe to switch off your computer."
  27. Re:Why 64 bit by grub · · Score: 5, Funny


    Actually that's a good question. I think chipmakers should slow down a bit and enjoy life. Perhaps meet halfway with a 48 bit chip...

    --
    Trolling is a art,
  28. what matters by 1nv4d3r · · Score: 3, Funny

    Code size matters. Price matters. Real world matters

    If only on-chip instruction set morphing mattered...

    (sorry, but it's true...he's living in a glass house on this one.)

  29. Re:Code size? by hpa · · Score: 5, Informative

    Code size matters because *cache* isn't cheap. Worse, you can't make L1 cache arbitrarily fast without slowing down your chip big time.

  30. Re:Code size? by ChadN · · Score: 2, Informative

    Number 2 (make cache bigger) is easier said than done, and works against number 1 (cost).

    --
    "It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward
  31. Re:Code size? by maugt · · Score: 2, Insightful

    Memory is cheap, but the architectures to try and contain all of the memory certainly are not. Try telling your fortune 100 customer that they need to go buy a bigger sun/hp box because you need 80gb of ram to run your application and they start making you reduce memory consumption.

    The "memory is cheap" thing is silly, and just encourages people to be lazy and not layout data structures in the optimal way, and/or use efficient data containers.

  32. Itanium2 is the fastest floating-point processor by mrm677 · · Score: 4, Informative

    Check the latest SPEC CPU benchmarks. The Itanium2 has the fastest floating-point score and is no slouch in the integer tests either. It will improve. Linus will eat his words in a few years.

  33. Re:Itanium 2 is great by Anonvmous+Coward · · Score: 2, Insightful

    "I don't care what Linus says about it."

    Makes ya curious why anyone should care about what he says, duddn't it?

    I'd rather hear about what people can do than hear people complain about what they can't do. Why? Because you buy hardware to suit your needs, not suit your needs to your hardware.

  34. Re:Most home users run Windows. by Anonvmous+Coward · · Score: 4, Funny

    "Did anybody notice a 2X performance boost moving from Windows 3.1 on 16bit MS-DOS to a nominally 32bit Windows 95 OS?"

    I did. There was so much less time in between crashes that I learned to move quickly!

    Well, that fud filled anti-MS joke should earn me a Karma point or two.

  35. Re:Why 64 bit by Billly+Gates · · Score: 3, Insightful
    The memory limit is soon approaching and this is a nice thing to have.

    Would you really want to return to the dos himem.sys, memmaker, extended and expanded memory, and autoexec.bat hacks again? Sure they were not needed for the first several years of DOS when people had only 512 kb of ram but the situation changed quickly. Its this is what first turned me off from Microsoft. If I had 8 megs of ram and had 6 free why couldn't I run dune2? Do I not have a 32-bit chip? I had to create a custom boot disk with autoexec.bat just to run the game. That is screwed up.

    A Hammer is nice just like a 386 was nice to have run 16-bit software. They were particularly usefull in Windows3.11 since it actually had 32-bit disk access while everything else was 16-bit. The hammer is fast at running 32-bit software and is easily upgradable if customers want to add ram. They do not understand techno mumble jumbo. Its not like you can explain the base of 2 math when Joe just wants to purchase a 4 gig ram stick and wonders why Windows wont recognize all the ram.

  36. Re:AMD by Xerithane · · Score: 2, Informative

    [AMD] Was recently considering leaving the CPU business altogether

    Uh.. what? AMD can't leave the CPU business. That would leave them with.. Flash memory. We all know how much revenue that brings in for them.

    You have any links to support this claim?

    --
    Dacels Jewelers can't be trusted.
  37. Re:AMD by Anonvmous+Coward · · Score: 2, Informative

    AMD Was recently considering leaving the CPU business altogether."

    Um when was that? The only thing I recall was a Slashdot article with a misleading headline...

  38. Re:Code size? by VAXman · · Score: 2, Insightful

    To quote Linus: "And I further bet that using a native distribution (ie totally ignoring the power and price and bad x86 performance issues), ia-64 will work a lot worse for people simply because the binaries are bigger. That was quite painful on alpha, and ia-64 is even worse - to offset the bigger binaries, you need a faster disk subsystem etc just to not feel slower than a bog-standard PC."

    Yeah, RISC workstations always seemed sluggish to me for interactive use. Not sure if it's really due to the increased time to load binaries, or some other optimization issue.

  39. Inquirer does not do the post justice by NullProg · · Score: 4, Informative

    The read from theinquirer.net is all wrong. The slashdot story line is also wrong. It does not state at all what it implies. Here is the link to what Linus actually wrote:

    http://www.ussg.iu.edu/hypermail/linux/kernel/03 02 .2/1909.html

    Now, I agree with Linus on the PPC MMU issue. Can anyone tell me what he means by "baroque instruction encoding"? I have been doing x86 and 68k assembler for a long time, I have never heard of this.

    Enjoy,

    --
    It's just the normal noises in here.
    1. Re:Inquirer does not do the post justice by wmshub · · Score: 2, Informative
      Can anyone tell me what he means by "baroque instruction encoding"?

      "Baroque" in this case means "overly complicated, usually in a bizarre way." If you ever wrote an x86 assembler or disassembler, you would know exactly what Linus was talking about.

    2. Re:Inquirer does not do the post justice by leroybrown · · Score: 4, Funny

      Can anyone tell me what he means by "baroque instruction encoding"?

      well, you know what they say: if it ain't baroque, don't fix it.

      i am going straight to hell for that one.

      --
      Founder, Americans Allied Against Alliteration
    3. Re:Inquirer does not do the post justice by cbiffle · · Score: 5, Informative

      Probably (IANLT) he's referring to the various prefix encodings and variations for instructions. From my x86 manual, "Machine language instructions...vary in length from 1 to as many as 13 bytes. ...There are over 20,000 variations."

      Now, granted, that rather large number probably includes different target registers, but compared to (to use your example) the 68k, the x86's encoding format is just -weird-:
      16-Bit:
      -An opcode. Either 1 or 2 bytes.
      -Some flags and/or target register. 1 byte, optional
      -Displacement. 0-2 bytes.
      -Immediate. 0-2 bytes.

      32-Bit:
      -Optional address size prefix byte.
      -Optional operand size prefix byte.
      -Like above, but with 0-4-byte displacement/immediate and optional scaled index byte.

      Now consider the fact that many opcodes implicitly reference registers. Decoding this instruction set by hand would be a royal bitch, and it's exactly the sort of configuration that RISC targeted for demise.
      However, Linus makes a good point in the e-mail, which is (paraphrased) that the x86 encoding is basically a very good compression algorithm for its code. While the RISC machines that use 32 or 64-bits for every instruction may be more regular, their code does tend to be larger.

      The ironic thing, in my mind, is that the IA-64's encoding is in many ways -more- baroque than the x86s! Instructions in bundles, bundles in groups (or is it the other way around? I never remember), flags at the end to specify how to interpret the instructions before -- it's an interesting take on VLIW, in that it doesn't specify the number of execution units, but YUCK. :-)

  40. Re:Obsessive - You've got it wrong by MBCook · · Score: 5, Insightful

    Linus isn't saying he won't let it in. He's simply saying that the thinks it's not a good arch based on technical merit. He'll let it in. He never said he wouldn't. He's just saying he doesn't like the way the chip was designed (what choices they made, etc).

    --
    Comment forecast: Bits of genius surrounded by a sea of mediocrity.
  41. Re:Itanium 2 is great by Anonymous Coward · · Score: 5, Insightful

    What are YOU smoking?

    Optimizations done at compile time are far better than optimizations done at runtime. At compile time, more is known about the structure of the program, where the flow of the program will be going, and more time intensive optimizations can be done than ones done in realtime in the cpu.

    Itanium is slower right now, but as compilers with optimizations tailored to it come out, it has the potential to kick every RISC processor's ass. The reason for this is that RISC processors are bogged down by doing the optimizations at runtime that Itanium doesn't have to care about. This means the Itanium will have the same or less stalls and more efficient use of the processor.

    Go read up on compiler optimizations to see why cutting out the middleman of instruction sets is a good thing.

  42. And Itanium prices are just insane! by MtViewGuy · · Score: 5, Interesting

    I think the problems with the Itanium boils down to this:

    1. The CPU's are insanely expensive. They make the majority of x86-architecture Intel Xeon CPU's look like a bargain.

    2. Where are the server applications that take advantage of the Itanium CPU? They're not exactly widely available, to say the least.

    3. Programming for Itanium is still a somewhat iffy proposition.

    Meanwhile, AMD's Athlon 64/Opteron offers these advantages:

    1. The CPU will definitely NOT be insanely expensive to purchase.

    2. Programming for the AMD x86-64 architecture is not going to require kiboshing a bunch of legacy programming tools and starting from scratch--it is a straightforward process to convert today's programming tools to take full advanratge of the x86-64 native mode.

    3. Because the programming tools are so readily available, both operating systems and applications for the Athlon 64/Opteron will be available widely by the time the new AMD CPU's are finally released for sale. Already, UnitedLinux is porting Linux to run in x86-64 native mode, and Microsoft is very likely readying versions of Windows XP Home/Professional and Windows 2003 Server that will run in x86-64 native mode.

    Meanwhile, Intel supposedly has a 64-bit x86-architecture CPU codenamed Yamhill that has developed. However, given we don't know how Yamhill implements 64-bit x86 instructions Intel will have to do some VERY serious convincing to Linux kernel programmers and to Microsoft to write Yamhill-native code--and Intel is far behind the AMD efforts.

    1. Re:And Itanium prices are just insane! by MtViewGuy · · Score: 2, Insightful

      I have to disagree with your assessment about the Pentium CPU at the time it was introduced.

      Remember, the Pentium CPU runs x86 architecture code natively, so it did not require an expensive starting-from-scratch mentality to take fully advantage of the CPU like you have to do with the Itanium CPU. In short, programs that ran on the 80486 CPU could run on the Pentium CPU with no modifications.

  43. Re:Code size? by mrm677 · · Score: 2, Insightful

    Code size matters because *cache* isn't cheap. Worse, you can't make L1 cache arbitrarily fast without slowing down your chip big time.

    This is irrelevant with the trace cache in the Pentium4. Instructions are decoded into micro-ops, by "traces" which are sequences of executed ops, and stored in the cache. Compact x86 CISC instructions are not stored in the trace cache.

  44. Re:Most home users run Windows. by addaboy · · Score: 3, Insightful

    No, because win95 is a piece of shit OS, but you point is valid. AMD should take this opportunity to stick to Intel however. Intel's been playing this game with comsumers' mind that the bigger the number the better the processor. Turnabout is fair play and I hope AMD takes this opportunity to bombard the computer buying public with 64 bit ads. I'd love to see Intel's answer to that, "uuhhhhh, bigger is better only if it says Intel Inside, yeah that's the ticket!"

  45. which architectures? by Anonymous Coward · · Score: 2, Informative

    MIPS is behind Itanium in performance. HP-PA is behind Itanium in performance. SPARC3 is behind Itanium in performance. SPARC64V is behind Itanium in performance. Alpha has higher specint but lower specfp. Power has higher specint but lower specfp. Both major current IA32 processors have higher specint, but they are slaughtered on specfp.

    That's without even mentioning TPC or Java benchmarks which make Itanium look just as good or better.

    1. Re:which architectures? by PurpleFloyd · · Score: 3, Insightful

      Ooooh, benchmarks. Any company can pick and choose good benchmarks for their chip; you aren't even giving numbers. I want to see some real-world numbers, preferably ones that relate to the Itanium2's ability to handle non-preprocessed code (as other posters have mentioned, trying to work with anything dynamic throws all of Itanium's fancy explicit parallelization out the window) Put up or shut up.

      --

      That's it. I'm no longer part of Team Sanity.
    2. Re:which architectures? by blair1q · · Score: 2, Interesting

      Wrong. They design chips to improve scores on benchmarks. Otherwise you wouldn't be able to tell they got better.

  46. Re:Why 64 bit by nomadic · · Score: 2, Funny

    they were not needed for the first several years of DOS when people had only 512 kb of ram

    Wha? I would have given my right arm for 512kb. Mine had 128k. Next step up had 256k. Geeze, 512k? We'd have been in happyland...

  47. Re:Itanium 2 is great by lingqi · · Score: 3, Interesting

    wow, but you know why they call it "power4" and "power5" though.

    the damn things have THOUSANDS of pins (like 8000+ on some iterations, iirc?), and drains MASSIVE current - as in, the kind that makes your dual O/C'd athlon look like an LCD clock.

    I think IBM's power4/5 chips are as well "unsuitable for real world" as well, but for some different reasons. That's not to say they won't be put into some niice servers though - and that's the point, itanium2 wasn't meant for desktop (for at least a while) anyway, and I think in their world they play some different rules.

    --

    My life in the land of the rising sun.

  48. Re:Itanium2 is the fastest floating-point processo by mrm677 · · Score: 2, Informative

    The SPEC benchmarks are real-world. That's the point of them, and they've been used over the last 10 years to judge the real performance of a processor.

  49. Re:Most home users run Windows. by captredballs · · Score: 2, Informative


    Don't forget that Itaniums are clocked far lower than P4's. The difference is that Intel doesn't plan on marketing 64bit chips to the consumer for a couple years, while AMD has their sights set earlier due to the expected lifespan on the Athlon-family and that their future is bet on 64bits.

    I guess the main thing to note is that the P4 will be around for a least two years longer, where you can't say the same thing about Athlon family, at least at the high end.

    Also coming into the picture is that Apple may have 64 bit workstations in ~ a year.

    --

    I suppose I'm not too threatening, presently, but wait till I start Nautilus
  50. Re:Where's the Harsh Words for Transmeta? by heptapod · · Score: 2, Funny

    Yes, revolutionary.

    Just like the Segway.

  51. Any of you read Fortune? by AxelTorvalds · · Score: 4, Insightful
    The Feb 17th issue broke it down nicely. You can read it on their web site. Basically. The conventional wisdom is that there are exactly 2 players in the 64bit arena: IBM and Intel. IBM isn't jumping on the Itanic either, at least not in any big way other than building some low end servers with it.

    AMD is the wildcard. If x86-64 is the bomb and takes off like AMD is betting on it. Intel lost the 64bit war for many years. IBM and maybe even Sun will quietly (well sun doesn't do jack shit quietly) push x86-64 for the low end while IBM POWER4 and POWER5 and POWER6 down the road run the big end.

    Basically Intel needs something like Sun to jump on it IA64 to really give it some credibility and they don't sound real eager to. IBM sounds like they are down for the fight. Alpha, MIPS, PARISC are all pretty dead; long term and relatively speaking. Meanwhile, if Intel doesn't get on the shit quick then they'll have to support x86-64 too and that's the real death blow to IA64.

  52. Re:Itanium2 is the fastest floating-point processo by dpletche · · Score: 3, Insightful

    SPEC scores tell me almost nothing useful. The code to run SPEC benchmarks is emitted by tricked-out compilers whose whole purpose is to emit hand-crafted assembly code specifically tuned to run those SPEC benchmarks. It doesn't tell me anything about how well common programs and subsystems perform at common tasks. You might as well buy a family car based on the quarter-mile time at the racetrack for a like-model car with a supercharger and dangerously-tweaked ignition timing, burning 120 octane racing fuel.

    In five years, if the Itanium isn't a huge success, will you eat your words?

  53. Anyone remember the Pentium Pro? by a-freeman · · Score: 3, Interesting

    Back when it was released, it was roundly maligned for offering shitty performance for Win95 users. "Buy a Pentium 233MMX" all the magazines screamed.

    Well, the PPro turned out to be one of the best chips of its day, and the 200Mhz version performed within 5% of the Pentium II 300mhzs that were released 18 months later. I still have dual-PPro system running my CVS/MP3/print/etc. server.

    Linus may be a god in the linux software universe, but I wouldn't discount Intel on this just yet.

    1. Re:Anyone remember the Pentium Pro? by warrior · · Score: 3, Interesting

      There comes a time when you have to chuck backwards compatibility in the name of progress. The problem is we've got all this existing software that's been tailored for x86. Right now we're experiencing similar problems in the automotive industry, which is much more stodgy than the semiconductor biz. We're stuck with ancient internal combustion behemoths because of people's unwillingness to accept change. Likewise, we're stuck with cpu tech that's relatively much older than our crappy auto tech given the pace of the semiconductor industry.

      Let x86 go! Live in the now! Itanium is a great CPU. Sure, the first iteration sucked, but look at it in the same light that you view electric automotive designs. Now take a step back and look at Itanium II. Itanium II is currently the leading performance CPU for floating point code by a small margin over Power 4 ( which, I might add, costs 2-4x more than an Itanium system ). In four months time Intel is said to be releasing a frequency bump to Itanium II with even more L2 cache. IA-64 performance scales almost linearly with frequency! (and no, you don't even have to recompile to reap the benefits of increased frequency) When Dell saw that the Itanium II performance rumours were true, they did a 180 and are now playing catchup to other vendors. IBM has hedged their bets by building Itanium II systems, Sun is dying, and Opteron will be DOA.

      --
      Intel transfer the difficult from Hadware to software, for get more power, programmer need more technology. -- chinaitn
  54. Even better... by ubugly2 · · Score: 3, Funny

    ...The shipping is free.

  55. Re:Itanium 2 is great by Waffle+Iron · · Score: 4, Insightful
    Optimizations done at compile time are far better than optimizations done at runtime. At compile time, more is known about the structure of the program, where the flow of the program will be going, and more time intensive optimizations can be done than ones done in realtime in the cpu.

    As time goes by, computer languages are trending towards more dynamic behavior. This tends to favor things like JIT compiling and linking into already running programs. Fewer people are going to be able to afford the luxury of spending hours to preprocess their code to fit into an extremely static ("explicitly parallel") hardware model. This will be especially true when chip makers treat their rocket science static compilers as a separate profit center.

    Not to mention, the CPU is the one that is actually in the position to know what optimization is needed right now based on the currently running data set. Given that there is usually a several year lag between the latest CPU developments and widespread compiler support, I'd go for a CPU that knows how to do its own tricks. (Hasn't the Itanium architecture been nailed down for almost a decade now? And we're still waiting on better compilers for it?)

  56. x86 will win...and that's too bad by Pemdas · · Score: 4, Interesting
    x86 -- it's cheap, fast, available, and compatible. That's why it consistently wins in the marketplace.

    That doesn't mean it's the best solution. Merely the one that's going to win. Architecturally speaking, x86 is one of the biggest loads of crap to come along since...well...hmmm...I can't think of anything crappier off the top of my head.

    Extreme register pressure. Segmentation models that make you want to retch. Hacks (PAE, anyone?) that leave any sane designer gibbering incoherently.

    If you read the thead, Linus' main argument seems to be "to get good performance, all the other architectures have had to do complex things in hardware, so there's no real hardware simplification in going with a 'better' architectural design. Plus variable length opcodes are a natural cache optimization!"

    I respect Linus a great deal, but he's talking out of his ass here. I agree that IA-64 may be best relegated to some academic's wet dream, but just about any of the major RISC architectures are big wins over x86. Intel and AMD have worked miracles with x86 to get it to run fast, but at a staggering engineering cost. The teams working on RISC chips tend to be a fraction of the size to come out with a high-performance chip. If the RISC houses had an engineering team of comperable size (and access to the same bleeding edge lithography processes) it would easily be worth an extra 25% in performance, minimum.

    If you look in the embedded world, just about anything that requires serious embedded performance is RISC based (MIPS/ARM, mostly), simply because it decreases the engineering work involved by an order of magnitude. Plus, writing low level software for just about any RISC chip is loads easier than for x86.

    Unfortunately, x86 is here to stay for the foreseeable future. Intel killed Alpha, not by buying it, but by doing a great job of pushing cheap x86 performance to the same level as Alpha, often surpassing it in later years. The same thing is happening to the other workstation-class RISC vendors, and, honestly, to Itanium, too. I don't see any reason to believe the march to x86 hemogeny outside the embedded world is likely to slow anytime soon.

  57. Uh... it's already there by Anonymous Coward · · Score: 2, Informative

    ia64 is in the mainline kernel. At least Debian and Red Hat have released, stable distributions for it. Red Hat even sells support for it.

    ia64 is "in there" as much as alpha and sparc, even if it isn't quite as well tested.

  58. Re:Why not a 69-bit chip? by ubugly2 · · Score: 2, Funny

    If you overclocked it it would eat itself..

  59. Re:the reason the Itanic is a bomb.. by wideBlueSkies · · Score: 3, Informative

    NT was built on the i860 first, then ported to the i386 arch. More accurately, MS engineers emulated the i860 untill the chip was ready.

    MS did this to make their new OS more or less platform independant. They didn't want to get 'stuck' on the x86.

    Slashdot story here . Article here .

    --
    Huh?
  60. TPC would say differently by Yankovic · · Score: 2, Informative

    The second highest rated TPC box in the world is running Itaniums...

    http://www.tpc.org/tpcc/results/tpcc_perf_result s. asp?resulttype=noncluster

  61. Listen to Torvalds about making money by jbischof · · Score: 4, Interesting
    because he knows.

    Linux made him ... oh wait nevermind.

    Transmetta makes a lot of ... oops there I go again.

    Intel is a company that time and time again proves it knows how to make money. It may not always support the crowds it should (like /. readers and superusers) but they are still making money.

    Sure there are lots of difficulties going to a new ISA. Especially at the server level. And yes Itanium has had some performance problems, especially in its first revision, but then again when was the last time you saw a company produce a 1st generation microprocessor and have it do well?

    IA64 offers tons of advanced ILP concepts and OS concepts that, when correctly implemented, can increase performance drastically. (if your looking for examples, data speculation, control speculation, predication, registers with kernel access only, rotating register files, a much larger register set, etc).

    The problem may be, it puts a lot of complexity into the Compilers, and compiler technology isn't good enough for Itanium yet.

    But then again, what do I know, Linus has made more money than I have. I just like arguing the other side while everyone else screams about how the Itanium will die.

  62. ARRGH! by pr0ntab · · Score: 2, Insightful

    Let me repeat this one more time:

    NO GAMING CONSOLE IS 128-bit (nor will they be 256-bit)
    The PS2 is a 32-bit system. It has a 32-bit wide address space and word space. It happens to have a quad-word SIMD execution unit. By this logic, the MMX-enabled pentium is also 128-bit.

    Okay... got that out of my system.

    What the 64-bit address space WILL do is make OS design simpler. This is an important win for developers. I understand OS start-up times will be vastly improved because applications, libraries, etc. will all be able to load at static addresses in memory, all precomputed. It'll also make database-as-filesystems easier to implement.

    Forget gaming machines, this is BIG stuff, a big step, and Intel is foolish to ignore it.

    --
    Fuck Beta. Fuck Dice
  63. A Slashdot Sin... by SleezyG · · Score: 3, Informative

    I know it's not very nerd-like to say that Linus is wrong and that AMD sucks, but in the case of the Itanium, that is exactly how I feel. Intel/HP's Itanium architecture is perhaps the most advanced processor to hit the market and has tremendous potential (from a Computer Architecture point of view). Because it's so new, its performance will be aweful, but shall improve with time. Anyone remember the SuperSparc? It performed horribly and was soon replaced by the UltraSparc. As will the Itanium II replace the Itanium.

    As for the emulation/legacy code argument, I say screw it. gcc is already ported to IA-64. And as a Linux user, most of my favorite open source programs can be ported with little difficulty.

  64. I completely disagree by Edmund+Blackadder · · Score: 4, Insightful

    First of all it is not very smart to try to reduce code size by putting complicated instructions in the processor architecture.

    A succesfull architecture may be used for 20 years, and there is no way you can know which complex instructions will be most usefull/popular in several years. And when you start making upgraded chips for a design, these complex instructions will be a real pain in the ass.

    The x86 architecture is a perfect example - it is a mess and many of its instructions are not used at all. The x86 is succesful because the way history played out - it was put on the first pcs, and the incredible numbers of precessors sold allowed intel to put more development money into that architecture than any body else was able to put into theirs. And large initial investments, and large sales numbers mean that individual chip prices can be lower.

    Nevertheless, the alpha and some of sun's chips can still compete with intel in the server environment, with much smaller investments and worse production technology. That basicly shows the weakness of the x86 architecture.

    When you have multiple pipelines and multiple stages per pipeline the size of your chip will grow exponentially to the number and complexity of your instuctions. Eventually adding more pipelines will be pointless and then you are reduced to adding cache as the only way you can improve your architecture.

    For a Risc architecture, multiple pipelines will cost less overhead and more can be used. Processor performance can be increased by adding more pipelines without having to increase speed.

    Intel has the money and the clout to make a succesful risc architecture. It is brave of them to do it, but from an engineering point of view it is the only right thing to do.

    AMD will support x86 because they do not have the clout to force a new architecture on the world. It is a completely understandable policy, but then again will result in worse performance (unless their engineers are somehow much more brilliant than intel's).

    Of course the real world matters and in the real world almost everyone uses x86. But if someone can change that it is intel.

  65. Re:Itanium 2 is great by redgren · · Score: 2, Informative

    They have so many pins because it is not a single cpu. It is an MCM (multi-chip-module). Each Processor "Brick" contains up to 8 CPU cores.

    Current draw is around 250-300A for an MCM. Alot? Hell yeah, but your average athlon XP pulls about 35A. 8 x 35 = 280.

    So, not so big a difference.

  66. Re:Linus holding on to his security blanket? by squiggleslash · · Score: 5, Informative
    I just don't get RISC chips. Why they want to remove things that make programming easier is beyond me.
    It's not stuff that "makes programming easier" that gets removed in RISC, its the more complex try-to-do-everything instructions which are a pain to implement in silicon and which ultimately can just as easily be done via a sequence of simpler instructions that may well, for a programmer who's actually programming at that level, be easier to understand.

    Early on the chief advantage of the approach was that you could use the freed silicon for things like extra registers, and that's exactly the approach taken by Acorn (now ARM) and the PowerPC range. Would you prefer to have eight registers and a single byte copy-block instruction, or 64 registers and have to replace that copy-block instruction with (*gasp*) three simpler instructions?

    (Actually, I guess that depends on how good your cache is. There's no such thing as a free lunch)

    --
    You are not alone. This is not normal. None of this is normal.
  67. Linus.. by irabinovitch · · Score: 2, Insightful

    Ya, but he works for trasmeta. If he were trying to pimp the company he works for he'd be pushing some Transmeta chip not AMD's stuff. Then again I could be wrong and there could be some connection between AMD and Trasmeta or some "The enemy of my enemy is my ally" type of deal.

  68. Microsoft will decide the outcome of this battle by RelliK · · Score: 4, Insightful

    Without Windows for x86-64, AMD is dead. No, Linux will not save it. However, the moment Microsoft releases Windows for x86-64, Itanic is history. The market will overwhelmingly favour x86-64 because of the much lower price (I expect at least 3-4 times lower, cosidering that the Itanic CPU alone sells for over $3000), and perfect backwards compatibility. Itanic's ia32 support is so pathetically slow that it may as well not exist, so a move to Itanic requires you to replace _all_ your software, which ain't cheap, while x86-64 allows you to do incremental upgrades. So, taking simple economics into account, Itanic will go the way of that ship and AMD will emerge the winner... provided there is a version of Windows for x86-64. Without that there is no point of talking about "64 bit desktop" market because it just won't exist. So what is Microsoft doing?

    --
    ___
    If you think big enough, you'll never have to do it.
  69. Re:Itanium 2 is great by Tailhook · · Score: 3, Insightful

    (Hasn't the Itanium architecture been nailed down for almost a decade now? And we're still waiting on better compilers for it?)

    That's the key isn't it? Itanium demands breakthroughs in compiler technology. Will this happen?

    I dunno.

    --
    Maw! Fire up the karma burner!
  70. That's just the beginning. by Rimbo · · Score: 3, Insightful

    Sometimes, just becomes someone HAS an economic interest in something, and IS interested in seeing something fail/succeed, does not automatically invalidate the point he/she makes. Linus didn't just put forth an unsubstantiated rumor or point of view; he backed his points up with facts and reasoning. If he is biased, show facts and reasoning to counter the bias, or else you are no better than the FUD-mongers when you write him off.

  71. Linus is too young to remember by imnoteddy · · Score: 3, Informative
    From the article:
    Torvalds wrote that Intel had made the same mistakes "that everybody else did 15 years ago"
    when RISC architecture was first appearing.

    RISC first showed up on the commercial radar screen almost twenty years when MIPS Computer Systems
    was formed. But people at Stanford (and Berkeley, IIRC) had been publishing papers about
    RISC for four or five years before that, and people at IBM were working on it even before that.

    And the CDC 6600 was a RISC machine in the 1960s. If you don't believe me, ask Cray's Chief Scientist Burton Smith.

    In seeking the unattainable, simplicity only gets in the way. -- Alan Perlis

    --
    No electrons were harmed creating this post, though some may have been subjected to electrical and/or magnetic fields.
  72. Re:When is the... by sweetooth · · Score: 3, Interesting

    Yeah, that's a given. I was just trying to point out that the Crusoe is already a 128 bit processor. So you won't be seeing a 64 bit Crusoe any time soon. Maybe a Crusoe executing instructions intended for an X86-64, but that's just an extension of the code morphing software. Even then I think the new astro chip would be more likely for that application since it looks to be meant for high density low cost blade servers with more punch than the Crusoe.

  73. Why the AMD logo? by Psykechan · · Score: 2, Funny

    Some guy from Transmeta badmouths Intel's new processor and Slashdot files this under AMD?

    I know that AMD has something to gain here but shouldn't this be under a different topic? Maybe when it gets reposted it'll be correct.

  74. Uggh....what does really matter? by zerofoo · · Score: 2, Informative

    Who is Itanium good for? Who is G4 or Power4 good for? What is X86 good for?

    That's like asking what is a saw, hammer and screwdriver good for...they each have an application.

    All these architectures have their good points and bad points. I've written sparc and x86 assembler and I can't say that they are better or worse than each other....just different.

    At this point the hardware is MOOT. Unless algorithms get significantly better soon, the hardware won't matter. Sure, we'll get mega memory address space with any 64-bit architecture, but what does that get you? More memory address space? Big deal...so you've got big memory space...that won't make NP=P any time soon.

    -ted

  75. x86-64 is not a simple recompile by TFloore · · Score: 3, Insightful

    It is still a full port, if you want to get the benefits of the 64-bit architecture. If you want to keep running 32-bit x86 code, don't even bother recompiling. But don't make the mistake of thinking that switching 32-bit x86 code over to x86-64 is a simple re-compile.

    It is still a port, with all that is included in that awful word.

    Do you understand how little 64-bit safe code there is that runs on 32-bit x86 systems? Most of the linux kernal is already 64-bit safe, because it has been ported to so many other 64-bit architectures already. And it still wasn't a simple "just recompile it".

    Speaking specifically to C programs here, porting from 32-bit to 64-bit is not a fun process. A variable declared as "int" switches in allocation size. This is good and bad.
    fread (fp, sizeof(int), &var); //(forgive me if I have the parameters backwards, I'm doing this from memory. And notice that I'm a bad programmer, I didn't check the return value.)
    Congratulations, you just killed all your existing data files. And if you happened to read a 32-bit pointer from that data file (any structures that you write directly that contain a pointer write a pointer... you'll throw the pointer value away when you read the structure back in, but you still have to read the proper data size), and then assign a pointer to it... Oh, you're going to have all sorts of fun playing with that.

    Yes, this may only be an issue with "bad" C code that assumes it will ever only run on a 32-bit platform... That probably covers 99% of all x86 C code out there, for any OS you care to name.

    Don't pretend it will be easy moving from 32-bit x86 to x86-64. For most programs, I assure you, it will be non-trivial. Anything that does direct memory allocation will have to be checked very carefully. Anything that does binary file i/o will have to be checked very carefully. Oh, and anything that uses "magic" numbers will have to be checked... Have you ever used an if conditional for an int of the form
    if (i == 0xFFFFFFFF)
    congrats, you just assumed 32-bit for your architecture.

    64-bit clean code is the exception, not the rule.

    --
    This is my sig. There are many like it but this one is... Oops. Frank, I've got your sig again! Where's mine?
    1. Re:x86-64 is not a simple recompile by psamuels · · Score: 2, Insightful
      Speaking specifically to C programs here, porting from 32-bit to 64-bit is not a fun process. A variable declared as "int" switches in allocation size. This is good and bad.

      You had me until that part. While it is technically possible for a compiler and runtime environment to use a 64-bit int size, I am not aware of any that actually do. Precisely because of portability problems with legacy code. Instead, the common convention (which I know is used in HP-UX, Digital Unix, Linux and IRIX, and probably most other 64-bit platforms as well) is that an int is 32 bits, and a long int is 64 bits, as are all pointer types. (This is known as the LP64 model.)

      The three biggest 32-bit-isms / portability problems I've seen: (a) assuming you can stuff a pointer value into an int (correct answer: either use a union, or (as seen in the Linux kernel) a long int); (b) [not strictly a 32-bit-ism] ignoring alignment issues which may exist for anything bigger than a char; (c) not handling sign extension properly in implicit casting (on 32-bit platforms such casting often doesn't change the word size).

      --
      "How can you claim that you are anti-crack, while still writing a window manager?" — Metacity README
  76. Re: *I* need 64-bit to use more RAM for... by Frobnicator · · Score: 4, Interesting
    You say you need more than 4GB for video editing and 3D rendering?

    Sorry while I rant, but you just stomped on one of my nerves. (Unless your comment about neededing that much RAM was a complaint about Adobe or their direct *cough* compeitors -- sucks to be you.)

    <Old Geezer Mode> In one case, not long ago, a fellow lab-rat Eric Mortenson had sold his research and tools to Adobe, but part of the poorly-written agreement said that he couldn't upgrade his work station. So he finished his Ph.D on a 386 with 32-MB of RAM, while the rest of us in the lab were using Pentium 3's, DEC Alpha's, and various SGI boxs. Eric's algorithms ran great on the newer PC's even though he couldn't develop them on the new boxes. Other with Adobe (NOT on that web site interestingly enough) needed the DEC Alphas (64-bit machines) with scads of memory and much more running time to do a similar implementation of Eric's algorithms. </Old Geezer Mode>

    3D rendering doesn't take that much RAM. As a 3D graphics researcher and developer, I have worked with models where individual objects were multi-gigabytes (meshes+textures and volumes) but even then, having 1GB of RAM was more than enough for us to reach 20-30 FPS realtime on a box with NT4 and first- and second-generation 3D cards. Software rendering with very realistic detail was a little slower (3-5 fps) but was fine for writing movies. Progressive geometry & texture transmission, continuously calculated view-dependant detail levels, and other current and not-so-current research would solve the memory problems in 3D. Don't believe me? Go to Visualization 2003 and see if the leading researchers are finding RAM as their primary bottleneck. It is a bottleneck of course, but processing speed, caches, and the system BUS limitations are far more troubling.

    As for video editing, you only need enough memory for the tools, a few frames, and whatever operations you are performing. In every case that I've had to do video editing, I've seen two classes of tools -- those that take gobs of memory and try to copy the entire video clip into RAM and end up thrashing for memory -- and those that intellegently figure out what is needed and use only the memory needed for the app.

    An example of the first, an Adobe AfterEffects rendering a simple math function over time was only able to render 30-seconds because it wanted to buffer the AVI file in memory and ran out of RAM (2GB) after a several-hour rendering. An example of the second, a simple home-brew compositor that used the Windows multimedia API to write the AVI to disk -- the same machine and the same set of images required about 45 minutes to render the entire clip.

    So instead of saying:

    I need more than 4GB RAM (3.5 if I want it stable) for video editing and 3D rendering.

    I would suggest you say " I need to buy tools that are properly designed and implemented for my class of computer. "

    Frob.

    --
    //TODO: Think of witty sig statement
  77. Re:Microsoft will decide the outcome of this battl by mudrat · · Score: 2, Insightful

    It would be very risky for Microsoft not to provide a version if windows for x86-64. Microsoft are already facing major competition in their server market from free *nix. If they allowed the competition access to free reign on a very fast and powerful architecture, they would be taking a major risk.

    Microsoft need to weigh up development costs against the risk the *nix/x86-64 will become very popular and decide whether its worth their while to compete with a version of Windows. I would guess it is.

  78. Sometimes it affects point of view by Galvatron · · Score: 2, Interesting
    I don't think many were proposing that Linus was attempting to manipulate us for his own ends. However, sometimes the work you do can influence your thinking.

    Take my grandfather for example. He worked as a transportation lawyer, back in the bad old Interstate Commerce Commision days. The ICC was created to regulate railroad monopolies, but was eventually coopted by the railroads to keep out trucking competition. In order to establish a new shipping route, you needed to prove to the ICC that there was a "need" for this new shipping route. Clearly, it was an absurd, anti-competitive system. My grandfather retired shortly before the industry was deregulated. However, to this day, he still believes that the ICC was a good thing, because being dependant on its existance for a job made him a believer.

    The point is, when you have an economic interest in something, it can start to affect how you think about things. The wealthy tend to want tax cuts, and the poor tend to want spending increases, but most of them are probably not conciously supporting those positions for their own selfish ends. They truly believe that what's good for them is the right thing for everyone, it's a natural justification process for humans, and I wouldn't think less of Linus for the tricks his subconcious might play on his mind

    --
    "The question of whether a computer can think is no more interesting than that of whether a submarine can swim" -EWD
  79. Re:Linus holding on to his security blanket? by 4024490502 · · Score: 2, Interesting

    Would you prefer to have eight registers and a single byte copy- block instruction, or 64 registers and have to replace that copy-block instruction with (*gasp*) three simpler instructions?

    You obviously know much more about cpu architecture than I do, so perhaps you could help me with something I don't totally understand: I don't doubt that RISC architechture is simpler, but isn't the executable code for a RISC cpu much larger than the code for a non-RISC cpu? For some things that can be done in one instruction on a CISC cpu, it takes several on a RISC. This would seem to me to take both more code for a program and be more intensive on the system bus by transfering the extra instructions. Am I wrong?

    --

    Why is this moist???
  80. Depends on your point of view if Itanium sucks by demachina · · Score: 5, Interesting

    The key point about Itanium is that it is a horrible general purpose processor but it is a serious contender to be very good processor for supercomputing. It has very good floating point performance and the EPIC architecture is designed to be very good on Fortran, especially vectorizable Fortran which is very prevelent in HPC applications. What Linus said is correct in the context of Itanium as a general purpose processor, but its doesn't give Itanium the credit its due as a floating point supercomputer which is the only place its going to sell and is what it was designed for.

    It will probably never be very good for most C and C++ apps. Pointer aliasing in particular will give the Itanium compiler fits. Unless you manually tell the compiler there are no two pointers accessing the same memory the compiler can't safely or effectively pack the parallel instructions in the VLIW and that is the essential to good performance in VLIW.

    You do have to really question the sanity of some execs at Intel and HP for spending the staggering sums they've spent on Itanium. Supercomputing just isn't big enough a market for them to have any chance to recoup their investment in our lifetime and they aren't going to sell it in to the mass market as Linus said.

    For a general purpose 64 bit processor to run existing C and C++ applications AMD is going to win hands down. But as many have noted its not likely most people are going to really need a 64 bit processor anytime soon so Intel will probably do just fine selling 32 bit x86 processors for a while.

    --
    @de_machina
  81. Not quite by Rui+del-Negro · · Score: 2, Informative

    There are two issues here:

    1. There is no difference in the speed it takes to transfer data, because the bus is wider. There is also no difference in the time it takes to process data, because registers are also wider. There is a decrease in cache performance (because addresses take up more space). All other things (CPU design, clock speed, etc.) being equal, this hit would be of about 5%. It would only apply to programs running in 64-bit mode, though (the Hammer can still run in 32-bit mode, and can use 8, 16 and 32-bit pointers even in 64-bit mode, in certain instructions).

    2. AMD's x86-64 Hammer doesn't just increase the register size to 64 bits. It adds several new registers, that can (with minor adjustments in the compilers) give a pretty good speed improvement (I'd say about 10% for the same clock speed, although this will depend a lot on the specific program). It also improves the prefetch and adds SSE2 support (one of the few areas where the P4 has an edge). This should give the Hammer approximately a 20-25% improvement over an Athlon XP at the same clock speed (more, if SSE2 is used).

    RMN
    ~~~

  82. Re:Linus holding on to his security blanket? by norton_I · · Score: 4, Insightful

    Yes, RISC programs tend to be longer (sometimes considerably) than CISC programs. There are actually two (main) reasons for this. First, as you mentioned, you need to replace single instructions (usually ones that do memory-to-register operations) with multiple operations (such as a load followed by an math operation). Second, the instructions themselves tend to be longer, since most (all?) RISC archetectures have exclusively 4-byte opcodes - usually something like 1 byte for the opcode, a few bits for flags, and the remaining for up to three arguments. (register numbers or immediate values). CISC archetectures have varying length opcodes, some a signle byte and some several bytes.

    There are a couple of mitigating factors here. First, compilers are usually not very good at using some of the more complicated combined instructions, so they go unused, inflating CISC code to match RISC code. Second, careful optimization of RISC code can identify repeated or unecessary memory operations, and eliminate them. When the memory operations are tied to the arithmetic (or other) operation, that is not possible. Finally, since RISC archetectures generally have large register files and all registers are equivelent, fewer operations are needed to shuffle bits around to where they can be worked on, whereas i386 has a lot of operations that only work on certain registers (though far less than earlier incarnations).

    I used to be a big fan of Intel cutting life support on the 386 archetecture, because RISC is so obviously cleaner and nicer. However, I have started to believe the AMD hype about x86-64, which is basically along the lines Linus talks about here. RISC vs. CISC doesn't really matter any more, and the i386 archetecture is not so bad. If you A) add some more general purpose registers, B) eliminate most of the remaining register usage restrictions, and C) Ditch the worst (looking and performing) FPU on the market in favor of almost anything else, you have yourself a very servicable archetecture. Extend the registers and addressing to 64 bits, and you have something that has a lot more room to grow. That is what the x86-64 is, and despite all the rumors that Intel has their own 64 bit extension to x86, if they don't actually release soon people will start to adopt x86-64 and they will be in the unenviable posistion of having AMD dictate the future of their product line.

    I have heard frequently that something like only 5% of the transistors on the PPro core were tied to the "i386ness" of the core. I assume with the P4 that number is even less. It seems then that the instruction set is not as big of a deal as we would like to think.

    The thing that puzzles me about ia64 is: if the whole point is to "make the compiler do it", and none of the fancy instruction reordering is done in silicon, why is it so expensive?

  83. Does it run Windows? by BagMan2 · · Score: 2, Insightful

    That's the real key. Now days, recompiling well written software for different CPU's is trivial, provided the OS API is the same. If Windows runs on an Itanium well, I can likely just recompile my software and be done. If it can emulate 80x86 well enough to let me run old Windows programs, that's game-set-match.

    I realize /. doesn't like the view of the world through Microsoft colored glasses, but that's the reality that is out there. If it runs their software quickly, users couldn't care less about what the CPU type is, and that includes high-end server applications as well.

  84. Re:the reason the Itanic is a bomb.. by psamuels · · Score: 2, Interesting
    Windows was pathetically slow on those machines, in comparison to them with Digital Unix, or in comparison to my Pentium 133Mhz workstation. Even in comparison to Pentium 166Mhz machines running BSDi 3.0, the Alphas were very slow, even though the Alphas were stuffed full of memory.

    I've never run NT on Alpha, but my understanding is that Microsoft took the easy way out and ran NT basically as a 32-bit OS. Linux, OTOH, fully supports 64-bit and has done so ever since the Alpha port matured. Microsoft finally had to bite the bullet and fix their 32-bit-isms when then came out with Win2k for ia64, but that was years later.

    The Alpha also gets you on unaligned memory accesses - if you and your compiler are not careful, you can force some very slow OS traps. I wouldn't be surprised if your applications were slow partly because of this.

    I was never particularly fond of the DEC Alphas. I didn't like Digital Unix much at all, and the power-that-be wouldn't even let me consider Linux on them.

    Hmmm, what's not to like about Digital Unix? I always thought it was quite nice, as proprietary Unixes go. It was slower than Linux, due to its microkernel layer.

    --
    "How can you claim that you are anti-crack, while still writing a window manager?" — Metacity README
  85. Re: *I* need 64-bit to use more RAM for... by Dynedain · · Score: 2, Interesting

    When I say 3D rendering I don't mean openGL/DirectX rendering....I mean full raytraced, reflection/refraction with global illumination, complex shaders, etc. With scenes as complex as I'm working with, I'm still looking at rendertimes of upwards of an hour per frame.

    And as for video editing. Take a 1920x1080 (max HDTV) clip in raw 1 targa per frame format, add gradients, filters, masks, particle effects, 3d camera movements and lighting, and tell me you can buffer more than a few seconds in RAM. Don't believe me? Go download the demo version of Combustion from Discreet and try it.

    --
    I'm out of my mind right now, but feel free to leave a message.....
  86. A really good CPU benchmark summary by 3770 · · Score: 3, Informative


    Go here for a really good summary of current CPUs.

    --
    The Internet is full. Go Away!!!
  87. Re:Linus holding on to his security blanket? by TheLink · · Score: 3, Informative

    Ask anyone who has done assembly language programming on x86 and a decent CISC and x86 will always lose out too.

    But the x86 has evolved a lot since the bad old days. You could regard the ugly stuff as vestiges of a primitive form and stick to saner modes.

    A larger code size can be a significant disadvantage nowadays. Imagine CISC as compressed RISC opcodes. The current situation is the CPU is VERY much faster than the RAM or even the 2nd level cache. So it's not a big deal to have to decompress (decode/expand to RISC) instructions in the CPU. You gain overall processing throughput that way.

    As long as that situation remains, larger code size is a significant issue. It means fewer programs in memory.

    True RISC processors you talk about are declining. Most are becoming more pragmatic. Which is what Linus is talking about.

    --
  88. No Power4 or 5 due in Macs by Senjaz · · Score: 2, Interesting

    There is no chance of seeing a Power 4 or 5 in an Apple machine. They are IBMs high end server processors.

    The PPC970 however is a different matter. Based on the Power 4 core with AltiVec, minus the on chip level 3 cache and multiple cores (though going back to multiple cores is a possibility when they improve their fabrication to 90nm from 130nm)

    --
    Don't blame me - this .sig had steal me written all over it.
  89. Re:didnt intel sell some rights....... by vidarh · · Score: 2, Interesting

    Intel was founded 1968 and AMD in 1970, so what was your point about AMD being "so young"?

  90. Re:Itanium 2 is great by roca · · Score: 2, Insightful

    I'll elaborate. Itanium is statically scheduled, so a "good" compiler has to know in advance the order in which instructions will complete. This is impossible when you're doing random memory accesses some of which will hit in cache but you don't know which ones.

    Every other modern CPU can do out-of-order execution so a "good" compiler doesn't have to know exactly the order in which instructions will complete.

    That is why there are "good" compilers for most architectures, but not for Itanium.

  91. Re:Itanium 2 is great by dr2chase · · Score: 2, Informative

    Itanium's problems were visible from the moment the architecture appeared. It is, and was, an architecture that should excel at running Fortran programs, which are much more easily optimized than code written in C, C++, or Java. Compilers written ten years ago should be able to do a decent job compiling Fortran to Itanium with only a modest amount of porting work. Problem is, people aren't just running Fortran on Itanium.

    The apparently-dynamic nature of current programs (that is, the intractability of statically analyzing them) has been coming for years. Ten years ago I spent my time studying the inner loops of SPEC benchmarks, and even then the typical inner loop of a C program was the instructions:

    compare X with a value
    branch out if equal
    load indirect through Y to get Y'
    load indirect through Y' to get X
    branch to top of loop.

    If Y (and Y', and Y'', etc) don't address memory in cache, you're hosed. Static prediction algorithms used in some of the first RISC chips (HP-PA, e.g.) work as well as any other on this loop, but you don't know that you're done until you load all the data and compare it. The loop cannot run any faster per iteration than the latency of the memory that happens to hold the data (Cache is King).

    Object oriented programming, whether accomplished with an OO-TM programming language, or just a structure full of function pointers, is about the same can of worms (internally, the processor is caching the last location of the indirect branch, so it is not substantially different from prediction of conditional branches).

  92. Re:Itanium 2 is great by Agent_Basilisk · · Score: 2, Insightful

    Itanium 2 in benchmarks are faster than any other processor used for the same thing, it even kicks the Power 4's ass! Itanium given time will have software support and better compilers that are more optimized for the Itanium architecture. Intel had many problems getting a working processor and even more trouble trying to get support. Intel backs the Linux community and then we see now that support being thrown out the windows BY the Linux community. does this make sense? Itanium yes, on paper has been around for years, yes there were lots of problems, yes it doesn't run 32-bit code well (remember Intel's first shot at RISC with the P6 processor known as the Pentium Pro? It didn't work with 16-bit code well as there wasn't a consumer OS at the time (Windows 95) that could use only 32-bit code. This chip worked well on 98/NT/2000/XP/ME and Linux/BSD). so the moral of the story is, give it a little time, support it, and it will shine :D . sig, Twat's that all about?

  93. Re:Itanium 2 is great by John+Bayko · · Score: 2, Interesting
    Optimizations done at compile time are far better than optimizations done at runtime.
    Er, you're aware that one does not exclude the other?

    Compile time optimizations can take more of the program structure into account, so can be really big wins when they're done well. But they can't take into account program behaviour. For example, the compiler can only tell that there's an if statement in the middle of a loop. The CPU can tell that 99% of the time it is false, so it can skip it and only go back if needed (profiling optimizers can do something similar, as can recompilers and translators - like Java HotSpot JVM and the Transmeta CodeMorpher (Linus works on that), which will both stop and optimize if a block of code is executed a lot).

    The compiler is also nearly clueless about the structure of the CPU. Although the compiler can assume a CPU has a floating point unit, it has no idea if it has two that can execute in parallel, or how long the pipelines are. Modern CPUs can re-order instructions to match the actual hardware. The hardware can even add registers by "renaming" them - the x86 instruction set has eight general purpose registers, but the CPUs usually have dozens, so if a group of instructions share a register but don't share data, the CPU can give two independent registers the same "name" as far as the instructions are concerned, and execute both groups at the same time.

    Yes, a compiler might assign registers more effectively, but only if you knew before hand what CPU was going to be used.

    Compilers can group instructions together in such a way that the CPU is more likely to detect the right patterns. In this case, the optimizations are "implied", and a smart CPU will pick up on them and run faster, a dumb CPU won't, but it normally won't hurt performance. This means that you can have both high level, compiler optimizations as well as low level, CPU optimizations that the compiler could not perform.

  94. Re:Where's the Harsh Words for Transmeta? by John+Bayko · · Score: 2, Insightful
    How is anything Transmeta done even remotely close to something of this level?
    It has the potential. Revolutions aren't always noticed right away, but the idea of translating the instruction from one CPU to another, and running almost the same speed as the original, is something that has never been done both commercially and successfully before.

    The implications of this are more profound than it first seems. First, this removes the requirement that CPUs be particularly compatible with anything. In fact, the some of the Transmeta CPUs aren't compatible with each other - yet they all run the same programs.

    This frees up the design incredibly. For comparison, the Transmeta CPUs (which Linus writes code morphing software for) and the IA-64 (which he thinks is crap) are both VLIW architectures, with about the same issue width and number of registers, and so on. They are more similar to each other than either is to the x86.

    However, the Transmeta CPUs leave a lot of things like the ordering of instructions and handing of exceptions exposed to software. The code morpher takes care of those jagged edges, making them disappear - as a result, the CPU implementation is very simple but very effective.

    In comparison, the IA-64 needs to explicitly specify what instruction combinations are allowable in the input packets, needs to store exception information (there are at least 128 bits that do nothing but remember if an exception happened when using a register), and so on. The result is that the sharp jaggie bits now look like soft jaggie bits, yet the machanisms needed to keep the whole thing consistant bogs it down. I don't think an IA-64 could be implemented at all in a CPU as simple as Transmeta's.

    The second important thing is that the CPU is not tied to a single instruction set. I think Transmeta has made a mistake in not including support for other instruction sets, but it has demonstrated that it can. I remember reading that the first demo showing a Transmeta CPU running Doom was written in x86 machine language, except for the inner loop which was written in Java bytecode. Every iteration the CPU+software switched instruction sets without missing a frame.

    If Transmeta is aiming for the low power, embedded marketplace, the dominant player there is ARM, not x86. If they could offer the ability to mix popular embedded ARM code with popular desktop x86 code, they might have a real winner there.

    But as yet, the revolutionary aspects of the Transmeta designs are unexploited, and mostly even unnoticed. But I'm convinced that even if they don't do it, it will get done another way, whether it's Java, .NET, some Open Source project (Parrot?), or something that's not even noticed yet (Tao Group Elate?). But I think they are revolutionary.

  95. Re:Itanium 2 is great by Mikeytsi · · Score: 2, Insightful

    Because then the University can't take the code and sell it at a profit. It's pretty standard practice that all work you do, including new, patentable ideas, are considered "work for hire" and are owned by the school. "Existing project" and "OSS" tends to mean, "we can't exploit this to get money".

    --
    I've been called a "Fucking Dick" by better people than you.