Slashdot Mirror


AMD Talks About Internal Benchmarks for Opterons

ggruschow writes "AMD's CTO says their 2.0-Ghz Opteron (aka Hammer) beat a 2.8-Ghz Xeon (P4) on both SPECint2000 and SPECfp2000 tests, but was mixed against an Intel 1-Ghz Itanium 2 (details at ExtremeTech). IBM predicted "conservative" 1.8-Ghz PowerPC 970 scores, which fall in the middle of the pack (sweet for OS X). It's probably not a coincidence that AMD's news comes so soon after Gartner said x86-64 would fail. Even if Intel loses the performance crown again, their upcoming mobile processor is looking pretty spiff with its recently announced 1MB of cache. Sounds like next year might finally bring a worthy upgrade for my 486dx4-160."

109 of 295 comments (clear)

  1. *sigh* by chefren · · Score: 2, Informative

    Who cares what processor is slightly slower or faster than others? You need at least a 10% difference in overall system performance to notice anyway.

    Darn, I missed fp by thinking...

    1. Re:*sigh* by tcdk · · Score: 5, Insightful

      For straight CPU intensive tasks it matters.

      But for 99% of normal peoples taskes 10% whont matter.

      But it's the edge and it has to be somewhere and it has to move.

      My rule is that I upgrade when I can get a cpu that is twice as fast as my old one for about 1000dkr (130$/).

      Thats possible right now (I've a 850Mh celeron), but I need a new motherboard, which kind of changes the rules.

      --
      TC - My Photos..
    2. Re:*sigh* by SomeGuyFromCA · · Score: 2, Informative

      The conversion rate is $1 == 1.02

      So it's fairly close.

      --
      if the answer isn't violence, neither is your silence / freedom of expression doesn't make it alright
    3. Re:*sigh* by dimator · · Score: 2

      But for 99% of normal peoples taskes 10% whont matter.

      "Doctors say that Nordberg has a 10 percent chance of living, though there's only a 50 percent chance of that. "

      --
      python -c "x='python -c %sx=%s; print x%%(chr(34),repr(x),chr(34))%s'; print x%(chr(34),repr(x),chr(34))"
    4. Re:*sigh* by tconnors · · Score: 5, Informative

      For straight CPU intensive tasks it matters.

      But for 99% of normal peoples taskes 10% whont matter.


      10% never matters. We regularly run simulations here that take a month. What is 10% on top of a month? 3 days. If you have already been waiting 30 days, what does another 3 matter? It probably corresponds to the weekend anyway.....

    5. Re:*sigh* by tcdk · · Score: 5, Insightful

      How many people are "we"?

      If you are ten people, one of them could be fired, by your argument, without anybody noticing.

      Let me turn it around - how many procent do you need before it matters? 12? 15?

      But I agree, one can't upgrade everytime theres a 10% speed increase. One has to do the cost/benefit thing carefully first (and then ignore the c/b and just spend, spend, spend - the only way to get the economy back on track ;)

      --
      TC - My Photos..
    6. Re:*sigh* by Anonymous Coward · · Score: 2, Insightful

      I am a researcher, and I must say I would generally appreciate a 10% boost in performance.

      Basically, I find that I sometimes get into situations analogous to going to a bus stop. If you can get there in 8 minutes instead of 10, usually it makes no difference, but sometimes you can catch the bus when otherwise you would have to wait for another one.

      What this means in my work, is that I might miss out on an open slot in the LFS batch queue. Or for a job that lasts several days, a few hours can make the difference between being able to present results at the meeting this week instead of having to wait until next week.

      So I am glad to get 10% additional performance... but if I'm spending my own money on it, I'm probably not willing to spend any more than 10% additional dollars to get it.

    7. Re:*sigh* by secondsun · · Score: 2

      Does thermal weapon mean ANYTHING to you?

      --
      There is nothing wrong with being gay. It's getting caught where the trouble lies.
    8. Re:*sigh* by sql*kitten · · Score: 3, Insightful

      10% never matters.

      On the contrary, if you can get by with spending 10% less on equipment (the other way of looking at this) than that can make the difference between being a solvent, viable company and everyone being out of work.

      You're at a university, so you are under no commercial pressure to deliver. I mean, once you're past undergrad assignment deadlines, research gets written when it gets written, right? You can't rush science, maaaan, pass the bong. But in the real world, there are real consequences, and 10% could make a real difference to computation-intensive jobs.

    9. Re:*sigh* by Syre · · Score: 2

      I don't think speed is so much the issue for Intel.

      What they really want to do is to come out with a new architecture that no one can copy.

      AMD is still making use of old licensing deals with Intel that go back to the 80s and basically allow them to use x86 microcode etc.

      If Intel can get Itanium adopted, AMD is SOL... Itaniam will be a bitch to reverse engineer, and is not covered under any of those old pesky licensing deals.

      Sure, Intel is trying to advance the architecture, but the reason they're willing to spend whatever it takes to get Itanium accepted is because it removes all direct competition.

      As usual, the business world is more cynically motivated than it seems...

    10. Re:*sigh* by cheezedawg · · Score: 2

      AMD's Athlon XP is clocked at well over 2Ghz now

      No, AMD just barely hit 2.0 GHz with the Athlon 2400+ (the fastest Athlon available now), and the much hyped release of the 2700+ and 2800+ that were announced about 2 months ago has been delayed until the end of November.

      --
      "The defense of freedom requires the advance of freedom" - George W Bush
    11. Re:*sigh* by 4of12 · · Score: 2

      Let me turn it around - how many procent do you need before it matters? 12? 15?

      Reminds me of the criterion I heard for how much of a pay increase is needed to induce people to leave their existing job for a new one.

      IIRC, 10% wasn't enough. People need 15-20% increases to motivate the trouble of switching.

      Not that computer speed and pay are really comparable...I think Bill G. is the only one whose pay has kept up with Moore's Law. Mine hasn't.

      --
      "Provided by the management for your protection."
    12. Re:*sigh* by NanoGator · · Score: 2

      I metamoderated in your favor as well. Mods need to learn not to be so literal about 'off-topic'.

      --
      "Derp de derp."
  2. 486dx4-160? by acehole · · Score: 5, Funny

    You're weak my friend ;)

    You've got no holding power... hell i've still got my Commodore 64 with accoustic coupler modem, and i'll hold onto it until I see something worth spending money on...

    --
    Be you Admins? nay, we are but lusers!
    1. Re:486dx4-160? by figjamjam · · Score: 2, Funny

      Bah!!

      You youngins with your new fangled machines. I'll never give up my UNIVAC.

      Goto go and replace some valves now. See ya.

  3. Yawn, wake me when it ships. by Gldm · · Score: 4, Insightful

    Benchmarks are nice and all, but I'm getting kinda tired about hearing how great a CPU benches for about 6 months before I could even buy one with a sack full of money.

    Not that I'm not excited about 64bit CPUs on the desktop, I could really find a use for one (I've got something interesting that likes to malloc more than 4GB sometimes).

    --

    Introducing the new Occam Fusion! Now with sqrt(-1) fewer blades!

    1. Re:Yawn, wake me when it ships. by Anonymous Coward · · Score: 2, Funny

      "I've got something interesting that likes to malloc more than 4GB sometimes"

      Woah, you do open a lot of porn with mplayer!

    2. Re:Yawn, wake me when it ships. by Anonymous Coward · · Score: 3, Funny

      I've got something interesting that likes to malloc more than 4GB sometimes

      Mozilla?

    3. Re:Yawn, wake me when it ships. by RealAlaskan · · Score: 2
      >>I've got something interesting that likes to malloc more than 4GB sometimes

      >Mozilla?

      No, he said interesting.

      (Disclosure: This was posted using Mozilla.)

  4. Re:AMD sucks! by Anonymous Coward · · Score: 2, Interesting
    Well, I kind of agree.

    I want lots of cache and extreme memory bandwidth. As CPUs are getting faster and faster, both the lack of cache and memory access are seriously limiting the performance of current PC architectures. Yet, not even Intel seems to be interested in improving those areas. In fact, with P4 Intel actually cut the amount of cache.

  5. MOD PARENT UP!! by megaduck · · Score: 2

    Damn, I tried to mod "funny" and it entered it as "overrated". Stupid wheel mice.

    --
    This .sig for rent.
  6. boring... by hatchet · · Score: 2, Insightful

    Noone cares about few % performance gain anymore. And even if Opteron would be much faster, people wouldn't care much simply because you can't buy it. Pentium4 is better because you can get it *NOW*

    If you need new computer, buy it (NOW!), otherwise don't buy anything until you need it.

  7. How much is adequate? by Deton8 · · Score: 4, Interesting

    Oviously there is a market for super-fast processors to those of us on /., but aren't we at a point where currently available processors are fast enough for more and more user segments? What I mean is, people who do Word and Excel were happy along about 800 MHz and ordinary CAD people like me don't need more than about 2 gig. There are only two guys in my organization (running VHDL simulations day in and day out) who have any need for faster processors. Will we soon get to a point where the total market size of gamers and /. people will not pay for another processor spin?

    1. Re:How much is adequate? by Duds · · Score: 3, Insightful

      But people were saying this back when that guy's 160 wasn't laughably slow.

      He can browse with it, why does the home user need more? That with linux or winNT and memory would do everything average Joe wants.

      The answer is A)marketing B)keeping up with the Jones' and C)Because there IS always something new for people to do.

      You won't stop CPU dev, there's always someone who could use it or some Redmond based multinational doing something to make it needed.

      No-one NEEDs more than a P100 tops. They CAN find a use for it though and that'll never changed. The reason can be summeried thusly.

      "Hey Ma, look at what this fancy computer can do!"

    2. Re:How much is adequate? by GnomeKing · · Score: 3, Insightful

      Your forgetting one important factor in the computer-upgrade cycle

      microsoft.

      Sure, current computers will run word of 2 years time without (m)any worries, BUT, "innovation" has bumped up the required specs for every single windows/office release

      Of course its not just microsoft which bumps up required specs, but their the driving force behind most hardware upgrades

      As processors get faster, software gets both lazier and "smarter"...
      lazier 'cause theres less optimization and "smarter" 'cause, for example, 15 years ago no one would have ever implemented some of the stuff thats present in todays computers (fex image thumbnails in explorer)

    3. Re:How much is adequate? by joto · · Score: 2

      Nah. Is free software any better? I sure wouldn't want to run Gnome, KDE, OpenOffice or Mozilla on an old PC (My old Cyrix 586 with 48MB ram is completely unusable for webbrowsing. But I can still use old wordprocessors, older versions of gcc, older games, etc without problems. Of course, the web is the only thing I can't get an old version of.

    4. Re:How much is adequate? by ottffssent · · Score: 5, Insightful
      No-one NEEDs more than a P100 tops


      Yeah, but only in the way than no-one NEEDs modern medicine, central heating, or citrus fruit during the winter.

      On the other hand, I NEED faster than a Duron/600 for:
      sending messages in ICQ (yup, sending a message is O(n) or O(n^2) - not sure which) with n the number of messages in your scrollback
      Encoding MP3s - I spent over 2 hours this afternoon switching CDs every 10-15 minutes.
      Recording TV - I can only record to divx at quarter VGA or less
      Using Mozilla the way I want (with 20-50 tabs open at a time and 128M of RAM cache)
      Using an encrypted filesystem (unless win2k's implementation is just horribly inefficient)
      Opening / manipulating 500M images

      Sure, I could plop an XP2200+ in here, but I spent $50 on the original CPU and I'm unwilling to spend more on another until Hammer comes out. A dual Clawhammer should be about 10-20x as fast as my current machine depending on app - a most satisfying upgrade.
    5. Re:How much is adequate? by TobyWong · · Score: 2

      One severe case of scurvy and you will change your tune about citrus fruit in the winter!

      =P

      --
      - Toby
    6. Re:How much is adequate? by joto · · Score: 2
      Yes, but do you find much on the web that renders adequately in netscape 3? Or is even rendering at all?

      Can you even use e.g. hotmail in netscape 3? Can I access my bank with it? Can I deliver my tax reports with it? Can I use it for popular shopping sites like ebay or amazon? Can I read the online phonebook provided by my phone-company, or the zipcode-catalog by the postal office? If all I can do is to surf the useless majority (unfortunately) of the web (like slashdot), then it wouldn't do me much good.

      And that is why I claim that the web is nowadays one of the major reasons to upgrade. Sure, I can still type letters in WP5.1 (actually, I kind of like it), and lotus 123 is still an excellent spreadsheet. And I remember fondly running linux, XFree86, emacs and gcc (yes I did have some trouble) on an old 486sx25 with 8MB of ram and 20MB HD. But the moment I want to access something useful on the web with that machine, I'm lost. And so I am with my old Pentium-clone.

  8. C64 ? by stud9920 · · Score: 5, Funny
    hell i've still got my Commodore 64 with accoustic coupler modem
    You overcomfortable rich kid ! A C64 is just a toy with loads of eye candy. I am still doing it with my Difference Engine 2.0 and IPoAC (IP over avian carriers). More would just be superfluous luxury. Besides, shouldn't you have typed your message in all caps ?
    1. Re:C64 ? by roundand · · Score: 4, Funny

      IPoAC (IP over avian carriers)

      That would be RFC1149, right?

    2. Re:C64 ? by joto · · Score: 2
      Well, you've got the most expensive networking equipment I've seen then.

      For IP over avian carriers to work, you need: a printer, preferably to microfilm, a scanner, preferably from microfilm, OCR software, and lots of avian carriers. Seems to me it would be far beyond the capabilities of the difference engine. What computer do you use to feed your difference engine the IP-protocol messages?

    3. Re:C64 ? by salimma · · Score: 2

      Ooh where did you get that prototype? AFAIK the product line was discontinued :p

      --
      Michel
      Fedora Project Contribut
  9. Benchmarks... by e8johan · · Score: 5, Interesting

    Benchmarks are as bad as statistics. They measure nothing but how much you can tweak your CPU and compiler to fit that specific benchmark.


    I would say that AMD may have an advantage for being more backwards compatible than Itanium, but I also feel that it is time for a change!


    All major CPU manufacturers make proper RISC CPU already so why don't we find them in our ordinary computers? It is because the Windows codebase cannot simply be recompiled for a new target but has to be ported function by function (painful assignment, to say the least). Perhaps they can reuse 3/4 of the code, but still, there is a whole lot or rewriting and verification to do.

    I have worked in a Tru64 environment (running Alpha CPUs) and I was surprised of how easy it was to get 95% of the Linux apps to properly compile and run. I didn't try to get Linux it self running but I had gcc running and that was enough.

    What I'm trying to say is that the open source movement has proven that one can write portable code successfully and that it is time to make a hardware change. The serial ATA and AGP solutions from the PC are good enough, so is the PCI bus (lots of peripihals available) so I wouldn't change that, but simply make the standard computer run multiple RISC CPUs and a proper multi-threaded OS that can take advantage of that and then you'll have a performance boost that would make P4 look like a bicycle compared to a F1 car (ok, perhaps a Porche, but still, an F1 does 0-200kph in
    While I'm at the subject. As we have bochs, it would still be possible to run Windows in a VM, no matter what platform we use, so all M$ users could be happy, or do as ACorn did (does), have a PC as a extension card, i.e. run a PC natively in a window, just used the *fast* RISC CPU for any real work.

    1. Re:Benchmarks... by radish · · Score: 2


      RISC is no panacea, there's no real reason why a RISC box is inheriently faster (in real world use) than a CISC one - they're just different architectures.

      The real reason wintel is still CISC is not Windows itself (NT4 for example is already ported to Alpha) but all the third party apps - people want to be able to run the xyz app they bought 5 years ago on their new box. This is why Intel is having fun trying to get their new non-backwards compatible architecture accepted widely.

      Oh and gcc isn't a Linux app, of course it was easy to recompile for other platforms. That's kind of the point of gnu.

      --

      ---- Den ene knappen er powerknapp, den andre er Bender voice knapp "Bite My Shiny Metal Ass"

    2. Re:Benchmarks... by e8johan · · Score: 2

      I know that gcc isn't a Linux app, what I was trying to say is that applications *developed* in a Linux environment easily ports to other platforms. Even though the endianess and variable sizes may differ. This is due to good coding. Keep that up!

    3. Re:Benchmarks... by e8johan · · Score: 2

      There is also the problem with other apps (intended for Windows or even (shiver) DOS). These apps are developed as propetary software and will be hard to simply re-compile.
      WinXP even has a compatibility mode to run older apps since the system has been so badly designed from the start.
      As for the pricing. The prices would drop if major suppliers started supporting them. This is what I want: the major players (i.e. "the computer industry") should realize that it is time to make a platform switch before we dig our selves even deeper into this pit of horror (i.e. x86 architecture).

    4. Re:Benchmarks... by vrt3 · · Score: 2
      I'm sure plenty of x86-ities have crept in since they realised no one was asking for cross-platform support, of course.

      And on the other side of the pond, Linux was in te beginning not intended to run on anything besides x86. It has turned out otherwise, and Linux now runs on quite a lot of platforms.

      --
      This sig under construction. Please check back later.
    5. Re:Benchmarks... by RealAlaskan · · Score: 2
      And I believe you can get motherboards for other CPUs with AGP and sATA.

      If you can point me to a link for PPC or MIPS motherboards with PCI busses, which use AT or ATX power supplies, I'd be very happy.

      You're just going to have to pay four times as much. x86 systems are cheap because millions of people buy them ...

      There's some truth in that. I've found motherboards for Alpha's, but they cost over $1000, so were a little hard to justify. The problem was that the manufacturer didn't want his motherboards competing with his assembled machines, I think.

      ... WinNT was originally intended to be cross-platform. 3.51 was released for x86, PPC, Alpha, MIPS, and I think one other.

      Alpha, MIPS and x86 I know about. PPC I hadn't heard about. Was that for IBM's RS6000 workstations? I don't think PPC support was still there by NT4.something.

    6. Re:Benchmarks... by Magila · · Score: 2

      All major CPU manufacturers make proper RISC CPU already so why don't we find them in our ordinary computers?

      Someone already pointed out that Macs use RISC CPUs but in fact all modern x86 chips are really RISC cores with a translation layer from x86->RISC. Also most compilers optimize to a very RISC-like subset of x86. So you see, x86 has managed to evovle so it has most of the advantages of RISC plus the all important legacy support. This sort of thing is how x86 has managed to survive so long and why that's not nececarily a bad thing.

  10. Clawhammer by Perdo · · Score: 5, Informative

    Clawhammer (Athlon) has a single 16 bit wide hyper transport bus.

    The workstation Sledgehammer (Opteron) has two 16 bit busses

    The server Sledgehammer (Opteron) has three 16 bit busses

    The spec results are as follows:

    Spec_int

    PIII1G 426
    G4 1ghz 306
    G5 937 (IBM PowerPC 970)
    2.8Ghz p4 1010
    XP 2800 933
    Itanium 1Ghz 810
    Power4 1300 804
    Clawhammer 2.0 Ghz 1202

    Spec_fp

    PII 1Ghz 426
    G4 1Ghz 187
    2.8 Ghz p4 947
    XP 2800 782
    Itanium 1Ghz 1356
    Power4 1300 1169
    Clawhammer 2.0Ghz 1170

    Opteron??? Higher than clawhammer considering the multiple hyper transport busses 1/2 mb L2 (compared to clawhammer's 256/512 l2) and dual on chip DDR memory controllers compared to Clawhammers single memory controller

    Bootleg Powerpoint Presentation:

    http://130.236.229.26/download/misc/AMD-Opteron. pp t

    and

    http://a26.lambo.student.liu.se/download/misc/AM D- Opteron.ppt

    Read the Show notes! AMD failed to edit them out

    Filename is AMD-Opteron.ppt google search it.

    Includes a system that is an Opteron workstation dualed with a clawhammer that still presents itself as a single proc system. The clawhammer acts as a math co-processer :)

    --

    If voting were effective, it would be illegal by now.

    1. Re:Clawhammer by Halo1 · · Score: 2

      Exactly. Given that programs which are highly optimized for AltiVec can already achieve higher performance on the current crop of G4-based Macs than on much higher clocked x86 compatibles (even though these Macs have a quite limited memory bus architecture that doesn't allow the processor to realise its full potential), I think a PPC970 with a 900MHz bus will be able to achieve monstruous performance numbers in some cases.

      Couple that with the fact that large parts of Mac OS X are AltiVec optimized (lots of functions from the standard C library like memcpy, the OpenGL framework, the CoreAudio framework, ...), I think the real world performance of (Apple) systems using this processor may lie quite a bit higher than what you would expect from just looking at the SPEC scores.

      --
      Donate free food here
  11. Re:486 160 mhz? (History lane) by zensonic · · Score: 5, Informative

    Your konwledge isn't that good. The fastest 486 in terms of Mhz was the Amd 5x86 - 133Mhz (4*33Mhz) chip. That chip easily overclocked to 160Mhz (4*40Mhz). In terms of pentium performance (integer wise) it was equivalent of a P75 at 133Mhz and of a P90 at 160Mhz (give or take a few percent)

    In terms of performance the fastest chip that fitted in a socket 3 was the Cyrix 5x86 120Mhz, which (again speaking of integer performance) was equivalent of a P100.

    --
    Thomas S. Iversen
  12. I hope Hammer will fix the rc5 crippled speed!! by Anonymous Coward · · Score: 5, Informative

    I hope THIS mask rev of Opteron (Hammer) chip will be faster than January 2002 PowerPC G4 chips.

    Currently, according to the RC5 benchmarks AMD is far slower than dual cpu macintoshes (half as fast). (source available for cor rc5 loops for most processors). RC5 was silently completed in June or so but a bug went unnoticed for a couple months, but the contest is over. They measured performance in units of "Mac poerbooks" in their press releases.

    The Mac Dual 1 Ghz g4 is faster than all existing dual AMD motherboards in RC5 benchmark by almost 100%.

    21,129,654 RC5 keyrate for dual 1 Ghz g4 system ! And Now apple sells dual 1.25 Ghz stock which would be even faster.

    A dual 1800+ AMD MP gets only HALF as many as a Mac! 10,807,034 rc5 keys !

    Funny "Mhz myth" there showing itself I guess... Apple now is selling even FASTER machines but with smaller caches and less fast read-write ram (it now uses DDR on newest boxes).

    And the macs are using low power g4 chips meant for microcontroller usages with very little predictive branching and a simple 7 stage RISC pipeline depth. (macs complete many many instructions per cycle though, unlike Pentiums).

    The mac I mentioned uses a 2 MB L3 cache and no AMD MP dual cpu boards I know about have any L3 cache at all, so maybe that is whay some common macs are over twice as fast, its not just altivec meager tweaks to rc5. AMS have similar , but less mazing vector ops.

    Another reason the mac might be over twice as fast as an amd dual mp board is not just the 2MB l3 cache but the fact that mac can read and write to a cold page of memory simulatneously FASTER than any AMD MP designs which are biased for linear access and streaming. Many memory scatter benchmarks show this too. Appels newest DDR-RAM machines might not offer this feature though.

    So basically, will the new Hammer systems be able to get close to speed for RC5 and other crypto tasks as the RISC based Powerpcs?

    I really want to know. And I am so sad to see Slashdot reduced to fanboys modding down anything discussing tech subjects like this as "flames" all the damned time. This post is all informatinve and factual and my reason for asking is genuine.

    http://www.research.ibm.com/journal/rd46-1.html has 5 LARGE technical articles on how the POWER4 chip was designed... in PDF form too. Even if you do not appreciate the Power4 (which apple is using a dual-core version of in many months) you might want to read these PDFs because they are all about chip design.

    They put the floating point on the corners of the chip die to help spread heat, etc. Hundreds of interesting facts and pictures on at that site.

    Top500.org lists Power3 dominating the cluster speeds of the top 500 computer clusters for memory+float speed. Power4 will soon start appearing in that list as well as the "lite" version with only 2 MB of cache instead of 4,6, and 16 MB.

    Plus the new chip apple will start using announced yesterday, will have SIMD "VMX" or Velocity Engine added (Moto calls theirs"altivec").... only 90% of altivecs hundreds of opcodes will be offerred though.

    With Pricewatch showing cheapest 800Mhz Itanium bare cpu at almost 8 THOUSAND dollars, and 3.5 thousand for the old itanium 700 Mhz, it does not take a financial genius to see why apple's workstations are selling so well nowadays.

    1. Re:I hope Hammer will fix the rc5 crippled speed!! by Toraz+Chryx · · Score: 2

      "only 90% of altivecs hundreds of opcodes will be offerred though."

      Source?

      Altivec is 162 instructions, and the Microprocessor forum brief on the GPUL stated "over 160 instructions"

    2. Re:I hope Hammer will fix the rc5 crippled speed!! by hyoo · · Score: 3, Insightful
      I remember this IRC log from a while back. In a nutshell, they said that the PowerPC architechture (namely AltiVec) is well suited for RC5 since it has nice hardware bit rotates, and RC5 uses rotate A LOT.
      [acidblood] More registers available (32 in the PowerPC versus 8 in MMX and SSE2), plus 128-bit wide registers (MMX is only 64-bit wide), and the existence of a hardware vector rotate instruction in Altivec, which isn't available in MMX and SSE2.
      Is RC5 a useful benchmark if it mainly tests the bit rotate performance? Does Intel/AMD really care if their RC5 keyrate is low? Are you going decide which CPU to get next based on bit rotate performance?
    3. Re:I hope Hammer will fix the rc5 crippled speed!! by Colin+Bayer · · Score: 4, Funny

      Are you going decide which CPU to get next based on bit rotate performance?

      On the PC, all of my work was so slow. That repeated multiplication and division by powers of two took forever. That's why I got a Mac, which has great shift left / shift right performance! Now I have more time to ogle the secretaries from the water cooler.

      I'm Colin Bayer, and I'm an accountant at Arthur Andersen.

      --
      Want Linux games? HERE.
    4. Re:I hope Hammer will fix the rc5 crippled speed!! by Jay+Carlson · · Score: 3, Funny
      I hope THIS mask rev of Opteron (Hammer) chip will be faster than January 2002 PowerPC G4 chips.

      [...] The Mac Dual 1 Ghz g4 is faster than all existing dual AMD motherboards in RC5 benchmark by almost 100%.

      [..] Funny "Mhz myth" there showing itself I guess... Apple now is selling even FASTER machines [...]

      I can see the new "Switch" ad now (white background, jerky cuts):

      "I'm a network administrator and so are my friends" "We steal computer power from our employers, at school, wherever we can find it, to run this Are See Five thing"

      "Peace, love, and strong crypto"

      "So I noticed the Apple computers were pretty fast at kicking out keyblocks" "I had to have one"

      "Say it with me: Brute-force known-plaintext attacks" "That's what makes a computer cool"

      "If I'm going to spend a few thousand dollars on a computer, it's gotta be the best at at least one thing"

      "Hi, I'm Anonymous Coward. I'm a crack user."

      [Apple logo]

      Cmon. The estimated SPECint numbers are wonderful news. They're a lot closer to reflecting what most of us do with these machines than key-agile stream ciphers. Beating up x86 weenies with the RC5 key rates will just make them buy a couple of $400 Athlons to stick in the closet and gloat about price/key/sec performance. (That's counting electricity too.)

    5. Re:I hope Hammer will fix the rc5 crippled speed!! by Milican · · Score: 2

      Well here is a benchmark of the RC5 speeds for various processors. Yes the PowerPC does kick some major arse. Why is the question, and here is the answer. Anyway, long story short I heard there is a nice barrel shifter in the PowerPC that makes them excellent candidates for the RC5 client. So as they said in the second link the RC5 contest is not a good benchmark for performance. Although, it is sweet how fast the PowerPCs cores are!

      JOhn

    6. Re:I hope Hammer will fix the rc5 crippled speed!! by acidblood · · Score: 4, Informative
      I suggest you read the distributed.net Slashnet forum, where I explain why the G4 performs faster than x86 processors. Summarizing:
      • RC5 is completely parallelizable, so you could theoretically do as many simultaneous operations as you have execution units on your processor, as long as there's enough registers to mask memory load latency. Obviously, there's many more registers on PowerPC architectures than on x86.
      • The distributed.net core uses the Altivec SIMD extension on the G4, which has a useless rotate instruction, which serves absolutely no purpose that I know of on anything other than RC5 encryption. So I see Intel's point in not including a rotate instruction in SSE2: bit rotation is a completely useless operation except for RC5. Did I make my point clear enough? However, that makes it difficult to use SSE2, given the limited amount of registers available, coupled with the need to emulate a rotate instruction by means of shifts, ORs and an additional temporary register.

      It must be clear that, if Intel had included an SSE2 rotate op, the P4 would easily beat a G4, not at the same clock speed, but given that a G4 can't scale as well as a P4 it wouldn't matter anyway.

      Hammer can't get any better on RC5 without an instruction set overhaul. Athlons already do pipelined scalar integers rotates in 1 clock cycle, it's impossible to beat that.

      Also, please do not generalize G4's distributed.net RC5 speed to a ``PowerPC superiority in crypto tasks,'' because it makes me want to laugh really hard at your cluelessness. SIMD is completely useless in real-world crypto applications: when you use a cypher in Output Feedback mode, which is how stuff is done in the real world when you're encrypting data instead of trying to break keys, you need to know the output of the last crypto operation to mix in the next operation. It should be obvious that you can't do operations in parallel now, so SIMD becomes useless and the Athlon goes back to being faster than the G4 at the same clock rate, and of course much faster on commercially available speed rates.

      Oh, and the larger cache you mentioned has absolutely ZERO effect over RC5 performance. RC5 memory usage for each key being encrypted/decrypted is:
      • number of bits in key rounded to the next 32-bit multiple (64 bits in RC5-64, 96 bits in RC5-72)
      • number of cyphers round plus one, times 8 bytes (12 rounds in the RSA Secret Key challenge equals 104 bytes)
      • 8 bytes for two temporary variables, which hold the plaintext before encryption and the cyphertext after encryption, or the cyphertext before decryption and the plaintext after decryption.

      As you can see, even if you take into account loop control variables and whatever else, it boils down to less than 150 bytes per key. You could probably fit a 60-wide superscalar core on the P4's measly 8 KB L1 cache.
      --

      Join the NFSNET. Our prime goal is making little numbers out of big ones. http://www.nfsnet.org/

    7. Re:I hope Hammer will fix the rc5 crippled speed!! by fault0 · · Score: 2

      And altivec optimizations don't help rc5 much either. It's a common myth that rc5 is completely altivec optimized.

      Anyways, rc5 is an extremely poor benchmark, as it only tests which processor runs the rc5 algorithm the best, which is quite limitied by nature anyways.

    8. Re:I hope Hammer will fix the rc5 crippled speed!! by Graff · · Score: 2
      and I'm an accountant at Arthur Andersen.

      Wonder how many will get that.
      About 5 to 10...
    9. Re:I hope Hammer will fix the rc5 crippled speed!! by falzer · · Score: 2

      I've used bit rotate operations for bitmaps and multiplications/divisions by powers of two. They can also be used in some cases for serial transmission of data. They're not completely useless.

    10. Re:I hope Hammer will fix the rc5 crippled speed!! by randombit · · Score: 3, Informative

      The distributed.net core uses the Altivec SIMD extension on the G4, which has a useless rotate instruction, which serves absolutely no purpose that I know of on anything other than RC5 encryption.

      I'll admit I don't know Altivec too well. But I can pretty much guarantee you that a SIMD rotate instruction would be fairly handly on a reasonable number of crypto algorithms (RC6 and MARS come immediately to mind). Assuming it's doing what I figure it's doing based on your statement.

      BTW, SIMD is useful in some crypto algorithms. In particular, I'm thinking of UMAC16, which was designed to be used with MMX or AltiVec. Yes, it most sitiations it's hard or impossible to run the high-level operations in parallel (though you can with Counter mode and when decrypting CBC -- they can both be done infinitely in parallel). And some algorithms do have operations internally that can be implemented with SIMD (mostly by design).

    11. Re:I hope Hammer will fix the rc5 crippled speed!! by red_dragon · · Score: 2
      That's why I got a Mac, which has great shift left / shift right performance!

      This is a great time to sing the Programmer's Cheer:

      Shift to the left!
      Shift to the right!
      Pop up, push down,
      Byte, byte, byte!

      --
      In Soviet Russia, Jesus asks: "What Would You Do?"
    12. Re:I hope Hammer will fix the rc5 crippled speed!! by acidblood · · Score: 2
      Sorry, but you are even more clueless than the first poster I replied to.


      Ever hear of OBJECT CODE? It is stored in cache too!


      The core functions for block cypher encryption are very small and fit in L1 data caches. Besides, if your code is so bloated you need to fetch it out of L3 cache or main memory, then performance appears to be the least of your concerns.


      Ever hear of CBC mode crypto? CBC is parallel for any algorithm of crypto by definition. CBC is ideal for pgpdisk for example.


      A small cut-n-paste job from the Handbook of Applied Cryptography by Menezes, van Oorschot and Vanstone:

      Algorithm CBC mode of operation
      INPUT: k-bit key K; n-bit IV; n-bit plaintext blocks x[1] ,..., x[t].
      SUMMARY: produce ciphertext blocks c[1],...,c[t];
      1. Encryption: c0=IV. For 1<=j<=t, c[j]=E(c[j-1]^x[j],K).

      So, it appears c[j] depends on c[j-1] for encryption. What were you saying about parallelism anyway?

      Oh, and don't even try to mention CFB, it's not parallelizable either.


      learn more about crypto.

      also rotation is essential in lots of computer operations, it is not worthless. In crypto for example rotation isuseful for not just kracking, but performing the encryption.


      Heh, how do you think a block cipher is cracked? Since you don't have a clue, let me explain it to you: the plaintext is encrypted with a possible key and the cyphertext is examined for a match with a known ciphertext, or a less-than-random composition (for instance only ASCII printable characters.)

      Maybe it isn't time for you to learn some crypto?


      the fact that your totally off base and incorrect post got a +5 mod is indicitive of all that is wrong with people like yourself (unskilled naysayers and amd fanboys)


      Would you mind pointing out where the post was off base and incorrect? All you have shown up to now is your cluelessness.
      --

      Join the NFSNET. Our prime goal is making little numbers out of big ones. http://www.nfsnet.org/

  13. Re:Browsing by Jugalator · · Score: 2

    Hey, he can probably even run Quake 1 with a decent framerate on minimum window size.

    I did.

    --
    Beware: In C++, your friends can see your privates!
  14. Pentium 4s have no shared cache. uni-processor by Anonymous Coward · · Score: 2, Informative

    Pentium 4s have no shared cache. uni-processor designs only.

    If you want DUAL cpus, or more, you have to go mac or AMD to get speed per dollar.

    and macs are twice as fast as the fastest AMD for rc5 benchmarks.

    a pentium 4 is a heatwasting joke once you start using 2 or more cpus.

    Apple is only selling dual cpu machines now. And when the dual core Power4 ships in 8 months or less, they mught be offereing 4 cpus economically as a stock product, even if they do not, many 3rd party dual cpu board suppliers for macs exist, such as Sonnet Technologies.

    1. Re:Pentium 4s have no shared cache. uni-processor by Toraz+Chryx · · Score: 2, Informative

      1) Xeon
      2) Macs won't be shipped with POWER4's in them, they'll _probably_ be shipped with PowerPC 970s (which are effective single core Power4's + VMX)

  15. WRONG! RISC "ordinary computers" exist! by Anonymous Coward · · Score: 3, Informative

    WRONG! RISC "ordinary computers" exist!

    You wrote "why don't we find them in our ordinary computers"!

    In fact I am using one as I type this. It was built in 1996 (yes nineteen ninety six) and has a 800 Mhz G4 accelerator in it from Sonnet.

    Its my "internet" machine, I use other RISC machines for programming not wired to any external networks.

    It runs a wonderful version of Microsoft Office at full speed (RISC) and launches MS word in 2 seconds cold. (yes two seconds to flashing cursor).

    no intel emulation needed.

    its called a Macintosh

    millions of macs exist and millions of macs use one or more risc processors and almost no mac people I know ever wnat to emulate a pc running windows EVER if they can help it.

    RC5 and other benchmarks are twice as fast on standard macs than AMD, and Pentium 4s have no multi-cpu board designs...

    If you want to run thousands of high end commercial shrink wrapped products in RISC you can, but only on macintosh. And they run very well in the new Jaguar 10.2 (though faster in 8.6).

    1. Re:WRONG! RISC "ordinary computers" exist! by Corporate+Troll · · Score: 2, Informative
      • Mac's are PC's... PC is just an abbreviation for "Personal Computer". Use x86 instead.
      • They give back to the BSD community. Not everything of course (look, giving Aqua under the BSD license is suicide), but I'm pretty sure that PPC machines (and thus macs) are better supported now by OpenBSD and NetBSD. That is contributing back! Don't forget that they don't even have to because it's the BSD license which damn well allows you to take the code and keep it for yourself.
      • Apple's are now pretty generic computers with standard PCI, standard RAM, only the CPU is different. Back in the day they did tricks with ROM's etc...but those are gone by now.
      • High pricing? eMac: G4 700Mhz, 40Gig HD, 1Gig RAM, NVidia GeForce2 MX, CD-R/RW for 1600Euro. Ehm... I think that's pretty good bang for the buck! And waaaaay prettier than any beige box you can get.
        I personally own an iBook, and a comparable Dell was really about the same price. I agree that the dual G4's are a bit pricy but look at the prices of a nice Dual Proc Dell workstation fully equipped and then we'll talk again. Oh, and then don't forget that Macs last longer.
        Always compare prices of Apple computers to Dells, Compaq's, etc. Don't start with the idea: "I can build something better cheaper", I know that, you know that, but it's a different market.
    2. Re:WRONG! RISC "ordinary computers" exist! by e8johan · · Score: 2

      I'll comment you one point at a time.

      * I know what PC means, but I used in such a way that I though that it was clear what I was saying. Please do not use spelling misstakes and such as an argument (it often happens at /.)

      * The BSD license lets them take code and do what ever they want with it, but that does not make it a *good thing*.

      * Generic computers now - yes, but I said that they had a history of doing things their way (which generally yeilds more expensive hardware).

      * High pricing - YES. I'd say what you show is expensive, even compared to a Dell or a Compaq. I do not see apperance as a reason for buying a computer (I have mine in a closet). Concerning Macs lasting longer, could it be because the development of new models is slower?

    3. Re:WRONG! RISC "ordinary computers" exist! by Corporate+Troll · · Score: 2, Informative
      • Excused.... I'm nitpickin on it anyway. I just don't like it that people say that Macs are no personal computers
      • It doesn't make it a good think, but they have been nice and contributed back. This is a major difference in comparision to what Microsoft does.
      • Yes, they sucked. They play nice now. Look at IBM, they sucked in the late eighties, now they rock. Companies change. Perhaps in 10 years we'll all love Mircosoft around here.
      • I don't know where you live... But I just went to the website of Dell, and configured a x86 that is (except for the CPU) equivalent to a Mac with Mac OS X:
        • Dimension 2300 Value
        • Intel® Celeron® processor 1.7Ghz
        • 1024MB 133MHz SDRAM (2x512MB)
        • 40GB ATA-100 Ultra DMA
        • Dell E772 17'' (15,9'' VIS)
        • 48x CD-Burner (CD-RW)
        • Dell stereo speakers 206
        • 10/100MB netwerkkaart
        • 56K V.90 PCI Data/Fax Modem
        • Dell Movie Studio I (IEEE 1394)
        • Microsoft® Windows® XP Home
        • Microsoft® Works 6.0
        • Total Price including VAT: 1706,10 EUR /
        Everything I selected Extra, was fair because the Mac comes standard with it. People just find Macs expensive because Macs come with everything and a kitchen sink. This machine is equivalent to the G4 I described. Yes, except the CPU, yes, I know.
        Perhaps model development is slower, but I don't think it is that much an issue.
    4. Re:WRONG! RISC "ordinary computers" exist! by e8johan · · Score: 2

      "Yes, they sucked. They play nice now. Look at IBM, they sucked in the late eighties, now they rock. Companies change. Perhaps in 10 years we'll all love Mircosoft around here."

      I still don't want an IBM stationary. I can use their laptops as laptops are bound to have quirks anyway. I'm just worried since (to my knowledge) there are no custom Macs (i.e. just one manufacturer of boards).

      I live in Sweden. I just poped by www.komplett.se and picked one of their standard computers:

      Box: AVANTECH Medium Tower - Skruvlöst Kabinett m/300W

      Processor: AMD Athlon XP2100+ 1.733 GHz 266 MHz bus - Socket A (Palomino) processor

      RAM: DDR-DIMM PC2100 256MB DDR

      Motherboard: MSI KT3 ULTRA2B Moderkort Socket A VIAKT333, ATA/133, ljud, ATX, USB2.0

      HDD: IBM Deskstar 80GB IDE 7200RPM - ATA/100 120GXP

      Graphics card: Asus V8420 GeForce4 Ti4200 64MB DDR. - AGP, (V8420/TD) DVI, Tv-Out, Retail.

      CD-Burner/CD-reader: Asus CD-brännare IDE 40x/12x/48x CRW-4012A, Intern (FlextraLink)

      DVD-Player: Asus DVD -spelare IDE 16x/48x (DVD-E616)

      Soundcard: Soundblaster compatible

      Speakers: Creative Högtalare SBS250 2 active speakers, White box

      Network card: CNet Kort 10/100 Mbps PCI - TP only Davicom Chipset

      FDD: Nec 1,44MB

      Screen: Hansol 19" CRT 920P TCO-99

      Keyboard, Mouse & Mousepad

      Microsoft Windows XP Home (Svensk)

      3 years warranty and free telephonesupport

      This for only 12999SEK (around 1275Euro). This gives me a few hundreds to play with to get the Movie Studio and Works.

      As for selecting extras for the PC to make it as good as the Mac. I had the same discussion with an Atari owner a few years back (I too am an Atari owner and user). He claimed that it was cheap as an extra MIDI interface would cost so and so much for a competing brand.

    5. Re:WRONG! RISC "ordinary computers" exist! by e8johan · · Score: 2

      I give up this flamewar.
      I like the new macs, but I feel that they are more expensive that PCs^H^H^Hx86s.
      What I wanted to say was that the mainstream computer aught to be a RISC machine. If the platform is a Mac, that is OK. The problem is that finder is propetary and that there are no (or few) producers of hardware except Apple.

    6. Re:WRONG! RISC "ordinary computers" exist! by afidel · · Score: 2

      Ok I am generous and will give the G4 a 2X multiplier when compared to an Athlon, so 1.4Ghz equivilant. Now for about $800 I can build a Athlon 2000+, 40GB hdd, 1GB DDR ram, Nvidia GF4 4400, cd-rw, sb audigy pc with 15" monitor. For half the money I have a pc that beats the mac in every way except for software (I love OSX). Don't get me wrong I like macs again for the first time in over a decade, but price/performance leaders they are not (and probably never will be Apple has a lot of development costs for their awsomely designed systems that they need to spread over a fairly small number of units). Oh yeah, laptops are the one area where Apple is competitive because the white box makers can't make laptops and therfore drive the price down, it also happens to be one of the few areas where any of the pc makers other then Dell can make any money.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    7. Re:WRONG! RISC "ordinary computers" exist! by Halo1 · · Score: 2
      You really don't get it. Darwin == the BSD code they "took from the community". Do you really think there's so much BSD code in Aqua? Or in the Finder? Or their OpenGL framework?

      Additionally, they also work on GPL code (gcc being the most known) and of course also give back things there (even if they didn't want to, they'd have to). The result is that gcc 3.x has much better PowerPC code generation, from which Linux and *BSD on PPC machines can benefit greatly as well.

      Finally, they also opensourced several things they wrote from scratch, like their CSDA and ZeroConf (RendezVous) implementations.

      --
      Donate free food here
    8. Re:WRONG! RISC "ordinary computers" exist! by ncc74656 · · Score: 2
      "Pentium 4s have no multi-cpu board designs..." Really?

      Intel seem [sic] to think otherwise.

      Where are the dual Socket-478 motherboards? They don't exist...if you want Intel MP, you get to fork over the big bucks for Xeons. Hope you brought some Vaseline...you'll need it.

      --
      20 January 2017: the End of an Error.
    9. Re:WRONG! RISC "ordinary computers" exist! by ncc74656 · · Score: 2
      And please, don't use no-name computers. Compare to Dell, Compaq et al.

      Why not? Nobody with any intelligence buys name-brand desktops. Would you pay more money to get inferior components? I wouldn't. Want a dual Athlon MP 1900+ with decent amounts of memory & disk, a decent video card, and Win2K for somewhere around $1600? Dude, you're not getting a Dell!

      --
      20 January 2017: the End of an Error.
  16. Windows XP by droyad · · Score: 5, Informative
    I hear that people are saying it would be difficult to port Windows XP to RISC chips (and new 64bit arch). This infact is not true. In the Windows NT family there are 2 features that make it easy:

    1) It's mostly written in c/c++
    2) The HAL (Harware Abstraction Layer) contains most of the platform specific code. As I understand it the kernel does not actually handle the hardware directly

    Ofcourse I can see it going like this:
    1) Apple, Intel, AMD and Moterola put forward new Chip designs
    2) They ask MS to support it with their OS
    3) MS picks Intel

    --

    $vi any_article_on_iraq
    :s/iraq/microsoft/gi
    :s/Weapons of mass destruction/Windows/gi
    :s/Axis of evil/Redmond/gi
    :s/In this post september 11 climate/Service Pack 1/gi
    :s/Bush/Linux/gi
    :wq

    1. Re:Windows XP by Anonymous+Conrad · · Score: 2, Informative

      MS have been supplying developers (like myself) with 64 bit SDKs for at least 6 months, and migration information (i.e. recommendations for writing portable code) for at least 12 months.

      *Way* longer than that.

      In late 1999, MS shipped a crippled 64-bit compiler in their platform SDK for syntax/portability verification. They began shipping a functional compiler and libraries six to nine months later. My then employers (a network card manufacturer) used to get weekly or fortnightly pre-release builds of Win2k and I'm fairly sure they had Itanium builds up to November 1999 or so - when they just stopped. We didn't have itanium hardware anyhow.

  17. Re:486 160 mhz? (History lane) by jpt.d · · Score: 2

    I got one of those 133mhz chips to go upto 180mhz on a tomato board (pci :-) by trying different jumper configurations. Stable? Wouldn't know, didn't run it long enough.

    --
    What we see depends on mainly what we look for. -- John Lubbock Now search for that bug slave!
  18. 486dx4-160 by clinko · · Score: 5, Funny

    486dx4-160? No wonder you crazy linux folks hate windows. You haven't bought a computer since 1995.

    1. Re:486DX4-160 by Phosphor3k · · Score: 2

      Or AMD's "5x86-133", which was a 486-133mhz that performed similarly to a P5-75MHZ in integer apps.

    2. Re:486dx4-160 by cobar · · Score: 2

      My only issue is that XP doesn't like to do NAT (aka share it's internet connection) for more than one network at a time (802.11b and 100Base-T). I've fought with it before without satisfaction, so I'll probably just replace all the 100Base-T with wireless.

      You could just run your 100base connector out to the hub and then to a wireless access point. May cost more than wireless PCI cards, but why subject yourself to the speed penalty for computers that aren't moved around.

  19. Re:None will be successful in MY house by meringuoid · · Score: 2

    Palladium would have been Microsoft's price for x86-64 Windows. If MS develop that OS, then end users are going to see what Hammer can really do. If they don't, and everyone just runs XP in 32-bit mode, then all you have is a fast Athlon.

    Early models will be able to deactivate Pd, anyway. When it becomes hardwired, that's the day I start looking at Apple and ARM.

    --
    Real Daleks don't climb stairs - they level the building.
  20. Computational Power by Ocelaris · · Score: 3, Insightful

    I think the point of getting more powerful processors is not just for everyday use, but increasing the overall computing power in the world. Imagine getting back the results from Folding@Home in a week, rather than a couple years... sequencing genomes etc... There are very valid purposes for computationally powerful machines, just because WE don't know of any (in our daily lives), doesn't mean that there aren't any (hehe, agnostic argument).

    If someone were to say to me, that the number of kids on computers today doing the things they do was not directly related to computational power, I wouldn't believe them. The more power, the further the abstraction from what computers really are underneath, hence the broader user base.

    If my old computer that my mom uses were 100x as powerful, it would be smart enough to go look online as to why it's having errors printing, and I'd never have to venture out of my cave in the basement :-) Good enough reason for me.

  21. Old hardware, old software and efficiency by XNormal · · Score: 5, Insightful

    At work I've got a 49000 line Microsoft Visual C++ project that compiles in 5.5 minutes on a 1700 MHz Pentium 4. That's right, about 150 lines per second.

    Turbo Pascal used to compile at thousands of lines per second on machines with a clock nearly two orders of magnitude slower that tool several cycles per instruction instead of running several instructions per cycle.

    Before you say something like "hey, but moderns compilers have optimizations yadda yadda" perhaps I should mention that this compilation time was with no optimizations and features like updating browser files disabled. With optimization it's even slower.

    We're talking about four orders of magnitude difference in efficiency here. It's not all the compiler's fault, of course. The libraries and code use complex templates and multiple levels of definitions that make the compiler work much harder.

    At each one of these layers someone probably said "It's OK if this is 10 times slower. It's easier to write and maintain, I'm more productive (or lazy) and the CPU is fast enough". Each one of these decisions may be justified *in itself* but they add up (or rather multiply up) to a 1/10000 difference in efficiency. Slowing the edit/compile/debug cycle reduces programmer productivity and code quality. Reduced code quality to more code bloat and even slower edit/compile/debug cycle and so on.

    Damn, it's depressing.

    --
    Stop worrying about the risks of nuclear power and start worrying about the risks of not using nuclear power.
    1. Re:Old hardware, old software and efficiency by Tony+Shepps · · Score: 2

      "Slowing the edit/compile/debug cycle reduces programmer productivity and code quality."

      I guess, if you're compiling to figure out where your missing semi-colons are. Try working on a project where you can't tell whether your code works until there's a full build, and a full build takes 24 hours. You write quality code at that point, because you have to work top-down. No more write-compile-debug-write loops.

    2. Re:Old hardware, old software and efficiency by Junks+Jerzey · · Score: 5, Insightful

      Turbo Pascal used to compile at thousands of lines per second on machines with a clock nearly two orders of magnitude slower that tool several cycles per instruction instead of running several instructions per cycle.

      Object Pascal (Delphi) still compiles that fast, only now it does include optimization (maybe not as hardcore as some C compilers, but still pretty good). Borland used to advertise speeds of 800,000 lines per minute, back in the day when a 266MHz Pentium II was a hot machine. For most projects, the compilation speed is *zero*. For medium sized projects, it's in the "barely perceptible" range (as in maybe 1/30 second). Very, very impressive.

      Why is it so fast? There are a variety of reasons, in rough order of importance:

      1. There are no header files. All exported identifiers are in the "interface" section of the main source file.
      2. Interface information is always precompiled into a lean format, so there's no need to #include giant files (kind of like having all headers always be precompiled).
      3. There's no preprocessor.
      4. "Object" files are stored in a lean "almost linked" intermediate format, rather than traditional, bulky object formats. This makes the linker a very simple and fast affair, but linking can be the slowest part of building a C++ project.
      5. The compiler, linker, and build manager are all in one executable, so there's no loading programs during compilation (typically for C++, make is loaded first, the compiler is loaded for each source file, then the linker is loaded at the end; yes, disk caching helps here).
      6. Object Pascal is generally a cleaner language than C and C++, so parsing and optimization are easier.

    3. Re:Old hardware, old software and efficiency by borgboy · · Score: 3, Interesting

      1. According to Borland, the name of the language is now simply "Delphi." This changed as of the release of Delphi 7.
      2. Borland C++ and Delphi use the same machine code generator engine, so the optimizations are largely the same. The performance is largely the same. As you said, Delphi is single pass, and parses a good bit faster.
      3. For those of you out there saying "huh? Pascal??? No one uses THAT??!?!" Guess again. It is used a lot more than you might think, typically by small, lean shops with insane deadlines like mine.

      --
      meh.
    4. Re:Old hardware, old software and efficiency by ncc74656 · · Score: 2
      At work I've got a 49000 line Microsoft Visual C++ project that compiles in 5.5 minutes on a 1700 MHz Pentium 4. That's right, about 150 lines per second.

      Something must be seriously ate-up on your machine. I have a ~20000-line MFC project in VC++6. On a dual Athlon MP 1900+, I get three EXEs and two DLLs each in debug and release builds in about 50 seconds. On an Athlon XP 1600+, the compile time increases a little bit to 65 seconds. I know the P4 is a slower processor than the Athlons I'm running, but it shouldn't be that much slower. (If I had my old 1.0-GHz Athlon set up, I'd benchmark the build on that for sh*ts and grins.)

      --
      20 January 2017: the End of an Error.
    5. Re:Old hardware, old software and efficiency by captaineo · · Score: 2

      Are you using STL? You may want to try STLport, or even write your own template containers.

      On my project I switched from the GNU STL to STLport, and compile time went down by half. Then I wrote my own containers and compilation speed went down by a factor of FIVE!

      (my containers are templates to be typesafe like STL, but I have very carefully moved most of the code out-of-line instead of stuffing it all in the headers)

      C++ templates are a real double-edged sword. They give the compiler tons of room for optimization, but they do it by presenting it with basically ALL your code at once.

  22. And another ten, and another ten... by Kjella · · Score: 5, Interesting

    I don't pretend to feel the difference between 2.0GHz and 2.1GHz. I don't "feel the difference" when going from a HD with 3x20gb platters to 2x30gb platters. I don't feel the difference between PC3200 and PC2700.

    But I do feel it when I upgrade from an outdated system to a new one. And to know what kind of performance I could get for a reasonable* (*as defined by me ;) ) price, I do need to know what the state of the art is.

    Maybe that isn't relevant to you, maybe your 486 / Pentium / Duron / Space heater does what you want it to when you check your email and type up your word document, but not for all of us. I know a few tasks where I'd like 4gb+ of memory, solid-state SATA drive and a multi-GHz proc+, or a dual, for that matter.

    Large strides are best made one small step at a time. This is just another one of them.

    Kjella

    --
    Live today, because you never know what tomorrow brings
    1. Re:And another ten, and another ten... by Junks+Jerzey · · Score: 2

      I don't pretend to feel the difference between 2.0GHz and 2.1GHz.

      Although these days I don't feel the difference between 1.0GHz and 2.0GHz, and I'm a software developer. I think that riles up some of the hardware fanboys, but it's true.

  23. Gartner by Ender+Ryan · · Score: 2
    Gartner says a lot of things. Didn't they say Linux would fail a couple years ago? Then didn't they recently publish something else saying Linux would make great strides this/(last?) year?

    It's just mind boggling that people take them seriously...

    --
    Sticking feathers up your butt does not make you a chicken - Tyler Durden
  24. more backwards compatible than Itanium, but by dpilot · · Score: 2

    I won't argue about a change from X86 being desirable either but....

    IMHO Itanium just isn't the way to go. By some measure if X86 is warty, then Itanium most closely resembles Ben Grimm in his best orange. By other measures perhaps IA64 is a cleaner architecture, but it's proving to be a sonofagun to write compilers for. To me that portends a somewhat moribund future with a highly complex compiler on a highly complex architecture. Even incremental improvements, other than clock speed and cache size ramping will be difficult.

    --
    The living have better things to do than to continue hating the dead.
  25. Everyone, look AWAY from the clock speed. by Neil+Watson · · Score: 4, Interesting
    I think the industry has to stop being blinded by clock speed. Before you can improve the speed of the chip there are still bottle necks on the motherboards (e.g. PCI bus, Disk controllers). Also, there is the problem of power consumption and heat.

    I think a better approach for the future are smaller less power hungry modular CPUs. We've all seen the evidence of the clusters that makeup super computers. What if all standard computers came with 4 CPUs that used the same power as the P4 today? What if, instead of buying a newer faster computer, you could add CPUs like expansion cards but, at a reasonable price?

    1. Re:Everyone, look AWAY from the clock speed. by fitten · · Score: 2, Interesting

      Today, SMP code usually requires the code to be written to take advantage of multiple CPUs. There are compilers out that can do some automated threading (and have been a while) but many threaded applications are threaded by hand. Basically, we'd need better compilers and OSs to go along with those computers than we have now -- compilers that can make runtime decisions on how many threads to fork/etc and OSs that can report system resource reports accurrately to the programs.

      That being said, your term "power" is heavily overloaded here... I'm sure you can put 4 G4 processors into a box and the total (electrical)power usage of the 4 G4s would be comparable (or less) than a P4. If you are talking about four processors that are basically 1/4 of the computational power of a P4 (so four of them equal a P4), some applications will still need higher 'power' so that they can finish in times comparable to today. To paraphrase an old saying, a process is only as fast as its slowest thread =)

    2. Re:Everyone, look AWAY from the clock speed. by doom · · Score: 2

      Neil Watson wrote:

      I think the industry has to stop being blinded by clock speed. Before you can improve the speed of the chip there are still bottle necks on the motherboards (e.g. PCI bus, Disk controllers). Also, there is the problem of power consumption and heat.

      Not to mention NOISE. But this sounds good to me, I'm with you so far.

      I think a better approach for the future are smaller less power hungry modular CPUs.

      Now you're kind of losing me. If the bus is the bottleneck, shouldn't you be going after the bus? (If you want to improve your disk controller, that's fairly easy for most of you: just switch to SCSI.) Maybe it's time to dump the current PC architecture entirely... maybe stick with the double-bus design, but instead of ISA and PCI swtich to PCI and whatever the next generation is?

      But granted that clusters of low power processors sound nifty (transmeta?). It wouldn't surprise me if this is the wave of the future in server designs (the place where I used to work, the IT guys had to keep telling people "yes, we have physical rack space to put in another server, but we're maxed out on our power allocation").

    3. Re:Everyone, look AWAY from the clock speed. by drinkypoo · · Score: 3, Informative
      If people were doing more threading or planning to actively run more processes at once, then SMP would be more attractive. Unfortunately too few applications make use of multiple processors, and too few operating systems provide relocatable threads.

      P4 hyperthreading will hopefully get people into threading. Athlon will have slick four way and eight way multiprocessing with hammer when it finally rolls out. Halfway to 2003. I'm a student so I won't be buying until it comes out... That's what you get for delaying to add palladium you bastards.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  26. Ehhh... by fireboy1919 · · Score: 3, Insightful

    Back when Pascal was prevalent...wait that never happened.

    Anyway, twenty years ago people didn't write thing modularly like they do today so recompiles were of a bigger piece of the project.

    Now we use modularity, so code is broken up into much smaller pieces. A recompile need only be the file you're working on - the other 50 of them can just stay compiled as they are. Obviously 'make' was developed specifically to optimize the decision of what needs to be recompiled.

    Sure, it is much, much slower. But linking takes very little time, and compile time has been cut way down by previous compiles - almost enough to make up the difference (although, I admit, not quite). Still, you're comparison is not the best - Pascal hardly has the powers available to a bigger programming language, and since its only been academic, not as much effort has been placed in making the compiler really smart (and therefore slower). Perhaps you should talk about Fortran '77?

    --
    Mod me down and I will become more powerful than you can possibly imagine!
  27. paradigm shift... by john_uy · · Score: 4, Informative

    i think the new release of hammer lines will be very difficult for amd. intel is one step ahead. if you see right now, they are already announcing next generation product lines in all fronts. like banias in cpu, ultra low voltage and integrated chips for small devices, extremely high speed chips for network devices.

    i believe intel has shifted its focus in the battle of the desktop cpus. while amd is just playing catch up, intel now is already looking at what consumers will benefit from. maybe intel has realized that the speed today is an overkill for majority of today's needs. they are just speeding up their chips to keep up with moore's law.

    but look at their products, right now, they are focusing on making things smaller, lightweight, ultra low power consumption, low heat devices, integration. the future is not on desktop computers requiring very high speed cpu but mobile devices such as phones, pda, tablets, etc. intel will be a clear winner (if only i have humongous money so i can buy intel stocks at discount.)

    they have good engineers that produce good results. right now, they are already producing better chipsets for their server product lines, maybe a few years, they will no longer rely on broadcom's serverworks.

    they are also picking up on their storage chips. from all the raid controllers in the market, i hardly see a card that does not have an intel 960 i2o processor or their new ixp processors.

    their network and communication is very dynamic. like introducing 10gigabit products today (even with the downturn of telecoms.) enabling encryption and decription at 10gb/s is no joke. maybe a few years from now, we will see intel as chips in those network gear from cisco, et al.

    they are now focusing on wireless integration. few years from now, capacitors and resistors will be in a silicon chip. it is the future, and they are very lucky to realize that. when the economy recovers, intel will clearly be a winner.

    and for the server, i would want to say this. i believe amd will produce good cpu. but that is just half of the story, amd is not emphasizing any good chipsets/system to come with it including support pci-x at 133mhz with hotplug slots, interleaved memory with chipkill(tm), good server management, good integration.

    (as one who decides what to purchase in a server,) amd must make a lot of effort before i will take them seriously. their cpu is not enough for me to get their system, yet.

    let's just wait and see, but i see that intel will always be a step ahead. now for amd, the challenge is to be at par or even be ahead of intel.

    --
    Live your life each day as if it was your last.
  28. Re:kids nowadays... by nelsonal · · Score: 3, Informative

    Your joke reminds me of the ancient Egyptian symbol for a large number. It was a man with his arms upraised as if saying it's incomprehensible. I think it was used for numbers larger than 1000 or 1000000.

    --
    Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
  29. Re:AMD sucks! by Jim+Norton · · Score: 2
    If you want to cough up the bucks you could have bought an Alpha 21364 with a s*itload of on-die cache, but Intel bought them.

    Intel bought DEC? :)

    --
    -- Jim
  30. RISC and CISC now the same thing by Hoser+McMoose · · Score: 2

    Ok, maybe they aren't quite the same thing yet, but the lines between the two have REALLY blurred.

    Just take a look at any modern RISC processor. Chances are it has several hundred instructions, ie they sure haven't "reduced" that instruction set by any significant amount. Than if you look at any modern CISC processor, you'll find that they just decode instructions into RISC-like ops internally. End result? The difference between RISC and CISC is REAL small these days.

    If you read about the design of the Power4 vs. the Athlon, you'll see that essentially ALL of the basic building blocks are the same, it's mainly just a matter of how many of those blocks there are and how they all fit together. If anyone thinks that the Power4 is so fast clock for clock vs. the Athlon is because of it's instruction set, they probably just haven't looked to see that this chip has tons of execution units, HUGE cache and a shitload of bandwidth. All things that could potentially be added to a chip like the Athlon if the economics of such would fit.

    Now, this isn't to say that x86 isn't without it's flaws, but most of those flaws are rather minor and have been worked around in compilers for years. The two biggest problems are the small number of registers and the stack-based floating point units. Well, Intel's SSE2 can now mostly replace the old floating point unit for the majority of tasks (though it typically isn't used as such yet), and AMD's upcoming Hammer/Operaton will double the number of registers available.

    1. Re:RISC and CISC now the same thing by e8johan · · Score: 2

      I'd always prefer a RISC CPU since the instruction set is more general. In RISCs there are usually only general purpose registers (i.e. no cx for loops, etc.) which yeilds less complexity both in the hardware and in the compilers.

      Since x86s now days are RISCs with a CISC shell, why not simply remove that extra layer of complexity and simply introduce a plain RISC architecture.

      If you want to know how *bad* the x86 is, simply try too boot of a floppy and enter protected mode. You enter the CPU in 16 bits mode, have to fiddle with some special reigster, make sure to take a jump and then you're in.

    2. Re:RISC and CISC now the same thing by be-fan · · Score: 2

      As for registers, I was very interested to find that a modern P4 maps the 8 x86 registers to 128 internal registers. Compare this to a G4 which only has 48 internal registers (32 visible, 16 rename).

      --
      A deep unwavering belief is a sure sign you're missing something...
  31. Re:486 160 mhz? (History lane) by Jim+Norton · · Score: 2

    Well, they weren't lying about the clockspeed. Incidentally, I noticed large gains in applications which were dependant on floating point performance while using that processor (as opposed to a friends 486dx4-100) as well as higher floating point in benchmarks. It wasn't as fast as a P90 ... not quite (there were upgrade chips based on that brand of CPU which claimed to be equivalent to a P75)

    I had one for many years. I'd say it would compare favorably to a stock P75.

    --
    -- Jim
  32. heh.. by Suppafly · · Score: 2

    Sounds like next year might finally bring a worthy upgrade for my 486dx4-160

    I love it when people who never used prepentium systems try to talk like they did.. Everyone knows that a dx4 ran at 100mhz.

  33. Hammer delayed further? by Namarrgon · · Score: 3, Interesting
    Tech Report are reporting a story at the Inquirer which quotes AMD indicating it has "changed its roadmap schedule".

    They're saying that Barton will be here 1Q03, Sledgehammer is due 1H03, but now ClawHammer may be delayed until 2H03!

    Arghh. I thought the point was to do a 64 bit CPU without requiring an Itanium schedule...

    --
    Why would anyone engrave "Elbereth"?
  34. Re:AMD sucks! by fitten · · Score: 2, Insightful

    What do you want large caches for? Large caches aren't simply a trump card to be played to magically make all your applications faster. Misusing a cache can be detrimental to performance unless your cache is big enough to fit the entire application and all its data. Algorithmic enhancements can be used to make a huge difference... and in some situations, I can get higher performance out of a processor with 1/4 the cache size of another processor where a poorly written version of the application runs on certain types of data. All that being said, yes, larger caches will improve the execution of the majority of software (and certainly of algorithmically tuned software) but it is not the end-all, be-all solution.

  35. Mobile needs to run cool. by Steveftoth · · Score: 2

    The ultimate mobile processor should have a power saving mode that runs slower and won't burn your lap. My main prob with laptops is that you can no longer use them on your lap. They run too hot. This is of course due to the CPU, RAM and hard drive (maybe cdrom if spinning). But the CPU is on the most and runs the hottest of all those. They only put 4200 or 5400 rpm HDs in those machines so the HD can't get as hot as the CPU seems to get.
    Course it should also have a mode that burns through the case, but gets you those extra fragging frames on Q3 :)

  36. dx4-160 explained by Indy1 · · Score: 2

    the amd 486dx4-120 (which ran at 40mhz *3 ) was a great overclocker, and a whole LOT of people overclocked em to 40mhz * 4 = 160mhz.

    --
    Lawyers, MBA's, RIAA? A jedi fears not these things!
  37. PPC is not a great example of RISC by be-fan · · Score: 4, Informative

    I hate it whenever Mac-heads point to PPC and show how its such a great example of RISC that runs "all you're programs 2x as fast as the fastest Pentium4!" In all reality, the PowerPC line (not necessarily the POWER line) are very unimpressive. These days, a 1.25 GHz Alpha can still hold its own against a 2.5 GHz P4 in terms of floating point power. Yes, the same Alpha that has been neglected for the last half-decaded whose design has stagnated since the 21264 and whose process technology is antique compared to AMD's and Intel's. But the Alpha still keeps kicking x86 in the head. Yet, the PowerPC, running at the same 1.25 GHz, backed by the dual giants Motorola and IBM, built with leading edge copper fab technology, the second most common desktop RISC architecture (after x86 :) shipping in every single Apple computer isn't even competitive with the P4. Damn you DEC! Damn you to all hell!

    --
    A deep unwavering belief is a sure sign you're missing something...
  38. Re:486 160 mhz? (History lane) by ncc74656 · · Score: 2
    In terms of performance the fastest chip that fitted in a socket 3 was the Cyrix 5x86 120Mhz, which (again speaking of integer performance) was equivalent of a P100.

    I still have one of those kicking around on a Biostar 8433UUD...it's not currently installed in anything, though. For $350 (processor & motherboard) in late 1995/early 1996 (?), it was a deal. It outran a P5-133 Packard Bell at work (not too surprising, since the Packard Bell had no L2 cache and sh*tty onboard video vs. the #9 Motion 531 I had at home). The only downside was that the 40-MHz FSB of the 5x86-120 meant that the PCI bus had to be underclocked (to 26.67 MHz) to keep things stable. I suppose I could've tried running the processor at 133 MHz (4x33) instead of 120 (3x40), but slower access to the L2 cache would probably have made performance about the same.

    --
    20 January 2017: the End of an Error.
  39. Memory intensive benchmarks by AaronW · · Score: 2

    What I want to see is how it handles memory intensive benchmarks. I think this may be where it will shine, with the DDR interface built directly into the processor, thus eliminating latency and bottlenecks imposed by the north bridge.

    The other big advantage most people seem to forget is the amount of memory addressing capability. Where I work, we have racks of Linux X86 servers with 6GB of memory each. While there are hacks to go beyond 4GB, it gets kind of ugly. With Opteron, addressing 6GB or more of memory is not a problem.

    Also, with their Hypertransport bus and supporting multiple processors, the amount of memory scales with the number of CPUs.

    -Aaron

    --
    This post is encrypted twice with ROT-13. Documenting or attempting to crack this encryption is illegal.
  40. Hammer NOT delayed further by Namarrgon · · Score: 2
    The Inquirer have "clarified" their earlier statement:

    Clarification: AMD asks us to point out that Hammer schedules haven't slipped from its previous advice, as we originally suggested in this article. A spokesman from the company told us that desktop versions of Hammer are still planned to ship (for revenue) in Q1 2003 with systems on shelves at the turn of Q1 2003, not the second half of 2003 as we stated.

    --
    Why would anyone engrave "Elbereth"?
  41. Re:How do you do that? by captaineo · · Score: 2

    No. The containers consist of an unsafe "core" part that works with void* pointers, wrapped in a typesafe template.

    The idea is roughly like this:

    struct CoreListElement {
    struct CoreListElement *next;
    };

    class CoreList { // note: append() is implemented out-of-line
    void append(struct CoreListElement *elemnt);
    };

    template &lt typename T &gt
    class List {
    struct ListElement : public CoreListElement {
    T value;
    };

    CoreList core;
    void append(const T& t) {
    core.append((CoreListElement*) new ListElement(t));
    }
    };

    I also have Array<T> (vector), Map<T>, etc. These work almost exactly like STL containers, except most of the code is out of line so compiles are MUCH faster and binaries are MUCH smaller.

    My containers have the added advantage that you can embed ListElements in a structure or class to avoid allocating extra memory when inserting into a list or hashtable.