Slashdot Mirror


Intel - Market Doesn't Need Eight Cores

PeterK writes "TG Daily has posted an interesting interview with Intel's top mobility executive David Perlmutter. While he sideswipes AMD very carefully ('I am not underestimating the competition, but..'), he shares some details about the successor of Core, which goes by the name 'Nehalem.' Especially interesting are his remarks about power consumption, which he believes will 'dramatically' decrease in the next years as well as the number of cores in processors: Two are enough for now, four will be mainstream in three years and eight is something the desktop market does not need." From the article: "Core scales and it will be scaling to the level we expect it to. That also applies to the upcoming generations - they all will come with the right scaling factors. But, of course, I would be lying if I said that it scales from here to eternity. In general, I believe that we will be able to do very well against what AMD will be able to do. I want everybody to go from a frequency world to a number-of-cores-world. But especially in the client space, we have to be very careful with overloading the market with a number of cores and see what is useful."

16 of 548 comments (clear)

  1. We've heard that before. by GundamFan · · Score: 5, Insightful

    I don't doubt an "8 core" desktop will exist in the near future. Then again he has a point... we won't likely need it.

    --
    I don't give a damn for a man that can only spell a word one way.
    Mark Twain
    1. Re:We've heard that before. by kfg · · Score: 5, Funny

      I don't "need" an R sticker and turbo sound synthesizer either, but they sure make my FIAT 500 go faster.

      The Little Mouse that Roars!

      KFG

    2. Re:We've heard that before. by mrxak · · Score: 5, Interesting

      I frequently run as many as 8 programs at a time, sometimes more, but I seriously doubt each program would know what to do with its own core. With my two-CPU set-up, I find RAM to be almost the biggest limiting factor (although with 2GB, I've never actually run out). There's really no need for 8 cores until my brain is able to take multitasking to the next level, doing many many complex tasks that would gain benefit from (essentially) unlimited CPU power for each program.

      They say the biggest bottleneck of any modern computer is its user...

    3. Re:We've heard that before. by rbgaynor · · Score: 5, Funny

      eight is something the desktop market does not need

      So is he the only person on the planet who has not tried the Vista beta?

      --
      "Good things don't end with eum, they end with mania or teria." - H. Simpson
    4. Re:We've heard that before. by Mayhem178 · · Score: 5, Funny

      And that will almost let you run Oblivion at max settings.

      --

      "You will pay for your lack of vision..." - Emperor Palpatine to Ray Charles

    5. Re:We've heard that before. by guaigean · · Score: 5, Insightful

      The home consumer market isn't exactly the goal for technology like this intiially, and the price won't be inline with home consumers anyhow. This is the kind of stuff used in High Performance Computing, as a single computing node can maintain large amount of CPU performance with no transfer between nodes. 2GB is nothing in the HPC world, and 8 cores get filled up fast. While it may be easy to assume "I can't fill 1 CPU, what would I do with 8"? you have to remember that there are people out there running huge simulations, which could very easily use up many thousands of CPU's.

      Utility is in the eye of the user.

      --
      Microsoft Sucks, F/OSS Rocks. I get mod points now right?
    6. Re:We've heard that before. by samkass · · Score: 5, Insightful

      Isn't this the same thing they said about 64bit chips?

      Good point... yes, Intel said this about 64bit chips, and they were right. Almost nobody needs 64bit chips. But now virtually all chips are 64bits, wasting a lot of die real estate and engineering effort because of the perceived benefits driven more by AMD's marketing than reality. It's quite possible 8 cores could end up in the same boat-- AMD pushing it for no valid technological reason and Intel being forced to follow suit.

      --
      E pluribus unum
    7. Re:We've heard that before. by synx · · Score: 5, Interesting

      Interestingly enough as you recompile for 64 bits, you also need more memory as you get more memory. Now your memory alignment is now 8 bytes not 4 bytes, and your pointers are much later.

      I'd like to take a moment to rail against most commonly accepted forms of parallel education. I'm sure you were taught about threads, critical sections, semaphores, shared memory, etc.

      These are all inherently dangerous and difficult to program concepts. Write some application that is flexible and can run with N threads - usually this is hard, the best solution from Java-land is the concurrency toolkit which defines units of work which can be parallelized by a thread pool.

      However, there _is_ another way. "CSP" - communicating sequential programs. This is a method of writing naturally parallel systems that do not have the disadvantages of all of the above. (Standard concurrency debugging suggestion in java: "make the method synchronized") A practical example of this is the programming language Erlang. Ericsson invented this language to write high performance telco gear. Their ATM switch line is written in it. In Erlang you have many many 'processes' (not traditional OS processes, but defined in the VM) which cannot share memory - the only way they can communicate is via async messages. You can build a synchronous call on top of async messages pretty trivially (after all, all syncronous network protocols are based on IP which is asynchronous). You never have to worry about memory stomps, or critical sections. You _do_ have to design your applications differently, but it is most definitely worth it.

      Another interesting thing about this is your applications naturally parallelize. The "R11" release was just put out, which included SMP support. The previous versions would only use 1 CPU, but this version will use all your CPUs, which means if you have multiple processes ready to run, they'll run on as many CPUs you have! Instant SMP support, no redesign, no RECOMPILE necessary.

      This kind of language technology is what is necessary to get us to the next level. A similar thing is possible with Functional languages such as OCaml, Haskell, etc.

      I've been working in the industry for 5 years and I'm currently working on a Erlang project. My company was fairly conservative in terms of languages, there was a standing order (until about 2000) "no C++".

  2. Silly Perlmutter by ackthpt · · Score: 5, Funny

    If the home user can justify (even indirectly due to demands of the operating system or changes in software architecture) 4 cores then 8 is immenently logical. Seems some minds at Intel are falling back to the dubious position they held regarding home users never needing 64 bit CPUs. Then again, maybe they're just playing dumb and are slaving away, burning midnight oil by the drum, to make 8 and 16 core processors.

    Three Cores for the Clippy, but I don't know why,
    Seven for the Vista kernel which is defect prone,
    Nine for for Bloat which will make the cooling fry,
    One for the Screensaver to toil alone,
    In the Land of Redmond where Marketing lies.
    One Core to rule them all, One Core to find them,
    One Core to bring them all and in the darkness bind them
    In the Land of Redmond where Marketing lies.

    --

    A feeling of having made the same mistake before: Deja Foobar
  3. Neither four nor eight. by MarkByers · · Score: 5, Funny

    I think there is a world market for maybe five cores.

    --
    I'll probably be modded down for this...
  4. Re:well, by man_of_mr_e · · Score: 5, Insightful

    I think he was talking about the foreseeable future.

    1 core is really enough for most users. 2 cores is enough for most power users. 4 cores will be enough for all but the most demanding jobs. Workstations are different, however and are not usually considered part of the "desktop". For example, I could see 3D artists using 4 or 8 cores easily. In fact, there's simply no such thing as a computer that's "too fast" for certain purposes.

    The issue, though, is one of moderation. Why would a desktop user want 8 cores, which are drawing insane amounts of power, when they're not even utilizing 4 to full advantage? Word processing, accounting, and surfing the web don't need any of this. Games? I can imagine in 10+ years we'll have some photo-realistic 3D games that run in real-time, but the vast majority of the work will likely be handled by GPU's and won't need 8 cores to deal with it.

    I simply cannot fathom a purpose for 8 cores for any "desktop" application that isn't in the "workstation" class.

  5. Do all cores have to be smart? by spyrochaete · · Score: 5, Interesting

    I recently read about a 1024-core chip for small devices like cell phones Each core ran on a simplified instruction set and specialized in a certain task like muting the microphone when incoming sounds are too quiet, smoothing text on the low resolution screen, and other minute tasks. Individual cores could be placed in low power sleep mode until the software dictated a need for that instruction set.

    Is it possible to couple CISC and RISC cores on one die? Is this how the math coprocessors of the 386 era worked? This sounds like an ideal solution to me since nobody needs 4 or 8 cores to be fully powered and ready to pounce at all times.

  6. Re:Question. by Aadain2001 · · Score: 5, Informative
    It depends on which memory bottleneck you are talking about. There is a memory hierarchy in computers, with the fastest also being the closest to the processor, the level 1 or L1 cache (usually split into separate data and instruction caches). This is then tied into a much larger, but slower, L2 cache (combined instruction and data lines). Some processors use an L3 cache, but not many these days. Current processors have L1 and L2 directly on the chip. If you see those die pictures they show off to the press, the largest areas of the chip are the caches. Finally, the chip can go across the front side bus and access the main system memory, which is very large compared to the L2 and L1 caches, but much slower in terms of number of cycles to access.

    So which bottleneck are you refering to? The new Core 2 Duo chips of Intel's share the L2 cache and, as far as I can tell from the reviews I have read, this setup works very well. Both chips can share data very quickly or when executing a single sequential program one of the cores can use all of the L2 cache (which in the Extreme Edition verion is up to 4MB!). Or are you refering to the main memory? It is possible for both cores to need to access the main memory at the same time, but modern pre-fetching and aggress speculation techniques reduce how often that occurs and the timing penalties when they do occur. And of course, the larger the L2 cache the more memory can be stored on the chip at once, reducing the need to access the main memory very often. According to Intel's own internal testing, they had a very hard time using all of the bandwidth the current front side bus and memory offers, which means the main memory shouldn't be a bottleneck.

    So what is the bottleneck you are refering to?

    --
    Space for rent, inquire within
  7. 6 Coors enough by CrazyJim1 · · Score: 5, Funny

    The 6 pack has been tried and true, why try and stuff an additional 2 Coors into it.

  8. I'll be the first to say it... by Junior+J.+Junior+III · · Score: 5, Funny

    640 cores ought to be enough for anybody.

    --
    You see? You see? Your stupid minds! Stupid! Stupid!
  9. Yes and no : depends on the brand by DrYak · · Score: 5, Informative

    Not quite exactly. Things depends on the brand.

    For Intel that's exactly the case :
    With current intel architecture, memory is interfaced with the NorthBridge.
    With multicore and multiproc systems, all chips communicate to the NorthBridge and get their memory access from there.
    So more cores and processors means same pipe must be shared by more, and there for memory bandwith per core is lower.
    Intel must modify their motherboard design. They must invent QUAD-channel memory bus, they must push newer and faster memory types (that's what hapenned with DDR-II ! They needed the faster datarates, even if those come at cost of latency), etc...

    But the more their pursue in this direction, the more latency they add to the system. Which in the end will put them in a dead end. (Somewhat like the deeper pipe of their quest for Gigahertz put them in dead-end of burning-hot and power-hungry P4).

    For AMD that's not quite the same :
    With the architecture that AMD started with the AMD64 series, memory is directly interfaced with a memory controller that is on-die with the Chip.
    The multiple procs and the rest of the mother board communicate using a standarized HyperTransport.
    The rest of the mother board doesn't even know what's hapenning up there with the memory.
    And with the advent of HyperTransport-plugs (HTX) the mother board doesn't even realy need to know it.
    Riser cards with Memory-And-CPU-Both-of-Them (à la Slot 1) is possible (and highly anticipated, because it'll make possible a much wider possibility of specialized accelerators to be plugged than currently with AM2 socket)

    The most widely publicised advantages of this structure are the lower latency.
    But this also makes it easier to scale up memory bandwith : Just add another on-board memory controller and voilà you have dual-channel. That was the differences between first generations of entry-level AMD64 (Athlon 64 for 7## socket : one controller - single channel, Athlon FX for 9## socket : 2 controllers, dual channel).
    by the time 8 cores processors come out and if CPU riser-board with standart HTX connector appears, nothing will prevent AMD to just build riser board designed for 8 cores chips with 4 memory controllers (and Quad-channel speed). Just change the riser board, memory speed will scale. Mother board doesn't need to be re-designed. In fact, same mother board could be kept.
    And this won't come at the price of latency or whatever : the memory controller is ON the cpu die, and must not be shared with anything.

    In fact, that's partially already happening :
    In the case of multi procsystems, instead of all procs sharing the same pipe thru the NorthBridge, each chips has it's own controller going at full speed.
    And this memory can be shared over the HT bus (albeit with some latency).
    It's basically 4 memory controllers (2 per proc) working together. Acheiving quad-channel alike shouldn't be that difficult.
    Specially when Intel is pushing the memory standart to chips with higher latency : asking for more bandwith in parallel over the HT-bus won't be that much penalizing.

    So I think AMD will be faster at developping solutions to scale against higher number of cores than Intel, due to better architecture.

    Maybe, it's not a coincidence that AMD is working on technology to "bind together" cores and present them as single proc to not-enough SMP-optimized software, and that at the same time Intel is telling who ever wants to listen to them that 4 cores is enough, 8 is too much. (Yeah, sure, just tell it to the database- and Sun Niagara people. Or even to older BeOS users. This just sounds like "640k is enough for everyone")

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]