Slashdot Mirror


Ask Slashdot: Is the Gap Between Data Access Speeds Widening Or Narrowing?

New submitter DidgetMaster writes: Everyone knows that CPU registers are much faster than level1, level2, and level3 caches. Likewise, those caches are much faster than RAM; and RAM in turn is much faster than disk (even SSD). But the past 30 years have seen tremendous improvements in data access speeds at all these levels. RAM today is much, much faster than RAM 10, 20, or 30 years ago. Disk accesses are also tremendously faster than previously as steady improvements in hard drive technology and the even more impressive gains in flash memory have occurred. Is the 'gap' between the fastest RAM and the fastest disks bigger or smaller now than the gap was 10 or 20 years ago? Are the gaps between all the various levels getting bigger or smaller? Anyone know of a definitive source that tracks these gaps over time?

92 comments

  1. The gaps are getting smaller by NotInHere · · Score: 2

    The distance between the "fastest" and "slowest" gets larger and larger, but the gaps are getting smaller because things like SSDs fill them.

    1. Re:The gaps are getting smaller by Anonymous Coward · · Score: 1

      The distance between the "fastest" and "slowest" gets larger and larger, but the gaps are getting smaller because things like SSDs fill them.

      Except the bus it (and everything else) is connected to hasn't kept up. Try building a computer to solve real problems, and you'll quickly learn this.

    2. Re:The gaps are getting smaller by Anonymous Coward · · Score: 0

      The problem is that the weakest link in a chain determines the strength of the chain. Thus slowest device causes everything to wait for it. This is referred to as a bottleneck.

      The next issue is that we have some issues with transistors and switching. Transistors have what is called a slew rate, this determines how long it takes a transistor to switch from it's on state to it's off state. But what happens when you switch the transistor on and off too fast, well you will end up with your transistor behaving more like a resistor, as it will be unable to react fast enough. The problem is that for most silicone based transistors this occurs somewhere around 5GHz.
      So why are most processors closer to 4 GHz and slower? Well that has more to do with chaining this effect. To get transistor logic you need several transistors, all in series. (like a chain) Now each of these transistors take a certain amount of time to switch on or off and we need to wait for the slowest part of the transistor chain. (Yes, i know it's more complicated than that but I don't want to give you a lesson on digital logic) Since we have had processors that can run at close to max speed for nearly a decade now we have had to work around this limitation.

      The solutions:
      Handle more data at a time; 32 bit upgraded to 64 bit (while not always faster there are cases where there will be a significant speed boots), Multi threading (less wasted time), Multiple cores (let the computer do more than one thing at once (only usable when one operation does not depend on another).
      Increase amounts of faster memory; Cache increases, more ram, etc...
      Faster technologies replace slower; Floppy disk -> CD-Rom -> USB flash drive, HDD -> SSD, etc...

      So to answer your question, it's both yes and no. We cant go much faster with the current processor technologies, though we do have some processing alternatives, but we do have many ways around the limitations of current technologies as well. In the end it's still faster just maybe not in the way you expect.

  2. Can do basic maths by psinet · · Score: 1

    ...but not just because someone else is too lazy to do it themselves. Soing the maths would have taken less effort than writing this /. fill-piece.

  3. We don't really know by Anonymous Coward · · Score: 0

    There is currently a real gap gap.

  4. Does it matter? by i.r.id10t · · Score: 2

    Does it matter? Fast CPU, fast RAM, fast disks is like having no speed limits on every race track in the world - but in order to get from track to track you have to go on the interstates or perhaps back country roads (PCI bus, etc). Sure, each component is fast and getting faster, but the way those components connect to each other hasn't changed all that much...

    --
    Don't blame me, I voted for Kodos
    1. Re:Does it matter? by Anonymous Coward · · Score: 3, Interesting

      I disagree, the interconnecting paths may not be keeping up as fast but look back at the original ISA bus or before that S100. The PC/AT 16 bit slots more than doubled the speed of the original ISA. Specialized video busses were all the rage not that long ago. PCI has grown to become PCIX and multi lane. For us greybeards, that a serial bus operated faster than a parallel bus remains one of those great mysteries.

    2. Re:Does it matter? by Wycliffe · · Score: 2

      Yes. It matters. Price also matters. If the price/performance gap is eliminated then major architectural changes are both possible and likely. We're already seeing this with tape vs harddrive where people are just archiving to an external harddrives instead of tape. Think about what would be possible if RAM was cheaper than harddisks and also non-volatile. Then everything could be in ram and even "saving to disk" becomes a thing of the past. Likewise, if SSDs becomes just as fast as ram, then why differentiate between ram and harddrive at all, just have "memory" which can be used or reused however you like.

    3. Re:Does it matter? by Anonymous Coward · · Score: 1

      Yeah, things like this matter. I was once told by a person who worked for Microsoft that this person was assigned the task of overseeing some research for an upcoming Xbox system. The research involved having prototype units run various pieces of code to see which piece of code ran the fastest. It was rather assumed that the different versions of the code were equally compatible, coming up with the same result. However, automated benchmarking was measuring speed.

      See, it doesn't even matter which piece of hardware is in the faster category. We're looking at having machines calculate the precise math of exactly how much one thing is faster than another, and that information is being used during the development process.

      This will continue to matter as technology keeps exploring the limits provided by concepts like parallel processing, virtualization, and so forth.

      Plus, some categories may change over time. For instance, when I got my first computer, a hard drive was much faster at handling data than a 14.4kbps modem, and even then I didn't have a 14.4kbps modem like my friends. I had a 2.4kbps modem. ATA 100MB/sec was way faster, and I learned that local storage is much faster than communications. But fast forward to later, and the numbers of a hard drive (e.g., SATA 3.2 is 16Gbps) may not be nearly as fast as some of the higher end networking gear (40Gbps). So all kinds of things are subject to change, even things that have been true for a long time.

    4. Re:Does it matter? by godrik · · Score: 2

      That is so untrue. What are you talking about!?
      PCI-express is much faster than the late AGP, PCI and ISA buses, you get over 15GB/s from one GPU to an other one. SATA and SCSI have had a good run, but are being remplaced by PCI-express for high throughput devices. In clusters, Infiniband decreased latency massively compared to ethernet, and gives you bandwidth of over 100Gb/s.

      So surely, the interconnect is a factor slower than devices, but that's pretty much always has been the case.

    5. Re:Does it matter? by Anonymous Coward · · Score: 0

      Does it matter? Fast CPU, fast RAM, fast disks is like having no speed limits on every race track in the world - but in order to get from track to track you have to go on the interstates or perhaps back country roads (PCI bus, etc). Sure, each component is fast and getting faster, but the way those components connect to each other hasn't changed all that much...

      You should be modded up and the idiots that replied to you should be modded down because they obviously have no clue where bottlenecks in a computing system exist. Perhaps they should spend the time to build a HPC system and learn something about real computing. The bus is more often the bottleneck than anything connected to it in modern computing systems.

    6. Re:Does it matter? by sexconker · · Score: 2

      fast disks

      I showed your mom my fast disks, but she only had an ISA port.

    7. Re: Does it matter? by Anonymous Coward · · Score: 1

      PCIE. Not x. X died a while ago.

      Serial outperforming is simple. Multiple serial busses with switching on each end. They're independent but they send packetised data in parallel.

    8. Re:Does it matter? by peragrin · · Score: 1

      wait until they start sticking sad'd on the ram bus

      --
      i thought once I was found, but it was only a dream.
    9. Re:Does it matter? by drinkypoo · · Score: 2

      I had RAM on the 8-bit ISA bus back in the PC-1 days, but back then the CPU was so slow that wasn't a serious impediment. Today, however, we have specialized (and seriously fast) memory buses, and we have massive interconnects between the CPU and the chipset, and we have large numbers of PCI-E lanes. High-speed SSDs are designed to sit right on a useful number of PCI-E lanes. These days a PC actually does have massive bandwidth, in a way it never did before; it was crippled between basically the time PCs got into double-digit MHz speeds and the time when PCI-E became the prevalent bus.

      In my current PC, the only stuff connected to a PCI bus (which is chained off a PCI-E bus) is IEEE1394 and some of my USB ports. Everything else connects directly to PCI-E, down to my audio hardware. I have overclocked dual-channel memory... yet benchmarks prove that this makes basically no performance difference. (Still, it's got an XMP profile, why not use it?) Bandwidth is not something which concerns me in my PC any more.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    10. Re:Does it matter? by Khyber · · Score: 2

      " The bus is more often the bottleneck than anything connected to it in modern computing systems."

      Not even close. Quite often it's the underlying architecture itself causing the bottlenecks.

      Example, Intel's latest and greatest Xeons FUCKING SUCK. Why? Because their internal architecture to deliver data across cores is gimped beyond belief. You can run 2 CPU x 4 GPU, 4 CPU x 2 GPU, but you can't do 4 CPU x 4 GPU. Meanwhile, I've got far older AMD systems that run 4 CPU x 4 GPU without a problem.

      --
      Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
  5. WTF post, come on kids by Anonymous Coward · · Score: 1

    This could literally be answered with three google searches. '2015 l2 cache speed' 'ddr4 speed' '2015 ssd speed'

    1. Re:WTF post, come on kids by jeffb+(2.718) · · Score: 3, Funny

      Ancient scrolls of dubious provenance hint darkly that DDR4 was not the first inhabitant of the RAM slots we consider so permanent. Debased cultists still sometimes mutter chants mentioning "PC100", or even uncouth syllables such as "korr"...

    2. Re:WTF post, come on kids by Anonymous Coward · · Score: 2, Interesting

      This could literally be answered with three google searches. '2015 l2 cache speed'

      This /. article, plus one called "Casino lock flooring [...] play casino online" which 404s in Norwegian when you click on it.

      'ddr4 speed'

      As it turns out, a whole bunch of really technical reviews on DDR4 memory, plus 'scope/test gear for testing DDR4 bus access. At least partially, potentially, useful, if you're prepared to wade through a bunch of dense stuff.

      '2015 ssd speed'

      A whole bunch of MacBook reviews/unpaid ads, followed at the end by a Toshiba and a Kingston paid ad (sucks to be them, should have paid more).

      So, ummm, like most LMGTFY trolls who think they're way smarter than they are, the actual results are far less useful than just asking someone who knows what they're talking about for the answer. (Whether /. counts as that is an entirely different matter, though there's usually at least one person who appears to know what they're talking about and will answer the question usefully and honestly without being a smug stuck up prick.)

    3. Re:WTF post, come on kids by Anonymous Coward · · Score: 0

      This could literally be answered with three google searches. '2015 l2 cache speed' 'ddr4 speed' '2015 ssd speed'

      But Timmy! Each of his fucking posts seems to get more retarded than the last. He must have some sort of degenerative disease, I mean seriously, how can you get older and yet so much dumber.

    4. Re:WTF post, come on kids by Anonymous Coward · · Score: 0

      Seriously, that was one hell of an autistic reply.

    5. Re:WTF post, come on kids by Anonymous Coward · · Score: 1

      (Whether /. counts as that is an entirely different matter, though there's usually at least one person who appears to know what they're talking about and will answer the question usefully and honestly without being a smug stuck up prick.)

      So, you're ruling yourself out for an answer then?

    6. Re:WTF post, come on kids by HiThere · · Score: 2

      Yes, he was ruling himself out as unable to answer. So am I. And it would take a *LOT* more than a Google search to answer. I lean towards agreeing with the people who cite bus speed as the limiting factor, but I'm not sure, and there could certainly be special circumstances where something else was the limit.

      I *do* know that it's not an easy question to answer, and that any answer is going to depend for its correctness on a presumed workload. (Some things are CPU bound, and don't even use much RAM. Other things are IO bound, and make you think your disks are thrashing. Most things are somewhere in between.)

      But the original question was "has the gap between fast-small memory and slow-large memory gotten larger, smaller, or stayed the same. Even that's an oversimplified question, as it doesn't deal with persistent RAM. (I'm dubious about the value of that, but some people used it to advantage in the days of core-memory.) Also ignored were the questions of relative cost. If you pay ENOUGH there are lots of exotic technologies...and I have no idea of the tradeoffs.

      So much better to get the answer from somebody who actually knows the area. It's not simple.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    7. Re:WTF post, come on kids by TheOneFreeman · · Score: 1

      I found information on PC100 but what's korr?

    8. Re:WTF post, come on kids by jeffb+(2.718) · · Score: 1

      Phonetic rendering of "core".

    9. Re:WTF post, come on kids by TheOneFreeman · · Score: 1

      Thanks, does this refer to some older form of RAM too?

  6. Going Backwards by Anonymous Coward · · Score: 0

    Memristers can be layed out in a large grid which might be the future, but I really don't see the point of talking about the evolution of the hard drive into the cpu.

  7. What are you really asking? by jeffb+(2.718) · · Score: 3, Insightful

    I'm not sure what a historic timeline of these ratios (not "differences", please) would gain you.

    These ratios can have a big impact on what algorithms and implementations you choose to maximize performance. I suppose if, say, the ratio of RAM to disk speed increased by a factor of 10 over the decade before last, then decreased back to its original ratio in the last decade, it might be worth trawling through some old papers (or old source trees) to revisit lessons learned in the earlier period -- but that seems like a bit of a stretch.

    If you're just curious, it shouldn't be too hard to generate timelines of CPU cycle speeds, cache and RAM latencies and bandwidths, disk performance, and so on. But really, each of those has enough factors that a simple "ratio" would probably conceal more than it illuminates.

    1. Re:What are you really asking? by hydrodog · · Score: 1

      I for one would like to see a historic timeline of absolute numbers for CPUs, memory, and mass storage. But that is not so easy to do. I have found little snippets here and there on Wikipedia, but not even a single master list of CPUs, let alone more hardware. There are master lists of CPU benchmarks but not spanning generations and radically different CPU sizes obviously. Here's DDR3 RAM: https://en.wikipedia.org/wiki/... DDR4, not really there in Wikipedia, though there are some articles that talk around the subject: http://www.extremetech.com/ext... A discussion of the LINPACK benchmark: http://www.netlib.org/utk/peop... History of hard drives, plenty of data but not complete and not tabular: https://en.wikipedia.org/wiki/... If you know where to get tables of the raw data without an enormous amount of work, I'd like to see it.

  8. Writing a paper by Anonymous Coward · · Score: 3, Insightful

    for a CS or IT class?

  9. Is the Gap Widening Or Narrowing? by fleabay · · Score: 3, Informative

    Yes, yes it is.

    1. Re:Is the Gap Widening Or Narrowing? by David_Hart · · Score: 1

      Yes, yes it is.

      It depends on your diet and how much exercise.... oh, you mean tech, not your waist line....

    2. Re: Is the Gap Widening Or Narrowing? by Anonymous Coward · · Score: 0

      Gotta have gap

  10. As with most "X OR [almost all NOT X]" questions.. by Anonymous Coward · · Score: 0

    ... the answer is almost certainly "yes."

    Then again, the gap could be the same as it was decades ago. But my hunch is the answer is yes, "the Gap Between Data Access Speeds [is] Widening Or Narrowing".

  11. The memory wall by Anonymous Coward · · Score: 1

    is still in effect. Every memory speed increase is perforce dwarfed by speed increases above in the hierarchy. The ongoing stall in increasing CPU clock speeds has obscured the nature of the problem, but we'll never go back to the 1980s when CPUs were not fast enough to cause memory hot spots.

    No one has attacked the problem successfully except for architects who have designed split-phase memory transactions for loads and stores along with the capability for many of them to be simultaneously in flight. This requires lots of silicon for management, and significant investment in compiler technology to be effective, a cost that almost no one has been willing to bear.

    1. Re:The memory wall by Anonymous Coward · · Score: 0

      It occurs to me that some may never have read this seminal little note:
      http://www.researchgate.net/publication/2429866_Hitting_the_Memory_Wall_Implications_of_the_Obvious
      We spend far too much time in computer science re-inventing what other people have known for a long time.

    2. Re:The memory wall by Anonymous Coward · · Score: 0

      Yep read it an weep.

      Long time ago I realized the size of bulk memory, dram, flash, and magnetic storage was outstripping access times. You could see that in the nineties. 18 months and the size doubles, but the access time decreases 20% which means the amount of time it takes to go through the systems memory increases. Programmers were like, 'yay more memory!' not really getting that increasing the memory footprint means slower.

      I suspect moves to sideline Java and other traditional garbage collected languages is due to their tendency to create vast numbers of objects on the heap doesn't work well on architectures where cache misses totally wreck your performance. Plus multi-core systems where having to synchronize the state of the world, also wrecks performance.

    3. Re:The memory wall by queazocotal · · Score: 1

      My first computer was a ZX81.
      This had a 16K RAM pack, with 250ns RAM.
      This could read a random byte of RAM in 470ns.
      My current system can access a byte of RAM in around 50ns.

      That is a factor of ten improvement, when RAM size has increased a million fold.
      Caches ammeliorate this.
      But they do not eliminate it.
      #cachemissesmatter

    4. Re:The memory wall by Anonymous Coward · · Score: 0

      So your ZX81 could read every byte in ram in 7.7ms
      Your current machine, say it has 4GB of ram, 107 seconds assuming 64 bit reads.

  12. lolwut? by kuzb · · Score: 2

    "Everyone knows that CPU registers are much faster than level1, level2, and level3 caches."

    I'd argue that most people don't even know what a CPU register is, never mind what it's faster than.

    --
    BeauHD. Worst editor since kdawson.
    1. Re: lolwut? by Anonymous Coward · · Score: 0

      Agreed. I bet not even every employee at Intel knows.

    2. Re:lolwut? by freeze128 · · Score: 2

      And apparently, now everybody knows that 15 minutes can save you 15% or more on car insurance. Well, Screw that! We need to get the word out. CPU registers are faster than level 1 cache! Suck it, Geico.

    3. Re:lolwut? by Anonymous Coward · · Score: 0

      But how else is warren buffet supposed to get his money?!

    4. Re:lolwut? by Anonymous Coward · · Score: 0

      And apparently, now everybody knows that 15 minutes can save you 15% or more on car insurance. Well, Screw that! We need to get the word out. CPU registers are faster than level 1 cache! Suck it, Geico.

      This is awesome. 10/10

  13. Re: by Anonymous Coward · · Score: 0

    Personally, We've seen the clock rates stall essentially for 15 years. Instead slight of hand has been used through multi-core tech. A brute force measure until they fix the main issue. Clock speeds at the bus level without having to use multipliers.

  14. Too complicated to answer by gman003 · · Score: 2

    Originally, there was CPU registers, and memory. Then there was registers, memory, and disk. Then there was registers, SRAM cache, memory, and disk. Then there was registers, L1 cache (on CPU), L2 cache (on mobo), DRAM, and disk. Then the L2 moved onto the CPU. Then there was L3. Then SSDs were added between RAM and disk. Now some chips have an L4 cache on the CPU package (but not the CPU die).

    Oh, and there's a difference between latency and bandwidth. DRAM latency has not significantly improved over time, particularly compared to DRAM bandwidth.

    And with multiple cores, some levels are core-specific while others are not. You can even have a bizarre situation where L1 cache is per-core, L2 cache is shared between two cores, and L3 cache is per-CPU (in SMP setups, that means main RAM is the first level shared among all cores).

    1. Re:Too complicated to answer by Blaskowicz · · Score: 2

      And IBM has made external memory controllers with 16MB L4 cache in each of them, called "Centaur".

    2. Re:Too complicated to answer by Anonymous Coward · · Score: 0

      That's for Power processors, real men use mainframes. Their external memory controllers also have some amount of L4 cache, 480MB to be exact, a modest increase of a factor 30 over the puny Power memory controllers.

    3. Re:Too complicated to answer by Blaskowicz · · Score: 1

      Well I've checked and it seems three z13 CPU share 480MB L4, and one CPU ("Storage Controller") can have 480MB to its own.
      Three full POWER8 would have 128MB L4 each, or 384MB if you add them up.
      You can add up cache figures like that but it seems suprisingly close.

    4. Re:Too complicated to answer by Blaskowicz · · Score: 1

      can't

    5. Re:Too complicated to answer by Anonymous Coward · · Score: 0

      There were a couple of additional steps: registers, memory, and paper tape; registers, memory, and magnetic tape of various types (ranging from audio cassettes to Big Iron). Granted, there was some overlap between use of tape and use of disks (floppy or otherwise).

      Remember ExaTape?

  15. Objectives are different. by hcs_$reboot · · Score: 1

    For a number of years, we were in the technological progress era, we're now in the commercial progress era.

    --
    Slashdot, fix the reply notifications... You won't get away with it...
  16. Depends by Anonymous Coward · · Score: 1

    I think that a lot of good IT folks are looking at where the bottlenecks are within the technologies available and making implementation decisions in a much more detailed way. Right now, and has often been the case, the bus becomes the bottleneck for a lot of applications. Sure there are memory bound, and I/O bound problems that still see the gaps between memory speed and disk read/write speeds as an issue, but there are far more problems that are handicapped due to the system bus speed being hobbled, comparatively. One prime example being problems that could be solved by GPGPU systems that have to swap a lot of data across the system bus to an expansion bus and get hung up doing so. The same could be said for coprocessors like the Xeon Phi. Bus speeds have always lagged other technologies that interface with them, and that's really a concern for most computing problems. Overall, I'd say that the gaps are getting smaller, but there is still a lot of work to be done to integrate the various protocols and technologies used in the end-to-end computing environment in order to make things more streamlined from a data throughput point of view.

  17. RAM latency is not getting much faster by hkultala · · Score: 4, Informative

    The latency of RAM is improving very slowly, only something like 2x-4x improvement in last 20 years.

    Only the bandwidth of the memory is growing faster, and that's just because they have been putting more dram cells in parallel, always doing bigger data transfers and having faster memory bus.

    Same is true for hard disk drive speed, the rotation speeds dictates the random access latency and the rotation speed of average hard disk has only gone up from 4200 or 5400 to 7200 rpm in the last 20 years, meaning only 1.7 or 1.33 times improvement in random access latency

      Though replacing hard disks with flash-based SSD storage has improved latency by a huge margin.

    1. Re:RAM latency is not getting much faster by jonbryce · · Score: 1

      And CPU clock-speed hasn't really increased at all in the last 10 years. We are still at just over 3GHz. Of course we have had massive performance improvements in other areas.

    2. Re:RAM latency is not getting much faster by Anonymous Coward · · Score: 0

      You haven't seen Intel's i9. They just haven't revealed it because they have 0 competition from AMD and their i7 beats the pants off AMD..

    3. Re:RAM latency is not getting much faster by Anonymous Coward · · Score: 0

      I'm a time traveler and I've already seen the i860 and i960 which are retro-purist RISC processors!

    4. Re:RAM latency is not getting much faster by Robear · · Score: 1

      Replacing hard drives with SSDs still leaves another bottleneck. The disks have to connect to the cpu somehow. If they are internal (as in a home pc), then you only get a few disks, but they connect at PCIe speeds. If you need more disks, you go to a SAN. But then you're putting your disks at the end of *network* latencies; there's definitely a wall there. You can't cache your way out of the transmission delays on the SAN... Other solutions are used which essentially move the software and/or data closer to the hardware.

      --
      French - The lingua franca of Europe!
  18. Gaps are smaller by candude43 · · Score: 1

    There are orders of magnitude difference in the access times. Disk access is measured in milliseconds; memory access is measured in nanoseconds; register speed in picoseconds. Improving disk access by even 1 millisecond closes the gap.

  19. CPU Speed is crazy as well by Anonymous Coward · · Score: 0

    Another thing to take into account is CPU speed.
    A Xeon5560 (Nehalem) at 2.8Ghz will execute 89.6 instructions per nanosecond.
    or, about 89,600,000 instructions per milisecond. how many miliseconds does it take to access flash?

    math: nehalem = 4 core * 8 single precision per clock = 32 ops per clock * 2.8ghz = 89.6 GigaOp = 89.6 ops per nanosecond

    Geek note: yes, I am aware of FLOP calcs. yes, I am aware of SIMD. yes, I am aware of out of order and all the fun stuff in the CPU, but we have to have SOMETHING to measure

  20. Disk *seek* times are still very slow by DrJimbo · · Score: 1

    A modern cheap laptop from today is faster than a Convex supercomputer I was using 20 years ago. But in that same time span disk seek times dropped from 15 milliseconds down to about 7.5 milliseconds which is only a factor of two.

    --
    We don't see the world as it is, we see it as we are.
    -- Anais Nin
    1. Re:Disk *seek* times are still very slow by drinkypoo · · Score: 1

      A modern cheap laptop from today is faster than a Convex supercomputer I was using 20 years ago. But in that same time span disk seek times dropped from 15 milliseconds down to about 7.5 milliseconds which is only a factor of two.

      Manufacturers were exploring expensive strategies to speed up seek times, like multiple armatures, independent arms, etc. Then hybrid drives and massive arrays became things and there was no more reason to bother trying to improve seeks on spinning rust. We're all just hoping that it will be cost-effective to stop using it sometime soon.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  21. Yes, it does it matter by amplesand · · Score: 1

    Yes, it does it matter.

    For example, if you want to sort objects which does not fit at one level, the sorting may spill over to a next level of much slower memory. This imbalance of speed is relevant regardless of the interconnect speed limits, as a delta as such is the culprit.

    There are some ways to improve on that.

    From Wikipedia > Sorting_algorithm > Memory_usage_patterns_and_index_sorting
    https://en.wikipedia.org/wiki/...

    That is one example of a solution to a problem that matters.

  22. The gaps are still there. by Anonymous Coward · · Score: 3, Interesting

    20 years ago main memory was 10-14 ns, instruction cycle time was 2-4ns (Cray)

    Guess what? it still is.

    Memory has grown, it has gotten cheaper.

    What HASN'T changed? Access to memory. That is how Cray got its speed - instead of a single port to memory, it used a crossbar switch - 4 ports for each processor. 1 instruction bus, 2 input data busses, and one output bus; even I/O got its own port to memory; all with overlapping address/data cycles.

    The effect was that all of main memory worked at the speed of cache, thus the CPU had no need to waste silicon on cache memory - and the entire system ran full speed.

    What slows down the current systems? Memory access. Most systems only have a single port to main memory. Some servers and "high performance" desktops have dual ported memory. Yet even dual ported memory access is slow when you have to share it among 4/8 cores... plus I/O (which isn't dual ported). Interrupt latency on PCs is really horrible. Still only 15 IRQs? and have to share them? No direct vectoring? Forced interrupt chain actions? Even the old PDP 11 with ONE interrupt request line allowed direct interrupt vectoring (64 basic vectors) to reduce overhead.

    There hasn't been much innovation in architecture in over 20 years.

    1. Re:The gaps are still there. by Anonymous Coward · · Score: 0

      Err, we're not still using 8259s, buddy. APICs are a little better.

    2. Re:The gaps are still there. by business_kid · · Score: 2

      I am a dissenter from Moore's law. There are physical limits and we are near them. There is a physical limit on fab size, charge time, propagation time, signal speed, current and heat. There are also pricing issues: Would you pay $300 extra for faster ram? It could probably be done by using cache. I doubt if users would pay.
      If we could fit the entire system on a chip, Then you could speed up. But choice goes out. No choice at all.

    3. Re:The gaps are still there. by Anonymous Coward · · Score: 0

      No - because you STILL can't get data in or out.

      so it remains SLOW.

    4. Re:The gaps are still there. by rubycodez · · Score: 1

      there has been MUCH improvement to computer architecture, but most slashdotters think only desktop PCs and desktop PC-esque server exist.

      a mainframe has a thousand or more channels to memory

    5. Re:The gaps are still there. by Morpf · · Score: 1

      And even if you had a trillion channels to main memory, you'd still only improve bandwidth and keep the same latency.

    6. Re:The gaps are still there. by Anonymous Coward · · Score: 0

      Actually APIC (the A must stand for awkward IMO) are worse. For example they easily lose edge triggered interrupts (there are enough Linus' rants about it on the kernel mailing list to prove my point. Besides that the APIC programming is awfully complex, some versions using a specialized bus cause lots of race conditions between APIC programming and CPU interrupt responses.
      The real progress are MSI coming with PCIe.

    7. Re:The gaps are still there. by Anonymous Coward · · Score: 0

      There hasn't been much innovation in architecture in over 20 years.

      True, current Intel processors cane easily be regarded as descendants of the Pentium Pro. The only time they tried something new was with the P4, and we all know how it turned out.

      Oops sorry, I forgot the Itanium, but he market has decided that a single architecture from effectively a single vendor is enough. I seriously disagree with this: monoculture is bad, has always been bad and will always be bad. But other architectures (Sparc, Power, Z) are now niche, and ARM has not made a dent in laptops, desktops and servers

    8. Re:The gaps are still there. by Anonymous Coward · · Score: 0

      To eliminate latency performance penalty, we have out-of-order cores.

    9. Re:The gaps are still there. by Robear · · Score: 1

      Right, and bandwidth is more important for most workloads. Gotta move more bits around in one go if you've got (relatively) more latency to deal with in each generation, otherwise the newer chips just wait for i/o faster than the older ones did...

      --
      French - The lingua franca of Europe!
  23. Re:As with most "X OR [almost all NOT X]" question by Hognoxious · · Score: 1

    It's unlikely (but still possible) that it's staying the same.

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  24. A gap not normally considered by VernonNemitz · · Score: 1

    Each memory address is normally associated with 8 bits of data (not counting correction bits). But processors nowadays routinely consume 64 bits at a time. That means getting the data from 8 different addresses simultaneously. Things would be simpler if they put all those 64 bits at one address --if every single address had 64 bits of data associated with it. In the previous processor generation, gobbling 32 bits at a time meant accessing 4 different addresses simultaneously, and the total accessible address space of the processor was essentially 30 bits instead of 32 bits --while you were allowed to access 4 addresses starting at Address Zero, you were not allowed to access 4 addresses starting at Address One or Address Two or Address Three. It could have been allowed if every address had had 32 data-bits associated with it. With 64-bit processors today needing to access 8 addresses at a time, the total effective address space is 61 bits instead of 64 bits (still a huge number, I know). Anyway, my main reason for writing this is, wouldn't memory run slightly faster if it didn't have to access all the data from 8 addresses simultaneously, but instead just got 64 bits, nicely parallel from any one address?

    1. Re:A gap not normally considered by crunchy_one · · Score: 1

      Good idea. So good, in fact, you're getting close to how it's actually done: data is moved in parallel in bulk. For example, when your program accesses an 8-bit byte, the 256-bit (or larger) chunk (called a cache line containing it gets read from DRAM into cache. There is no address space sacrifice because once the cache line is read, additional logic selects the desired byte from the cache line using the low-order bits of the address.

    2. Re:A gap not normally considered by Mr.CRC · · Score: 1

      Memory busses already transfer more than one octet at a time, in fact, more than the 64-bit architecture size for typical implementations of x64. Having the effective address space for 64-bit words be 61 bits isn't really much of a problem. Who is going to have nearly 2.15GiGiB of memory (attached to one CPU) any time in the next century?

      I program the TI C2000 series 32-bit microcontrollers, where the 16-bit word size can be a significant headache when trying to deal with 8-bit IO streams.

      So I'd opt to keep the 8-bit memory addressing granularity, or else what are you going to do with all the 8, 16, and 32 bit data? Are you going to store 8-bit data with each byte in one 64-bit memory location? That's obviously silly. Without 8-bit addressability, to get an 8-bit stream to pack each memory cell, you have to do a bunch of algorithmic work (shifting each octet into place in a 64-bit register, making sure things line up where they belong, etc.), whereas with 8-bit addressability, this isn't so difficult.

  25. Shed some light by transfire · · Score: 2

    This article can shed some light on it: http://www.dba-oracle.com/t_hi... Looks like RAM is the laggard.

  26. Calculating is easy by guruevi · · Score: 3, Informative

    You can look up the specs easy.

    Back in the 80286 days, there was not even an L1 cache however the memory and ISA bus ran at CPU speed 8-20Mhz. Hard drive latency was ~65ms.

    In the 80486 days L1 cache was introduced and L2 was sometimes available in (very) expensive modules. I remember buying 256kb for the same price as 16MB RAM. The L1 caches ran (if I remember correctly) at CPU speed, 1 cycle. However the bus speed started to slow down compared to the CPU. The VLB ran at CPU bus speed ("local" bus) and was often used for graphics but PCI (an inferior bus) ran at 33MHz so for anything over 33MHz, we started needing dividers. The RAM ran at 80-120ns so it started being slower than the CPU bus. Hard drive speeds were however up to ~30ms.

    In the Pentium age memory slowed even farther compared to the CPU bus. Now it took several cycles to access memory, buses ran even slower (still PCI mostly, eventually PCI-X (133MHz?) until PCI-e (serial buses running) came along. Hard drive speeds went up to ~15ms

    In modern age, L1 caches have slowed even further requiring 4 cycles for L1 cache and up to 30 for L3 caches. RAM is even slower access with bus speeds about a quarter of a single CPU but sometimes 16 CPU's need to share those lanes. Peripheral bus speeds however have gone up and PCIe 3.0 is now directly integrated into CPU 80486 VLB-style. Hard drives have latencies of 10ms (we have a mechanical issue there) still but even cheap SSD's can go down to ~1-2ms.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
  27. SIZE MATTERS! by Anonymous Coward · · Score: 0

    Datasize has increased in all things. It's 1 thing you can't really offset as it's reality. It gets slower. Data grows. Even in code you use initially declaring large values (double onwards etc.) & going from 8->16->32->64 bit in programs you use (this is a hand-optimization I often do, say vs. std. widestring down to shortstr when possible, from integer to shortint for example, it helps on disksize too, not only memory potential consumption) + they get larger on disk too in program doing it. Pickup times from mechanical disk increase (I offset it with compressed executables lessening disksize area used) - not so bad with SSD or on live ramdrive storage (software or hardware). It offsets disk access to load/store cycles, especially if compressed data files on filesystems are used for data & what I do in exe compacting for programs themselves above). Lots of little hairs that addup to a moustache in mechanical diskbound world (largely storage).

    ---

    Elevator algorithms on harddrives help offset it. Good move in hardware for that but still, see above.

    Bufferbloat congestion was unexpected iirc as well online as another factor to mitigate that 'backfires'.

    APK

    P.S.=> The "infamous they" say "Size matters" - & in this case, YES, it does for hdd access which is still the most prevalently used, & striping/spanning disks, shortstroking them, or caching of all forms notwithstanding (not knocking the latter UNLESS aging problems arise, causing stale data & there are those, see above), on hdd's, even @ diskfarms for storage for huge commercial sites? No questions asked it does... apk

  28. Self-edit/add: Forgot to note indexing... apk by Anonymous Coward · · Score: 0

    " The "infamous they" say "Size matters" - & in this case, YES, it does for hdd access which is still the most prevalently used, & striping/spanning disks, shortstroking them, or caching of all forms notwithstanding (not knocking the latter UNLESS aging problems arise, causing stale data & there are those, see above), on hdd's, + indexing in databases vs. HUGE files,

    See subject: I forgot to note it as it helps vs. large diskmass for internal parsing of records too.

    APK

    P.S.=> Nitpicking myself today... apk

  29. 2nd self-edit/add: Forgot to note defrag... apk by Anonymous Coward · · Score: 0

    " The "infamous they" say "Size matters" - & in this case, YES, it does for hdd access which is still the most prevalently used, & striping/spanning disks, shortstroking them, or caching of all forms notwithstanding (not knocking the latter UNLESS aging problems arise, causing stale data & there are those, see above), on hdd's, + indexing in databases vs. HUGE files PLUS COMPACTING DATABASES (removing blank bloat spots containing NO DATA) even @ diskfarms for storage for huge commercial sites? No questions asked it does... apk" - ME

    See subject: I forgot to note it as it helps vs. large diskmass for internal parsing of records too.

    APK

    P.S.=> Nitpicking myself today 2nd TIME again (had to, leaving no stones unturned) - since WHEN YOU SPEEDUP THE SLOWEST PART, the entire system speeds up - datasize REALLY affects HDD's, but when you come right down to it? It affects even in-memory operations as well, especially in RAM limits are hit & paging, yes FROM RAM to disk, begins... apk

  30. 3rd self-edit/add: Forgot dbcompact+dedup... apk by Anonymous Coward · · Score: 0

    " The "infamous they" say "Size matters" - & in this case, YES, it does for hdd access which is still the most prevalently used, & striping/spanning disks, shortstroking them, or caching of all forms notwithstanding (not knocking the latter UNLESS aging problems arise, causing stale data & there are those, see above), on hdd's, + indexing in databases vs. HUGE files PLUS COMPACTING DATABASES (removing blank bloat spots containing NO DATA) as well as DEDUPLICATION/NORMALIZATION on disk and in db's both, AND HDD defragmenting to offset excessive headswing movement instead picking up files as a single piece in 1 pass even @ diskfarms for storage for huge commercial sites? No questions asked it does... apk" - ME

    See subject: I forgot to note it as it helps vs. large diskmass for internal parsing of records too.

    APK

    P.S.=> Nitpicking myself today 3rd TIME again (had to, leaving no stones unturned) - since WHEN YOU SPEEDUP THE SLOWEST PART, the entire system speeds up - datasize REALLY affects HDD's + internal file parsings (not as bad due to index seeks), but when you come right down to it? It affects even in-memory operations as well, especially in RAM limits are hit & paging, yes FROM RAM to disk, begins... apk

  31. assuming you mean SSDs, this exists already by Chirs · · Score: 1
  32. Don't forget filesystem efficiency! by Anonymous Coward · · Score: 0

    Not only is the underlying physical technology getting better, but the software (aka filesystem) utilizing that hardware is also becoming more efficient. The likes of ZFS and ext4 are far better than predecessors (UFS or ext2/3). No troll-o, but I think NTFS and FATx are static in performance across hardware revisions.

    1. Re:Don't forget filesystem efficiency! by mnslinky · · Score: 1

      Not only is the underlying physical technology getting better, but the software (aka filesystem) utilizing that hardware is also becoming more efficient. The likes of ZFS and ext4 are far better than predecessors (UFS or ext2/3). No troll-o, but I think NTFS and FATx are static in performance across hardware revisions.

      Gah, forgot to log in.

  33. Wait till filesystems for ramdisks/ramdrives by Anonymous Coward · · Score: 0

    See subject: Gets better minus geometries used for filesystems for mechanical disks & the structures it demands ramdrives/ramdisks do NOT need - this redesign will lend to far smaller overheads on the logical filesystems end of things & make them even FASTER...

    * That's coming soon...

    (So what's already "the street dominator" in diskspeeds is about to get another jump in its lead beyond access/seek advantages it has now...)

    APK

    P.S.=> It's pretty exciting knowing it's going to happen & boost the slowest part of a system (SSD, fast as it is over HDD, is still the laggard)... apk