Slashdot Mirror


Hard Drive Capacity Confusion, Lucidly Explained

mrklin writes "James Wiebe of wiebetech.com has written a clear example of how hard drive capacity is calculated (PDF file) by hard drive manufacturers (base 10) and OS (base 2). He failed to name how the capacity should be described, though."

13 of 482 comments (clear)

  1. Big whoop by attemptedgoalie · · Score: 3, Interesting

    In the grand scheme of things, drive capacity issues seem to revolve around lawyers more than consumers.

    I wish that the major manufacturers would stop putting 1 BIG drive in the system, and put 2 normal sized ones in and MIRRORED.

    As somebody who gets blasted by customers when they failed to do their backup, an out of the box, pre mirrored system would be far better for the consumer than properly labelling those lost 200 MB.

    Sorry, that's my partially related rant for this evening.

    --
    My mom says I'm cool.
  2. In light of common sense by dnotj · · Score: 2, Interesting

    Why are we seeing another article on this very issue?

    Everyone understands HD manufacture's measuring systems. Failing that, we could just have billy fix up windows to overstate drive capacity to all windows users and they would never know any better.

    --
    No more Micro$oft bashing from me. Its like bashing at the special olympics.
  3. Strange by autopr0n · · Score: 3, Interesting

    I think it's a little odd that he claimed that Hard drive makers have "Always" done this. I very specifically remember advertisements for hard drives being "One Billion Bytes" (with like a 14 point small print letting us know that it was indeed 1000000000 bytes). After that "billion bytes" became gigabytes and the font became smaller.

    I've also heard that for some drive makers "gigabyte" means 1^20*10^3 (i.e. one thousand megabytes) and things like that.

    --
    autopr0n is like, down and stuff.
  4. Re:Base 2 by EvanED · · Score: 4, Interesting

    Ah, and therein lay the crux of the matter. The problem is that *everywhere else* kilo-, mega-, etc. prefix units (to stop the megapolis argument) they denote powers of 10. A megavolt is a million volts. A kilometer is 1000 meters. A gigahertz is a billion hertz. Only in computer science have people redefined the units to refer to anything other than powers of 10. *That* is what the debate revolves around, and that is what is IMO the mistake of people early on. The solution is to make kilobytes officially be 1000 bytes (as the IEC has) and use a different unit for the powers of two.

  5. Re:Does it matter anymore? by |deity| · · Score: 5, Interesting
    Even the article states that you are losing 10% of the capacity you would expect. I think 10% is significant enough to complain about.

    The author at one point in the article says that operating systems have historically not documented how size is counted. Like the engineers at a drive manufacturing company aren't smart enough to know that if you calculate a kilobyte in base 2 you are going to calculate a megabyte, or gigabyte in base 2.

    Yes if you are smarter then your average computer user, which is to say smarter then a really dumb rock you should know that what's reported on a drive is not the actuall size.

    It still hacks me off. It's like a soda manufacturer deciding it's ok to redefine an ounce so that they can claim that their drink is larger then it is or just use a smaller container and claim it's still the same size.

    Does it matter, yes and it will matter more as storage capacity increases.

    If you use a computer it does all calculations in binary, it only makes sense for the capacity of the drive to be calculated in binary.

    --
    Environmentalists are their own worst enemy. ~tricklenews.com
  6. Re:Does it matter anymore? by OverlordQ · · Score: 4, Interesting

    Like the engineers at a drive manufacturing company aren't smart enough to know that if you calculate a kilobyte in base 2 you are going to calculate a megabyte, or gigabyte in base 2.

    That's where the standard agrument fails, because mega, kilo, giga, terra, et al are base 10 prefixes not base 2.

    --
    Your hair look like poop, Bob! - Wanker.
  7. Re:So how do you say it? by Molina+the+Bofh · · Score: 2, Interesting
    A more complicated question would be how to correctly pronounce ordinal numbers in any base. For instance, try to correctly read the following sentences:

    I need to clear the 101th bit of that byte.

    1 4m l33t. 1 4m d4 b0mb. 1 hax0red da 0xF49B5Cth byte 0f dat file.

    0o1232 the number of the beast. (a music by Iron Maden)

    Good morning, kids. Please open your history books in the CDXXXVIIth page.

    --

    -
    Roses are #FF0000, Violets are #0000FF, find / -name '*base*' |xargs chown -R us && mv zig greatjustice
  8. Re:Does it matter anymore? by Bi()hazard · · Score: 4, Interesting

    This is a big issue for those who use RAID arrays based on intercahngeable hard drives. This is a common practice among large corporations, and drive manufacturers' nonstandard descriptions of sizes make it very difficult to mix manufacturers within an array.

    Buying from company A gives you 120GB=120 billion bytes, and buying from B gives you 120GB=128,762,169,664 bytes. If we have an array of 10 disks at the larger size and swap one out for the smaller size, the disks cannot be treated as interchangeable anymore, and the array loses much of its efficiency, or is forced to waste the extra space on the larger drives.

    The bottom line is that this costs money. Companies are locked into using one supplier and must pass up opportunities for good deals. The lack of flexibility and occasional screw ups by interns who don't check which drive is which uses up the IT department's time.

    Nobody really cares whether a GB is 1 billion or a funny number that comes from base 2, but a lot of people with a lot of money care whether 1 GB from company A equals 1 GB from company B. One of these days the industry will have to standardize.

    It's just as bad as monitor sizes-they measure those at funny angles and have different sized black margins around the viewable area. Just a couple months ago a manager here ordered a new 19 inch monitor and was so annoyed by the margins that he sent it back to be replaced. We gave him an old, lower quality monitor with the settings adjusted to minimize the margin. Some guy in IT took the new one home with him, and wrote it off as trashed defective equipment.

  9. Re:Does it matter anymore? by Anonymous Coward · · Score: 1, Interesting

    "If you use a computer it does all calculations in binary, it only makes sense for the capacity of the drive to be calculated in binary. "

    No, it does not. Eg. How many bytes are there in kilobyte? Anwer: 1024 not 1000. Thus we should not use the prefix 'kilo' as 2^10 because it is defined as 10^3. The SI-system works fine, we should not let people who still measures stuff with someones thumbs and feet ruin it.

  10. Re:Base 2 by vrt3 · · Score: 2, Interesting

    Every time this issue comes up, and somebody proposes to abandon the use of K = 1024, people react saying that would be bad, because K = 1000 doesn't fit with power-of-2-based storage in computers.

    That may very well be, but that's not the point. We can still use binary prefixes. But it doesn't make any sense to use the same prefix with different meanings. It makes perfect sense to use different prefixes, and to use different symbols for them. That way there is no confusion.

    I never understood this really. CS is, of all fields, a field where it is important to be unambiguous. One byte wrong, or even one bit, and the computer doesn't understand it, or misunderstands it. Yet where it comes to defining storage units, we hijack the established 'kilo' and make it mean 1024 instead of 1000. Not even always: 1kbps is never 1024 bits per second, always 1000 bits per second.

    I say, where it makes sense to use binary prefixes, let's do it. And let's be clear about it. The current 'Look ma, I'm using binary prefixes but I made them look exactly like the usual decimal prefixes so I can create confusion and ./ religious wars' should be stopped. And I think it will: ifconfig, for example, already reports bytes transmitted/received as GiB, MiB, ...

    --
    This sig under construction. Please check back later.
  11. KiB, MiB, GiB by SLi · · Score: 2, Interesting

    Somehow I get the feeling that it's mostly Americans who refuse to accept that kilo=10^3, mega=10^6, giga=10^9. (Please read on before moderating as troll/fb.)

    I guess that's because you aren't used to kilo, mega and giga, except to the (incorrect) power-of-2 definitions. To someone who lives pretty much anywhere else in the world (ie. where metric units are used), kilo has always been 10^3 and mega has always been 10^6. Well, except in most fields of CS (but not telecommunications or HDD capacity).

    What's happening is that several different fields of science are slowly starting to overlap, and suddenly there's real confusion: for someone, kilo=10^3, for someone else it's 10^3 EXCEPT in some cases it's 2^10.

    This source of confusion should be fixed now when it's still possible. It may seem to this audience that Computer Science == Life (and most of you probably don't need to think about data in terms of telecommunications) and therefore you think kilo=2^10 is standard, but for a huge majority of people it simply is not so.

    Kilo has always been 1000 and will always be 1000. It's us the computer people who have made a mess of it, and we're also responsible for cleaning it up.

    1. Re:KiB, MiB, GiB by MikeBabcock · · Score: 3, Interesting

      I have to agree here. I've been using computers since the early 80's and the "kilo" or "mega" notation was well understood then to be an approximation, at least in my circles, to their decimal prefix equivalents.

      "A kilobyte in a computer is 1024 bytes only because in base-2 it is simpler to count in 1024's than in 1000's"

      That said, and everyone learned that back when people had to learn about computers (instead of growing up with them), this approximation is *still* just an approximation.

      Just because you grew up thinking a kilo meant 1024 because you're in a non-metric country doesn't mean a kilo means 1024. It means your predecesors didn't bother using a different name for a different number (back when "the world will never need more than maybe 10 computers").

      Mebi is available now ... use it; point out to the rest of the world that MB is inaccurate and should mean 1000*1000 bytes, that MiB in fact *means* 1024*1024 bytes and this will solve our confusions within a generation.

      --
      - Michael T. Babcock (Yes, I blog)
  12. Re:Does it matter anymore? by sketerpot · · Score: 2, Interesting
    They both need to switch, as do all of us. The powers-of-two prefixes are practical for computers, and they are what OSes use (although yes, they should use the proper prefixes). If the hard drives use decimal prefixes, that's misleading and not optimal: there are the people used to the nonstandard prefixes who will think that the HD has more space than it really does, and there are the people who want the HD space specified in the units they're used to using with the OS. What I'm trying to say is, even if the HD manufacturers are in the right with their prefixes (and I know they are), they should still switch to the standard binary prefixes.

    Mmmm, standards compliance....