Hard Drive Capacity Confusion, Lucidly Explained
mrklin writes "James Wiebe of wiebetech.com has written a clear example of how hard drive capacity is calculated (PDF file) by hard drive manufacturers (base 10) and OS (base 2). He failed to name how the capacity should be described, though."
With storage prices falling through the floor, does it matter to anyone except whiny nerds whether the byte counts are done in base 10 or base 2?
In the words of William Shatner, "Get a life!"
Our computers are binary, so the hard drives that we put in them should be measured using the binary (Base-2) representation.
Build me a microprocessor containing transistors that switch between 10 different voltage levels before you continue with your Base-10 tomfoolery.
Share Files To Large To E-mail
As far as ordinary users (i.e. anyone who doesn't have to deal with TLBs, memory pages, disk sectors and the like) are concerned, there's really no reason left to use binary units; 2^9 bytes per sector, 8 sectors per filesystem block, etc. are all low-level conveniences that the user shouldn't have to even notice. Though I personally am too used to the binary units to switch easily, the vast majority of users probably wouldn't even notice the difference, aside from their computers finally reporting the right size for their hard disks. Granted, overcoming the huge momentum for binary units will be difficult, but one could always consider it practice for getting the USA to accept metric.
This is not a matter of base-10 vs base-2... a base-10 number is written as "2875" for example. A base-2 number is written as "10100110". A base-16 number is written as "8A3F0"...
This is a matter of UNITS used - like inches vs. feet, or in this case GiB vs GB.
Geez, get the terminiology right...
About two years ago there was a debate about this. Can't remember the details of that debate. Maybe it was when those "mebibytes" were introduced. I still say now what I said then.
I think there should be "short megabytes" and "long megabytes", and the same for gigabytes. Like this:
Then all we need is to get hard drive manufacturers and OS vendors to state whether they are using short or long tons, er, gigabytes.
As to abbreviations, take Donald Knuth's suggestion. Use the capital letter twice to suggest binaryness. 1 MMB = one long megabyte; 1 GGB = one long gigabyte. I like this much better than the now-standardized MiB men-in-black abbreviation for long megabytes (which are still not called long megabytes in the standard, they are called mebibytes, which sounds silly and no one uses it).
Who's with me?
Sunlit World Scheme. Weird and different.
Those are too hard to pronounce. Who not just distinguish them by prefixing the metric ones with the word "metric", as we do with tons and metric tons.
kilobyte = 1024 bytes
metric kilobyte = 1000 bytes
Oh man, that just brought back memories. A bunch of geeks sitting at Round Table pizza for a BBS party all trying to get the highest in base 2 decimals. 512, 1024, 2048, 4096, 8192, etc. all shouting to be heard.
No wonder you'd never see a woman at those parties, must have scared them off. of course, nowadays, you see women geeks much more often, thank God.
1f u c4n r34d th1s u r34lly n33d t0 g37 l41d Capitalization really works: i helped my uncle jack off a horse
The only relevant issue is the meaning of words like kilobyte, megabyte, and gigabyte. Wiebe describes how you can arrive at two different answers for drive capacity depending on how you define the word "gigabyte," but does so completely uncritically. For example, he describes the drive manufacturer logic and writes that "the drive's claim of 123.5 GB is verified with this simple mathematical formula." But the issue is what the word "gigabyte" means, and the formula presented sheds no light on the word's conventional usage or etymology. I personally was raised to use these terms to correspond the numbers that are powers of two. Wiebe doesn't give me any point of reference to shed light on whether it's reasonable to use the meanings drive manufacturers do. (Of course I already know the answer, but that's beside the point.)
Wiebe uses some other odd logic, exemplified in point 3.7. He writes that the consumer was never cheated, because a drive advertised as having a capacity of 123.5GB had just that in "decimal based" capacity. This is a bizarre way to characterize the complaints. Consumers who believe they were cheated aren't claiming they didn't get 123.5GB for any definition of the word gigabyte. They're claiming they didn't get 123.5GB by the conventional definition of the word as commonly used in connection with computers. In my view, they're right, although I don't personally get too upset about it.
Oh, ye - so you want:
1 kg = 1024 g
1 metric kg = 1000 g
1 km = 1024 m
1 metric km = 1000 m
Thanks, but no thanks.
Hey Jimmy, assuming you're using FAT32 as your XP filesystem, which uses 73.8 MB of space for every gigabyte, not just 73.8 megs one time, that adds up to roughly 8,856MB of space used for the filesystem. Which on a labeled 123.5 GB drive, leaves you with roughly 115GB of space! Wow! The HD manufacturers were right!
The OS *do* use a negligible amount of drive space in these days with 100+ GB hard drives. And you're confusing file systems with operating systems. Just because an OS allow you to use a file system that waste resources, doesn't mean the OS itself use a lot of drive space.
Beware: In C++, your friends can see your privates!
...he ignores the fact that HD manufacturers are happy using bytes which are 8 bits, all the while flaunting the established convention that MB/GB refers to binary megabytes and binary gigabytes. Why don't they specify the size of their HDs in bits?
But personally I strongly reject this "kibibytes" attempt at CS revisionist history. Stick with what CS people have been using as measurements for decades, I say,
Why shouldn't CS people stick to what the rest of the sciences have been using for decades, that "kilo" means 1000? This CS thing of making "kilo" stand for 1024 is an attempt at revisionist history.
There's always another perspective.
No, because "kilo" is, in fact, a metric prefix. So a simple kilobyte should have its standard meaning as the SI unit prefix implies. You might however call the other one "non-metric," "binary," or "bastard" :-)... problem is, no one will use such terms. It is understood as an unwritten rule that anything suffixed with "byte" implies that the prefixes "kilo," "mega" etc. refer to 2^10, 2^20 etc. factors. Even more interesting, I suppose the "general public" doesn't even know how much a gigabyte is anyway, you might as well call it a hogshead - the simple rule is that 20 Quux is less than 40 Quux, whatever the unit may be... so, since people will only compare the size of their hard drive (please, spare the obvious jokes here...) to that of other hard drives (and not memory or whatever), it would be good enough to ensure that at least for a given type of storage medium, all manufacturers calculate their unit prefixes the same way.
I would think that any greater change (like writing MiB or MMB vs. MB) will only create more confusion. I just remember when, a couple of years back, German computer manufacturers were forced to specify things like floppy disk sizes and screen diagonals not only in inches, but in centimeters - ever tried to buy an 8.3 cm floppy? That just doesn't work. The computer business just has its own weird set of units, but in fact no one really cares (except for maybe some nit-pickers going for the law suit), and a change of prefixing would, while being scientifically correct, not serve any good purpose. (Before you say "then we Americans can continue using pounds and miles too!" - that's a totally different question in my opinion that bears issues of "compatibility" and "ease of use" etc. etc.)
Or we could just beat the hard-disk manufacturers with a stick until they understand that most people expect 1 kilobyte to be 1024 bytes :P
You are out of touch. If you conducted a scientific survey of 100 random adults who own PCs and asked them:
"How many bytes are in a kilobyte?" you really think that more than 50 would answer "1024"?
I'd be surprised if more than 10 did, personally.
100% of the non-geek population equates kilo with base 10, not base 2.
This issue reminds me of a practice used in another industry. The auto industry commonly reports horsepower and torque for their cars as measured at the engine's crank/flywheel vs at the wheels. While the measurements themselves are an accurate reflection of an engine's general performance alone you typically do not just buy an engine, you buy a system which is the car. When the engine's performance in measured within the context of the car--meaning at the wheels--then the truth is revealed. That revelation shows, on average, a loss of 10-20% when power is measured at the wheels vs the crank. Which spec do you think a manufacturer is going to release?