Slashdot Mirror


To Keep Track of World's Data, You'll Need More Than a Yottabyte (wsj.com)

An anonymous reader shares a report: In 10 or 15 years, Dr. Brown, who is head of metrology at the National Physical Laboratory in the U.K., anticipates the amount of computerized data worldwide will exceed 1 yottabyte in size, and without expanding the list of prefixes, there will be no way to talk about the next great chunk of numbers. Even worse, dilettantes could fill the void by popularizing glib prefixes such as bronto or hella -- terms that have already won fans. Without professional intervention, Dr. Brown fears, the next numerical prefix could become the Boaty McBoatface of weights and measures.

[...] For the record, there is an argument to be made for adopting a prefix like bronto: giga and tera are based on the Greek words for "giant" and "monstrous." Why not make bronto, named for the brontosaurus, official, perhaps along with tyranno, stego, colosso or even yeti? Dr. Brown is sympathetic to the argument but unconvinced. Instead, he proposes four prefixes that adhere to recent naming conventions [Editor's note: the link may be paywalled; an alternative source was not available.]: ronna and quecca for octillion (27 zeros) and nonillion (30 zeros), along with ronto and quecto for their fractional counterparts, octillionth and nonillionth. Like the latest sanctioned prefixes, Dr. Brown's proposals are loosely related to Latin and Greek words for numbers (in this case, nine and 10). And like most of the prefixes, his suggestions end in "a" or "o." But the process of expanding, or even amending, the official measurements is lengthy.

81 comments

  1. JiggaByte by Anonymous Coward · · Score: 0

    JiggaBytes

    1. Re:JiggaByte by Tablizer · · Score: 1

      A Future, Back To The, reference this is, no?
        - Yotta

    2. Re:JiggaByte by Anonymous Coward · · Score: 0

      Begun, the reference wars have.

      Run, you fools.

    3. Re: JiggaByte by Anonymous Coward · · Score: 0

      It's "Fly you fools."

      You're welcome.

    4. Re: JiggaByte by Tablizer · · Score: 1

      It's "Fly you fools."

      I'm a wingless being, you insensitive clod!

    5. Re: JiggaByte by Anonymous Coward · · Score: 0

      As god as my witness, I thought turkeys could fly!

    6. Re: JiggaByte by Tablizer · · Score: 1
    7. Re:JiggaByte by Anonymous Coward · · Score: 0

      Lottafaginabyte.

    8. Re:JiggaByte by Anonymous Coward · · Score: 0

      A Future, Back To The, reference this is, no?

        - Yotta

      Well well well... let us see how many Yottabytes we need, after the big tech firms, takes down all of the porn (to save space) :-)

      Even if the internet does not have any weight, there is still only so much that can be contained in that little black box, know as the internet

  2. Somewhat arbitrary what we call data by ganv · · Score: 2

    It has always seemed a bit arbitrary to label something as "the world's data". You could always add the history of every cache on every processor on the planet to your definition of "data" and have a much larger number.

    1. Re:Somewhat arbitrary what we call data by Tablizer · · Score: 0

      World? Instead of "Libraries of Congress" as the reigning standard unit of Big Info, it's now "Putin's Harddrives".

    2. Re:Somewhat arbitrary what we call data by ShanghaiBill · · Score: 4, Funny

      It has always seemed a bit arbitrary to label something as "the world's data".

      A yottabyte is 1e24. That is more than 100 terabytes per human.

      You could always add the history of every cache on every processor on the planet to your definition of "data" and have a much larger number.

      640 yottabytes ought to be enough for anyone.

    3. Re:Somewhat arbitrary what we call data by apoc.famine · · Score: 2

      I know, right? I've been backing up /dev/random for years now, and I'm not sure when I'll be done. I think part of the problem is running the checksum, but I'm not sure.

      --
      Velociraptor = Distiraptor / Timeraptor
    4. Re:Somewhat arbitrary what we call data by ganv · · Score: 1

      That is hilarious. Please mod up. It highlights much better than my first comment the insanity of trying to quantify the "worlds data".

  3. Bigger by Anonymous Coward · · Score: 0

    Buttloada and Assloada

    1. Re: Bigger by Anonymous Coward · · Score: 0

      I for one welcome the Boaty McBoatface Byte!

  4. Whats than in terms of by AHuxley · · Score: 1

    NSA and GCHQ spending per year?

    --
    Domestic spying is now "Benign Information Gathering"
    1. Re:Whats than in terms of by Anonymous Coward · · Score: 0

      $5 on your mom, each. $10 total. Balance outstanding, she was ok with it at the time, no further contact recommended.

  5. Yottabyte? by Anonymous Coward · · Score: 1

    Yottabyte? That's a lotta byte!

  6. Fake need? by Tablizer · · Score: 2

    Why make a new prefix for each power of ten unless (and until) it really is used often? Just make a generic term, such as "24th order of magnitude". In fact, I believe that's already used. We can even have a shorthand: "24 oom bytes". To remember it, think of a cow mooing in reverse.

    1. Re: Fake need? by Anonymous Coward · · Score: 0

      I likk yerr bot hole in da heeted pewl

    2. Re:Fake need? by viperidaenz · · Score: 4, Insightful

      They're not.
      they're making new prefixes for every third power of 10.

      If a consensus isn't reached relatively soon, the whole "billion" thing will happen again. it's been defined as both 10 to the 9th power and 12th power.

    3. Re:Fake need? by AHuxley · · Score: 1

      In terms of one standard NSA data storage facility?

      --
      Domestic spying is now "Benign Information Gathering"
    4. Re: Fake need? by Tablizer · · Score: 1

      I likk yerr bot hole in da heeted pewl

      Here ya go...

      Well, no heat, but wait until summer.

    5. Re:Fake need? by Anonymous Coward · · Score: 0

      Then 1,000 Yotas should be a Fuckton.

    6. Re:Fake need? by Anonymous Coward · · Score: 0

      Or prepend "kilo", "mega" etc. to yotta.

    7. Re:Fake need? by AmiMoJo · · Score: 0

      Wrong wrong wrong.

      A yottabyte is 2^80 bytes. It's always every 10th power of 2. Anyone who tells you otherwise is wrong.

      1000^8 is only 0.827 YB.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    8. Re:Fake need? by Anonymous Coward · · Score: 0

      A yottabyte is 2^80 bytes.

      No. A yottabyte is not 2^80 bytes, you fuckwit. A yottabyte is 10^24 bytes.

      2^80 bytes is called a yobibyte.

      https://en.wikipedia.org/wiki/Binary_prefix#yobi

  7. StuffShirtBytes by Tablizer · · Score: 1

    Without professional intervention, Dr. Brown fears, the next numerical prefix could become the Boaty McBoatface of weights and measures.

    What's wrong with that? The rejection of "Boaty McBoatface" was a stuffed-shirt reaction. Going with that name could have helped increase funding even via increased awareness.

    BoatyBytes, McFaceBytes, sounds fine with me. The existing names are already silly, or at least magnets for jokes.

    1. Re:StuffShirtBytes by Anonymous Coward · · Score: 0

      I suggest calling it "Phil"

    2. Re:StuffShirtBytes by Tablizer · · Score: 1

      Or "Trilo": TriloBytes.
       

    3. Re:StuffShirtBytes by Anonymous Coward · · Score: 0

      I think the humour potential should be a key selection criterion.

      1 vanilli = 2000 guys who can't sing

    4. Re: StuffShirtBytes by Anonymous Coward · · Score: 0

      As one of the many Phillipâ(TM)s in the world, and on behalf of all of us: Fuck You

  8. If there was only some scientific notation by Anonymous Coward · · Score: 0

    Some method of scientific notation that could be use?

    Nah nobody ever thought of that.

    Anyway I am not impressed until we are talking about Googol® bytes

  9. Horseshit by Anonymous Coward · · Score: 0

    and without expanding the list of prefixes, there will be no way to talk about the next great chunk of numbers

    Well, other than using powers of 1,024 (or powers of 1,000 for the pedantic types who are unfamiliar with base 2.)

    1. Re:Horseshit by Waffle+Iron · · Score: 3, Insightful

      Well, other than using powers of 1,024 (or powers of 1,000 for the pedantic types who are unfamiliar with base 2.)

      I'm familiar with base 2. So I know that hard drives typically allocate blocks of size 2^9 or 2^12, and I know that there is nothing else in a hard drive related to powers of two.

      Which means that insisting on using powers of 1024 notation is like demanding that we count everything related to the NBA in base 5, since basketball teams have 5 members.

      (Actually, power-of-1024 notation is even worse than that, since it uses a *mixture* of various mutually incompatible 1024 powers combined with decimal fractions, all of which makes Roman Numerals look practical by comparison.)

  10. Idiotic obsession with... by Anonymous Coward · · Score: 1, Insightful

    ... ancient languages.

    When we define the words and terms that are yet undefined, we can start fresh. We don't need to be chained to the past. And why not have numbers that sound cool to say that we can associate with things people know about it? This cult of ancient and dead languages is pretty disturbing. Since the naming convention is based on latin words for numbers is arbitrary in and of itself.

    1. Re:Idiotic obsession with... by taylormc · · Score: 1

      The advantage of using ancient languages - Greek in this case, BTW, not Latin - is that it allows a common vocabulary for use among speakers of many different mother tongues. Just as "petabyte" is founded on Greek "pente" ("five"), "yottabyte" is founded on the Greek "okto" ("eight"); so the next iteration would most usefully be founded on Greek "ennea" ("nine").

  11. Useless data by Anonymous Coward · · Score: 0

    A third of the population is comprised of morons who are objectively ill suited to life in a modern society. Whatever data they produce or is gathered on them is useless.

  12. May I suggest... by Major_Disorder · · Score: 1

    The Hellabyte.
    In honour of the great profit... Eric Cartman.

    --
    First law of people: People are generally stupid.
    1. Re:May I suggest... by Anonymous Coward · · Score: 0

      You realize this stupid suggestion is specifically mentioned in TFS, right?

    2. Re:May I suggest... by Anonymous Coward · · Score: 0

      > In honour of the great profit...

      You loose!

    3. Re:May I suggest... by Major_Disorder · · Score: 1

      > In honour of the great profit...

      You loose!

      Unless I did it just to mess with people.

      --
      First law of people: People are generally stupid.
    4. Re:May I suggest... by Anonymous Coward · · Score: 0

      If we're making it up anyway, I say we should be talking about bits... insert joke for the hellabit.

  13. Weighing a planet, one milligram at a time. by geekmux · · Score: 1

    There's a reason we created terms like "ton" to describe considerable weight. Childrens electronic toys can hold multiple Libraries of Congress these days, so let's stop pretending that "mega-ultra-giga-bazillion" is going to impress anyone.

    Hell, if we're gonna get stupid about this, then why not measure each individual bit? I'm sure Mathy McMathface can get piss drunk on new number names with an 8x power factor.

    Yes, there's a lot of data in the world. We get it. Now perhaps we can grow up and create a reasonable unit of measure.

  14. Biggest by Anonymous Coward · · Score: 0

    Muthafuckabyte

  15. Yodabyte by Anonymous Coward · · Score: 0

    Damn much storage my basement has!

  16. 1 yottabyte does not justify a new prefix by Anonymous Coward · · Score: 0

    We don't need a new prefix until we get to 1000 yottabytes, and that will take a little while :-)

    I like the suggested prefixes because they abbreviate nicely into R and Q for the big ones, and r and q for the small ones. These haven't been used yet.

    They messed up with the original prefixes, using kilo (abbreviated k) and milli (abbreviated m) for 1000, because that meant they couldn't use m for micro. We don't want to make that mistake again, do we? So that wipes out tyranno (clashes with tera) and hella (clashes with hecto).

  17. SI unit? by Gabest · · Score: 1

    Yottitatard? Yottard? Yottetard?

  18. Byte my shiny metal exponents by az-saguaro · · Score: 3, Interesting

    The names must apply to all forms of measures and metrics.
    But, if the Bureau of Geeks and Nerds has its say, the names will be:

    whata-byte
    abigga-byte
    onthisa-byte
    myassa-byte
    heybitchdont-byte

    On the serious side, the current system requires us to remember three names or prefixes for each triad (each 10^3).
    For example:
    one-million or one-millionth, versus one mega-meter versus one micro-meter. Million-mega-micro-.
    one-thousand or one-thousandth, versus one kilo-liter or one milli-liter. Thousand-kilo-milli-.
    For Europeans and others speaking Latin or Romance languages, the cardinal number names may be closer to the multiplier-divider prefixes, but it is still a cumbersome system.

    For the higher order new numbers, why not make them with a uniform naming convention.
    For instance, the common root name, then tillo- and tetto-.
    Examples:
    10^27 = one octillion trees, one octillo-meter, one octetto-meter.
    10^30 = one nonillion beans, one nonillo-newton, one nonetto-joule.
    10^39 = one dodecillion electrons, one dodecillo-farad, one dodecetto-ohm.

    Instead of having unique initials as abbreviations, such MB, mm, cm, km, Gb, etc., try this, using "D" for "decade":
    My new computer has 4 of 10^27 byte chips = 4-D27B of memory.
    The distance to so and so galaxy is one nonillion meters away, or D30m away.
    Or, something like that.

    It just seems too cumbersome to remember too many contrived names and disparate prefixes for ever bigger numbers that no one can really comprehend or has the time to recall in the middle of a sentence that is meant to be fluent.

    1. Re:Byte my shiny metal exponents by Anonymous Coward · · Score: 0

      I'm just hoping her breast size is 4-D44D.

    2. Re:Byte my shiny metal exponents by Anonymous Coward · · Score: 0

      They haven't reached a prefix for numbers so large that it's appropriate to designate the yomamma prefix yet?

    3. Re:Byte my shiny metal exponents by AmiMoJo · · Score: 1

      My new computer has 4 of 10^27 byte chips = 4-D27B of memory.

      Computer memory is always powers of 2 though. Everything is built around that, such as the way the MMU works, and changing to powers of 10 would create huge complexity in the circuits for no benefit.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    4. Re:Byte my shiny metal exponents by Anonymous Coward · · Score: 0

      Why not TGB ?

    5. Re:Byte my shiny metal exponents by mlheur · · Score: 2

      2^10 = 1KiB
      2^20 = 1MiB
      2^30 = 1GiB
      2^40 = 1TiB
      2^50 = 1PiB
      2^60 = 1EiB
      2^70 = 1ZiB
      2^80 = 1YiB

      now we add...

      2^90 = 1NiB (ninobyte)
      2^100 = 1DiB (decabyte)
      2^110 = 1LiB (levenbyte)
      2^120 = 1WiB (tWelvebyte)
      2^130 = 1BiB (because B looks like 13 in the right font/print)

      or just stop using prefixes and go full maths on it. e.g. "there are 3.250 x 2^98 bytes of storage"

  19. Garbage In, Garbage Out by Anonymous Coward · · Score: 0

    No data is better than big bad data... unless you're into fake news, in which case you can massage the big bad data any way you want to make anything seem true.

  20. "Shanghai" Bill is a known liar many times over by Anonymous Coward · · Score: 0

    Bill got caught lying 12-25 times repeatedly stating "Blood plasma is sterile" and then later that "The Chinese Govt does not directly censor Chinese citizens" and other absolute bullshit head-in-ass retard-level lies. You're not trustworthy.

    You are not a source of information that anyone should or even could trust, knowing your dishonest history. Sorry. That's what accountability means when you get caught lying repeatedly, over and over, even after directly corrected.

    You're a liar, Bill.

  21. Doctor Brown? Doctor EMMET Brown? by Anonymous Coward · · Score: 1

    1.21 jiggawatts, at 88 mph.

  22. Did I miss something? by Anonymous Coward · · Score: 0

    I thought that, after exabyte, SI basically went to using letters of the alphabet in reverse. They already have zettabyte and yottabyte, so xonobyte is up next?

    How about the proposal of "tooie" bytes to help increase the confusion? 10 "tooiebytes" is 2 to the 10th decimal power, 2e10, or 1 kilobyte. Bonus points for SI vs binary conflation on multiple fronts!

  23. Re:Doctor Brown? Doctor EMMET Brown? by antdude · · Score: 1

    Great Scott! This is heavy. :P

    --
    Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
  24. Some suggestions for the next level by Anonymous Coward · · Score: 0

    Here's my suggestions for the next as-yet-unnamed level after yottabyte

    fuckton-byte
    ds-byte (in honor of the death star bc it was pretty fuckin' big)
    and of course can't help but mention bytey mcbyteface

  25. Not bronto by Anonymous Coward · · Score: 0

    Bronto means thunder.

  26. Human Brains Solve This Problem by Anonymous Coward · · Score: 0

    By forgetting things that aren't important. Is where I ate 10 years ago on a Thursday in April really important or can they drop that row from their database? Part of big data is knowing when the data has become too big.

  27. And here I was by Anonymous Coward · · Score: 0

    I thought they'd Done The Right Thing defining the zetta before the yotta, so you could count down at need. You know, X..., W..., V..., etc. Funny how this "scientist" hasn't caught on to that.

  28. Wrong approach by Anonymous Coward · · Score: 0

    Using the same scale is wrong approach. Scale must be changed. Eg. a Zetta Byte (ZB) could become a unit. Then you have a kilo ZB. Mega ZB. Giga ZB, etc... Let's call them quads! Giga quads! :)

    Meanwhile... isn't energy required to store that much data (YB) enough to boil the worlds oceans a thousand times over?

    https://hbfs.wordpress.com/2009/02/10/to-boil-the-oceans/

  29. NSA by Anonymous Coward · · Score: 0

    NSA: "Hold my beer."

  30. But, he USES existing prefixes! by sabbede · · Score: 1
    "[extraneous nonsense that follows no convention] for octillion (27 zeros) and nonillion (30 zeros)"

    What's wrong with Octilabyte and Nonilabyte?

    Besides nothing.

    1. Re:But, he USES existing prefixes! by dkman · · Score: 1

      They make sense. That's what's wrong. If we don't over complicate the hell out of it then people might be able to understand it.

      Sad, but true.

      --
      I refuse to sign
  31. Doc Brown wants a new unit by Anonymous Coward · · Score: 0

    Hang on, Doc Brown wants a new term for something large? How about a "Great-Scottabyte"

  32. Should have spelled it by JudgeFurious · · Score: 1

    ...Yodabyte

    --
    Appended to the end of comments you post. 120 chars.
  33. How about ... by Anonymous Coward · · Score: 0

    BRONDO-byte?

  34. Thousand being 10^3 was the big mistake... by Anonymous Coward · · Score: 0

    As a kid I read books that used the english quantity of a billion, i.e. one billion == one million million. I realized then that the common usage for names for quantities was arbitrary and sucked.

    I felt a more rational system would be like this:

    1 - One (fine)
    10 - Ten (fine)
    100 - Hundred - Fine - we use a new word when you have X of X quantity
    1000 - Wrong - it should be ten hundreds
    10000 - Should be the real thousand - meaning one hundred hundreds. - Use a new word for X of X quantity
    100000 - Ten "real thousands"
    1000000 - Hundred "real thousands"
    10000000 - Ten hundred "real thousands"
    100000000 - Million Following the rule new word for X of X quantity - where X is 10000 * 10000
    100000000 100000000's would be the new word "billion"

    I know that looks weird, but it is way more logical and makes sense.

  35. Its the computers that are wrong by barius · · Score: 0

    We should just switch to base-10 computers, it might be easier than getting computer science majors to agree on a naming convention.

  36. I vote for ... by mark_reh · · Score: 1

    kaijubytes ... because they do!

  37. the Internet has already reached Yottabyte scale by Anonymous Coward · · Score: 0

    just goes to show the vast chasm between the classified high-side world, and the public open source low-side world. according to certain documents written by the NSA's Chief Research Scientist back in 2012, he projected the data from the Internet and all "sensors" would reach 1 Yottabyte by 2016. and his documents clearly expressed the NSA's intention to record all of it.

    Bluffdale operates at Yottabyte scale. 16 Yottabytes to be more precise.

  38. Here's my list by dcooper_db9 · · Score: 1
    I know I'm late to the discussion but I've been thinking about this for a long time. This combines a base number and a order of magnitude number. It can be translated backward into existing terms and the principle can be used for much larger numbers than I list here:
    • 1E+027: k^9: koennea (k^ennea)
    • 1E+030: k^10: kodeca (k^deca)
    • 1E+036: M^6: mohexa (m^hexa)
    • 1E+042: M^7: mohepta (m^hepta)
    • 1E+045: G^5: gopenta (g^penta)
    • 1E+048: M^8: mocto (m^octo)
    • 1E+054: M^9: moena (m^ennea)
    • 1E+060: M^10: modeca (m^deca)
    • 1E+063: G^7: gohepta (g^hepta)
    • 1E+072: G^8: gocto (g^octo)
    • 1E+081: G^9: goennea (g^ennea)
    • 1E+090: G^10: godeca (g^deca)
    --
    I do not block ads. I do block third party scripts.