Slashdot Mirror


To Keep Track of World's Data, You'll Need More Than a Yottabyte (wsj.com)

An anonymous reader shares a report: In 10 or 15 years, Dr. Brown, who is head of metrology at the National Physical Laboratory in the U.K., anticipates the amount of computerized data worldwide will exceed 1 yottabyte in size, and without expanding the list of prefixes, there will be no way to talk about the next great chunk of numbers. Even worse, dilettantes could fill the void by popularizing glib prefixes such as bronto or hella -- terms that have already won fans. Without professional intervention, Dr. Brown fears, the next numerical prefix could become the Boaty McBoatface of weights and measures.

[...] For the record, there is an argument to be made for adopting a prefix like bronto: giga and tera are based on the Greek words for "giant" and "monstrous." Why not make bronto, named for the brontosaurus, official, perhaps along with tyranno, stego, colosso or even yeti? Dr. Brown is sympathetic to the argument but unconvinced. Instead, he proposes four prefixes that adhere to recent naming conventions [Editor's note: the link may be paywalled; an alternative source was not available.]: ronna and quecca for octillion (27 zeros) and nonillion (30 zeros), along with ronto and quecto for their fractional counterparts, octillionth and nonillionth. Like the latest sanctioned prefixes, Dr. Brown's proposals are loosely related to Latin and Greek words for numbers (in this case, nine and 10). And like most of the prefixes, his suggestions end in "a" or "o." But the process of expanding, or even amending, the official measurements is lengthy.

32 of 81 comments (clear)

  1. Somewhat arbitrary what we call data by ganv · · Score: 2

    It has always seemed a bit arbitrary to label something as "the world's data". You could always add the history of every cache on every processor on the planet to your definition of "data" and have a much larger number.

    1. Re:Somewhat arbitrary what we call data by ShanghaiBill · · Score: 4, Funny

      It has always seemed a bit arbitrary to label something as "the world's data".

      A yottabyte is 1e24. That is more than 100 terabytes per human.

      You could always add the history of every cache on every processor on the planet to your definition of "data" and have a much larger number.

      640 yottabytes ought to be enough for anyone.

    2. Re:Somewhat arbitrary what we call data by apoc.famine · · Score: 2

      I know, right? I've been backing up /dev/random for years now, and I'm not sure when I'll be done. I think part of the problem is running the checksum, but I'm not sure.

      --
      Velociraptor = Distiraptor / Timeraptor
    3. Re:Somewhat arbitrary what we call data by ganv · · Score: 1

      That is hilarious. Please mod up. It highlights much better than my first comment the insanity of trying to quantify the "worlds data".

  2. Re:JiggaByte by Tablizer · · Score: 1

    A Future, Back To The, reference this is, no?
      - Yotta

  3. Whats than in terms of by AHuxley · · Score: 1

    NSA and GCHQ spending per year?

    --
    Domestic spying is now "Benign Information Gathering"
  4. Yottabyte? by Anonymous Coward · · Score: 1

    Yottabyte? That's a lotta byte!

  5. Fake need? by Tablizer · · Score: 2

    Why make a new prefix for each power of ten unless (and until) it really is used often? Just make a generic term, such as "24th order of magnitude". In fact, I believe that's already used. We can even have a shorthand: "24 oom bytes". To remember it, think of a cow mooing in reverse.

    1. Re:Fake need? by viperidaenz · · Score: 4, Insightful

      They're not.
      they're making new prefixes for every third power of 10.

      If a consensus isn't reached relatively soon, the whole "billion" thing will happen again. it's been defined as both 10 to the 9th power and 12th power.

    2. Re:Fake need? by AHuxley · · Score: 1

      In terms of one standard NSA data storage facility?

      --
      Domestic spying is now "Benign Information Gathering"
    3. Re: Fake need? by Tablizer · · Score: 1

      I likk yerr bot hole in da heeted pewl

      Here ya go...

      Well, no heat, but wait until summer.

  6. StuffShirtBytes by Tablizer · · Score: 1

    Without professional intervention, Dr. Brown fears, the next numerical prefix could become the Boaty McBoatface of weights and measures.

    What's wrong with that? The rejection of "Boaty McBoatface" was a stuffed-shirt reaction. Going with that name could have helped increase funding even via increased awareness.

    BoatyBytes, McFaceBytes, sounds fine with me. The existing names are already silly, or at least magnets for jokes.

    1. Re:StuffShirtBytes by Tablizer · · Score: 1

      Or "Trilo": TriloBytes.
       

  7. Idiotic obsession with... by Anonymous Coward · · Score: 1, Insightful

    ... ancient languages.

    When we define the words and terms that are yet undefined, we can start fresh. We don't need to be chained to the past. And why not have numbers that sound cool to say that we can associate with things people know about it? This cult of ancient and dead languages is pretty disturbing. Since the naming convention is based on latin words for numbers is arbitrary in and of itself.

    1. Re:Idiotic obsession with... by taylormc · · Score: 1

      The advantage of using ancient languages - Greek in this case, BTW, not Latin - is that it allows a common vocabulary for use among speakers of many different mother tongues. Just as "petabyte" is founded on Greek "pente" ("five"), "yottabyte" is founded on the Greek "okto" ("eight"); so the next iteration would most usefully be founded on Greek "ennea" ("nine").

  8. Re: JiggaByte by Tablizer · · Score: 1

    It's "Fly you fools."

    I'm a wingless being, you insensitive clod!

  9. May I suggest... by Major_Disorder · · Score: 1

    The Hellabyte.
    In honour of the great profit... Eric Cartman.

    --
    First law of people: People are generally stupid.
    1. Re:May I suggest... by Major_Disorder · · Score: 1

      > In honour of the great profit...

      You loose!

      Unless I did it just to mess with people.

      --
      First law of people: People are generally stupid.
  10. Weighing a planet, one milligram at a time. by geekmux · · Score: 1

    There's a reason we created terms like "ton" to describe considerable weight. Childrens electronic toys can hold multiple Libraries of Congress these days, so let's stop pretending that "mega-ultra-giga-bazillion" is going to impress anyone.

    Hell, if we're gonna get stupid about this, then why not measure each individual bit? I'm sure Mathy McMathface can get piss drunk on new number names with an 8x power factor.

    Yes, there's a lot of data in the world. We get it. Now perhaps we can grow up and create a reasonable unit of measure.

  11. SI unit? by Gabest · · Score: 1

    Yottitatard? Yottard? Yottetard?

  12. Re: JiggaByte by Tablizer · · Score: 1
  13. Byte my shiny metal exponents by az-saguaro · · Score: 3, Interesting

    The names must apply to all forms of measures and metrics.
    But, if the Bureau of Geeks and Nerds has its say, the names will be:

    whata-byte
    abigga-byte
    onthisa-byte
    myassa-byte
    heybitchdont-byte

    On the serious side, the current system requires us to remember three names or prefixes for each triad (each 10^3).
    For example:
    one-million or one-millionth, versus one mega-meter versus one micro-meter. Million-mega-micro-.
    one-thousand or one-thousandth, versus one kilo-liter or one milli-liter. Thousand-kilo-milli-.
    For Europeans and others speaking Latin or Romance languages, the cardinal number names may be closer to the multiplier-divider prefixes, but it is still a cumbersome system.

    For the higher order new numbers, why not make them with a uniform naming convention.
    For instance, the common root name, then tillo- and tetto-.
    Examples:
    10^27 = one octillion trees, one octillo-meter, one octetto-meter.
    10^30 = one nonillion beans, one nonillo-newton, one nonetto-joule.
    10^39 = one dodecillion electrons, one dodecillo-farad, one dodecetto-ohm.

    Instead of having unique initials as abbreviations, such MB, mm, cm, km, Gb, etc., try this, using "D" for "decade":
    My new computer has 4 of 10^27 byte chips = 4-D27B of memory.
    The distance to so and so galaxy is one nonillion meters away, or D30m away.
    Or, something like that.

    It just seems too cumbersome to remember too many contrived names and disparate prefixes for ever bigger numbers that no one can really comprehend or has the time to recall in the middle of a sentence that is meant to be fluent.

    1. Re:Byte my shiny metal exponents by AmiMoJo · · Score: 1

      My new computer has 4 of 10^27 byte chips = 4-D27B of memory.

      Computer memory is always powers of 2 though. Everything is built around that, such as the way the MMU works, and changing to powers of 10 would create huge complexity in the circuits for no benefit.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    2. Re:Byte my shiny metal exponents by mlheur · · Score: 2

      2^10 = 1KiB
      2^20 = 1MiB
      2^30 = 1GiB
      2^40 = 1TiB
      2^50 = 1PiB
      2^60 = 1EiB
      2^70 = 1ZiB
      2^80 = 1YiB

      now we add...

      2^90 = 1NiB (ninobyte)
      2^100 = 1DiB (decabyte)
      2^110 = 1LiB (levenbyte)
      2^120 = 1WiB (tWelvebyte)
      2^130 = 1BiB (because B looks like 13 in the right font/print)

      or just stop using prefixes and go full maths on it. e.g. "there are 3.250 x 2^98 bytes of storage"

  14. Doctor Brown? Doctor EMMET Brown? by Anonymous Coward · · Score: 1

    1.21 jiggawatts, at 88 mph.

  15. Re:Horseshit by Waffle+Iron · · Score: 3, Insightful

    Well, other than using powers of 1,024 (or powers of 1,000 for the pedantic types who are unfamiliar with base 2.)

    I'm familiar with base 2. So I know that hard drives typically allocate blocks of size 2^9 or 2^12, and I know that there is nothing else in a hard drive related to powers of two.

    Which means that insisting on using powers of 1024 notation is like demanding that we count everything related to the NBA in base 5, since basketball teams have 5 members.

    (Actually, power-of-1024 notation is even worse than that, since it uses a *mixture* of various mutually incompatible 1024 powers combined with decimal fractions, all of which makes Roman Numerals look practical by comparison.)

  16. Re:Doctor Brown? Doctor EMMET Brown? by antdude · · Score: 1

    Great Scott! This is heavy. :P

    --
    Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
  17. But, he USES existing prefixes! by sabbede · · Score: 1
    "[extraneous nonsense that follows no convention] for octillion (27 zeros) and nonillion (30 zeros)"

    What's wrong with Octilabyte and Nonilabyte?

    Besides nothing.

    1. Re:But, he USES existing prefixes! by dkman · · Score: 1

      They make sense. That's what's wrong. If we don't over complicate the hell out of it then people might be able to understand it.

      Sad, but true.

      --
      I refuse to sign
  18. Should have spelled it by JudgeFurious · · Score: 1

    ...Yodabyte

    --
    Appended to the end of comments you post. 120 chars.
  19. I vote for ... by mark_reh · · Score: 1

    kaijubytes ... because they do!

  20. Here's my list by dcooper_db9 · · Score: 1
    I know I'm late to the discussion but I've been thinking about this for a long time. This combines a base number and a order of magnitude number. It can be translated backward into existing terms and the principle can be used for much larger numbers than I list here:
    • 1E+027: k^9: koennea (k^ennea)
    • 1E+030: k^10: kodeca (k^deca)
    • 1E+036: M^6: mohexa (m^hexa)
    • 1E+042: M^7: mohepta (m^hepta)
    • 1E+045: G^5: gopenta (g^penta)
    • 1E+048: M^8: mocto (m^octo)
    • 1E+054: M^9: moena (m^ennea)
    • 1E+060: M^10: modeca (m^deca)
    • 1E+063: G^7: gohepta (g^hepta)
    • 1E+072: G^8: gocto (g^octo)
    • 1E+081: G^9: goennea (g^ennea)
    • 1E+090: G^10: godeca (g^deca)
    --
    I do not block ads. I do block third party scripts.