Slashdot Mirror


Share The Pi!

freedumb writes "From this article in Nature: "Two mathematicians have now taken the first step towards proving that pi contains not a single message but every conceivable message, meaningful or not."" Actually, it's a discussion concerning whether "that all strings of the same length appear in pi with the same frequency: 87,435 appears as often as 30,752, and 451 as often as 862, a property known as normality."

26 of 380 comments (clear)

  1. Badass compression algorithm? by defile · · Score: 5

    Soo.. if pi contained every possible message (ie, was truly random), couldn't you in theory find a specific position where pi prints out say, the Max Payne ISO, and distribute that position to friends?

    Then, said friends, start calculating pi from that offset (wasn't there a story on slashdot about calculating any N digit of pi without having to calculate the first N-1 digits). Voila, kickass compression.

    Of course, the small snags here are:

    • Searching pi until you find that right position that matches your Max Payne ISO, which could be located on the far end of infinity.
    • Distributing what could be a multi-trillion digit number to your friends.

    But once you get over these boring details, pi-based-compression can make for some very neat applications

    1. Re:Badass compression algorithm? by The+Raven · · Score: 4

      It wouldn't work. With a completely random (normal) data set, the address of any particular string of numbers is of equal length to... the particular string of numbers! Thus, the average distance into pi of a four digit number... is a four digit number. I don't really care to do the exact math, but the end result is that the number of bits you wish to find and encode the address of would, on average, require an address with an equal number of bits.

      Raven


      And my soul from out that shadow that lies floating on the floor

      --
      "I will trust Google to 'do no evil' until the founders no longer run it." Hello Alphabet.
    2. Re:Badass compression algorithm? by localman · · Score: 4
      Distributing what could be a multi-trillion digit number to your friends.

      Easy! All you have to do then is search pi for the multi-trillion digit number and then send it's offset. If that offset is still to long you can just do it again until you ended up with, like, a single digit!

  2. Re:Useless Pi Fact by wirefarm · · Score: 3

    Oddly, my ICQ number is at position 19724810 counting from the first digit after the decimal point.
    What're the odds of *that*?
    ;-)
    Cheers,
    Jim in Tokyo


    Have no clue about firewalls?

    --
    -- My Weblog.
  3. Random bits that are in Pi somewhere by BigKahuna · · Score: 3

    Why concentrate on just pi? If they show it's true for all trancendental numbers, they've got it for pi, e, etc.

    Can pi appear in pi anywhere? I guess not, since that would mean that pi repeats. Could e be in pi? I suppose if e was in pi, and pi was in e, then pi would be in pi, which I guessed earlier it couldn't be. But, maybe I'm wrong and there's a loophole since if pi contains itself, there's an infinite recursion going on.

    If there were an infinite number of monkeys typing on an infinite number of keyboards would they eventually produce all the works of Shakespeare? Not exactly - they would produce them immediately (as quickly as a monkey can type). That's infinity for you. The lazy 8. It goes on and on...

    --
    BigKahuna
    1. Re:Random bits that are in Pi somewhere by Moogoo · · Score: 3
      Why concentrate on just pi? If they show it's true for all trancendental numbers, they've got it for pi, e, etc.

      My guess is as good as yours, but probably the reason they focus on pi is that pi is very old and very basic; it's one of those things that the ancient Greeks thought about. e, OTOH, is a little younger and arises from a more difficult problem.

      Can pi appear in pi anywhere? I guess not, since that would mean that pi repeats.

      Nope, pi can't appear directly; that is, pi can't look like 3.1415...31415... Think about it like this: if pi contained a copy of pi starting at the n-th digit, then the (n + m)-th digit of pi would be the same as the m-th digit of pi for every m. And then pi would be the same as (first n - 1 digits) + 10^(-n) * pi. This gives pi * (1 - 10^(-n)) = first (n - 1) digits of pi, which in turn gives pi = [first (n - 1) digits] / (1 - 10^(-n)), which is a rational number.

      I suppose pi could appear in pi in a slightly more complicated way though. For instance, it could be interleaved with other stuff, i.e. pi = ...3*1*4*1*5*... where the *'s are other digits.

  4. Re:Normality by SamIIs · · Score: 3

    That's not quite true. There exist numbers which have a PATTERN, but don't actually repeat.

    For instance, .101001000100001000001...

    There's certainly a pattern (1 zero then 2 zeros then 3...) but the number never repeats.

  5. source code for windows? by ankit · · Score: 5

    this is the latest : microsoft sues pi for containing the complete source code to windoze.
    btw, the code starts at position 4200394298 (in the binary expansion of pi), and continues for well, as long as anyone ccan read the stuff...

    --
    Don't Panic
    1. Re:source code for windows? by glebite · · Score: 4

      So, the ancients were right - there are no new things under the sun!

      All messages in PI exist as the ultimate expression of prior art...

      Coolness...

      --
      I donate all spillover Karma to the charity of my choice... Ada was still a babe despite what people may say...
  6. Re:Useless Pi Fact by BlueUnderwear · · Score: 3
    > Big deal...since Pi is an irrational number, and never ends, at some point there is a string of 5,646,498,765 8's all in a row.

    Not necessarily so. If you define a (decimal) number as follows:

      • it starts with 1. after the dot, each digit at a prime number position is 1, and 0 otherwise
    The number would be 1.01101010001010001...

    • The number would have no periodicity (because prime numbers become rarer and rarer), so it woule not be a rational number.
    • It never ends.
    • But still, by construction, it would not contain a single 8, much less a series of 5,646,498,765 eights.
    Thus proving that never ending irrational number does not necessarily contain all strings. Btw, irrational numbers are always never ending, or else they would be a fraction whose denominator would be a large power of ten. Think about it.
    --
    Say no to software patents.
  7. Re:Useless Pi Fact by BlueUnderwear · · Score: 3
    > This is a bullshit proof. First, you define your number not to contain any 8s, and then you say "see, it doesn't contain any 8s!"

    If you did any math in your youth, you'd know that this is a perfectly valid way to do a proof. It's called "coming up with a counterexample". If somebody claimed that all prime numbers were odd, it would be perfectly valid to point out that 2 is both prime and even. Discarding the proof because "you purposefully picked 2 to show me wrong" is invalid, as this is the whole point of the proof.

    Likewise, in this case DNS-and-BIND claimed that all infinitely long irrational numbers have necessary a long sequence of 8's in them. I refuted his claim by showing him a number which had no 8 at all inside. Now, what exactly is your problem with my refutation of that claim?

    > But that doesn't tell us anything about wether or not there is a string of 5,646,498,765 8s in pi or any other irrational number in decimal.

    You're right on that, but nobody claimed the contrary. Saying "not all irrational numbers have a long string of 8's in them" is not the same thing as "no irrational numbers have a long string of 8's in them".

    It's like in real life: "not all pies are cream pies" (or expressed differently "it is a pie, so it has to be a cream pie"): indeed, there are also apple pies...

    But that doesn't mean that "no pies are cream pies" (i.e. cream pies don't exist): indeed Bill Gates was hit by one in the face...

    --
    Say no to software patents.
  8. Re:New cult... by BlueUnderwear · · Score: 5
    With your username, you should know one egregious example of funny strings in Pi at funny positions:

    42424242 at position 242424.

    Oddly enough, according to the pi search page, the same string can be found again at position 1404114, which is also below 100000000. On a normal pi, you'd expect a single occurrance of 42424242 below 100000000, and at a completely random position...

    --
    Say no to software patents.
  9. Re:Normality by BlueUnderwear · · Score: 5
    > I was always taught that because pi is infinite (i.e. never ends), it must repeat itself somewhere in itself

    Cannot be, or else it would repeat itself an infinite number of times, cyclically, which would make it a rational number.

    --
    Say no to software patents.
  10. New cult... by Zaphod+B · · Score: 3

    It will start a whole new branch of numerology dedicated to finding entire new holy books... the Book of the Damned, I Microsoft, II Microsoft, the letter of BOFH to the Great Unwashed, and, of course, the source code to Office (which will take up the space between 2^8 and 2^40906) ...


    Zaphod B
    --
    Zaphod B
    When duplication is outlawed, only outlaws will have /bin/cp
  11. Re:Normality by Mister+Attack · · Score: 3
    The article states specifically that the researchers are working in binary. The property they are looking for to prove normality is a property of a binary number. The base-10 numbers they gave were probably just examples that "normal" people would understand.

    So if anything, they are proving normality to base 2^n, NOT base 10^n. And it may actually be that their proof is general enough to show normality in all bases - the article is not clear on that point.

  12. Goin' on an MP3 hunt... by 4/3PI*R^3 · · Score: 3

    Ok, if an average MP3 is 5MB (5,242,880 bytes) then the odds of finding a specific MP3 using a sequence of random numbers is 1/256**5,242,880 (since 8-bit bytes have 256 possibilities). This is about the same as 1/10**12,626,113 (since our base 10 numbering system gives each digit only 10 possibilities)

    P(MP3)=1/10**12,626,113 (really close to 0)

    Thus the odds of not finding a specific 5Mb MP3 using a sequence of random digits is 1-P(MP3)

    ~P(MP3)=1-1/10**12,626,113 (much closer to 1)

    Since the longest known expansion of Pi is about 500 Billion digits (500,000,000,000) there are 499,987,373,888 consecutive strings of 12,626,113 digits contained in the known expansion. So the question is, what is the probabilty that at least one of these is the specific 5MB MP3 we are looking for.

    An easier question is to ask what is the probabilty that we won't find the specific 5MB MP3 in a 500 Billion digit expansion of Pi.

    The probability that any specific 12,626,113 digit substring of the 500 Billion digit expansion of Pi is not the 5MB MP3 we are looking for is ~P(MP3). So the probability that every one of the 499,987,373,888 possible 12,626,113 digit expansions is not the 5MB MP3 we are looking for is ~P(MP3)**499,987,373,888.

    P(~MP3)=~P(MP3)**499,987,373,888

    So now that we know the probability that a specific 5MB MP3 file is not contained in the 500 Billion digit known expansion of Pi, we can calculate the probability that we can find at least one instance of a specific 5MB MP3 as ~P(~MP3)=1-P(~MP3).

    ~P(~MP3)=1-P(~MP3) =1-~P(MP3)**499,987,373,888 =1-(1-P(MP3))**499,987,373,888 =1-(1-1/10**12,626,113)**499,987,373,888

    Hmmm... I think I'll go by a lottery ticket.

  13. Other stuff that could be found by Dr_Cheeks · · Score: 3
    C'mon; there's gotta be some other moderators out there with funny/insightful points going spare that could be well used on the parent post : )

    Other possible things that we could find -

    • Trademarked names, e.g. Disney(TM), Microsoft(TM), McDonalds(TM), etc.
    • A complete recording of Master Of Puppets in MP3 format.
    • Various satirical jokes about George W.
    • A complete DivX of Star Wars : Episode 3 (found 1 month before the movie is released). Or for that matter, a better Episode 1.
    • A circle, drawn in "1"s, when the number is examined in base 11 (nod to the late Carl Sagan).
    --

  14. Irrationality by sigwinch · · Score: 3
    Not being a troll, but I still don't see the big deal about one irrational number.
    'Irrational' means literally 'cannot be written as a ratio'. This doesn't necessarily mean that the digits are random. You can have numbers like
    3.44333444443333334444444...
    that are irrational, but whose digits are trivially deterministic. Boring.

    Then there are the 'dirty' irrational numbers like pi and e that seem to have random digits. The research mentioned has moved a big step closer to proving that the digits of pi don't just seem random, they truly are random (at least in the sense that all possible combinations occur).

    The part that'll really blow your mind is that somebody found an equation that tells you any binary digit of pi you want, without having to calculate any of the other binary digits. (See here.) That is why people are excited by the conjectured normality of pi: if normal, it produces all possible strings of bits from a trivial deterministic equation. This mixture of randomness with order is at the heart of many interesting questions in chaos theory, computational theory, and cryptography.

    --

    --
    Kuro5hin.org: where the good times never end. ;-)

  15. Well, I know one thing this will mean... by Antaeus+Feldspar · · Score: 3

    And that is a pain in the neck for everyone in comp.compression.

    There is a frequent fallacy among those who almost understand how compression works, that works like this:

    "Wait a minute! I bet that every set of digits that someone could be trying to encode can be found somewhere in the digits of pi! Therefore, we can compress any sequence by simply reducing it to the number of digits in the sequence, and the offset in the digits of pi where an identical sequence begins!

    The assumption, of course, is that the number of digits and the offset can be encoded in a form that will be smaller than the original sequence. There is nothing to warrant that assumption. The fact is that the number of possible inputs that a lossless compression method can handle places lower bounds on the average length of its outputs. This means that no lossless compression method can achieve a lower average length for its outputs than would be achieved by simply numbering them all with the non-negative integers.

    In fact, 'compressing' a sequence of digits into a (length, offset) pair will do substantially worse, since there are multiple (length, offset) pairs that will correspond to a given digit sequence; for instance, "1" could be encoded as (1,0) or (1,2). This duplication means that (1,2) is essentially wasted, since it could be representing a sequence that currently has a longer representation.

    Lossless compression methods need to be used in conjunction with models: some criteria that separates the data we will want to compress from the vast majority of files, about which we do not care. The accuracy of this model affects how many of our inputs we can actually compress, and its precision affects the average compression ratio.

    --
    If people are to respect the law, perhaps the law should begin by respecting the people.
  16. Re:Normality by BadDoggie · · Score: 3
    This has happened before, when mathematicians realised they had been basing proofs on a couple assumptions which themselves had never been proven. You can read about it in Simon Singh's "Fermat's Last Theorem", an extremely readable and enjoyable look into both Fermat and mathematics in general.

    A teacher, a physicist and a mathemetician are having drinks together in a Scottish pub when the teacher looks out the window and sees a white sheep. The teacher says, "There are white sheep in Scotland". The physicist looks out the window and declares, "There are sheep in Scotland; we have already detected and confirmed white ones." The mathematician says, "In Scotland there is at least one sheep, at least one side of which is hite."

    No, I didn't pull that from Singh (it's there, though). It's an old mathematician's joke but it's true. The most anal Rainman you've ever seen is incredibly chaotic compared to mathematicians (at least when they're working on a proof or theorem).

    woof.

    "No ma. You don't have to worry about Code Red. Yes, I know CNN told you that you do. Ma, do you run a Web server? No, Netscape is a browser, not a server. Yes, there's a difference. You don't want to know. No, and the Internet didn't die last week, either..." -- my side of a phone call two nights ago.

  17. Craziness with transcendental and imaginary #s by proxima · · Score: 3

    Trying to imagine why every n digit number shows up the exact same amount of times is hard to imagine at first. But then, once you think about it, on an infinite scale, it would seem to attest to Pi's true randomness.

    On a side note, I had a Calc II professor awhile back that wrote on the board:

    e^(i*Pi) = -1 (of course, using the real symbols).

    Then, he proved it. I have the proof written down in a notebook and I even managed to work through the final parts of the proof (it uses a standard solution for finding e^(i*A*X) without using it. If anyone is really interested in seeing it, I can post it (in rough ascii math =) For those of you with TI-92s that don't believe me, type it in. That magical machine can do more than I give it credit for sometimes.

    Anyway, I just thought it was absolutely incredible that you could mix the two most popular transcendental numbers with the imaginary number (square root of -1) and spit out plain old -1.

    --
    "The universe seems neither benign nor hostile, merely indifferent." --Carl Sagan
  18. Re:Every message? by junklight · · Score: 4
    If thats the case then the DeCSS algorithum can be denoted by the formula for PI and the decimal position.

    Who they going to sue then - the universe. Or perhaps they are going to require licenses to use PI instead...

  19. Ban the circle! by Telal · · Score: 5

    If pi has all conceivable messages, pi must contain all of the US military's secrets, DeCSS, kiddie pr0n, violent and explicit sexual films beyond anyone's imagination and much much more. It must therefore be banned. When you get the death penalty for circle possession, don't say I didn't warn you...

    1. Re:Ban the circle! by MarkusQ · · Score: 3
      If pi has all conceivable messages, pi must contain all of the US military's secrets, DeCSS, kiddie pr0n, violent and explicit sexual films beyond anyone's imagination and much much more. It must therefore be banned.

      It must also contain all finite length MP3s. Therefore under the DMCA it already is banned.

      The sad part is, I'm not joking. The DMCA is so absurdly broad that you could easily raise a cogent case for using it to ban the concept of Pi for this very reason.

      -- MarkusQ

  20. Re:What about the length between the messages? by Gary+Yngve · · Score: 3

    A specific string of k bits long will on average occur every 2^k bits of a random bitstream. (For example, it will on average take about 1024 coin tosses to get 10 heads in a row.)[0] The reasoning I have in mind for this relies on the fact that coin tosses are independent events. The digits of pi are certainly not independent random events because we can calculate them. However, because of normality behaves macroscopically much like independent events, k precise bits will occur on average every 2^k bits of pi. I imagine the actual proof is highly nontrivial, but handwavy entropy arguments convince me that this is true.

    [0] As an interesting sidenote, many of the "streaks" in sports coincide roughly with the streaks that probability dictates. Baseball hitting streaks aren't necessarily because the hitter is in the zone... they may be because probabalistically, they are due to hit in 30 games in a row if their average is .320 and they play 1000 games (made-up numbers).

  21. roll your own PI by beanerspace · · Score: 5
    For those of you with a spare machine and time on your hands, here are some links that show you how to calculate your own value for PI: