Slashdot Mirror


A Mathematical Proof Too Long To Check

mikejuk writes "Mathematicians have generally gotten over their unease with computer-assisted proofs. But in the case of a new proof from researchers at the University of Liverpool, we may have crossed a line. The proof is currently contained within a 13 GB file — more space than is required to hold the entirety of Wikipedia. Its size makes it unlikely that humans will be able to check and confirm the proof. The theorem that has been proved is in connection with a long running conjecture of Paul Erdos in 1930. Discrepancy theory is about how possible it is to distribute something evenly. It occurs in lots of different forms and even has a connection with cryptography. In 1993 it was proved that an infinite series cannot have a discrepancy of 1 or less. This proved the theorem for C=1. The recent progress, which pushes C up to 2, was made possible by a clever idea of using a SAT solver — a program that finds values that make an expression true. Things went well up to length 1160, which was proved to have discrepancy 2, but at length 1161 the SAT returned the result that there was no assignment. The negative result generated an unsatisfiability certificate: the proof that a sequence of length 1161 has no subsequence with discrepancy 2 requires over 13 gigabytes of data. As the authors of the paper write: '[it]...is probably one of longest proofs of a non-trivial mathematical result ever produced. ... one may have doubts about to which degree this can be accepted as a proof of a mathematical statement.' Does this matter? Probably not — as long as other programs can check the result and the program itself has to be considered part of the proof."

19 of 189 comments (clear)

  1. To long, didn't check. by fleabay · · Score: 5, Funny

    TL;DC

    1. Re:To long, didn't check. by Garridan · · Score: 5, Informative

      Funny thing about this. They've checked it. Actually, their "check" of this proof is many of orders of magnitude more rigorous than when, for example, a reviewer "checks" a math paper for errors before firing off a positive review. Nondisclaimer: I'm a mathematician.

    2. Re:To long, didn't check. by SydShamino · · Score: 4, Funny

      The neat part is that, if you take the first bit of each byte of the proof and string them all together, you get a complete HD MPEG copy of The Matrix.

      --
      It doesn't hurt to be nice.
    3. Re:To long, didn't check. by Garridan · · Score: 5, Insightful

      Mathematicians are supposed to be able to think at a higher level of abstraction than most other folks. Any mathematician who claims that 'this is too much for a human to check' is an idiot. It's not too much. We understand how computers work. They're way less error-prone than humans.

      1) Verify the proof that the verification algorithm works.
      2) Obtain several independent simple, portable implementations of said verification.
      3) Run said implementations on proof certificate on a variety of hardware.

      Trust the math, and where it comes to the hardware and software, trust but verify. Too long to check without aid of a computer? Sure, I'll buy that. But you'd have to be an idiot to want to check this proof without a computer. Why is this news? (actually, the result in discrepancy theory is wonderful, and I'm very happy to see it here on Slashdot... but massive computer proofs are truly nothing new)

    4. Re:To long, didn't check. by maxwell+demon · · Score: 5, Funny

      So you say the real reason why they cannot check the proof is that they would violate the DMCA by doing so?

      --
      The Tao of math: The numbers you can count are not the real numbers.
  2. wow by Anonymous Coward · · Score: 5, Insightful

    less space than wikipedia? that sounds large.

    wtf?

    1. Re:wow by HaZardman27 · · Score: 5, Funny

      I guess we've moved on from using "Libraries of Congress" as a unit of data size. I wonder how many "less than Wikipedia"s worth of data the NSA has?

      --
      Apparently wizard is not a legitimate career path, so I chose programmer instead.
  3. the beginning, not the end by EngineeringStudent · · Score: 5, Interesting

    it is the beginning of AI-science, not the end of human science.

    Science requires testable, provable, repeatable. If a human cannot understand the proof then he cannot participate in the science. This is likely to be referred to as an "early" version of machine-exclusive science.

    1. Re:the beginning, not the end by Kufat · · Score: 5, Insightful

      I'd hesitate to call one big for loop "AI." The interesting part of the proof is the reduction to SAT, and that's easily understood by mathematicians. The computer part is a straightforward and dull brute force search.

  4. After 9.5gigs by jellomizer · · Score: 5, Funny

    In the results there is the following statement.
    "As any idiot can plainly see"

    --
    If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    1. Re:After 9.5gigs by QilessQi · · Score: 4, Funny

      I have it on good authority that one of the steps of the proof is "???", followed by "PROFIT!".

  5. Paging Mr Fermat... by UdoKeir · · Score: 5, Funny

    I have discovered a truly marvellous proof of this, which this DVD is too small to contain.

  6. Grad students? by EvilSS · · Score: 5, Funny

    "Its size makes it unlikely that humans will be able to check and confirm the proof."

    I thought that's what grad students were for: endless mind-numbing labor. "Here, check this and have it back to me in 30 years or so."

    --
    I browse on +1 so AC's need not respond, I won't see it.
  7. Less space than Wikipedia by BlueMonk · · Score: 5, Insightful

    less space than is required to hold the entirety of Wikipedia

    I'd venture a guess that this is not unique and that every mathematical proof to date takes less space than Wikipedia. Did they mean more space?

  8. Re:prove that the program works by ClickOnThis · · Score: 5, Informative

    Prove that the algorithm works. That's your proof.

    Gödel and Turing make strong cases that proving the algorithm works for some inputs that are correct proofs doesn't count as proof it will work for all correct proof inputs. So no, even if you "prove the algorithm works" it is not the same as a rigorous mathematical proof.

    You're comparing apples to oranges (and lemons.)

    If the algorithm can be proved correct (within whatever axiomatic system you're using) then it's correct. The End.

    Gödel's incompleteness theorem shows that certain statements about axiomatic systems can be true but cannot be proved. That doesn't mean you can't be certain of something that is in fact proved (subject of course to the axioms.)

    Turing's halting problem is a statement about limitations in the ability of algorithms to examine other algorithms. Again, it doesn't mean you can't prove that an algorithm is correct.

    --
    If it weren't for deadlines, nothing would be late.
  9. Canadian Prime Minister would say... by jayveekay · · Score: 4, Funny

    "A proof is a proof. What kind of a proof? It's a proof. A proof is a proof. And when you have a good proof, it's because it's proven."

    Jean Chretien, former Prime Minister of Canada

  10. Oh, so that's what Beta is for by Tenebrousedge · · Score: 4, Funny

    Editor? This is Slashdot.

    You forgot to finish with the kick into the pit of death.

    But what if GP is already using Beta?

    --
    Those who advocate genocide deserve every protection afforded by law, and none afforded by common human decency.
  11. SAT is not a brute force loop by Mask · · Score: 5, Informative
    As someone nearing the completion of his Ph.D. in a subject close to SAT I can say that SAT does not resemble "one big for loop", not a bit. A modern SAT solver can solve problems with millions of variables and hundreds of thousand clauses. In contrast, a brute force for loop would require O(2^N) iterations where N is in the millions, which is like eternity. As an exercise, please try to write a trivial solver that can handle even 100 variables.

    Also, unlike what you may think, a SAT proof is not a list of "I tried a=1 and it did not work out, and this is the proof that a=0". A standard SAT proof deduces new clauses from the original problem by applying the resolution rule repeatedly. The newly deduced clauses reduce the search space and, if the problem is unsatisfiable, the solver ends up with the empty clause, which is always FALSE. The proof is a collection of resolution steps that lead to FALSE.

    SAT solvers are AI at least since:
    • 1. They employ search (not unlike chess game).
    • 2. They have non-trivial heuristics (not unlike chess game).
    • 3. The heuristics evolve and improve over the course of a run.
    • 4. They are able to deduce new clauses from the original problem.
    • 5. Many solvers employ a lot of smarts to simplify the problem even before starting search.

    SAT is clearly NP complete, and clearly the existence of good SAT solvers is not a proof that P=NP. This means that there will be relatively small problems that SAT solvers won't be able to solve. On the other hand, most real-world problems have a hidden structure which SAT solvers are able to find and use to their advantage.

    1. Re:SAT is not a brute force loop by Kufat · · Score: 5, Interesting

      Yeah, I'm familiar with SAT solvers and the fact that they aren't REALLY full brute force; I oversimplified it a bit for the Slashdot crowd. Might have gone a little too far on the "lies to children" scale, mea culpa.

      My point was that anyone with high school level math experience can understand the basic problem of boolean satisfiability; I was trying to draw a distinction between problems that are beyond human comprehension and those that are merely beyond human time and ability, with huge SAT instances falling into the latter category. Shouldn't have glossed over the details quite as badly as I did.