Text Compressor 1% Away From AI Threshold
Baldrson writes "Alexander Ratushnyak compressed the first 100,000,000 bytes of Wikipedia to a record-small 16,481,655 bytes (including decompression program), thereby not only winning the second payout of The Hutter Prize for Compression of Human Knowledge, but also bringing text compression within 1% of the threshold for artificial intelligence. Achieving 1.319 bits per character, this makes the next winner of the Hutter Prize likely to reach the threshold of human performance (between 0.6 and 1.3 bits per character) estimated by the founder of information theory, Claude Shannon and confirmed by Cover and King in 1978 using text prediction gambling. When the Hutter Prize started, less than a year ago, the best performance was 1.466 bits per character. Alexander Ratushnyak's open-sourced GPL program is called paq8hp12 [rar file]."
... does it run linux?
Don't call me back. Give me a call back. Bye. So yeah. But bye our, well, but alright we are on a shirt this chill.
Will this work on Linux?
This entire thread just points out how stupid I am on the Grand-Nerd scale of things ;-)
so if we compress google, we will give birth to skynet? how the fuck does a compression program == AI
If you mod me down, I will become more powerful than you can imagine....
Note that you didn't eliminate the spaces - you changed their form into requiring the next character to have a "preceding character is a space" bit.
This increases the required bits for each character by 1. 6 bits are required for lowercase letters + numerals (36 characters).
In the long run, it probably comes out better that way (for that text sample), since the average 'word' length is under 6 characters, but it doesn't do much.
One of these days, I should actually create an account here, I'm really just an Anonymous Lazy Lurker Person...
Well, if you're gonna do all that, you may as well be creating AI. Which I guess is the point.