1st International Longest Tweet Results
Dr_Evil6_6_6 writes "Slashdot had a story about the 1st International Longest Tweet Contest last month, and the winners have just been announced." The winner is impressive.
← Back to Stories (view on slashdot.org)
FTA:
:= K // Initialize accumulator := a % N // Extract lowest-order digit from accumulator := U + d // Convert base-N digit to 32-bit character := c // Store 32-bit character in message := (a – d) / N // Subtract lowest-order digit and shift accumulator right
:= 0 // Initialize accumulator := M[i] // Extract 32-bit character := c – U // Convert 32-bit character to base-N digit := (a * N) + d // Shift accumulator left and add digit
"Todd Lehman says:
March 25, 2010 at 10:14 pm
The maximum number of bits you can squeeze into a tweet using 32-bit non-Unicode characters is 4339.
Below is an algorithm which uses large-integer arithmetic to carry out operations which encode and decode a 4339-bit value. Perhaps someone can develop this into a working program.
Constants:
Let V = 2 = 2147483648 be the number of non-negative integers representable in 31 bits.
Let U = 10FFFF + 1 = 110000 = 1114112 be the number of valid Unicode code points.
Let N = V – U = 2147483648 – U = 2146369536 be the base for numeric conversions below.
The encoding and decoding algorithms below operate on the principle that 4339 / logV 139.97 and that ceil(4339 / logV) = 140.
Encoding algorithm (K M[140])
(1) Let K be any 4339-bit integer
(2) a
(3) for each i in {0..139}:
(3a) d
(3b) c
(3c) M[139-i]
(3d) a
(4) Output M[140]
Decoding algorithm (M[140] K)
(1) Let M[140] be an array of characters representing a message to decode
(2) a
(3) for each i in {0..139}:
(3a) c
(3b) d
(3c) a
(4) K = a"
Anyone care to translate?
my karma will be here long after I'm gone
I don't get it.
If they ask what can be arbitrarily stored in the 4339bits available then there you can store 4339 arbitrary bits. It's a rule of compression. If they are asking for an English language compression program there are plenty better out there. Also if the goal is compression of English text and they aren't including the program size in the tweet then the competition can easily be cheated using a dictionary in the program that can be looked up.
At the winner it's not a particularly good compression algorithm. It doesn't even seem to take bayesian probability of characters into account. I can't see any arithmetic coding (mathematically the perfect entropy encoder) either.
Except for the fact the algorithms he has submitted have NOTHING to do with compression, and are just a method of mapping the 4339 bits into the allowable Unicode character set over 140 x 32 bit character "slots", i.e. encoding / decoding only.
With 4339 bits, hell in theory the longest actual tweet you could make is 2^4339 of any single character you choose, using the 4339 bits just to represent a (very large) counter of how many times to repeat the character.
Considering that 2^4339 is approximately 10^1305, and there are probably only 10^82 atoms in the whole universe, that's one bloody long tweet.
Is impressive another word for fucking stupid?
I missed that memo i guess...
Hey boss! you're pretty impressive today!
http://www.steike.com/code/useless/zip-file-quine/ ...infinite compression.
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
Ah, so someone has finally determined the absolute breadth of the Twittersphere. If the world ran on tweets, maybe we wouldn't ever need more than 64kB of memory.
Solution - Just tweet the following picture of a swimming fish:
".`.`..`.>"
Given that 1 word is 16 bits, and a picture is equal to 1,000 words, :-)
that makes my above tweet 16,000 bits of information (fitting
several pictures in a tweet may extend this further)
(.)(.)
(.Y.)
d^_^b
48000bits!
Long tweet is looooooooooong.
I eventually gave in and read TFA; they actually describe the winning algorithm...in the contest description. The contest was just to implement it (sort of). And (apparently) no one attempted to use the valid unicode characters as well. They just avoided them (like the contest bloggers) because they weren't sure that there wasn't some arbitrary string of characters that would mess up the message.
I suppose that the contest could continue on that basis alone: how many more bits can you encode by using the printable characters, without choking on arbitrary data? It is more likely that it will vanish in a poof of apathy instead, since no one bothered to do that this time.
Twitter is down? I don't know what Ashton Kutcher is having for lunch!
I DON'T KNOW WHAT ASHTON KUTCHER IS HAVING FOR LUNCH!!!!!!
ALL THAT HAS BEEN AND ALL THAT WILL BE IS GONE FOREVER! REPENT SINNERS! REPENT!!!!
these characters here to bypass bullshit filter adfasdfasdf dfasdfasdf asdfasdfasdf asdfasd fasdfasdf asdf