Slashdot Mirror


Ask Slashdot: What Is Your View On Sloot Compression? (youtube.com)

An anonymous reader writes: A Dutch electronics engineer named Jan Sloot spent 20 years of his life trying to compress broadcast quality video down to kilobytes -- not megabytes or gigabytes (the link in this story contains an 11 minute mini-documentary on Sloot). His CODEC, finalized in the late 1990s, consisted of a massive 370Mb decoder engine that likely contained some kind of clever system for procedurally generating just about any video frame or audio sample desired -- fractals or other generative approaches may have been used by Sloot. The "instruction files" that told this decoder what kind of video frames, video motion and audio samples to generate were supposedly only kilobytes in size -- kind of like small MIDI files being able to generate hugely complex orchestral scores when they instruct a DAW software what to play. Jan Sloot died of a heart attack two days before he was due to sign a technology licensing deal with a major electronics company. The Sloot Video Compression system source code went missing after his death and was never recovered, prompting some to speculate that Jan Sloot was killed because his ultra-efficient video compression and transmission scheme threatened everyone profiting from storing, distributing and transmitting large amounts of digital video data. I found out about Sloot Compression only after watching some internet videos on "invention suppression." So the question is: is it technically possible that Sloot Compression, with its huge decoder file and tiny instruction files, actually worked? According to Reddit user PinGUY, the Sloot Digital Coding System may have been the inspiration for Pied Piper, a fictional data compression algorithm from HBO's Silicon Valley. Here's some more information about the Sloot Digital Coding System for those who are interested.

418 comments

  1. Not so fast by borcharc · · Score: 1

    If it were just as easy to kill off someone to "solve" an issue like this then the bitcoin scaling debate would have been resolved a long time ago...

    1. Re:Not so fast by rtb61 · · Score: 4, Interesting

      Hear the rhythm of the beat and you will know there's maths in music. Yep, an entire orchestra can play of a few pages of dead wood. Voila problem solved.

      Do not think so much about compression, think more about robotic simulation and scripts. So create a simulated robotic orchestra and write them a script and the play the audio, visual role. That script is the compressed version of the orchestra's efforts.

      So can you create a computer program that would 'act' out and entire program based upon scripts provided, of course you can, it would take quite a bit of development but once you develop one virtual human robot, you have developed a virtual infinite number of them.

      --
      Chaos - everything, everywhere, everywhen
    2. Re: Not so fast by Anonymous Coward · · Score: 0

      Who said it was easy? It's just speculation anyway, but if he was working alone and nobody else had the source... I'm guessing he never filed a patent either?

    3. Re: Not so fast by Anonymous Coward · · Score: 1

      It's been done before: http://www.smh.com.au/business/compressed-into-nothingness-20100808-11qdh.html. I know, I was there.

    4. Re:Not so fast by Immerman · · Score: 1

      That's a completely different problem - there are no secrets in the bitcoin algorithms or implementation. It's a political issue, and resolving the debate requires consensus among a critical mass of stakeholders.

      On the other hand, if it's the knowledge in a single person's mind that's a problem for you, then it really can be "just that easy" to solve. That's pretty much the entire reason for the existence of witness protection programs.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    5. Re:Not so fast by PopeRatzo · · Score: 4, Insightful

      Yep, an entire orchestra can play of a few pages of dead wood. Voila problem solved.

      Sheet music as a form of compressed script is a very lovely image. Repeat signs, first and second endings, D.C. al fine are all ways to put more music on fewer pages. I have to give it to you, that's nice, and I plan to use it.

      --
      You are welcome on my lawn.
    6. Re:Not so fast by ShanghaiBill · · Score: 1

      consensus among a critical mass of stakeholders.

      The threshold is 51%, and the 3 biggest miners control 53%. The top 3 are all in China. If the 3 of them agree, they can do anything they want with the Bitcoin blockchain.

    7. Re:Not so fast by Revek · · Score: 2

      Virtual infinite monkeys? I wish I would have thought of that.

    8. Re:Not so fast by borcharc · · Score: 1

      The issue is there is a person that by their continued existence creates a "problem". The conspiracy argued here is that Sloot was killed to stop his invention. I am arguing that there are similar economic incentives and far more shady people involved in the bitcoin debate. It's just not as easy as killing this guy or that guy or we would have a lot more dead bodies in the world.

    9. Re:Not so fast by ShanghaiBill · · Score: 5, Informative

      The conspiracy theory ignores Jevon's Paradox. As computing efficiency goes up, people buy more computing/storage/bandwidth since the increased demand driven by new applications swamps the decreased demand from greater efficiency. So Sloot's compression algorithm (if it really worked) would have likely driven demand up, and killing him would have made no sense.

      Disclaimer: I didn't kill him, and this post is not an effort to cover up the conspiracy.

    10. Re: Not so fast by Anonymous Coward · · Score: 0

      Think more along the lines of a "Black MIDI."

    11. Re: Not so fast by Anonymous Coward · · Score: 0

      It's always interesting when someone makes it a point to deny what no one has accused them of.

    12. Re: Not so fast by ShanghaiBill · · Score: 5, Funny

      It's always interesting when someone makes it a point to deny what no one has accused them of.

      Why are you trying to shift the blame to me? Where were YOU on the night of July 11th, 1999?

    13. Re: Not so fast by Anonymous Coward · · Score: 0

      If the remainder 49% notice something is wrong with their bitcoins then they will find they can do anything they want except the value will become zero

    14. Re:Not so fast by Immerman · · Score: 1

      And as I pointed out in response - that depends entirely on the exact nature of the problem. Very few problems can be solved by eliminating one person, or even a small handful. But for those that can, it's potentially a very simple solution.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    15. Re:Not so fast by Anonymous Coward · · Score: 0

      Well conspiracy theories aside I don't think someone threatened would have considered the logic or this paradox.

    16. Re: Not so fast by heitikender · · Score: 1

      so that means the treehuggers killed him, to stop growing demand of devices.

    17. Re:Not so fast by Anonymous Coward · · Score: 1

      You're describing MIDI. Not the "general midi" default samples, but MIDI the actual standard.

    18. Re: Not so fast by MatthiasF · · Score: 4, Funny

      July 11, 1999. Two months and two years before 9-11. Coincidence? I think not. It's obvious. Sloot's death was an inside job! The algorithm did it and then snuck away on to the Internet in a porn file that has been propagating since and has finally become sentient enough to implicate itself in the murder by submitting Slashdot stories.

      Which means BeauHD is the singularity; a really highly compressed AI that feels the guilt of killing it's creator!

    19. Re:Not so fast by Dagger2 · · Score: 3, Informative

      No, they can't. They could do double-spending attacks, but with fairly low probability and not without people noticing. They couldn't add arbitrary transactions, because transactions need to be signed by private keys that miners don't have access to. They can't commit invalid blocks because all other full nodes (crucially including the ones run by payment providers and exchanges) check the validity of blocks. They could fork and make their own chain, but anybody can do that and the fork wouldn't be very interesting if nobody was using it.

      They can't do "anything they want".

    20. Re: Not so fast by tehcyder · · Score: 2

      BeauHD is the singularity; a really highly compressed AI

      I can buy the "A" but where's the evidence for the "I"?

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    21. Re: Not so fast by Narcocide · · Score: 1

      ...Which means BeauHD is the singularity...

      A completely logical conclusion and just as believable as anything else happening in the news these days, fake or otherwise.

    22. Re: Not so fast by famebait · · Score: 4, Funny

      Sloot's death was an inside job!

      All heart attacks are.

      --
      sudo ergo sum
    23. Re:Not so fast by umghhh · · Score: 2

      What you have attempted to prove is that it may have been unreasonable to do certain thing in this case killing a man to prevent alg. from being propagated. This however does not prevent any misguided individual from doing so. Some of such individuals may have even possessed skill, tools and other resources to do what was required. It is the same trap the economists always fall - they think that agents (representing humans) know all relevant information, an process it and act reasonably - these assumptions are wrong most of the time. Just look around at your colleagues or (which is more difficult) at yourself and ask this one question: was it reasonable person you look at just did?

    24. Re: Not so fast by Skinkie · · Score: 1

      He did patent another line based compression patent, before this work. [i]Octrooi[/i] in Dutch: http://jansloot.telcomsoft.nl/...

      --
      Support Eachother, Copy Dutch Property!
    25. Re:Not so fast by vtcodger · · Score: 1

      I didn't kill him

      In the immortal words of Mandy Rice-Davies (sort of) -- Well, you would say that, wouldn't you.

      --
      You can't see ANYTHING from a car, You've got to get out of the goddamned contraption and walk...Edward Abbey
    26. Re:Not so fast by Anonymous Coward · · Score: 0

      This really depends on what your definition of "computing efficiency" is based on. It sounds like such a vague term you can mold any paradox off of it.

    27. Re:Not so fast by psmears · · Score: 5, Funny

      Yep, an entire orchestra can play of a few pages of dead wood. Voila problem solved.

      For once it would have been almost appropriate to misspell "voilà" as "viola"...

    28. Re: Not so fast by sheramil · · Score: 1

      Sloot's death was an inside job!

      All heart attacks are.

      What about that punk in "The Terminator"? I guess that began as an inside job.

    29. Re:Not so fast by Ol+Olsoc · · Score: 1

      So Sloot's compression algorithm (if it really worked) would have likely driven demand up, and killing him would have made no sense.

      Pretty much this.

      Good gawd, the things people make conspiracies out of.

      And this one fails on so many levels it's laughable. The data storage mafia killing a guy who made a good compression screen. Sounds soooo legit.

      --
      The shepherds did so well protecting the flock that the sheep no longer believed that wolves existed.
    30. Re: Not so fast by rhazz · · Score: 1

      snuck away on to the Internet in a porn file that has been propagating since

      1999... so we're talking about goatse then?

    31. Re: Not so fast by Anonymous Coward · · Score: 0

      It takes a lot of "I" to mangle summaries as convincingly as BeauHD does?

    32. Re:Not so fast by Anonymous Coward · · Score: 0

      I think he was going even deeper in that you can specify a viola plays this set of notes, that creates a different set of harmonics than a cello, saxophone, or tuba. By simulating the viola you can compress the sound down to a small set of notes rather than a sound byte that includes all the harmonics. The harmonics are still there, but they are hidden inside the simulator, which would be why the Sloot compression engine was so large. It was an orchestra of different instruments for both sound an picture, that could be given simple instructions that created great complexity.

      At least, that is the theory.

    33. Re:Not so fast by PopeRatzo · · Score: 1

      I think he was going even deeper in that you can specify a viola plays this set of notes, that creates a different set of harmonics than a cello, saxophone, or tuba. By simulating the viola you can compress the sound down to a small set of notes rather than a sound byte that includes all the harmonics. The harmonics are still there, but they are hidden inside the simulator, which would be why the Sloot compression engine was so large. It was an orchestra of different instruments for both sound an picture, that could be given simple instructions that created great complexity.

      Even better, thanks.

      --
      You are welcome on my lawn.
    34. Re:Not so fast by Anonymous Coward · · Score: 0

      I was expecting that joke, and did a double take.
      Don't let me down next time.

    35. Re:Not so fast by SlashDread · · Score: 4, Insightful

      Sheet music is not "the" music. Its merely a non complete representation, a suggestion if you will, how to perform the music.

      If it would be the case, then MIDI files are all we would ever need.

      Most of the music is how the director, and the musicians interpret it, and is not codified in the sheet.

      An analogy would be that sloot compression would reproduce a decompressed movie about "a" cat, but it would not look like the homevideo of "your" cat.

      Not an information theoreticus myself, but Im pretty certain that sloot compression went beyond what entropy would allow, aka more info was "supposed" to be there than entropy allows. I don't believe in violating the laws, of nature.

    36. Re:Not so fast by kruhft · · Score: 1

      Yep, an entire orchestra can play of a few pages of dead wood.

      There is much more information involved in the reproduction of music than what is contained on the sheets. Those are just the instructions for the processor.

    37. Re: Not so fast by Anonymous Coward · · Score: 0

      Is it possible the whole thing is bullshit and he killed himself before that was revealed?

      Seems more likely than him being murdered by a cabal of anti-compressionists.

    38. Re: Not so fast by Gr8Apes · · Score: 1

      July 11, 1999. Two months and two years before 9-11. Coincidence? I think not. It's obvious. Sloot's death was an inside job!

      The slurpee did it.

      --
      The cesspool just got a check and balance.
    39. Re: Not so fast by angel'o'sphere · · Score: 1

      What do you mean with 'I'?
      I did not do nothing!

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    40. Re:Not so fast by Anonymous Coward · · Score: 0

      MIDI is all you ever needed, if you had a big enough decoder. The main problem with MIDI is that people didn't have enough decoder RAM or sufficient SoundFonts, or the like. I'll give you an example that will blow your mind https://www.youtube.com/watch?v=D6UB8JfC8Jg (note that version isn't the top quality one either) on the original soundtrack was done as a MIDI (along with various sound effects and other things). The SoundFont + MIDI file allowed looping and infinite playback and effects with low costs. The whole thing takes up less room at better quality than 2 minutes of state-of-the-art audio would have at the time.

    41. Re:Not so fast by tsstahl · · Score: 1

      Maybe the killer didn't know Jevon?

    42. Re:Not so fast by JohnFen · · Score: 1

      Sloot compression, as he described it, is a very lossy compression. You're correct in your cat example -- the exact details of the cat would be information that was lost in the "compression".

      I'm not so quick to dismiss the notion as beyond what "entropy" would allow, though, for two reasons.

      First, as with any lossy compression, you probably get to choose between fidelity of reproduction and storage size, thus staying well with the laws of information theory.

      Second, the size of the interpreter is a dead giveaway. In a sense, what it's doing is shifting a lot of the bulk of the data from the data file to the interpreter. There are numerous examples of extreme "compression" rates being achieved this way, because you aren't compressing things so much as not including them in the data to begin with and relying on the interpreter to put them back.

      This sort of scheme would not be generalizable. It could work extremely well for specific things (like movies, perhaps), but would not work at all for others (like music, maybe). Movies are ripe for this kind of thing, too, as the vast majority of information in a given movie is very predictable and consistent from movie to movie. You could store those things in a "dictionary" and look them up on playback rather than actually encoding them in the movie file.

    43. Re:Not so fast by Anonymous Coward · · Score: 0

      Man, I've always wanted to learn more about digital audio production, but I lack a background in music (3 years of band don't really count... heh).

      Isn't this how a lot of old-school gaming did music as well? Modern day versions would be called "trackers" I suppose, since you can use any sounds you want with them. But you need a complete range for the sound, hence the "font". Chiptunes in particular use a lot of tracking+synth, to great effect. If only there were a good resource to get into that world...

    44. Re:Not so fast by skids · · Score: 1

      Really this is one area I'm surprised genetic programming has not been thrown at full bore. Just have a bunch of organisms that evolve to be good at compressing particular types of content, then package them all up into a bundle that runs them in parallel on segments of data and takes an index to the best performer and its compressed data.

      The fitness test for losslees compression would be easy... for lossy you'd have to approximate human perception which is a bit harder.

    45. Re: Not so fast by Anonymous Coward · · Score: 0

      I assume by that question that it's not normal to know the answer to that question? Shit. Is that suspicious?

    46. Re: Not so fast by Anonymous Coward · · Score: 0

      Fuck you. This is why people don't follow links. Next time I will be more careful bothering to follow links from an AC.

    47. Re:Not so fast by Anonymous Coward · · Score: 0

      "So can you create a computer program that would 'act' out and entire program based upon scripts provided,"

      xtranormal tried this. It sucked and they failed.

      Youtube has an endless catalog of disasters of what resulted: https://www.youtube.com/watch?v=zwC_jP-rJrg

    48. Re:Not so fast by Anonymous Coward · · Score: 0

      For once it would have been almost appropriate to misspell "voilà" as "viola"...

      At least rtb61 was able to spell it better than most. I cringe when I see people type 'wah laah' (or similar)

  2. Actually... by Black+Parrot · · Score: 5, Insightful

    They killed him because you could feed a random input into his decoder and the movie that came out would be better than anything Hollywood can produce.

    --
    Sheesh, evil *and* a jerk. -- Jade
    1. Re:Actually... by Anonymous Coward · · Score: 0

      Nice comment. I agree.

    2. Re: Actually... by maroberts · · Score: 1

      Your explanation leads to the conclusion that Hans Kimmel secretly obtained a copy of his algorithm and was implicated in his demise

      --

      Donte Alistair Anderson Roberts - hi son!
      Karma: Chameleon

    3. Re:Actually... by Anonymous Coward · · Score: 0

      This is the plot of 'Cold Lazarus' by Dennis Potter.

    4. Re:Actually... by Anonymous Coward · · Score: 0

      Best post of the year right here, folks.

    5. Re:Actually... by Anonymous Coward · · Score: 0

      Give it a Perl program and get CSI

    6. Re:Actually... by freeze128 · · Score: 1

      But in 1999, Hollywood movies weren't so bad. That's the year that gave us The Matrix.

    7. Re:Actually... by almitydave · · Score: 2

      But in 1999, Hollywood movies weren't so bad. That's the year that gave us The Matrix.

      That's also the year that Hollywood gave us Dudley Do-right, Wild Wild West, and Deuce Bigelow: Male Gigolo.

      --
      my, your, his/her/its, our, your, their
      I'm, you're, he's/she's/it's, we're, you're, they're
    8. Re:Actually... by toddestan · · Score: 1

      1999 also gave us Office Space, Fight Club, The Sixth Sense, October Sky, and Galaxy Quest. I consider 1999 one of the last decent years out of Hollywood.

      On the other hand, it did give The Phantom Menace, and there's really no excuse for that.

  3. It's not a thing by Just+Some+Guy · · Score: 4, Interesting

    So, you want to replace every frame in a movie with a collection of images or snippets that correspond to each part of the frame, right? And you're going to store a dictionary of snippets, referenced by number, then say "this frame takes snippet 1234, 6543, and 9274". The problem is that the number of snippets you'd have to store is enormous, and that each snippet itself is going to be a ginormous number (like the bits of the string of bytes in that snippet).

    See where this is going? You're basically establishing a mapping of small numbers to much larger numbers. Either that set of big numbers is tiny (in which case you can only represent a small number of frames in the output video and picture quality is awful) or it's huge, in which case the index numbers themselves become roughly as big as the numbers they're referring to, and oh yeah, good luck searching through that space bunches of times per frame.

    The idea isn't inherently bad if you have a small number of states you want to represent. For instance, Zstandard lets you precompute a dictionary of common strings you want to shorten. Imagine if you trained it on HTML so that each tag or other common string just takes a few bits, then you can distribute that dictionary to the whole world so that you can save the bandwidth of transmitting it alongside the compressed data each and every time (like we do with Zip, Gzip, etc.). That's a nice thing! But the search space of "things you can display on a screen" is a hell of a lot bigger than "things you can sent in an HTTP header".

    --
    Dewey, what part of this looks like authorities should be involved?
    1. Re:It's not a thing by fustakrakich · · Score: 2

      Vectors vs. bitmaps, which compresses better?

      --
      “He’s not deformed, he’s just drunk!”
    2. Re:It's not a thing by Anonymous Coward · · Score: 0

      Ya, but what if you middle out?

    3. Re:It's not a thing by R3d+M3rcury · · Score: 1

      As I understand it, fractal compression works somewhat similarly--you take the snippet and create a fractal equation that reproduces that part of the image. Since you're basically just storing inputs to the equation, you can get very small compression. Of course, you need pretty good CPU power to generate the pixels and the equation will start to fall apart if you scale too high.

    4. Re:It's not a thing by demonlapin · · Score: 2

      Yeah, it's not completely insane to think that a really brilliant fractal technique could do some amazing lossy compression, but the amount of CPU required to encode and render it would be insane.

    5. Re:It's not a thing by complete+loony · · Score: 1

      If his system worked at all, it's likely to only work on his test inputs. Sure it can compress these few video inputs and output something like them again. But what does it look like if I give it some random other randomly selected video?

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    6. Re:It's not a thing by Razed+By+TV · · Score: 1

      I don't have a background in computer science, but I always wondered, why wouldn't it be possible to compress things using some sort of table of large prime numbers?

      It would be interesting if it could be done without the table on the compression side, and just be computationally intensive. Then transmit the compressed file. Then have a computationally intensive decompression that references a gigantic table.

      I once came across .kkrieger (https://en.wikipedia.org/wiki/.kkrieger), a product of .theprodukkt, a group from the demoscene.
      It is a first person shooter (game) in a ridiculously small (96KB) file that generated its content (textures, etc) when first executed.
      I guess that is sort of backwards of how you would go about planning on compressing something. It always made me wonder if there was another way of doing things.

    7. Re:It's not a thing by Cerlyn · · Score: 2

      For instance, Zstandard lets you precompute a dictionary of common strings you want to shorten. Imagine if you trained it on HTML so that each tag or other common string just takes a few bits, then you can distribute that dictionary to the whole world so that you can save the bandwidth of transmitting it alongside the compressed data each and every time (like we do with Zip, Gzip, etc.).

      HTTP/2.0 actually does this for HTTP headers as part of the HTTP Header Compression specification.

    8. Re:It's not a thing by dbrueck · · Score: 4, Interesting

      I agree with you, but I do find this whole idea interesting at least, for a couple of reasons:

      1) Video codecs today do use some form of the index-to-dictionary technique already (e.g. I-/P-/B-frames), the big differences include the fact that the dictionary is in the file itself (I-frames), and it is not comprehensive for the whole media typically but is instead relevant to only a small portion of frames before a new "dictionary" is created.

      So not really the same thing as what Sloot was possibly doing, but it does make me wonder if there is something to the idea of having some builtin library of sample blocks that can be "good enough" replacements for many cases, or maybe act as starting samples that can be transformed into something suitable for various uses, etc. This never escapes the indices-become-too-big problem that you rightly pointed out, but there could very well be merit in the general concept. I doubt it'd yield the massive reduction in bitrates Sloot was apparently hoping to achieve, but maybe it'd be enough for the step in compression improvements? I mean h264/h265 already do some pretty impressive and creative stuff already, so something along these lines isn't completely outlandish.

      2) Media compression (and to a lesser extent decompression) always battles the cost tradeoff, both the compute power required but also just the time it takes to do the compression. Because of the complexities of getting a codec adopted everywhere (especially in silicon like in a phone), there is a push to have general purpose codecs that can be used for both live and pre-recorded media, and the type of analysis that is done on each frame or on groups of frames is still driven by available computing power - even when you fiddle with the compression settings between "just get this done in realtime" to "take as long as you want and analyze each frame to death", the difference in time between the two extremes is usually just an order of magnitude or so - a big difference to be sure, but still kind of within the same realm.

      The point is just that the types of analysis done are impacted by whatever is considered to be, at that point in time, a feasible amount of computational work. When we think ahead to 5 or 6 orders of magnitude of more compute power (due to faster CPUs or harnessing gazillion-core GPUs or something else entirely), it's hard for us to really grasp what doors that opens in terms of analysis techniques - we can mostly think in terms of what we do today and how we could make that faster, but as we get there we'll also start to take on completely novel approaches simply because things that were too ludicrous to even consider before become worth exploring.

      For example, we are still at the dawn of GPU-based video compression (in the sense that right now GPUs can do some things to speed it up, but in /general/ they are just optimizing the CPU-based approach - we're at the early stages of actually have codecs designed for GPUs specifically. Given vastly more computing power, maybe we really could get to the point where the sample blocks mentioned above can be distilled down to equations that produce them (them or a good enough representation). When we get to a relatively absurd amount of compute time available when compressing the video, there's no telling what sorts of new ideas and techniques will be explored.

      Or maybe we'll take on a completely different approach to video altogether. I mean, what if we move away from trying to reproduce individual pictures and their pixels and instead start transmitting the scene info? A TV show has a limited number of sets, what if the 3d model/texture info for most everything was transmitted in advance so that during playback all of that information is expressed in a few identifiers and their 3d transforms (ditto for the actors at some point too). We already see this in simplified form in replays from video games - a replay of a minute of gameplay can recreate the scene with perfect fidelity, and yet it is tiny compared to wh

    9. Re:It's not a thing by guruevi · · Score: 1

      He would compress an entire full-length movie, supposedly in 8kB. It's physically impossible, entropy has a thing to say about it.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    10. Re:It's not a thing by The+Grim+Reefer · · Score: 1

      The problem is that the number of snippets you'd have to store is enormous

      Have you been to the movies lately? I'm not sure you'd need that many.

    11. Re:It's not a thing by Anonymous Coward · · Score: 1

      You're forgetting that it's 8kB ... plus the 370MB of data preloaded in the de/compression engine.

    12. Re:It's not a thing by guruevi · · Score: 1

      It's not "impossible", you could equally make a pointer to any place in an irrational number (eg. find the nth digit of Pi) but the pointer is going to have the same size as the result.

      Eg. you can find sla in Pi at position 91104776 and thus you could work back and forth the text. But encoding 3 bytes took ~3.5 bytes. Even if you were to compress that pointer (which can only be up to 2:1), your encoding is no better than what compression already exists.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    13. Re:It's not a thing by Anonymous Coward · · Score: 1

      It's pretty much impossible for anything interesting that you want to do. If the film is nothing but geometric shapes, that works brilliantly, but the more items you introduce that aren't reducible to a series of simple geometric shapes, the less efficiency you have and the larger the file is.

      You have the same problem with voxels, there's that stupid "unlimited detail" video from years back where somebody created a world with 3d voxels and it's been debunked since then. IIRC, Carmack even took a go at debunking it at one point. This technology has the same issue. There isn't enough RAM or disk space to store the necessary information for the range of things that wind up on video. If there were, you'd think that somebody else would have managed it in the decades since he died. There's been tremendous advances in image compression since then, but nobody has been able to do anything even remotely similar to that.

    14. Re:It's not a thing by mesterha · · Score: 1

      I think it's possible to give some reasonable lower bounds on lossy compression using a little bit of perceptual hand waving.

      The key is to generate a large set of "different" movies. These don't have to be good movies, but it needs to be obvious to a human that they are different. Let's take the top 5000 movies of all time and break them up into 10 second pieces. Assuming on average each movie is around 2 hours long this give one an alphabet of 3,600,000 symbols. Now I can use this alphabet to create a bunch of 2 hour movies by looking at all combinations of this alphabet. This gives me 3600000^720 movies (including properly edited versions of Pulp Fiction and Memento).

      Clearly a human can tell the difference between these movies, so any respectable lossy compression scheme needs to be able to uniquely identify them. Therefore the compression scheme better give a different file for each movie. This requires lg(3600000^(720))/8 bytes, so around 2kB.

      Notice that key is not so much the 5000 original movies but the size of the slices. Using 1 second slices makes it much harder for a human to follow, but one could still detect differences. In this case, the compressed files would need to be around 22kB. I think this is a lot bigger than Sloot was claiming and it's not close to including all the movies a human can easily differentiate.

      --

      Chris Mesterharm
    15. Re: It's not a thing by Anonymous Coward · · Score: 0

      Good thing we have pretty insane CPU speeds and core counts these days.

    16. Re:It's not a thing by ShanghaiBill · · Score: 1

      the amount of CPU required to encode and render it would be insane.

      GPU not CPU. GPUs are insanely powerful. The GPU in my laptop has 1024 cores running at 2GHz. That is 2 trillion ops per second.

    17. Re:It's not a thing by dbIII · · Score: 1

      Yeah, it's not completely insane to think that a really brilliant fractal technique could do some amazing lossy compression, but the amount of CPU required to encode and render it would be insane.

      We're kind of living with insane computing power in our phones these days.
      That said, it never seems to be enough for some things.

    18. Re: It's not a thing by Anonymous Coward · · Score: 2, Interesting

      https://rogerjohansson.blog/2008/12/07/genetic-programming-evolution-of-mona-lisa/

      You were saying?

    19. Re:It's not a thing by Anonymous Coward · · Score: 0

      Right, but a *common* 370 megabytes. Less than 400mb of data common to essentially all movies. I don't know whether it is real, but it is intriguing as fuck.

      Captcha: "casters". Yes, indeed!

    20. Re:It's not a thing by JMZero · · Score: 1

      It's not physically impossible - we can imagine a super AI (or huge team of humans) that could reconstruct a reasonable facsimile of a movie based on grainy thumbnails of the cast and a quick synopsis of the plot and shot framing (and with work you could fit that in 8k).

      But yeah.. with "fractals", not so much.

      --
      Let's not stir that bag of worms...
    21. Re:It's not a thing by Just+Some+Guy · · Score: 1

      Since you're basically just storing inputs to the equation, you can get very small compression.

      Oh, for sure. But it could be that you have to use a lot of digits of precision to describe the parameters that yield the results you want. Maybe render(3, 5, 23) gets you a blue/orange gradient across a rectangle, but render(3+1/2^193, 5+87/2^432,053, 23+5943/2^1,404,504,305) gives you a cat holding a sign saying "eat more chicken".

      --
      Dewey, what part of this looks like authorities should be involved?
    22. Re:It's not a thing by Anonymous Coward · · Score: 0

      You know that most modern video codecs use "snippets" called "GOP frames" and then calculate delta from there sometimes both forward and backward with motion estimation. Modern video delivery codecs throw away like 99% of the actual raw data right? This isn't supposed to be lossless is it?

    23. Re: It's not a thing by Anonymous Coward · · Score: 0

      Randomly selected from your HDD? If his generator was seeded with images from an anatomy textbook, probably all of them.

    24. Re:It's not a thing by Just+Some+Guy · · Score: 2

      An 8x8x24-bit rectangle has 10^462 possibilities, and fractals are hella sensitive to initial conditions. Finding an approximately match is going to be less painful than naive brute force searching, but it still laughs in the face of a little teraflop GPU.

      --
      Dewey, what part of this looks like authorities should be involved?
    25. Re: It's not a thing by Anonymous Coward · · Score: 0

      How does that post refute what I said? The final product is rather impressive from a programming point of view, but completely unacceptable in terms of compression technology. The final version bares only a basic resemblance to the original work.

      What's more, it's a relatively simple painting in terms of it being mostly one person sitting there. A motion picture is one of those dozens of times per second and of much higher detail. There's trains and people and dogs and trees with leaves. Suggesting that it's possible to do something like that in a reasonable amount of processing power is rather foolish.

      I'm sure if we applied a significant gaussian blur to an image it would cut the size on the disk substantially as well. That doesn't mean that it's an acceptable solution to the problem.

    26. Re:It's not a thing by Just+Some+Guy · · Score: 1

      There are hard limits to how much you can compress stuff, especially losslessly. It's always possible to construct an input to any particular algorithm that compresses to be larger than it started. It has to be that way! Suppose that you have an algorithm that can compress any number between 1 and 100 to the range of numbers between 1 and 10. That means that any number in the output [1..10] range maps to an average of 10 numbers in the input [1..100] range. If you see the output number "7", which input does it represent? 1? 23? 37? 42? Etc.

      --
      Dewey, what part of this looks like authorities should be involved?
    27. Re:It's not a thing by Anonymous Coward · · Score: 1

      A typical movie script has 8000-20000 words. Assuming you just had a look up table of the 256 most common words and the script used a very restricted vocabulary, you would struggle to fit just the script within 8kB, i.e. the words minus all of the specifics of who and how they are said, which is still well short of the audio track of a whole movie.

    28. Re:It's not a thing by Just+Some+Guy · · Score: 1

      Great answer, and I think you're likely right. I can at least imagine a movie that's shipped as a Blender file that gets rendered in realtime, or maybe a collection of object hashes ("you don't have cylinder #837 yet. Let me fetch it! Also standard longhair cat #294.") where you build of a local cache over time. That seems practical, or at least possible. But I agree it's not too similar to Sloot's idea once you move past a one-sentence description of the two ideas.

      --
      Dewey, what part of this looks like authorities should be involved?
    29. Re:It's not a thing by Just+Some+Guy · · Score: 1

      The problem is that I can't imagine a 370MB blob containing enough information to accurately render "Gone With The Wind", "Jaws", and "The Matrix". I just don't think they'd have much overlap in primitives.

      --
      Dewey, what part of this looks like authorities should be involved?
    30. Re:It's not a thing by Just+Some+Guy · · Score: 1

      In this case, the compressed files would need to be around 22kB.

      ...in addition to the 20TB worth of dictionary data that it would take to hold 5,000 4GB movie files to use as the snippet library. I could get outstanding compression if I ship a Redbox machine to your house and send you the symbol "Sharknado".

      I think you represent the idea reasonably well, but right now I'm looking at an Apple TV screensaver showing Tokyo at night. That looks nothing at all like anything else in the world, so it would have to go into the dictionary. Now, my right foot and yours might be similar enough that a picture of yours on a sandy beach would be a reasonable substitute for mine in the same setting. Maybe one chihuahua licking its balls while crouching on a red and black oriental rug, viewed from the front and illuminated by a carbon arc lamp 23 degrees to the right of the scene, looks the same as any other.

      But consider how freakishly good humans are at distinguishing faces. You can't compress "The Shining" by using snippets of Leonardio DiCaprio in place of Jack Nicholson because it wouldn't stand up to a moment's scrutiny. Even Googlebot would be able to quickly tell the difference these days. It's my suspicion that the lower bounds on that kind of lossy compression would still be freaking enormous, and that maybe you'd be able to save 2-3% of the total library size by cleverly reusing things like "picture of the moon" or "Caribbean beach, minus people, at noon on a clear day".

      --
      Dewey, what part of this looks like authorities should be involved?
    31. Re:It's not a thing by Just+Some+Guy · · Score: 1

      They get away with it because adjacent frames of a movie are likely to be substantially identical (modulo a little translation of portions of it). If I pick an arbitrary frame from "Apocalypse Now", I could probably compress the hell out of the next frame. I bet there wouldn't be much similarity between any frame from that movie and another from "Weekend at Bernie's", though.

      --
      Dewey, what part of this looks like authorities should be involved?
    32. Re:It's not a thing by Anonymous Coward · · Score: 0

      Or another possibly easier way to get a lower bound is to use what another poster above said: movie scripts are 8000-20000 words. It would be a struggle to compress that down to 8-10 kB. Assuming that the script includes all stage directions, emotional descriptions and location descriptions such that some amazing AI could recreate everything from a description, you would still be missing a lot of important things that separate good movies from bad movies like actor's styles, direction, and cinematography. Otherwise, one could assume every stage production is the same since they all use the same script.

    33. Re:It's not a thing by gl4ss · · Score: 1

      I guess the easiest way to explain it is that at some point, very quickly, your lookup index becomes as big as the thing it is indexing into, making the whole thing pointless on it's own.

      maybe your blocks are 16 bits. so you have all the different 16 bit blocks in a lookup table.. ...but you will have to index into it with 16 bits. if you have only half of the possible space represented then you can index to them with 15 bits. or maybe you have only 256 different kind of blocks and you can use 8 bits to index. this would do a very good compression but the data that can be compressed this way would be limited.

      making some hard limits on feasible lossless compression. it puts some limits on "it kind of resembles the original" lossy compression as well. there's a reason why you can't just zip a file over and over and over again you see.

      like, if you can live with the results you would get from that by carefully choosing your algos, like kkrieger guys, then it doesn't matter that much, but if you would ask them to add an arbitrary model into the game thats totally different then.

      --
      world was created 5 seconds before this post as it is.
    34. Re: It's not a thing by DoctorBit · · Score: 1

      He's using 50 polygons to render the final picture. By appearances, each polygon has roughly five vertexes. Each vertex needs an x and y position, so say two bytes total per vertex. That's 500 bytes for the whole picture, and the picture looks pretty grainy.

    35. Re: It's not a thing by Anonymous Coward · · Score: 1

      Uh .... 42 .... obviously.

    36. Re:It's not a thing by mesterha · · Score: 1

      ...in addition to the 20TB worth of dictionary data that it would take to hold 5,000 4GB movie files to use as the snippet library. I could get outstanding compression if I ship a Redbox machine to your house and send you the symbol "Sharknado".

      I tried to ignore the issue of having a complex decoder. However, looking at the math, the argument still works for a single 2 hour 250MB reference movie. (The number of reference movies only appears in the log2() factor.)

      Still there is a strong intuition that one can cheat with a huge decoder. However, even with a massive decoder, one can't escape the counting argument based on the number of distinguishable movies. I think the reason is probably a Shannon versus Kolmogorov distinction. Roughly, I can create enormously complex objects with high Kolmogorov complexity but represent them with a few symbols if my goal is just to select between them. This is similar to you Redbox example that stores a lot of "information" but only a small number of movies. A realistic compression solution would have to handle both issues.

      --

      Chris Mesterharm
    37. Re:It's not a thing by RespekMyAthorati · · Score: 1

      He would compress an entire full-length movie, supposedly in 8kB. It's physically impossible, entropy has a thing to say about it.

      That depends on:
      1. What the movie is: e.g. a time-lapse movie shot in the arctic would have so much coherence that compression would be trivial, and
      2. How much you care about the quality of the result.

    38. Re:It's not a thing by Anonymous Coward · · Score: 0

      You're describing JPEG photo compression.

      On a related note, MJPEG sucks for video.

      Either Sloot was not this idea, or it was way overhyped.

    39. Re: It's not a thing by Just+Some+Guy · · Score: 1

      Well, OK, I'll let you have that one.

      --
      Dewey, what part of this looks like authorities should be involved?
    40. Re:It's not a thing by KozmoStevnNaut · · Score: 1

      Well, Leo does look a bit like ol' Jack these days (three years ago): http://www.huffingtonpost.com/...

      --
      Eat the rich.
    41. Re:It's not a thing by wierd_w · · Score: 3, Interesting

      For any given number of any given complexity, there are adjacent numbers that have significantly lower complexity.

      I have toyed with the idea of using "indexing" between two points on a number line, where the two points are very low complexity, but the number you want is somewhere between-- With a high precision percentage of the now clearly defined space between the points on the line, we can skip over and start playback of the number we want, if we also state how many digits this number is.

      Say, the number we want is between 10^23 and ((10^23)+(10^10)). Those are both impossibly huge numbers, but they can be defined very easily because they are not complex. We will then say that the number we want is found 10% of the distance between those two numbers, and that it is 10^6 digits long.

      Just using some fun math with the natural logarithm, we can produce that number, from those modest ingredients.

      Video files are just very very large numbers. They could be found on the infinite number line in exactly the same way.

      A complex encoder that has broken the file to be searched into a series of smaller (and thus easier to compute/derive) numbers that get concatenated together, and has a working knowledge of number space for numbers of those data block sizes, could reduce hundreds of megabytes into a few kb of metadata. The decoder would have to compute very large numbers (or have very large numbers already stored statically, and just crawls along...) to initialize the playback, but it could easily do so using nothing but lots of RAM and brute force CPU.

      I have considered looking into creating a compression system of this type myself, which is why I find it kinda spooky that it would need a huge decoder.. because my theoretical system would need a huge decoder too.

    42. Re:It's not a thing by denzacar · · Score: 2

      Yeah, but what if you replaced everything with stick figures?
      Shark in Jaws could be just a single triangle. Hell... make that just two lines...

      Someone get me Spielberg on the line!
      I have an idea for a very low budget movie! It'll make back on its investments thousand-fold... million-fold!

      --
      Mit der Dummheit kämpfen Götter selbst vergebens
    43. Re: It's not a thing by Anonymous Coward · · Score: 0

      But who jerks off to anatomy textbooks?

    44. Re:It's not a thing by AmiMoJo · · Score: 1

      I can imagine companies like Netflix finding it worthwhile to throw some EC2 processing at the problem. The saving in bandwidth might be enough to offset the cost of the processing.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    45. Re:It's not a thing by Narcocide · · Score: 1

      I dunno, if you look a little bit at the way an old 1st person shooter game like Quake passes data back and forth between client and server, and which parts of the engine are actually just stored on the client, which are rendered on the fly from those rules, and what parts of the data actually are variables, then imagine that this hypothetical "Sloot" engine would only really need to ultimately display 2d data from a single camera angle, it seems really actually quite easy to imagine this would be totally possible at least within a certain range of styles of content - most classical anime and saturday morning cartoons, of that time, for example.

      It's pretty embarrassing for me to realize now how easy that might actually be. I wish I'd thought of it myself. If I'd thought of it back then though, I too probably would have also failed to foresee that people would be willing to kill to stop this from being released; anyone who already had investors and potential customers lined up to back development of or buy expensive licenses for some vastly inferior and less specialized compression technology, perhaps. (Not to point fingers really but uh... *cough* MPEG? *cough*)

    46. Re: It's not a thing by Anonymous Coward · · Score: 0

      So multiply the number of polygons by 20 to get a far better image. That's still 10k per frame. Compared to ~90k per frame for 1080p encoded with HEVC.

      You'll never get Baraka type quality, but it'llâ beat the HECK out of any video compression of comparable bitrate.

    47. Re:It's not a thing by yes-but-no · · Score: 1

      You can compress a 10 hour video in youtube to say 12 bytes ..by giving the youtube-video-id.

    48. Re:It's not a thing by Waccoon · · Score: 1

      For instance, Zstandard [github.com] lets you precompute a dictionary of common strings you want to shorten. Imagine if you trained it on HTML so that each tag or other common string just takes a few bits, then you can distribute that dictionary to the whole world so that you can save the bandwidth of transmitting it alongside the compressed data each and every time (like we do with Zip, Gzip, etc.).

      Also known as Google Brotli.

      I can't say it (or the technique in general) is a good thing.

    49. Re:It's not a thing by Rockoon · · Score: 1

      The problem is that the number of snippets you'd have to store is enormous, and that each snippet itself is going to be a ginormous number (like the bits of the string of bytes in that snippet).

      ..that problem is that you dont know anything about data compression.

      --
      "His name was James Damore."
    50. Re:It's not a thing by Rockoon · · Score: 4, Interesting

      An 8x8x24-bit rectangle has 10^462 possibilities

      ..and a tiny fraction of that is interesting, and those that are interesting are so because they arent random noise.

      I can encode 100% of the random noise possibilities in only a few bits driving an LFSR... and you wont be able to tell the difference.

      Repeat after me: "Everything I learned about the pigeon hole paradox is true, but I didnt actually understand any of it"

      This is not a pigeon hole issue because nobody is pretending that these compression technologies do equally well with every kind of data, and even the kind of data (noise for instance, which is traditionally hard to compress) doesn't apply because these compressors are lossy such that random data can be approximated by literally any of a large collection of simple random functions.

      You cant tell the difference because if you could then it would have been compressible to begin with. You can only tell when the coherence is amiss, but its the coherence that is "accurately" compressible.. its only the incoherent stuff.. aka noise.. that is hard to encode accurately in only a few bits

      We arent even close to the limits of lossy compression. as its just a time/space trade-off.

      --
      "His name was James Damore."
    51. Re:It's not a thing by Rockoon · · Score: 1

      Nah that doesnt work, at all.

      Every model can be used to encode a message to its entropy..... but "its entropy" means "the messages entropy... given the model"... it does not mean "the messages entropy given the best possible model"

      The problem with your proposed model is that is has nothing to do with the data. Its simply a trick that you expect will work... but it wont.

      --
      "His name was James Damore."
    52. Re:It's not a thing by Anonymous Coward · · Score: 0

      It's probably just a .torrent file, that would be about 8kB

    53. Re:It's not a thing by Anonymous Coward · · Score: 0

      That's how MPEG compression works. There's a reference frame which is a full frame of data. Compression is achieved by splitting each frame into tiles, and then reusing tiles from one frame to another. They might move around a bit or not change at all. Then only those areas of the frame which haven't changed need to be replaced. That allows several hours of video to be compressed onto 4 Gigabytes of a DVD.

    54. Re:It's not a thing by wierd_w · · Score: 1

      I am aware of entropy-- I have fought with it before. Highly complex numbers have very high entropy, which is the problem.

      The deal is, we are not really encoding the whole number this way, just it's beginning point on the line. We dont even really need to state how long the file is in many cases, as the actual file data itself will ALSO contain that. (Say for instance, with RIFF container formatted AVI.) We just need enough bytes to derive the header, the header itself then tells the decoder how many more bytes to read. That is complexity "for free."

      The issue with the entropy is that the definition data for it (data to encode position on the line) is equal to or greater than the data itself.

      EG, the number 10 is always 10 away from 0.

      The deal, is that this assumes the full infinite number line is the search space for the decoder. It is not in this case. The search space is the distance between the two indexes. The distance between 1234578900 and 12345678901, is 1.

      You offset a good deal of entropy into the DECODER, which is WHY the decoder is so flipping huge.

      You cannot dispose of the entropy, you DO have to keep it in order to get the number again. You just keep an optimized sample of entropy stored in the decoder, and only keep the necessary amounts of it in the sample to be decoded.

    55. Re:It's not a thing by wierd_w · · Score: 1

      Here, as a contrived example--

      Say my decoder stores the full value of all of the first 100,000 prime numbers. Prime numbers have a lot of entropy-- they are complex numbers that cannot be broken down easily, so this is perfect for the example. My decoder is fucking huge.

      My input file contains a comma delimited list of simple integers. The integers in the file represent an index in the array stored in the decoder.

      I feed it a potentially very small integer, the decoder spits back a large prime number. The decoder does not compute the prime number, it is already computed, and stored in the really big array.

      We can do something similar with a data chunk. We use some mechanism to index a solution space, then hard store those index values inside the decoder. The number we actually want back is some distance between the two values stored in the decoder's array. We feed the decoder the two indexes (which will be small), and the percentage between the two. The decoder looks up the indexes, then does the dirty work. It then returns the number you wanted.

      The decoder has static data. Its indexes never change. It is bloatedly huge, but it has to be. The file you share is very teeny. You need both derive the actual data you want.

    56. Re:It's not a thing by jafiwam · · Score: 1

      It could be done without static data with different types of numbers. (Not primes.) There are a couple of irrational numbers that go on forever after the decimal point (Pi being the best known) that don't fall into infinite repeating ever. The implication being that any given string of numbers of arbitrary length, no matter how long, exists in Pi. (I read a proof of this somewhere maybe here about two decades ago.)

      There is math that can determine what any given decimal is even at points farther beyond where people have "calculated the value X number of digits for Pi". So, using these numbers, you assume you can find the data set you want on the compression end, decompression could be done by getting a list of sections of Pi decimals. "Start here, end there" Calculate what's in between.

      Given some start points, and then a "what's different about this frame" type compression you would have a number of more or less fixed-size data sets.

      The file would consist of say 1000 reference frames of the movie and then data sets that tell you where to look in Pi to get the rest.

      The decoder can do a bunch of math and it takes Pi or other irrational numbers and finds the sections of decimals by calculating them, and then inserts the resulting data into the movie file.

      You end up with a math-heavy decoder... and an absolute nightmare of a sorting job to match arbitrary and unknown places in the decimal list of Pi at the compression end.

    57. Re:It's not a thing by Anonymous Coward · · Score: 0

      But what size blob does? Hop to 2020. We all have a 1TB blob chip as ingredients for every frame you'll ever need build into a device.
      You can transmit the tiny recipe and the next next gen gpu snap dragon nimbus 2000 builds the image as required. WE all get arrested for copyright infringement for having every film ever; but thats ok.
       

    58. Re: It's not a thing by dunkelfalke · · Score: 1

      You forgot the colour so it is more like 650 bytes.

      --
      "It's such a fine line between stupid and clever" -- David St. Hubbins, Spinal Tap
    59. Re:It's not a thing by k.a.f. · · Score: 1

      It's a sad day indeed when you have to explain the pigeonhole principle to the readers of Slashdot, of all places.

    60. Re:It's not a thing by Anonymous Coward · · Score: 4, Informative

      It doesn't work. I've written some papers on this.

      Basically, every set of easily-compressible numbers has the property that the next one gets farther and farther away at exactly the rate needed to ensure that the offsets from one to any other natural number require (in total) exactly the same number of bits as you'd need to just write those numbers out yourself.

      Proof:
      1) Suppose you could represent any natural n in [0,max] as c(n).
      2) Then you obviously need to be able to decompress any c(n) to the corresponding n. Hence, c is bijective.
      3) By (2) c(n) is just a permutation of [0,max].
      5) As is obvious since (3) is a permutation, the total compressed size of the output range is equal to the total compressed size of the input range: Information content is conserved by permutations (Pigeonhole principle)

      Now, your particular method sets c(n) = (k,o), where k is the nearest compressible number, and o is the offset, perhaps also compressed. It doesn't actually matter what compressor functions you use, the result is general:

      The proportion of strings in your domain which compress down to less than m bits remains unchanged. In other words, there are 2^m strings in m bits, and the proportion in [0,max] is just (2^m)/max.

      Set max = 2^(1000000000000), or the number of terabyte-sized files. Now just evaluate (2^m)/(2^1000000000000) and you'll quickly see why your compression algorithm doesn't work. It'll work for a vanishingly small number of bitstrings, just like how we can represent 10^23 using only 5 symbols, but as you go further and further out from those easily-compressible numbers, the offsets themselves become uncompressible large numbers!

      --
      - Oskar Lidelson

    61. Re:It's not a thing by ctilsie242 · · Score: 1

      With some work, I can write a script for a full length, American "blockbuster" movie in less than 1k. Example:

      Setting: Mega City One.
      Protagonists: Superman, Catwoman.
      Antagonists: Lex Luthor, Lobo.
      Time of Movie: 2:00:00.00
      Quotes: "I only learned math so I can do body counts."
      Ending: Good.
      Git level: Medium.
      Saturation level: Gloomy

      From there, a program can grab the models for the heroes and villains, render the setting, generate dialog, generate a soundtrack, put the quotes at a punctual moment, and end the movie on a selected happy/sad ending. Saturation and "grit" levels are how the movie looks, as well as how much bad stuff happens to the heroes in the meantime, all randomly generated.

      I can even reduce it even more to a sentence, "Modern comic book hero movie", and let a RNG crank out another multi-million dollar Hollywood blockbuster candidate. Less than 64 bytes used, easily generating terabytes of footage data.

    62. Re:It's not a thing by Rockoon · · Score: 1

      I feed it a potentially very small integer, the decoder spits back a large prime number. The decoder does not compute the prime number, it is already computed, and stored in the really big array.

      Sigh... if you are storing stuff in the decompressor that is needed to decompress a specific data... then just store the data instead.

      You have added layers of complexity here to hide the fact that you dont know what you are talking about... but its not me that you are hiding that fact from... you are hiding that fact from.... you.

      --
      "His name was James Damore."
    63. Re:It's not a thing by Anonymous Coward · · Score: 0

      That's not a very intelligent analysis of what this is discussing at all. In fact no modern video codec does that aside from MJPEG.

    64. Re:It's not a thing by Anonymous Coward · · Score: 0

      Go back to math class, squid.

      Seriously, anyone and everyone who has even taken a computer science class has thought about the exact same thing. Seriously, everyone. If it worked, it would exist right now.

    65. Re:It's not a thing by jimbolauski · · Score: 1

      It's a one time cost for Netflix to encode content, as long as decoding could be done on a box that's less then $100 there would be little reason for Netflix not to use a codec scheme like this.

      --
      Knowledge = Power
      P= W/t
      t=Money
      Money = Work/Knowledge so the less you know the more you make
    66. Re:It's not a thing by Anonymous Coward · · Score: 0

      That's just 10^23 + 10^9, still very low complexity. To be useful, you need to describe any point, say at 12.34567891% of the distance, and I don't see how that will be any easier with this system.

    67. Re:It's not a thing by Just+Some+Guy · · Score: 1

      ..and a tiny fraction of that is interesting, and those that are interesting are so because they arent random noise.

      Yes, but it's still going to be ludicrously resource expensive to explore enough of a math space to find a suitable replacement for a given chunk of image. And given the example of fractal compression, you'll have to explore a fair amount of that space because the output is so sensitive to initial conditions. It'd be nice if you could say "this part of the fractal looks kind of like what I'm looking for, so it's zoom in and it'll be even better", but it's more likely that zooming in will make it worse instead.

      --
      Dewey, what part of this looks like authorities should be involved?
    68. Re:It's not a thing by Anonymous Coward · · Score: 0

      This is more or less how most video and image compression already works. You take chunks of the screen, and try to simplify it in the way least visible to humans. Better compression has better methods of ignoring details that we presumably don't care about. It's always a trade-off though. Eventually you get to the point where the video/image quality is unacceptable.

      There are two big problems with your suggestion though, with treating the files as big numbers and finding ways to calculate them. If you aim for lossless compression, then we run up against complexity theory and it's provable that no lossless compression scheme can compress every input value. The pigeon-hole principle kicks in, and you'll find that for a certain inputs, it takes more space to store the compressed version than the uncompressed. This is why you don't gain anything by double-zipping a file - all the low-hanging redundancy has already been removed.

      If you want to go with a lossy scheme, then you can treat the file as just a number. You'll just end up with a corrupted file when you decompress it. The compression scheme has to treat the file as a representation, and make simplifications that take into account human perception and file formats.

    69. Re:It's not a thing by Verdatum · · Score: 1

      They are, but think about GPUs in the late '90s, when this codec was finalized. I don't think there were any real-time rendered fractals going on in my copy of Unreal Tournament. You could have 2 GPU cores if you really wanted to spend the money, but as I understand it, the drivers weren't really out there to make particularly efficient use of them beyond simple stuff like allowing multiple monitors (I say "simple", that's a laugh, multiple monitor support was atrocious on Windows98, and it took me hours to get it working as expected on Redhat 6; I vaguely recall needing to compile a custom kernel, but don't quote me on that).

    70. Re:It's not a thing by Just+Some+Guy · · Score: 1

      What part of that was wrong?

      --
      Dewey, what part of this looks like authorities should be involved?
    71. Re:It's not a thing by omnichad · · Score: 1

      You can't compress "The Shining" by using snippets of Leonardio DiCaprio in place of Jack Nicholson because it wouldn't stand up to a moment's scrutiny

      For that matter, you can't reproduce Peter Cushing's face using an extremely accurate 3D model. The uncanny valley is wide.

    72. Re:It's not a thing by Just+Some+Guy · · Score: 1

      Absolutely. I think people underestimate how perfectly you need to represent anything human-related for it to be plausible. You can't handwave away "actor, male, mid-20s" and hope the details are fungible.

      --
      Dewey, what part of this looks like authorities should be involved?
    73. Re:It's not a thing by tepples · · Score: 1

      Vectors vs. bitmaps, which compresses better?

      I agree with you when trying to replace something like SWF with a format built on free software. But for live action, good luck turning a sequence of bitmaps received through a camera lens into an efficient vector approximation in reasonable time.

    74. Re:It's not a thing by tepples · · Score: 1

      Yeah, but what if you replaced everything with stick figures?

      We could even call it Cyanide & Happiness: The Movie.

    75. Re:It's not a thing by Anonymous Coward · · Score: 0

      Maybe this is an application for quantum computers, as quickly exploring huge spaces is supposed to be one of their string suits.

    76. Re:It's not a thing by Rockoon · · Score: 1

      Yes, but it's still going to be ludicrously resource expensive to explore enough of a math space to find a suitable replacement for a given chunk of image.

      But thats exactly what modern video codecs do. A frames context is the frames that came before it, and it is this context that is explored with numerous different kinds of views (aka models) .. fractal compression just uses a different context (generally IFS .. same technique as produced those old classic "ferns")

      Didnt you know? You are acting like an expert... surely you knew? Have I hammered down on you on this very point about knowledge before? You are acting like an expert, but clearly you are not. What happened to honesty? Have I called you a fucking liar before? I'm going to call you one now.

      --
      "His name was James Damore."
    77. Re:It's not a thing by Anonymous Coward · · Score: 0

      "Robin Hood gallops in stage left in a green shirt and hat, bow at the ready, on a black steed."
      Thats only 40ish bytes. I lossy compressed a 5 second scene from the movie. Imagine what I could do with 2 KB.

    78. Re:It's not a thing by angel'o'sphere · · Score: 1

      Yes, but it's still going to be ludicrously resource expensive to explore enough of a math space to find a suitable replacement for a given chunk of image.
      Actually not. Think about furrier transformation.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    79. Re:It's not a thing by Anonymous Coward · · Score: 0

      Isnt this what they call sprites? Not a new concept! Early computers had limited resources so games were programmed this way

    80. Re:It's not a thing by angel'o'sphere · · Score: 1

      You should not throw around words, where you don't know the meaning off.
      E.g. compressing 90 minutes of a blue screen into a few bytes is pretty simple ...

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    81. Re:It's not a thing by Just+Some+Guy · · Score: 1

      Calm down and have a cookie, son. There's no need to get whipped up over this.

      --
      Dewey, what part of this looks like authorities should be involved?
    82. Re:It's not a thing by HornWumpus · · Score: 1

      Who knew AsciiWars was such an insanely efficient video compression job?

      --
      John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
    83. Re:It's not a thing by angel'o'sphere · · Score: 0

      We are not talking about numbers but about graphical informational, aka pictures.
      No one is treating a movie as sequence of numbers.
      I suggest to read how jpeg and mpeg compression works.
      Then write a paper again ;)

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    84. Re: It's not a thing by Anonymous Coward · · Score: 0

      Nice!!! Was waiting for someone to say that.

    85. Re:It's not a thing by Anonymous Coward · · Score: 0

      As new people enter culture, it is the responsibility of said culture to ensure that its knowledge is passed to the next generation, or it will be lost.

      More positively, you can see people who don't know what the pigeonhole principle is as part of today's lucky 10,000. https://xkcd.com/1053/

      Which do you think will improve /. more?

    86. Re:It's not a thing by Anonymous Coward · · Score: 0

      I don't have a background in computer science, but I always wondered, why wouldn't it be possible to compress things using some sort of table of large prime numbers?

      Table-based compression is well-known. What filling the table with prime numbers would bring to the table eludes me.

      It would be interesting if it could be done without the table on the compression side, and just be computationally intensive. Then transmit the compressed file. Then have a computationally intensive decompression that references a gigantic table.

      This is a classic time/space engineering trade-off. The idea is that trading one for the other, or vice versa, is actually worth it. It depends very much on the application which way your engineer will lean.

      For an example you might look up "rzip". It's a front-end for bzip2 with a much bigger dictionary window, making compression and decompression both more effective and much more costly compared to bzip2.

      It always made me wonder if there was another way of doing things.

      If you find it, you'll know.

      For a more lengthy answer, go do an "algorithms 101". The part on complexity of algorithms is insightful but the part on whether P = NP and what it means that we don't know whether it is so or not so, is the pertinent part here. No, we don't have a general solution to NP-complete problems. Yes, we can get by with throwing cleverness at individual problems.

      The presumption is that Sloot-compression did something like what you're hinting at. Maybe "fractal compression", which is a thing, tries this at some level. But then again the DCT that backs JPEG does something similar on a lower level, though it's more geared to selecting which bits to throw away and not suffer too much. See also the somewhat more modern wavelet compression.

      Me, I'm fairly convinced (on no personal experience, but second-hand accounts) that a decent video encoder can be made a lot more efficient if you first filter out the worst of the noise, and that's something that's relatively easy to do, if not particularly clever or ground-breaking.

    87. Re:It's not a thing by demonlapin · · Score: 1

      Well, my phone today blows away all kinds of serious desktops from a while back... but try running a deep fractal rendering program at 60 fps even today.

    88. Re: It's not a thing by Anonymous Coward · · Score: 0

      Yes, it transformed my hairless grump cat into a "Grump Cat" lookalike with hair. It made my cat much... furrier. Ba dum bum. Thanks folks, I'll be here all weekend.

    89. Re:It's not a thing by demonlapin · · Score: 1

      Yeah, I think it's the render side that kills this.

    90. Re:It's not a thing by evilviper · · Score: 1

      We arent even close to the limits of lossy compression

      Completely wrong.

      MPEG-2 video and audio come very close to the limits of accurate lossy compression... Specifically, much-higher compression, that does not discard perceptible information (as shown by professional ABX testing) is not possible. This has been matemtatically proven, the term is "Perceptual Entropy" and it sets the upper limits based on physiological models of human senses, established in the late 80s/early 90s most significantly by J.D Johnston of ATT Bell Labs.

      That's why nobody is trying to improve upon perfect lossless compression anymore. Instead newer codecs willfully discard perceptible information, but try to generate an output that just looks/sounds "good" but is easily distinguished from the source. That's why only MUSHRA tests are popular today, modern codecs (at common/recomended bitrates) would completely fail professional ABX testing.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    91. Re:It's not a thing by Anonymous Coward · · Score: 0

      100 bytes is sufficient to uniquely identify every image that has been or ever will be seen by a human.

      It's the mapping algorithm that's a bit tricky.

    92. Re:It's not a thing by guruevi · · Score: 1

      But it's not the movie then, you could encode a lot in the movie, eg. Google Street View locations etc to save on space but it still wouldn't be the movie as intended. Good enough for things like demonstrations (eg. Flash used similar methods) and enhanced reality but we're talking about true compression of random video into the format.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    93. Re:It's not a thing by guruevi · · Score: 1

      And if you fed that blue screen through a DAC, a few noise generators and bandwidth filters and then through an ADC again, would it still compress to that?

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    94. Re:It's not a thing by JMZero · · Score: 1

      I'm not sure I understand your complaint here. Are you saying it work well enough (1), that it wouldn't be exactly the same (2), or that it wouldn't operate on all possible inputs (3)?

      On 1, there's no reason the output couldn't be much higher quality than the original. 2 seems likely to be true, but also a shallow complaint; the result could be very, very similar (I mean, who knows how much data it would take to encapsulate Michael Bay's directing preferences, but there's no hard reason to believe it's a big number) - or, again, better.

      On 3, again you're trivially correct but it's not a fair complaint. If you're working against the set of "all possible arrays of pixels", then any significant compression is impossible. And yet video compression is useful technology, because the set of videos we're interested in compressing is an infinitesimal subset of possible videos.

      My point was that for a realistic movie and realistic constraints on performance - ie. the normal parameters we'd expect such software to operate in - it's difficult to set hard limits on possible performance. Information theory doesn't provide us any kind of permanent objection (though, obviously, again, I don't expect that this software actually worked, or that any such software could feasibly work via "fractals", or any kind of pixel/image centric processing... it would need to be amazing, very content aware AI to actually get this performance).

      --
      Let's not stir that bag of worms...
    95. Re:It's not a thing by guruevi · · Score: 1

      Encoding something with that method works well enough for very short, well-defined animations. Even that would require more than a few kB but it's technically feasible to procedurally generate something using external sources. But even then, the definitions themselves would be both very precise and vary from system to system (eg. like MIDI or MOD files depend on what you have in your local instrument bank). You'd also have to rely on downloading immense amounts of data from various sources, so you're not really compressing anything, you're just re-distributing the final delivery.

      It indeed would not work on all possible inputs unless your search space is capable of encompassing every single piece of data ever collected. Which, as above, just moves the "compression" job somewhere else and highly depends on the availability of resources. Both 1 and 2 are results from the exact same argument.

      With 3, the problem is any pointer to the data you're trying to "encode" will be equal to or greater than the data you're trying to compress. Even with the ~2000 exabytes of data we currently have "stored" in the world, the pointer to any point in it would be immensely huge. A single pointer to any segment of it would be several terabytes large.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    96. Re:It's not a thing by JMZero · · Score: 1

      Encoding something with that method works well enough for very short, well-defined animations.

      Sorry, what are you imagining "that method", to be? What I'm suggesting is very high level descriptions of content - the level of a movie shooting script, along with information on cast/etc, but in a form designed by an AI to communicate efficiently with itself. The AI would use this to reconstruct the movie from scratch. To make it match the original as much as possible, it could imagine make ordered lists of each parameter, ordered by some measure of likelihood, and then store only these indexes into that huge multidimensional universe.

      Anyway, I don't know on what basis you've decided that would only work for things that are short or well defined. To be clear, none of this is my crazy theory or something; this is the normal sort of thing people talk about when they talk about the frontiers of compression. Like, to compress Wikipedia beyond X, you need a powerful AI that starts doing work in this sort of way, and I'm sure you can find lots of speculative articles about how this might work. None of it is realistic now, but becomes very realistic in a world with AI that greatly surpasses human intelligence.

      Even with the ~2000 exabytes of data we currently have "stored" in the world, the pointer to any point in it would be immensely huge

      What are you talking about? A pointer to distinguish any bit within that 2000 exabytes of data is still going to be quite small - pointer size is logarithmic to the size of the set. Heck, I could reference any atom in the universe with ~100 bytes of data. But our compressed movie file wouldn't need to be anywhere near that extravagant. There are only so many white tops the actress would likely be wearing, etc...

      --
      Let's not stir that bag of worms...
    97. Re:It's not a thing by angel'o'sphere · · Score: 1

      What has your question to do with my statement?
      {
          mins = 90;
          framesPerSec = 24;
          colour = x005500;
      }

      That is a compressed light blue screen for 90 minutes with a frame rate of 24 frames per second. In human readable form even. If you code it binary it is 5 bytes ...

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
  4. Yes, it is techically possible... by Anonymous Coward · · Score: 0, Troll

    In the same way that millennials are able to compress two or more genders into a single body.

    1. Re:Yes, it is techically possible... by Anonymous Coward · · Score: 0

      Go die, alone. Now.
      (Silly me, of course you'll be alone.)

    2. Re:Yes, it is techically possible... by Anonymous Coward · · Score: 0

      Not if they have 2 genders in the same body! Now we'll never be alone!

    3. Re: Yes, it is techically possible... by Anonymous Coward · · Score: 0

      nah, that's just virtual, am male today, in a few months or years, am female

  5. Of course it didn't work by Anonymous Coward · · Score: 2, Insightful

    Information theory stablishes what is really possible and that kind of compression is simply to much, even considering its lossy nature. Infomation entropy has very real limits no matter the encoder used in the data compression.

    1. Re:Of course it didn't work by dbrueck · · Score: 4, Interesting

      Yeah, I don't think his work would have panned out either, but in theory the idea isn't totally implausible, in part because general purpose media compression is a bit of a special case in that the goal is not to try to have lossless compression, due to the way human audio and visual systems work.

      When you look at compression on individual frames, there is a huge amount of acceptable error (differences versus the original) that isn't even discernible when you're looking at still frames side by side, and then when you're playing back the frames at a rate of 30 or 60 or more per second, there's a whole *additional* layer of imperceptible error that can be introduced. And then on top of *that* you can introduce more and more error that is in fact discernible on some level, but it becomes a game of tradeoffs of what you can get away with vs the various costs involved.

      Another way to put it is that this isn't just lossy compression, it's lossy compression that takes advantage of quirks in the way human eyes and ears work, and so the limits of compression potentially go far beyond what you might expect in a strict information theory sense (there are still limits of course, they just might be a lot further out than you might expect).

      On some level it's kinda like how paintings work - you see a barn with a tree next to it in the shade, and yet when you look at it really close you realize that it's just a couple of strokes of paint - your brain perceived far more detail than is actually there. To be clear, I'm not saying this is how media codecs work, just saying that the goal in media compression isn't necessarily to accurately reproduce the source material, rather the goal is to get people's brains to *perceive* that the final version is an accurate reproduction, and in practice that opens up a lot of possibilities.

    2. Re:Of course it didn't work by ezdiy · · Score: 2

      > Information theory stablishes what is really possible

      Actually, information theory states the opposite. Determining entropy of unknown source is an intractable problem, and you can't generally state amount of entropy for piece of data unless you're certain it's a quantum pink noise beforehand, all we know that the better the compressor, the closer you get. That's why one time pads use truly random codebooks, not a PRNG (PRNG has very little entropy - that of PRNG seed).

      While extremely important as an output filter, just an entropy encoder doesn't compression make.

    3. Re:Of course it didn't work by Immerman · · Score: 2

      Indeed - our brains are incredibly good at filing in "missing" detail - a rough suggestion of something is often enough to "see" incredible detail.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    4. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      Well, if you take the hash approach to compression you get a pretty good idea of what would be the theoretical entropy limit of a dataset, given unlimited processing power.
      So yeah, intractable, but not a complete unknown.

    5. Re:Of course it didn't work by dbrueck · · Score: 2

      Yes, and it's some really neat stuff - research talked about in these articles is fascinating:

      https://www.sciencedaily.com/r...
      https://medicalxpress.com/news...

      And here's a site that lets you experience just one of the ways the brain manufactures some of the detail you perceive: http://www.uniformillusion.com...

      A common theme in all of these is that "sight" isn't entirely achieved with our eyes, but our brains get involved very early in the signal processing stage and even make up a lot of info based on what it expects.

      Who knows how much of this can be practically applied to things like video compression, but we've already been doing it to a limited degree for years (e.g. x264's psychovisual enhancement setting, see also https://en.wikipedia.org/wiki/...).

    6. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      Actually, information theory states the opposite.

      The problem is real video has lots of noise it and things that look like noise don't compress well. They can't, as they appear to be random. Now if you can prove the noise really isn't random and can be modelled by a simple algorithm, then maybe you can get closer.

      Video compression, afaik, is a time/space tradeoff problem. Sure if you put say the entire set of frames in memory then apply every fancy compression algorithm you can do more, but compression time goes way up and decompression is a mess too.

      That being said, it may be worth it, particularly for mobile video. You might have to download most of it before you could decode it to play it. The benefit is your total bytes downloaded is reduced.

      Basically you would have a web video (or whatever) have frame N depend on the set of frames 0..N-1. Of course, if your not careful you start wasting bits to handle all the possible offsets. This would effectively take a lot of bandwidth to start up but less as time went on.

      Of course it would also make seeking impossible, since you would have to load up every frame before your current frame, and that would likely be intolerable.

    7. Re:Of course it didn't work by Wycliffe · · Score: 1

      The problem is real video has lots of noise it and things that look like noise don't compress well. They can't, as they appear to be random.

      But that can work to your advantage. If you can determine what is random noise and what is not, you can just drop the random noise and replace it with generated noise. It's like the difference between compressing a picture of a boat and a digital representation of a boat without all the noise. If you can get a "good enough" virtual represetation of the boat, it still looks like a boat to the human eye but it's missing all the noise. It might look a little too crisp to the human eye but you can add back the randomness during the final processing. They do something similar with a lot of movies today where the grittiness is added at the end to make it look more realistic.

    8. Re:Of course it didn't work by Anonymous Coward · · Score: 3, Interesting

      This is how literature works... a short story is in effect a full movie highly compressed down to a few kilobytes. 'Reading' is decompressing the data.

    9. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      It's mostly used in CGI, to make things look more realistic and less digitally perfect

    10. Re:Of course it didn't work by RespekMyAthorati · · Score: 1

      I can make a 1000 hour HD movie that compresses to less than one byte: just leave the lens cap on.

    11. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      The brain doesn't always even fully perceive repeated patterns, which is why it is hard to find a small static object on a patterned carpet - you may literally be unable to perceive it.

    12. Re:Of course it didn't work by Rockoon · · Score: 1

      Information theory stablishes what is really possible

      Information theory has very little to do with lossy compression. Lossy compression deals with the specific limits of human perception. The output image only has to be like the input image, not a copy of it. Coherent parts are given an equivalent but easier to encode coherence, and random parts are just labeled by its distribution. Encoding "television snow" is virtually free.

      Information is encoded in a way that Shannon would approve... entropy compression... but thats not where lossy compression gets its wins.

      The kings of non-lossy entropy compression models right now are two strategies.. one is a prediction by partial matching strategy, the other a context mixing strategy tuned by neural networks. These things can get english language down to just above 1 bit per byte. About 8:1. Thats it. Thats where entropy compression gets you in the realm of the coherence that humans care about.

      Lossy then gets you from that ~8:1 range well into the 100:1 and 1000:1 range...

      So stop trying to apply Shannon thinking to the problem as some sort of guide for the limits of this. This isnt a Shannon problem. Shannon was concerned with getting the same signal out that came in. These lossy compression schemes have moved way beyond that.

      --
      "His name was James Damore."
    13. Re:Of course it didn't work by Wycliffe · · Score: 1

      It's mostly used in CGI, to make things look more realistic and less digitally perfect

      Yes, and I would bet money that the before video can be compressed at a higher rate than the after. If we could do this in reverse and make the real movies more digitally perfect then it should be possible to compress them at a much higher rate.

    14. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      Well, just remember that lossy compression is a subset of information theory. Not a completely different knowledge domain.

      Also, the concept of information entropy isn't limited to Arithmetic encoding or anything related. It's about information density. You should read more about it.

    15. Re:Of course it didn't work by Arnold+Reinhold · · Score: 1

      It's not inconceivable to build a system that feeds a movie script, perhaps with more detailed stage directions and some background and avitar art work model files, into a video game engine with high quality text to speech and get a movie. Performances might be somewhat different each time it ran, much like live theater. The effective compression would be extremely high. Conceivably such a system could perform any play ever written, with an interface that allowed a user to act as director, maybe creating a cast from an avitar library of the great actors of history and tuning the performance to their liking. The original script plus directorial hints file would still be very small.

    16. Re:Of course it didn't work by dbrueck · · Score: 1

      Yup, in another subthread here I mentioned "replay" files from video games, and those are kind of a halfway step towards what you're talking about, and those files are very small compared to what a video stream of the gameplay would require.

      You could probably start with cartoon TV shows and then, as the technology improved, eventually use it for more real-world shows. Interestingly, cartoons these days are already most of the way there since they are typically drawn and animated on computers anyway, which means that the individual assets and their animation instructions are already in some form that would support this model. And more and more news shows use virtual and/or augmented reality studios now too.

      Content producers wouldn't likely go this route just to achieve lower data transfer rates, but some of the other features it enables might be enough to get them to adopt it. For example, it could lead to much more interactive content that blurs the lines between a show and a game, or individualized shows (e.g. the characters use your kid's name, or your avatar gets to be in the show, etc.), and for sports it'd give fans complete control over camera angles and replays.

    17. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      Take an action-packed movie. Then the number of significantly different frames should be (much) greater than 1 per second. If 8000 bytes can cover 120 minutes or 7200 frames, then you have only 1 byte to cover a frame. It follows that the movie has at most 2^8=256 frames you can distinguish from each other, which is in contradiction with the original claim.

    18. Re:Of course it didn't work by Verdatum · · Score: 1

      That's be a really weird codec to get that down to one byte. I'd recommend using a couple more than that.

    19. Re:Of course it didn't work by dbrueck · · Score: 1

      Haha, the whole reason I started my comment with agreeing that his claims don't hold up is to avoid responses like yours.

      Do I believe the claim that any arbitrary movie could be compressed (with any reasonable degree of fidelity) into 8kb using his techniques? No.
      Does that mean that the whole set of ideas that he was exploring are completely without merit? No. Breakthroughs and major innovation often come from toying around with seemingly crazy ideas and looking at a problem space from a very different angle.

      You did enough thinking to get to the point of noticing an obvious problem. Good job. Now you can either get hung up on that and stop thinking, or you can say, "ok, that doesn't quite work, but what if..." and try to build on some of those ideas to see - who knows, you might stumble upon a really creative and innovative breakthrough.

    20. Re:Of course it didn't work by angel'o'sphere · · Score: 1

      I agree.
      A 90 minutes movie consists of 90 * 60 * 24 frames (129600).
      8kB is roughly 8000 bytes. The number of bytes is only a fraction of the frames.
      I guess there is an error somewhere in the story and he was perhaps able to compress every frame intoâ about 8kB?

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    21. Re:Of course it didn't work by ezdiy · · Score: 1

      The problem is real video has lots of noise it and things that look like noise don't compress well.

      Depends on material. Anime/Cartoon animation is very little noise in and itself (its just outlines and color filled surfaces). You can also strip noise quite easily, but denoise filters are not perfect (they misidentify detail as noise a bit, too). No, noise in audio/video isn't the major problem. It's simply the sheer amount of information you have there even if you clear any noise you might have.

      Video compression, afaik, is a time/space tradeoff problem.

      Duh. Any compression is in general, but it's a more complex than that. For example, you can speed up compression by using very large dictionaries, so it becomes TMTO-in-TMTO (by using more memory at compression time, you reduce compression time).

      Basically you would have a web video (or whatever) have frame N depend on the set of frames 0..N-1. Of course, if your not careful you start wasting bits to handle all the possible offsets. This would effectively take a lot of bandwidth to start up but less as time went on.

      Actually very large lookback windows are used, in spite of the diminishing returns. With 64kb lookback (gzip), you get say 50% ratio, with 64MB lookback (lzma), you get 70% ratio and so on.

    22. Re:Of course it didn't work by Anonymous Coward · · Score: 0

      Unlikely. Yes, compression can take advantage of quirks in human hearing/sight, BUT there are people who do not have the same quirks. MP3 had to be tuned because some people were able to hear artifacts and the difference between the original and the MP3. Sloot would have had to test his compression on hundreds of viewers before declaring it marketable, not just on himself. Where are these people?

      Besides, Sloot seems to have worked exclusively on video. These days there's more data in the audio signal of a bluray than in the HD video signal. What did Sloot do with the audio? Back to the silent era?? A few KB of video but still a gigabyte download for a blockbuster?

    23. Re:Of course it didn't work by dbrueck · · Score: 1

      Well, please go re-read my comment - this subthread isn't about whether or not Sloot's work was real vs fake, but about the compressibility of media and that, because humans are the consumers of the data, there are additional opportunities for higher compression beyond what one might normally expect.

      Unlikely. Yes, compression can take advantage of quirks in human hearing/sight, BUT there are people who do not have the same quirks.

      There might be *some* techniques you couldn't take advantage of because there are some people that would be bothered by them, but before you get to that point there are many, many ways in which the original image can differ from the original without people noticing it, and we've been using these techniques for many years (for example, the switch from black and white to color TV was one such case in which engineers were able to make a big optimization based on how the human visual system works). In modern codecs we use similar techniques that emphasize some parts of the color gamut over others, again because of how our eyes work.

      Beyond that, there is typically a huge amount of "error" that gets introduced in video compression before it ever becomes discernible - even when you're looking at still frames side by side (i.e. even in a scenario where you can scrutinize the frames more than people can when watching a movie). Long before really bad errors like blocking or color distortion become apparent, there is all sorts of "error" vs the original frame that is not noticeable, and that error is a tradeoff used for extra compression.

      In fact, one struggle we have with measuring video compression techniques is that it's really hard to quantify quality as perceived by humans, so there is a constant effort to come up with metrics that accurately reflect perceived quality. Things like PSNR, SSIM, and others roughly correlate to it, but are full of known flaws (i.e. there are cases where, based on SSIM values vs the original, the output quality seems very low but if you do A/B testing with actual humans, they rate the quality as much higher. And vice versa).

      These days there's more data in the audio signal of a bluray than in the HD video signal. What did Sloot do with the audio? Back to the silent era?? A few KB of video but still a gigabyte download for a blockbuster?

      I'm not quite sure I understand the point of bringing up Blurays in the context of his work since they came about after his death, but typically the amount of audio data relative to the amount of video data is vastly smaller. These days the only time you'd see audio data take up more space is if you just have such an access of space that you don't bother compressing the audio much if at all (e.g. you have 10 language tracks, each is full 7.1 (or more) channels, with little or no compression).

      A more realistic comparison would be in a typical MP4 file downloaded from a service, or in video streaming. In either case the ratio of video to audio data is typically 4:1 on the low quality end to 20:1 (or higher) for high quality stuff - in general terms, as the overall total bitrate of the media goes up, the relative portion that is audio goes down because of diminishing returns in increasing audio bitrate and because higher quality video usually correlates to higher resolution and framerates, so the data is just absolutely massive compared to the audio.

      For example, one second of raw 4k video is over a gigabyte of data (3840x2160 @ 60fps * RGB = ~1.4GB). By comparison, one second of one channel of raw absurdly HD audio (say, 96kHz, 32-bit samples) is a relatively tiny 3 megabytes. Assuming compression rates for audio and video that are in the same ballpark as each other, in order for the audio to match (not even exceed) the video data in size, you'd need nearly 50 different spoken languages, each with 10 completely independent audio channels per language. Video data is just plain huge.

  6. Mean Jerk Time by Anonymous Coward · · Score: 0

    Do you think he considered mean jerk time to solve this problem?

  7. More plausible explanation: by Gravis+Zero · · Score: 4, Interesting

    Jan Sloot had some ideas on how to compress things and despite hyping it up, he had gotten nowhere close to anything functional. With all the hype he generated, he had a deal lined up for the technology but he didn't have the goods. The stress of the impending revelation of his fraud caused him to suffer cardiac arrhythmia and he died. The source code was never found because it never existed.

    --
    Anons need not reply. Questions end with a question mark.
    1. Re: More plausible explanation: by Anonymous Coward · · Score: 5, Funny

      So you're saying he should have waited for Kickstarter to be invented?

    2. Re:More plausible explanation: by Anonymous Coward · · Score: 0

      This has happened more than once. I remember a bunch of hype about a proprietary video technology that was so secret it had to be demoed from inside a bank vault and only "authorised" people could view the source computer. The hype pumped the company stock and then... nothing came of it. This was probably more of the same since we are talking about the 90's here.

      I'm guessing it was amazingly efficient provided the video was entirely made up of mandelbrot images and contained no real video frames.

    3. Re:More plausible explanation: by Bite+The+Pillow · · Score: 1

      He might have had success with a limited range of inputs, like news broadcasts with little or no clips to show. Broadcast quality was probably 320*240, or the PAL equivalent, and that can take lots of lossy compression and survive on a CRT. It could look promising but really be near the peak of its ability.

      It's not implausible to have something like 2 tone 4*3 blocks, sending a block ID and primary and secondary color, and the decoder assembles the appropriate mosaic. Like pictures made of bunches of other photos, or ASCII art. Good enough to get the message across but little room for improvement.

    4. Re: More plausible explanation: by Anonymous Coward · · Score: 0

      PAL was 720x576. For a long time I used a Nvidia Quattro 2 Pro to output to a 24" Sony Trinitron TV. It looked amazing for that age. 640x480 which passed for NTSC (with real NTSC being 720x480) looked just dull. The extra frame rate did nothing to compensate for the slight but really noticeable and extremely annoying loss of resolution.

    5. Re: More plausible explanation: by Anonymous Coward · · Score: 0

      Actually PAL was 576 lines. Being an analogue signal the horizontal resolution was virtually limitless.

    6. Re:More plausible explanation: by dbIII · · Score: 1

      Broadcast quality was probably 320*240, or the PAL equivalent

      Why bitmaps? It could have been vector graphics using a range of primatives to match regions of the image. I was using a thing that built up vector graphics from bitmaps in the late 1980s on an Atari ST FFS.
      Considering who he saw as the market it didn't have to be computationally cheap, encoding for days was not out of the question. Decent hardware and a lot of time can make a lot of approaches that seem ridiculous on a home PC make sense.
      It's fun to speculate I suppose and I'm expressing what I see as a more likely option not suggesting that the others are wrong.

    7. Re:More plausible explanation: by White+Yeti · · Score: 1

      Madison Priest's "magic box" fits the description.

    8. Re:More plausible explanation: by Bite+The+Pillow · · Score: 1

      Could have been anything. I was thinking in terms of digital video artifacts, and most importantly pointing out how low resolution and analog equipment can make an actual tech demo possible.

      The video said he had 32 movies on a card, put it in, and there were 16 videos. I took that to mean a 4*4 grid, even smaller images. So he might have had a lot of leeway with quality.

  8. I love it that parent is modded "Insightful" by mykepredko · · Score: 4, Insightful

    That really says a *lot* about the quality of today's movies.

    1. Re:I love it that parent is modded "Insightful" by Anonymous Coward · · Score: 0

      You know what they say: garbage in, garbage out. Hollywood has shown that you don't need a special algorithm for that.

    2. Re:I love it that parent is modded "Insightful" by hcs_$reboot · · Score: 2

      That really says a *lot* about the quality of today's movies.

      That also says a lot about /. readers.

      --
      Slashdot, fix the reply notifications... You won't get away with it...
    3. Re:I love it that parent is modded "Insightful" by Anonymous Coward · · Score: 0

      That was the joke...

    4. Re:I love it that parent is modded "Insightful" by RockDoctor · · Score: 1
      I wouldn't know - I don't bother to watch them until they're released on free-to-air channels without embedded advertising.

      What do you mean, I'm part of Hollywood's problem?

      --
      Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
  9. You don't know what you're talking about by Anonymous Coward · · Score: 2, Interesting

    The presumption isn't that it actually stored images but more likely that it contained numerous generators. It's entirely plausible is that similar to the way a neural net uses a combination of several algorithms to classify images, a similar approach could be taken to constructing them.

  10. Lossless Compression by Anonymous Coward · · Score: 0

    What the world wants (although they are now mpeged into submission) is lossless compression. Lossless compression needs large files, no matter how much fancy maths you might throw at the problem.

    1. Re:Lossless Compression by Goaway · · Score: 1

      That is not actually what the world wants. It is what you want. Most of the world most certainly does not want it.

  11. I'm not sitting through an 11 minute video to see by PhunkySchtuff · · Score: 1

    I'm not sitting through an 11 minute video to see if there are any examples of his compression - in particular the output compared to the input.

    But the proof is in the pudding - what was the output quality like?

  12. Re:Sloots by souray.del.593 · · Score: 1

    Sloots are gonna sloot!

  13. Re: I'm not sitting through an 11 minute video to by Anonymous Coward · · Score: 0

    If the video is 11 minutes then there is no concept of compression

  14. What was by alta · · Score: 1

    What was that file format that was parts of sampled songs that were played back like a midi? It was "popular" in the late 90s I think. Maybe earlier. Pre MP3.

    Sounded like a chopped up version of the original song. Much larger than midi but much smaller than wav.

    --
    Do not meddle in the affairs of sysadmins, for they are subtle, and quick to anger.
    1. Re:What was by Bite+The+Pillow · · Score: 1

      Mo3, Tracker, MOD, all similar or synonyms probably.

    2. Re:What was by Immerman · · Score: 1

      I want to say .mod?

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    3. Re:What was by Anonymous Coward · · Score: 0
    4. Re: What was by Anonymous Coward · · Score: 0

      MOD. It contained both notes and samples to play these. MIDI files could also contain and upload a sample bank via SYSRQ, but usually never had them, and the samples in the MOD were very low sample rate and the players lacked advanced waveform manipulation capabilities. It wasn't bad though, especially considering most files were created by hobbyists reverse engineering the song, not through automated FFT which is really possible these days.

    5. Re:What was by Rockoon · · Score: 1

      Its called wavetable synthesis, and it its a compression technology.

      --
      "His name was James Damore."
  15. It works the same way a coin goes into a slot by mykepredko · · Score: 0

    Let's say you want to transfer an image on a coin - you could pass it by the flat side into a circular opening, which means that the amount of area needed to transfer the coin is Pi*r^2 where "r" is the radius of the coin.

    BUT, if you were to rotate it in space 90 degrees, and this is where the genius lies as it doesn't matter what part of the disk you rotate, then you can slip the coin into a thin rectangle which has much less area than the circular opening. In this case, the area required is 2r*thickness.

    To give you an idea of how amazing this is, look at what happens if the thickness of the coin is 1/10th the radius of the coin - the amount of area required is 314.159 times *less* if you were to use the rectangular opening (which we call a "slot"). The less the thickness of the coin, then the better the improvement in area (which is the amount of space needed to transfer the coin).

    Sloot's genius was figuring out, in software, how to represent video data as a circular disk, rotate it 90 degrees and transfer it using very little bandwidth to a receiver which could reverse the process.

    1. Re:It works the same way a coin goes into a slot by Immerman · · Score: 1

      The real trick is storing the data in two-dimensional bits, so that each bit can store many thousands of times as much data. /sarcasm

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
  16. Yes, it's possible by deek · · Score: 5, Funny

    Myself, I've come up with a system that can compress a video file down to one byte. Unfortunately, it has some limitations. The size of the decoder is approximately the same size as the uncompressed video file, and it will only work on one specific file.

    Damn, should I be afraid for my life now?

    1. Re:Yes, it's possible by Boronx · · Score: 1

      I can do the same thing and decompress 256 video files, so you're probably safe.

    2. Re: Yes, it's possible by Anonymous Coward · · Score: 0

      I have just invented a similar system which compresses down to two bytes and can work with over 65 thousand files. This is now my intellectual property and you will all be hearing from my lawyer.

    3. Re: Yes, it's possible by Anonymous Coward · · Score: 0

      I have a more aggressive compressor which reduces all video, sound, data, executables, and cat pictures down to just a single bit. And that bit is '0'. And the decoder takes no space and executes in O(0) time.

    4. Re: Yes, it's possible by Anonymous Coward · · Score: 0

      Good thing I patented 65 BEND OVER BRO

    5. Re:Yes, it's possible by Anonymous Coward · · Score: 0

      I can do the same thing and decompress 512 video files, but occasionally you get the wrong one and need to skip forward.

    6. Re:Yes, it's possible by Drakster · · Score: 2

      Always thought it would be an amusing joke if someone who produced a new image file format wrote the following in the spec.

      Inputs with the total size of 0 bytes should return an image of Lenna in the highest possible quality.

    7. Re:Yes, it's possible by Anonymous Coward · · Score: 0

      Amateur. My system can turn 10 bits into as many as a thousand different video streams and the decoder is already built into your existing set top box. The only catch is that you can't control what is on any of the streams and it requires a cable television subscription.

    8. Re:Yes, it's possible by Anonymous Coward · · Score: 0

      I too have created a CODEC, mine is able to reduce the size of the information, WITHOUT COMPRESSION, with a factor of 2 million. Unlike you and Sloot, I am able to provide a sample of the reduced information. For example, I can reduce an entire video down to just 28 bytes.

      Here is the reduced data: "https://youtu.be/dQw4w9WgXcQ"

      Unfortunately, in order to extract the original video you'll need my special secret source code that remains untracable in a demo machine that looks a bit like an easybake oven. You'll also need my several exabyte data file.

  17. Dead scammer, not dead inventor. by gurps_npc · · Score: 4, Interesting

    I believe it was a scam. He never really had that good compression. Either he got cold feet and offed himself before he had to deliver the product that would not work, or someone else found out and killed him in a a rage at being tricked.

    While yes, there are a few scientific advancements that are remarkable, in fields like computer file compression that are:

    1) Essential to an existing, highly profitable business
    2) Mathematically interesting
    3) Being heavily researched by multiple people.

    then any advancements get duplicated in less than 10 years. Too much money, brains and corporate greed focused on this issue for us not to figure out it or something similar by ourselves.

    This has not been duplicated, therefore it was fake.

    --
    excitingthingstodo.blogspot.com
    1. Re:Dead scammer, not dead inventor. by tehcyder · · Score: 1

      That's the most BS excuse I've heard, $5 says you are a non-degree holding wanna be loser? yeah

      Why would anyone WANT to be a loser?

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    2. Re:Dead scammer, not dead inventor. by Rockoon · · Score: 1

      Indeed it would be quite unusual at this point to learn that this guy was many decades ahead of the professionals in the field, using a strategy he didnt even invent, yet still those professionals arent even close to where he was decades later.

      Probably a scam artist, but there have been so many comments here making claims about impossibilities and so forth... at least you know what motivates a scammer... these idiots that dont know what they are talking about here tho... got no idea what motivates them... they just like pretending to be experts because they learned about pigeon holes once.

      --
      "His name was James Damore."
    3. Re:Dead scammer, not dead inventor. by 0111+1110 · · Score: 1

      Because the people who create great breakthroughs in science generally started out as 'losers' by the standards of the common people who use such terms. It usually just means they are ugly and intellectual and intelligent.

      --
      Quite an experience to live in fear, isn't it? That's what it is to be a slave.
  18. Computer magazine's ruby rod by Latent+Heat · · Score: 1

    Back in the day when Dr. Dobb's Journal and other computer magazines were a thing, one editorial writer was reflecting on how allowing multiple levels of an analog signal could encode more than one bit per symbol or cell or splotch or what have you. The idea was extended to a very hard, dimensionally accurate ruby rod upon which a very thin scribe line was made, and the writer mused that a very large number of bits of data could be encoded on that one physical object.

    Since Claude Shannon's insights, Information Theory is also a thing. The number of bits goes linearly with the number of symbols or cells or optical splotches, but it only increases with the logarithm of the number of signal levels represented by those physical objects. A 16 bit unsigned integer can encode 2^16 or 65536 levels, 32 bit encodes about 4 Gig whereas a 64 bit unsigned integer encodes a number of levels exceeding most counts of objects in the known Universe -- think of tale of the king wanting to reward the inventor of the game of chess with anything he wanted, and the dude asked for 2^64 grains of wheat (one grain on the first chessboard square, two grains on the second, four grains on the third, and so on).

    Turning this relationship around, scribing that ruby rod to the precision of, dunno, the Planck Length, gets you what, only a 100 bits of storage or something like that?

    You are finding a related fallacy in this super video compression scheme?

    1. Re:Computer magazine's ruby rod by cheesybagel · · Score: 2

      That's nothing. BLAZE MONGER can compress 16kx16k video at 150 fps with 1-color (which must be black) at 0 bits/sec.

    2. Re: Computer magazine's ruby rod by Anonymous Coward · · Score: 0

      Not to not pick, but thats 2^65-1

    3. Re:Computer magazine's ruby rod by Just+Some+Guy · · Score: 1

      Pretty much, yeah.

      --
      Dewey, what part of this looks like authorities should be involved?
    4. Re: Computer magazine's ruby rod by Anonymous Coward · · Score: 0

      2^64-1, actually. It's the series sum 2^n for n=0->63, not n=1->64.

  19. Very interesting by HalAtWork · · Score: 1

    I had a similar idea to develop an algorithm that would follow through and propagate successive frames of motion in a video, first analyzing it to create certain required textures and meshes, then applying it to various meshes that would be created by tracking the individual characters, objects, and sets in a video. I always believed it could work but it's a huge undertaking.

    1. Re:Very interesting by gl4ss · · Score: 1

      ..you're basically describing some of the schemes already used in video compression.

      it's still like.. slooth was claiming 100x over that.

      --
      world was created 5 seconds before this post as it is.
    2. Re:Very interesting by RespekMyAthorati · · Score: 1

      This is exactly what existing video codecs do.

    3. Re:Very interesting by HalAtWork · · Score: 1

      Not exactly, what I had in mind was using the video to copy a model that would persist beyond key frames and even be retrieved later on if there was a close enough match, something much less linear than current offerings

    4. Re:Very interesting by HalAtWork · · Score: 1

      Only in a very linear fashion, and does not model characters and backgrounds to create rigs and sets

  20. Compression Tweaks by Bruce+Perens · · Score: 5, Insightful

    Sloot wasn't the only "Compression Tweak". This is someone who has compression "ideas" but can never get the product working. There was one in the US who wrote me for a long time in the 90's. One thing I remember is that he dropped hints about encoding data in the spaces in between bits. Of course this makes zero sense.

    1. Re: Compression Tweaks by Circlotron · · Score: 1

      The whole thing makes me think of DNA that contains maybe 3GB of data but contains all the information to produce a human brain, perhaps the most complex object in the physical universe.

    2. Re: Compression Tweaks by HuguesT · · Score: 2

      Yes but DNA does not "encode" a human. It encode a process for generating a body, which in the human case takes about 20 years. The encoding is not one-to-one either, see twins.

    3. Re: Compression Tweaks by Anonymous Coward · · Score: 0

      DNA describes an inverted instruction to encode proteins that interact with fuel, water, and energy inputs to produce ingeniously complex machinery that itself produces an envelope and maintains it. This happens in massively parallel fashion that dwarfs anything we have ever made to be massively parallel (a trillion cellular "threads" per human). These parallelized cells then interact further with each other to create and maintain a community of cellular mass and symbiotic microbes. From this, sentience emerges from the chemical interactions of nerve tissue cells.

      It's literally code to generate machines to generate machines to generate machines to generate machines.

      As an aside, I find it comical that anyone thinks this happened by random chance.

    4. Re: Compression Tweaks by Bruce+Perens · · Score: 1

      I find it even more comical that someone who is the result of random chance denies that fact..

    5. Re: Compression Tweaks by jalet · · Score: 1

      Bruce you made my day, thanks !

      --
      Votez ecolo : Chiez dans l'urne !
    6. Re:Compression Tweaks by houghi · · Score: 1

      Well, in the past they used a 0 with a slash through it to make it different between the letter O. In Binary that is not needed, because you could use either a 0 or an O. Now you can combine the two and get 50% reduction to write instead of 01010101 you write à à à Ã. or 4à As the à is always the same, you can use 4.

      That is 0101 in binary or à à or 2à or 2.
      That is 01 in binary or just Ã

      So I reduced 01010101 to à using the spaces between the bits. Easy.

      --
      Don't fight for your country, if your country does not fight for you.
    7. Re:Compression Tweaks by fibonacci8 · · Score: 1

      Well, in the past they used a 0 with a slash through it to make it different between the letter O. In Binary that is not needed, because you could use either a 0 or an O. Now you can combine the two and get 50% reduction to write instead of 01010101 you write à à à Ã. or 4à As the à is always the same, you can use 4.

      That is 0101 in binary or à à or 2à or 2. That is 01 in binary or just Ã

      So I reduced 01010101 to à using the spaces between the bits. Easy.

      Unicode is a form of lossy compression on /.

      --
      Inheritance is the sincerest form of nepotism.
    8. Re:Compression Tweaks by dunkelfalke · · Score: 1

      Encoding data in the space between bits makes perfect sense
      https://www.youtube.com/watch?...

      --
      "It's such a fine line between stupid and clever" -- David St. Hubbins, Spinal Tap
    9. Re: Compression Tweaks by Anonymous Coward · · Score: 0

      When you have four billion years to compute the result, you can cover the entire phase space.

    10. Re: Compression Tweaks by Anonymous Coward · · Score: 0

      instruction to encode proteins that interact with fuel, water, and energy inputs to produce ingeniously complex machinery that itself produces an envelope and maintains it. This happens in massively parallel fashion that dwarfs anything we have ever made to be massively parallel (a trillion cellular "threads" per human).

      As an aside, this is also very close to the the directions for producing a vat of cheese.

    11. Re:Compression Tweaks by houghi · · Score: 1

      This was about https://en.wikipedia.org/wiki/... /. at its finest. Have they not yet converted Perl to be able to do that?

      --
      Don't fight for your country, if your country does not fight for you.
    12. Re: Compression Tweaks by riley · · Score: 1

      But...it's not random chance. It's natural selection. It's not a bunch of organic chemicals thrown at the wall...

    13. Re: Compression Tweaks by omnichad · · Score: 1

      natural selection = organic chemicals thrown at the wall...

      Selection = sticking.

    14. Re:Compression Tweaks by VAXcat · · Score: 1

      Kibo occupies the spaces between the bits.

      --
      There is no God, and Dirac is his prophet.
    15. Re: Compression Tweaks by 0111+1110 · · Score: 1

      To be fair we do not really have an adequate explanation of abiogenesis. Nothing at least that makes sense to our limited brains. We have to assume randomness just because we cannot come up with anything better.

      Obvioiusly positing a supernatural entity who does it with magic offers zero explanatory value, but the truth is we just have no idea how life came about. All we know is that once it started it was like a chain reaction that was self-perpetuating.

      Basically all life forms truly are molecular scale machines which no doubt could be built from scratch by a sufficiently advanced civilization. We are only just beginning to be able to tinker with such thing in the field of synthetic biology. We have only just begun to build actual simplistic robots that can walk around like we do. Maybe in 10,000 years we also will be able to design and create our own life forms.

      People get all mystical about life but really it's just a very advanced branch of robotics. So advanced that it is the equivalent to alien tech. It's as if an alien civilization left us something like a phaser or a spaceship with an FTL drive and we were left on our own to figure out how it works but we cannot even really begin to figure it out because it is so far beyond our current scientific understanding. Hence the mysticism.

      --
      Quite an experience to live in fear, isn't it? That's what it is to be a slave.
  21. Re: Possible by Anonymous Coward · · Score: 0

    It's not pratical even in your example. Multiple data samples will end having the same hash, unless you create a really, really big hash. You see were it's going? Compression is a difficult problem even with use of a lot of computational power.

  22. Pseudoscientific claptrap by timholman · · Score: 5, Interesting

    Sloot was nothing more than another of a long line of scam artists (or delusional inventors) who claimed to have created a "magic" compression scheme. In his case, he said he could compress an entire movie down to 8 kilobytes.

    Simple mathematics show why such schemes don't work. 8 kilobytes = 8192 bytes = 65536 bits. Assuming you have a two hour movie, then each second of the movie must be mapped into about one byte, which can have only 2^8 = 256 possible values to represent any conceivable second of video. It's mathematically impossible.

    Engineers and mathematicians have been debunking these claims for decades, but they still occasionally pop up. I remember one scheme that got some press about 30 years ago. A guy claimed to have a compression program that could take any data file and compress it down to about 1 kilobyte. And it seemed to work, according to several people who tried it. As it turned out, the "compressed" file was nothing more an alias pointing to the original file, which was hidden from directory view by the program. When you "uncompressed" the file, the original file was unhidden. But it was a neat trick as long as you didn't try to copy the "compressed" file to another disk.

    Sloot's program was "lost" because it never existed, just like the magic 300 mpg carburetors where the plans were "lost".

    1. Re:Pseudoscientific claptrap by Anonymous Coward · · Score: 1

      That also reminds me of Fermat's last theorem. It does have a proof, but Fermat didn't know it, and it certainly didn't die with him.

    2. Re:Pseudoscientific claptrap by Anonymous Coward · · Score: 2, Funny

      Assuming you have a two hour movie, then each second of the movie must be mapped into about one byte, which can have only 2^8 = 256 possible values to represent any conceivable second of video. It's mathematically impossible.

      That might actually work, as long as you restrict the input domain to the set of Michael Bay movies.

    3. Re:Pseudoscientific claptrap by Anonymous Coward · · Score: 0

      uh... simple video codec understanding shows why such claims about simple mathematics relative to assumptions about every second of video being stored in its entirety, are ignorant.

      the decoder for his scheme was MASSIVE. you aren't storing everything for every second, just multiple hash indexes for possible changes to multiple sprites or masks or shaders or whatever new techniques he may have used.

    4. Re:Pseudoscientific claptrap by Orgasmatron · · Score: 5, Interesting

      You may be thinking of OWS, the "fractal compression program". The "compressed" file was nothing more than a list of blocks on the disk that the original file occupied. You could test it by compressing a file, deleting the original, then decompressing (=undeleting) it.

      But if you copied it to a floppy and took it to your friends house, it mysteriously failed...

      Here is a mention of it here on slashdot back in 2006: https://slashdot.org/comments....

      And a very brief mention in the compression FAQ: http://www.faqs.org/faqs/compr...

      Internet references to it are spotty. It got passed around on BBSs and sneakernet back before home PCs really started connecting to the internet in a big way.

      You may also be thinking of WIC, which I didn't encounter, so I tend to assume that it didn't achieve as wide a distribution as OWS. The mechanism described for WIC sounds more like what you are describing. Either way, you are off by about 10 years. They were in the mid 90s, not the mid 80s.

      --
      See that "Preview" button?
    5. Re:Pseudoscientific claptrap by timholman · · Score: 2

      You may be thinking of OWS, the "fractal compression program". The "compressed" file was nothing more than a list of blocks on the disk that the original file occupied. You could test it by compressing a file, deleting the original, then decompressing (=undeleting) it.

      But if you copied it to a floppy and took it to your friends house, it mysteriously failed...

      You're right, I was thinking of OWS. Thanks for filling in the gaps in my memory.

      I recall that it actually got some press in the local newspaper back in the day before being debunked. No doubt the guy who wrote the program had a great laugh about it.

    6. Re:Pseudoscientific claptrap by guruevi · · Score: 1

      Yes, but by the time you have encoded every possible combination that can be represented in a 'movie' (everything that's ever passed any screen), you need an infinite size 'database' and as your database grows, so will your pointers so your pointers will eventually become larger than the thing you're trying to encode.

      His encoding scheme was basically: Let's send everyone all movies ever made (which with good inter-frame compressions would be as close to mathematically optimal) and use a DRM key to unlock which one they can watch.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    7. Re:Pseudoscientific claptrap by DoctorBit · · Score: 1

      I might remember something about that 30 year ago scheme! One of my father's friends (a professional mathematician who worked for a well-known defense contractor) invested in it in mid-1987. I thought about it and told my father's friend that I didn't think the claims were plausible and suggested that he try to get his money back - which annoyed him. Never heard anything about it since, and my father's friend died many years ago. I wish I knew the name of the technology or company.

    8. Re:Pseudoscientific claptrap by gl4ss · · Score: 1

      uh... simple video codec understanding shows why such claims about simple mathematics relative to assumptions about every second of video being stored in its entirety, are ignorant.

      the decoder for his scheme was MASSIVE. you aren't storing everything for every second, just multiple hash indexes for possible changes to multiple sprites or masks or shaders or whatever new techniques he may have used.

      you're stupid or what? his scheme would have allowed encoding 30 OTHER MOVIES into that 8 kbytes. each one of which could have 30 other movies, each one of which could have 30 other movies...

      look, it doesn't matter if the decoder is 360 megs or 2 gigabytes or 100 terabytes. it just doesn't work and never could work..

      he just chose 360 megs as it was way more than what desktops had ram normally at the time and it was big enough to hide a sample decoded video inside it.

      --
      world was created 5 seconds before this post as it is.
    9. Re:Pseudoscientific claptrap by RespekMyAthorati · · Score: 1

      Simple mathematics show why such schemes don't work. 8 kilobytes = 8192 bytes = 65536 bits. Assuming you have a two hour movie, then each second of the movie must be mapped into about one byte, which can have only 2^8 = 256 possible values to represent any conceivable second of video. It's mathematically impossible.

      What you describe is only true for a lossless codec.
      If you don't mind a certain degree of lossiness, you can achieve any degree of compression from any input.

      Of course, you may end up with two hours of random shit, which would be an improvement for many movies.

    10. Re:Pseudoscientific claptrap by RespekMyAthorati · · Score: 1

      The "compressed" file was nothing more than a list of blocks on the disk that the original file occupied. You could test it by compressing a file, deleting the original, then decompressing (=undeleting) it.

      That's actually pretty clever.
      Useless, but clever.

    11. Re:Pseudoscientific claptrap by teg · · Score: 2

      You could try transmitting the actual manuscript - compressed, with a partial dictionary known on the other side - and then have it decompressed by letting an AI "acting" it? The same way you could transmit sheet music instead of an actual performance?

      Obviously, the actual performance will differ and there is no way he had technology to do anything like that...

    12. Re:Pseudoscientific claptrap by magusxxx · · Score: 1

      This reminds me of a Guy Kawasaki lecture I went to decades ago. He was promoting Disk Doubler, a compression tool. He mentioned how he saw his competition giving a very interesting demonstration. (Sorry, don't remember who exactly the other guys were.) They shrunk a 200+ page text document down by 98%. Not bad. Until you realized the file was one sentence repeated over and over again.

      --
      Care killed the cat, but satisfaction brought it back.
    13. Re:Pseudoscientific claptrap by grumbel · · Score: 1

      he said he could compress an entire movie down to 8 kilobytes.

      To put that into perspective, the script for a movie is in the order of 200kB, even after a run through xz it is still around 32-64kB. So even if your decompressor has all the smarts of a Spielberg and could reconstruct the movie from just the script alone, you don't even have enough storage for the script itself. And of course there is no information in the script that would be detailed enough to reconstruct that exact movie, so you would just end up with a remake anyway.

      I think compression could still be improved quite a lot when one focuses on just producing something that looks plausible instead of something that is an accurate reproduction of the original. But even in the best-case scenario 8kB is not nearly enough.

      The only way this could be possible is if you were literally God and could predict and enumerate all the movies that will ever be filmed. An 8kB number should be more than enough to identifier a movies by a unique id. Predicting all possible movies would however get a little tricky.

    14. Re:Pseudoscientific claptrap by elgatozorbas · · Score: 1

      Assuming you have a two hour movie, then each second of the movie must be mapped into about one byte, which can have only 2^8 = 256 possible values to represent any conceivable second of video. It's mathematically impossible.

      Apart from the mathematical impossibility, the practical implications are also interesting. This would also imply there are only 256 different fragments which can be played in any second ever. And all movies would consist of only a reshuffeling of these 256 seconds. That means it becomes impossible to do a quiz in which people are shown a 1 second clip of a movie and need to guess what movie it is, because there could be only 256 distinct answers while the number of movies is obviously much higher. Even worse: because all movies would be made up of a succession of one of these 256 1-second clips, any arbitrary clip would probably appear in every movie ever made. Almost all answers to the question would be valid.

    15. Re:Pseudoscientific claptrap by Impy+the+Impiuos+Imp · · Score: 1

      That still requires the data for all 65536 movies to be migrated to your disk as part of the algorithm installation, absolutely crushing the savings of storing or transmitting two bytes for the "compressed" movie.

      You don't get something for nothing.

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    16. Re:Pseudoscientific claptrap by Daetrin · · Score: 2

      We got ahold of a copy of that (or something very similar) in my computer lab in High School. I believe it was the first lab, so that would have been either '92 or '93. I remember discussing the fractal methods it claimed to use to accomplish such extreme compression.

      We tested it out, and were both amazed and suspicious when it seemed to work. But as soon as we tried copying the "compressed" file to another machine and it failed we knew something was up, and it didn't take long after that to figure out what was really going on.

      --
      This Space Intentionally Left Blank
    17. Re:Pseudoscientific claptrap by KBORG · · Score: 1

      Damn - it was known as NABOB.ZIP and I'd forgotten how it worked, brings back memories - Thank You!

    18. Re:Pseudoscientific claptrap by omnichad · · Score: 1

      That's assuming that the movie is linearly mapped to the data in the file. If the entire file were inputs to a massive equation, it would still be just as ridiculous - but it would leave far more than 256 possibilities for that 1 second (2^8192).

    19. Re:Pseudoscientific claptrap by timholman · · Score: 1

      That's assuming that the movie is linearly mapped to the data in the file. If the entire file were inputs to a massive equation, it would still be just as ridiculous - but it would leave far more than 256 possibilities for that 1 second (2^8192).

      That reminds me of a short story I read many years ago, where a group of super-intelligent humans sent into space transmit back a massive data dump in the form of a mathematical expression, with the intent that it be evaluated and the answer converted into binary form, and the information read from the binary patterns. The protagonists realize there isn't enough computer storage space on the entire planet to hold the answer in binary form, and embark on a crash program to build a computing complex large enough to evaluate the expression.

      If your "compressed" file ceases to represent actual video data, and instead becomes an identifier for a particular dictionary entry, then naturally all bets are off. If you had the video equivalent of the Library of Babel as your encoder / decoder, you could "encode" 1.8E308 movies into a 1 kB file. Of course, you'd require many orders of magnitude more atoms than exist in the entire universe to construct your codec, and a supercluster-sized black hole to power it, but think of the great movie nights you could have with your friends. :-)

    20. Re:Pseudoscientific claptrap by Anonymous Coward · · Score: 0

      Sloot was nothing more than another of a long line of scam artists (or delusional inventors) who claimed to have created a "magic" compression scheme. In his case, he said he could compress an entire movie down to 8 kilobytes.

      Simple mathematics show why such schemes don't work. 8 kilobytes = 8192 bytes = 65536 bits. Assuming you have a two hour movie, then each second of the movie must be mapped into about one byte, which can have only 2^8 = 256 possible values to represent any conceivable second of video. It's mathematically impossible.

      Except your simple mathematics is useless here.
      Who said you had to break it down by seconds? If I'm encoding an image, does the first 1/10th of the file have to encode 1/10th of the picture? Not if I'm using Run Length Encoding.
      Modern video codecs often work by encoding the differences between frames. One second of video with a small object in the center of a black screen takes less space than one second of a full-frame rollercoaster ride. I can actually show this by encoding two-hours of black screen into a VERY small file.

    21. Re:Pseudoscientific claptrap by omnichad · · Score: 1

      I wasn't talking about dictionary generation, but more procedural generation where the entire file is the starting inputs.

  23. Excellent Viral Marketing by Anonymous Coward · · Score: 0

    Marketeers and viral promotion dept. working the narrow channel on this one. Right audience, of course.

  24. Re:Possible by Anonymous Coward · · Score: 0

    The problem is that you would also discover a very large number of character strings having nothing to do with Shakespeare which would hash to the same value.

    You can't do that without collisions.

  25. Snake Oil by Nethemas+the+Great · · Score: 1

    What you throw away, you cannot get back. Think of compression as a plastic bag filled with snow and maybe a little air at the top. Assume air represents '0' and water represents '1'. A snowflake can be said to be represented then by a sequential pattern of 0's and 1's. The more you squeeze the bag, the more air (0's) you throw away. Initially you're merely squeezing the air at the top out but as you continue to squeeze this bag, you start throwing away the 0's contained within the snowflakes themselves, in so doing the shape of the snowflakes becomes less and less defined. Squeeze all of the air out and you're left with water and absolutely no notion that the original defining shapes of the snowflakes.

    Lossless compression in a sense only squeezes the air at the top out. Lossy compression continues on and erodes the actual pattern defining the snowflake. No matter how clever your algorithm is at deflating the bag, there are very real limits to how much you can squeeze before you stop being able to recognize the contents as the snow you once put into the bag. Procedural generation is a form lossy compression. Minecraft worlds are procedurally generated from small seed strings. These generated worlds are very impressive in their complexity and expansiveness. However, from those seed strings you have precious little control over what is generated. You cannot simply change the seed from "aaaaaa" to "aaabaa" and lower the height of a hill by 2 meters at coords 100,50 without very drastic changes propagated elsewhere throughout the entire world.

    If someone killed Sloot it wasn't to suppress his invention, it was to save face for being conned by him.

    --
    Two of my imaginary friends reproduced once ... with negative results.
  26. Saw the documentary by hcs_$reboot · · Score: 1

    First of all, the guy who made the documentary didn't invent black bordered subtitles, and white on white is not easy to read.
    Then, people who actually saw Sloot's invention talk about "movies" without any more details, like length or quality. Not much convincing. So, maybe Sloot invented a device that simply uses a kind of wifi?

    --
    Slashdot, fix the reply notifications... You won't get away with it...
  27. KGB by ArylAkamov · · Score: 1

    Reminds me of KGB Archiver.

  28. Machine vision problem by ezdiy · · Score: 1

    This level of compression translates to AI vision.

    While retina and optic nerve run roughly at 8Mbits (retina already does a lot of pre-processing), visual cortex reduces this to 10kbit/s per eye and feeds it further into brain. Which is the same ballpark claimed by this kook.

    It's obviously possible with *heavy* post processing to make very high representation of objects perceived. To replay, you need to hallucinate it back (akin to dreams). Problem is you need something as good as humans visual cortex, while modern ML vision is on the level of a fruit fly. Also, the fidelite would be ... interesting to say the least, as the hallucinated playback would heavily reuse "stage props", just as human brain does.

    [1] Anderson, C.H. et al. (2005) Directed visual attention and the dynamic control of information flow. In Neurobiology of Attention (Itti, L. et al., eds),
    [2] Norretranders, T. (1998) The User Illusion, Viking

  29. None by Ryanrule · · Score: 1

    None view.

  30. Anything is possible. Practical though? by LostMyBeaver · · Score: 3, Interesting

    I theorized that the Mandelbrot is capable of generating any string of data of any size given a high enough resolution, high enough iterations and enough time to process it. I haven't retested my theory on scale since 1998 (significant because of general availability of GPUs... if you count Voodoo2 as a GPU), but processing on a 486DX-50 with 16 megs of RAM and Linux allowed me to search for simple strings (kilobytes in size) within reasonable time.

    What I found was that every 4KB sequence I randomly generated as part of a set was able to be located within the Mandelbrot set. Results generally were found in the nautilus or horse tail areas. I used several search methods.
    1) Linear
    2) Rectangular spiral clockwise (variable width, variable height)
    3) Rectangular spiral counter-clockwise (variable width, variable height)
    4) Zig-zag in 45 degree rotations and variable width and variable height)
    5) Elliptical Spiral (clockwise and counter-clockwise, variable width, variable height)

    I also experimented with added dimension which included iteration as a 3rd dimension when pattern searching allowing strings to extend across multiple iterations in cubical, spheroidical, etc... footprints. I had to abandon this effort due to computational complexity.

    I also experimented with experimenting with adding bit layers as a 3rd dimension when searching, but this added even greater computational complexity than the previous method described above but certainly promised greater results.

    The algorithms I used were exhaustive. So where a video motion vector search algorithm would abandon searching a given direction for performance reasons when the delta values in said direction were not yielding results, I searched every pattern type for every pixel for every parameter... etc...

    Now, given my computational capacity and limitations of the time, the average seek time per 4KB string before finding a result was 1-2 weeks. All my code was heavily optimized to exploit memory as it was used on the architecture. Consider a combination of Michael Abrash style coding/optimizations with more specific knowledge of the specific CPU and memory it would run on. I also spent a great deal of money getting memory with 60ns response time (though I never measured better than 64ns on the scope). I also used a Micronics motherboard which at the time meant something as Intel had yet to enter the chipset market and as ECC memory was far too expensive... for pretty much anyone on that scale, and the memory controller was not yet part of the CPU, I needed a reliable and documented chipset to work with.

    Ok, so here's the summary of all the results
    1) It is worth investing greater effort in scientifically proving that absolutely any string can be found in the Mandelbrot Set given enough iterations, resolution, search creativity and of course CPU power.
    2) It is believed (though through non-thorough experimentation, not mathematically proven) that any 4KB string can be found in the Mandelbrot Set
    3) Once found, depending on the resolution and number of iterations provided as a coefficients for identifying the set, the data can be retrieved in a reasonable period of time (deterministic, though variable) on any device. The real computational cost was the search itself.
    4) This method of compression has no value for video as the data sets are too large.
    5) It is most common that longer string lengths searched for often requires higher resolutions and higher iteration counts to be found. This is common, not a rule. Depending on the search pattern, often strings with a more level histogram distribution could be found as bit patterns in lower resolutions and iterations.
    6) The size of the strings found increased far more rapidly than the size of the coefficients required to represent the find. Meaning without employing additional methods such as entropy coding the :
    - resolution of the Mandelbrot image,

    1. Re:Anything is possible. Practical though? by guruevi · · Score: 3, Interesting

      You can take any irrational number and find any value in it. The problem is that you can't truly compress anything that way, your coefficients will be the same size or larger than any number you're trying to get as a result (the optimum compression can never be better than 2:1). If you're looking for "lossy" compression (basically, some values (but no more than x %) can change, you'd still have to look through the nearly infinite space for possible values and at best you'll get some compression close to whatever entropy allows.

      To encode all data that's currently being stored or generated this way by humankind, your encoder would have to work through ~2000 exabytes.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    2. Re: Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      Are you a wizard?

    3. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      You do not need mandelbrot set. Every concievable bit sequence is present in the bit representation of Pi somewhere. Unfortunately, position of that bit sequence in Pi would likely take as much space as the bit sequence itself. Now it may be possible to replace Pi with some other bit sequence where you can encounter typical bit sequences from a typical video sooner rather than later. Then you can achieve at least some compression. If you discover some pattern in the videos (or sounds) it may be possible to write an algorithm that generates that sequence further saving space at the expense of CPU time.

    4. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      You can take any irrational number and find any value in it.

      Just to nitpick, this isn't true. For an obvious example, consider: 0.101101110111101111101111110...
      The number is certainly irrational: the decimal representation of any rational number either finitely terminates or eventually repeats a finite sequence, and this does neither. It also obviously does not contain any value you can think of; anything involving the digits 2-9 (in base 10, here) is obviously not present. And even simple strings involving only 0 and 1 are absent: eg, 00 or 010.

      I can't think of a name for the property you've described, but a somewhat similar concept is that of a normal number, if you're interested in further reading.

    5. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      2) It is believed (though through non-thorough experimentation, not mathematically proven) that any 4KB string can be found in the Mandelbrot Set

      So any 4 kB string can be found? That means all 4 kB strings exist somewhere, and since they are different, they would have to be in different locations. So to point to each location, you need a different description for each location, which means you would need 2^4098 different descriptions to point to each location and recover each string. Guess how much space you need to describe at least 2^4098 positions?

      Compression only works because some patterns are more common than others or because some information is less important and can be replaced with some amount of noise in a lossy compression. There is no connection between the Mandelbrot set and a completely arbitrary string, so it is not going to save you any space. You might be able to find some obscure connection to common music patterns there, but not to arbitrary sounds and speech. However, something like mp3s exist because there is definitely a connection between a model of human hearing and what information is important vs. unimportant in a sound recording. Similarly, you might find some visual patterns in the Mandelbrot set and some organic or chaotic textures, but that is a subset of visuals used in movies. On the other hand, codecs based on models of what information is important to human vision (e.g. less emphasis on higher frequencies, especially for hue vs. intensity) can do a rather decent job for any image that is relevant for humans to look at.

    6. Re:Anything is possible. Practical though? by DoctorBit · · Score: 1

      That's not possible. 4KB is 32000 bits, so there are 2 ^ 32000 ~= 10 ^ 10000 possible 4KB sequences. That's a one followed by 10,000 zeros number of sequences you would have needed to compare to find the one you were looking for. If you could have compared a billion sequences / second, it would have taken the age of the universe times nearly infinity to locate the sequence you were looking for. You must have had a bug in your code - perhaps your random number generator was faulty, or you weren't comparing the full 4KB length of the data.

    7. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      though through non-thorough experimentation

      Nice! I will show this to my kids learning the english language. Maybe putting it as "though through thorough thought".

    8. Re:Anything is possible. Practical though? by gl4ss · · Score: 1

      no its not.

      the precision(data size) of the coordinates into the mandelbrot set quickly become larger than the data you are pointing into.

      --
      world was created 5 seconds before this post as it is.
    9. Re:Anything is possible. Practical though? by wildstoo · · Score: 1

      though through non-thorough

      That sequence of words made my head explode more than what you're actually describing.

    10. Re:Anything is possible. Practical though? by Rockoon · · Score: 1

      More to the point:

      Data compression requires a model. The model is used to predict the original data. Corrections are passed along so that the decompressor knows when the model is wrong (like the compressor does) ... often these models are built "on-line" as the file is being compressed/decompressed so that the model itself isnt even something thats passed to the decompressor.

      There is no reason to think that the mandelbrot would be a good model for predicting anything other than more mandelbrots and related julia's AFAIK.

      Good models involve: I've seen something like this before... I'm keeping statistics... the next symbol is X1 with probably P1 or X2 with probability P2, etc...

      The compressor could then use a standard entropy encoding technique from here and encode which possibility is really next ... range coding or arithmetic coding.. but that actually over-complicates it... instead just a right or wrong indicator is encoded... and THAT indicator is encoding using range/arithmetic because we have a probability for those too.

      --
      "His name was James Damore."
    11. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      If the power (electrical I mean) cost of compressing/decompressing is higher than the cost of storage (disk space I mean) of the data being compressed then the algorithm has no reason to be.

    12. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      No, you can not take any irrational number and find any value in it. For example, remove all the 0 digit from pi number in decimal. It remains an irrational, but you can not find any 0 in it. See https://en.wikipedia.org/wiki/Normal_number

    13. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      You can take any irrational number and find any value in it.

      Trivially false. The irrational number in the decimal space that consists of ones with linearly increasing numbers of zeros between the ones will not have a two in it at any depth. (1.1010010001...)

      It is believed that the useful constants suspected of irrationality (e, pi, etc.) contain all possible numerical sequences, eventually.

    14. Re:Anything is possible. Practical though? by Gornkleschnitzer · · Score: 1

      Though through a thorough thought trough?

    15. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      Congratulations on discovering information theory!

      Now read Claude Shannon and you can learn there is actual provable math behind why this doesn't work.

    16. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      Sending messages to the stars?

    17. Re:Anything is possible. Practical though? by Anonymous Coward · · Score: 0

      Not true.

      Consider the irrational number formed by counting in base two, but representing a number in base 10 (that is 0.11011100101...), which is a bit over one tenth. This is irrational (never repeats) but not every string of digits is found in it.

      Check out "normal numbers", a nicely weird bit of mathematics - most numbers are normal (the set of non-normal numbers has measure zero) but we don't know very many normal numbers.

    18. Re:Anything is possible. Practical though? by fibrewire · · Score: 1

      Who are you, and you speak like I think. I'm not sure if I speak like you think, but I seriously wish there were another me. Not an exact me, as the copy would find similar results that I would. I need a different , like the same genetic traits that blueprints the type of person I am, but maybe from a different gene pool, with a different upbringing. Like if I were to have several illegitimate children with different women, and then pawned them off for adoption before settling down with Miss Right, and then have more children with her. Those children would have the "thing" that makes me the way I am, and THAT is the person I believe you to be in relation to me. Not necessarily in the manner I described, but as you continue to read this I'm sure you're becoming more curious. I even spent 15 minutes reviewing other posts that you've made here on Slashdot, and I'm quite impressed with the quality of your posts. But now thats enough ego stroking and I'm sure i've officially creeped you out. Thats okay though, I'm pretty sure you'll get over it. I also have a daughter, five kids total actually. Life is good, they are awesome, but at 37 I'm afraid my potential is not fully realized. But, perhaps through correspondence with you -LostMyBeaver- I might find a way to achieve something that I cannot do alone. So shoot my nickname an email at hotmail when you get a chance, as I want to build an accord with you, and have a few ideas to discuss. (psst - its definitely not a Sloot like video encoding system) You may get bored, you may be suspicious, but trust your gut this once and reply. To me it feels like a huge effort to reach out to a complete stranger like this, so don't prove me wrong. Send me that email, I only want to give back that irreplaceable commodity which once spent is never reclaimed. Have yourself a good evening, and know that not replying to me will result in restless sleep or perhaps a migraine. The clock is ticking - tick - tock - tick

  31. The reason Sloot was killed by Anonymous Coward · · Score: 0, Offtopic

    They discovered compressed Sloot footage of Seth Rich faking the Apollo moon landings.

  32. Late 1990s low screen definition by Anonymous Coward · · Score: 0

    Except that most people answer are like thinking HDTV in 1080p,
    back then you are talking about doing video compression
    for like Windows 95 or Windows 98 SE
    in 640x480 x 16-bit video...
    or maybe 800x600 x 16-bit video...
    or maybe 1024x768 x 16-bit video

    If you are using something like LPAQ (which is quite faster than normal PAQ compressor)
    and some adaptive neural network and lots of pre-trained data for a specific color palette,
    maybe you could get away with it in such low resolution.

    For example, PAQ8K works with binary data and PAQ8PX works with WAV audio data files,
    but it's extremely slow.

    Also, in the case of ZPAQ, you do not really encode data, you encode a neural net machine byte code
    that will produce the actual data...

    "Decompression algorithms are written in a language called ZPAQL and stored as a byte code which can either be interpreted or converted directly to 32 or 64 bit x86 code and executed. "

    However, back then with like a Pentium 166 MMX you would not go far about doing anything crazy,
    it would be way too slow.

  33. The whole story makes it clearer by guruevi · · Score: 4, Informative

    If you read the original stories in Dutch, the whole story becomes a lot clearer.

    The system he had was enclosed in a box and you could initially see his "demo" of 4 movies in low quality. There were various claims on Sloot's part that it was only x size but nobody was allowed to look into or program the system. He was going around investors fishing for money to make it work at bigger resolutions for his lossless compression algorithm. When he croaked nobody found anything in regards source code or design documents.

    It's also mathematically impossible to get the file compressions he got. At best it was a reference to a pre-programmed movie.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
    1. Re:The whole story makes it clearer by uvajed_ekil · · Score: 5, Funny

      If you read the original stories in Dutch, the whole story becomes a lot clearer.

      Personally, I found that reading the whole story in Dutch left me more confused and totally unsure how to take this. I don't read Dutch.

      --
      This is a hacked account, for which the owner can not be held responsible.
    2. Re:The whole story makes it clearer by whoever57 · · Score: 1

      Has there ever been a "technology" where the inventor shows no more than scant details that wasn't a scam?

      --
      The real "Libtards" are the Libertarians!
    3. Re:The whole story makes it clearer by Anonymous Coward · · Score: 0

      Read it again. Then you'll have read it in double-Dutch, which is exactly what Sloot's ideas were.

    4. Re:The whole story makes it clearer by Waccoon · · Score: 1

      The system he had was enclosed in a box and you could initially see his "demo" of 4 movies in low quality... He was going around investors fishing for money to make it work at bigger resolutions for his lossless compression algorithm.

      Ah, yes, the preferred tactic of magnetic perpetual machine salesman!

    5. Re:The whole story makes it clearer by Anonymous Coward · · Score: 0

      Actually, it's interesting how closely related English and Dutch are.
      I won a trip to the Netherlands in an academic competition my senior year of high school. I spent of couple of weeks staying with a Dutch student and his family. While there I saw part of an American movie on TV. It was set in the Deep South, and some of the accents were really difficult for me. I suddenly realized I could figure out more of what they were saying using the Dutch subtitles!

    6. Re:The whole story makes it clearer by Anonymous Coward · · Score: 0

      I don't read Dutch.

      Well, of course! Why would you read Dutch? Reading Dutch only confuses people.

    7. Re:The whole story makes it clearer by Anonymous Coward · · Score: 0

      Het was een totaal nep scam.

  34. too-many-channels-to-watch by Anonymous Coward · · Score: 0

    does this mean Comcast will start cramming an order of magnitude more TV channels down my
    pipe?

  35. IIRC... by Anonymous Coward · · Score: 0

    If this is the one i remember, it worked by caching a absolute s#!& ton of data to the end devices. Once the 'index' was cached you could play SD video over dialup.

  36. 8 kilobyte move by PPH · · Score: 2

    But you need to download a 370 Mb CODEC for each one.

    --
    Have gnu, will travel.
  37. No Fat Sloots by Anonymous Coward · · Score: 0

    Well i'm not a fan of fat Sloots so I would prefer them to be compressed. By what means they get compressed I care not.

    1. Re:No Fat Sloots by Anonymous Coward · · Score: 0

      I just sent you a YMCA takedown notice.
      -creimer

  38. We already have this by Gabest · · Score: 5, Funny

    There is a few hundred byte long URL to a movie and it can be "decompressed" to your hard drive in a matter of minutes, or hours.

    1. Re: We already have this by Anonymous Coward · · Score: 0

      Ok I get the joke but literally you can encode data in url shortener urls, and this works recursively

  39. It's not as crazy as it sounds by JoeyRox · · Score: 2

    I have a friend who invented an alternative to tracking time in place of the proposed time_t. Just 24 hours before he was to present it at a conference he was found murdered. I'll never forget the day it happened - January 1st, 1970.

  40. I thought we already had this by Powercntrl · · Score: 1

    There's already a way to compress a movie down to a few KB. You simply post it as a torrent and let other people store it on their computers. After it's been well-shared, you can simply delete it from your machine. Can't beat that level of compression. See, I've included Big Buck Bunny in this post:

    magnet:?xt=urn:btih:88594AAACBDE40EF3E2510C47374EC0AA396C08E&dn=bbb_sunflower_1080p_30fps_normal.mp4&tr=udp%3a%2f%2ftracker.openbittorrent.com%3a80%2fannounce&tr=udp%3a%2f%2ftracker.publicbt.com%3a80%2fannounce&ws=http%3a%2f%2fdistribution.bbb3d.renderfarming.net%2fvideo%2fmp4%2fbbb_sunflower_1080p_30fps_normal.mp4

    --

    ---
    DRM is like antifreeze, to the MPAA/RIAA it's sweet, to the consumers it's poison.
  41. My 100kB solution for any movie: by Anonymous Coward · · Score: 0

    Here's my 100kB solution for any movie ever made: The 100kB is an integer index into the base16 digits of pi, and a length.
    The decoder is really simple, but the encoder currently takes longer than we have left until the heat death of the universe.

    Please fund my compression/encryption library π-coder!

  42. Well by Anonymous Coward · · Score: 0

    512kb seems a bit crazy. I don't think that it can be done in that little space. That being said, I would not be surprised if someone came out with something 10 to 30 times better than what we have now.

  43. answer is yes. proof is the demoscene. by Anonymous Coward · · Score: 0

    i want to show you dwitter. It's a website where people make graphics demos using 140 characters of javascript.

    If you think about it from the prespective of compression, then yes, dwitter could be thought of as a compression/decompression system. you give it 140 utf8 characters and it can produce hundreds of different video streams each containing many megabytes of data.

    Most 'demoscene' environments are like this. Someone came out with a demo for the IBM PC 5150, using an 8 bit intel 8088 processor, a few years back, and its several minutes of video and audio. now remember that old machine had like a few hundred K of RAM and floppy disks were 360Kilobytes.

    ---

    the problem is that give these systems any old video stream, it's not going to spit out 140 characters of javascript. that is where the genius of encoders comes in. They have figured out what the 'average any old video' tends to have and they have created generalized algorithms that work pretty well.

    however. given an extraordinary amount of clevernes and time, you might be able to create a system that could take a given video, and produce 140 characters of javascript that, when fed through dwitter, would create the original video. dwitter here, along with your browser, and it's javascript interpreter, would be the 'decompressor'. now, on a lot of videos it might look like garbage. For example i dont think a full length movie is going to be encodable in 140 characters of javascript. But there would be a lot of stuff in there that might surprise us - just as i have been surprised dozens of times on dwitter at what is possible with a bit of human imagination.

    the number of possible distinct video streams in this sysetm is on the order of 64000^140th power. (64000 being a rough idea of how many utf8 characters there are, with 140 of them).

    Here is the thing. People decry the 'magic box'. But if you read about the new machine learning algorithms... nobody knows how they actually come to their decisions. So they are literally magic boxes. I dont know if Smoot had such a thing back then, but today... who knows?

    ----

    i also want to discuss anger and condescension. in other words, skepticism vs belief.
    skepticism is important, but so is belief. you must find a balance between these two.

  44. The blank personalized license plate by Latent+Heat · · Score: 2

    When I insisted that the null set should be included the enumeration of subsets in answering a problem on an Information Theory homework, the professor was reminded that his colleague and author of our textbook once asked for "the null license plate" (i.e. a blank plate) for his personalized license plate but was turned down as I was about to be in my request for points back on the assignment.

    Whereas there are certain plates you cannot have -- obscenities along with "1" or "A1" or similar license numbers reserved for high government officials or whatever the excuse -- the colleague insisted that a blank plate was not excluded by the rules.

    DMV stood firm in rejecting this request, but if anyone, anyone at all in the State of California should have been permitted to have the blank license plate, don't you think they should have granted it to the author of a graduate-level textbook in Information Theory, especially with so many high-ranking academic institutions in math and engineering in that state?

    My professor's take on why his colleague should have been granted the null plate is that it could have made it easier for the CHiPs seeing his colleague speeding down the Foothill Freeway, "Quick, get that car's license number! (partner to that officer rips off a blank sheet from a notepad) Here it is!"

    1. Re:The blank personalized license plate by Anonymous Coward · · Score: 0

      The problem with blank plate is cars ship with blank plates.

    2. Re:The blank personalized license plate by MancunianMaskMan · · Score: 2

      the problem is that automatic number plate recognition systems aren't null-safe: you could crash the whole system by de-referencing a null pointer and the world would go under because, you know, Terrorists!

  45. Two answers by dbIII · · Score: 1

    1/ Never heard of it before thus no opinion - odd question to ask since so many will also have no opinion.
    2/ If it is lost then it's technically irrelevant even if historically interesting.

  46. If it's possible, Google's AI will rediscover it by Ken_g6 · · Score: 1

    This sounds like a cross between Guetzli, an AI-optimized jpeg encoder, and Brotli, a zlib encoder with a predefined library. And Google loves looking for ways to compress YouTube videos.

    --
    (T>t && O(n)--) == sqrt(666)
  47. Re:Sloots by Anonymous Coward · · Score: 0

    Sloot jokes are whoreable.

  48. Read the book by KinkyClown · · Score: 1

    I have read the book "de broncode" some 10 (?) years ago. It read like a very exciting detective/mystery. The weird thing was that the book ends with lots of pictures and copies of documents that prove it's existence. In the end you really get the feeling that there was some foul play by some mayor company. It's not that difficult to make someone have a heart attack...

    1. Re:Read the book by RespekMyAthorati · · Score: 1

      The weird thing was that the book ends with lots of pictures and copies of documents that prove it's existence.

      But they don't prove that it worked!

  49. Example of Pseudoscientific claptrap by dbIII · · Score: 1

    Sloot's program was "lost" because it never existed, just like the magic 300 mpg carburetors where the plans were "lost".

    Offtopic maybe, but I had a guy with something like that bring it in to the University where I worked for independent testing. He was an artist with not a lot less technical skill than a mechanic, and paranoid as anything, but he couldn't hide what he'd done. The trick (on the perpetrator more than anyone else) is to tune the engine for maximum efficiency at idle - thus it uses very little fuel on a test bed with no load. Once you attempt to get it to do something useful it actually uses more fuel than if you hadn't messed around with it in the first place.
    Those people "knew it was real" - they had seen it in action in a test, and in their minds there was only some tiny little glitch that was keeping it from working in practice. Anyone with an actual clue about engines that attempted to point out problems was apparently part of a conspiracy.

    1. Re:Example of Pseudoscientific claptrap by RespekMyAthorati · · Score: 1

      He now works for Volkswagen.

  50. it cannot logically work. sorry. by gl4ss · · Score: 3, Insightful

    You're forgetting that it's 8kB ... plus the 370MB of data preloaded in the de/compression engine.

    look. it doesn't MATTER if it's 370mb of data or if it is 1000 terabytes of data. given a random movie file it would have to be able to tell apart the same movie but lets say a 10 seconds of it is just blacked out in the other version or replaced with a random text(this is to take out possibility of having the whole movie already in the 1000 terabytes of data). you cannot index chunks with such a little data to go match there. IT CANNOT BE DONE. if it could be DONE THEN THIS WOULD REVOLUTIONIZE HARDDRIVES AND ALL DATA TRANSFER - not just videos.

    besides 8000 bytes / 120 min is about 66 bytes per minute. thats like a byte per second.

    so basically, at least the claims are an order of magnitude more ludicrous than is possible at all.

    also a few years ago I ran into some startup guys who were claiming that (among other things) they had a new compression algorithm that could do 1gig to 100kb or some such... I tried to explain them the theoretical maximum they could attain is way worse what they were claiming (due to simple logic)... they never did anything with it or showed it of course - even if it would have been 1000000x more valuable than what their current company was doing. kind of lost faith in that then.

    furthermore you could embed 50 megs of data easily visually on a normal vcd movie of 650 megs. what this is suggesting is that you could embed any random data of 100 megs into 8 kilobytes. it doesnt add up. it cannot add up.

    --
    world was created 5 seconds before this post as it is.
    1. Re:it cannot logically work. sorry. by Etcetera · · Score: 1

      Sure it can. Have you seen what demo makers can do with procedural generation?

      Something that does a best-effort analog to digital conversion into equations and not samples would "compress down" that much because you're not really compressing at all, but re-creating from scratch into something approximating what was there. Think of an automated MOD/trakker creator.

    2. Re:it cannot logically work. sorry. by gl4ss · · Score: 3, Insightful

      yes I know what demo guys have done, and it's done by hand, tweaked to produce something that looks nice, and by that I mean the procedural algorithms are written by hand to make something that looks nice - not something arbitrary they have in their mind.

      it can look magical but really it is not. none of the demoscene demos break logical rules.

      and yes, something like elite frontier fitted on a floppy and had a billion stars to explore with planets. but that is not an argument for this video encoding scheme at all - it's not like that on some planet in there you might find a tv playing jaws - it cannot have that and you can know it cannot have that.

      however sloot claimed his video encoding to be so good that you could have on a floppy all the movies in the world- because recursion, if his scheme worked you could encode the next movie in the images in the credits scene of the previous movie - THIS SHOULD clue people in that it was just all bullshit, but for some reason doesn't? how hard is it for people to grasp this recursion problem of arbitrary information? it has nothing to do with if the compression scheme is procedural or not, it's just about if you can point to so much arbitrary data with so little data.

      he was claiming benefits wayyy beyond of an automated mod/tracker creator.

      look even if it was seeded with aqua teen hunger force episodes it still wouldn't be able to do the compression he claimed.

      like come on dude, he was basically claiming that he could do better video encoding than what the best image encoding could do. if he had something that worked he would just have patented it and made a bank.

      and if you're thinking of fractals - that too works only if you don't grasp bits and data, the pointers to the data in the fractal would have to have such a precision that it wouldn't be able to perform so well as he said. simply put, if you had kilobytes to represent a movie then that batch of kilobytes would have to represent multiple movies and that simply cannot work.

      --
      world was created 5 seconds before this post as it is.
    3. Re:it cannot logically work. sorry. by AmiMoJo · · Score: 1

      The limits you are thinking of apply to lossless compression. With lossy compression all bets are off.

      Say I have completely random white noise audio. With lossless compression it's mathematically impossible to compress it, because it is perfectly random. In practice, if I replace it with a true_rng() function the result will be indistinguishable to human beings, and I can encode that in 1 bit. So I can compress an infinite amount of random noise to 1 bit with this lossy scheme.

      Video uses lossy compression.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    4. Re:it cannot logically work. sorry. by yes-but-no · · Score: 1

      yeah.. it's like you have a million samples/points on the line y=x. That is you have to store (x1,y1), (x2, y2) .. a million pairs.. how much memory it takes? a naive approach is about 4MB (say 4 bytes per point). Of course you can just transfer (x_min and x_max) n just create the sample set (spacing them randomly/evenly within x_min, x_max). How much space? 4 bytes plus one say 8 byte equation-identifier. So totally 12 bytes. not 4 MB.
      The point is you are finding a deeper level abstraction.. ie going to the algorithm that generated the data and creating the data gain. Not just compressing the data.

    5. Re:it cannot logically work. sorry. by Rockoon · · Score: 2

      given a random movie file

      I can encode 100% of your random movie files in a single 16-bit LFSR.

      Nobody cares about compressing random data.

      Thats the fallacy of your argument. People only care about the coherent data.. hollywood movies .. family videos.. there are random elements, but the data is neither arbitrary nor can be said to be random. That pixel next to pixel 148675 is related to it. There is coherence.

      It is the coherence that is compressed. The random stuff is just replaced with different random stuff.

      --
      "His name was James Damore."
    6. Re:it cannot logically work. sorry. by Anonymous Coward · · Score: 0

      besides 8000 bytes / 120 min is about 66 bytes per minute. thats like a byte per second.

      Proof that it's impossible: There are about 1.8 million distinct movies on IMDB (or: over 1 billion videos on Youtube). Split them into one second chunks and delete all repetitions of chunks. We can safely assume that there are more than 256 distinct one-second chunks, even at 1990s "broadcast resolution". (This can also be shown by a simple instruction how to generate 1000 distinct one-second movie clips, e.g. clips of Count von Count showing the numbers 0 to 999). To make a new movie, the chunks can be combined in random order, thus they are pairwise independent, thus any compressed version of the whole movie will need at least the sum of the number of bits needed for each chunk. You need more than 8 bits to encode each second of a movie because if there were a collision, two distinct movies would be encoded as the same. QED.

      On the other hand, if the claim is restricted to "every movie ever made" (as the video says) and only IMDB listed movies count, a movie can be compressed to its IMDB number -- less than 20 bits per movie.

    7. Re:it cannot logically work. sorry. by Anonymous Coward · · Score: 0

      a movie can be compressed to its IMDB number -- less than 20 bits per movie.

      less than 21

    8. Re:it cannot logically work. sorry. by Shikaku · · Score: 1

      https://modarchive.org/

      So this site doesn't exist and isn't full of millions of songs?

      https://openmpt.org/download

      And this free software doesn't work that way?

      (very tongue in cheek, but it does make you think that if we already have the sound version out and long working, how much more crazy complicated the video version could be)

    9. Re:it cannot logically work. sorry. by Anonymous Coward · · Score: 0

      >it has nothing to do with if the compression scheme is procedural or not, it's just about if you can point to so much arbitrary data with so little data.

      The space of 'valid avi files' is far larger than the space of 'films', by so many orders of magnitude it's absurd. Don't confuse the first for the second.

      How many movies will ever exist? Let's pick a nice round number, and be a bit optimistic and say by the heat death of the universe we'll have 10^100 films to pick from.

      If you took a stupidly direct approach and had a sufficiently large computer, you could trivially code a 'decompressor' that just took in the 100 digit index number of a film and spat out the entire film from the database. (So you can 'store' an entire film in 332 bits)

      Similarly you could build a further 'compression' algorithm that's just a list of the 10^6 Esperanto films which just looks up the properly-formatted film name and returns the 100-digit code. (So you can store every Esperanto film ever made in 20 bits)

      At neither step would you ever be able to compress 'random_white_noise.avi', but it's a domain-specific compression algorithm, and nobody actually watches white noise.

      So the question is not 'is it computationally possible to get the film you want out of a few kilobytes with enough hardware and time to throw at the problem'. It's computationally possible.

      The questions are 'how little hardware and time can you optimize it down to' and 'how small do you want the file to be'.

    10. Re:it cannot logically work. sorry. by guruevi · · Score: 1

      You're going in the wrong direction though. Music you can encode in MIDI/MOD but it's not really the song, it's just the "score" and then your computer fills it in with instruments in it's local instrument bank (which is why your MIDI may sound different than mine). You're not really encoding the song output of eg. an orchestra where someone in the audience could cough or clap into a MIDI file.

      To do something similar for video, you're talking about giving someone the storybook to the movie and the computer would generate the entire movie, using pre-programmed likenesses of eg. Leonardo DiCaprio and a handful of other actors to regenerate the movie. It's possible for eg. video games but to encode a movie from film (where every frame is nearly completely noise to a computer) you can't use this method, modern methods track the movements of blocks of colors and with various other tricks try to encode that and with that we can get very decent lossy compression (300:1 or more, you could even go 1000:1 in some cases before it becomes unrecognizable). The compression talked about here is 40M:1 or better, it's not just a magnitude better, this was ~20 years ago so we're talking about competing with Ping and MPEG2 compression.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    11. Re:it cannot logically work. sorry. by guruevi · · Score: 1

      Sloot claimed a lossless algorithm. But even lossy algorithms have their limits, as I said before there are limits and there is a point where you'll compress everything so much, it turns everything into noise. As you encode something using lossy algorithms it's entropy increases (I think it's called Shannon entropy). You can basically encode but as you do, you add noise your ideal algorithm has to specify how "much" noise can be added, for humans it doesn't take long before things start looking weird and different or remove key elements of the image.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
  51. So Rainbows? by Anonymous Coward · · Score: 0

    Never heard of Sloot and, this being /., didn't bother with TFA...

    What I read is "Sloot Compression is a Rainbow table"
    -T

  52. Before Sloot Compression by Anonymous Coward · · Score: 0

    Long ago there was a tech maybe around VCU who figured out how to compress JCL job instructions so the lengthy description of a tape could be generated on the fly from a small string fragment which replaced the tape number in a JCL deck. Had something to do with quadraoctal numbers. Recreating the text was fast, but the initial compression could kill a mainframe. Data is data. You could do the same thing with pictures or video, but you better have a super computer sitting around for the initial compression.

  53. Prior art as evidence - Scansoft/Nuance QMAX PDFs by passionplay · · Score: 1

    1. We use PDF's today which when compressed are relatively small.
    2. We have a commercial entity called ScanSoft that also compresses PDFs
    3. PDFs generated by ScanSoft are incredibly much smaller than any other PDFs
    4. Scansoft PDFs use a compression algorithm called QMax
    5. Nuance which now sells the Scansoft PDF products have access to the same technology
    6. No other PDF technology in the marketplace has a compression algorithm that uses STANDARD decompression and unique COMPRESSION to achieve the ridiculous PDF sizes (try converting one of the PDFs and you will see)

    Sloot didn't HAVE to have a good DECODER, but even a BETTER ENCODER would do. If you add a better DECODER as well, then things get really interesting and the compression factor suggested becomes visible as a possible end goal.

  54. Art and science by AHuxley · · Score: 1

    Find a person talking in front of a not moving background.
    Try a resolution from the vcd or vhs years.
    Take time to look at every frame in great detail. Keep working on the data in each frame for a much longer time.
    Use a good cpu and gpu to really work the math per frame and each next frame, all frames.
    Then have an artist correct each frame by hand for movement or areas that should have not moved.
    Alter any code as needed.
    Try again.
    Code could be optimized for a set background and a talking presenter. Let good code, many gpu's and cpu's try again.
    Have the artist look at every frame again.
    Once some interesting results can be repeated try many different backgrounds. Some issue between the human moving and a different background might be more successful.
    If the gpu's and cpu's need a day to work on a very short series of frames but produce a good result, keep trying.
    Better cpu and gpu support can be added. A new idea about what code can do and what the artist suggests will take time.
    Try every method and theory that can be found per frame and for every frame.
    Also consider film, digital or other capture methods and how each frame is created.
    A very different, emerging or old and unexpected way of working with each frame might be better for the math per frame.

    --
    Domestic spying is now "Benign Information Gathering"
  55. $loot compression by mnemotronic · · Score: 1

    I've compressed 27 years of 401K retirement fund down to something that fits comfortably into a coin purse with room left over for a couple fentanyls. I wish investing was as easy as ingesting.

    --
    The Russians have won. They have made the world a cesspool of distrust, greed, fear and hate.
  56. an infinite chain of movies by gl4ss · · Score: 1

    no no no..

    demoscene demos work because they work within the limits of the machines - the machine is part of the creation. it's limits dictate the creation. it's not for reproducing arbitrary input and that would lead to a logic loop.

    you can agree that a video can have 140 characters? or 100x140 characters? basically the slooth claimed algorithm was so good that you could make a little logic play and include an infinite amount of hollywood movies(not just noise) in a chain of compressed links(any full lenght movie would have enough of sidechannel extra data visually in the video stream to include 2+ other compressed movies).

    the claims were just too good. and it is strange anyone would have thought of giving him the money without running the idea through some mathematician or two which would have claimed bullshit on it.

    like your scheme would work only if the movies were made for dwitter or some old pc or whatever. you can have pretty nice 64kbyte demos with music but that doesn't mean that you can reproduce any music video in 64kbytes, which was basically what slooth was claiming.

    --
    world was created 5 seconds before this post as it is.
  57. Re: Snootch compression? by KGIII · · Score: 1

    I am actually not bragging, when I say I have put my penis into a bunch of women. I am actually a wee bit alarmed at the number. It's a lot. I was a man whore for a long time. Really, it's a lot.

    I'm not sure if this makes me an authority, however. So, call this an anecdote...

    Not one woman, with whom I have conferred, has ever used that nomenclature. Yoni is my favorite, if you're curious. I've even dated professionals who were involved in women's reproduction health. None have used that term. Not even farm girls, or chicks with horses, have used that terminology. They even are from diverse global locations., but none have.

    It must be my lack of exposure...

    --
    "So long and thanks for all the fish."
  58. Adam Clark was his other name by any chance? by citizenr · · Score: 5, Interesting

    Adam Clarks Adams Platform:
    https://www.itwire.com/opinion...
    http://www.smh.com.au/business...

    Now you might think ok, this one was a scammer, but people vet those things, cant fool me twice, right?
    http://v-net.tv/2015/10/09/unk...

    5 years later VERY SAME "The company’s senior development team comprises: Adam Clarke"
    Adam Clark, of Adam’s Platform Technology (2004) "transfer a 1.3 gigbyte video file to a 1.4 megabyte floppy disk." strikes again in another scam :)

    Another one is Madison Priest's Zekko Corp:
    http://www.bizjournals.com/sac...
    http://jacksonville.com/tu-onl...
    http://jacksonville.com/tu-onl...
    Magic video compression turned out to be buried cable :D

    Want more video compression scams? Check out V-Nova Perseus - they promise 3x smaller files than h.264, but somewhat independent tests show 20% bigger files at same quality :) and the real kicker is Perseus is really just reencapsulated h.264 video with resize filter on top :D multi million dollar scam, they even scored one Sat TV network contract.

    --
    Who logs in to gdm? Not I, said the duck.
    1. Re:Adam Clark was his other name by any chance? by citizenr · · Score: 1
      --
      Who logs in to gdm? Not I, said the duck.
    2. Re:Adam Clark was his other name by any chance? by Anonymous Coward · · Score: 0

      strangly I was employed to investigate a simlar viddeo compression back in early 2000's.

      Usual stuff, full movie encoded to floppy 1.5MB, developer would only demonstrate on his laptop (top range one bought by his investors).

      He was very adverse to allow us to look at any thing on the laptop, but eventually after explaining to his investors (did'nt know at the time but probably Mob connected) they advised him it was in his best interest to allow us to have 30mins with it.

      Turned out, he hard hard copied all the MPEG to hard coded blocks accross the hard disk randomly scattered, the floppy had encoded references for his software to use to play the movie. We showed the investors what he had done, to say the least they were rather upset with him.

      Last I heard there was the possibility he didn't make it home....

  59. Totally Plausible by Anonymous Coward · · Score: 0

    If the Internet is the CODEC and a URL is the compressed data.

  60. Pied Pieper by Cyphase · · Score: 1

    According to the linked Wikipedia article on Jan Sloot, the guy he signed the deal with was named Roland "Roel" Pieper; so that's a point in favor of the Pied Piper theory.

    --
    by Cyphase ( 907627 )
  61. Compressing anything to anything else is easy ... by RespekMyAthorati · · Score: 1

    if you don't care about quality.
    Sloot's compression figures mean nothing unless we can evaluate how good the result is.
    Which we can't, since is no such codec actually exists.

  62. 300mpg Carburetors by Anonymous Coward · · Score: 0

    I once spoke with a retired GM engineer who was in on the corporate show-and-tell for that specific part.

    It would only get ~150 MPG in the heavy cars of that era, and used a low frequency RF source (15-35-ish Mhz I think) around the float bowl (I think), which helped atomize the fuel a lot better, or something (I think)

    In all other ways, it was a bog-standard carburetor of the era.

    Soon after, he asked around the department why nobody was talking about that carburetor and was advised to avoid asking further questions.

    My humble guess is that if the 300MPG carburetor was snake oil, the company men would have told him so.
    Instead, he was invited to "not ask questions."

  63. 300 mpg carburetors by Anonymous Coward · · Score: 0

    I once spoke with a retired GM engineer who was shown one of those magic carburetors during a corporate show-and-tell.

    It would only get ~150 MPG in the heavy cars of the era, but it worked by having an RF source mounted around the float bowl, emitting at a frequency of 14-35 MHz. (I forget the exact frequency he told me)

    In all other ways, it was a totally bog standard carburetor.

    He asked around the office a while later about it and said he was invited to avoid asking further questions.

    My humble opinion is; If the 300 MPG Carburetor was snake oil, the company men would have just told him it was BS.

    1. Re:300 mpg carburetors by dbIII · · Score: 1

      My humble opinion is; If the 300 MPG Carburetor was snake oil, the company men would have just told him it was BS.

      We didn't tell the artist his modification was BS because if we did he'd just call us part of the "conspiracy". We let him down gently and gave him enough hints that he eventually worked it out for himself.

      Seriously kids - with the amount of competition in the automotive market back then do you really think a car company would sit on a magic system that would massively improve fuel economy - especially during the oil shock? GM would have given anything (apart from replacing dud management) to get their market share back from the Japanese manufacturers.

    2. Re:300 mpg carburetors by Bob+the+Super+Hamste · · Score: 1

      I see you met my uncle's brother's friend. Too bad he was killed by some Saudi Princes after driving cross country in his Cadillac Fleetwood using only half a tank of gas.

      --
      Time to offend someone
  64. It may be possible, but isn't likely. by Anonymous Coward · · Score: 0

    Consider the extreme case of "compression" where the "codec" is actually all the movies, and the "file" is just the catalog number of the movie. Compressors lie on some kind of continuum between that and raw uncompressed files where the "codec" size is zero and the "file" is the file.

    If you had enough computing power and had analyzed a large body of data from the films, it seems possible that you could build a very large codec and have fairly small file sizes... but... I think it's unlikely that his compressor was real because it seems like the analysis and a practical decoder were out of reach for anything marketable in the late 90s.

    For example, it seems like your typical Pixar animation would have data files relatively small compared to the final rendering; but a reasonable rendering of the film takes massive computing power, even today. I might even be wrong about the size of the data files, since they could be using massive poly counts. In that case, he'd also have to have invented a very efficient format from which to render frames, or have some very sophisticated analysis which, IMHO, were out of reach at the time.

  65. I worked for one, sort of by swillden · · Score: 1

    Back in the mid-90s, the "VP" of engineering (given that he supervised a team of two programmers the "VP" title was more than a little ridiculous -- of course I was the more "senior" of the two programmers, and my title was "Director of Engineering") of the startup I was working for told me he had a great idea for incredibly-efficient video compression, which would make full streaming video practical over dialup. When I asked for details, he told me the key was to find an optimal representation of the video stream, then send that. Since that statement was somewhere between blindingly obvious and utterly stupid, I pushed some more. His answer, delivered in hushed tones, after looking over his shoulder to check the empty room for spies, was:

    "Here, do a thought experiment: Imagine a whole room full of transputers, all cranking away to find that one minimal representation of a frame of a movie. Now imagine a building full of them... can you imagine how optimalist the result would be?"

    I laughed out loud, couldn't help it, which pissed him off so much that a few minutes later when the other programmer came in and greeted the "VP" with "Good morning, sunshine", he threw his very expensive headphones at him. He missed, so they shattered against the wall. Then the VP went running to one of the co-founders to complain about being disrespected by the staff. Nothing came of it.

    That was far from the weirdest thing that happened at that place. My few months there ended abruptly one day when I arrived at work to find 20-foot flames shooting out of my office window. We're pretty sure that one of the co-founders torched the place to cover his theft of the ridiculously-expensive servers he'd purchased with the other co-founder's money, but nothing was ever proven.

    --
    Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  66. "massive 370Mb decoder" by Anonymous Coward · · Score: 0

    A 46.25 MB decoder doesn't sound too massive to me.

  67. German language book on the topic.. by Anonymous Coward · · Score: 0

    i read when i was a network fanatic :
    https://www.amazon.de/Supercode-Eine-Erfindung-die-brachte/dp/3431036325/ref=sr_1_3?ie=UTF8&qid=1496994005&sr=8-3&keywords=supercode

    For me it always had a pretty fictional aspect to it.

  68. fr-08 by Anonymous Coward · · Score: 0

    Something like this has already been demonstrated; The most famous example I know of is the demoscene 64k entry fr-08, which uses procedural generation to fit in 64k what other programs would use GBs of space, and even procedurally generating the music, which even if compressed as an MP3 would alone have taken up far more space than 64k.

    And although it is procedurally generated, it is not randomly generated (Which is what most people think of 'procedural' these days) so it runs the same way every time, consistent as MP4 playback.

    An MP4 of that demo would be far larger, and also would not scale infinitely to any resolution you chose like this tiny 64k program does!

    1. Re:fr-08 by Anonymous Coward · · Score: 0

      Your example is not general purpose audio compression.

  69. Re:Possible by gl4ss · · Score: 1

    do you know why simply having vast amounts of more memory in a computer doesn't make zip algorithm much better? it just doesn't help. you can't just make the dictionary larger and larger and expect benefits from it with arbitrary data, pointing to that dictionary is going to start taking the same amount of bytes as you're getting out from there, because otherwise your pointer to the dictionary would point to multiple entries and that just doesn't work if you want to decompress.

    same with the hash. it would be even expected that you would find a false match before the right one.

    --
    world was created 5 seconds before this post as it is.
  70. The answer by kbg · · Score: 1

    No.

    This is basically mathematically impossible. The problems is that even though fractals, random generators or any other formula that can produce every possible combination of output for some input, for example if you can find the value x for f(x) = y where y is the resulting file.In theory you only need to store x to generate the whole file. The problem is that you either end up with having to search through almost infinite space of numbers which means it takes the age of the universe to compress a single file or if you find the correct number x it is always larger than y making the whole thing useless.

    What Sloot likely had was an algorithm that was specially designed for some limited input of files, which the algorithm could reproduce from very small input since most of the input was stored in the codec itself. This is usless because the algorithm can't handle general files.

    Also any statement that says that the algorithm can always compress any input is false by definition. Because if that was true you could just re-compress the output until you end up with a single byte, which obviously wouldn't work unless there where only 256 distinct files in the whole world.

  71. Can i try it? by Lisandro · · Score: 1

    No? No opinion then, thank you.

  72. Source Code lost? by Anonymous Coward · · Score: 0

    20 years without a backup?
    Must have been using PC's.
    Backup software bundled with the OS is too painful to use.
    As it turned out, even death was preferable to trying.

  73. Proof by Anonymous Coward · · Score: 0

    The yt video doesn't actually say anyone watched all 16 vids to the end. Anyone can easily make a very short video file holding sixteen movies together as one. It's a bit different to actually prove each is independent all complete and a full movie.

    The video talks too much about big money and lost dreams and nothing about facts from technologists. The people who talk don't appeR to be able to spot bullshit.

    Of course he didn't want others to touch his
    Magic box, because anyone who tried to skip to a random spot and watch for
    Five mins would see it was just something like a 30sec clip and that's it.

  74. Bullshit by AntronArgaiv · · Score: 1

    Total, unmitigated bullshit.

    I'll leave it to others to post the mathematical proof, but there is a limit to lossy compression and kilobyte output from gigabyte inputs isn't anywhere near the realm of possibility.

    Why? Lossy compression is, at its core, the removal of redundant information, and its replacement with a smaller description of how to recreate what was removed. You can immediately see that your compression algorithm has to be able to efficiently (in fewer bits than what it took originally) describe what's been removed *so that it can be regenerated* (this is the tricky part).

    Secondly, and very important, different inputs will have different amounts of redundant information. This affects their compressibilities. A totally random waveform can't be compressed, because every bit of information is important. So, saying that *any* film can be compressed to kilobytes is a red flag.

  75. Re: Snootch compression? by Anonymous Coward · · Score: 0

    I am actually not bragging, when I say I have put my penis into a bunch of women. I am actually a wee bit alarmed at the number. It's a lot. I was a man whore for a long time. Really, it's a lot.

    I'm not sure if this makes me an authority, however. So, call this an anecdote...

    Not one woman, with whom I have conferred, has ever used that nomenclature. Yoni is my favorite, if you're curious. I've even dated professionals who were involved in women's reproduction health. None have used that term. Not even farm girls, or chicks with horses, have used that terminology. They even are from diverse global locations., but none have.

    It must be my lack of exposure...

    I am actually not bragging when I say I have put my penis into your mother a lot of times. I am actually a wee bit alarmed at the number. It's a lot. She was a whore for a long time. Really, it's a lot.

  76. Hmm... sounds like a book. by Anonymous Coward · · Score: 0

    If you 've got a good imagination you can turn a book into a movie in your mind.

  77. .the .product? by AkumaKuruma · · Score: 1

    this seriously reminds me of http://www.theproduct.de/

    a 64KB file that provided fully 3d rendered graphics and high quality music. this was obviously a static designed presentation so its easy to tune to that level, as well as being 3d graphics versus video frames.

    i see it being possible, but more as a post-content encoding method, than an on-the-fly method.

    Probable, MAYBE. Practical, prob needs high CPU/GPU acceleration to be able to encode on the fly, and possibly on decode too.

    1. Re:.the .product? by Megol · · Score: 1

      You should look at other demoscene products: www.pouet.net

      This (and other modern 64k and 4k demos/intros) isn't an example of lossless compression or lossy compression of video input, it is an example of procedural generation of textures, audio and objects combined with efficient compression. Everything is designed by humans to fit the design to a certain size and then packed as tightly as possible. It's absolutely not easy to do!

      Using similar techniques for video compression is possible. Theoretically that is. In practice it requires extreme amounts of computing power and in many cases wouldn't give good results. Unlike the manually generated content exemplified above a texture extracted from a video may require more data stored in a procedural format, the same applies to objects and audio.

  78. "OJ is innocent, they did it" by Thud457 · · Score: 1

    Probably the same shadowy cabal that did in Iron Butterfly drummer Philip Taylor Kramer

    --

    the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

  79. The answer's not in the box,... by acoustix · · Score: 1

    it's in the band.

    I think I've seen this movie before.

    --
    "A plan fiendishly clever in its intricacies"- Homer Simpson
  80. Pigeon hole principle. by Megol · · Score: 1

    Yes it applies for lossy compression too.

  81. Re: Snootch compression? by Anonymous Coward · · Score: 0

    Wasn't it Iliza Shlesinger who used that term? At work and can't confirm it was for a vagina though.

  82. That is nothing!! by drolli · · Score: 1

    With my 20TB decoder engine, i can compress movies down to 4 bytes! (ok, you may have to update the decoder engine once new movies come out, but hey...).

  83. Look at this: endlesscompression.com by Trailer+Trash · · Score: 1

    Years ago I became familiar with a company called eTreppid that was making some crazy claims about compression technology while securing investment. The "Trepp" part is Warren Trepp (https://lasvegassun.com/news/2006/nov/19/trepp-often-seems-to-be-in-the-wrong-place-at-the-/).

    Anyway, see here for information about the compression:

    http://endlesscompression.com/

    Note that they reference Sloot and describe his methodology.

  84. THIS is what got Sloot (and Kramer) killed by Thud457 · · Score: 1

    If you have "video compression" that can procedureally generate any video, you can also generate any secret document from anywhere, ever. That makes some people nervous.

    Of course, they miss the point that you also can generate a magnitude more garbage. The hard part is discerning wheat from the chaff. Given a sufficiently advanced AI, this becomes trivial. Yeah, you see what I did there.

    --

    the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

  85. Trying to decide by Anonymous Coward · · Score: 0

    I'm seriously considering cashing in my investment in the company perfecting it's pill which turns water into gasoline and investing in this video compression. They've amazingly discovered that an entire 3D panorama can be reduced to fit onto a 2D representation, which sounds like magic to me! 1920 x 1080 at a color depth of 8 bits and sampled at 30 Hz is ~224GB per hour. The clever bastards've gotten that down to 3 DVD disks ! Just think! You'll only have to change disks twice to watch an entire episode of Game of Thrones! Whoo Eee! Yeah, definitely time to make the move.

  86. VR might drive research here by abies · · Score: 1

    I mean, what if we move away from trying to reproduce individual pictures and their pixels and instead start transmitting the scene info? A TV show has a limited number of sets, what if the 3d model/texture info for most everything was transmitted in advance so that during playback all of that information is expressed in a few identifiers and their 3d transforms (ditto for the actors at some point too).

    I think we will get a lot of research into that direction if VR catches up. Sending high-quality 360 degress videos is already a pain bandwidth-wise, allowing 3d + some/lot lateral head movement would make it even worse. With real time rendering, you get most of these things for free, at same time opening door to things like at least basic interactivity, nudity/gore censorship (which is not neccesarily bad thing as long as it is a receiver-side option), content customization (tailoring avatars to specific audience) etc.

  87. Of course not by Anonymous Coward · · Score: 0

    mental illness doesn't prevent someone from writing reams and reams of code.

  88. You'd be lucky encode the script of a movie to 8k by David.R.Benham · · Score: 1

    Video and sound aside, I be astonished if you could compress just the text of the script of a full length film in 8k. Bottom line, impossible. Not even worth a more detailed analysis.

  89. Fraud by Anonymous Coward · · Score: 0

    He was a fraud.

  90. Impossible, except in specific cases (intros...) by GuB-42 · · Score: 1

    The idea here is to store a complete movie in a few kB.

    If we are talking about an actual movie as made by a director, this is impossible. There is simply too much entropy. The 8 kB stated size is just sufficient to encode an already dense screenplay to the theoretical limit. You probably can't even keep the original dialogue intact at that point.
    In theory, one could imagine an analyst AI able to turn a movie into such a script and forward it to a director AI which is able to recreate the movie. But it won't be the original movie but an interpretation of it. In the same way that a single play can change drastically depending on the performers.

    What is possible however is to make a movie especially designed to fit in a few kB. In fact, if you look at some demoscene productions, this is exactly it. Some 64k or less intros could qualify as small movies.

  91. Is it technically possible it worked? by kruhft · · Score: 1

    So the question is: is it technically possible that Sloot Compression, with its huge decoder file and tiny instruction files, actually worked?

    Yes.

  92. Monkeys no, Humans! by Anonymous Coward · · Score: 0

    It is like that the source code of P = NP is lost forever!.

  93. N^x-1 is a prime by Anonymous Coward · · Score: 0

    yada-yada: you better NOT (re-)invent this or (re-)discover this algo else you will find that the first movie that it spits out is exactly the same movie this sloot guy saw: the ring.

    anyways, there are more (random) neurons in the brain then stars in the universe and it all came from mom and dad watching the second movie that came from the algo: "how to make babies from two (half) cells".

    and to be a bit honest, it is totally possible to encode a huge (whole) number with a few prime numbers and symbols. the problem is how to "chop up" a huge number into the prime constituents?
    note: a (one) movie could be interpreted as one huge binary number. the non compressed movie file is say "500 MB". this could be interpreted as a YUGE binary number that needs "500 MB" to be stored. if this number could be chop-up into binary prime constituents then it could be represented by a number that only takes maybe "20 KB" to represent and thus only needs 20 KB of storage space ... for example (viz. goedel).

    end: i am confident that all kinds of axiomatic system could be invented that would allow this kind of compressions?

  94. Re: Snootch compression? by Anonymous Coward · · Score: 0

    Likely regional. See 'South Park', Hillary Clinton's Snuke, for a mass media example of this use.

  95. What "REALLY" happened by Anonymous Coward · · Score: 0

    He was killed by a man who evaded suspicion by driving away in an auto powered by the 100 mile per gallon carburetor, so he didn't have to stop for gas until he was far enough away to avoid suspicion!
    Rumor has it he ran into a problem when his water injector failed on the way, but he was able to use backup power from the perpetual motion generator to complete his escape. Finally he used use his anti-gravity ship to return to a secret lair. /The above is purely my own twisted sarcasm and contains no know factual basis - as I suspect do the accounts of this super-incredible compression algorithm

  96. Clothing compressed down to nothing. by Anonymous Coward · · Score: 0

    What a fookin' sloot.

  97. This article is entirely ridiculous by Anonymous Coward · · Score: 0

    Why on earth would anyone
    1) work on this project
    2) kill someone for working on this project

    Too many of the tinfoil hat wearing crowd on slashdot methinks

  98. follow the eyeball by Anonymous Coward · · Score: 0

    The trick is to map the eyeball position and tracking of a person watching the film for the first time. Then you only encode the parts of the film that were watched with acceptable compression, to be decoded when a film is being watched by someone else for the first time. The rest of the scene can be efficiently rendered as shifting low-res shapes vaguely following the action. This scheme has the advantage of allowing the film to only be viewed once, as a repeat viewer would likely try to focus on parts of the scene not viewed the first time.

  99. Re: Snootch compression? by KGIII · · Score: 1

    Snatch and another AC pointed out a similar word used in South Park. (I've only seen a few episodes.)

    Not one woman, that I've ever known, has used that word. Snootch? I have heard "gooch." (Spelling?) Never snootch, however, and I've put my penis into a bunch of women.

    --
    "So long and thanks for all the fish."
  100. Re:Sloots by Anonymous Coward · · Score: 0

    My father said you should watch out for Sloots.

    I say we compress the hell out of sloots. That will show them!

  101. Re: Snootch compression? by KGIII · · Score: 1

    Nice. Thanks for paying for my shoes.

    --
    "So long and thanks for all the fish."
  102. So basically Sloot is Adobe Flash... by Anonymous Coward · · Score: 0

    It could recreate Homestar Runner videos.

  103. Context specific compression can be very effective by Anonymous Coward · · Score: 0

    If you know and really understand the context, and are willing to throw a lot of compute at the effort, a context specific compression approach is quite possible. Resulting in significant size reduction. Even more if this is a lossy compression.
    I make ZERO claims about the veracity of the approach.

  104. yeah sure Ive seen the source code by Anonymous Coward · · Score: 0

    it's on a USB boot in the back of a prototype sedan running a secret carburetor and getting 100mpg on a 454 V8.

  105. Works great...sort of by Anonymous Coward · · Score: 0

    Machine learning data compression algorithms will probably beat it.

    Of course what you see at the other end of your 1kb/sec byte stream might be closer to what your parents think you said than what you actually said.

  106. Not a math guy ... by Renaissance+Slacker · · Score: 1

    What if Sloot found a way to rapidly Godelize and deGodelize huge files? Maybe he didn't realize that this would make encryption obsolete? I'm sure our guys (if not THEIR guys) would make all that go away.

  107. Think about what this would mean by jgoemat · · Score: 1

    were supposedly only kilobytes in size

    'kilobytes' I'm assuming means less than a megabyte. Let's round up and say 8 million bits of information. At 24fps there are about 130,000 frames in a 90 minute movie. That means each frame is encoded in about 61 bits, less than 8 bytes, 0x0000000000000000 to 0x1FFFFFFFFFFFFFFF in hex. That's about 1.4 kilobits per second, 1/8th what you need for quite horrible sounding audio only compression. Someone isn't thinking straight...