Slashdot Mirror


Will Compression Be Machine Learning's Killer App? (petewarden.com)

Pete Warden, an engineer and CTO of Jetpac, writes: When I talk to people about machine learning on phones and devices I often get asked "What's the killer application?". I have a lot of different answers, everything from voice interfaces to entirely new ways of using sensor data, but the one I'm most excited about in the near-team is compression. Despite being fairly well-known in the research community, this seems to surprise a lot of people, so I wanted to share some of my personal thoughts on why I see compression as so promising.

I was reminded of this whole area when I came across an OSDI paper on "Neural Adaptive Content-aware Internet Video Delivery". The summary is that by using neural networks they're able to improve a quality-of-experience metric by 43% if they keep the bandwidth the same, or alternatively reduce the bandwidth by 17% while preserving the perceived quality. There have also been other papers in a similar vein, such as this one on generative compression [PDF], or adaptive image compression. They all show impressive results, so why don't we hear more about compression as a machine learning application?

All of these approaches require comparatively large neural networks, and the amount of arithmetic needed scales with the number of pixels. This means large images or video with high frames-per-second can require more computing power than current phones and similar devices have available. Most CPUs can only practically handle tens of billions of arithmetic operations per second, and running ML compression on HD video could easily require ten times that. The good news is that there are hardware solutions, like the Edge TPU amongst others, that offer the promise of much more compute being available in the future. I'm hopeful that we'll be able to apply these resources to all sorts of compression problems, from video and image, to audio, and even more imaginative approaches.

59 comments

  1. Isn't Pied Piper already doing that? by Ecuador · · Score: 3, Funny

    I thought the Pied Piper platform already uses ML to improve compression, right?

    --
    Violence is the last refuge of the incompetent. Polar Scope Align for iOS
    1. Re:Isn't Pied Piper already doing that? by Anonymous Coward · · Score: 0

      What's their Wisemann score?

    2. Re:Isn't Pied Piper already doing that? by Anonymous Coward · · Score: 0

      The real question is does Mike Judge own the patent?!?

  2. Patent It and Get Rich by Anonymous Coward · · Score: 2, Insightful

    Perhaps you should get a patent for this vague description of a mathematical expression. We have learned with the right Supreme Court justices, you can circumvent established case law.

  3. Talk to other people by houghi · · Score: 1, Interesting

    When I talk to people about it, I get a "Shut up Nerd. Don't try to weasel out of the real issue. It is your turn to pay a round."

    --
    Don't fight for your country, if your country does not fight for you.
    1. Re:Talk to other people by Anonymous Coward · · Score: 0

      When I talk to people about it, I get a "Shut up Nerd. Don't try to weasel out of the real issue. It is your turn to pay a round."

      Also, I worked in video compression. It is boring. What's on tap here?

      Sure, I can watch shitty cable and say "Oh my, look at those blocking artifacts!" but after a while who cares?

  4. Weird startup pitch by Anonymous Coward · · Score: 0

    - Hey phone manufacturers, increase CPU performance 20x and then license our ML compression algorithm for a 17% in space savings!
    - Thanks, but we'll increase storage space by 17% instead.
    - Oops, haven't thought of that. Have a nice day good sir.

    1. Re:Weird startup pitch by Anonymous Coward · · Score: 0

      Compression for video delivery is not mainly for saving storage space (if anything you could compress videos better on the server side), it's for bandwidth.

    2. Re:Weird startup pitch by Anonymous Coward · · Score: 0

      I guess you're right about video delivery. Given an HD video rate of 3.6 GB/h, an ISP price of 0.20 $/GB, and an electricity price of 0.12 $/kWh, to get 17% savings on bandwidth you should be willing to use hardware consuming up to 0.17 * 3.6 * 0.20 / 0.12 = 1.02 kW.

      But the author might be wrong to think that these benefits apply to compression in general. For other applications (transferring a wedding video from a phone to a desktop), only storage space matters.

  5. NB by Artem+S.+Tashkinov · · Score: 1, Informative

    A quick remark: this could theoretically work for content delivery but it will be unsuitable for video archival where picture fidelity and lossless transfer are paramount.

    1. Re:NB by wed128 · · Score: 1

      Different tools for different jobs -- most video compression is lossy.

  6. Brloh:n by Anonymous Coward · · Score: 0

    There’s your compression.

  7. First Thing AI will do by Anonymous Coward · · Score: 0

    Compress itself into a singularity.

    caption : compel

  8. Voice recognition is improving rapidly? by OneHundredAndTen · · Score: 1

    So the article claims. Where is it improving rapidly? I interact with the Google Assistant and Alexa on a regular basis, and they seem to be just as limited and non-discerning as they have always been. I still have to speak slowly to them, while articulating carefully. And it still is the case that it does not take much in the way of background sound to throw them out of kilter. Thus, where is voice recognition improving rapidly?

    1. Re:Voice recognition is improving rapidly? by Waffle+Iron · · Score: 2

      To rival humans at voice recognition, these assistants would need to do at least two things that actual humans already do:

      1: Constantly listen in on your life (and possibly watch it with a camera), so that it can maintain a real-time context to interpret any ambiguous verbal information. Waking up the assistant only after it hears its name is not sufficient because without context you need to be extremely clear to establish what you're talking about. I've noticed that even talking to people, when you switch the topic to a totally new subject, it often requires a "handshake" where you tell them what you're going to talk about and then they acknowledge that they're on the same page with you.

      2: Have the assistant interrupt the human immediately in mid-sentence when it doesn't understand something, with something like "Huh?" or "What?". Real people do this so often that we don't even notice it, but right now it would seem incredibly rude if a bot did that.

    2. Re:Voice recognition is improving rapidly? by angel'o'sphere · · Score: 1

      I still have to speak slowly to them, while articulating carefully.
      You make the same mistake Newton testers made.

      You try to adapt till the machine "understands" you ... but the machine is actually trying to understand you without adapting. With your "articulate carefully" (and changing that articulation daily) you are just a moving target for the machine.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
  9. Re:nn is amazing with some images by Rei · · Score: 2, Interesting

    It should be obvious that neural nets are great at this sort of thing. That's how our brains record and recall events. They're not registering a stream of pixels or waveforms and zipping them up, they're registering chained concepts. Every time we remember, we piece these concepts back together, so yes, there is a lot of "imagination" filling in data in even our most detailed memories. But just like that can work for our brains, it can work for computers.

    --
    "What is the difference between a Ponzi Scheme and an Investment Bank?" -- Jon Stewart
  10. Re:nn is amazing with some images by OneHundredAndTen · · Score: 1

    "Through out"? Really? Well, at least you can spell "through".

  11. "preserving the perceived quality" by QuietLagoon · · Score: 4, Interesting

    Preserving perceived quality by which metrics? Comcast recently moved to downgrading the quality of the HD cable programming it provides. Some people see a significant problem with the downgraded video, especially during action video such as sport events. Yet Comcast says that ~the perceived quality is the same.~ So I ask again, how is "perceived quality" going to be measured? And by whom? By those who want to push out the new technology for monetary gain, or by those who are subject to the inferior results of the new technology?

    1. Re:"preserving the perceived quality" by Anonymous Coward · · Score: 0

      Preserving perceived quality by which metrics? Comcast recently moved to downgrading the quality of the HD cable programming it provides. Some people see a significant problem with the downgraded video, especially during action video such as sport events. Yet Comcast says that ~the perceived quality is the same.~

      So I ask again, how is "perceived quality" going to be measured?

      Humans literally grade outcomes when testing lossy compression algorithms. Perception is too human specific and subjective to fully entrust to algorithms.

      The trick of course is having professionals with interests aligned with reality doing the work. If you rely on PR/marketing hacks employed by cable companies making determinations you've got problems.

      For Comcast the perception of quality is weighed carefully against your cable bill. More and more people are perceiving they in fact are being ripped off and as a result running with much haste away from Comcast

      When you call to drop cable they will say you have some insanely fast Internet package that costs the same as whatever you were paying for Cable itself. Don't get pw0n3d by Comcast retention scams, get that shit knocked down to the lowest tier.

    2. Re: "preserving the perceived quality" by Anonymous Coward · · Score: 0

      You mean they are in the process of reducing quality to charge higher fees for higher quality in the future. Got it.

    3. Re:"preserving the perceived quality" by Matheus · · Score: 1

      Reality of the business world: Yes of course they are going to try to deliver less for the same or more $. Capitalism blah blah blah...

      That being said: Although the article uses the phrase "perceived quality" they aren't necessarily differentiating between that and "real" quality. The sentences work just fine when the compression is lossLESS meaning they can accomplish that increase in quality (resolution / bit depth) at same bandwidth or reduce bandwidth for same quality by that definition.

      That doesn't mean they won't choose a more aggressive lossY format to further milk you for your dollars but that's not implied or necessary within the bounds of this article's premise.

    4. Re:"preserving the perceived quality" by ShanghaiBill · · Score: 1

      NN compression is inherently lossy. For instance, denoising autoencoders filter out "salt-and-pepper" noise in an image, so obviously you aren't outputting the original.

    5. Re:"preserving the perceived quality" by antdude · · Score: 1

      So, what did Comcast say after those?

      --
      Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
  12. Re:nn is amazing with some images by Anonymous Coward · · Score: 0

    Yeah I saw that 10 microseconds after hitting submit, typical me.

  13. Say what now by SuperKendall · · Score: 2, Interesting

    I think you are more than a little confused if you think the Supreme Court has anything to do with patents...

    This is what partisanship does to your brains kids. Don't be partisan, learn how systems actually work, and offer thoughtful critiques instead of running around in a blind panic saying things that are outright wrong everywhere you go.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:Say what now by Aighearach · · Score: 1

      Wow, you didn't know SCOTUS has anything to do with anything? Yikes. Imagine how long you've been on the internet, too. And yet.

    2. Re:Say what now by SuperKendall · · Score: 1

      Wow, you didn't know SCOTUS has anything to do with anything?

      Here's how that read to normal or informed people:

      "Yarble blargh FRONOB FRONNOB FORNNOB!!!!!!!!!!!!!!!!"

      Yeah, keep working at it man. Someday you'll be intelligible, even if still ignorant of actual law.

      I'll let you have the last response because FORMNOB!

      --
      "There is more worth loving than we have strength to love." - Brian Jay Stanley
    3. Re:Say what now by Anonymous Coward · · Score: 0

      "I think you are more than a little confused if you think the Supreme Court has anything to do with patents..."

      Here you go:
      https://writtendescription.blogspot.com/p/patents-scotus.html

      BTW, for the other Search starved and Brain dead:

      "Article I Section 8 | Clause 8 – Patent and Copyright Clause of the Constitution. [The Congress shall have power] “To promote the progress of science and useful arts, by securing for limited times to authors and inventors the exclusive right to their respective writings and discoveries.”"

      This was incredibly important for the time; it's just trivia now. But it's why we have a Library Of Congress instead of a Library Of the United States, and why the Patent and Copyright Offices are under Congressional control, rather than that of the Executive branch.
      And just who is it that arbitrates between the two on Constitutional issues?

    4. Re:Say what now by Aighearach · · Score: 0

      Here's how that read to normal or informed people:

      That's a nasty type mismatch error. Are you saying that compiles for you, or do you just not understand your own words? That would certainly be average, or informed to the average level. But yeah.

      Protip: People who don't understand the basic vocabulary of the subject... are not well informed. And no, I really do not care if people who wear their base ignorance on their sleeve can understand my words. I don't type it out for them, I type it out for people who matter. It doesn't bother me if it is only two or three people out of the thousands that read it.

    5. Re:Say what now by Anonymous Coward · · Score: 0

      You code node.js. That makes you an expert on like, everything. Please write us some javascript to cure cancer because you read on the internet somewhere that it's like, you know, that easy for nerds who code web apps. They're like, you know, the smartest people around.

  14. There doing ML with audio compression too. by Anonymous Coward · · Score: 0

    I saw a paper on audio stream compression recently that was doing good compression. The quality was not quite there yet.

  15. Re:nn is amazing with some images by Anonymous Coward · · Score: 0

    Of course, he'll blame it on his Chinese copy of gramma, rly?

  16. Average file for average compression. by Anonymous Coward · · Score: 0

    I, too, am looking forward to a better age of compression.
    Current compression algorithms compress a fairly average way based on a fairly typical use-case for each file type.
    This leads to situations where certain structures in some files straight-up don't compress well at all.
    You can tell this when you chain multiple compression systems together that you can usually shave off some many Xbytes. And that's using yet another average compressor on top of another.

    But as you say, we really need solid accelerators for hardware compression. It's been talked about for years but now we are finally getting to the kinda core-counts needed to make it at least half-way feasible at the business level.
    Even ray-tracing is beginning to become practical in some scenes. In a few major generations of processor design, it'll be even more common.
    Exciting times in computing hardware are ahead. Good industry to be in right now before it gains some real momentum. Specialized processors are making a comeback!

  17. The 1000x celeb compression results are hilarious. by Anonymous Coward · · Score: 0

    Compare the compressed and original picture. They have just put another guy that happens to be in DB and looks kinda similar.
    Imagine now Comcast reduced plan contract: Netflix is permitted provided that every actor looks like Brad Pit.

  18. Re:nn is amazing with some images by Jeremi · · Score: 4, Funny

    It should be obvious that neural nets are great at this sort of thing. That's how our brains record and recall events. They're not registering a stream of pixels or waveforms and zipping them up, they're registering chained concepts.

    Video codec A: Han shot first!
    Video codec B: Really? That's not how I remember it.

    --


    I don't care if it's 90,000 hectares. That lake was not my doing.
  19. Options by jd · · Score: 3, Interesting

    1. Caching.

    Remember, all that matters is the bandwidth of the most constrained point you traverse. And that's typically going to be near the servers, as point-to-point connections are highly inefficient.

    If you place frequently-accessed content downstream, much nearer the recipient, you eliminate the need to use any of the pipes along the constricted regions.

    Experiments I did on this in the 1990s, when the problem was at its worst, showed that you could get a 60-fold improvement in quality of experience, well beyond anything compression can achieve.

    2. Multicast

    Most people are used to a few seconds delay before streaming starts. If N people request the same content over a 5 second period, then delaying the first person by 5 seconds won't be perceived as abnormal. You're now transmitting one copy per path. This is an ideal way to populate the aforementioned caches, it would be useful for server-based content only if no caches exist.

    3. Fractal compression

    Technically, wavelet. Used in the BBC's Schrodinger codec. Produces far better compression than typical codecs.

    4. Better pipes

    Most of the rest of the world is already operating at bandwidths between 100-10,000x that common in the U.S., with U.S. cable companies have either prosecuted those offering higher speeds or driven heavy equipment through their cables. It is time to stop accepting this as the cost of doing business.

    The U.S. should mandate 50 gbps to the home, the highest speed available elsewhere on a large scale. If you're going to be the best, you have to be the best. The U.S. should not be an also-ran. A fat tree is impossible, at those speeds. Realistically, I doubt you could get a block to handle more than 200 gbps. Which is adequate.

    Tier 1 is fine, just have a mesh network. Current SDM transmission rates are 111 tbps, which means you can support 555 blocks of houses off a single seven core cable. GMING can supply the info on how you'd actually get that to work on a metro level.

    You're really not going to need to do a whole lot of compression at those speeds.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    1. Re:Options by Aighearach · · Score: 1

      The problem with upstream caching is that it is a type of man-in-the-middle attack, and there is a strong tendency to abuse it.

    2. Re:Options by yes-but-no · · Score: 2

      compression is good for various other reasons like storage; there is a trade off CPU versus network bandwidth between sender and receiver. It's related to trading code versus data (generator vs look-up table). If you can have a small algorithm to generate the nth prime number, you don't hv to carry around a huge table of primes.
      For various applications the inter-node data traversal is the costly operation (fancy eg an intelligence in a far away star wants to send information, it's best it compresses it and may be send the decoding code/algorithm along).
      One reason why platform like twitter survived is because it forced humans to compress our thoughts into a few lines instead of a whole page of prose like a blog/fb post.

    3. Re:Options by wisnoskij · · Score: 1

      > all that matters is the bandwidth of the most constrained point you traverse. And that's typically going to be near the servers, as point-to-point connections are highly inefficient.

      What universe do you live in. The bottleneck is always at the users end. Businesses, enthusiastic fans, just sites in general have no problem with bandwidth. They can pay for bandwidth 10 times over with a few ads. But the consumers are often stuck with 10GB per month plans. It does not matter where the data is coming from, the only thing that matters is the size.

      Until ISPs start charging for long distance data, where the data is stored is a non-issue.

      --
      Troll is not a replacement for I disagree.
    4. Re:Options by wisnoskij · · Score: 1

      > The U.S. should mandate 50 gbps to the home

      I don't understand why bandwidth keeps increasing so fast. Who needs to watch 2000 youtube videos simultaneously? Outside of owning some business or operating some site, what is the point of anything over a couple Mbps? Are sites even compatible with 50 gbps? Will even the Behemoth that is steam allow you to download the latest 100 GB game in 16 seconds?

      --
      Troll is not a replacement for I disagree.
  20. fuknrdts by Anonymous Coward · · Score: 0

    fukrdts udntndmchnlrng

    1. Re:fuknrdts by Anonymous Coward · · Score: 0

      I can't read this. And we do need machine learning, dammit!

  21. Re:nn is amazing with some images by Anonymous Coward · · Score: 0

    "Through out"? Really? Well, at least you can spell "through".

    To be fair, "out" is also spelled correctly. ... but I sadly could not find the referenced quote.

    Also sadly, as "AC" I will never see any follow-up post. As you were..

  22. To study and research more. by Anonymous Coward · · Score: 0

    Let's differentiate classifier vs compressor.

    There are 2 kind of compressors: lossy vs lossless.

  23. Compression == Encryption by Anonymous Coward · · Score: 0

    Yup tey are the same thing. The difference is what is the seed key. Is it publicly known or not. That's it. So will AI build in back doors to itself to allow for recoverable keys. Who controls that feature?

    1. Re:Compression == Encryption by wed128 · · Score: 1

      Theyr'e very much not the same thing.

  24. Soon. by Anonymous Coward · · Score: 0

    Screenplay -> Neural Network -> Motion Picture

    I mean, let's be honest, if you tell me to imagine "a white room and Brad Pitt is standing in the middle of it wearing a blue jacket", does it really matter that in my mind he looks like he did in Fight Club and in your mind he looks like he did in Meet Joe Black? Look at how much data we saved by sharing text instead of an image.

  25. Compression - No : Debug - Yes by Anonymous Coward · · Score: 0

    Meh, compression. AI, machine learning, network technologies. et. al.; the heavy lift for machine learning will tackle friction points that consume hard money, real brain power and lay waste to precious resources - Debug

    A reverse revolution against entrenched momentum, human language and bridgeway to complexities

  26. No by Anonymous Coward · · Score: 0

    Skynet is Machine Learningâ(TM)s killer app

  27. Re:nn is amazing with some images by Anonymous Coward · · Score: 0

    Once you get above a Weisman score of 5 or 6 the visible effect on the application becomes negligible. Nothing to see here.

  28. The never-ending march of technology. by ColaMan · · Score: 3, Interesting

    Most CPUs can only practically handle tens of billions of arithmetic operations per second,

    Got to wonder how a computer scientist from 30 years ago would react to this casual statement. I'm going with something along the lines of, "Great Scott!"

    --

    You are in a twisty maze of processor lines, all alike.
    There is a lot of hype here.
  29. Nope by Anonymous Coward · · Score: 0

    Strong AI will be ML's killer app. Perhaps both figuratively and literally.

  30. Compassion? by Anonymous Coward · · Score: 0

    I first read is as "compassion" such disappointment at realising my mistake....

  31. plenty of ML killer apps already by Anonymous Coward · · Score: 0

    translation, OCR, Facial Recognition, speech-to-text, text-to-speech, personal assistants, chess/go/any-other game, predictive typing, energy management, ...