A Fictional Compression Metric Moves Into the Real World

← Back to Stories (view on slashdot.org)

A Fictional Compression Metric Moves Into the Real World

Posted by Unknown on Monday July 28, 2014 @08:31AM from the best-thing-since-sliced-scatterplots dept.

Tekla Perry (3034735) writes The 'Weissman Score' — created for HBO's "Silicon Valley" to add dramatic flair to the show's race to build the best compression algorithm — creates a single score by considering both the compression ratio and the compression speed. While it was created for a TV show, it does really work, and it's quickly migrating into academia. Computer science and engineering students will begin to encounter the Weissman Score in the classroom this fall."

3 of 133 comments (clear)

Min score:

Reason:

Sort:

Re:Bullshit.... by gweihir · 2014-07-28 08:59 · Score: 4, Insightful

There is no possibility for a useful single metric. The question does obviously not apply to the problem. Unfortunately, most journals do not accept negative results, which is one of the reasons for the sad state of affairs in CS. For those that do, the reviewers would call this one very likely "trivially obvious", which it is.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Re:Useless without measure of lossiness/distortion by viperidaenz · 2014-07-28 09:12 · Score: 4, Insightful

In the TV show only lossless compression was being considered, so MP3 would fail.
Re:Bullshit.... by nine-times · 2014-07-28 09:20 · Score: 4, Insightful

Can you explain in more detail?
I'm not an expert here, but I think the idea is to come up with a single quantifying number that represents the idea that very fast compression has limited utility if it doesn't save much space, and very high compression has limited utility if it takes an extremely long time.
Like, if you're trying to compress a given file, and one algorithm compressed the file by 0.00001% in 14 seconds, another compressed the file 15% in 20 seconds, and the third compressed it 15.1% in 29 hours, then the middle algorithm is probably going to be the most useful one. So why can't you create some kind of rating system to give you at least a vague quantifiable score of that concept? I understand that it might not be perfect-- different algorithms might score differently on different sized files, different types of files, etc. But then again, computer benchmarks generally don't give you a perfect assessment of performance. It just provides a method for estimating performance.
But maybe you have something in mind that I'm not seeing.