Dropbox Open Sources New Lossless Middle-Out Image Compression Algorithm (dropbox.com)
Dropbox announced on Thursday that it is releasing its image compression algorithm dubbed Lepton under an Apache open-source license on GitHub. Lepton, the company writes, can both compress and decompress files, and for the latter, it can work while streaming. Lepton offers a 22% savings reductions for existing JPEG images, and preserves the original file bit-for-bit perfectly. It compresses JPEG files at a rate of 5MB/s and decodes them back to the original bit at 15MB/s. The company says it has used Lepton to encode 16 billion images saved to Dropbox, and continues to utilize the technology to recode its older images. You can find more technical details here.
Even if I cross-compile for my 2MHz TRS-80? Amazing!
This headline sounds a lot like a press release from Pied Piper, the fictional company in the TV show "Silicon Valley".
Nice to see Pied Piper is finally getting their tech out there. Their startup has been so rocky, especially since they started feuding with Hooli.
does any one have knowledge about how this compares to other compression algorithms? also wonder if they are releasing this because they have lepton2 or whatever now?
Is this actually a "middle-out" compression, or is that just a joke? Do we know what the Weissman score is?
Tip-To-Tip compression really works!
No wonder everyone thinks that is silly.
It can both compress *and* decompress.
I'm all for companies open-sourcing cool algorithms. But not a great choice on the name. There are already several products out there called 'Lepton'. There's a software CMS, and also FLIR's thermal sensors are branded 'Lepton'. (Worth noting - Lepton IS an actual word so it probably won't qualify for Trademark protection. But an Apple Music vs. Apple Computer like scenario is not impossible to conceive.)
I worked as a part-time assistant in Data Structures and Algorithms course 10 years ago in Helsinki University of Technology.
JPEG is a lossy compression algorithm. It does not preserve the image. It creates these blocks of image data and then compresses them using Huffmann encoding. Same encoding is used in zip-files.
Dropbox's algorithm uses these same blocks JPEG algorithm produces (meaning, that the information is still lost in compression), but uses a clever way to compress them and ditches Huffmann encoding entirely.
So, the old process was:
1. Encode image into coefficients (lossy)
2. Encode coefficient blocks with Huffmann encoding
The new process is:
1. Encode image into coefficients (lossy)
2. Encode coefficient blocks with Lepton
Pfft.. too little, too late. JPEG is "good enough" and I don't want a huge clusterfuck of incompatibility problems with my libraries.
already compress much better?
We've been here before. JPEG2000, webp, BPG, JPEG XR. There are many formats that are superior to JPEG. And look - none of them caught on!
Why? Because JPEG, though far from the best modern algorithms could offer, is still 'good enough' for most purposes. It's also supported by every web browser, photo viewer, image editor, mobile phone, camera, digital picture frame, slideshow maker and every other thing that might need to process an image. A new format, no matter how superior, cannot offer the same ubiquitous support - and without that support it will never become widely used enough for developers to spend time including support for it.
We can't even get rid of MP3, and there are more formats than I can count both open and proprietary that could so everything MP3 does but better.
So a program named literally "Lepton under an Apache" that happens to also, confusingly, be an open source license (*and* a program)?
Okaaaaaaay.... ...Took me like a minute to figure out it was saying
"...dubbed 'Lepton,' under an Apache open-source license..."
and from a drunk, no less. A dead drunk, at that.
Phil Zimmerman.
Look it up.
Everybody should use DOKAN and ENCFS on cloud shares such as DropBox.
Compress that :)
I feel bad for our boys....
Dropbox' software is called "lepton". There is an image-processing library called Leptonica — could someone comment on the relationship, if any?
In Soviet Washington the swamp drains you.
All hail pied piperer.
The guy from piped piper
https://github.com/danielrh
So this means instead of getting 5 GB free storage, I should get 22% more if I'm storing JPEGs, so I get 6.1 GB free storage now? ;)
Better known as 318230.
It is because the idiots in JPEG 2000 committee did everything to keep people, especially web browser development teams away from that excellent format.
Now 4K monitors and ultra resolution phones around, watching web developers struggle with 5-6 different files of same photo, I really feel pity. That was a solved problem, both multiple bandwidth& resolution and the compression rate.
There is a reason we deal with JPEG files today, ask JP2 committee. Even MS stayed away from it fearing the patents.
Much of their compression comes from they dont use full 32 bit floats or integers to store the discrete cosine transform coefficients, but variable bit length numbers which can be squished more tightly. I didnt read the paper deep enough to study how efficient this bit hacking is machine operations. There might be few clever tricks there. Bit hacking was more common in the early days of computers when core memory was very expensive. I recall Woz had some clever way of compressing color and shape graphics in the Apple II.