Slashdot Mirror


A New Web Image Format

MrP- writes: "BetaNews is reporting that a company called LizardTech has developed a new image format for the Web called DjVu." Apparently, it differentiates between forground and background components of an image, and compresses each appropriately. Good idea, but I'm skeptical of improvements (especially because they say it's "20 times faster then gifs" -- which measure compression in terms of speed? And they also say it compresses faster then pdf, but pdf isn't really an image format). No Linux support. And I don't see any source code on the format, so don't expect it to get a lot of support on any major Web sites, regardless of the compression.

154 comments

  1. Re:Yeah, but... by xcedrinod · · Score: 1

    Nude: girls naked. Naked: guys nude. hehe

  2. Re:Boycott DjVu - Use PNG ! by Vincent+Bernat · · Score: 1
    Which is more important - open standards or better technology?

    Open technology.

    Even if this company ports its format on a variety of OS, I won't be able to use it in my own products, I won't be able to use it with my favorite browser, etc. See what happened with GIF.

  3. Slashdot, June 28 1998 by plaa · · Score: 1

    New Image Compression Algorithm claims 1000:1 ratio

    Talk about posting the same story twice...

    --

    I doubt, therefore I may be.
  4. Why are people so stupid? by GoRK · · Score: 4

    DjVu is almost three years old. It was developed by AT&T not LizardTech. LizardTech just bought it about 8 months ago. It is not, was not, and will never be designed to replace PDF, GIF, JPEG, or PNG. I have been using DjVu as the core of a web based document management system for over a year and a half. It is absolutely bar none the best, fastest, and most cross platform way to go from paper->web out there.

    Look at the ways to get a scanned document on the web:

    1) GIF, PNG, JPEG: Large filesize or bad rendering. If I need to send a 300dpi page to a web browser, the browser isn't going to let a user pan and zoom on it and it certainly won't print it correctly. JPEG is the only one of the three formats that actually has a place to store the document DPI regardless.

    2) PDF: Creating a pdf from a scanned image means either encapsulating a lossy or losless image in a file or doing OCR and risking unreliable information.

    DjVu regularly achieves compression ratios of 1200:1 or more at very very acceptible quality. There is a IW44 fractal compressed background layer and a loslessly compressed foreground layer. The information is progressive also. As the file downloads the foreground shows, then the background, then the color information loads. Example documents on the DjVu website have shown entire 300 dpi full color sharper image catalogs compressed to fit on a floppy disk.

    Btw not only are djvu plugins available for windows, macintosh, linux, and solaris. Let's not forget HP-UX and IRIX. How's that for covering the bases? If youre not supported, you can write your own for your particular flavor of UNIX.

    Geez get it straight.

    ~GoRK

    1. Re:Why are people so stupid? by Tet · · Score: 2
      JPEG is the only one of the three formats that actually has a place to store the document DPI regardless.

      Perhaps you're unaware of the PNG pHYs chunk, then, which lets you specify the physical resolution of the image.

      --
      "The invisible and the non-existent look very much alike." -- Delos B. McKown
  5. no linux support by trueimage · · Score: 1

    in the main post, it claims no linux support, and not that i support this format, they do have a linux plug-in viewer, just no authoring/encoding software for linux :o)

  6. Re:-1 redundant by drinkypoo · · Score: 1
    I really can't see much market, and very little application for this compression. On-the-fly compression of images for web download would be redundant, since a png would be smaller than this format, so the speedy on-the-fly compression of uncompressed images is pointless.

    No, it isn't. Think about how handy it would be to let someone dial in their own image quality, or do it for them of course. Some poor bastard on a 28.8kbit/sec modem would be able to set it to super low quality, which would look somewhat lousy but be quickly navigable.

    This would be handy in a system with CPU to spare but limited storage, like a 200mhz+ embedded webserver using flash memory to store images. Then again, you'd have to store them compressed there, as well, but at least you wouldn't have to store N different versions of the file for users with various different quantities of bandwidth.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  7. Let's get all worked up about nothing! by Hard_Code · · Score: 3

    Did anybody even follow the LizardTech link? Right on the front page is a link to a page describing DjVu. The whole product ("image format") seems geared towards scanning in Real Life documents and presenting them online. If you *read* the page it explains why it claims it is faster (first downloads high contrast data, then photographs and graphics, clarifying the image as it goes) and smaller (some wavelet kung foo). I don't see anywhere where they are pitching this as competition for gif or png, so everybody put down the flamethrowers. This is a very small niche product for digitizing and presenting real life mundane documents.

    --

    It's 10 PM. Do you know if you're un-American?
  8. Re:Some facts off the top of my head... by ledogar · · Score: 1

    This should do a good job of a lot of the things that JPEG (even at high-quality q) is currently failing on for us: photographs that contain a lot of bluescreen / greenscreen; pencil / pen artwork, etc. With JPEG, I get nasty haloes around foreground elements; these are very visible on a flat background such bluescreen or paper. JPEG also forces a mottled effect on the bluescreen due to color quantization I believe. DjVu appears to reduce or eliminate these undesireable side effects. I will be following its progress with interest (as long as it doesn't get the kiss of death from proprietary interests...)

    --
    Charlie Ledogar, Media Software Engr., Lucasfilm Ltd.
  9. Re:-1 redundant by dbarclay10 · · Score: 2

    Compression speed DOES matter. First of all, many things on the web are compressed on the fly - you just don't see it. gzip is used, as far as I know, for most of it. Partly due to its modest processor usage. You try serving up a million pages all compressed using something like bzip2 on a rackmount server with only two processors - you'll quickly melt the metal.

    Dave
    'Round the firewall,
    Out the modem,
    Through the router,
    Down the wire,

    --

    Barclay family motto:
    Aut agere aut mori.
    (Either action or death.)
  10. Comments from a wavelet expert by coult · · Score: 1
    As a Ph.D. applied mathematician whose research specialty is wavelet compression methods, I have some comments.

    The website clearly indicates that this software is designed for images that come from scanning papers documents that contain both text and graphics. The algorithm is supposed to recognize the text and store it as a separate layer (but still as an image) from the rest of the image. Furthermore, the image can be transmitted in such as way that the text and significant features of the image are transmitted first, followed by the bits representing less important features later - a technique known as progressive image transmission (among other names).

    I certainly believe that they use wavelets to do this. In fact, the hot wavelet method of the 90's, EZW (Embedded-Zero tree Wavelet) compression allows for exactly that: to compress an image in such a way that the more significant bits come first, followed by the less significant bits in order of significance. Picking out text that is layered on top of a background image is relatively easy with wavelets: just pick out the really large coefficients in the wavelet expansion, since those most likely correspond to parts of the image where large jump discontinuities occur. This can all be done automatically, of course.

    --

    All is Number -Pythagoras.

    1. Re:Comments from a wavelet expert by paulwomack · · Score: 2
      The BIG idea underlying this is that text requires high spatial resolution (say 300 DPI, 1 bit) with measly colour resolution. In the case of a page of bloack text on white paper the colour "pixel" could be the size of the page!

      The photographs need moderate colour AND spatial resolution (say 24 bit, 72 DPI). This is a very old idea. Colour print systems in the 70's used it. What's neat is the idea of "gating" the lower resolution colour data through the higher resolution "shape mask" to get the properties of both, at least for real world pages.

      Appropriate (and quite well known) compression techniques are then used on the 2 channels.

      The wavelet technique you describe sounds like it would get the frequency separation concept in a more "implicit" way. DjVu is very explicit. BugBear

      --
      Ignorance is curable. Stupid is forever.
  11. Please read and think before post by smolix · · Score: 1
    DjVu was not developed by the Olivetti and Oracle labs in Cambridge. Instead the guys who did it were at Redbank, NJ.

    Some of the members of the team include Leon Bottou, Yann LeCun (the guy who was one of the few inventors of Backprop), Yoshua Bengio, Patrick Haffner, Patrice Simard, and Larry Rabiner. I know those guys and they're very good. Don't know why ATT ultimately sold this product, though.

    As far as the text compression goes - it works by clustering individual characters into subgroups and using the latter as a highlevel compression scheme plus encoding the differences. So you could even 'edit' the text (they did it but didn't release it for some reason). Anyway, it's pretty cool stuff.

  12. PNG popularity by -kevin- · · Score: 1

    On an off note, its nice to see that PNGs are becoming more widespread, probably 12%-20% of sites that I've seen recently are using PNGs for all the images

  13. a jpeg-like codec by slashdot-me · · Score: 2

    Here's my own image compressor. It's just like jpeg but I had to trim the source a bit to fit inside the teeny-tiny submission box. I hope you can still understand it.

    The codec emits a byte stream considerably larger than the raw image (4x) but this stream is easily compressed by bzip2 or gzip. It reads and writes portable gray map (.pgm) files (both ascii and binary). If you want color I suggest you transform the image into YUV colorspace and compress each plane seperately. The color planes should be downsampled 4:1 (2x2 blocks) and compressed at very low quality (less than 15).

    Build instructions:
    Save the code as 4a.c
    gcc -O2 -ansi -Wall -o en -DMAGICK=1.0 4a.c -lm
    cp en de

    Running the program:
    convert image.gif image.pgm
    cat image.pgm | ./en 50 | bzip2 > image.4a
    cat image.4a | bunzip2 | ./de > image2.pgm
    convert image2.pgm image2.gif

    To make a pgm from a jpeg use:
    djpeg -grayscale -pnm image.jpg > image.pgm

    Hints:
    Try quality settings from 15 to 200.
    Bad input causes core dumps.
    The image is cropped if the dimensions aren't divisible by 8.

    ====================

    /* 4a by Ryan Salsbury */

    #include <string.h>
    #include <stdio.h>
    #include <stdlib.h>
    #include <math.h>

    #define D(a,i) a=(i); while (a--) {
    #define i int
    #define d double
    #define szi sizeof(i)
    #define szd sizeof(d)
    #define gc() getchar()
    #define pc(c) putchar(c)
    #define w while
    #define v void
    #define L goto

    #define _9 2+7
    #define _27 3+24

    #define di(F,A,B,C,G,E) \
    v F(i j,i h,G* M,E* S){ d O = 0.0; i x,y,u,q,s,t; D(t,h/8)D(s,j/8)D(y,8)D(x,8)\
    D(q,8)D(u,8)O+=M[u+q*j]*(A?ca[u]*ca[q]*(lQ[u+q*8 ]-23.5)*LQ:1.0)*co[B*u+C*x]*\
    co[B*q+C*y]; } } S[x+y*j]=O*(A?1.0:ca[x]*ca[y]/(lQ[x+y*8]-23.5)/LQ) ;O=0.0; } } M+=\
    8; S+=8; } M+=j*7; S+=j*7; } }

    i nm()
    {
    i c = gc(), a = 0;
    L0: if (c > 32) L L1;
    c = gc();
    L L0;
    L1: if (c - 35) L L2;
    L3: if (c == 10) L L0;
    c = gc();
    L L3;
    L2: if (c <= _9*_27 || c >= _27*_9) return a;
    a=a*10+c-48;
    c = gc();
    L L2;
    }

    v rp(i* x, i* y, d** X)
    {
    d K, *V;
    i h, m, p, G, H, I = gc()*gc();
    *x = (p=nm())&~7;
    *y = nm()&~7;
    m = nm();
    *X = V = malloc(256 + szd **x **y);
    K = MAGICK;

    D(G,*y) D(H,p)
    if (I==4240) h = gc();
    else h = nm();
    *V++ = h*K - 128.0;
    } V-=p-*x;
    }
    }

    v wp(i x, i y, d* V)
    {
    i n;
    printf("P2\n%i %i\n255\n", x, y);
    D(n,x*y)
    i l = *V+++128.0;
    l = l&~0xff ? l>>31?0:255 : l;
    printf((n%20)?"%i ":"%i\n", l);
    }
    pc(10);
    }

    d co[64], ca[8], LQ;
    char* lQ = "%! %,9CK\"\"#(.IJF##%,9HRG#&*0Ca[L'*7GQtoY,5FN\\pweAN Zao~}mUehkvlok";

    di(fw,0,1,8,d,i)
    di(rv,1,8,1,i,d)
    i main(i argc, char* argv[])
    {
    char T[32];
    i x, y, *pi;
    d* ui;

    d p = 0.1963495408494;
    d pp = 0.3535533905933;
    D(y,8) D(x,8)
    co[x+y*8] = cos((2*x+1)*y*p); }
    ca[y] = y?0.5:pp; }

    if (argv[0][strlen(argv[0])-1] == 'n')
    {
    LQ = 100.0 / ((argc-2) ? 70 : atoi(argv[1]));
    rp(&x, &y, &ui);
    pi = malloc(x*y*szi);
    fw(x, y, ui, pi);
    printf("%i\n%i\n%f\n", x, y, LQ);
    fwrite(pi,szi,x*y,stdout);
    }
    else
    {
    x = nm();
    y = nm();
    LQ = atof(fgets(T,32,stdin));
    ui = malloc(x*y*szd);
    pi = malloc(x*y*szi);
    fread(pi,szi,x*y,stdin);
    rv(x, y, pi, ui);
    wp(x, y, ui);
    }
    return 0;
    }

  14. Re:Dumbass. by meadowsp · · Score: 1

    Sorry, I did take a brief look at it, didn't see that part and did let myself be goaded by our glorious leaders.

    I don't know if "dumbass" is neccessary though. I was refering to the fact that the W3C seems to have a preference for PNG so perhaps we should stick to their standards.

  15. Proliferation of file formats by loki4eng · · Score: 1

    I work for a bunch of architects, engineers, and designers who could probably use a good image compression scheme. But I'm not going to tell them about this. Why? Because the proliferation of formats can become a huge pain for little sysadmins like me, and enough is enough.I am so tired of troubleshooting "but I can't convert myfile.bozoimage into myfile.gonzoimage". I realize standardization is a pipe dream, but I will not encourage this kind of thing. (Same for document formats - Screw Corel, Microsoft, and everybody who has to make a proprietary document format)

    --
    It's nota my planet, monkey-boy - Dr Lizardo.
  16. Re:Reminds me of FIF by flip-flop · · Score: 2

    I'm willing to bet those weren't in fact FIF files, but FITS files. FITS (flexible image transport system) is the de facto standard for astronomical images these days.

  17. Re:Reminds me of FIF by jovlinger · · Score: 2

    The problem is compressing them. The ideas behind decompressing fractals are easy; use the self similarity inherent in pictures (a tree looks much like another tree, so build a picture of a forest by layering trees and then correcting the errors).

    But figuring this out ammounts to exhaustive search. Apparently IFS had decent heuristics that got decent compression in only a few minutes. I think this is what was licenced, and that the file-format was open? This was a while ago, so I'm filling in the gaps in memory by guessing.

  18. I thought PNG did that effectively? by 1nt3lx · · Score: 1

    Perhaps I'm just ignorant, but I thought PNG was supposed to replace GIF when Unisys started seeking royalties? Slashdot advertised the "convert your web page's GIFs to PNG" contest or whatever it was. (Ironically, Slashdot still has MANY Gifs!)

    Secondly, isn't compression becoming less of a concern since more and more people are stepping into the broadband arena?

    It seems to me like the product makes sense, but it would have been much better received 10 years ago or a year ago when Unisys was going insane.

    Yes, we have no spoon.

    1. Re:I thought PNG did that effectively? by Evil+Grinn · · Score: 1
      (Ironically, Slashdot still has MANY Gifs!)

      Maybe the advertizers demand that their ads run as GIFs not PNGs.

    2. Re:I thought PNG did that effectively? by flynt · · Score: 1

      grep the homepage for gif, you'd be surprised...

  19. Better yet (OT) by Loundry · · Score: 2

    Can it detect the difference between art and pornography?

    --
    I don't make the rules. I just make fun of them.
    1. Re:Better yet (OT) by Bob+McCown · · Score: 1

      Yea, it knows it when it sees it...

  20. Re:Reminds me of FIF by Nailer · · Score: 1

    The comapny was Iterated Systems. The technology is still being developed, and highly advanced - they even have a fractal moving image compressor that can, [according to a workmate who did a thesis on compression - take with a grain of salt if you like, because I can't back it up] fit eight minuted of better-than-VHS-quality film on a floppy disk. He saw it at a trade show, allegedy. But believe what you will.

    The licensing fees for FIF and other iterated technologies are huge. And the weird thing is, if they were open source, Iterated would be in much better straights than they are now. We'd all be using infinitely zoomable, highly comrpessed images [both GIF, JPG, and PNG are web standards due to seeming freeness], and despiute the fact they'd loose the revenue, they'd be known as the number one player in town for fractal image creation tools. And people would but their software over the other utilities which use the same Open Source image format, because they had the fastest algorithms, extra features, or other competitive advantages.

    Just a thought.

  21. a rebuttal by Shoeboy · · Score: 1

    I have to address a point in the betanews article:

    The Internet has become a place where images rule, and visually pleasing Web sites reign over the simpler, more text-based sites.

    The man has obviously never been to asciiboner.com. Anne Marie should check this site out too, so as to get an idea of what to expect on our wedding night.

    --Shoeboy

  22. Misleading by Happosai · · Score: 3

    The BetaNews article is actually very misleading - it is confusing two of LizardTech's products.

    DjVu is a document format (like PDF), not an image format, and the techniques mentioned in the article refer to compression of documents.

    LizardTech do have a compressed image format. This is called MrSID, and uses completely different techniques for compression.

    I would be interested to see an independent comparison between MrSID and PNG - unless there are huge advantages of using the proprietry MrSID format over the OS PNG format, I don't predict much of a future for MrSID on the web (although it would seem that LizardTech are touting it more for internal use rather than for general distribution anyway).

    [Happosai]

    1. Re:Misleading by web_angel_tr · · Score: 1

      Right

      They are tring to juggler a few things. Like they say a 2.5GB Tiff file is only 3MB big in DjVu. I have never seen a tiff file on a web page. It would be nice to see how it compares with ... let's say jpg. Even the approach to compare this to formats is absurd.


      --

      --
      There is no such thing as gravity. The Earth just sucks.
  23. DjVu has Linux support by Lemuel · · Score: 3

    I don't know why the summary on Slashdot says "No Linux support". LizardTech has both decoders and encoders available for Linux.

    Also the summary picks on LizardTech's use of speed as a feature. While this isn't a standard measurement, it is a way to tell people that you will get your images faster because the files are smaller. That's not a big crime. They also do talk more specifically about their format producing smaller files, so they do understand real measurements. BTW, while it is possible that they say elsewhere that DjVu compresses faster than pdf, what I saw was that the documents download faster, not compress faster.

    The Slashdot write-up complains about LizardTech's comparison of DjVu with pdf, pointing out that pdf isn't an image format. True, but the LizardTech description refers to DjVu as "DjVu for Documents", and their web page describes why DjVu is good for documents. Images seem to be just part of the data they need to handle.

    Finally, I haven't seen any source for Acrobat either, but it is very popular on the Internet, so lack of source won't necessarily keep LizardTech from succeeding with DjVu.

    Is DjVu actually any good? I have no idea. Slamming the product with incorrect and misleading comments doesn't help one decide, though.

  24. Here's some prior art from 1994 for you by goingware · · Score: 2
    This sounds remarkably like an image compression algorithm and associated file format that I invented while I was at Medior in San Mateo in late 1994 - I actually implemented it in 1994 and it got used in some shipping multimedia CDROM products such as the Men are From Mars, Women are from Venus CDROM in 1995.

    My algorithm was lossless and what strikes me the most is the speed - my algorithm was notable for the speed of decompression, and we used it in particular just because it was so much faster than GIF, which was a significant advantage when you were scanning through lots of images on CDROM. While I only implemented it for 8-bit indexed images I felt it would likely work fine for any bit depth or color space.

    Medior was later purchased by AOL and renamed to AOL Productions. I think AOL Productions isn't around anymore but lots of old Medior people still work for AOL, for example former Medior President Barry Shuler is a high-level exec at AOL.

    Basically, my invention worked by dividing the image up into lots of little subregions and encoding each pixel in a given subregion in the minimum number of bits required to encode the number of colors that occurred in that region.

    For example, in an image that was black on top and white on the bottom, I'd have two subreqions with zero bits each and a single element color table that was either black or white.

    If it was snow - black and white pixels randomly intermixed - the whole image would be one bit with a two-element color table containing black and white.

    The big trick was to divide the image in a good way, in such a way that the whole image was reduced the most and the size of the data required to reconstruct the regions wasn't too big. In practice I found it worked OK to start with lots of small fixed-size squares then merge adjacent squares that had similar color schemes.

    I wrote a document for Medior that described what I invented in great detail and what I predicted this could be made to do. What I actually got it to do in practice was not nearly what it was capable of, but this was because of the limited time available to implement it.


    Michael D. Crawford
    GoingWare Inc

    --
    -- Could you use my software consulting serv
  25. Re:-1 redundant by BeanThere · · Score: 1

    PNG's are good, BUT:

    • They *still* aren't supported properly on the most used web browser (IE prompts me to save them to disk rather than open them and display them, perhaps this is fixed in Me, I don't know. I've also had problems with the alpha channels in IE, as well as the gamma correction stuff - they displayed very dark). Obviously this is typical MS sloth at work, and has nothing to do with PNG itself ..
    • I've never managed to get PNG files to approach JPG's in terms of compactness. Obviously JPG is lossy, so it can always have an advantage - but the fact of the matter is, for the vast majority of image distribution that takes place, a certain amount of information loss is acceptable (JPG's look perfectly good to 99% of people anyway, so why would these people want to download bigger files in PNG format?) (Most people probably aren't even aware that JPG's are lossy, hence the large of amounts of ultra-crappy looking porn on newsgroups, from people saving and resaving images just to STUPID things like adding a huge blue border around an image.)

      Software for creating/reading them (end-user software as well as API's) still isn't as common and widespread as it *should* be. These things take time, yes, but they are slowed by factors like popular browsers not supporting PNG's properly anyway.

    Don't get me wrong though, I would like to see PNG becoming the dominant format. But PNG seems to forever be stuck in the "early adopter" phase ..

  26. Re:Actually quite an old product by armb · · Score: 1

    http://www.djvu.com/cgi-bin/products/products.pl
    lists DjVu Solo 3.0 (encoder for single pages only) as a free download. Could you use that?

    http://www.djvu.att.com still has source for the reference library for people wanting to write viewers, but an encoder would be harder - I guess there might be patent issues too.

    --
    rant
  27. Re:-1 redundant by jovlinger · · Score: 2

    Like so many other factors, speed is one that does matter if in the extreme, but once it's good enough, it has very little importance. Size is arguably more important, because every byte saved is saved every time you use the file, while time saved is only saved during compression, which is often batched.
    So for time, order of magnitude seems to be the differentiator, while size has a linear importance.

  28. Re:Boycott DjVu - Use PNG ! by DocBill · · Score: 1

    Obviously, this person is not very well informed.
    Linux has always been the primary development platform for DjVu products. Typically the release cycle is Linux and other unix platforms, then Windows, and finally Mac.

    You'll also find the 2.2 DjVu Reference Library is available with GPL licence. The 3.0 Reference Library is scheduled to be released shortly...

    http://www.lizardtech.com/products/djvu/referenc elibrary/DjVuRefLib.html

  29. The old thing again? by Ashran · · Score: 2

    I hope that the GIF thing doesnt happen again..
    Wait till it's a "standart" then ask for license fees.
    The idea sounds pretty nice, but I'd like to see quality / compression ratio comparisions with hard numbers ;)

    --

    Before you email me, remember: "There is no god!"
    1. Re:The old thing again? by Ashran · · Score: 1

      The spelling is because in German its Standart and not standard =)
      Sometimes this kinda stuff just happens.

      --

      Before you email me, remember: "There is no god!"
  30. Re:Actually quite an old product by jovlinger · · Score: 1

    http://slashdot.org/article.pl?sid=older/980628231 4218&mode=nested

    That was almost 2.5 years ago. The reason they compare against pdf is that the target application seems to be text; compress the black and white text with one-bit RLE, and the background with JPG sort of deal.

  31. Re:Boycott DjVu - Use PNG ! by DocBill · · Score: 1

    First time I used PNG, I thought the same thing.
    Actually, it is just that PNG has a far more
    complicated interphase for selecting options.
    Once you select the right options, PNG will compress on the average about 10-20% smaller than
    a GIF. I've seen as high as 50% smaller. The
    only thing I've seen that is better for lossless
    compression hasn't been released publicly yet.
    When it is, I'll be happy to tell people about it.

  32. Boycott DjVu - Use PNG ! by meadowsp · · Score: 1

    Come on people, let's stick to open standards here...

    1. Re:Boycott DjVu - Use PNG ! by Mickey+Squid · · Score: 1

      It doesn't do animation, it's closed and it doesn't even compress down as small as gif. Have any of you acturally tried this thing? It sucks. It's like that jpeg 2000. It's not even as good as the old jpeg but they still managed to fool enough people to turn it into a standard. It's just a railroad job and they are winning.

      --
      All you need in this life is ignorance and confidence -- and then success is sure. Mark Twain
    2. Re:Boycott DjVu - Use PNG ! by WindowsTroll · · Score: 1

      Which is more important - open standards or better technology? If someone were to develop a superior technology - and made easily available to everyone - should users refuse to use it because it wasn't an open standard? Why would anyone choose to use something that was commonly known to be inferior when better alternatives were available?

      If the DjVu format is good and if they want it to gain acceptance, then they will create versions of the product for popular OS's. Given the number of servers that run on Apache/linux, it would only be a matter of time before tools were created that could be used for this platform.

      --
      "Microsoft has made computing accessible to a population who would otherwise not be able to use computers" - B. Kernigha
  33. Reminds me of FIF by Tony+Hoyle · · Score: 3

    A few years ago a company came up with a compression which was actually rather good, using fractals. It was called FIF. They made the mistake of greed ovtaking common sense and tried to charge for a license to write compressors for it... The result - when is the last time you saw an FIF file?

    If these guys don't have an open format they will simply go the same way.

    1. Re:Reminds me of FIF by Nailer · · Score: 1

      You can also grap the current viewer and a FIF Plugin for Adobe Photoshop [Win16, Win32, and Macintosh only] from Altamira Group

    2. Re:Reminds me of FIF by mazur · · Score: 1
      The result - when is the last time you saw an FIF file?

      Last time? There's never been a first time, for me. This is the first I ever heard about it, and I'm not exactly a newbie.

      Stefan.
      It takes a lot of brains to enjoy satire, humor and wit-

      --
      The truth shall make you fret. (Ankh-Morpork tImes motto)
    3. Re:Reminds me of FIF by JurriAlt137n · · Score: 1

      MS used to use FIF for Encarta, Art Gallery and a bunch of other multimedia reference titles - I expect they've gone to JPEG since they now use DVDROMs and don't care so much about space or quality.

      Hmmmm, nah. Some jokes are too easy....

      --

      People replying to my sig annoy me. That's why I change it all the time.
    4. Re:Reminds me of FIF by CdotZinger · · Score: 1

      The result - when is the last time you saw an FIF file?

      Last night on Nova, they showed a small labful of astronomers/cosmologists looking at distant supernovae (trying to find a type 1A, I think), and the files they were inspecting (visually) were .FIFs. I only noticed because I was surprised at how good-looking the rough incoming images were, and got really, really close to the tv (so close I could see the Squant Nebula in Sector FUFME). So, .FIF hasn't failed; it's found a small-fame/big-$ niche.

      --
      Your mouth is like Columbus Day.
    5. Re:Reminds me of FIF by Howie · · Score: 1

      I know - I bit my tongue too.

      --
      "don't fall into the fallacy of believing that Perl can solve social problems. Maybe Perl 6 can, but that's a ways off"
    6. Re:Reminds me of FIF by fatphil · · Score: 1

      Last thing I heard (a year ago?) IS had completely given up on fractals.

      I worked for a DSP company, which was based next to an engineering (and sport) oriented university.
      The number of Ph. D. students who used to come in and muck around with their fractal image code using our DSP boards was amazing. And the results? Garbage. In order to get any decent results the CPU input for compresion was prohibitively high. Not impossible, just impractical.

      Barnsley was too tight arsed, serves him right that it flopped.

      FP

      --
      Also FatPhil on SoylentNews, id 863
    7. Re:Reminds me of FIF by Enahs · · Score: 1

      No, those were FITS files--a totally different format.

      --
      Stating on Slashdot that I like cheese since 1997.
    8. Re:Reminds me of FIF by Howie · · Score: 1

      The result - when is the last time you saw an FIF file?

      MS used to use FIF for Encarta, Art Gallery and a bunch of other multimedia reference titles - I expect they've gone to JPEG since they now use DVDROMs and don't care so much about space or quality.

      The interesting thing with FIF was that because of the way it's done, you can zoom in to arbitrary depth and get a smooth image - not much detail, but a smooth image.

      --
      "don't fall into the fallacy of believing that Perl can solve social problems. Maybe Perl 6 can, but that's a ways off"
    9. Re:Reminds me of FIF by American+AC+in+Paris · · Score: 2
      I agree that licensing mania was a deciding factor in the fall of FIF, but as I recall, there were other issues at play, as well--one of these was that compression times tended to skyrocket the higher level of compression you chose. Another was that certain types of images would compress better than others--that is, a picture of a tree worked far better than a picture of a skyline because of the shapes involved in the images, because the fractal encoding worked better on the "natural" shape of the tree than it did on the straight-edged, blocky skyline.

      Last I used it (on a 486 laptop, mind,) I could compress a JPG image in about five seconds to the FIF's 150 seconds or more. The images were comprable in quality--FIF did a visibly better job on some photos, but not so much of a difference that it justified the extra effort of encoding them.

      Again, this is all from pretty old experience; the compressors may be far better now, and there's also the fact that an open source format/non-licensed technology approach would have likely resulted in better compressors. Any more recent info on how FIF fares, performance-wise?

      $ man reality

      --

      Obliteracy: Words with explosions

  34. Has this been patented yet? by sid_vicious · · Score: 1
    If it's been patented commercially already, no thank you. The last thing the web needs is to get wrapped up in another Unisys/GIF problem.

    And how the heck does it figure out what the "foreground" and "background" components are, anyway? If I have to go in and mask out the background before saving, then forget it.

    --
    If it ain't broke, it doesn't have enough features yet.
    1. Re:Has this been patented yet? by kyz · · Score: 2

      And how the heck does it figure out what the "foreground" and "background" components are, anyway?

      As it only works on scans of paper documents, "background" is bland paper-texture, perhaps coloured. You can easily use an edge-detection filter to work out which is which.

      --
      Does my bum look big in this?
    2. Re:Has this been patented yet? by TicTacTux · · Score: 1

      >And how the heck does it figure out what the "foreground" and "background" components are

      You know, background is "0" while Foreground is (not background). If the new format is accepted by the community, they'll try a color version... :-)

      --
      Use The Source, Luke!
  35. They got it from AT&T by dne · · Score: 4
    1. Re:They got it from AT&T by SEWilco · · Score: 4
      And we discussed it here on Slashdot:
  36. Re:Some facts off the top of my head... by Anonymous Coward · · Score: 5
    Gary got is mostly right.

    I am one of the four persons who created DjVu in the first place. The events took place in AT&T-Labs Research between 1997 and 1999.

    1. There is Linux support. Just go to the download page and select the Linux platform. Most of DjVu was first coded under Linux.
    2. After the Lizardtech deal, we set up a "non commercial site" named DjVuZone. It contains general information, benchmarks. links, a searchable digital library, etc.
    3. There is source code. Lizardtech recently had the good idea to relicense version 2 of the DjVu reference library under the GPL. We have the corresponding online documentation on DjVuZone. We are just waiting for the release of version 3 to redo that part of the site.
    4. DjVu combines several new technologies including new approaches to arithmetic coding (Z'-coder), new compression methods for textual images (Soft pattern matching, JB2), new wavelet method (IW44), and new ways of combining them together. The current implementation is geared toward compressing scanned document images in 24 bit colors around 300 dpi (raw size is 25MB) and typically packs them into 50-60KB. Neither TIFF, nor JPEG, nor JPEG-2000 nor Fax-G4 can do that. None of these technologies will let you realistically view such documents over the web. DjVu can.

    Hope this helps :-).

    - Leon Bottou, AT&T-Labs Research.

  37. There IS some source code by moogla · · Score: 1

    ...unlike the original story commentary above stated. It's on AT&Ts site Right Here and you can download and tinker with it. But read the AT&T Source Code License first; you can only distribute patches to the distribution if you change anything. Also, they don't give you all the encoding code, omitting the background/foreground image seperation and the lossy JB2 wavelet encoder that handles bitonal images (some of their best examples of compression!). They actually suggest someone should make a GIMP plug-in for all this. And because they give you the rest of the JB2 back-end, they're practically begging for someone who's read the literature to bridge that gap.

    That was real nice of them.

    --
    Black holes are where the Matrix raised SIGFPE
  38. -1 redundant by buttfucker2000 · · Score: 4

    The article has it about right. Png is vastly superior - excellent lossless compression, sometimes better than lossy methods (plus technical features like a full alpha channel), and, most importantly for its dominance over gif, it is unencumbered by patents or closed source algorithms.

    Speed of compression is not a factor in compression - otherwise we would use bmps or xpms, which have zero compression time - because they're uncompressed. Size matters. Speed doesn't.

    I really can't see much market, and very little application for this compression. On-the-fly compression of images for web download would be redundant, since a png would be smaller than this format, so the speedy on-the-fly compression of uncompressed images is pointless.

    And in any case, modern PCs are more than powerful enough to almost transparently display well compressed images, so a simpler format is about 10 years out of time.

    If it was open source, it could perhaps have a market in replacing things like xpms, which are used in games for processing speed, but even it was, the benefit would be marginal, since hard disk space is, relative to image size, almost infinite, so compressing them slightly wouldn't make much difference - and for download those images would be gzipped anyway.

    --
    Free Anne Tomlinson!!
    1. Re:-1 redundant by Fizgig · · Score: 1

      JPEG is a lossy compression format.
      PNG and GIF are lossless compression formats.

      The reason JPEGs are smaller is that they don't have all the information in there. So on the size front it's no surprise that JPEG "wipes the floor with" PNG. They just look crappier.

    2. Re:-1 redundant by Chagrin · · Score: 1

      -1 redundant? This should be -1 doesn't-know-wtf-he's-talking-about. Comparing DjVu to PNG is like comparing GIF to Jpeg -- each format serves a very different purpose.

      --

      I/O Error G-17: Aborting Installation

    3. Re:-1 redundant by wjr · · Score: 2

      You're missing the advantage that MRC-based image formats like DjVu (and TIFF-FX and eventually JPEG-2000) have: by separating continuous-tone content like photographs from monotone (spot colour) high-frequency content like text, you can achieve really good compression while maintaining image quality. You don't want to do lossless compression of high-resolution photographic images - even with the best lossless compressor, the files are still huge. And if you do lossy compression of mixed content (text over images), you'll get really lousy results unless you separate the different types of image content.

    4. Re:-1 redundant by harmonica · · Score: 3

      DjVu is for scanned documents. There is no major accepted file format in use for this kind of data. This will be a huge market once bureaucracies around the world start digitizing their tons of documents. OTOH, DjVu is there for quite a while already and I don't see it having succeeded. Plus, when I installed the plugin under IE 5 a year ago, it was in some dubious beta state. Not nice to work with.

      Lossy / lossless image compression types. You cannot compare PNG tolossy schemes. PNG cannot beat a lossy method because the goals are different. Lossless: Compress as small as possible (but the exact original must be restoreable). Typically, the algorithms that throw more resources (CPU and memory) at it are better. Lossy: For a given file size, reach the best quality. You can easily beat PNG with a lossy scheme by simply choosing very bad quality.

      Open source. There are a few programs out there. Try TIC. It's GPL'd and beats JBIG-1 by about 40 percent on scanned images, according to the website.

      Resources: Image Compression Resources, The Data Compression Library.

  39. Re:Browsers by Tim+C · · Score: 2

    Not so fast - they have a plugin available for Netscape and IE at:

    http://www.lizardtech.com/cgi-bin/products/desc. pl?tsb=443224

    Cheers,

    Tim

  40. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  41. Re:Will it help me get goatsex down the pipe faste by gle · · Score: 1

    I hope it'll reduce it to zero byte

    ____________________

    --
    Ni!
  42. Re:Why would I use this? by Flavio · · Score: 1

    Flash? Hell, no! check out my home page and see how many images and plugins I've used (hint: none).

  43. Re:It is Official: You are Stupid by moogla · · Score: 1

    DjVu is applicable for any raster image. It just happens to be good at seperating high constrast regions (text) out of the image. And they may be referencing the name AT&T gave to the whole technology, while LizardTech chose to split it up into two distinct groups.

    --
    Black holes are where the Matrix raised SIGFPE
  44. Re:Whatever! LizardTech has the market cornered! by IntlHarvester · · Score: 1

    I'll second that MrSid is pretty cool. (I found it by way of some old railroad maps at the library of congress. So, if you are looking for some free content, there you go.)
    --

    --
    Business. Numbers. Money. People. Computer World.
  45. ?! by bludwulf · · Score: 1

    I have to question if CmdrTaco even went to the web site. There are SDK's for linux/win/solaris, and they measure the speed in terms of PDF because DjVu is a contender to PDF: you can add hyperlinks to documents you scan in.

  46. This exists already! by multipartmixed · · Score: 2

    As I understand it, it is possible to set different compression factors (from lossless to very lossy) on different polygonal regions of a JPEG file.

    Since JPEG is already fairly-well supported, why not use it? The only thing that needs to be changed is the image-creation software; it needs to know how to detect the difference between background and foreground content. Image-creation software also has a huge advantage here; layers (and their relative z-Indexes) give great hints as to what is, and what isn't, "background"

    --

    --

    Do daemons dream of electric sleep()?
  47. Re:Some facts off the top of my head... by edinho · · Score: 1

    This post looks legit to me despite the AC status. I.e., you can believe it. Cheers, Ed. (Only people from AT&T Labs-Research write "AT&T Labs-Research" :-)

  48. Re:I saw this working at Seybold by Angst+Badger · · Score: 2
    I've worked with MrSID, too, and I have to agree that for the intended applications it's much nicer than PNG. The ability to rapidly and progressively zoom into an image is very nice and implemented in such a way that selective zooms can be delivered on the fly over the web without downloading the entire image at once. The demo pieces I looked at were mostly historical maps -- the Library of Congress uses MrSID extensively -- and it was a great way to examine a document that was originally 4 feet across within the confines of a monitor. DjVu was also interesting, and definitely outshines PDF for scanned documents, but MrSID is LizardTech's real marvel.

    I wish people would get off the "if it's not OSS it must suck" bullshit attitude. LizardTech has some really well-designed software, and if it bothers you that they're proprietary (and IMHO, grossly overpriced), then bigod go out and clone it. Having an OSS equivalent of MrSID and DjVu, especially in a form that would be usable to the point-and-drool crowd, would be a very strong selling point for getting Linux into the MS/Adobe-dominated document archiving market.

    --

    --
    Proud member of the Weirdo-American community.
  49. Re:goat sex by kazzuya · · Score: 1

    Not only you are an Anonymous Coward (if you have something to say come out and speak up !) but you also had to post this off-topic bullshit 2 times to get it right. Next time don't write anything and use the Preview button to make sure nothing actually spilled in the Comment box 8)

    Thank you

  50. Um.... by ozbon · · Score: 1

    If this graphics format is so great, why aren't they using it on their own site? Everything I saw was in .gif or .jpg. DOH!

    --
    I say we take off and nuke it from orbit. It's the only way to be sure...
    1. Re:Um.... by flynt · · Score: 1

      It is not a graphics format, it is a document format. The article was wrong.

  51. It IS available for Linux by wilton · · Score: 2
    You CAN get a DjVu plugin for Linux:


    Have a look at:


    http://cgi.netscape.com/cgi-bin/pi_moreinfo.cgi?PI D=11473

    Will

    --
    per mere, per terras
  52. djvu is not new, it's OLD. by TheLink · · Score: 1

    It's AT&T's technology.
    http://djvu.research.att.com/

    Even mentioned in slashdot LONG ago (and we know how timely slashdot is ;) ).
    http://slashdot.org/article.pl?sid=99/03/29/0316 220&mode=thread

    http://slashdot.org/article.pl?sid=older/9806282 314218&mode=thread

    Do a search on slashdot for djvu for more links.

    Link.

    --
  53. What's wrong with the old encoder? by Andy+Dodd · · Score: 2

    Um, you can't arbitrarily change a license for software (that you have already licensed.)

    If you have a free open-source encoder, you can continue using it for no cost. No $2000 fee.

    --
    retrorocket.o not found, launch anyway?
    1. Re:What's wrong with the old encoder? by Suydam · · Score: 2

      lost the sources :( went back to download a new copy of it and found that all of a sudden it cost $2000. That was actually my problem.

      --


      Werd.
  54. Is it just me... by db · · Score: 1

    I didnt read through all the comments here, so this has probably been said already...

    But wasn't DjVu developed a GOOD while back by AT&T or Bell Labs or something? I could have sworn I took a look at this 2 or 3 years ago.

    --
    Dave Brooks (db@amorphous.org)
    http://www.amorphous.org

  55. Re:Actually quite an old product by Suydam · · Score: 1
    actually I was using the linux command line tool as a batch processor. So while, yes, the documents were one page each....the problem is the lack of the command line tool.

    It was really just a proof of concept anyway....so it's not costing me enough bandwidth to matter having switched away from DjVu.

    --


    Werd.
  56. DjVu, MRC, TIFF-FX, JBIG2 and more acronyms by wjr · · Score: 1
    The imaging model used by DjVu to get such good compression is to divide the image into (typically) three planes: a background plane (colour, compressed using wavelets, usually at a low resolution), a foreground plane (colour, compressed using wavelets, also at low resolution) and a selector plane. Imagine an ad page in a magazine: there's coloured text printed across a photographic image. The photograph is stored in the background, the text *colours* are stored in the foreground as blobs of colour (e.g., a blue word would have a large blue blob in the foreground), and the text *shapes* are stored in the selector. The decoder decompresses all three, and then draws the foreground colour where the selector is '1' and the background colour where the selector is '0'.

    This scheme has the advantage that it keeps high-frequency content (the edges of the text) out of the background, enabling it to compress better (wavelets and JPEG don't handle high frequencies well: they either smear them, add ringing, or require a lot of bits). Another advantage is that there are specialised algorithms for compressing binary (1 and 0) images containing text. The one used by DjVu is called 'SPM' (Soft Pattern Matching). It works (roughly) by breaking the text image up into isolated characters, then deciding which characters look similar to each other. On most pages, there will be a lot of repeated characters. You can then get improved compression by storing only one instance of each unique character shape. SPM does something fancier than that but that's the basic idea.

    This three-layer model is known as MRC (Mixed Raster Content). It's used outside DjVu - most notably in a file format called TIFF-FX. This is an extension of TIFF intended for Internet Fax (including colour fax). TIFF-FX is, if I remember right, RFC2301. There is an extension to TIFF-FX in the works to add support for JBIG2. What's JBIG2? It's a format standardised by ISO for doing bilevel image compression, using the same concept of improving text compression by identifying repeated shapes. (Actually, there's a lot more in JBIG2 but I'll leave that aside). The concepts used in DjVu's SPM were incorporated into the JBIG2 design (AT&T participated in the design of JBIG2), so JBIG2 can do anything SPM can do. JBIG2 was approved by ISO and ITU this past spring.

    Another file format that includes a lot of the same concepts as DjVu (MRC, repeated shape compression) is ScanSoft's XIFF - it's yet another TIFF extension, and is used in ScanSoft's Pagis line of products. In fact, much of the stuff in TIFF-FX is a standardised version of features that appeared in XIFF. MRC is also going to be in one of the later parts of JPEG-2000.

    I've been involved in the work on XIFF and TIFF-FX and I was the editor of the JBIG2 standard. So what do I think of DjVu? One problem is that AT&T abandoned the standards process - they felt it was moving too slowly. This means that they opted for a proprietary solution over an open standard one. Yes, the standards process is slow, but when it works it produces things like JPEG and PNG (a W3C standard) - formats that everyone can use, and where you have a choice of encoders, a choice of decoders, and few worries about interoperability. Another problem is its reliance on arithmetic coding - while that gets you good compression, it can have speed problems. JBIG2 offers the choice of arithmetic compression (for applications where size is the most important issue) and Huffman-based compression, for applications where speed is the most important issue. I've personally seen a JBIG2 decoder decompress at over 1 gigapixel per second...

  57. The facts. by sabre · · Score: 1
    If you want to learn more about what DjVu actually IS then don't bother reading the marketoid page. Look here. Specifically, on the what is DjVu? page, they say the following:

    The commercialization of DjVu is handled by Seattle-based LizardTech Inc. in partnership with AT&T Labs. DjVu is an open standard. The file format specification, as well as an open source implementations of the decoder (and part of the encoder) are available.
    Among other claims, they say "Black-and-white pages at 300 DPI typically occupy 5 to 30KB when compressed. "

    I have long given up on slashdot reporting the whole truth and nothing but the truth...

    -Chris

  58. Uh... this was on slashdot before by sean@thingsihate.org · · Score: 1

    This was on slashdot like two years ago, when it was AT&T pushing it instead of this LizardTech place that apparently picked it up.

    --

    One of the many things I hate. thingsihate.org
  59. CmdrTaco needs more sleep by Hasdi+Hashim · · Score: 3

    He posted this two years ago: DjVu plug-in available on Linux/Irix/Solaris/Mac. First post by sengan:
    New Image Compression Algorithm claims 1000:1 ratio

    Hasdi :-)

  60. Re:Actually quite an old product by DamageBoy · · Score: 1

    More over that, the first demo compressor / netscape plugin that AT&T released was for Linux... And this was two years ago... :)

  61. Re:DjVu has Linux support - including SOURCE CODE! by stu_coates · · Score: 1

    Not only is there a Linux x86 binary available, the source code is also online here.

  62. Re:Why would I use this? by Chatterton · · Score: 1
    LITTLE FLAME: So much bandwidth available... hum, it take me near 20 seconds to load the slashdot main page ans I don't talk about an article full of responses... So much bandwidth... But if you have too much money, you can send me some of it then i can have an OC45 at home... ;-)

    It seems that web designer omit the fact that there is always slow computers and slow connections. My work computer can't play flash animations (too slow), my home computer can't load big pages (56k winmodem). If a better compression could exist, it could leverage my poor home connection and if this compression is fast it can leverage my work computer. In the two case i am a winner. :-)

    I agree with you that this reply seems to be a flame, it's nearly one because i am in anger to those who thinks that everybody have a T1 at home...

  63. Re:Smaller then PDF? by f5426 · · Score: 1

    > Who makes pdf's that way?

    you have a printed document, with color. Now, you want to archive/distribute it. What are you going to do ?

    When going the PDF way, you blindly scan the color document and don't do any extra job (for instance, take all those re-editions of Sierra/Lucas Art old adventure games. The original manula is included as a huge PDF file).

    So their stuff makes half-sense to me. But, clearly, there is a lot of hype on their site...

    Cheers,

    --fred

    --

    1 reply beneath your current threshold.

  64. Yet another graphic format from hell. by Ino · · Score: 1

    IMO - they can all go and stick it up their hole where their head is.
    Regarding their "Convert your paper assets into the fastest, smallest, clearest digital format" there was something like: "any paperless office will wind up consuming more paper than a simple - paper-based office."
    Conclusion? DIE AT&T - DIE! and take your bleeding proprietary format with you - it's allright DejaVu already! :)

    --

  65. DjVu - Actually from AT&T by a_librarian · · Score: 1
    DjVu was actually developed by AT&T Labs - Research. http://djvu.research.att.com/ well over a year ago. They've released it to various other companies to comercialize it. Since the folks at AT&T Labs are (not surprisingly) big Unix users there is support for the various Unix flavors, including Linux. You can get more technical information than you want by going to their website. (one note: Also not surprisingly, most of their technical documents about DjVu are in DjVu format, so you'll need to get the plugin, either from AT&T or LizardTech.)

    Disclaimer: I used to work at AT&T Labs as an outside contractor in their library.

  66. Whatever! LizardTech has the market cornered! by (27-Shaner) · · Score: 1
    Check out their other product MrSID ... It is great for browsing massive (50 gig+) images on the web.

    It is heavily used in Remote Sensing, online mapping, and Web based-Geographic Information Systems.

    It is very slick!

    In general, with 'free' software vs 'paid' software, I think the corporate world has the advantage of advertising their product. But this corp produces high-quality, useful products!

    --
    ----- "Computers will never catch on, why don't you go outside and play!"
  67. Professional Image Processing Interests by rufus+t+firefly · · Score: 1
    LizardTech's main attraction is the MrSid software (which, interestingly enough *does* have a penguin flavor available for it). This software holds interest for imaging professionals (like photographers) for archiving photos, since even JPEGs get pretty space hungry when coming from high grade digital cameras.

    I'm really interested to know where JPEG 2000 support is for the GIMP and the rest of the free software world. Last I read, GIMP wasn't planning on supporting JPEG 2000 because of licensing issues, which would be a real shame, since wavelet compression is an awesome concept.

    ---

    --
    "He may look like an idiot, and talk like an idiot, but don't let that fool you. He really is an idiot." - Duck Soup
  68. Re:Smaller than PDF? by mkendall · · Score: 1

    It all depends how you make your PDFs. The bulk of the size of a typical PDF is in the images, and the images may be embedded within the PDF in various formats and with various levels of compression. A comparison without detailing what format was used is meaningless.

    For example, if you scan a typical one page black and white text document at 300dpi and save the result as a TIFF with CCITT Group 4 compression it will be 20-50kB. If you make a PDF by printing to Postscript then using the PDF generation tool within the GhostScript Viewer you will end up with a PDF that is several MB. That's because embedded within the PDF will be an uncompressed bitmap. If you make the PDF using the c42pdf utility your PDF will be the same size as the TIFF, 20-50kB. That's because embedded within the PDF will be the compressed bitmap.

    If you use the Adobe Acrobat tools to produce your PDFs, you will find lots of options for controlling how embedded images are compressed, what the resolution is and what formats are used. You can select different options for B&W, greyscale and colour images.

    I suspect the DjVu comparison is not comparing like with like. And in any case, as has been noted, anyone who generates PDFs of documents like annual reports with a scanner is missing the point.

  69. Re:Smaller then PDF? by wjr · · Score: 1

    Even for black and white images, PDF is the wrong way to go when you're dealing with scanned input. You either use their OCR which (like all OCR) introduces error, or you store the scanned image. The compression they use for the scanned image is quite poor. Modern compression methods like JBIG2 and the SPM compression used in DjVu can reduce a scanned document to a tiny fraction of its original size - 100:1 is not uncommon.

  70. Re:There's nothing here that TIFF6 can't do by wjr · · Score: 1
    The bilevel compression is done with SPM (Soft Pattern Matching - see my comment below). It uses a JBIG-like context model to do pixel coding, and uses an arithmetic coder called the Z-Prime coder, which is mathematically related to IBM's Q-coder.

    Group 3 and Group 4 fax do really badly on halftoned images; JBIG-like compressors can do quite a bit better. JBIG2 includes a special mode to compress halftoned images.

    TIFF6 doesn't support the MRC model used in DjVu - you have to go to TIFF-FX to get that.

  71. And read this too by Enahs · · Score: 1
    --
    Stating on Slashdot that I like cheese since 1997.
  72. Re:Fat chance by Technician · · Score: 1

    I think you are right about it not being open. Notice they have two doors for visitors? One is for commercial use! I smell PDF revenue stream. Give away free viewers and sell the encoders.

    --
    The truth shall set you free!
  73. Re:Actually quite an old product by Suydam · · Score: 4
    On that note, I've been using DjVu's open-source encoder for several years to encode text documents. It's compression ratios are incredible and the plugin is also free and easy to install.

    The big problem I have with this article is that DJVu isn't a "new image format". It doesn't even display things inline (like GIF, PNG and JPEG). It is however an excellent alternative to PDF if size of file is your main concern.

    The extensive references to "speed" when compared to GIFs and PDFs could be one of two things. They could be talking about Download speed (my personal experiences show DjVu files to be about 10 times smaller than GIFs and even more when compared to PDFs. Or, they might be speaking of encoding speed which DjVu seems to excel in

    Here is a problem however: the command line encoder used to be free for non-commercial use. I was using DjVu for encoding swim-team documents for a small non-scholarship collegiate swim team. Certainly this counts as non-commercial. HOwever, the new version from Lizard Tech would cost me $2,000 USD to run. That is absurd by comparison. So I'm abandoning DjVu since I can no longer afford the encoder.

    Incidentally, if you want to see how it worked for me, I used it on nearly every swim meet results page for a few years. Here is an example, just click on the links next to the word "Splits" in each event: http://www.k-swimming.org/cgi-bin/swimming/results /meet_view.pl?8

    --


    Werd.
  74. Re:Why would I use this? by Flavio · · Score: 1

    I see what you mean.

    I'm on 256kbps DSL right now, but even if I were in a modem, things wouldn't be that problematic. I only see an advantage to DSL when downloading large movies and game demos. Of course getting slashdot in 3-4 seconds versus 10 seconds is comfortable but IMHO it doesn't warrant for better image compression. I'm used to browsing with 3-4 netscape windows open and downloading at the same time and load times certainly don't bother me when I'm with a 33.6 modem.

    While I don't assume everyone has a T1 at home, I usually assume everyone has a half-decent machine. Thinking otherwise reminds me of Andy Tanenbaum arguing with Linus Torvalds and reasoning that it was a bad idea to make Linux run only on 386 machines, thus breaking compatibility with older x86 processors.

    Flavio

  75. Okay, so it's another wavelet-based compression. by jcr · · Score: 1

    So is JPEG 2000. So, who needs this?

    Now, if it were a fractal compressor, then it might actually matter.

    -jcr

    --
    The only title of honor that a tyrant can grant is "Enemy of the State."
  76. What about it? by Trinition · · Score: 2
    So, we get a new format? Not that easy. What about the spotty support for PNG? What about the various competing vector image formats? What about the more aggressive wavelet based image compression?

    First you have to have a good format, then it has to be accessible and affordable, then it has to be accepted. For the life of me, I can't figure out why PNG hasn't replaced GIF.

    Actually, though, that's part of the final hurdle -- a Catch-22. No one will adopt it until web pages and browsers support it. Web pages won't support it until browsers do and browsers have no reason to until web page creators demand it.

  77. Re:Okay, so it's another wavelet-based compression by PigleT · · Score: 2

    And why exactly is "fractal compression" so much better than wavelet-based?


    ~Tim
    --
    .|` Clouds cross the black moonlight,

    --
    ~Tim
    --
    .|` Clouds cross the black moonlight,
    Rushing on down to the circle of the turn
  78. There is Linux support by snookums · · Score: 4

    They have a browser plugin for Linux/x86/glibc2 available for download here

    Yes, I know that link is broken because /. put a space in it -- you'll have to get rid of it yourself
    --
    Be careful. People in masks cannot be trusted.
  79. Actually quite an old product by Nailer · · Score: 5

    DjVu has been around for 2 years, and isn't anything new. In fact, it wasn't actually designed by Lizardtech - it was developed as an Open Source technology in the Olivetti and Oracle Researtch labs in Camridge, UK, and was sold when US telco AT&T purcahsed the labs.

    Hence the Open Source products generally only seem to be there to satisfy existing licensing requirements from prior to Lizardtech's purchase. It's doubtful Lizardtech tend to encorage that aspect of the technology, and they're only promoting the closed source stuff.

    However, the compression is indeed very real and the cross platform nature makes it quite useful for archiving stuff that won't be modified frequently in the future - remeber, that text ain't vectorized, it's just another layered image, AFAICT.

    1. Re:Actually quite an old product by Enahs · · Score: 1
      Well, other than the fact that you got the compression format for the background totally wrong, and the fact that it's really pathetic that you bothered to look this up, I'd say, well, who cares. Thanks for posting redundant info.

      Why don't you take a look at this? Apparently they use some sort of wavelet compression on the background. =)

      --
      Stating on Slashdot that I like cheese since 1997.
    2. Re:Actually quite an old product by markvchain · · Score: 1

      It was developed at AT&T, but not at the former Olivetti/Oracle lab. The developers are Yann LeCun and his team at AT&T Labs in New Jersey. For technical details on the algorithms, see the papers available from Yann's bibliography page

    3. Re:Actually quite an old product by dkh2 · · Score: 5
      True. If you want to see DjVu in action, go get the plugin at djvu.com and visit one of my projects here at CWRU. http://www.cwru.edu/UL/DigiLib/Hours/homepage.html

      Picture this: Start with a 15th century Flemish "Book of Hours", hand illuminated on vellum (goat skin). Scan it at 600dpi 24bit for archival purposes. Reduce your tiffs to 300dpi and you still have 1.06 GB of image data (not very downloadable). Using the DjVu compressor we achieved 205:1 compression so the final product totals 5.44MB. By separating the pages so they only download when called for the initial download is a mere 45.06KB (including all of the HTML and other images on the page) with an average download of subsequent pages only 21.34KB.

      DjVu was developed by AT&T Research. It was then purchased by LizardTech last year.

      Code commentary is like sex.
      If it's good, it's VERY good.

      --
      My office has been taken over by iPod people.
    4. Re:Actually quite an old product by jovlinger · · Score: 2

      frankly, who cares what the individual formats are? Really. That is completly besides the point.

      The important thing is that they try to make a semantic interpretation of the input image and apply differing approaches depending on the content. My above post answered the question above as to why they are comparing themselves to pdf; they focus on compressing exactly one kind of information, so they shouldn't be compared to standard image compression.

      So I guess I don't agree about that being redundant. whereas the fact that they use wavelets... well, that's nice, but hardly germane.

  80. Why would I use this? by Flavio · · Score: 1

    IMHO there's very little need for highly compressed images on the web right now. With so much bandwidth available and the notion that images aren't always mandatory, what difference does 100kB of data make to a web page?

    Of course the concept's interesting, and if it's really innovative it could be used in conjunction with other techniques to improve video compression, but considering the format's not open [(don't even get me started on this, because we're considering the web here)] and perhaps not all that fantastic, why bother?

    Flavio

    1. Re:Why would I use this? by f5426 · · Score: 2

      > IMHO there's very little need for highly compressed images on the web right now.

      If you download a whole book with text + pictures, then it would make a difference.

      Today, scans of archives avalaible on the web are often ugly 1bits-per-pixels low-res scan embedded in PDFs. And that suck badly / is almost unreadable.

      So, DjVu _is_ something interesting for reading non-copyrighted papers, research work, FOI publications, or public domains archives.

      > but considering the format's not open

      As many others have already pointed, the format is open.

      Cheers,

      --fred

      --

      1 reply beneath your current threshold.

  81. This is a new PDF competitor by fluor2 · · Score: 1

    This is not an image format, but a new PDF format.

  82. Re:Forget about it... by Enahs · · Score: 1

    How is a GPL reference lib proprietary?

    --
    Stating on Slashdot that I like cheese since 1997.
  83. Old technology by elendril · · Score: 1

    It's an old technology developed at AT&T a few years ago. See : What is DjVu.
    YMMV but it seems to work quite well.

  84. neat little toy by junk · · Score: 1

    I was able to get a look at this at some Press thing at Comex. What they were actually showing off was their MrSID program, which is what's needed to view the images. Aside from thr fact that they were pulling from a server in Seattle off of a T1, it was still pretty impressive as to what they could do with their images. You could "zoom in" on small sections of images and check them out with amazing clarity. While impressive, the need for such things isn't all that broad. I discussed possible uses for their new format and they told me they were targeting e-store types of places. Say you wanna buy a blanket from a store online, but you're not too sure about the quality, since you can't get an upclose of the blanket...just zoom in. I thought it was fairly impressive.

  85. Jpeg 2000 by sumengen · · Score: 1
  86. Twenty times faster to *download* by marnanel · · Score: 1
    >especially because they say it's "20 times
    >faster then gifs" -- who measures compression
    >in terms of speed?

    Well, what LizardTech's site says is:

    >Download a 50-page color catalog in DjVu format
    >in the time it takes to download a single page
    >of that same catalog in PDF format.

    So it looks like when they say "faster", they mean "smaller" (and thus faster to download).

    BTW, I so much hope that if this goes anywhere, the format will be made open. (Fat chance :( ) Someone made a comparison on the BetaNews story with the problems with GIF; but that's not comparing like with like; we have a new format which requires a plug-in. It's more like pdf, only worse, because the viewers won't be as easily available.

    --
    GROGGS: alive and well and living in
  87. Won't go anywhere by laborit · · Score: 2

    GIF = "jif"
    JPEG = "JAY-peg"
    PNG = "ping"
    DjVu = "duh-ju-voo"

    So you have the choice between drooling on yourself or saying "deja vu," which has more syllables, tone changes, and stops than its nearest competitors. Plus the danger of the illiterate calling it "day-JAH-voo."

    -----
    Go ahead, blame me... I voted for Nader!

    --

    -----
    Go ahead, blame me... I voted for Nader!
    1. Re:Won't go anywhere by matthead · · Score: 1

      From their web page: ... DjVu (pronounced "déjà vu") ...

      Looks like the illiterate are right for once.


      -Matthead

      --

      -Matthead
  88. almost cornered by Jafa · · Score: 2

    There's another competitor to MrSID by Iterated Systems called MediaBin. We were helping Iterated out from an actual field use (we're a prepress/graphic arts shop) but they were more interested in the science and not implementation. Looks like MediaBin is their offering.

    Lately we've been much more interested in MrSID and have been using it a little bit, but are hoping to include it in our home brewed media management system.

    Jason

  89. Re:The Ability to Search by dmarney · · Score: 1

    DjVu has both image layers and a data layer. The high-contrast b&w layer is OCR'd and the recognized text is stored in the data layer. The image can thus be indexed by any search engine.

  90. There's nothing here that TIFF6 can't do by fatphil · · Score: 2

    It appears that they've simply filtered the documents into 2 layers.
    One for bilevel compression, say using CCITT group 4 compression (Faxes use group 3, group 4 is better ratio, but bigger disaster if you drop a bit), or maybe JBIG (IBM have a patent on the statistical model of the Q boxes, not on the actual compression algorithm, so all you need are your own Q boxes). Both of the above compare the current line with the previous line, and barf horribly when given dithered bilevel images, such as newspaper photos, or noisy scans.
    The second layer is the full colour one, and can be implimented as a JPEG.
    Tiff 6 supports all of the above.
    The 3GB Tiff document they talk about on their site was probably 3GB using the "uncompressed" setting. Apples and Oranges, as they say.

    The world doesn't need any new de-facto "standards" when there are perfectly good present non-proprietory standards which can do the same.

    FP

    --
    Also FatPhil on SoylentNews, id 863
  91. This is in use... by VaultX · · Score: 1

    At the Cobb County (Georgia) Court Website, I can't remember the address..but they compress all their court documents using this method, it doesn't seem too bad.

    --
    - nick
  92. Re:Smaller then PDF? by dkh2 · · Score: 2
    Actually, PDF documents generated directly to PDF and not scanned first are fairly small. This is a format for scanned documents that knocks the socks off of PDF. On average a DjVu file from a scanned document will be 1/5 the size of its PDF equivalent. I have actually achieved compression down to 1/15 the size of the similar PDF.

    Code commentary is like sex.
    If it's good, it's VERY good.

    --
    My office has been taken over by iPod people.
  93. The Ability to Search by sverrehu · · Score: 1

    The web page talks about the 90% of documents in the world that are still on paper only. Now I really hope that everybody understand that using DjVu alone to webify these documents will be of little value. How do you expect to find those documents? Will Google be able to index images?

    Scanned documents on the web will have to be included in web pages that contain reasonable keywords and extracts from the texts. Scanning, compressing and commenting may actually be more work than keying in the entire document, so most people will probably just skip the commenting.

    But heck, nobody seem to remember that the web was about content, not layout, so I'm sure DjVu will be warmly welcomed.

    1. Re:The Ability to Search by dardem · · Score: 1

      "Will Google be able to index images?"

      No, but IBM's Garlic might....


      --

      "Ceilean Súil an ní ná feiceann..."
  94. Oh, I didn't notice... by The+Queen · · Score: 1

    I watched that last night too but didn't know those were FIFs... I was actually fascinated that they could discern anything in those close-up pixelated shots!

    Here's to the Ever-Expanding Universe!


    "I'm not a bitch, I just play one on /."

    --

    The House Between - Original Sci-Fi Series
  95. New Format by Sternn · · Score: 1

    After installing the plugin and checking out the samples, I saw that what most people are saying is true, it is a PDF clone, not an image compressor format. If you look at some of the sample files, you will also see they are MUCH bigger than standard PDF files, the majority of them range from 120,000 to 190,000 per page. It appears to initally load faster than Acrobat, but I think that's because it's more of a streamlined plugin rather than a whole system. The images themselvces however seem to be much bigger than any PDF I have ever encountered. Anyone else go through their sample directory?

    --
    -Sternn
    1. Re:New Format by njacques · · Score: 1

      I have used DjVu for documents and found that 200DPI TIFF Group 4 image are around 90KB, after conversion to DjVu the image is around 10KB and has no visible loss of quality.

  96. Some tech other than lizard's by compwiz3688 · · Score: 1

    A little late in posting, but anybody browsing "old" news might want to know that LuraTech also has wavelet compression.
    ---

  97. well, ok... by tewwetruggur · · Score: 1
    the question I have is: Why?

    do we really need another image format? do we even want another image format. you can claim all the nice enhancements / speed improvements / compression improvements you want - its still yet another damn file format that is not built into what we've already got.

    sometimes it seems that people just like to try to invent something for the sake of inventing it, not on whether or not it fills some sort of real need.

    --
    Hi! This is the Sig, blatantly attached to the end of this comment.
  98. Browsers by anpe · · Score: 2
    so don't expect it to get a lot of support on any major Web sites
    Images are displayed by browsers so they have to implement this image format in IE and Netscape before it can be used by Web sites. And if you consider how slow was implemented support for PNG which was free, you can expect support to djvu to be much slower.
  99. what DjVu may mean ? by Atreide · · Score: 1

    "DjVu" makes me think of french "Deja Vu" which means "already seen"

    --
    The world belongs to those who get up early. - I'm far from being the king of Earth then :-(
    1. Re:what DjVu may mean ? by billybob2001 · · Score: 2
      Wasn't that already posted?

      Wait - I get it ;,)

  100. It wil not replace a the "standards" we have now by funkman · · Score: 3
    Here is the press release. Their market is not for general sites like slash, yahoo, etc. This is a business oriented product for storing digital assets, so they may easily be cataloged and transformed into a format a user may see. From their press release, some example apps:

    Corporate digital asset collections

    Real estate sites

    Online catalogs/retail companies

    Auction sites

    Libraries

    Medical sites

    Geospatial imagery/government agencies

    Corporations are storing everything digitally now: pictures, instructions, etc and need are searching for a way to manage all of this. This product is attempt to fill that void.

    In a nutshell, this will be a specialized format that we will see for businesses that need to pass digital assets to the user.

  101. Smaller then PDF? by plastik55 · · Score: 2
    They seem to be focusing on print documents (LizardTech mentions using it to present a catalog online, for example.) I have a hard time believing their claims of being smaller than PDF for that purpose--their figures for improved compression appear to be derived from pdf's made by scanning a print document in full color at high resolution. Who makes pdf's that way? It might be useful for archival purposes, but anyone who's distributing catalogs and non-archival documents online is going to make pdf's the correct way (ie, ps2pdf or some other way of making a pdf that isn't just a big bitmap.) anywho, PDF is a vector format, and when used correctly, will be smaller than anything DjVu can accomplish, Their "comparison" is an astonishing case of apples and oranges.

    --

    I have a positive modifier on Troll. When I mod someone Troll their karma should go UP!

  102. Not a web image format by Nailer · · Score: 3

    Its more a document archiving format than a new web image format [although it happens to be viewable over the web] - as the article states although its non vectorized, it uses layered bitmaps to create more efficiently encodable data chunks.

    And, actually, there is Linux support, and source code available. Just Lizardtech aren't going out of the way to tell anybody about it - see my above post :-)

    1. Re:Not a web image format by Coriolis · · Score: 1
      Just Lizardtech aren't going out of the way to tell anybody about it

      Um. It took me one click to get to a page where I could download the Linux SDK, and from that page one more click to get to the page from which I could download the source code. The only other thing they could have done is plaster "LINUX SOURCE AVAILABLE" all over the homepage 8-)

      On a side note, would it have CmdrTaco to do some research before posting?

      --
      Rgasuya aata! : I have been coding Perl and cannot tell where my fingers are now!
  103. Hey Taco by clink · · Score: 1

    Why bother posting the article if it's such a non-event?

  104. Old news... by bradipo · · Score: 1

    This graphic format has been around for at least 3 years now... How can they tout it as new?

  105. Image format? No. Document format. Yes. by prototype · · Score: 2
    The format they discuss on the LizardTech page is for both an image and a document format. The DjVu format is really just a new document format for viewing, you guessed it, documents (which includes images). This isn't a new image format that will replace GIF, PNG or JPG. It's just another way to put together documents (as if we don't have enough formats out there for this). They also have an image format that is more compact than other ones, but I doubt it will become a standard of any kind.

    Now the claims on the document format bother me. First, they compare it against PDF which we all know is large and bloated to begin with. Sure, if I took any document and separated out the text and formatting and images, compressed it down I'd probably have a "new" revolutionary document format. Doesn't this just sound like HTML? I've written server side scripts and client applets that will compress HTML the same way, and I think my results would be about the same as this (and perhaps faster?). You'll still need their plug-in to view their documents. They also say that a 2.5gb TIFF is compressed down to 3mb. Wow. I can do that now if I convert the TIFF to JPEG with little loss of quality. I really don't see what the big deal is about.

    I wish the /. reporters would do a little research before they go posting messages that send the readers into a frenzy of clicking and sending off emails to friends about the next wave sweeping the internet.

    liB

  106. Not news, check FIASCO! by hadessPPC · · Score: 1

    This is not news, it's already been on Slashdot I guess. This codec is from AT&T's labs. For a really new image codec, check Ulli's website: http://ulli.linuxave.net/fiasco/. It would be nice to add FIASCO! support to the GIMP and gdk-pixbuf for example...

  107. No linux support? by bradipo · · Score: 1

    If there is no linux support, how come I have a plugin for it on my slackware box? It's called nsdejavu.so (maybe its not the same, but it sure looks like it to me).

  108. Some facts off the top of my head... by gary.flake · · Score: 4
    As far as I can tell CT's post and the article have anumber of things wrong. I've known some of the people involved with DjVu for a couple of years, so let me list a couple of facts in no particular order:
    1. DjVu was originally developed at AT&T by a group that has traditionally worked in machine learning. LizardTech purchased the technology from AT&T.
    2. This format is specialized for scanned documents.
    3. The technology is very different from just about everything else because it seperates background and foreground planes. The background is compressed with wavelets, and the foreground probably uses a form of clustering on characters shapes (in a typeface and language independent manner). As a result of the latter, you get a form of OCR almost for free. You can also do text search.
    4. Everything can be viewed at 300dpi directly in your browser and in realtime (you normally only view at 100dpi but you can zoom in).
    5. The linux viewer plugin and compressor has been available for years.

    The main attraction of DjVu is that your scanned documents are tiny (typically less than 50KB) which makes it feasible for putting them on the web. Just about every other format results in files too big for easy distribution on the web. Interestingly, you can convert a *.ps.gz file into a DjVu file, and see a dramatic improvement in file size while preserving almost all of the detail. I am not talking about simple pages here, by very complex ones with a mixture or real images / artwork, and text.

    Apologies for any mistakes, but I think that I got most of it right.

    -- GWF

    1. Re:Some facts off the top of my head... by dmarney · · Score: 3

      I have been using DjVu for more than a year now, and have tested it extensively against other image compression technologies. If you've got scanned images to display on the web, especially if they are in color, then DjVu is far and away the best technology on the market. Here is just a short list of the things that I like about DjVu: 1. I can encode a 20MB color scan into a 100-200K file, and still get a great-looking image on the screen. I can create astonishingly tiny B&W images (how about a full page of text scanned at 300 DPI rendered in 17K?) Nothing else comes close. 2. I can create separate image and data layers. DjVu produces a color background layer, color foreground layer, high-contrast B&W layer, and a data layer. This is essential for doing OCR, where we really need to have that B&W image to get to the text. 3. I can encode the DjVu image to automatically upsample to match the greater resolution of my printer vs. the screen, in the same file. 4. I can create a multi-page document either as a single file, or as a linked list of indivdual pages, with a file for each page. No more horrid PDF byte-serving! (Please pardon us as the Author weeps for joy.) 5. I can construct a URL that will drop a user into the middle of a document (page 50 of 100, for example), and not lose the context of the other pages. 6. I can use the EMBED tag to provide automatic installation of the free DjVu viewer, and I get to specify which image comes up once the software is installed. There are no sign-up forms, no harvesting of my customer's email addresses, and no taking the user out of my visual space. 7. The viewer zooms and pans on the fly. You can zoom in to 100%, and pan through the image simply by clicking and dragging the mouse. This is the only way this should be done. 8. All of the encoding and decoding tools are completely free for eval purposes. The decoder is free for all purposes, and most of the source is open. 9. Once I create a DjVu file, I can convert it to other file formats such as JPEG. Try doing that with PDF! 10. IT LOOKS BETTER -- A LOT BETTER -- THAN ANYTHING ELSE.

  109. We tested it, I invite you all to do the same by ubi · · Score: 1

    We tested the new image format some months ago (in the sense that we tested its usage with the plugin) and I must admit that decompression is quite fast and compression ratio is quite noteworthy.
    I seem to remember that the compression process is a bit slow but I'n really sure about it.
    Do not ask me wether the plugin is strongly optimized or its the format in itself which is smart, I'm just talking about results.

  110. I sat through a dog & pony show on this 2 yrs ago- by ralphc · · Score: 1

    Put on by AT&T. Got a "compressed" T-shirt for my trouble (shrink-wrapped, about the size of 3 CD jewel cases stacked).
    Anyway, things not mentioned elsewhere. The main "cool" things that the exhibitors mentioned was that DjVu would take an image that had graphics and text on it, distinguish between the two and on the same image compress the graphics part with wavelets and the text part with a fax-like compression whose name I don't remember.
    The text compression worked by taking all the characters and deciding which ones were similar, store a small image for the character and just remember the placement of that character. For example "hi there" would store h, i, t, e, and r once then slap the images down in the correct places when decompressing. The nice part about this is that it works just as well on Chinese characters or other pictograph languages as it does on Western languages. Those of you wanting to scan and store Asian languages should give this a look.
    As for PDF, if you're generating the PDF file yourself you're not going to do any better with DjVu since it just treats the PDF as a big image. I got the 1040 PDF off of the IRS site and ran a DjVu compressor on it and it came out about the same size since PDF's just have strings of characters for text.

  111. DjVu Does What PDF Can't by dmarney · · Score: 1

    Here are just a few of the things that DjVu can do that PDF cannot:

    1. Encode a 20MB color scan into a 100-200K file, and still produce a good-looking image.
    2. Encode a 300 dpi B&W scan into 17K, and still be able to read the image.
    3. Create separate image and data layers.
    4. Create a multi-page document either as a single file, or as a linked list of indivdual pages, with a file for each page.
    5. Construct a URL that will drop a user into the middle of a document (page 50 of 100, for example), without losing the context of the other pages.
    6. Provide automatic installation of the free DjVu viewer without sign-up forms that harvest my customer's email addresses
    7. Zoom and pan on the fly.
    8. Convert to other file formats such as JPEG.

    For my money, DjVu is a superior file format that has been nicely tuned for the web. PDF was optimized for print pre-press, and it shows.

  112. I saw this working at Seybold by karzan · · Score: 3

    LizardTech had a booth at Seybold this year, and let me tell you this technology is very, very impressive. They demonstrated extremely high-res files at various zoom points--and explained that the files were very small. This thing also worked lightning fast. Except for a just-visible delay, zooming happens almost instantly. Another thing: I'm fairly sure it was doing raster, not vector. In any case, it was obvious it went way beyond PNG.

  113. Wow, a new image format. by commandant · · Score: 1

    Remember when PNG was supposed to be The Next Big Thing (TM)? PNG was billed as the greatest image format for the Internet, since it was compact, high-quality, supported interlacing, and (I think) animation.

    When was the last time you saw a PNG image? I've never seen them, except when I've made a few. The fact is, the image formats that are ingrained into people's sites and minds are GIF and JPEG. Nowadays, even digital cameras store images in JPEG. I remember when they used to have proprietary formats, like KDC on the Kodak DC40. That thing was $1000 when we got one, now you can get better cameras that store JPEGs for a couple hundred at most. Amazing.

    Just as well, about DjVu. What kind of stupid name is that for a file format? How do you even pronounce it? I also fail to see the advantage of compressing foreground and background separately, but hey, I'm no imaging expert.

    New image format? I'll believe it when I see it. Everywhere.

    I do not belong in the spam.redirect.de domain.

  114. I'd charge you too by xcedrinod · · Score: 1

    If I invented some crazy cool new image compression format... I'd charge admission too. Isn't that sorta what the wavelet compression plugin publishers are doing? Getting your personal info in lieu of money, i guess... $.02

  115. Yeah, but... by Black+Parrot · · Score: 3

    > Apparently, it differentiates between forground and background components of an image, and compresses each appropriately.

    Yeah, but can it detect the difference between nude and naked?

    --
    Sheesh, evil *and* a jerk. -- Jade
  116. Faster download, not faster compression by elendril · · Score: 1

    Part of the betanews article is misleading : DjVu does not compress 'faster' than PDF/GIF/JPG but more efficiently, which enable faster transmission of compressed documents.
    Note to the editor : please read the article more carefully next time. Thanks.