Slashdot Mirror


KOffice To Use Open Office File Format

InodoroPereyra writes "This article at The Dot indicates that the KOffice developers decided to switch to the Open Office file format (OASIS) for their next major release. Excellent news both for KOffice, which will benefit from OpenOffice's excellent filters, and for the GNU/Linux Desktop users in general, who will benefit from a unified file format standard between these office suites."

4 of 48 comments (clear)

  1. That other office suite by __past__ · · Score: 5, Interesting

    Let's wait how long it takes that other office suite vendor to see the light. After all, they are an OASIS member themselves...

  2. Additional XML benefits by neglige · · Score: 4, Interesting

    Using an XML based (and documented!) file format has additional advantages. First and foremost, the documents can be easily used by other applications, e.g. full text indexer. Generating meta data has never been easier ;)

    Or use a stylesheet on the document and adopt it for, say, mobile devices (my favourite topic, I must admit). XML->HTML, XML->WML, XML->cHTML ... no problem. It's even possible to extract an abstract, collect hyperlinks from the document and present them seperately, leave out the graphics (or convert them)...

    Is this possible with .doc? I'd guess so. As easy as with XML? Don't think so.

    --
    My cats ate my karma. They also wrote this comment.
    1. Re:Additional XML benefits by swillden · · Score: 3, Interesting

      The xml files would be a lot bigger than a binary format, but the zipping process manages to get it down to about the same size again most of the time..

      Incorrect. Go try it on a few documents. In practice, I see that OOo Writer documents (without images) are less that half the size of their Word counterparts, and OOo is not (yet) very careful about the XML it spits out, tending to save lots of style and other information that isn't even used in the document.

      The zipping process makes the files a lot *smaller* than you normally get out of a binary file format. Why? Rather simple, really. In most binary file formats (e.g. Word), the formatting information is fairly compact, but the content isn't compressed in any way. Given that English text has about one bit per character of entropy and given that (hopefully!) there's much more content than formatting, there's a lot of room for compression to do its work. In the case of embedded images, it really doesn't matter what format you use, they don't compress, but the XML doesn't add a significant amount of overhead to them, either.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  3. Electronic Publishing by SgtChaireBourne · · Score: 5, Interesting
    Yes, let's hope this will be a new trend. The last round of open standards (e.g. TCP/IP, HTTP + HTML) brought a lot of good, especially HTML. I'm curious to see where this step will lead.

    I suspect that it is also a big step closer to electronic documents with a long shelf life. This may lead towards electronic publishing where well-formed and, possibly, valid documents become the norm. Even if the structures are rudimentary, this still will help portability and retrieval.

    Right now, [X]HTML and PDF are only part way there. PDF is useful for rapid dissemination, but can more or less be thought of as a compact form of paper. Most HTML document are neither well-formed nor valid and often too dependent on transient constellations of technologies. So, a format like this will let organizations choose tools suited for their specific needs and tasks.

    --
    Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.