Slashdot Mirror


The Future Is Open: The OpenDocument Format

Daniel Carrera writes "I've written an article for Groklaw describing the OpenDocument format: 'I asked Daniel Carrera, an OpenOffice.org volunteer, if he'd please explain the OpenDocument format. How does a format get chosen? And is OpenDocument on the list of acceptable formats for governments like the State of Massachusetts? We are all concerned about proprietary formats and standards, and more and more governments are adopting policies requiring open standards, it's a very important subject.' It's currently being considered by the EU Commission as a candidate for an official format."

39 of 210 comments (clear)

  1. It has always baffled me... by krudler · · Score: 5, Insightful

    Why people never even consider that something else exists other than MS Office. It's not just a philosophical argument, everyone I know has ran into problems with a .doc from a different version that doesn't open. It is hard for some people to do work at home, then bring it to work/school and use it! If it's a .doc, it should work in every version of work. The same goes for all the other formats.

    Krudler

    1. Re:It has always baffled me... by elid · · Score: 2, Insightful

      Well, I think that depends. You can't expect every version of MS Word to support newer features (although the ability to read the rest of the document should be unaffected). However, never versions of the software should always be capable of opening older documents.

    2. Re:It has always baffled me... by pdiaz · · Score: 3, Insightful

      Because in these days most of the people that use computers are not computer experts (not that this is a bad thing though)

      If they have a problem with microsoft word, they don't usually blame the program. For them there is no distinction between the software that runs on their computers and the computer itself. They blame the computer, because they don't know better.

      --
      Make It Secret . Free JavaScript implementation of AES for your browser
    3. Re:It has always baffled me... by yotto · · Score: 3, Insightful

      Software?

      To most people Microsoft is synonymous with computers period.

    4. Re:It has always baffled me... by fm6 · · Score: 4, Interesting
      It's not just a philosophical argument...
      Indeed, the philosophical argument is of no interest to anybody except a few geeks. But there are a lot of practical reasons to want alternatives to MS Office. Not just the reasons you mention, but issues of cost, and of problems caused by overdependence on a single notoriously flaky company.

      But if you're baffled by people's adherence to MS Office, then you've never used this kind of software in a real-world environment. Being able to pass a file around without interopeability problems is crucial. Given the messy kind of data most people have to deal with, the only way to do this is to standardize on a specific set of tools from a specific vendor. In the past, you had real competition between Microsoft, IBM/Lotus, WordPerfect, and others. It was inevitable that one company would win the desktop application wars, though I wish it wasn't the same company that also won the desktop OS wars.

      If you're going to end this monopoly, you're going to have to overcome the same social and economic forces that drove Lotus and WordPerfect into niche status. There's more to doing that than simply coming up with a technicallly supperior or more open product.

    5. Re:It has always baffled me... by TheKidWho · · Score: 2, Interesting

      Ive recently started using Apple's iWork suite, in particular Pages, and I love it much more then MS Office!

    6. Re:It has always baffled me... by doj8 · · Score: 3, Informative

      Actually, I used to write document conversion programs many years ago (back when dedicated word processors were still common). There were a couple of document formats which did support forward compatibility. It's been 25 years, but I believe IBM was one. The other escapes me (maybe Aquarius?). (I guess you can count SGML as supporting forward compatibility too.)

      Basically, an older program could read any version of the document format. When it encountered elements it did not recognize, it retained them, but ignored them for rendering purposes. So, when the document was saved, the ignored elements were saved with it. A newer version of the word processor could then use them, even after the older version had edited the document.

      You are mixing programs and document formats. The two do not have to have the same behavior.

      --
      -- Dan Jenkins, Rastech Inc.
    7. Re:It has always baffled me... by yagu · · Score: 3, Interesting

      ..., But if you're baffled by people's adherence to MS Office, then you've never used this kind of software in a real-world environment. Being able to pass a file around without interopeability problems is crucial.

      I am STILL baffled... I have attended meetings where I worked where people literally were not able to print or view agendas, etc. ahead of the meeting because of the incompatibilities among the microsoft applications! Were it not so counter-productive to the work at hand, it would have been funny. (And this was/is an almost every-meeting event.)

    8. Re:It has always baffled me... by 808140 · · Score: 4, Insightful

      You know, I'm not sure that's the case as much anymore... or really that it ever has been, actually.

      The reason is simple: Apple. Now, Apple has taken a second (third?) seat to MS-based PCs for a long time and I think they probably will continue to do so for the forseeable future (ie, I am not an Apple zealot). But Apple remains a name-brand that exists in the public, non-geek consciousness. While their current success is due almost entirely to their iPod, in most people's minds, they remain a computer company.

      I believe that part of the reason that alternative browsers like Firefox are beginning to gain ground is because of MS's discontinued support of IE on the Mac. Despite the fact that not many people use Macs, many of the people that do are not geeks, and those "not-geeks" were forced to consider the browser question in a more realistic way when MS discontinued Mac support. Up until then, they likely considered (as most people do) that IE was the internet.

      Now they know better, and as you've probably noticed on Slashdot, Mac-types are a loud bunch -- even the non-geeky ones. They use Firefox or Safari and they make a big fuss about it. They're convinced of a conspiratorial anti-Macintosh agenda on the part of, well, pretty much everyone and they complain loudly when things don't work well on their macs. Nowadays, this includes websites.

      My point in all of this is that MS has been the big bully in the industry for a long time. Apple, Sun, IBM -- all would be exactly like MS if their roles were reversed (IBM in fact was, at one time) -- but as it stands, all would like nothing more than to see MS toppled.

      Individually, each of these companies represents a feeble marketshare. Together, it still isn't much, but it's enough, I think. They have the users required and the lobbying power, too, to really make a difference. IBM and Sun have always had the problem of being companies only IT people really know much about, due to their lack of penetration on the desktop. Apple, on the other hand, is widely seen as a desktop system normal people actually use, and so Apple being on board hopefully will make more non-industry folks aware of what's going on. Unfortunately, these three companies haven't been keen on cooperating on things like formats precisely because of the lack of open standards -- none of them wants to allow a competitor to dictate the structure of any format.

      Each of them produces its own office suite; each of these is MS Office's bitch. By making sure that their office suites all interoperate 100% with an open format, and by lobbying governments (especially non-American governments) with arguments about (American) vendor lock-in, I believe they can make in-roads into ODF adoption.

      If governments use it, large companies and contractors will be forced to use it as well, even if infrequently. They will quickly find MS Office's inability to save into these formats annoying (which will not force them to switch to another office suite, but which will cause them to lobby MS to support the format).

      Big companies = big clients = big money. Add this to the fact that any law requiring a government to adopt an open format that MS Office doesn't support will make the use of MS Office illegal in a de facto sort of way, because of its non-compliance.

      If (and that's a big if) all of this happens, if the laws pass, and IBM/Apple/Sun manage to cooperate for a change, I expect that MS Office will include support for a usable subset of ODF. What they will not do -- what they will never do -- is make it the default format. Further, they will likely ensure that some features of their doc format cannot be saved in ODF, allowing them to pop-up the little box that warns the user that "some formatting information may be lost, proceed?"

      This will make little difference to governments legally required to avoid doc, but this will be enough to prevent widespread adoption in the private sphere.

  2. Talking to yourself again? by Anonymous Coward · · Score: 2, Funny

    Daniel Carrera writes ... 'I asked Daniel Carrera, ...'

    Hmm... /me scratches head

    1. Re:Talking to yourself again? by Zaiff+Urgulbunger · · Score: 2, Funny

      You know, deep down, we are all Daniel Carrera on the inside! O_o

  3. Re:Death to PDF..?? by sewagemaster · · Score: 2, Informative

    on windows, stay away from any adobe readers above version 4. i have no problems with version 4 on my pentium 2.

    on linux, acrobat's *ok*, but gpdf and xpdf are pretty decent and very fast. the new version of kpdf in kde 3.4 is going to be great as well.

  4. To speed up the loading time... by elid · · Score: 2, Informative
  5. From a user's perspective by bigberk · · Score: 3, Informative

    For a similar discussion, but from the perspective of an OpenOffice.org user, check out this article (even though it's really talking about OO.org, there is a section where it goes into the advantages of open formats for data interchange and longevity/archival). The XML format discussed there is I believe the same as OpenDocument

  6. Re:It's nice to be optimistic but... by game+kid · · Score: 3, Insightful
    Years down the road, when everyone is still using Microsoft formats, I'll be sure to remember the prophetic vision of this article.

    Agreed. What MS Office will always have (and Lotus/Apple/et al. before that had) is ad money. They can sell to the schools and offices that move formats in the first place.

    Over the years, people at home bought/received WordPerfect/MS Word/etc 'cause they needed them to use the formats they used at school and work. What OpenOffice.org needs, I think, is an even larger word of mouth (or mouf?) campaign. People have to know it exists, that it can be used with the suites still used at work, and, of course, that it's a free download and legal to copy.

    --
    You can hold down the "B" button for continuous firing.
  7. Actually.. by Rezonant · · Score: 3, Interesting

    After all complaints about the slowness of Adobe Reader 6 they have sped up version 7 A LOT. It starts almost instantaneously and even performance within the program is much better.

  8. Not true. Move on. by Saeed+al-Sahaf · · Score: 4, Insightful
    Why people never even consider that something else exists other than MS Office

    First, Word Perfect is still King in law offices and certain other niche areas. But two words: "Market Saturation". If you need to communicate with the majority of people and business out there, if you're not sending .doc you might as well just send a random string of characters, so it's a matter of if you want to do business or not.

    everyone I know has ran into problems with a .doc from a different version that doesn't open

    Also, most people don't have problems opening Word docs that are not the latest version, this is simply an anecdote perpetuated by people that don't like Microsoft. Right now, I have Office 97 (which I actually have owned since about that time) at home, and have never had any problems opening brand spanking new Word docs.

    I support open document formats because it promotes competition in the areas of application user experience that count like usability. I would very much like to see OpenOffice mature to a point where most people including large companies would feel safe transitioning. But repeating these discounted "stories" of version incompatibility help no one.

    --
    "Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck
    1. Re:Not true. Move on. by gardyloo · · Score: 5, Insightful

      most people don't have problems opening Word docs that are not the latest version, this is simply an anecdote perpetuated by people that don't like Microsoft.

      Great. But the point is that no one, if the program were committed to being more compatible with past versions, should have problems. I have problems opening Word docs in several versions, whether they were created on older versions or on the newest ones. And many people I know do, too.
      I don't care if 70% of people who use Office haven't had compatibility problems. I DO care that at least half of the people I work with do or have had problems with it. When you say "discounted 'stories'", I take some offense, because those stories should NOT be discounted, and they aren't apocryphal -- many are true!
      There are rarely problems with postcript files or .pdfs, and they look much better. There are NEVER any problems with .rtfs, or with plain .txt documents, and even though these don't have the bells and whistles of many Word formats, they're always readable, and always editable.
      There's a higher standard than Word, and there has been for a long, long time.

      I don't hate Microsoft, but their compatibility issues are ridiculous.

    2. Re:Not true. Move on. by civilizedINTENSITY · · Score: 3, Informative

      The biggest problem our campus printing shop has is incompatibilies of versions of MS Word due to different computer labs running different versions (97 thru 2000 Pro). Mostly this seems to relate to embedded graphics and the formating of text around said emedded object. This isn't anecdote perpetuated by people who don't like Microsoft, this is historical fact related to using the campus printing shop, acknowledged by them as well as students.

      Some classes required bound reports (Software Engineering did...), and your only hope is PDF. Crappy formatting isn't an option in a "professional" report.

    3. Re:Not true. Move on. by oliverthered · · Score: 2, Informative

      The Help compiler that came with visuial basic 3 wouldn't work with RTF documents created using word 97.

      I had to write something to reprocess the RTF documents and remove the crap that word 97 had put in there.

      Oh, try opening a text document written on a Mac or Linux on windows, sometimes it forgets that crlf isn't the only form of line termination in the world.

      And for the count, I've had lots of problems with word files of different versions, but word files with different fonts are even more annoying.

      And I wish that I could save spellings with a document, but that's a differnt story.

      --
      thank God the internet isn't a human right.
  9. Wishful thinking by JAFSlashdotter · · Score: 3, Interesting
    While I honestly hope the OpenDocument format catches on and wins out in the end, I really think it's not going to make a major impact until Microsoft Office & Works save in OpenDocument format by default.

    I find that in my experience, most MS Word users have no clue what different file formats are, why they'd care to change, or even that they CAN choose a different type in the "Save As..." dialog. The only time it ever becomes an issue is if the version of Word / Excel / Powerpoint that they're using at work is significantly newer than the one they have at home . If they don't let that completely stop them (maybe "Clippy" shows them how), they learn to choose "Microsoft Excel 97" from the list if they want to take work home. That's the only time they are likely to differ from the default. And when they do that, they get warned what a bad idea it is, because features or formatting may not be available.

    No, I doubt the future is open, unless Microsoft makes open the default.

    --
    We apologize for the preceding message. All those responsible have been sacked.
    1. Re:Wishful thinking by wanderingstan · · Score: 2, Interesting

      You're right that users choosing "Save as OpenDocument" won't be the driving force behind a shift. But I think the shift will happen. The pressure will come from integration with other systems, especially the internet varieties. Right now the MS Office suite covers most of what runs a business, but already strange new tools and data formats are becoming important even for small businesses: content/site management, integration with search tools, blogs, RSS, RDF, automatic translation systems, Wikis, collaberative document generation, and yet-to-be created tools.

      Office can't handle these things. But the people who use and develop these new beasts will certainly find ways to integrate with a standard document format...and long before Microsoft gets around to it. Image being able to use one word-processing tool to create blog entries and update wiki pages, with content integrated to your sites CMS and published on RSS, with RDF automatically extracted, and FOAF used for distribution, and on and on... This can (and will) happen with an open document format... ...As long as it's standardized, and has a modicum of support. And this, I gather, is what OpenDocument is all about.

      [That said, it'll still be an uphill battle. IMHO OpenOffice really sucks compared to MS Office, I just wish I had viable other options.]

      -stan
  10. Re:.txt by Anonymous Coward · · Score: 2, Informative

    Probably you are confused between .txt and .tex. You can use just notepad or wordpad for the first. Only the second one refers usually to LaTeX. But even for that second one you can try, for example, the nice (and free) LyX (there is a standalone version for windows) and there are much other for windows (of course for the free operating systems there are much many).

  11. Re:OpenOffice by Xtifr · · Score: 3, Insightful

    No, I've never looked at an OOo document in notepad; that would require installing a system that runs notepad, finding a copy of notepad, and installing it. That's an awful lot of work to go through to get a crappy plain-text editor that's nowhere near as good as the ones I already have installed. :)

    I have looked at OOo documents in Emacs - many times - and it all looks pretty straightforward to me. With a bit of practice, I bet I could write OOo documents in Emacs. I'd hardly call that a nightmare.

    I'd think that you were just unaware that OOo files are zipped, except that your second sentence implies that you did find text in the document, which seems like it would have been hard if you hadn't unzipped it. So I have to assume that you're just ignorant of XML. No, it's not a "freaking nightmare", it's a simple, pretty straightforward format.

    And, just to complete the trifecta of you being wrong, I'd like to point out that .doc files DO have text you can decipher. I routinely use the UNIX strings command to extract the readable text from .doc files. Before OOo/Abi/etc. had usable MSFT filters, that was the only way I could read .doc files. And it actually works fairly well, if you're only interested in the content, and unconcerned about the format. Which is why I still often use strings to read .doc files. (The strings command loads a lot faster than OOo or even Abiword.)

  12. It seems the Irish Government has copped on by Milton+Waddams · · Score: 5, Informative

    They release their documents in OpenOffice, PDF and .doc format.

  13. Re:OpenOffice by doj8 · · Score: 2, Informative

    It does depend on the version of Word which created the .DOC file, the method of saving (fast save vs. normal) and how heavily edited the document.

    I just searched several hundred .DOC files for human-readable strings. Twelve documents had any significant human-readable text in them, barring phrases such as "Word.Document.8", "Dan Jenkins", etc. - document metadata, in other words.

    Some others had phrases or occaisional paragraphs which were human-readable, the rest was not.

    Years ago I wrote document conversion software (back when dedicated word processors were still common). One of the last conversions I was involved in was Microsoft Word. Their format was extremely complicated compared to the majority of word processors of the time. The format was also not linear. You had to follow binary pointers from section to section of the document. So, the contents of paragraph 23 might preceed paragraph 5. Also, deleted material could well be retained in the document.

    --
    -- Dan Jenkins, Rastech Inc.
  14. Re:Actually True... by Saeed+al-Sahaf · · Score: 3, Insightful
    Just because you've never seen something doesn't mean that it doesn't exist.

    I said MOST people. Also, try PDF for resumes, they get there just the way you want them to, no one can change them (without difficulty)...

    --
    "Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck
  15. Re:OpenOffice by frostman · · Score: 2, Insightful

    You might want to check out Antiword.

    I've been using that lately on .doc files and it works great - though I have to admit I haven't tried it on anything very complex yet.

    --

    This Like That - fun with words!

  16. Massachusetts is a Commonwealth by ptimmons · · Score: 2, Informative

    Semantics or no, Massachusetts is a one of four commonwealths. It should not be referred to as the "State of Massachusetts".

    1. Re:Massachusetts is a Commonwealth by iggymanz · · Score: 2, Informative

      "Commonwealth" sounds vaguely socialistic and pinko; Maybe Bush should liberate it.

    2. Re:Massachusetts is a Commonwealth by RmanB17499 · · Score: 3, Informative

      You're correct and it's not socialistic or communistic or anything else.
      Commonwealths "States" of this country are Pennsylvania, Kentucky, Massachusetts, and Virginia. VA doesn't have a Secretary of State -- they have a Secretary of Commonwealth.

      Curious why some states are commonwealths? Read the FAQ -- Why is VA a Commonwealth?
      Starts with: There is no such entity as the "State" of Virginia. While generally categorized as a state, Virginia has been the "Commonwealth" since independence from Great Britain. Virginia is first of four states that are Commonwealths, to include our daughter Commonwealth of Kentucky, which was formed from Virginia in 1792.
      Finally, Puerto Rico is technically a commonwealth, but that's under a different situation, since it is not a State of the Union. It's just the name and form of its local government.

      Each State of the Union is guaranteed sovereignty and a repulican form of government: thus they really are like 50 mini-countries.
      However, they each agreed by compact upon admission or ratification of the Constitution for the original 13-states that they would have a Government of the United States of America (USG) to operate in certain areas and that this government would also be a sovereign, too.

      How each state wants to operate in its sovereign form is up to the people as long as its republic in nature. If New Jersey would like to call itself the "Free and Independent Peoples Democratic Place of the Principality of New Jersey, formerly known as the State of New Jersey." That's up to itself.

      However, from the point of view of the USG each state is just like any other state. That's why we just call them the fifty states. From the national point of view: All states are equal in that they have two senators, elect the president via the electoral college through whatever selection process the state would like to select, and can't be destroyed by Congress.

  17. OpenDoc... there's a 1993 flashback by MarkRebuck · · Score: 2, Interesting

    Did anyone else flashback to the FORMER technology known as OpenDoc [wikipedia] after reading the title of the article?

    Talk about a bad flashback... [shudder]

  18. Prediction Time! by thehunger · · Score: 2, Interesting
    My predictions:
    1. EU chooses OpenDocument for standard
    2. Micro$oft includes support in next version of M$ Office
    3. then adds its own 'extensions' to OpenDocument format
    4. People discover that using anything but M$ Office is a 'hassle' since other products dont support 'extensions'

    Ok so it might not happen exactly like this but I bet they will try to do something similar!

  19. Re:OpenOffice by Geoffreyerffoeg · · Score: 3, Insightful

    No, I've never looked at an OOo document in notepad; that would require installing a system that runs notepad, finding a copy of notepad, and installing it. That's an awful lot of work to go through to get a crappy plain-text editor that's nowhere near as good as the ones I already have installed. :)

    Notepad has become a generic term much like Kleenex, Xerox, Coke, etc. I really don't feel like explaining what "vim" is, what "vi" is, and how the two differ, every time I want to say I opened something in a text editor. If you ask someone for a Kleenex, do they say, "No, but I have a Puffs Plus (or whatever); would you like that instead, or shall I go buy a box of Kleenex?"

  20. Not a state by robogymnast · · Score: 2, Informative

    Hate to be nit-picking, but it is the Commonwealth of Massachusetts. Don't ask me explain how that works or what the difference is, but IIRC there are 3 others.

    --
    unzip ; strip ; touch ; grep ; find ; finger ; mount ; fsck ; more ; yes ; fsck ; umount ; sleep
  21. The OpenDocument Format by demon_2k · · Score: 5, Interesting

    Obvious point : This could help solve most of the compatibility problems between different Office suits. Your work document may one day open in your frineds OpenOffice word processor and look 100% thesome as in you MS Office.

    The problem : Digital Rights Management. Ms might have or might open their XML document format. Other suits might open their format.

    However, can a application be an owner of a license? You could have a DRM'ed document created using Ms Word that is in an "open format" but, only Ms Work is licensed to open it or you are only allowed to open it in Ms Word. Anything else is considered a hack and you could me prosecuted under DMCA.

  22. How about the National File Format? by beetle496 · · Score: 2, Informative

    The U.S. Government has been pursing an XML based National File Format (NFF) for some time. This has currently morphed to the National Instructional Materials Accessibility Standard (NIMAS 1.0), a subset of ANSi/NISO Z39.86 (DAISY 3).

    --
    I paid the going retail price for a Windows screen reader and got a free Unix computer!
  23. we need several open standards by macjohn · · Score: 2, Interesting

    I think there need to be several open file format standards:
    * one for plain text (straightforward, but standardize the /n/c/r)
    * one for rich text (above plus bold, italic, underline, color)
    * one for mixed documents (basically html - mix rtf and graphics)
    * one for rigid formatting (pdf)
    * one for complex documents - including collaboration markup

    Forgetting authoring interface, each is an extension of the one below it. Rich text is still only text. Mixed adds graphics and tables, but no rigid layout control. PDF adds exact duplication or all fonts and layout. The complex document should take the mixed format and add collaboration tools, embedded objects, and stuff like that.

    These 5 formats would give you a right solution for just about any document interchange problem. In fact, the first 3 could be collapsed into one, if they were universally recognized.

    If we could come up with these as published standards, then it would make great sense for governments and corporations to start requiring interchanged documents to be in one of the standard formats.

    Absent published, supported open formats, Microsoft wins.

    --
    --Hi. I'm in Portland and it's raining. This appears to be a permanent condition.
  24. Re:.txt by paretooptimum · · Score: 2, Funny

    I completely agree. OpenOffice is soooooo... in need of a Firefox to its Mozilla suite. Can someone please fork this puppy and dump the bloat.