Slashdot Mirror


ODF Threat to Microsoft in US Governments Grows

Tookis writes "Another setback for Microsoft has cropped up in the space of document formats in government organizations. The state of California has introduced a bill to make open document format (ODF) a mandatory requirement in the software used by state agencies. Similar legislation in Texas and Minnesota has added further to the pressure on Microsoft, which is pushing its own proprietary Office Open XML (OOXML) document format in the recently released Office 2007. The bill doesn't specify ODF by name, but instead requires the use of an open XML-based format."

4 of 269 comments (clear)

  1. Re:Define Open by zappepcs · · Score: 5, Informative

    FTFA:
    "The new bill, introduced by Californian Democrat Mark Leno, does not name ODF specifically but has stipulated that by 2008 agencies must be equipped to store and exchange documents in an open, XML-based format. Although the name of Microsoft's Office Open XML suggests that it would match the requirement, it is in fact a proprietary format that would fail the open standards test."

    It appears that there are more tests than the blurb indicates as to what 'standard' would be accepted. To me, it sounds like the bill is not trying to eliminate any possible software, simply to ensure that all of the apps can play nice together. That is common sense to me as far as business decisions go.

  2. Re:Define Open by jeevesbond · · Score: 5, Informative

    the specs for these formats are going to be so complicated that nobody will be able to open the file in a text editor and just read through it.

    I have untarred several documents from the ODF family and found them easy to understand. I would suggest you do the same as the software to create these files is Free. If you can't be arsed to do that, then stop writing inane commentary. :)

    The specification for ODF is available online. Since that is the case, please attempt to read it before spouting-off about it being unreadable. It is 722 pages long, I've had a brief look at it and it seems very readable (better than that: it looks implementable!)

    In my opinion Microsoft's format is neither XML, or open. It's binary, patentable cruft in an XML wrapper. So it's best not to describe it as an 'XML Format' at all. The specification for this is reportedly 6,000 pages long. This is also available online.

    The advantages of XML file formats are:

    • Increased Robustness
    • Document Archiving
    • Version Interoperability
    • Documented and Transparent File Content
    • Standards Based
    • Easy Import and Export of Other File Formats
    • Search Engines / Knowledge Management Systems

    All of these were copied from the OpenOffice Web Site, explanation of the items in that list can be found there.

    --
    I'm going to transform myself into a mighty hawk. Either that or I'll just go and work at Dixons, haven't decided yet.
  3. Re:X(HT)ML+CSS? by nitsuj · · Score: 5, Informative

    Read:
    http://old.opendocumentfellowship.org/Articles/Int roductionToTheFormatInternals
    http://old.opendocumentfellowship.org/Articles/For matODFVsMSXML

    And let me know if you still think the ODF is merely a 'memory dump in angle brackets'. Maybe they could have reused a good chunk of CSS, but that would also require another type of basic parser in implementations. I imagine you've heard of expat, but can you name a standard CSS parser library? I can't, and once upon a time, I had CVS checkin privs on mozilla. Looks simple enough, but ask a web developer if they've ever heard of any major browser having CSS parser bugs.

    And it looks like ODF's style definitions could maybe be generously described as CSS in XML, too. Regardless, I think you could make a pretty compelling argument that the layout needs that have historically driven CSS are a little different than a word processor's needs.

    Back when I worked on Abiword, the native format was very similar to XHTML/CSS. Some arbitrary element renamings -- I believe our equivalent to the span tag was a single letter. The XML->XHTML conversion could probably have been handled by a simple sed script.

    For styling, we reused as much CSS as possible. I learned about a lot of nifty stuff in CSS3 back then. I hope I get to use some of that stuff in browsers some day. But we were well on our way to the first draft of a hypothetical CSS3 Wordprocessor Module, too.

    The OOXML format does strike me as a brain dead C struct to XML encoder, however. And I know the doc format pretty well, having written some non-trivial bits of wvware and the Abiword importer based on it. We actually once got a post on the mailing list from someone looking for technical details on the doc format, and they had been forwarded to us by someone on the Word team at Microsoft. They had their time-tested, battle-worn libraries, but we apparently understood the actual bytes better than anyone still in Redmond willing to help a customer.

    But we all knew that the eventual Microsoft XML format was going to be silly. Actually, it's better than I expected. I had considered the occasional base64 encoded binary data structure wrapped in data tag to be a very real possibility.

    In my mind, the most astonishing thing is that they just arbitrarily reimplemented -- and generally very badly -- dozens of standards, including many ISO ones. I believe they have several novel timestamp definitions, in addition to ISO's.

    I'm pretty shocked anyone is even pretending OOXML is being seriously considered as a standard. I think some people in Redmond had an April Fools' joke get out of hand. If this gets standardized, I expect the next anti-trust case is going to reveal internal Microsoft emails with text such as "holy shit, ISO just accepted our format!"

    PS: I don't even read slashdot that often anymore, and I very rarely post. The few times I do, I generally don't even bother to login. But it would seem that several years of random hobbyist open-source contributions have made me quite likely one of the top few dozen or so domain experts on the planet regarding your specific post. I thought that was kind of amusing myself. I don't know if anyone actually cares, but my name is Justin Bradford, and I imagine google retains sufficient evidence of what I claim.

  4. Re:Define Open by k8to · · Score: 5, Informative

    Dammit people, read the damn bill, it's quite short. It has a four part test for formats to be adopted.

    1. Interoperable among diverse internal and external platforms and applications
    2. Fully published and available royalty-free
    3. Implemnted by multiple vendors
    4. Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard

    It's not perfectly worded (what are internal and external?), and it's not a perfect list, but it's a quite reasonable starting place and it doesn't allow any of the hand-wringing excuses I'm seeing in these comments. This open document stuff has been being debated in the public sector for some years now. Politicians may be many things, but they're not incapable of reading.

    I've written my California Assemblyperson, you can too.

    --
    -josh