Slashdot Mirror


California Joins Open Document Bandwagon

Andy Updegrove writes "A legislator in California has decided that it's time for California to get on the open formats bandwagon. If all of the bills filed in the last few weeks pass, California, Texas, and Minnesota will all require, in near-identical language, that 'all documents, including, but not limited to, text, spreadsheets, and presentations, produced by any state agency shall be created, exchanged, and preserved in an open extensible markup language-based, XML-based file format.' What type of formats will qualify? Again, the language is very uniform (the following is from the California statute): 'When deciding how to implement this section, the department in its evaluation of open, XML-based file formats shall consider all of the following features: (1) Interoperable among diverse internal and external platforms and applications; (2) Fully published and available royalty-free; (3) Implemented by multiple vendors; (4) Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.'"

44 of 188 comments (clear)

  1. Minnesota also by SEWilco · · Score: 4, Informative
    1. Re:Minnesota also by bigtomrodney · · Score: 5, Insightful
      From TFS

      'all documents, including, but not limited to, text, spreadsheets, and presentations, produced by any state agency shall be created, exchanged, and preserved in an open extensible markup language-based, XML-based file format.' The one thing that stands out to me is specifying that it is XML-based. I have no problem with that, the two competing formats are indeed XML based but shouldn't the mandate of the state simply to specify the Open and Cross-Platform aspects?
      What happens for instance if tomorrow all of us wonderful Slashdot readers co-developed a magical format that not only was open and cross platform but inexplicably worked with all currently available office suites without modification... ...but it wasn't XML. Does that disqualify it? I would rather see the politics of this issue left with the politicians, the tech issues left with the techies. I've seen other combinations go horribly wrong DMCA
      --
      I never get used to these constant resurrections
    2. Re:Minnesota also by foniksonik · · Score: 4, Interesting

      If it is an open XML based format then doing a conversion to whatever new format arises should be trivial (maybe not fast, but fairly easy with XSLT). SO better to put it into XML now and worry about what better format may arise later.

      This is good news... why be negative about it?

      --
      A fool throws a stone into a well and a thousand sages can not remove it.
    3. Re:Minnesota also by daeg · · Score: 3, Insightful

      If, and when, such a format comes into play, and has a large enough subset of tools available for it, then the laws can be revised. With all documents already in XML, converting to the new format should be nearly painless, and more likely, both formats could be used.

      The tech needs to be spelled out clearly in the law, otherwise vendors like Microsoft will be able to say their format qualifies and lobby until enough tech-clueless legislators agree to it.

  2. What does XML have to do with it? by beavis88 · · Score: 3, Insightful

    As long as the format meets criteria 1-4, I don't see why it's necessary to specify that it must be XML-based. Keep it simple, and all that...

    1. Re:What does XML have to do with it? by gEvil+(beta) · · Score: 4, Insightful

      There's only one reason, and that's because the higher-ups think that EVERYTHING should be XML based. Of course, they have no idea what this actually means. They just know that it needs to have XML in it, because that's what the other guys are doing.

      This reminds me of my boss, who keeps saying that we need to publish things in XML, but can't give me any reason why we should. Then again, two years ago I kept on hearing about how our company needed a blog, again with no justification as to how it would help us. Thankfully, that passed. Eventually, the XML thing will, too. Of course, this isn't meant to belittle the things out there that actually can benefit from utilizing an XML format.

      --
      This guy's the limit!
  3. Wtf? Why "XML-based", specifically? by Lord+Bitman · · Score: 4, Insightful

    Why not just require the format to be in ANY published standard format? "XML" by itself is meaningless, "extensible" is a loaded term (and a very bad idea when trying to write a way to keep things compatible). Why do lawmakers always have to over-specify things until the purpose of the law is lost?

    --
    -- 'The' Lord and Master Bitman On High, Master Of All
    1. Re:Wtf? Why "XML-based", specifically? by beavis88 · · Score: 3, Insightful

      Why do lawmakers always have to over-specify things until the purpose of the law is lost?

      Job security. If they wrote clear, concise, and sharply targeted bills, we wouldn't need to keep electing a fresh crop to fix the mess left by the last one.

  4. Text in XML? by The+Monster · · Score: 2, Insightful

    ...text, spreadsheets, and presentations, produced by any state agency shall be created, exchanged, and preserved in an open extensible markup language-based, XML-based file format.
    Why would text need to be in an XML-based format, when it can be in a... text format? If you have a text document that doesn't require any formatting, just make it text!
    --

    [100% ISO 646 Compliant]
    SVM, ERGO MONSTRO.

    1. Re:Text in XML? by hey! · · Score: 4, Interesting

      Well, obviously it doesn't need to be xml, but XML does have one nice self-documentation property that plain text lacks: the character encoding.

      If you've looked at project gutenberg texts, you can see why this is a problem. Not a huge problem, but a problem. When a source text has a non-ascii character in it, they have to put some sequence of ascii characters which will suggest what the glyph is supposed to be. This doesn't really preserve the information in the source document, nor does it make the document easy to read.

      So, you could have a trivial text XML format that has only one defined tag. It's still useful:
      <xml version="1.0"? encoding="us-ascii">
      <text>
      This is my text. It has no wacky glyphs so ASCII is fine.
      </text>

      vs.

      <?xml version="1.0"? encoding="utf8">
      <text>
      This is my text. It has wacky glyphs therefôre ascii sücks for it!
      </text>

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  5. <xml>we should do this too</xml> by wwwillem · · Score: 4, Funny


    <user="wwwillem">
    <subject>we should do this too</subject>
    <content>
        What is good for government documents is also good for Slashdot posts. :-)
    </content>
    </xml>

    --
    Browsers shouldn't have a back button!! It's all about going forward...
  6. What type of formats will qualify? by tttonyyy · · Score: 4, Funny

    Format is irrelevant - since these documents will contain legal-speak, they'll be unreadable anyway. ;)

    --
    biopowered.co.uk - catalytically cracking triglycerides for home automotive use since 2008. Just say no to big oil!
  7. Re:Dominoes by RichMan · · Score: 2, Funny

    > The dominoes are beginning to fall.

    Are they being hit by flying chairs, perchance?

  8. XML panacea by Gothmolly · · Score: 4, Funny

    N00b: Hey we have this data representation problem, we'll use XML!
    Greybeard: Son, now you have two problems.

    --
    I want to delete my account but Slashdot doesn't allow it.
    1. Re:XML panacea by ratboy666 · · Score: 2, Informative

      From a greybeard

      XML Problem Checklist:

      - Ensure the library functions are flexible enough for the application, and do not consume too many resources. This can be an issue for smaller systems, and may be an issue for larger ones as well.

      - Ensure that the XML implementation is usable on a security basis. Should not have buffer overruns - parsing problems etc.

      - Ensure that the XML works with ANOTHER XML implementation. Adds testing (its not just "someone elses problem").

      - Application data structures are influenced by XML (eg. XML does not represent bitfields/bitsets directly). Mappings must be reasonable.

      - XML should be 'vetted so that the format doesn't look like it was laid out by 'noobs. Stuff like the Microsoft specification for XML WORD shouldn't be generated.

      - Because the only reason that XML formats are being used is for interoperability, a separate set of interoperability documents must be maintained. At some point, someone will ask the question "You use XML, I want to stick the data into ... That should be easy (because of XML). Help me."?

      --
      Just another "Cubible(sic) Joe" 2 17 3061
  9. Why is the victim silent? by 140Mandak262Jamuna · · Score: 2, Insightful
    Well why corporate America is so silent on this issue? After all almost all the revenue of MSFT comes from corporations. If the palying field becomes level and multiple vendors compete to serve them while being fully inter-operatble, it will be the corporate America that will benefit most. Technically they are the victim of the monopolistic deeds of MSFT. Still they remain silent, and the Govenment, after protecting the citizens from their own stupidity (seat belts, airbags, spacing between crib railings) now comes to rescue corporate America?

    If Government intervention is what it takes to force a level playing field, I will accept it. But still I would prefer it if market forces create a level playing field instead of government mandates.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  10. Re:You must not have got the memo by maxume · · Score: 3, Funny

    I can write a text to text converter in about *0 lines* of perl.

    --
    Nerd rage is the funniest rage.
  11. Criteria n3 by DrYak · · Score: 4, Funny

    As long as the format meets criteria 1-4


    In other news, Microsoft is quickly subsidizing 3 small companies to write quick and meaningless stupid plug-ins using OOXML as input, just to pretend that their format is "Implemented by multiple vendors" and on "diverse (...) platforms" (ie.: Windows 98, Windows ME, Windows 2000, Windows XP *and* Windows Vista)...

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
    1. Re:Criteria n3 by Coryoth · · Score: 2, Interesting

      In other news, Microsoft is quickly subsidizing 3 small companies to write quick and meaningless stupid plug-ins using OOXML as input, just to pretend that their format is "Implemented by multiple vendors" and on "diverse (...) platforms" (ie.: Windows 98, Windows ME, Windows 2000, Windows XP *and* Windows Vista)... I think MS may actually have already squeezed through this particular hole. The reality is that other vendors are going to have to be able to read and write OOXML at a basic level for compatability. They'll never fully implement the standard that MS has written because they can't possibly implement the behaviour of the <SpaceLikeWord95> tag and all the others like it, but they'll have something. If you demand a "full" implementation from multiple vendors you're just digging yourself a different hole: nothing will qualify. Honestly, try reading and writing .odf files in a few different products (OpenOffice.org, KOffice, AbiWord, Writely, etc.) and you'll find the formatting gets noticeably messed around. Fully implementing ODF might be more feasible than fully implementing OOXML, but that doesn't mean it will actually happen. It seems MS has managed an effective pre-emptive strike against such legislation - they've created a "standard" that is just open enough to qualify, but sufficiently obscured that no one else can actually claim full compatability - the kicker being that full compatability is unlikely with any sufficiently complicated standard (as standard for office documents are bound to be) so you can't specify that as a criterion.
    2. Re:Criteria n3 by kennygraham · · Score: 2, Insightful
      That will meet 1-3, but

      (4) Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.

      Microsoft still isn't an open industry organization, they're one company. I think #4 is the most important part.

    3. Re:Criteria n3 by man_of_mr_e · · Score: 2, Insightful

      I think you make a disingenuous argument. By definition, no office suite can fully implement any other office suites interoperability unless suite b is a complete superset of suite a's features, regardless of document type.

      By way of example, let's take something like KOffice. It seems unlikely that KOffice is a complete superset of OpenOffice, therefore even if both OOo and KOffice implement ODF, KOffice can never be completely interoperable with OOo (at least OOo -> Koffice). Further, if KOffice implements any features that OOo doesn't have, then the same is true in reverse.

      This argument is a red herring. No, no office suite can implement every feature of OOXML, because OOXML is, by definition, a direct map of Office functionality, but even if Office used ODF, those same features would have to be represented in ODF somehow, and those same suites would still not be able to implement them.

  12. Called my rep by inverselimit · · Score: 3, Informative

    Just called my CA Assembly rep to ask them to support the bill. Look yours up here.
    It may not be perfect, but is a move in the right direction.

    1. Re:Called my rep by inverselimit · · Score: 2, Informative

      There is also an online form for comment here.

  13. Re:You must not have got the memo by Anonymous Coward · · Score: 3, Funny

    I can write a text to text converter in about *0 lines* of perl.


    Which truly is the perfect amount of Perl to ever write.
  14. Does MS's format qualify? by danpsmith · · Score: 2, Insightful

    MS made a format that fits the very definition of what they said will be required in this bill. Is this bill just going to lead to government organizations upgrading to the new Office? Technically, all of these things apply even if the implementation of the "standard" will later be forked by MS with their extend and extinguish model. In short, does this really mean truly open formats will get a boost? Or that MS's new format will seem like the solution to a problem they have practically invented?

    --
    Judges and senates have been bought for gold; Esteem and love were never to be sold.
  15. Woohoo, ODF soon to be here by guruevi · · Score: 2, Interesting

    I think the only document format that would qualify is ODF (by OASIS). It's the only well known document format, based on XML and extensible, open and implemented by different vendors and office suites.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
  16. American legislation 101 by bornbitter · · Score: 2, Interesting

    "Why do lawmakers always have to over-specify things until the purpose of the law is lost?"

              This is over-simplified, but here goes... American laws are made in sub-committees of committees of the legislative body. The committees are packed with 'specialized' delegates, i.e. someone with a political stake or in the pocket of a special interest group, (like Microsoft, OSDL, or Green Peace). ...of course that is the federal process, and the states vary in their organization, but it is mostly the same. It all depends on how the states have drawn-up the rules for their specific legislature.
              Keeping that in mind, every law has to 'pass' through the upper committee after the sub-committee, before passing in the full-legislative body. The extra wordiness is to satisfy the other 'specialized' delegates' demands.
              To put it simply; They HAVE to make it ridiculously wordy or it will never become a law. There is just too much money involved. This means that all Microsoft, or anyone else, has to do is 'buy' an influential delegate in the sub-committee, or the chair of the committee in order to kill this bill before it is even voted on in the full legislature.

    --
    "Our Constitution was made only for a moral and religious people. It is wholly inadequate to govern any other" -John Ada
  17. You do need a blog (mildly OT) by Mateo_LeFou · · Score: 2, Insightful

    Your company needs a blog, but (and this is critical) it won't work if it's part of your corporate strategy of appearing-to-look-really-hip. It works if one of your employees creates it on her own initiative, and the strategists leaves her alone.

    --
    My turnips listen for the soft cry of your love
  18. Microsoft's open XML format: by hoggoth · · Score: 5, Insightful

    Just specifying XML doesn't mean much, really:

    <document>
    Description of MS Open Format
    <![CDATA[
    37642364 78346478 23465789 34657834 65783465 78934653 47895634 78563478 65347856
    56347825 63478256 34786578 34567893 45678934 65783456 78465783 46578346 57834567
    34895723 48957348 90578934 75890347 58934758 93475892 ... more binary crap...
    ]]>
    </document>

    --
    - For the complete works of Shakespeare: cat /dev/random (may take some time)
    1. Re:Microsoft's open XML format: by Coryoth · · Score: 2, Insightful
      I think the kicker for Microsoft will be the

      (1) Interoperable among diverse internal and external platforms and applications; and

      (3) Implemented by multiple vendors; clauses, though it will of course depend on exactly how those are interpreted. It is unlikely that anyone other than Microsoft will fully implement OOXML, so "multiple vendors" rules it out under a strict interpretation. If people want to get around it, however, you can go with a loose interpretation and point to all the vendors who will, out of necessity, provide a basic implementation of OOXML for the sake of compatability and importing documents. Likewise with the interoperable among platforms - with no MS Office for anything other than Windows and Mac, OOXML doesn't qualify as interoperable among diverse platforms. On the other hand you can go back to a loose interpretation and claim that OpenOffice.org "has an implementation of OOXML" (because they will have to have something for compatability's sake) and hence OOXML works on a diverse range of platforms (everything that OpenOffice.org runs on).
    2. Re:Microsoft's open XML format: by skubeedooo · · Score: 2, Informative

      Just specifying XML doesn't mean much, really

      Which is why the bill doesn't just specify XML. FTFA,

      ...the department in its evaluation of open, XML-based file formats shall consider all of the following features:
      1. Interoperable among diverse internal and external platforms and applications.
      2. Fully published and available royalty-free.
      3. Implemented by multiple vendors.
      4. Controlled by an open industry organization with a well-defined inclusive process for evolution of the standard.
  19. Texas House and Senate bills by ptbarnett · · Score: 2, Informative
    HB 1794
    SB 446

    So far, each bill has been filed and referred to the appropriate committee. However, the legislative session just started in January and things don't usually start happening until after the filing deadline on 2007-03-09.

  20. Here's my problem: by superbus1929 · · Score: 5, Insightful
    There's one company missing from the Open Document party here, and without it playing along, I don't see this lasting long.

    I am, of course, talking about Microsoft. They refuse to accept the Open standard.

    Until that happens, there will be problems. Yes, you could have .odt documents sent internally, but what if someone has to send a document to someone outside the company? Microsoft Office does not recognize .odt, and if you think that you can train someone to remember to send .doc files to outside users, and keep internal documents to .odt, then I have a bridge to sell you.

    --
    Let's stop dilly-dallying and just change "-1: Overrated" to "-1: Disagree" or "-1: Doesn't Subscribe to Groupthink".
    1. Re:Here's my problem: by mrchaotica · · Score: 2, Insightful

      Yes, you could have .odt documents sent internally, but what if someone has to send a document to someone outside the company?

      We're talking about government offices here, not companies. The answer to your question in this case is (or at least, should be) "we compel the outside person to get an ODT-compatible program by government fiat."

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  21. For one thing... by brunes69 · · Score: 4, Insightful

    XML means it is readable by humans. You don't even NEED any kind of a program to get the text.

  22. Is Arnie still governor? by Bazman · · Score: 2, Funny

    What next? Hasta la vista, Vista?

  23. Re:More importantly, what *documents* qualify? by Eccles · · Score: 2, Informative

    I believe the IFC (Industry Foundation Classes) format, intended for cross-app CAD communication, is XML-based, and major CAD developers are supporting or working on support for it.

    Death to DWG/DXF.

    --
    Ooh, a sarcasm detector. Oh, that's a real useful invention.
  24. Two Possible Outcomes by BCMcI · · Score: 2, Interesting

    If California passes this resolution I can see two outcomes. 1 The state recognizes that ODF has to be used and scraps Office and loads OpenOffice or StarOffice. Big win for the citizens of California big loss for Microsoft. 2 The state recognizes that ODF has to be used and because older versions of Office won't work with ODF they purchase Vista and Office 2007 for all state agencies. Huge loss for the citizens of California huge win for Microsoft. Guess which is more likely?

  25. In Completely Unrelated News... by antirelic · · Score: 2, Interesting

    3 states who's yearly budget is under review are looking for ways to drive down existing IT costs by threatening to pass legislation that will get them huge discounts on Operating System and Office Software .........

    --
    20th century Marxism is not progress...
  26. Reality vs. Concepts- by ShrapnelFace · · Score: 2

    The absolute unfortunate truth in this case is that it will not matter what requirements the state of California sets forth, because in the end it serves not the people but the income of the government.

    You see, in California, we have this precedence of hiring under-motivate, under-educated, people into roles to fulfill status-quo on the premise of serving equality.

    This results in a rule that I call "Factor 4" where by you can take the initial cost of any related project, service or resource requisition, and thereby multiply it by factor of 4 in order to obtain the actual cost to the government.

    Sadly, Factor 4, is a direct result of the mediocrity that has taken up residence within all of our government agencies. I cannot imagine a bigger nightmare than this one that I just read about. Half of the institutions within the government are filled with people that have no idea what that means, and lack the education to understand it.

    With this being true, we open the door to committees, educational round-tables to determine educational requirements, requisitions for training, then post-comittees to evaluate if the needs were met, then another comittee to determine if the proper mixture of minority members were upheld, then further we'll add layers of evaluation to insure that all submissions qualified with the sole purpose of perpetuating a verification process that checks itself sometimes 3 times over- with absolutely no guarantee that said process is: accurate, predictable, or effective.

    All this does is allow state governments the ability to ask for additional funding, which they will earmark with non-related items, and then fund other programs with the initial request.

    Translation- the greater good for which said items are presented will be moderately served.

    Outcome: Ho hum and whatever. Can't we think of better things to do with my tax money than fuddling around with this area of business? I say they throw out this status quo requirement and start paying people what they are worth so that we can get some really bright minds into our state governments.

  27. No brainer by electrosoccertux · · Score: 4, Funny

    I don't get what all the hoo-haw is and why we need courts or lobbying for any of this. I find it very difficult to write anything when my term paper or [insert your document here] isn't open. Sounds like a bunch of people just need to learn how to double-click.

  28. Isn't quite the same situation by DrYak · · Score: 2, Interesting

    It seems unlikely that KOffice is a complete superset of OpenOffice, therefore even if both OOo and KOffice implement ODF, KOffice can never be completely interoperable with OOo (at least OOo -> Koffice). Further, if KOffice implements any features that OOo doesn't have, then the same is true in reverse.


    Although the starting situation is mortly similar with OOXML, it isn't quite exactly the same.
    Yes, there may be some different way to interpret the standards, and maybe two different implementation produce slighlty different results (Spreadsheet formules, for exemple aren't standarized yet). The difference is that this standard is controlled by a whole comitee (OASIS), in which several software maker are represented, include FLOSS, and it's in their interest to have the best interoperability as possible.
    Thus there's a high probability that, faced with such a situation, the detail of the implementation will be specified in next OpenDocument revision, so that the other software can do a better job in opening their interoperation. In fact, latest versions of AbiWord seem to be much more close to the original OpenOffice.org document. Or maybe they'll even create a new options that allows to tweak the parameters of the function in a documented way ( where 2.67 imitates best the behaviour of OOo and 3.14 is KOffice's default and 3.00 is what every new application is supposed to assume in case of missing param, according to documentation)
    And if some other product develops more functionality than ODF is capable of encoding, there's a high probability than an extension will be written and published with the next ODF revision.

    In fact, ODF isn't as much direct memory dump of OpenOffice.org as SXW was. ODF has been further processed by OASIS. Whereas OOXML (for now) is still a direct memory dump.
    s the sole and unique maintainer of the OOXML specification, it's not in their interest to maintain pixel-perfect conversion for competitor (they need competitor to be bale to interoperate with document formats - to shut the people complaining up - but they need to be the only product that can promise 100% pixel-perfect imports).
    Regarding with difference of working, the whole documentation is bloated with definition of options like "" for several thousand pages, which aren't explicitly documented at all (it's only written that they will be deprecated and that implementation aren't required to react to them). Only MS-Office will ever be able to open them by definition (Abiword may be able to reverse engeneer them, but it'll take more work than asking OASIS for a better documentation).
    You can bet that, if Microsoft adds some new functionality, they'll be the only one to support them as a paid-for extension... probably called 'Visual OOXML#'. You can be sure that they'll out-"Embrace, Extend, Extinguish" their own ECMA approved standard.
    Or at least to produce as badly written as possible documentation.

    What will make the difference between OOXML and ODF is microsoft willingness to cooperate (or lack of), and OASIS comitee collective need to collaborate between members.
    The only potential way to save OOXML is to put it into control of a groups, in which there's at least one FLOSS represented (say, WordView), which will have to grant full right to use and promise not to patent-sue independent implementations, and will force Microsoft to use OOXML instead of some "MS OOXML.net" extensions.
    Which they'll never agree to.
    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  29. illegal now: PNG, JPEG, PDF, OGG... by r00t · · Score: 2, Informative

    All video formats are now illegal. (or is there now an XML video abomination?)
    All audio formats are now illegal.
    Probably all image formats are now illegal.

    Whee.... this'll be entertaining.