OpenDocument Voted In By ISO

← Back to Stories (view on slashdot.org)

Posted by ryuzaki0 on Wednesday May 3, 2006 @03:04AM from the well-isn't-that-special dept.

cduffy writes "OpenDocument has been voted in as ISO/IEC 26300, with no dissenting votes and a small number of abstentions. There are still several formalities to take place before final issuance. Now the question: Will OpenXML get the same treatment, despite its technical weaknesses? There's also coverage on Groklaw."

11 of 179 comments (clear)

Min score:

Reason:

Sort:

Comparison by 2.7182 · 2006-05-03 03:10 · Score: 2, Interesting

If you look at the history of standards, such as done at NIST, usually people try to choose the best thing, but it is hard to forsee what is the best. A good example are the standards associated with how to quantify vibrations in static structures, such as bridges. Looked good in 1948, turned out bad (Tacoma bridge).
1. Re:Comparison by ThePhilips · 2006-05-03 03:56 · Score: 2, Interesting
  
  The result is that standardized computer algorithms and formats are rarely incorrect. However, they do become obsolete in relatively short periods of time due to increases in computing power and informational storage/transmission requirements.
  In engineering, building blocks are developped probably once per decade. How old concept of building houses of bricks? of wood? etc. You can't do much with physics, which describes the laws of the world we see around us.
  In software engineering, earth gets reinvented completely more or less every decade. Every new generation of computers allows newer improved algorithm and new application fields to be sucked in. And everytime people find that the algorithms can be improved even more. 200 hundred years ago, simple automation of money counting was unimaginable. Try to consider what happen in the two centuries. And how the process evolved, if now amount of money has *no* physical equivalent: it's just number our bank stores along with rest of account information. Numbers can evolve thou they exist only in our imagination. You hardly can expect brick or lump of iron to evolve in any similar way.
  Standards if they want to remain useful has to evolve. IMHO standard has to include way to add improvements and way to move the improvements under standard umbrella. E.g. HTML is tag based. There is a definition of tag along with its properties. Improvement to HTML can be done in two ways: new property of an exsiting tag or a completely new tag. And with next revision, schemas can be updated to include the improvement. It worked that ways with HTML evolution from ver 1.0 to 4.0 to XHTML 1.0. Make HTML an international standard requiring strict compliance and 6 month aprove period for every new feature - and you would find that the HTML would have never evolved that far the way it did under the rule of W3C.
  ODF inherited from XML easy way to add improvements. If ISO workgroup isn't made up of complete [CENSORED] - and luckily to us it isn't - standartization would not stand in a way for improvements.
  What is remaining for ODF to be healthy standard - is competing implementations. KOffice is limited to KDE which doesn't run under Windows. Working with OOo every day I wish it was never ported to Windows in first place. I hope the Corel would deliver on promise and add to competion. Having at moment under Windows only OOo as an option - hardly helps ODF adoption.
  
  --
  All hope abandon ye who enter here.
Re:Hopefully not? by jrumney · 2006-05-03 03:34 · Score: 2, Interesting

Adding to that is the fact that attributes and nodes are two different things that are, in general treated the same (and the functionality can be achieved without attributes by making an "attribute" node and putting all the attributes under it).
That is perhaps the biggest mistake developers make when they design their XML schema (or DTD), and leads to ...
I hate XML. It's not easy for humans to read as a wire protocol.
If you keep the things that are supposed to be human readable as the text within nodes, and move the rest (formatting instructions etc) into attributes, your XML will be much more readable after some simple processing to remove the nodes. Using attributes for all those small name-value pairs that XML documents are full of also reduces the size and makes parsing more efficient.
Re:Hopefully not? by AKAImBatman · 2006-05-03 03:38 · Score: 2, Interesting

I hate XML. We should be using something like JSON or YAML.

JSON and YAML are more focused formats intended for lightweight transmissions and compatibility with existing computer languages, and tend to complement XML rather than supplant it.

XML is designed as a "catch-all" format that is capable of storing any form of data. That makes it extremely powerful, yet sometimes quite unweildly.

Each format has its tradeoffs, and as a result it is hard to say that one is "better" than the other. For example, XML's verbosity allows for parsing errors to be much more easily identified and repaired while simultaneously preventing accidental errors from going unnoticed. In YAML and JSON it is much easier to place unintended characters or data structures without the parser noticing. Neither one (to my knowledge) has the ability to check the structure of the transmission like XML DTDs and Schemas do.

However, DOM and XSLT are both awesome ideas - especially for parsing documents.

You've just given two reasons for the existence of XML. Both concepts are extensions of the XML concept, and are not necessarly applicable to other data-exchange formats. (At least not without massive changes.)

XML was designed with the DOM in mind so that any type of flat or heirarchical data could easily be loaded and stored programatically. This cuts down on the number of programs that attempt to construct an interchange document manaually. This rigid structure thus makes way for the programatic transformation of such documents, ala XSLT.

--
Javascript + Nintendo DSi = DSiCade
Good news by spectrumCoder · 2006-05-03 03:45 · Score: 3, Interesting

If Microsoft implements OpenDocument (or anything like it) in Office 2007 it will make a lot of people very happy.

A blank Word document takes up eleven kilobytes, and a one page document takes up about forty. If this becomes the de facto standard for documents rather than the Word document format, then document file sizes will shrink significantly, and a lot of bandwidth and disk space on office networks will be saved as a result.
1. Re:Good news by tomstdenis · 2006-05-03 03:49 · Score: 1, Interesting
  
  I'm writing a book in Word [yeah I know, shudder] and my 57 page 2nd chapter is about 340KB on disk. It sports 10 figures, lots of styles (from normal paragraphs, to emphasis to source code etc...)
  
  Maybe you put high res graphics and are using tracked changes?
  
  Tom
  
  --
  Someday, I'll have a real sig.
Re:Hopefully not... by DrXym · 2006-05-03 03:49 · Score: 3, Interesting

Nicer from a human point of view means less bugs down the line. I just spent a week trying to get an .wsdl to parse through Axis AND .NET's wsdl.exe. Any format that is less opaque, less verbose and more understandable gets my vote.
Formulas? by Makzu · 2006-05-03 03:49 · Score: 2, Interesting

I just hope that OpenDocument gets its formula standards in order. I've read in a few places that there is very little documentation in the standard proper about how formulas (for spreadsheets) should be stored and used, which could in time cause some compatibility problems. That being said, I'm glad that it was approved by the ISO... maybe in a few years I'll not have to worry about converting from one office format to another ad absurdum.
Same thing? by Anonymous Coward · 2006-05-03 04:01 · Score: 2, Interesting

So did ODF folks finally decide how to store formulas? Currently every single spreadsheet that supports ODF (not that there are many) stores those as they wish with no defined standard.
Re:What technical weaknesses in OpenXML? by alanQuatermain · 2006-05-03 06:59 · Score: 2, Interesting

I believe that some information that will help explain this is to be found here. It's best to read that article for yourself, but I'll provide a little abstraction of some of the details myself, although this isn't really my area of expertise:

The main point revolves around the fact that MS' OpenXML uses a non-mixed content model, while OpenDocument uses a mixed content model. This means that OpenDocument can have tags interspersed with regular text, or tags within text delimited by other tags, etc. However, OpenXML cannot do this: all text must reside within a tag, and only text or tags can reside within other tags. The article gives a textual example of this. To the computer, the MS one is probably closer to the internal representation of the data: object-oriented programmers will probably recognise the structure as an object encoding its member variables. However, it pretty much removes the benefit of using XML in the first place: source readability. If you look at HTML, it's fairly easy to change a couple words around, and make a few italic, or bold. But in the OpenXML format, that becomes a more laborious task.

The article goes on to make arguments which back up the basic premise given here. You can also see from the examples how the tags differ in type. They give examples in OpenXML, ODF, and XHTML. Just looking at the tags in the OpenXML source doesn't give you any real idea what they're doing-- I mean, what does <w:rPr> mean? However, the tags used in ODF are longer and easier to read and understand for a human.

Of course, you could say that human-readability isn't an issue, and that's a fairly valid argument. However, if human-readability isn't an issue, why use XML? Why not do what Office was doing before, and writing memory out to disk, or basically serializing the document object tree in binary? It'll be smaller and easier for the computer. The whole point of using XML is to make the data easily understandable to humans, to the point where we can make numerous (albeit potentially quite small) changes without needing a program to interpret the data for us. Or where it's possible for us to write an app that understands the data, which pretty much requires that we personally understand it. As it stands, just about any XML data format is quite self-explanatory in itself, which is why we have XML.

Maybe that doesn't answer everyone's questions, but I hope it proves at least a decent starting point.

-Q
Re:mathML sucks. by zippthorne · 2006-05-03 12:02 · Score: 2, Interesting

Yeah I've used that. That's how I know that even simple formulas take up a huge amount of markeup with mathML. completely unnecessary markeup. They should've just used the "mathematica" format as the format since it's much more concise. make the tag something like,
<equation img="sparea.png" eq="4*pi*r^2" lang="mathematica"> area o' sphere </equation>
the mathml equivalent?
<math xmlns='http://www.w3.org/1998/Math/MathML'> <mrow> <mn>4</mn> <mo>⁢</mo> <mi>pi</mi> <mo>⁢</mo> <msup> <mi>r</mi> <mn>2</mn> </msup> </mrow> </math>
All that text to display FOUR glyphs.

--
Can you be Even More Awesome?!