OpenDocument Voted In By ISO
cduffy writes "OpenDocument has been voted in as ISO/IEC 26300, with no dissenting votes and a small number of abstentions. There are still several formalities to take place before final issuance. Now the question: Will OpenXML get the same treatment, despite its technical weaknesses? There's also coverage on Groklaw."
Wasn't this the one that Microsoft was going to sabatoge? What happenned to that?
Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
Although ODF is a bit nicer standard from a human point of view, and builds on existing standards, I hope OpenXML isn't accepted simply because having two standards doing the exact same thing is nonsense. They're much more similar than they are different at many levels.
ECMA are welcome to OpenXML, I don't think ISO should accept it.
"Elmo knows where you live!" - The Simpsons
If you look at the history of standards, such as done at NIST, usually people try to choose the best thing, but it is hard to forsee what is the best. A good example are the standards associated with how to quantify vibrations in static structures, such as bridges. Looked good in 1948, turned out bad (Tacoma bridge).
I hate XML. It's not easy for humans to read as a wire protocol. It's not easier for computers to read than binaries. So I don't see the point, since that's what it's used for almost exclusively. Adding to that is the fact that attributes and nodes are two different things that are, in general treated the same (and the functionality can be achieved without attributes by making an "attribute" node and putting all the attributes under it).
We should be using something like JSON or YAML.
However, DOM and XSLT are both awesome ideas - especially for parsing documents.
Maybe this will lead them to adopt an XML-equivalent technology that is easier to read and parse.
Mod me down and I will become more powerful than you can possibly imagine!
What specific technical weaknesses are in OpenXML? Sincerely I'm interested (and need to make some decisions for my company on this), but I don't want religious crap, give me the real technical differences.
"There's no doubt that this broad vote of support will serve as a springboard for adoption and use of ODF around the world..."
mmm, we shall see..
Interoperability.
I agree there's much overhead having to translate between text and binary data, but the point is that XML isn't used for exclusively processing. It's for INFORMATION INTERCHANGE.
OpenDocument is an xml format, but it's an OPEN format, completely documented and with no loose ends. Furthermore, it's very similar to HTML, so the algorithms to process it are similar, too.
On the other hand, Microsoft's "Open"XML... eew.
The parties involved I believe will be in the knowledge that this standard ie free for all to implement. Kudos to ODF.
Experts from Boeing, bring them on.
Experts from the Society of Biblical Literature?? Wtf?? What the hell
have they got to do with a computer data formatting standard??
Or did they just require some people who had experience of a
large project at its Genesis.
OpenDocument is a good format, both from a user standpoint and a technical one. How you feel about OpenXML is another matter entirely, but it's not the one that just got voted in as an ISO standard.
Specifically based on this sentence: In software engineering, the rules of the system are predefined and well understood.
1) Many projects are hacked together without good requirements. There are even methodologies which minimize requirments, see RAD or Agile Development. In some cases the users often are unsure of the requirements and they are discovered by accident, if at all.
2) Most requirements have a strong temporal association. Meaning what is a requirements in May may not be a requirment in August due to changes in the economy, the technology being used the regulatory environments (state, national, local or international regulations, treaties or standards) or if a company is purchased or merges with another company. So if a software project lasts 18 months, I will bet the requiremnts will have shifted in that time span.
3) Software cannot be touched, tasted, felt, smelt or seen. Just the results, such as a report, a graphic or a GUI. This makes it *much* harder to understand, esp. by the majority of the population who do not have either the training or the aptitude for this type of abstract reasoning. And we rely in a large part on these folks to help with requirements!
Your statement in the last sentence sort of overlaps point 2 of this comment, BTW.
Just my $.02
putting the 'B' in LGBTQ+
If Microsoft implements OpenDocument (or anything like it) in Office 2007 it will make a lot of people very happy.
A blank Word document takes up eleven kilobytes, and a one page document takes up about forty. If this becomes the de facto standard for documents rather than the Word document format, then document file sizes will shrink significantly, and a lot of bandwidth and disk space on office networks will be saved as a result.
"The movement toward OpenDocument in the free world, warms the open cockles of my heart. (Emphasis mine)
I sure hope the chambers of your heart aren't open, you might want to visit the doctor if so.
But if the cockles you're referring to are the bivalve mollusc kind, they are always open -- cockles don't shut. However, they are hermaphroditic and they can jump. Which still presents a problem for your cardiac health.
Seriously, though, formal recognition of this standard removes one of the obstacles to widespread implementation of non-MS office software. The bigger hurdle, of course, is retraining & support expenses (for businesses) and factory (or pre-purchase, anyway) installation of the software (for home users).
This doesn't change the fact that MS formats are the de facto standards in use, but it may help unify the communities that use non-MS formats, leading to a larger install base.
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
I joined the OpenDocument Format Alliance (http://www.odfalliance.org/) recently - partly to keep connected with what they are doing and partly to support a good cause.
:-) I always think of systems as data with software added as needed. Seriously, get the data (structures, schema, persistence, etc.) right and the rest is easier. Who would want to build systems around Office formats?
I understand how entrenched Microsoft Office is in many organizations but hopefully common sense will prevail - want permanent free access to your data? Then use ODF.
Although I am a 'programming language junky' (I am happily coding away in Ruby and Common Lisp this morning on a new long term AI engagement
I just hope that OpenDocument gets its formula standards in order. I've read in a few places that there is very little documentation in the standard proper about how formulas (for spreadsheets) should be stored and used, which could in time cause some compatibility problems. That being said, I'm glad that it was approved by the ISO... maybe in a few years I'll not have to worry about converting from one office format to another ad absurdum.
It's not just nicer from a "human point of view"; it's simply more appropriate, technically, for the data it contains. Microsoft's half-hearted attempt at an XML format is like storing cars on a keyring with more keyrings attached: you can put all the parts on there, if you try hard enough, but it just makes no sense, for man OR machine.
But I would wager that microsoft WILL impliment ODF, then they will tag on some extra data that they call DRM and then all of a sudden, you can still be locked out of your documents for not paying microsoft extortion.... eerrrrr... subscription fees!
Cliff Claven
K.E.G. Party Chairman
Founding Leader of: Koncerned for Egalitarin Governance
So did ODF folks finally decide how to store formulas? Currently every single spreadsheet that supports ODF (not that there are many) stores those as they wish with no defined standard.
ODF is 5 to 100 slower that Microsoft Office. http://blogs.zdnet.com/Ou/?p=196
MathML is the worst way to store formulas ever. Anything that takes 5k of text to specify int(from 0 to infinity, exp(-lambda*x**2)dx) correctly is simply stupid. It means hand coding mathML just isn't a viable option for more than a couple very simple equations. We should agree on something similar to a C, Fortran, Matlab, or other programming language notation as the standard way to store equations in the file. The added benefit of potentially being able actually execute at least some of the functions is just icing on the cake.
On a related, but somewhat less relevant note is that I can't find any inexpensive programs that allow the generation of mathML easily. There are a few out there that generate mathML at all, but they seem to concentrate on the typesetting aspect of mathML* and on having an obtuse interface. Why isn't there a easy-to-find, cheap or free (beer or speech), mathML editor that is as easy to use as the equation interface in LyX? (and yes i've tried export-html options in LyX, and attempted to manually convert with commandline utilities but my latex2html functions all seem to be completely braindead.)
*iirc, there is a way to use mathML to store calculable functions, but I have yet to see this implemented, and it takes even MORE text to store the equations.
I think the lack of available editors, and tex converters, especially considering the potential academic utility of mathML is pretty good evidence that it is a poor standard: it hasn't generated enough interest for someone to scratch the itch and write a decent converter/generator/editor.
Can you be Even More Awesome?!
OpenXML is patent ridden and in a way that is problematic at best, compared to OpenDocument. ODF is also patent ridden, but unlike MS' offering, the patents have free licensing for conformant implementations and conformant means to the official stated spec, with the possibility of extensions becoming part thereof- unlike MS' offering which requires you to meet MS' shifting definition of what is/isn't compliant (i.e. it's not explicitly stated...) and you don't get to add improvements unless MS embraces and extends them themselves (i.e. if you've got extensions and MS doesn't approve of them, you're NOT at all compliant and can be sued for patent infringement...).
Technically, they're the same. This is the reason why people can't understand why MS is insistent on NOT supporting ODF as a format and trying to push OpenXML- unless they've got some ulterior motive. Now, they've little valid excuse for it.
I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
Here is the draft of ECMA standard, enjoy its 2000+ pages of detailed info.
My other Beowulf cluster is... er...
When comparisons between formats remark upon mixed content models compared to non-mixed asking "which would you rather transform" expecting the answer of "mixed" you know a lot of people throwing opinions around on this issue have never actually worked transforming XML.
If you're wanting a human readable document format you have XHTML. Use it and enjoy. If you're producing an interchange format for word processing applications I'll take unambiguous and explicit over ambiguous and implicit even if that is at the expense of human readability.
The MS model uses a manifest to resolve link references, the ODF uses absolute references... this is criticised by Groklaw on the basis of human readability. Not maintainablity, application use, refactoring or normalisation of data.
There are valid problems that can be cited for both formats (I wish for instance MS had stuck with XLink), but this is quickly resolving into another round of MS bad, anything else good. It's emotive and is in most cases prejudged before technical merits are weighed.
I guess I just resent being asked whether I'd prefer to transform a mixed content model by somebody I know has never done so.
Just simple forced harmonic oscilation. Exactly like a sream of air [or a bow] vibrating a violin string.
One of my professers [at Brooklyn Poly] analyized the failure for his doctoral dissertation. [that would have been about 1940 or so]
He found that the Bronx - Whitestone bridge had the same failure modes, and was responsible for designing some remedial alterations to the structure.
Caution: Do not stare into laser with remaining eye.
Is "sabatoge" something Microsoft-specific or did you mean sabotage?
Finally, someone that uses objective analysis rather than religious doctrine regarding this issue.
-- "I never gave these stories much credence." - HAL 9000
If that was true, software would be more reliable and robust than the Forth Bridge. Yet it is not, for any non-trivial control program (or operating system).
Why? Engineers understad the beams of steel at least as well as the software engineer understands the computer, if not moreso. The civil engineer understands the force of wind often better than the software engineer understands the force of data entry error. And most importantly, the civil engineer understands the use requirements better than the software engineer.
LedgerSMB: Open source Accounting/ERP
A simpler explanation: Before the meeting started, they cleared the room of all chairs.
;-)
The article says that ODF *just now* got approved as ISO standard and asks "Now the question: Will OpenXML get the same treatment, despite its technical weaknesses?". Your answer is basically that OpenXML should not be an ISO standard because its not already an ISO standard, while ODF is a standard. That says nothing about the inherent weakness of OpenXML
Now the examples given at Groklaw show the real technical weakness of OpenXML/MSXML. It is basically a byzantine nightmare of complexity to(humanly) read and therefore hard to parse. It also is encumbered by patent issues.
I once took a try at using the MS Office 2003 XML capabilities (such as they are) via VBA, but enventially gave up and took a different route, partially because the "XML" output by Office was a LSD trip to write a handler for. Instead I just inserted plain text via the Office Document model. So, yes, MS Office XML sucks, but for valid reasons other than not already being an ISO standard.
You think that's bad? I have a template that all of our meeting minutes gets typed into; this is a text-only (no embedded graphics) Word document. It has a bunch of tables and some formatting in it, but no tracked changes or anything. It's a very simple template.
... 275 KB.
.... that's a lot of wasted disk space thanks to an inexplicably bloated template document.
It weighs in at (drumroll, please)
This is for two pages, mostly consisting of blank space. I am absolutely at a loss as to why it's so huge; but it's a standard template that we have to use, long predating me, and I'm not going to rebuild it. Every week we generate two or three of these, which depending on how much actual content gets put into one, ends up being around 300 KB or so. Multiply by 50 weeks, multiply by six or seven years
I'm sure other people can come up with even more egregious examples, but that's mine. I've definitely seen Word docs that have ballooned to a MB or more from a few hundred KB just from turning track changes on and changing a line or two (what the heck does it store in there? Is a diff not good enough or something?). But probably nobody outside MS will be able to tell what the problem is, since the format is a black box.
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Patents e.g. http://gauss.ffii.org/PatentView/EP1376387
This needs to get discussed first.
No, the biggest hurdle is making superior office software! From my limited experience, AbiWord is a decent document writing tool, and I have only a few gripes with OpenOffice.org Writer (namely rotating and flipping graphs), so beating MS Word is not a problem IMO. Excel, on the other hand, is the best spreadsheet software that I have ever used. It seems to contain more convenient functions, it has a nice sliding-bar feature to adjust values of a cell, and it handles large datasets ALOT faster than OpenOffice.org Spreadsheet! Once I had to make a graph of ~3000 datapoints (X & Y), and it took forever to highlight the entire row, and after I plotted the graph, the entire package slowed to a crawl. I then used Excel to plot the same points, and it highlighted and plotted the graph very quickly. Yes, OpenOffice has a shitty codebase that was developed by an obscure German company; however, this means we, the open source crowd, have even more work to do. Don't rest yet boys, 'cuz the party ain't over.
---- "XML is like violence. If it doesn't fix the problem, you aren't using enough."
On the contrary, KOffice will run on Windows very soon. Kdelibs are being ported to Qt4 as we speak, and almost runs under Windows already. The same is happening with KOffice, and I think we will see a proof of concept of KOffice running on Windows before summer this year.
If it takes you along time to highlight your data set, why don't you use named data ranges?
Re: work to be done -- of course. But the vast majority of most people I know who use Excel regularly only use it for keeping lists (or small databases) and sometimes minor functions.
Ideally, if you need to develop graphs and charts from your data, you should be using Openoffice Chart -- though it also needs some work (see the early March slashdot review/tutorial of Chart 2.0). This is a large point of interoperability, after all -- not only switch between different brands of the same application type, but also to use different applications with the same data.
At the office, I often use Crystal Reports to generate the reports I need from my Excel spreadsheets -- much quicker than generating reports in Excel using VLOOKUP or PRODUCT or the other tools out there. And heaven forbid I need to write yet another VB script to get the functionality I want, especially when I'll want that functionality across different files. The point is that it's important to select the proper tools for the job at hand -- and IMO, a spreadsheet is not a chart/graph generator, nor should it be.
I digress a bit, but there is a reason that MSOffice is so bloated -- they've tried to make each piece do more than it should. For those who never use charts in Excel, why should they have to install that functionality?
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
You are comparing the 'effeciency' of fruits to the effeciency of fruit squeezers? ODF is the file format. MSOffice is an application. The linked article compares the ODF reading/writing capabilities in one implementation (OpenOffice) to that of Microsoft using its binary memory/OLE dump. You're not looking all to bright by doing that.
The linked article, btw, starts to spreads unfounded idiot opinion at the paragraph that explains the file format specifications are "a non-issue" to the author and goes further with lying: "But this is argument is fundamentally flawed because the existing Microsoft Office binary formats are effectively the de facto standard and are effectively open to anyone."
If the meaning of the word "open" is twisted to "can be implemented with lots of guesswork" and "compatiblity can be broken by a whim of Microsoft", then, yes, the author is right. But as it reads, the author here just slightly invented a "fact" to bring his point around.
I REALLY like the idea of the OpenDocument format and can't wait to see it get implemented in various Word Processors.
... or is there another suggestion? I've used Pages a bit but my main concern is that I want a program that will easily read the files 10+ years down the road.
I am a university student. As I starting my 4th year next year the research is going to be more 'intense' and I'm going to have to write longer papers. Stuff that I would like to present in a job interview to show my academic credentials, etc. And also be able to read it in 5 years' time.
I don't want to use plain text, HTML, or Tex/Latex (I've tried and its way too complex) and no thanks for Scribus suggestions.
I've been writing everything in MS Word for the last 6+ years. I don't care about my earlier work too much. But I don't want to be totally locked in. I'd use OpenOffice but the Mac version is too slow.
What word processor on Mac or Windows will use the OpenDoc format? To get nicely formatted (font, layout, etc) documents am I pretty much stuck to Word
HTML was adequate back in the days of dialup when people actually used HR tags rather than styling borders on the bottoms of DIV tags and so forth, but really the HTML that people use nowadays in "real" web site design is actually very elegant. It has become less of just a markup language and more of a layout language, with the block layout model used by CSS. While this is true, the old HTML code will still work and you can learn all the new stuff having a cursory knowledge of old-school HTML.
In this way, HTML is just fine, because it's been easy to expand it and adapt it to changing requirements and capabilities of users and servers. I think that ODF, being in the spirit of that same standard (XML), as it's been commonly used with respect to documents, is definitely the winner here. As new capabilities or requirements come about, the standard as described now is flexible enough that it can be extended easily without breaking a lot of applications in the process. What has resulted is a number of different "standards" that are all accurate, but one is definitely identifiable as being more recently defined than the last as opposed to equally-accurate and equally-relevant standards being developed and specified in parallel.
Weren't we agreeing that we should start using lyx (or latex) so we can stop the formatting hell in WYSIWYG text processors?
Seriously, why not extend XHTML to support missing features?
l a
Why do we need another OpenDocument, OpenXML syntax?
Like Groklaw pointed why do we need another XML syntax? when people know XHTML/CSS already?
MS Office can already support HTML for Word and Excel, very nicely.
It would be much easier to make Microsoft accept a new version of XHTML,
then to adopt something awkward like OpenDocument.
The only thing missing are better CSS export for custom types, individual page settings, printer setup, page margin, small caps support, font kerning, MathML and embedded SVG support.
I guess this is too simple:
<html><body>
<page width="8.5in" height="11in"
margin="1in,1in,1in,1in">blablabla
<footer pos=".5in"><center>Page 1</center></page>
<page width="11in" height="8.5in"
margin="1in,1in,1in,1in"><header pos=".5in"><center>Confidential</center>
blablab
<footer pos=".5in"><center>Page 2</center></page>
</page>
</body>
</html>