Microsoft to Open up Office Formats
Been on TV writes to tell us that Microsoft is expected to announce on Tuesday the opening of their Office file formats, according to Financial Times. From the article: "Microsoft will submit its Office file formats to Ecma International, the standards body, which will develop the documentation and make it available to the industry. The move is being supported by a number of organizations including Apple Computer, Barclays Capital, BP, Intel and Toshiba."
So.. Will they really open everything, or just wrap their proprietary implementation inside XML and therefore claim their format is "open"?
I hope they really open up the format. Otherwise it'd be as bad as RIAA promoting DRM "for freedom". Sigh.
It seems odd that it will take 18 months to develop documentation for the file formats. Sure, the formats must be complex, but it seems like maybe this documentation organization might not be a truly independent standards body.
Ecma's wiki and site seems to be pretty much confirm that they're composed of manufacturer members. I wouldn't consider them the equivalent of ANSI or UL. 18 months of work by a collusive industry is more throwing those governments a bone than actually getting the work done right.
I guess there should be some applause for getting the ball rolling. Uphill?
My first question, and likely that of many others, was: "Why are they doing this?"
Well, according to TFA, it's because of the European Commission has been urging companies to open up their document formats, and Microsoft feared the EC would stop using Microsoft's formats for the creation of public documents, and urge national governments to do the same.
So, thumbs up for the EU on this one!
Please correct me if I got my facts wrong.
[T]he format is free to use. In his next post, Brian points out that the license is perpetual; that is, it cannot be changed once granted.
We've seen other such licenses that have turned out to be very misleading. For example, if I use the license and write software that implements the specs, can I legally sell my software? The fact that I have a license doesn't mean that I can pass the license on to others in my products.
There are lots of potential legal gotchas in a lot of "free" licenses. Given MS's history, a bit of paranoia is in order here. We need people suggesting ways that they can turn around and sue us into bandruptcy if we use their specs. Then we need assurances that they won't sue us in any of those ways. And we need lawyers looking at the assurances and telling us whether they're legally meaningful, or whether we might get sued anyway.
After all, consider the RIAA. Who'd have ever thought that someone could be sued for buying a recording, sticking it into their own sound equipment and playing the music? But that's happening these days. We've even just had a story of recordings that intentionally damage our playing equipment when we attempt to play the music.
Paranoia here is quite appropriate.
Those who do study history are doomed to stand helplessly by while everyone else repeats it.
They are fully and openly specifying how to write all of the Office formats. While this is good, it does nothing for the other important half which is reading. They clearly don't want all applications to perfectly files generated by some software. This tatic seems to guarentees that at least one product will "clean" as well as special Office formats: Office itself.
I suppose people can take the information on how to write a valid "clean" Office format to make better format translators but we are still hosed for various random files that will be generated and only readible in sanctioned applications.
Adobe has been doing it for years with the PDF format, and most people are ok with it.
Keeping control over the evolution of the format but having the specs fully open so as to allow completely compatible products is a good thing, I myself would appreciate it if MS did that.
"The way we can tell it's C# instead of Haskell is because it's nine lines instead of two." -- wadler
Well, on the admittedly limited chance that you're not trolling, and that instead you'll actually consider a reasoned response with an open mind, I'll try one more time. First, yes, we "idiots" do know exactly what we're talking about, so I honestly hope you'll bother to read and possibly even learn something.
The fine example you gave is a trivially simple and quite static format, similar to an image. It is far from complex and dynamic enough to describe any useful arbitrary document. If you'd actually re-read the post to which you replied, you'd find a much more relevant example, that of HTML. HTML can't be described as a basic C-style structure like your example, but a formal grammar such as BNF (or a DTD for XHTML) could be used. However, you can very easily omit many optional flags/features when describing how to write a valid document in any such format. As noted, I might only tell you only about the head, title, and body tags, and perhaps the h1-h7 tags as well.
Is it possible for me to neglect to tell you about all the other formatting tags (like b and i and friends) and even "forget" to mention the whole hyperlink concept with the "a" tag? Sure it is. Can you write a valid document? Sure you can. Now, can you really read all possible documents, including those that use the tags I so conveniently neglected to describe? No.
Let's even use your own example, with a modification:
long version 0x0100
long number of strings 0x0002
long string length
string
long string length
string
long number of options 0x0001
int option_num
int option_length
byte [] option_data
EOF
Here you see that I've told you how and where to add multiple options. However, I've not told you what options are valid. I might only tell you about some of the options and not others. You can always still write a document given that format, but you can't read all documents unless you've been told all the possible valid options.
So, really I hope this hasn't been a waste of time, and that you can see that Microsoft can choose to give out any arbitrary amount of detail for how to write a proper and valid document, without giving sufficient tools with which to parse all possible valid documents.
They're opening their file formats because they still has a trump card. Or has anyone forgotten about this?
A quick patch or two to Microsoft Office (now one of their biggest or the biggest ca$h cow - 1/3 of their profits?) and MS Office suddenly reads|writes XML format only. They aren't about castrating themselves voluntarily. They still have shareholders to keep happy, but more importantly, they want to be the trendsetters, no matter what.
How does this impact Open Office? Open Office can then read the XML Format because it's declared in the patent. But what O^2 won't be able to do is write the MS Office XML Format [except to violate the patent]. This means: no interoperability and any business which wants to migrate away from a closed system (MS Office) to Open Office can do so only as a one-way trip, burning the bridge behind them. And the company can't communicate both directions, so that forces a move en masse.