XML Support In Office 2003 Isn't For Everyone
0x0d0a writes "Unfortunately, it seems that Microsoft's recent campaign to promote Office 2003 based on its XML support may be a bit misleading. Only the Enterprise and Professional releases will have this support -- not Standard. Microsoft will still be leveraging file format compatibility for at least another Office release."
Right, like we couldn't have seen this coming from a looong way off.
.DOC files completely - thanks to .PDF, it's been mostly successful.
I've given up on Office completely. I even try to reject
"Compatability" is still a bitches game.
; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
This is one reason I use openoffice (openoffice.org at home as it supports most word versions flawlessly, without promting me to "insert office cd 2" to install the feature.
We're only gonna die from our own arrogance, that's why we might as well take our time...
We have to remember that this is Microsoft we are talking about here. Any time they say "we are going to switch to an open format", there's always a catch to it.
Is Microsoft ever going to switch to an open format? No, why would they? They will only lose money. As for the people complaining about competition, why should a company with 90 - 95% of the desktop Office suite market care?
People with little or no knowledge about what Microsoft has done in the past might think that Microsoft is taking a great step forward. But remember, this isn't going to be complete XML, it is "Microsoft XML"
All this about Microsoft doing a great thing by switching to an "Open XML base" is all hype, nothing more.
Read my journal here.
Text in Office 2003 files stored in XML format might be viewable in other desktop programs, but all document formatting would be lost
Actually, this is entirely the point of XML. XML is not Yet Another Word Processor Format. It's intended to store "content" as opposed to "presentation", leaving "presentation" up to the app, much as was the original intent of HTML. Rather than an evil Microsoft plot, they are in fact conforming to the spec when they produce such a file.
The semi-trailer truck sized hole in the notion is, of course, that "presentation" isn't really entirely separable from "content", especially in a modern document. All that graphic-artist stuff like layout and font choice and formatting actually affects the value and usefulness of the document. That's why we put it in in the first place. And that's why everyone always whines when Word strips out all the "presentation" they've spent all that effort putting into the document and just leaving them with the raw XML "content" -- a bunch of text.
The flaw here is in the attempt to erect too high of a wall between presentation and content, not in Word.
By the time you get fine-grained enough control over the presentation to create documents that actually look the way you want, the "content" usually becomes illegible. Alternatively, you have only coarse control over the presentation, in which case the content most often looks like crap. This problem is easily seen in any number of web pages that feel obliged to include some little rant at the top about bloated HTML and how they concentrate on "pure content", which usually means a sea of unreadable and undiffentiated Times Roman.
The flip side is if you actually do break up the content enough to get control over the presentation. The last time sometimes tried to create a human-readable ASCII-text format for documents, they wound up with Postscript. A typical document actually looks something like:
[556 0 24 -19 541 703 ]
AddEuroGlyph
} if
F
F4S53 Ji
688 1320 M ( )S
F2S53 Ji
800 1518 M (802.3z Gigabit Eth)[42 42 42 21 42 36 21 60 23 41 37 42 23 23 21 51 23 0]xS
1431 1518 M (ernet local)[37 28 41 37 23 21 23 42 37 37 0]xS
1781 1518 M (-)S
1809 1518 M (side interface)[32 23 42 37 21 23 41 23 37 29 27 37 37 0]xS
2255 1518 M ( )S
F3S53 Ji
650 1620 M S
F4S53 Ji
688 1620 M ( )S
F2S53 Ji
800 1620 M (Supports f)[46 41 42 42 42 28 23 32 21 0]xS
1145 1620 M (ull Gigabit line rate)[41 23 23 21 60 24 41 37 42 23 23 21 23 24 41 37 21 28 37 23 0]xS
1795 1620 M ( )S
F3S53 Ji
650 1722 M S
F4S53 Ji
688 1722 M ( )S
F2S53 Ji
800 1722 M (Operates in either media convert)[60 42 37 28 37 23 37 32 21 23 41 21 37 23 24 41 37 28 22 63 37 42 23 37 21 37 42 42 42 37 28 0]xS
1888 1722 M (er)[37 0]xS
1953 1722 M ( or line)[21 42 28 21 23 23 41 0]xS
2189 1722 M (-)S
2216 1722 M (card )[37 37 28 42 0]xS
800 1817 M (mode)[63 42 42 0]xS
Here's a hint. The "content" is clearly delimited by parentheses (instead of, oh, "") Easily readable by humans, right? A cinch to import into other applications, right? Guess what: a real XML word processing document that kept the presentation information isn't going to be any more readable. You're not just going to whip out vi and fix it up any more than you can do that to your Postscript documents now.
XML is not magic application pixie dust that makes all features transparently interoperable when you sprinkle it on.
Microsoft will still be leveraging file format compatibility for at least another Office release.
Here we go again. "If Microsoft would just use an open format like XML then anyone could read the documents with any program and the world would be a better place."
XML is a format for creating data formats. It is not a data format. The fact that a particular format is XML compliant says nothing for its readability, it simply means that it can be parsed into a document tree by an XML parser. That doesn't mean that anybody can determine what the tree represents, only that it can be created. My favorite analogy: "If Microsoft would just start using 8-bit bytes, then anybody could read their file formats."
Microsoft has made it clear that the dollar value of secret file formats isn't lost on them. They will continue to use secret file formats, even if they're XML-based, until someone makes them stop. At the same time, they'll be able to harvest the stupidity of PHB's who will claim that Microsoft file formats are open because they're XML. It's surprising how many people on Slashdot foolishly believe the same.
Michael
Do you have ESP?
Even if XML was supported in all versions of Office, would that mean that Office would suddenly have an open file format? I don't think so. It's perfectly possible for me to write anything in XML in a way that you will not be able to read it.
Which is normal. XML is a way to describe data. If you have the DocType Definition (DTD) of an XML file, the only thing you know is whether that XML file is structured correctly, and how you would create another XML file that would look like the same thing for an XML parser. Nothing more.
In the long run, XML is nothing more than a standard you can use to base other standards on. XML can be put in the same row as ASCII, bytes, the file concept, or even SGML: it's a standard intended for the creation of other standards.
Nothing more, nothing less
Therefore, I think the argument that Microsoft Office will 'support XML' is just a marketing joke. It won't do anything out of the ordinary...
Treat XML like a database. It has rules of operation, but what you contain and how you describe the data are completely arbitrary.
...
Anyone who has used XML knows perfectly well that it's entirely possible to describe the complete dataset for content, layout, and presentation, within an XML document, in a form which can be easily parsed by humans and software alike. Completely. Using open standards, even.
Consequently, it's also possible to wrap it all up in 'parseable', yet 'unhandleable-unless-you're-on-the-inside' data blobs which mean nothing to no-one, yet still use 'XML' as a wrapper.
It's a liability of having such an open design, and Microsoft are exploiting this fact, in the context of *CLEAR* market-division tactics.
*They* created the artificial 'Professional/Enterprise/Standard' labels. Not the Users.
MS' use of XML here is perverted. It serves no purpose other than to give MS an opportunity to blag press release points about how their software uses 'the latest open standards' to people who have *NO CLUE* what they're talking about
; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --