XML Support In Office 2003 Isn't For Everyone
0x0d0a writes "Unfortunately, it seems that Microsoft's recent campaign to promote Office 2003 based on its XML support may be a bit misleading. Only the Enterprise and Professional releases will have this support -- not Standard. Microsoft will still be leveraging file format compatibility for at least another Office release."
but XML support in OpenOffice is.
------------
This is guarenteed to not be the first post.
Right, like we couldn't have seen this coming from a looong way off.
.DOC files completely - thanks to .PDF, it's been mostly successful.
I've given up on Office completely. I even try to reject
"Compatability" is still a bitches game.
; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
The sun rose this morning; sunset predicted for later today!
This is one reason I use openoffice (openoffice.org at home as it supports most word versions flawlessly, without promting me to "insert office cd 2" to install the feature.
We're only gonna die from our own arrogance, that's why we might as well take our time...
Microsoft announced that only the Enterprise and Professional versions of Office 2003 would support the feature of saving files to industry-standard media such as IDE and SCSI hard disks. The Standard version of Office 2003 will allow the user to save document files only to Microsoft Zippo (TM), a new proprietary USB-based external removable media device. "We believe this is an innovative way to provide extra value to our customers." said Microsoft spokesman Hugh Jass.
"And this is my boy, Sherman. Speak, Sherman." "Hello." "Good boy."
We have to remember that this is Microsoft we are talking about here. Any time they say "we are going to switch to an open format", there's always a catch to it.
Is Microsoft ever going to switch to an open format? No, why would they? They will only lose money. As for the people complaining about competition, why should a company with 90 - 95% of the desktop Office suite market care?
People with little or no knowledge about what Microsoft has done in the past might think that Microsoft is taking a great step forward. But remember, this isn't going to be complete XML, it is "Microsoft XML"
All this about Microsoft doing a great thing by switching to an "Open XML base" is all hype, nothing more.
Read my journal here.
Swimming with a very big shark is always guaranteed to be interesting, not necessarily good or bad. This is just throwing a few drops of blood in the water to spice things up.
Text in Office 2003 files stored in XML format might be viewable in other desktop programs, but all document formatting would be lost
Actually, this is entirely the point of XML. XML is not Yet Another Word Processor Format. It's intended to store "content" as opposed to "presentation", leaving "presentation" up to the app, much as was the original intent of HTML. Rather than an evil Microsoft plot, they are in fact conforming to the spec when they produce such a file.
The semi-trailer truck sized hole in the notion is, of course, that "presentation" isn't really entirely separable from "content", especially in a modern document. All that graphic-artist stuff like layout and font choice and formatting actually affects the value and usefulness of the document. That's why we put it in in the first place. And that's why everyone always whines when Word strips out all the "presentation" they've spent all that effort putting into the document and just leaving them with the raw XML "content" -- a bunch of text.
The flaw here is in the attempt to erect too high of a wall between presentation and content, not in Word.
By the time you get fine-grained enough control over the presentation to create documents that actually look the way you want, the "content" usually becomes illegible. Alternatively, you have only coarse control over the presentation, in which case the content most often looks like crap. This problem is easily seen in any number of web pages that feel obliged to include some little rant at the top about bloated HTML and how they concentrate on "pure content", which usually means a sea of unreadable and undiffentiated Times Roman.
The flip side is if you actually do break up the content enough to get control over the presentation. The last time sometimes tried to create a human-readable ASCII-text format for documents, they wound up with Postscript. A typical document actually looks something like:
[556 0 24 -19 541 703 ]
AddEuroGlyph
} if
F
F4S53 Ji
688 1320 M ( )S
F2S53 Ji
800 1518 M (802.3z Gigabit Eth)[42 42 42 21 42 36 21 60 23 41 37 42 23 23 21 51 23 0]xS
1431 1518 M (ernet local)[37 28 41 37 23 21 23 42 37 37 0]xS
1781 1518 M (-)S
1809 1518 M (side interface)[32 23 42 37 21 23 41 23 37 29 27 37 37 0]xS
2255 1518 M ( )S
F3S53 Ji
650 1620 M S
F4S53 Ji
688 1620 M ( )S
F2S53 Ji
800 1620 M (Supports f)[46 41 42 42 42 28 23 32 21 0]xS
1145 1620 M (ull Gigabit line rate)[41 23 23 21 60 24 41 37 42 23 23 21 23 24 41 37 21 28 37 23 0]xS
1795 1620 M ( )S
F3S53 Ji
650 1722 M S
F4S53 Ji
688 1722 M ( )S
F2S53 Ji
800 1722 M (Operates in either media convert)[60 42 37 28 37 23 37 32 21 23 41 21 37 23 24 41 37 28 22 63 37 42 23 37 21 37 42 42 42 37 28 0]xS
1888 1722 M (er)[37 0]xS
1953 1722 M ( or line)[21 42 28 21 23 23 41 0]xS
2189 1722 M (-)S
2216 1722 M (card )[37 37 28 42 0]xS
800 1817 M (mode)[63 42 42 0]xS
Here's a hint. The "content" is clearly delimited by parentheses (instead of, oh, "") Easily readable by humans, right? A cinch to import into other applications, right? Guess what: a real XML word processing document that kept the presentation information isn't going to be any more readable. You're not just going to whip out vi and fix it up any more than you can do that to your Postscript documents now.
XML is not magic application pixie dust that makes all features transparently interoperable when you sprinkle it on.
Microsoft will still be leveraging file format compatibility for at least another Office release.
Here we go again. "If Microsoft would just use an open format like XML then anyone could read the documents with any program and the world would be a better place."
XML is a format for creating data formats. It is not a data format. The fact that a particular format is XML compliant says nothing for its readability, it simply means that it can be parsed into a document tree by an XML parser. That doesn't mean that anybody can determine what the tree represents, only that it can be created. My favorite analogy: "If Microsoft would just start using 8-bit bytes, then anybody could read their file formats."
Microsoft has made it clear that the dollar value of secret file formats isn't lost on them. They will continue to use secret file formats, even if they're XML-based, until someone makes them stop. At the same time, they'll be able to harvest the stupidity of PHB's who will claim that Microsoft file formats are open because they're XML. It's surprising how many people on Slashdot foolishly believe the same.
Michael
Do you have ESP?
I m not counting on MS Office Suite to provide me with a XML editor. Here are some alternatives:
DocSoft's W2XML Version 2
Authentic by Altova
i4i Tagless Editor
XMLWriter by Wattle Software
Opensource Extensible XML Modeling Application
If you know of any other GUI based XML modeling/editing apps, please feel free to add them to this list.
Consensus is good, but informed dictatorship is better
This is as good-a-time as any to migrate away from Microsoft Office. Open Office 1.1 is about to come out and it looks brilliant!! (the beta is currently available at http://www.openoffice.org/ ) It supports open standards (eg. XML), Microsoft Documents (word/excel/powerpoint) and exports to PDF (both text and graphics) at the press of a button! It also manages to count page numbers correctly when printing (* cough - word, cough *).
:)
On the other hand, my wife prefers Word and I prefer Open Office. The only time she likes open office is when she asks me to convert a document from one word format to another - because word won't do it at all, or word converts it very badly.
Also, I save several hundred dollars every few years
AC
lemme see ...
...
... see Bill, see Bill emulate.
... ;)
there's MS Java, then there's the other version
there's MS HTML, then there's the other version
there's MS VC++, then there's the other version
there's MS OS's then there's the other OS
same ol same ol
Nope, nuthin new here folks, move along
Words to men, as air to birds.
Even if XML was supported in all versions of Office, would that mean that Office would suddenly have an open file format? I don't think so. It's perfectly possible for me to write anything in XML in a way that you will not be able to read it.
Which is normal. XML is a way to describe data. If you have the DocType Definition (DTD) of an XML file, the only thing you know is whether that XML file is structured correctly, and how you would create another XML file that would look like the same thing for an XML parser. Nothing more.
In the long run, XML is nothing more than a standard you can use to base other standards on. XML can be put in the same row as ASCII, bytes, the file concept, or even SGML: it's a standard intended for the creation of other standards.
Nothing more, nothing less
Therefore, I think the argument that Microsoft Office will 'support XML' is just a marketing joke. It won't do anything out of the ordinary...
"I triple guarantee you, there are no American soldiers in Baghdad."
Mohsen Khalil, Iraqi Ambassador to the Arab League:
"Iraq will not be defeated. Iraq has now already achieved victory - apart from some technicalities."
Jean Paoli, Microsoft's XML architect:
"I'm out of the business of creating formats. Our focus on Office is on data exchange. There is no more difference between documents and data."
Treat XML like a database. It has rules of operation, but what you contain and how you describe the data are completely arbitrary.
...
Anyone who has used XML knows perfectly well that it's entirely possible to describe the complete dataset for content, layout, and presentation, within an XML document, in a form which can be easily parsed by humans and software alike. Completely. Using open standards, even.
Consequently, it's also possible to wrap it all up in 'parseable', yet 'unhandleable-unless-you're-on-the-inside' data blobs which mean nothing to no-one, yet still use 'XML' as a wrapper.
It's a liability of having such an open design, and Microsoft are exploiting this fact, in the context of *CLEAR* market-division tactics.
*They* created the artificial 'Professional/Enterprise/Standard' labels. Not the Users.
MS' use of XML here is perverted. It serves no purpose other than to give MS an opportunity to blag press release points about how their software uses 'the latest open standards' to people who have *NO CLUE* what they're talking about
; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
XML is a text-based system for data storage and retrieval, intended to be *self documenting*. In other words, the details on what fonts are used, what settings The User has set for individual parts of the documents, the parameters for those setting, etc. ARE ALL SUPPOSED TO BE STORED IN READABLE FORMAT WITHIN XML TAGS, CONFORMING TO A KNOWN, PUBLISHED DOCUMENT DESCRIBING THE CONTENT.
No it's not. XML is not supposed to store information such as 'font' and other presentational features. This is the job of the XSL stylesheets or CSS etc. XML is designed to store data in a structured way. So for instance you may have a <chapter> tag, but what font to use for chapter tags is only supposed to be specified in the XSLT. If I did an XML export of my word document, I would expect (hope for) an XML document, and either an XSLT stylesheet transforming the XML to HTML, or an XSL:FO stylesheet so that I can turn the XML into a pdf or postscript file. However, the stylesheets would be the 'icing on the cake'. The essential item is the XML formatted data, not the presentational information.