Slashdot Mirror


Dark Corners of the OpenXML Standard

Standard Disclaimer writes "Most here on Slashdot know that Microsoft released its OpenXML specification to counter ODF and to help preserve its market position, but most people probably aren't aware of all the interesting legacy code the OpenXML specification has brought to light. This article by Rob Weir details many of the crazy legacy features in the dark corners of OpenXML. As it concludes after analyzing specification requirements like suppressTopSpacingWP, 'so not only must an interoperable OOXML implementation first acquire and reverse-engineer a 14-year old version of Microsoft Word, it must also do the same thing with a 16-year old version of WordPerfect.'"

250 comments

  1. It's not a true standard... by __aaclcg7560 · · Score: 3, Funny

    Until it supports WordStar documents.

    1. Re:It's not a true standard... by Anonymous Coward · · Score: 0

      Damn straight... I loved Wordstar and I still fire up version 3.1 now and then ;-) The editor Joe does a pretty good job though. Ahhh yes! Those were the days ;-) I never like Word and I still don't. I'm getting warmed up to Open Office Writer after several years of use though. Even so... after using VI for the last 10+ years for text... nothing even comes close. VI and Wordstar... when it was all about efficiency :-D

  2. Re:MIcrosoft sucks. by Anonymous Coward · · Score: 0, Insightful

    The crazy amount of backwards compatibility is what allowed Microsoft to rise to the position it holds today...

  3. Length by jcnnghm · · Score: 3, Funny

    I don't know why anyone would complain, the spec is only 6,000 pages long.

    --
    You don't make the poor richer by making the rich poorer. - Winston Churchill
    1. Re:Length by MillionthMonkey · · Score: 1

      Well, if you were to start implementing that spec at the rate of one page per hour, you'd be done in just 6000 hours.

    2. Re:Length by Heir+Of+The+Mess · · Score: 1

      The company boss would say why not just give 8 pages each to 750 developers and by the end of the day we should have a fully working product.

      While this is rediculous, I'm sure the spec could be broken up into specs for a few different modules. Afterall if Microsoft wrote the spec, and has implemented the spec, then how difficult could it be?

      I once spent 18 months writing a 3000 page spec, and it only took a team of 5 another year to implement it. Of course since then whenever someone asks me if I would like to write a spec everything suddenly goes dark and people find me shaking and sweating in a fetal position in a corner somewhere. Writing a big spec is not fun.

      --
      Australian running a company that does C# / C++ / Java / SQL / Python / Mathematica
    3. Re:Length by chthon · · Score: 2, Funny

      Yes, but modular programming is anti-thetical to Microsoft's way of doing things.

    4. Re:Length by redcane · · Score: 2, Insightful

      I think they may have implemented it, and then made a spec to take into account their horrible implementation.

    5. Re:Length by Anonymous Coward · · Score: 0

      Yes. I think they just did an XML dump of the existing binary DOC-objects. Then they just had to document each XML element

    6. Re:Length by 99BottlesOfBeerInMyF · · Score: 2, Insightful

      Afterall if Microsoft wrote the spec, and has implemented the spec, then how difficult could it be?

      Did you read the article. Some of the spec is things like "do what MS Word 5.1.4 did with line spaces." How exactly is anyone other than MS supposed to implement that? By reverse engineering a whole slew of old products that are not even available on the market anymore?

      I once spent 18 months writing a 3000 page spec, and it only took a team of 5 another year to implement it.

      That's fine but this spec isn't even a spec in the proper sense. It references specific closed implementations by MS and other vendors. Since those other implementations are not themselves specs, neither is this one.

    7. Re:Length by computational+super · · Score: 1
      The company boss would say why not just give 8 pages each to 750 developers

      You're giving him too much credit. He would save give 9 pages each to 600 developers.

      --
      Proud neuron in the Slashdot hivemind since 2002.
    8. Re:Length by Heir+Of+The+Mess · · Score: 1
      Did you read the article. Some of the spec is things like "do what MS Word 5.1.4 did with line spaces." How exactly is anyone other than MS supposed to implement that? By reverse engineering a whole slew of old products that are not even available on the market anymore?

      I read the article, and I think the author lacked imagination afterall why reinvent the wheel. I assume Microsoft has already done all the hard multi-product reverse engineering work in Word 2007, which you can get a time limited evaluation copy of for free. Additionally Word 2007 has plugins to output in PDF and XPS. So all you need to do is write a script that outputs an OpenXML wordprocessor document with all the things in it you want to test, load it into Word 2007, and then save it out as a PDF file. Then you will have an input file, and an output that has all the typesetting information in it that you require. Given a few iterations and tweaking of that I would think giving one smart person 6 months to come up with a best attempt should yield a reasonably functioning module. Of course how you then port this to a non-typesetting wordprocesssor file format is another issue, which is probably why MS went down the path that they did.

      --
      Australian running a company that does C# / C++ / Java / SQL / Python / Mathematica
  4. The author is exactly right. by JoshJ · · Score: 3, Insightful

    This is why the Microsoft Office XML (let's not kid ourself, this is far from "open") format should not become an ISO standard.

    1. Re:The author is exactly right. by Zaiff+Urgulbunger · · Score: 2, Interesting

      Totally agree. I wonder how it managed to get approved by ECMA? IIRC only IBM didn't agree to its approval; all other parties (whoever they are) agreed. I don't understand what they felt was good about this "standard" especially given that ODF had already been approved.

    2. Re:The author is exactly right. by Helldesk+Hound · · Score: 2, Funny

      I always thought the ECMA was something to be purchased. ;o)

    3. Re:The author is exactly right. by _|()|\| · · Score: 4, Insightful

      Prior to reading this article, I was ambivalent about Office XML. The push to standardize Office's "DNA sequence" seemed disingenuous, but at least the format was described in detail. Now I see that the table-sagging 6,000 pages is just the tip of the iceberg: this "standard" effectively includes, by reference, the source code for every prior version of Office, to which only Microsoft has access.

    4. Re:The author is exactly right. by Anonymous Coward · · Score: 0
      This is why the Microsoft Office XML (let's not kid ourself, this is far from "open") format should not become an ISO standard.
      Or depending on your opinion of the ISO, it should. ;-) Remember OSI?
    5. Re:The author is exactly right. by mwvdlee · · Score: 2, Informative

      Nobody takes ECMA seriously anyway.

      You probably know that JavaScript has been standardized as "EcmaScript" by ECMA; everybody just ignores that standard.

      --
      Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
    6. Re:The author is exactly right. by maxume · · Score: 1

      I would suggest examining javascript 1.5 as it is implemented in mozilla.

      --
      Nerd rage is the funniest rage.
  5. The power of legacy systems... by Anonymous Coward · · Score: 5, Insightful

    The power of legacy systems is at once both Microsoft's greatest strength and greatest weakness. Nobody in OSS is going to have the patience to rebuild the same level of backwards compatibility needed to displace them but the code must be an absolute tarpit of accumulated cruft and security holes that's incredibly difficult for them to keep going.

    1. Re:The power of legacy systems... by blincoln · · Score: 1

      Nobody in OSS is going to have the patience to rebuild the same level of backwards compatibility

      I'm sure there are plenty of people that would do it if they had access to the dev docs that Microsoft works from.

      The hitch here is that *not* having them means tons and tons of reverse engineering, and that's only after tracking down every release of every version of every MS Office ever. Reverse engineering can be fun, but I have a hard time imagining that figuring out character spacing in the Mandarin version of Office 5 would fall into that category.

      --
      "...always new atoms but always doing the same dance, remembering what the dance was yesterday." -Richard Feynman
    2. Re:The power of legacy systems... by clodney · · Score: 2, Interesting

      I suspect that Microsoft has near zero dev docs to work from, at least when it comes to emulating Word 95 or WP 5.1. In the early days MS was known as a very freewheeling culture, and internal docs were very rare.

      Even more to the point, the guidance sections essentially say "the existing implementations are buggy and we can't actually describe the precise behavior for arbitrary input". In the case of previous versions of Word they probably just have copies of the old modules that they feed the data to. For WordPerfect emulation I bet someone just did a black box reverse engineering that *mostly* works. While it may be documented, the rules are going to seem completely arbitrary.

      I am often faced with this kind of situation at work. The requirements calls for significant enhancement to a feature, and it seems clear that the best way to implement it is to write the code from scratch. But the feature still has to maintain backwards compatibility. So you start studying the code, and find that it has subtle bugs that you now have to recreate in a new implementation.

    3. Re:The power of legacy systems... by Bert64 · · Score: 1

      Their internal documentation seems to be rather poor, witness the laughable ability of ms publisher to open and save word documents... It garbles them badly, and makes a much worse job of it than openoffice does.
      It seems that they just keep the original office codebase going and carefully add to it trying not to break anything.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
  6. 14 year-old Word and 16-year old WP by AiY · · Score: 2, Funny

    Sweet! I actually have copies of those somewhere. The reverse engineering process will begin immediately. Now where did I put my 286....

    --
    "You need a license to buy a gun, but they'll sell anyone a stamp." - Red Green
    1. Re:14 year-old Word and 16-year old WP by Hymer · · Score: 1

      You are welcome to borrow one of my AT-compatible PC's... I even got a Toshiba portable 286 (the one with the cool red plasma display).

  7. Basically by DrYak · · Score: 5, Insightful

    ODF is the former SXW format that was taken and transformed into a standard by a committee comprising several Office software makers. It's suppose to describe the normal features that anyone should expect from any Word processing application, be it OpenOffice.org, KWord, AbiWord, Corel Word Perfect, etc. all this in a perfectly neutral way. It was designed with a function in mind (storing word processing documents in an open and interoperable way). Its benefits are comparable to the standardisation of HTML.

    OpenXML is Microsoft trying to translate its proprietary DOC file inside a XML container (because it's a big buzzword) and propose it as a standart to ECMA (because everyone is speaking about ODF being an ISO standard). It describes not only what is to be expected from a word processor, but also all MS-Word specific microsoftism. It was designed with a specific software in mind (and partly derives from the internal functionning of MS-Word). It's only a small improvement over the previous MS XML format (which had a lot of informations hidden in a binary blob).

    The good thing for Microsoft, is that they can pretend this limitation is "Not-a-bug-but-a-feature", and brag around that there are a lot of stuffs that MS-Word couldn't store inside an ODF and only OpenXML can carry.

    Microsoft's plan :
    1. Embrace
    2. Extend <- They are here
    3. Extinguish

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
    1. Re:Basically by megabyte405 · · Score: 3, Insightful

      ODF is a nice idea in theory, but really, it's a similar situation (OpenOffice.Org internal dataformat jammed into a standard, so designed with OO.o in mind by necessity) just with more OSS-positive karma associated. There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea (see .ABW for AbiWord, .DOC for Word, .WPD for WordPerfect I would also wager is the same idea). However, interoperability seems to work best when taken from the ground up - when working with another application's data structure of any complexity, you simply can't do a lossless roundtrip without losing before you've started.

      There is, however, a format that can do this sort of thing. Yes, I'm talking about the dark horse of the "file format wars," the non-glamorous workaholic format that even WordPad and TextEdit.app can read with ease: RTF. It may not get press attention, but it's actually a fairly well-documented standard, has been working as an interchange format for years, and yet is designed with enough expandability that it's still useful with the kinds of documents produced today. It's a true de-facto standard.

      This may not be an exciting idea, but for those who really want interoperability, RTF is the way to go with today's software. Not to say that import/export of ODF, Word, WPD, etc. isn't important (AbiWord, a project that I contribute to but do not purport to speak for, has very good to great support for those formats and many others), just that an unnecessary dichotomy is drawn between OpenXML and ODF with regard to their design goals - both are repurposed native formats for a single application.

      --
      I recognize people by their sigs. Is that a bad thing?
    2. Re:Basically by blincoln · · Score: 3, Insightful

      There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea (see .ABW for AbiWord, .DOC for Word, .WPD for WordPerfect I would also wager is the same idea).

      I would argue that when it's taken to the extreme of Office prior to 2007, it *is* a bad thing. AFAIK, the old Word format is more or less a (very) partial RAM dump (which is why you can often find all sorts of interesting stuff in Word files that the authors think they've deleted). That makes for faster dev times, but because the load and save functions don't really "understand" the content of the file, IMO the developers made things a lot harder for themselves in the big picture. I imagine reproducing issues in testing is a particular nightmare.

      --
      "...always new atoms but always doing the same dance, remembering what the dance was yesterday." -Richard Feynman
    3. Re:Basically by Nicopa · · Score: 4, Interesting

      No. ODF has several real, factual, benefits. It might have been originated in a single product but... it reuses existing standard technologies (SVG, CSS...). It has properly designed XML tags that act as "markup", in OpenDocument xml tags act as container for chunks of data. ODF tries to separate content from style.

      And about your RTF suggestion... can I draw diagrams with RTF? Can I have a ToC? Can I do complex styling? Can I have a "galery" of styles? Can I include images? No. RTF is not a solution.

    4. Re:Basically by megabyte405 · · Score: 2, Interesting

      Actually, I think for most of the things you suggest, you can do them - I know AbiWord supports them at least. (images, complex styles, TOC) RTF's really not the old dog it seems to be - keep in mind that for copy/paste of any sort of rich text to work in any sensible manner on Windows, one _must_ support RTF well.

      --
      I recognize people by their sigs. Is that a bad thing?
    5. Re:Basically by megabyte405 · · Score: 1

      Oh, I'm sure testing is a nightmare, and it can't be good for performance to be going from a binary memory dump to a binary memory dump probably encoded somehow and shoehorned into XML so you can use those three letters. (Apologies for the lack of solid knowledge - for legal reasons I'd rather not know too much about the intricacies of Microsoft OpenXML.) I was reffering fmore to the fact that .doc is reasonable for use by Word, though it certainly is a pain to load and no good for interchange even between versions.

      --
      I recognize people by their sigs. Is that a bad thing?
    6. Re:Basically by iluvcapra · · Score: 3, Informative

      After having written some tools on OS X that do stuff with RTF:

      RTF is well documented and you can make an RTF document on all manner of platforms (I've done it in Ruby and Cocoa), but many platforms have extended RTF in their own way in order to support special features. OS X has added a few special methods to RTF files to support Mac OS X typography, and I've noticed that different versions of Word handle document attributes (like headers and page numbers) in different ways.

      RTF is great if you want to make up something quick that is ONLY formatted text, but readers have all manner of different ways of interpreting the exact appearance of tables, page layouts and margins, and there doesn't seem to be any manageable common mechanism for including images or other documents, something Word and OO.org excel(pun) at. Even HTML seems to be better at this.

      I use RTF output in a few little in-house tools I have, so people can get the text+attributes they create and open them in a text editor of their choice for touching-up and delivery. When my tools have to create something that is supposed to be finished, they make PDFs.

      RTF is great for interoperability, but I never expect an RTF file to contain a "finished product," unless the recipient expects quality on par with a Selectric. It is merely a relatively-open serialization format for strings with attributes.

      --
      Don't blame me, I voted for Baltar.
    7. Re:Basically by I'm+Don+Giovanni · · Score: 1

      RTF can do all of the things that you mention. But not all apps support RTF as well as others. I think Word has the most complete impl of RTF.

      --
      -- "I never gave these stories much credence." - HAL 9000
    8. Re:Basically by Anonymous Coward · · Score: 0

      Just a minute... You're suggesting RTF as an alternative to ODF? Are you mad?

      The specification for RTF basically goes something like this:

      "An RTF file is defined as whatever garbage the latest version of Word spits out when you save as an RTF file"

      RTF is basically just another encoding of a Word document, just like OpenXML, using it's own custom weird tagging system rather than XML. Each version of Word will produce something completely different, incompatible with other software (including Wordpad) and other versions of Word, and with a random subset of Word's features not working for no apparent reason.

      There is a common subset of RTF that works most of the time. It is documented, but there's no real standard, and it's very basic. If you want nothing more than changing basic font styles, then it's fine. If you want, for example, tables, proper styles, tables of contents, pages, and so on, it's useless. As a word processor file format, it's somewhat less useful than HTML.

      ODF, on the other hand, has a full, complete specification, with rigorously defined behaviour. It was derived from StarOffice / OpenOffice 1.x's SX? formats, but they are significantly different. While it does make some assumptions about how the document should be proessed (even HTML or plain text documents do that), they're nowhere near the kind of assumptions that Microsoft's formats make.

    9. Re:Basically by telso · · Score: 1
      Microsoft's plan :
      1. Embrace
      2. Extend <- They are here
      3. Extinguish
      It's like there's something missing in this list...a next step. Microsoft...eliminating competition...securing market domination...step 4...what comes next again?
    10. Re:Basically by AuMatar · · Score: 2, Insightful

      No, saving the internal representation as a file is an utterly fucktarded idea. You see, an internal representation is made to make the most sense for your implementation of various features. It changes frequently, sometimes with every patch. It has performance hacks, redundancy, etc. A file format is supposed to be a representation of the data in an easy to parse format, so it can be loaded by applications.

      So what happens when you use the internal representation as the file format? Well, you have a file format with lots of hacks, wierdnesses, lack of clarity. Redundant information, which leads to unreadable files if they don't match. Nearly impossible to parse since now you don't have 1 file format, you have N file formats 1 for every version of the data structure. Oh, and future versions of your software *will* be unable to open old versions without major bugs. Thats a consequence of using 8 or 9 different formats.

      From a technical perspective, using the internal structure as the file format is the stupidest thing you can do. There is *no* good technical reason to ever do so.

      --
      I still have more fans than freaks. WTF is wrong with you people?
    11. Re:Basically by urbanradar · · Score: 1

      Not to forget:

      4. Profit!!!

    12. Re:Basically by fitten · · Score: 1

      One of the arguments I continually hear against OpenDoc is that it assumes (and pretty much defines itself as being) only suitable for the creation of new documents (and, of course, using those documents as you go forward). So, once it is in place, new documents will adhere to it. Unfortunately, there are a large number of older documents that people still use that need to be a) actually used, and b) translated to the new standard. The arguments I hear say that OpenXML better supports backwards compatibility or something.

      I haven't looked at either standard very much (or at all) but it seems like a fairly significant argument, if true.

    13. Re:Basically by megabyte405 · · Score: 2, Interesting

      Well, AbiWord serializes its internal data structure into XML, so it's not an exact dump - it lets us do things like have backward-compatible additions such as LaTeX and MathML equations and include an image preview of the equation as a fallback, for instance. There are things you can do to make your internal format more lucid, and binary->text is one of those things: I can fix almost anything that can go wrong with an AbiWord doc (usually only happens in dev releases, but sometimes strange things happen) with Notepad.

      (And I would say that as long as it's well-documented and in a useful manner, if you're just using it for internal/non-archival data storage and need a lot of speed, using the internal structure would make sense.)

      --
      I recognize people by their sigs. Is that a bad thing?
    14. Re:Basically by dominator · · Score: 2, Informative
      Can I draw diagrams with RTF? Can I have a ToC? Can I do complex styling? Can I have a "galery" of styles? Can I include images? No. RTF is not a solution.


      Actually, you can. RTF can express most (if not all) of what the Microsoft Word format can. Let me answer your objections using excerpts from the RTF 1.8 specification:

      The \tc control word introduces a table of contents entry, which can be used to build the actual table of contents.

      The \stylesheet control word introduces the style sheet group, which contains definitions and descriptions of the various styles used in the document.

      An RTF file can include pictures created with other applications. These pictures can be in hexadecimal (the default) or binary format. Pictures are destinations and begin with the \pict control word.

      \dgmt creates diagrams. \pict of subtype \*metafile supports vector drawings, in case that's what you meant by "diagrams".

      Check out http://www.microsoft.com/downloads/thankyou.aspx?f amilyId=ac57de32-17f0-4b46-9e4e-467ef9bc5540&displ ayLang=en sometime. Don't be afraid by the EXE download - it's just an auto-extracting ZIP file containing a single DOC. The guy you replied to was an AbiWord developer, as am I.
    15. Re:Basically by Anonymous Coward · · Score: 0

      There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea It's an awful idea. Imagine trying to explain to users why they cannot use standards formats like GIF,JPG,PNG etc.
      Data is *far* more important than the application! The data is owned by the user, it needs to be open and free, not locked into anything.
      When all data can be kept in open formats, then true open computing can happen. Compete on features and benefits, not lock-in.
    16. Re:Basically by Anonymous Coward · · Score: 0

      Two huge problems with RTF:
      1) It's purely a de facto standard, and MS is holding all the strings. Maybe it could be open-standardized but it would still be controlled in every practical sense by the mass of existing implementations (i.e. MS). Why go to all the work to refine and standardize yet another format?
      2) ODF XML separates content from formatting; RTF does not. There are lots of things you can do with a document beside printing it, and ODF is designed for that. RTF is not. There are several good, open source libraries for reading, writing and transforming any XML document, including ODF. How many such libraries are there for RTF?

      RTF is NOT the way to go.

    17. Re:Basically by Bert64 · · Score: 1

      With either format, you will need to convert your old files so there is no direct compatibility, the old file will be converted into a representation of itself stored in the new format.
      The difference is, MS's format includes specific tags telling it to emulate the buggy behaviour of older programs, for instance:
            mwSmallCaps (Emulate Word 5.x for the Macintosh Small Caps Formatting)
      This basically says, that word 5.x for mac had buggy handling of small caps that caused them to be even smaller than they should normally appear, and so when opening such a file recent versions of word will emulate that behaviour, and store this setting in the document so it's retained when you open it again.
      ODF on the other hand, does not have piles of kludgy tags like this, because the format is not the place for it. Instead, whatever program is responsible for opening the file, when it sees a word 5.x for mac file can simply reduce the font size of the small caps to match what word 5.x would have done and save the file as ODF like that.
      That way, you don't bloat the format with stupid things like that, you just use the existing and standard functions for defining font sizes.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
  8. Re:MIcrosoft sucks. by theLOUDroom · · Score: 2, Insightful

    The crazy amount of backwards compatibility is what allowed Microsoft to rise to the position it holds today...

    Or maybe it was their illegal business tactics?

    It would be pretty easy for me to run a successful business too if I could break federal law with impunity.

    --
    Life is too short to proofread.
  9. Re:MIcrosoft sucks. by Brandybuck · · Score: 2, Insightful

    Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.

    --
    Don't blame me, I didn't vote for either of them!
  10. Solution? by Nicopa · · Score: 0

    Is it the only solution to all this to attack key Microsoft executives while they are sleeping?

    1. Re:Solution? by Bob54321 · · Score: 1

      Well there are predicting World War 8. I have no idea what footnotes will look like then so I will be unable to implement footnoteLayoutLikeWW8...

      --
      :(){ :|:& };:
    2. Re:Solution? by Anonymous Coward · · Score: 0

      Vampires don't sleep.

  11. The site seems to be slow... by junglee_iitk · · Score: 5, Informative
    You want to hire a new programmer and you have the perfect candidate in mind, your old college roommate, Guillaume Portes. Unfortunately you can't just go out and offer him the job. That would get you in trouble with your corporate HR policies which require that you first create a job description, advertise the position, interview and rate candidates and choose the most qualified person. So much paperwork! But you really want Guillaume and only Guillaume.

    So what can you do?

    The solution is simple. Create a job description that is written specifically to your friend's background and skills. The more specific and longer you make the job description, the fewer candidates will be eligible. Ideally you would write a job description that no one else in the world except Guillaume could possibly match. Don't describe the job requirements. Describe the person you want. That's the trick.

    So you end up with something like this:

    * 5 years experience with Java, J2EE and web development, PHP, XSLT
    * Fluency in French and Corsican
    * Experience with the Llama farming industry
    * Mole on left shoulder
    * Sister named Bridgette

    Although this technique may be familiar, in practice it is usually not taken this extreme. Corporate policies, employment law and common sense usually prevent one from making entirely irrational hiring decisions or discriminating against other applicants for things unrelated to the legitimate requirements of the job.

    But evidently in the realm of standards there are no practical limits to the application of the above technique. It is quite possible to write a standard that allows only a single implementation. By focusing entirely on the capabilities of a single application and documenting it in infuriatingly useless detail, you can easily create a "Standard of One".

    Of course, this begs the question of what is essential and what is not. This really needs to be determined by domain analysis, requirements gathering and consensus building. Let's just say that anyone who says that a single existing implementation is all one needs to look at is missing the point. The art of specification is to generalize and simplify. Generalizing allows you to do more with less, meeting more needs with few constraints.

    Let's take a simplified example. You are writing a specification for a file format for a very simple drawing program, ShapeMaster 2007. It can draw circles and squares, and they can have solid or dashed lines. That's all it does. Let's consider two different ways of specifying a file format for ShapeMaster.

    In the first case, we'll simply dump out what ShapeMaster does in the most literal way possible. Since it allows only two possible shapes and only two possible line styles, and we're not considering any other use, the file format will look like this:

    <document>
    <shape iscircle="true" isdotted="false"/>
    <shape iscircle="false" isdotted="true"/>
    </document>

    Although this format is very specific and very accurate, it lacks generality, extensibility and flexibility. Although it may be useful for ShapeMaster 2007, it will hardly be useful for anyone else, unless they merely want to create data for ShapeMaster 2007. It is not a portable, cross-application, open format. It is a narrowly-defined, single application format. It may be in XML. It may be reviewed by a standards committee. But it is by its nature, closed and inflexible.

    How could this have been done in a way which works for ShapeMaster 2007 but also is more flexible, extensible and considerate of the needs of different applications? One possibility is to generalize and simplify:

    <document>
    <shape type="circle" lineStyle="solid"/>
    <shape type="square" lineStyle="dotted"/>
    </document>

    1. Re:The site seems to be slow... by junglee_iitk · · Score: 1

      It's opening now :(

    2. Re:The site seems to be slow... by IvanTheQuiteNasty · · Score: 1

      Guillaume Portes! Très bien! Je veux vous donner mes points de moderation, mais ca ne marche pas. Je n'ai pas des points. Et j'ai déjà repondu.

    3. Re:The site seems to be slow... by grimJester · · Score: 1

      Well, at least it's compatible with application x if it says in the standard an application has to render something like application x without specifying how...

      I don't think the job description is a very apt comparison. Perhaps if it said "Must drink beer like my old college roommate, Guillaume Portes" without saying how he drank beer. To be a bit more exact, all your examples are of the same Fallacy of Definition in that they use an example as the sole defining trait. One of the examples on Wikipedia is "Chutzpah is killing your parents and then throwing yourself on the mercy of the court because you are an orphan.". Should ISO accept that as the definition of Chutzpah?

  12. zOMG by dexomn · · Score: 1

    Does this mean the big E won't work on my windows 97 anymore?

  13. Backwards compatibility by Bob54321 · · Score: 3, Interesting

    I thought most people considered themselves lucky if there documents could open in successive versions of Office. Why would anyone want to implement support for really old versions if Microsoft does not do it themselves?

    --
    :(){ :|:& };:
    1. Re:Backwards compatibility by kfg · · Score: 2, Informative

      Why would anyone want to implement support for really old versions if Microsoft does not do it themselves?

      Nobody would. That's the point of it.

      KFG

    2. Re:Backwards compatibility by greenrom · · Score: 1

      What if you have an old document that you continue to maintain and update? I can think of a lot of documents like this. They've since been converted to a more recent Word format, but I bet the Word file format contains tags like these to preserve compatibility. The average user won't know or care about these nuances. They'll just see that many of the documents that used to look fine in Office 2007 aren't formatted right in the new version. Think how much time it would take you to go through a 500 page standards document and correct formatting problems or fix broken footnotes. Now think about the number of documents still in use today that may have originally been created with one of these older but popular word processors. Given the number of man-hours that would be required to hand edit all these documents, building backwards compatibility into the application makes a lot of sense.

    3. Re:Backwards compatibility by julesh · · Score: 1

      I thought most people considered themselves lucky if there documents could open in successive versions of Office.

      Not that I've come across. I've used every version of Winword from 1.0 up to 2002, and each one has been able to open documents from the previous versions with a substantially higher degree of success than I get today opening them with OO.o.

    4. Re:Backwards compatibility by 99BottlesOfBeerInMyF · · Score: 1

      I've used every version of Winword from 1.0 up to 2002, and each one has been able to open documents from the previous versions with a substantially higher degree of success than I get today opening them with OO.o.

      I used to manage a lot of .doc and .rtf files created by different versions of Word, including on both the Mac and PC. We had a whole slew of .doc files that the current version of Office for Mac and PC could not open, but which OpenOffice managed. It has been a standard tool for that purpose in my arsenal for years now, and I'm not the only one. I twas even mentioned in a lecture at a conference I attended about archive file formats. Problems with Word files also seem to get worse as the number of images and the file sizes increase. We actually had a procedure in place whereby we would save a file, not close that file, transfer the file to another machine, open it on the other machine, and then quit Word and archive the file. This is because Word often silently corrupted files on save and it was the only way to be sure we'd be able to open that file again. Note, this was Office 2000 era. I've luckily not had to use Word for large files for a long time and the last one I opened, I used TextEdit.app.

  14. Posters sucks, then blows, then sucks some more. by Anonymous Coward · · Score: 1, Insightful

    "It would be pretty easy for me to run a successful business too if I could break federal law with impunity."

    What's insightful about pretending that federal laws are the only ones a company has to deal with? And when you go international...

  15. Size by Kadin2048 · · Score: 4, Funny

    I don't know why anyone would complain, the spec is only 6,000 pages long.

    And the best part is, these are the pages it uses... (I mean, why else do those specs cost so much?)

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
    1. Re:Size by SuluSulu · · Score: 4, Funny

      6,000 pages is nothing! Try reading The Wheel of Time series.

    2. Re:Size by Squigley · · Score: 2, Funny

      Ow! I got a paper cut, and now I need a prosthetic arm!

    3. Re:Size by DoctorPepper · · Score: 1

      Heh. I tried. I finally quit after reading the seventh book.

      At least my wife was smarter than me, she gave it up after number three!

      --

      No matter where you go... there you are.
    4. Re:Size by just_another_sean · · Score: 1

      What? It's finished finally?

      --
      Creationist Textbook Stickers Declared Unconstitutional by CowboyNeal
    5. Re:Size by Briareos · · Score: 1
      6,000 pages is nothing! Try reading The Wheel of Time series.

      Bah. 9685 pages is nothing! Try reading Perry Rhodan - that's somewhere around 120 to 130 thousand pages and counting...

      (And yes, I've read most of it - took me only 12 years or so...)

      np: Markus Guentner - Hotel Shanghai (1981)
      --

      "I'm not anti-anything, I'm anti-everything, it fits better." - Sole

  16. Don't forget the page counts... by Anonymous Coward · · Score: 5, Interesting


    ODF spec page count: 722.

    OpenXML spec page count: 6000 !!

  17. Re:MIcrosoft sucks. by Aadain2001 · · Score: 4, Insightful

    But they broke plenty of laws to keep their monopoly :) And while their actions during their rise to the top may not have been illegal, they could easily be called 'strong-armed'.

    --
    Space for rent, inquire within
  18. Re:MIcrosoft sucks. by NineNine · · Score: 0, Flamebait

    If you get to the point where you build up a company that can even consider garnering the term "monopoly", then get back to us. Until then, you have no idea what you're talking about, especially when quoting arbitrary and esoteric "federal laws". Call me nuts, but if you ever got to that point, you'd might even get a crazy idea in your head that those "federal laws" that you are so damned proud of, are about as fair and just as our drug laws. At that point, maybe, just maybe, you may come to thinking that you you earned what you got, and the government has no right to tell you how to run your business that you started in your teens, and proceeded to build to make it one of the most successful companies in the history of capitalism.

    Until you get to that point, I suggest that you those "federal laws" out your ass, Mr. Ashcroft.

  19. I can't see this being too big of a problem by All_One_Mind · · Score: 1
    I understand that these tags will be needed when converting legacy documents, but how many people are going meet all the following conditions to even be effected by this:

    A) Desire to convert an old Word 5/95/WordPerfect 6 document to OOXML.
    B) Have the original document actually use one of the undocumented legacy features
    C) After converting the file actually experience a problem in formatting

    First of all, the number of people who fall into category A is going to be small to begin with, same with category B. Although Microsoft is not providing the documentation like they should be, category C would be up to Corel, Sun, and other producers of future OOXML compatible word processors to implement. They're going to implement OOXML, so they're going to be encountering these issues as they program anyway. I trust they can figure out a way to display "full-width East Asian characters" and other such issues that are not fully documented in the standard.

    1. Re:I can't see this being too big of a problem by aeschenkarnos · · Score: 1

      If all else fails, just scan a printout as an image, OCR it to make it searchable, check the OCR output to make sur3 no 0CR=typo5 get introduced, and save it as the image with the OCR output as comment text.

    2. Re:I can't see this being too big of a problem by All_One_Mind · · Score: 1

      Ha ha. Seriously. Why even export to this format in the first place? If you really want to port a document from Word, why not just open it in OpenOffice or WordPerfect and resave it? In fact, both OpenOffice and WordPerfect support old *.doc formats, so they should already be equipped to deal with old legacy Word format issues anyway, right? And like you said, push comes to shove, scan/OCR and you're all done!

    3. Re:I can't see this being too big of a problem by gaijin99 · · Score: 2, Insightful

      You're missing the point. By defining their "standard" in this manner they can now say "Application X doesn't implement OOXML", naturally by "implement OOXML" they mean "fully implement OOXML" so that if even the most obscure and bizarre tag is not supported that's that. At that point they can either demand that application X not claim on its packaging that it supports their "standard", they might have one of those cute little "OOXML Compatiable" seals and refuse to let anyone who doesn't fully support the "standard" use it, or simply use it as marketing tool.

      If you want to think in a more paranoid manner, one could also speculate that MS might cause its future versions of Word to use one or more of the supposedly depreciated tags regularly (or, nastier thought, at random) so that any competing product that attempts to open an OOXML document produced (or even saved) in MS Word will not properly render the document. Joe User will assume that since it renders properly in MS Word, but not in application X, and it is an open standard after all, it just proves that MS Word is better.

      MS does what is best for MS, not what is best for its customers and certainly not what is best for documents in general.

      --
      "Mission Accomplished" -- George W. Bush May 1, 2003
    4. Re:I can't see this being too big of a problem by rohan972 · · Score: 2, Insightful

      I understand that these tags will be needed when converting legacy documents, but how many people are going meet all the following conditions to even be effected by this:

      If it gets adopted as a standard (ISO or similar, not defacto standard) then everyone. The point is not whether people need the features, the point is that MS is trying to get this accepted as a standard. It still can only be implemented by MS, and therefore should not be accepted as a standard. If a government body had as part of a software procurement requirement "Complies with ISO XXXX (MS OpenXML)" Then by default only MS office could fill that requirement. As opposed to ODF which can be supported by any company that chooses to to so.

      MS can have any features it likes in it's file format, that's not the issue. How well it works or if people need it are also not the issues. The issue is that for it to be considered a standard, it should be able to be implemented by anyone and that the format as currently documented is practically impossible to implement for anyone but MS. Therefore it should not be considered a standard by bodies such as ISO.

    5. Re:I can't see this being too big of a problem by Ucklak · · Score: 1

      I say batch every old document to PDF then you don't have to worry about Microsoft anymore.

      What an assinine piece of crap OOXML is.

      --
      if you steal from one source, that is plagiarism, if you steal from many, well, that's just research.
    6. Re:I can't see this being too big of a problem by All_One_Mind · · Score: 1
      I see everyone's point with this, I really do, but as long as companies support these tags the spec provided they would be in full compliance, would they not be? Here's a quote from the article/spec:

      2.15.3.6 autoSpaceLikeWord95 (Emulate Word 95 Full-Width Character Spacing)

      This element specifies that applications shall emulate the behavior of a previously existing word processing application (Microsoft Word 95) when determining the spacing between full-width East Asian characters in a document's content.

      [Guidance: To faithfully replicate this behavior, applications must imitate the behavior of that application, which involves many possible behaviors and cannot be faithfully placed into narrative for this Office Open XML Standard. If applications wish to match this behavior, they must utilize and duplicate the output of those applications. It is recommended that applications not intentionally replicate this behavior as it was deprecated due to issues with its output, and is maintained only for compatibility with existing documents from that application. end guidance]

    7. Re:I can't see this being too big of a problem by All_One_Mind · · Score: 1

      Damnit, I wasn't even close to done typing that and I hit submit when I meant to hit preview. Ha ha. Now I'm really going to get flamed. Anyway, what I was going to say was that they simply need to "imitate" the feature, which OpenOffice/WordPerfect already do with their legacy Word doc support, making the point of this article moot. Now, don't get me wrong, this standard sucks, it's a bunch of floof, but so is the basis for this article.

    8. Re:I can't see this being too big of a problem by Askmum · · Score: 5, Insightful

      You seem to be missing the point.

      You do not need these features to begin with in a new format that is inherently incompatible with an old format. You don't want to say "now I'm going to do WP style linespacing and my linespacing is 1".
      If you want to convert a WP document to an XML document, the conversion program should know that the linespacing in WP is 0.9 times the linspacing in XML document (or what it really may be)and will then use linespacing=0.9 in the XML document. This is not a task of the new wordprocessor or its specification.

      By adding this so-called "backward compatibility" to your specification, you make the spec overly difficult and in fact you make the conversion program in the new application when this is absolutely not necessary.
      And on top of that, you require that the programmer who uses this spec should have knowledge of all these old versions and is able to program them without error. And as the application will grow because of these unnecessary features, the number of bugs will also rise. So this is not a blueprint for a good application, this is a blueprint for a very buggy implementation of a wordprocessor.

    9. Re:I can't see this being too big of a problem by jlarocco · · Score: 1
      Anyway, what I was going to say was that they simply need to "imitate" the feature, which OpenOffice/WordPerfect already do with their legacy Word doc support, making the point of this article moot. Now, don't get me wrong, this standard sucks, it's a bunch of floof, but so is the basis for this article.

      No, they can't "imitate" the feature. They have to do EXACTLY what Office does, otherwise they're not compliant. Did you even read that guidance block you quoted? "If applications wish to match this behavior, they must utilize and duplicate the output of those applications." It entirely defeats the point of even having a standard. They should've just saved 5999 pages, and simply wrote "Mimic Microsoft Office".

      Sure, they say it should be avoided, but that's not going to stop them from continually pointing out that nobody else is compliant. And do I need to point out how silly it is that a brand new draft standard already has deprecated features?

      [Guidance: To faithfully replicate this behavior, applications must imitate the behavior of that application, which involves many possible behaviors and cannot be faithfully placed into narrative for this Office Open XML Standard. If applications wish to match this behavior, they must utilize and duplicate the output of those applications. It is recommended that applications not intentionally replicate this behavior as it was deprecated due to issues with its output, and is maintained only for compatibility with existing documents from that application. end guidance]

      That guidance block perfectly sums up why this standard is a waste of fucking time.

    10. Re:I can't see this being too big of a problem by tdelaney · · Score: 1

      The number of *people* who need to convert old documents may well be small, but the number of *documents* that need to be converted/imported is *huge*. And you can guarantee that if one of their documents hits an undocumented legacy feature, all their documents will.

    11. Re:I can't see this being too big of a problem by All_One_Mind · · Score: 1
      Ha ha. This seems to be the thread where everyone tells me I'm missing the point.

      Quite honestly I understand this. I read the whole article, the format is a POS, everyone's life will be ruined. Yada yada yada. However, I believe you're missing a point here, in that some documents require consistent formatting, whether it's military, legal, scholarly, specs, drafts, you name it. By "updating" these pages by not supporting the old format you will breaking the layout of some documents that need a consistent layout. Is this going to affect a lot of documents? Of course not, but it is still a consideration, and I am playing devil's advocate after all.

      Second, for the third time in this thread I'd like to mention that both WordPerfect and OpenOffice already have legacy support for old implementations of Word documents. Oh, and it likes the DOC format specs are published in full up to Word 97.

      Now look, I understand this format is Microsoft's best interests and not the end user, but the entire time all I've been trying to say is that the fundamental points of this article are over hyped, they're going to effect a small number of users, and that WordPerfect and OpenOffice already have the programming to deal with legacy Word format problems. That's it. That's my point. My point is not that OpenXML is good, it sucks.

    12. Re:I can't see this being too big of a problem by All_One_Mind · · Score: 1

      Ha ha. Did you even read the piece you quoted? First you say: "No, they can't 'imitate' the feature" and then you quote directly from Microsoft where the spec says: "To faithfully replicate this behavior, applications must imitate the behavior of that application." Come on. Obviously Microsoft doesn't expect the general public to have the source code to old versions of Office. And for the millionth time, there are several applications that can already handle and correctly format legacy .doc files including WordPerfect and OpenOffice, so the programming work is largely done to imitate those features. And again for the upteenth time, I'm not endorsing this as a standard, I'm with everyone else who opposes this. However, the examples in the article do not appear to be the huge "standard breaking" ace card that every witch hunter here seems to be making it out to be. I hate Microsoft as much as the next guy, but the article doesn't present anything newsworthy, just FUD.

    13. Re:I can't see this being too big of a problem by vidarh · · Score: 1
      You've missed the point - he's presented examples, not an exhaustive list. He even linked to another article detailing more brain damage.

      Also, again, you can get the precise behavior in a generic way. The post you replied to explained how. It's even ok to include information about _why_ certain information is like that, so that tools that have special knowledge about the format can do full roundtrips more easily. The problem comes when the format itself is burdening ALL implementors with replicating undocumented behaviour in order to reproduce a document faithfully.

      In fact, if you care about faithfully reproducing documents, then including a generic description of the various attributes is essential, or different implementations WILL fail to accurately reproduce them.

    14. Re:I can't see this being too big of a problem by Anonymous Coward · · Score: 0

      That's his entire point. The standard says you must imitate Word (Or WordPerfect) but an implementation can not correctly imitate Word (Or WordPerfect) because Microsoft don't tell you what you have to do in order to correctly imitate those products.

      And for the millionth time, there are several applications that can already handle and correctly format legacy .doc files including WordPerfect and OpenOffice..

      At what point did those applications gain 100% perfect Word DOC import? Do they correctly implement Word 95 full-width character spacing if they encounter a document that uses it? What about a document that was created on Word 5 for Mac that uses small caps? Can OpenOffice.org handle those perfectly? No? What a shame: guess they can't fully implement the OOXML "standard".

    15. Re:I can't see this being too big of a problem by pstorry · · Score: 1

      When converting?

      Well, yes, that will be the most common issue. But OOXML is supposed to open up documents for easy, cheap creation and manipulation by third parties. It's Microsoft's response to the ODF claim that now, you don't need the application which generated the data file.

      Why is this an advantage? Well, imagine that you have a webserver serving financial quotes, and want to generate those quotes on the fly using the very latest data. Now you can output those quotes as documents quickly and easily, because you have the spec, right?

      (Pedantically, yes you might choose to use PDF because it's "non-modifiable". I have a copy of WordPerfect X3 that allows roundtrip editing of PDFs. I have PDF cracking software that removes passwords/locks. And I have a copy of Acrobat. I'll modify your PDF if I want to. If you want to ensure your quote remains secure, then either don't give them the file or keep a copy of it for comparison later... Now back to the example.)

      With ODF, outputting the file is easy. Same with OOXML. They're about equal for new documents.

      What if your app needed instead to read old documents for some reason? Perhaps it's looking for links into your CRM system, using hints like the text before "yours sincerely" or for text which is right-formatted at the top of a page?

      So to make this easier, you bulk convert your old data to OOXML. And then set your agent loose on the data.

      Now what does useWord97LineBreakRules mean to you? Formatting is important in this context. It's how you're trying to recognise data. What the heck is a Word 97 Line Break Rule? Apparently, it refers to Far Eastern language blocks. Does it explain the weird formatting in all your RTL formatted paragraphs? Is this why the program works on documents from your Western offices but not ones from the Eastern offices?

      A specification's job is to tell you not only that something exists, but also what it's for and what is does. Too often, Microsoft wave the "it's old, don't use it" wand in their OOXML spec, and whilst that's fine for new documents it's an appalling disservice to the documents you already have.

      Of course, this is to Microsoft's advantage anyway. Having read about the spec, I plan to try and read it. But I think that I'll stick to OLE automation when it comes to manipulating Office documents, because the OLE/COM interfaces are better documented and better supported. And they have the advantage of letting Word figure out what the heck useWord97LineBreakRules means, rather than them having to do it.

      This is probably what Microsoft want. Every application which reads and writes OOXML files natively and isn't part of Office is, to them, a potential lost Office sale. Why go out of their way to support that? OOXML sets the bar for using their XML suitably high, encouraging developers to use the existing OLE/COM interface rather than use the XML.

      (And in my job I maintain a small app which uses OLE/COM to scrape data from an Excel spreadsheet that the user provides, so I'm fairly sure that OLE/COM would be easier than using the XML. Why did I use OLE/COM? Because the users tend to prepare the data in Excel anyway, so it cuts down on the steps required. I'd rather pull it from CSV or XML if I had the choice, but Excel is more convenient for the users on the whole, and prevents me from having to explain what File -> Save As does over and over again. And most importantly, because someone else wrote the core of the app, and I inherited it like that later. I've just been tweaking it as necessary. *grins*)

      To go back to my example - who would want to scan their old documents and put them into a CRM system as possible related items? Well, I bet most businesses would be open to the idea. Until they see the cost. I'd also bet that it would be cheaper to accomplish with ODF documents than it would OOXML documents, because of the kinds of issues I've mentioned. In that regard, OOXML will actually hold back the IT industry - it will prevent cool things from being

    16. Re:I can't see this being too big of a problem by cduffy · · Score: 2, Insightful
      By "updating" these pages by not supporting the old format you will breaking the layout of some documents that need a consistent layout.

      You're missing his point: When converting the file to OOXML, one can and should add generic tags indicating the specific (broken) behavior which should emulated (such as "scale small caps by this percentage point") rather than just specifying a generic "Do What I Mean" marker without any useful guidance on how rendering of documents containing this marker should be implemented.

      As long as tags indicate for all the relevant changes (like scaling small caps), the document will then look the same even without the DWIM markers.
    17. Re:I can't see this being too big of a problem by jlarocco · · Score: 1
      And again for the upteenth time, I'm not endorsing this as a standard, I'm with everyone else who opposes this. However, the examples in the article do not appear to be the huge "standard breaking" ace card that every witch hunter here seems to be making it out to be.

      If that's the case, you don't seem to understand the purpose of an ISO standard.

      An ISO standard is supposed to be the definitive guide to a particular thing. They (usually) go into painstaking detail and explicitly describe exactly what each feature does, and specifically state any undefined or implementation specific behavior. See the C standard, C++ standard, Ada standard, POSIX, or any other ISO standard to see a few examples. They're not supposed to have vague descriptions like "Mimic the behavior of this 12 year old, hard to find program, whose behavior we can't describe."

      How am I supposed to write code to use Office XML, when the standard can't even come up with an English description of what that code is supposed to do? It mocks the entire idea of having a standard.

    18. Re:I can't see this being too big of a problem by clodney · · Score: 1

      But given all the complaints I see hear on /. about how Word does such a crappy job of opening older files, how is MS going to claim that they are fully compliant either?

      If it comes to a pissing contest, which is going to seem worse: MS can't read old MS files faithfully, or random vendor XYZ can't read old MS files faithfully.

      Or is Word's ability to read old documents perhaps not as awful as is usually claimed here?

    19. Re:I can't see this being too big of a problem by Anonymous Coward · · Score: 0

      Oh don't worry about that, Microsoft wrote themselves a handy get-out clause that allows them to claim compliance without having to implement the full standard. This would be amazing, if we weren't talking about a company that is also trying to redefine 1900 as a leap year just to work around a bug in Excel that they could fix in a couple of lines of code.

      Anyway, do you think it will matter when Microsoft start their marketing campaign to tell everyone that "FooOffice doesn't implement the complete OOXML standard. Buy Microsoft Office if you want to be sure that you can work with other people!"

    20. Re:I can't see this being too big of a problem by Hymer · · Score: 1

      must comment on your sig... sorry... Running Windows Vista with a BSD kernel... err... I meant BSOD kernel
      You should consider upgrading to the new RSOD kernel, unless you are running the new UI on the old kernel...

    21. Re:I can't see this being too big of a problem by Anonymous Coward · · Score: 0

      > Now look, I understand this format is Microsoft's best interests and not the end user, but the entire time all I've been trying to say is that the fundamental points of this article are over hyped, they're going to effect a small number of users, and that WordPerfect and OpenOffice already have the programming to deal with legacy Word format problems. That's it. That's my point. My point is not that OpenXML is good, it sucks.

      The problem is that documents converted to OpenXML will only be readable by Word, whereas if they'd been converted to ODF, anyone would be able to read them (and they'd be free of ancient legacy weirdness).

      So if Microsoft gets their way, we'll be stuck with this craptastic standard instead of a good one.

  20. Death would be too easy. by Kadin2048 · · Score: 4, Funny

    I think we need to do some sort of "Trading Places"-esque scheme, where all the Microsoft board members go to sleep one night as usual, but wake up the next morning working in Bangalore at an outsourced call center for OEM tech support.

    At the same time we'll let the tech support drones have their way with the Microsoft campus, which I suspect will involve setting it on fire.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  21. Re:MIcrosoft sucks. by DavidShor · · Score: 0, Troll

    Take a economics course before you shovel anarchic crap down our throats.

  22. My favorite quote by IvyKing · · Score: 3, Insightful
    From TFA


    This is not a specification; this is a DNA sequence.


    Outrageously funny and to the point.

  23. Bah! by mrchaotica · · Score: 2, Funny

    The strings command supports all legacy document formats! What more could you possibly need? Besides, formatting is overrated anyway...

    --

    "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    1. Re:Bah! by Solra+Bizna · · Score: 1

      PDF?

      -:sigma.SB

      --
      WARN
      THERE IS ANOTHER SYSTEM
    2. Re:Bah! by MrMr · · Score: 1
  24. FUD; you guys are scared to death by Anonymous Coward · · Score: 0

    you fools thought that ODF would be the end of MS Office because, while MS destroys you in functionality, you'd get everyone to forgoe functionality for an open format. And you'd also immediately render billions of existing MSO documents obsolete if you could get govt to mandate ODF exclusively. And the bonus is that such govt mandate would render any and all features not supported by ODF (i.e. not supported by OO.o) irrelevant. But MS opened their own format, thus leveling the playing field so that you must again compete on features (rather than merely having an OAISIS format that was rubberstamped by ISO (please, you don't even have spreadsheet formulas spec'ed, and yet ISO ratified it? What do you call that, if not "rubberstamping"?). your plan failed, so now you resort to FUD.

    Facts:
    OOXML is being implemented by:
    MS Office
    Corel Wordperefect Office
    Apple iWork 2007
    OO.o (via Novel)

    And is being implemented on multiple OSes: Windows, Linux, Mac.
    OOXML is just as open as ODF, does more than ODF, is faster to load than ODF, and has smaller file size than ODF. You lost, both on technical merits and in the marketplace.

    It must sting like a bitch that your grand strategy to marginalize MS Office blew up in your faces!!

    1. Re:FUD; you guys are scared to death by Anonymous Coward · · Score: 0

      The goal is not to marginalize MS Office, it is to interoperate with it. MS took down the office specs in 1999. That wasn't very nice, was it? You had to buy a license to interoperate with it.

      OOXML may be open, but it is still controlled by a single entity: Microsoft. ODF is controlled by a group of entities. I think its better to be controlled by a group because then one entity can arbitrarily change the spec to screw with the other entities.

      OOXML may do more than ODF... but is that a good thing? OOXML weighs in at 6000 pages and apparently contains many legacy features. ODF weighs in at 800 pages.

      How did you arrive at the conlusion that ODF files are larger than OOXML? Is there something in the spec that makes ODF larger? It seems to me that file size will vary vendor by vendor. Also the file loading times will vary vendor by vendor.

      What does it mean to win in the marketplace? Are you saying that OOXML is supported by more vendors? How many of the vendors that implement OOXML will also implement ODF? OO saves as ODF. Isn't there a plugin to make MS Office itself save as ODF? It seems to me every major word processor will be able to at least load ODF if not save. Also even though these vendors implement OOXML how complete is the implementation?

      You also conveniently left out the fact that ODF is likely easier to implement than OOXML. It's a smaller spec and reuses existing technologies like SVG and MathML.

      We can debate all you want about whether ODF was "rubberstamped"... but at the end of the day it has been stamped. Is is a standard as of May 6, 2006. OOXML still has not been approved, and there is some doubt as to whether it will be.

      Your conclusion that ODF loses on technical merits is dubious at best. I'm sorry, but you come off like a MS fanboi.

    2. Re:FUD; you guys are scared to death by Anonymous Coward · · Score: 0

      Dear troll,
      you are sadly mistaken.

      you don't even have spreadsheet formulas spec'ed
        how did you come to this conclusion?
        Please read:
        http://wiki.oasis-open.org/office/About_OpenFormul a
        http://www.openmalaysiablog.com/2006/11/the_formul a_iss.html

      OOXML is just as open as ODF,
        RTFA

      does more than ODF,
        Maybe. But, this is not necessarily a good thing. Actually, reading the article, is probably a bad thing. What are you referring to?

      is faster to load than ODF, and has smaller file size than ODF.
        is it? There are many who come to opposite conclusions. And they do have numbers to back it up:
        http://www.robweir.com/blog/2006/10/why-is-ooxml-s low.html

      You lost, both on technical merits and in the marketplace.
        Don't think so. As we can see above, OOXML pretty much looks like crap to me.

    3. Re:FUD; you guys are scared to death by cching · · Score: 1

      is faster to load than ODF, and has smaller file size than ODF That's because what they don't implement in the file format has to be implemented in code. Indeed:

      lineWrapLikeWord6 means that's all you have to specify in the document, but then you have more, specific code *in the application* to handle just that. And they they aren't even going to tell you *how to write that code*. They just say "go load up word 1.0 and figure it out for yourself."

      Some people just can't think past what's in front of their eyes.
  25. Re:MIcrosoft sucks. by Al+Dimond · · Score: 2, Insightful

    No, actually, I think you'd find it takes the skill of many people, good timing, and luck to be successful in business, even if you could break very many laws. Creating and sustaining a business for many years is hard. Not very many businesses make it.

  26. Re:MIcrosoft sucks. by WalterGR · · Score: 1

    The crazy amount of backwards compatibility is what allowed Microsoft to rise to the position it holds today...

    Or maybe it was their illegal business tactics?

    Or maybe I, too, can post unprovable, untestable anti-Microsoft conjecture to slashdot and get modded up?

  27. Re:Suck it up by oohshiny · · Score: 2

    Are you kidding? This is not a format specification. And it reflects badly on Microsoft and the engineers that authored this document: either they are too stupid to know that this is not a specification, or they are taking everybody else to be fools.

  28. Re:MOD DOWN - TROLL by strider44 · · Score: 0, Offtopic

    I wonder how many people were like me and actually checked to see if the GP had that in it? Troll credit for making me look twice!

  29. wow, subtle... by Anonymous Coward · · Score: 0

    It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.

    But they almost certainly broke many fair business practices laws.

    But you're quite subtle indeed... those are state laws. You aren't by any chance working for Microsoft?

    1. Re:wow, subtle... by poopdeville · · Score: 2, Informative

      Call me a shill if you want, but I've seen NineNine post in a lot of different threads not related to MS. I think he's acting in good faith.

      --
      After all, I am strangely colored.
    2. Re:wow, subtle... by Anonymous Coward · · Score: 0

      this AC still disagrees with him.
      More, his post is flamebait with very little added. (that's the real shame)

    3. Re:wow, subtle... by Zontar+The+Mindless · · Score: 1

      I've known NineNine for ten years, met him him through a different board long before he ever showed up on /. - I'm pretty sure he doesn't work for Microsoft, and I think he honestly believes most of the crap he posts.

      --
      Il n'y a pas de Planet B.
    4. Re:wow, subtle... by I'm+Don+Giovanni · · Score: 1

      Anyone notice the increasing number of anti-MS slashdotters accusing slashdotters that don't despise MS as "shills" so that they don't have to deal with the argument that those accused "shills" put forward?

      Almost like the accusers ran out of arguments so they pull the "shill" card in an attempt to magically discredit any poster that goes against the "MS is evil" groupthink.

      --
      -- "I never gave these stories much credence." - HAL 9000
  30. No bragging rights there. by Erris · · Score: 1, Troll

    OpenXML is Microsoft trying to translate its proprietary DOC file inside a XML container .... The good thing for Microsoft, is that they can pretend this limitation is "Not-a-bug-but-a-feature", and brag around that there are a lot of stuffs that MS-Word couldn't store inside an ODF and only OpenXML can carry.

    Pretend is the operative word. Translation is supposed to happen when you import the crufty old crap. M$ may have an advantage there, but you won't find that ability in the 6000 pages of their spec. The only place you will really find 20 year binary history of M$ Word is in M$'s source code, which itself must be contradictory and crazy because it's not really compatible with itself and has never been a portable typesetting system. Basically, the extra material in M$XML is noise and stuff they will never use. That and stuff you use that's not described is exactly what you want to make sure no one else can implement it.

    The real bragging rights go to those who manage to open those nasty old documents without all of the bloat. KWord will happily import most Word Perfect documents and save them as ODF. Open Office as good a job as Word does with the crazy formats. M$XML is yet another waste of time from DoS central.

    --
    DMCA, Hollings, Palladium. What might have sounded like paranoia is now common sense.
    1. Re:No bragging rights there. by medlefsen · · Score: 2, Funny

      When you hold down shift and slowly extend your finger towards the 4 key are you seriously thinking to yourself, "Ha, take that Microsoft!" Cause if you are you need a new hobby.

    2. Re:No bragging rights there. by Anonymous Coward · · Score: 0
      twitter, please read this carefully. Following this advice will make Slashdot a better place for everyone, including yourself.

      • As a representative of the Linux community, participate in mailing list and newsgroup discussions in a professional manner. Refrain from name-calling and use of vulgar language. Consider yourself a member of a virtual corporation with Mr. Torvalds as your Chief Executive Officer. Your words will either enhance or degrade the image the reader has of the Linux community.
      • Avoid hyperbole and unsubstantiated claims at all costs. It's unprofessional and will result in unproductive discussions.
      • A thoughtful, well-reasoned response to a posting will not only provide insight for your readers, but will also increase their respect for your knowledge and abilities.
      • Always remember that if you insult or are disrespectful to someone, their negative experience may be shared with many others. If you do offend someone, please try to make amends.
      • Focus on what Linux has to offer. There is no need to bash the competition. Linux is a good, solid product that stands on its own.
      • Respect the use of other operating systems. While Linux is a wonderful platform, it does not meet everyone's needs.
      • Refer to another product by its proper name. There's nothing to be gained by attempting to ridicule a company or its products by using "creative spelling". If we expect respect for Linux, we must respect other products.
      • Give credit where credit is due. Linux is just the kernel. Without the efforts of people involved with the GNU project , MIT, Berkeley and others too numerous to mention, the Linux kernel would not be very useful to most people.
      • Don't insist that Linux is the only answer for a particular application. Just as the Linux community cherishes the freedom that Linux provides them, Linux only solutions would deprive others of their freedom.
      • There will be cases where Linux is not the answer. Be the first to recognize this and offer another solution.

      From http://www.ibiblio.org/pub/linux/docs/HOWTO/Advoca cy

  31. Re:Suck it up by Al+Dimond · · Score: 1

    Because there's an opportunity for the format to not be ugly, so that the engineers can get as much done with less work and spend the rest of the time doing something that's really useful instead of duplicating their futile efforts. Or they might just kick off early and sip margaritas on the beach for all I care.

  32. M$ DNA by Erris · · Score: 1
    This is not a specification; this is a DNA sequence.

    It's appropriate to note that the 6000 pages will only fit the DNA of a few pathogens:

    "Measured as Manhattan telephone books, each containing about 1,000 pages of 10-point type," said Simpson, "the genome of the bacterium E. coli is about a third of a book. Baker's yeast, which is my specialty, is a full book. The human genome will occupy two hundred books."

    Other parts of the article about genetic disorders, witches and demonic possesion are also appropriate when talking about M$.

    --
    DMCA, Hollings, Palladium. What might have sounded like paranoia is now common sense.
    1. Re:M$ DNA by Anonymous Coward · · Score: 0
      twitter, please read this carefully. Following this advice will make Slashdot a better place for everyone, including yourself.

      • As a representative of the Linux community, participate in mailing list and newsgroup discussions in a professional manner. Refrain from name-calling and use of vulgar language. Consider yourself a member of a virtual corporation with Mr. Torvalds as your Chief Executive Officer. Your words will either enhance or degrade the image the reader has of the Linux community.
      • Avoid hyperbole and unsubstantiated claims at all costs. It's unprofessional and will result in unproductive discussions.
      • A thoughtful, well-reasoned response to a posting will not only provide insight for your readers, but will also increase their respect for your knowledge and abilities.
      • Always remember that if you insult or are disrespectful to someone, their negative experience may be shared with many others. If you do offend someone, please try to make amends.
      • Focus on what Linux has to offer. There is no need to bash the competition. Linux is a good, solid product that stands on its own.
      • Respect the use of other operating systems. While Linux is a wonderful platform, it does not meet everyone's needs.
      • Refer to another product by its proper name. There's nothing to be gained by attempting to ridicule a company or its products by using "creative spelling". If we expect respect for Linux, we must respect other products.
      • Give credit where credit is due. Linux is just the kernel. Without the efforts of people involved with the GNU project , MIT, Berkeley and others too numerous to mention, the Linux kernel would not be very useful to most people.
      • Don't insist that Linux is the only answer for a particular application. Just as the Linux community cherishes the freedom that Linux provides them, Linux only solutions would deprive others of their freedom.
      • There will be cases where Linux is not the answer. Be the first to recognize this and offer another solution.

      From http://www.ibiblio.org/pub/linux/docs/HOWTO/Advoca cy

  33. Not really a problem if ... by AlanS2002 · · Score: 1

    companies make it their policy to only purchase software which uses truly open standards to store their data.

    --
    Not all conservatives are stupid,
    but it is true that most stupid people are conservative.
    - Hume
  34. Where I'm from, reverse-engineering... by PurifyYourMind · · Score: 3, Funny

    ...14- and 16-year-olds is illegal.

    1. Re:Where I'm from, reverse-engineering... by kfg · · Score: 1

      ...14- and 16-year-olds is illegal.

      Repeat after me:

      "O Canada!
      Our home and native land!
      True patriot love in all thy sons command
      With glowing hearts we see thee rise"

      When correctly viewed, everything is lewd.

      KFG

  35. Forbidden partial implementation? by tepples · · Score: 4, Interesting

    OOXML is just as open as ODF

    The behavior of years-old proprietary word processing software is included by reference into OOXML. How is any spec that includes by reference the behavior of proprietary software exactly "open"? True, implementors could produce a partial implementation of the spec that degrades away the legacy baggage (more or less) gracefully, but some standards' patent licensors forbid implementors to publish a partial implementation. I don't know if this applies to OOXML's license.

    1. Re:Forbidden partial implementation? by bigpat · · Score: 1

      but some standards' patent licensors forbid implementors to publish a partial implementation. I don't know if this applies to OOXML's license. My understanding is that Microsoft was going to enforce full implementations of the standard, except on themselves of course, so it really makes OOXML untenable as common format. What you get is an open standard that everyone but Microsoft has to follow exactly and implement fully, but only Microsoft could follow it and implement it fully. And then even if someone could manage to implement MS's XML fully and perfectly, then Microsoft could just pull a bait and switch would break compatibility.

      Actually, I am wondering about this so called open standard actually being a way to legally create compatibility problems. I am not sure what the legal framework under which OpenOffice currently opens and writes Word Docs and other Microsoft compatible formats. But it seems that if OOXML becomes the default format, then the only way to use the format would be under a Microsoft license, but if that license requires a full implentation, then Microsoft could probably find something that wasn't implemented the same as MS Office to sue over or else it could break its own compatibility with the format and then sue any company that tried to deviate from the ECMA standard, even though Microsoft can deviate from the standard with impunity and immunity.

    2. Re:Forbidden partial implementation? by TheNetAvenger · · Score: 0

      Are you actually reading what you typed?

      The behavior of years-old proprietary word processing software is included by reference into OOXML. How is any spec that includes by reference the behavior of proprietary software exactly "open"? True, implementors could produce a partial implementation of the spec that degrades away the legacy baggage (more or less) gracefully, but some standards' patent licensors forbid implementors to publish a partial implementation. I don't know if this applies to OOXML's license.


      And so you are saying OpenXML isn't open because it supports legacy documents, yet if someone were to use only ODF, they ALSO would not have any legacy support for older specifications?

      How can you logically argue this based on an assumption of what is REQUIRED, when there is nothing required for usage of the format.

      If someone doesn't want to support legacy documents, don't implement the older legacy portions of the specification, then it can be just as broken and open as ODF with regard to legacy features.

      Microsoft is NOT in any way MANDATING what features of OpenXML a company uses, they could use the specifications for bold and regular type and nothing else in their document format if they want, and it would at least be readable by other OpenXML capable wordprocessors. And they would at least be able to open the text from other OpenXML wordprocessors and display the regular and bold text from them if that is all their product IS DESIGNED TO DO AND SUPPORT.

      Does this make sense yet?

    3. Re:Forbidden partial implementation? by Anonymous Coward · · Score: 0

      WTF do you mean ?

      Would you like the ANSI-C spec to say something like:

      "Code using multiple increments on the same lvalue between sequence points should behave like specific-vendor-cc v1.0.7"

      Well, this is what the OOXML says. And that is not open. Because it means you need to have the specification, source code and test suites of specific-vendor-cc version 1.0.7 to be able to implement a fully compliant ANSI-C compiler.

      > "Microsoft is NOT in any way MANDATING what features of OpenXML a company uses, they could use the specifications for bold and regular type and nothing else in their document format if they want, and it would at least be readable by other OpenXML capable wordprocessors."

      This is not what a standard is about. You are only saying that companies can produce OOXML documents by using a subset of OOXML. Sure. But the issue is not competing with Microsoft in PRODUCING OOXML documents, but CONSUMING OOXML documents. And what the spec says is "only microsoft can CONSUME conforming OOXML documents". Hence everybody will have to use MSWord, and the monompoly is now mandated by the standard. This is what the article is about. Read-read it.

    4. Re:Forbidden partial implementation? by dkf · · Score: 1
      How is any spec that includes by reference the behavior of proprietary software exactly "open"?
      Easy. It's open... to abuse.
      --
      "Little does he know, but there is no 'I' in 'Idiot'!"
    5. Re:Forbidden partial implementation? by TheNetAvenger · · Score: 1

      Ok, you like the person that wrote the article are missing the key point of the specifications the article specifically talks about.

      Go back and find 'If applications wish to match this behavior' from the specification. This IS the key point the article clearly seems to overlook.

      There is NO reason for any application to PROVIDE Word95 support if they do not want to do so.

      Sure the OpenXML has ALL the guts of how WORD works and stores its files, but if MS ommited this from the specification everyone here would be SCREAMING that MS didn't disclose portions of the OpenXML format that WORD USES.

      By providing this information, even though MOST companies will NEVER use it or NEED to use it, MS has provided people with the tools if they do choose to support legacy Word 'concepts', they can do so, just as Word 2007 does.

      This DOES NOT EVEN MEAN Word 2007 uses these formats in saving a 'native' document, it just means that they are there for OpenXML add-on Readers for previous versions of Word that MS is/will be releasing for Word95-Word2003.

      Again, how this relates to the MAIN standardize portions of OpenXML is NOT RELEVANT, these are ONLY details of how OLDER WORD APPLICATIONS will handle OpenXML, but MS has fully disclosed even those portions knowing that the need for anyone to implement them would probably never happen.

      Further example, look at how few Word Processors fully handle Asian Characters in the first place, so they would NEVER even need to touch the Word95 Asian Character specification that isn't even used. Besides the fact that Word97-Word2007 won't even use these specifications when SAVING an OpenXML document, so why would anyone be so stupid to assume that MS wants all companies using OpenXML to use this specification?

      The entire specification has two major parts, the standard that is OpenXML that is used by Word2007, and ALL the legacy features of previous versions of Word that can also be used if a vendor wants to code for it. Basically the first part is the specification, and the second part is FULL DISCLOSURE of how Word Internally handles features going back to all previous versions of Word, something that is good information, but NOT SOMETHING THAT IS IMPORTANT OR REQUIRED BY ANYONE.

      For once MS has provided full disclosure of how Word works and has worked, and people are complaining still because they provided TOO much information about how Word USE TO WORK.

      GEESH!!!!

  36. Isn't ODF an updated OpenDoc? by msobkow · · Score: 1

    I thought ODF was an updated version of the venerable OpenDoc standard pioneered by IBM, Apple, and others. Doesn't it mean "Open Doc Format"?

    If so, it was a defacto industry standard long, long, long before OpenOffice existed.

    --
    I do not fail; I succeed at finding out what does not work.
    1. Re:Isn't ODF an updated OpenDoc? by itlurksbeneath · · Score: 2, Informative

      No. See OpenDocument and OpenDoc. Two different things. Sort of...

      --
      Have you ever considered piracy? You'd make a wonderful Dread Pirate Roberts.
    2. Re:Isn't ODF an updated OpenDoc? by imroy · · Score: 4, Informative

      No. Everyone shortens the expansion of ODF to "Open Doc Format". As you note, there was an "OpenDoc" long before OpenOffice.org and the OpenDocument Format, but they have nothing to do with one another. ODF is based on the StarOffice format that has been around a while, that much is true. Just a sad case of mixing up names...

  37. I can. by mrchaotica · · Score: 1

    What you don't seem to realize is that your list of people "effected" by this is irrelevant. There's exactly one category that matters:

    1. People who will not even consider alternatives to MS Office by the fact that only Microsoft could possibly ever claim "full compliance" with the "standard."
    --

    "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  38. Re:MIcrosoft sucks. by kjart · · Score: 1

    But they broke plenty of laws to keep their monopoly :) And while their actions during their rise to the top may not have been illegal, they could easily be called 'strong-armed'.

    What they got in trouble for was actually using their monopoly to get into other markets - i.e. bundling IE with the OS meant that they used their OS monopoly to get into the browser market. There was also doing things like offering Office at a discount if vendors bought Windows. This is the kind of thing that everybody does (think of all the apps that Apple includes) and doesn't become a problem until you have a monopoly.

    In terms of 'strong-armed' tactics, I don't think the Simpsons were being literal with Bill Gates breaking pencils and messing the place up when 'buying him out' :)

  39. Disadvantages of ISO by BillGatesLoveChild · · Score: 5, Interesting

    Once it is ratified as an ISO Standard, the standard is locked up and anyone that does want to a copy has to buy it from ISO. These are copyrighted. They're not cheap; thousands of dollars. Out of the reach of the average hobbyist, and not listed anywhere on the Internet. That 6,000 page draft will vanish into the mists of time.

    Larger Companies can afford this, but garage companies and hobbyists definitely can't. So what's the chance of an open source or even small upstart challenging Microsoft's Documentonopoly? Zero.

    Want another example? ISO country codes. The country codes (e.g. .us, .jp) are actually ISO, and ISO ended up backing off on a demand for royalties for this(!) But if you want state codes (e.g. California, Kantou), well, forget it unless you want to buy them off ISO. http://www.alvestrand.no/pipermail/ietf-languages/ 2003-September/001472.html

    ISO aren't the only ones guility of doing this. IEEE do it as well. Want the latest simulation standard? Then get out your checkbook: http://standards.ieee.org/catalog/olis/compsim.htm l

    ISO and the IEEE are enemies of openness. Microsoft is taking a page out of their gamebook.

    ISO or IEEE certification is a *bad* thing.

    1. Re:Disadvantages of ISO by EvanED · · Score: 2, Informative

      Once it is ratified as an ISO Standard, the standard is locked up and anyone that does want to a copy has to buy it from ISO. These are copyrighted. They're not cheap; thousands of dollars. Out of the reach of the average hobbyist, and not listed anywhere on the Internet. That 6,000 page draft will vanish into the mists of time.

      You mean like the C++ standard (ISO:14882) which can be downloaded as a PDF for $32 or purchased hardcopy for something like $300, and for which there are multiple sources for drafts?

    2. Re:Disadvantages of ISO by BillGatesLoveChild · · Score: 2, Informative

      They wouldn't get too far gauging you for a C++ manual. Here are some examples of what I am saying: ISO/IEC TR 9126 "Software engineering -- Product quality " US$153 each volume * 4 volumes = US$612 IEEE 1278 US$151 each volume * 6 volumes = US$906 Problem is when you are told your software has to comply with one of these, these are the only shops in town. They prohibit copying or sharing the information. Anyone who wants to meet the standard has to send I$O or I money, and there are many, many of these standards. A typical tender might list twenty.

    3. Re:Disadvantages of ISO by Anonymous Coward · · Score: 0
      That 6,000 page draft will vanish into the mists of time.


      Not unless it's mirrored. There's also archive.org.

      Why don't you throw up a page with the files?
    4. Re:Disadvantages of ISO by Anonymous Coward · · Score: 0

      As far as ODF (ISO 26300) is concerned, you can get a free (as in beer) copy of the standard at Oasis

  40. MS areslow learners by WebCowboy · · Score: 3, Interesting
    ...but they do learn....slowly...eventually.

    Their "open" XML format for office docs is a prime example of this.

    I think Steve Jobs was the one who first said "Microsoft just doesn't get it". Microsoft was probably the very first third-party software developer for the Mac and this was Jobs' reaction to Microsoft's first Mac applications (I think a port of Multiplan--which was re-incarnated into Excel IIRC, and MSBasic). They really WERE "tasteless", ugly and took almost no advantage of the revolutionary GUI interface--their DOSness really showed through--I think in the case of Multiplan the mouse could be used only to jump the cursor to a certain cell and that was it--the rest was all like in DOS.

    MS Windows is another example--Microsoft didn't "get it" well enough until the third major release. Now MS is SLOWLY "getting it" with the beneficial characteristics of XML standards. Microsoft's early XML efforts are like Windows 1.0--there is some very rudmentary understanding of the mechanics but not the philosophy of XML, and I wonder if this is why SOAP ended up NOT so simple (given Microsofties were involved in its creation and seemed to be trying to make it a DCOM-in-XML-but-dumber thing). Microsoft's "Version1" XML might look like this:

    <Soap:Envelope>
    <Soap:Body>
    <wsWriteLegacyData>
      <encodedBinaryData>
    SDFgkdfkljSDFJLDFSJKLkjdfbks df jklsdfklj;hk/jkjnb.kndf
    jk.sdfjkldfsddfsdfkkjsdfh kvbkjnkjkjksdfkjsdfkeuieru903
    oijooeoefvkmefmklef lmkseflkvfeklmlmermklemleflmdvldflk
    </encodedBina ryData>
    </wsWriteLegacyData>
    </Soap:Body>
    </Soa p:Envelope>
    "See? We're using XML and SOAP! We're hip! We're cooool! You can't say we don't play by the rules now!"

    Of course, this is an obtuse, opaque and obsfucated way to use XML andtotally NOT in the spirit of interoperability and openness. I won't even go into the nifty XML tools MS has made...nifty to use but they've done a lot to obliterate the S out of SOAP in their crazy output.

    OOXML (Opaque and Obsfucated XML) standard is "version 2.0"--they're doing their best to eliminate ambiguity but now we've gone over to hyper-specificity, and the standard is being shared a bit better...problem is that they don't fully describe the interpretation of the standard elements so as to keep its advantage. All they've done is taken every formatting option and mapped it to an XML element--it is monolithic and completely non-extensible. But hey, at least its publicly available and doesn't involve weirdness like encoded-binary-blobs.

    In a few years MS will reach version 3.0 of "getting" XML...
  41. Re:MIcrosoft sucks. by edwdig · · Score: 3, Insightful

    Microsoft broke no laws getting DOS onto every PC. They happened to be in the right place at the right time, and the market fell onto them. But from there, Microsoft bended and broke the law every chance they got to ensure that there never was any competition.

    Also don't forget that although MS's purchase of DOS was perfectly legal, it was ethically horrible. They arrived at a handshake agreement to license the code from Seattle Computer Company. While the MS paperwork was being finalized by the lawyers, SCC then made arrangements to finance other business ventures using the MS money. MS then presented them a contract to buy the code rather than license it, and told SCC to take it or leave it. As SCC had already committed to the other deals, they had no choice but to take MS's offer. Sure, no one held a gun to the head of the SCC executives forcing them to take the deal, however, they didn't have any other reasonable alternatives. MS's behavior was legal, but certainly not ethical.

  42. Unfair by BCoates · · Score: 0

    This spec sounds like a bloated monster, but the criticism the FA is making is entirely unfair. If OOXML is going to be a useful one-size-fits-all document format, it'll need to be a superset of all existing things word processors can do, even the weird old bits that don't make much sense. There's two ways to do this: Either spec out the broken behavior into the already-bloated specification, or add a flag that says "old broken spacing" and let implementors decide how faithfully to represent it.

    If they take the first option, then writing a tool that converted to and from OOXML would be a nightmare, you'd have to work out all those broken options into something that looked right, even if the end application supported it natively, since the converter app would be the last chance to attempt this obscure conversion. Making the old format->OOXML->old format loop actually end with a document that rendered anything like the starting document would be pretty much impossible.

    The way they did it, a converter app that reads in those standards can just set the appropriate flag, and let the downstream renderer deal with it. If the user actually needs these crazy old features they can go get a patch to their wordprocessor to support it; or they can find a special-purpose converter that modifys the document to not need the flag anymore; or they can convert the doc back to the original obsolete format and open it in the ancient app itself. If the document had already been mangled by a half-baked conversion/export tool, the user couldn't have done any of these.

    Tools that don't care about legacy support are unaffected by this; they can just pick the closest modern option to whatever the legacy flag calls for on input, and not output documents that use them.

    1. Re:Unfair by DavidTC · · Score: 4, Insightful

      Tools that don't care about legacy support are unaffected by this; they can just pick the closest modern option to whatever the legacy flag calls for on input, and not output documents that use them.

      And thus tools, legally, are not OOXML, and won't qualify for purchasing by companies that specify OOXML. Which is the entire point.

      There's a difference between 'We need to make sure that old documents can be converted correctly.', and 'We will literally convert old documents into a new representation that contains all their weirdness, and we won't explain how to implement said weirdness in the standard.'.

      What Microsoft has produced is not even a standard. Standards must specify everything, or reference other standards that specify everything. They can't reference applications.

      If Microsoft wants to keep secret how to turn Office 95 documents into OOXML, fine. Producing a standard doesn't mean you have to explain how to convert things into that standard.

      It does, however, mean you have to explain exactly what should happen if mwSmallCaps is true, to the pixel. You can't just pawn it off on the unexplained hypothetical behavior of some other application.

      --
      If corporations are people, aren't stockholders guilty of slavery?
    2. Re:Unfair by Askmum · · Score: 3, Insightful

      So they did it wrong.

      You need to let a conversion program worry about converting Word 2006 documents to XML documents. You need to let the maker of Word 2006 worry about making this conversion program. This can be in the form of a "save as XML" option, but also an external program.
      You can not say "oh, this is an old feature, let's put it in the spec and let's let the programmer that uses this spec worry about it because we can't be bothered to convert it or don't know how to convert it".
      Sorry, but XML should be clear to everyone and if you include an option, you should document the behaviour of this option

      But even so, you do not want the specification of a new document format have all the quirks of all the old formats. That is just silly. That is saying that a car should have a 6V battery system too because old cars have 6V battery systems and you might come across an acessory that uses 6V.

    3. Re:Unfair by SanityInAnarchy · · Score: 1
      Let's see -- first off, it would help if Microsoft had actually spelled out how this is supposed to work, rather than requiring us to have a copy of every word processor since WordPerfect and carefully reverse-engineer its behavior in every case.

      Second, you're right, old format -> OOXML -> old format... yeah, that would kind of suck. However, I don't get why you would ever want such a loop to work. Old format -> OOXML -> some other format -- now that makes sense. Think of it this way -- you can burn your mp3s as music CDs, then rip those CDs into mp3s, but you're going to lose something.

      But keep in mind: If OOXML -> old format is to work, it must be able to cover anything OOXML might do, including hacks to make the old format look right.

      And more to the point, have you actually looked at these? Some of them seem entirely too simple: Emulate WordPerfect 5.x line spacing. Really, how hard would it be to simply have a generic <p spacing="0.8"> (just an example, I don't know how this is done in practice or exactly what the difference is) in your conversion of WordPerfect to OOXML? And, going the other way, couldn't you just perform a similar calculation to determine line spacing?

      I mean, you're right, it takes a bit of effort. Yeah, your conversion tool takes a little more effort to write than it otherwise would. I think that's an acceptable loss, considering it saves a massive amount of effort from every fucking word processor to implement this standard, EVER.

      As for some of this other stuff, some of it seems inevitable, if you want the document to look perfect. And if you accept that, then yes, you need these tags -- but would it've killed Microsoft, in their 6000 word "spec", to spell out what an app's actually supposed to do? Specs are supposed to be the basis for implementations -- they should not have to refer you to another implementation -- and it certainly doesn't help Microsoft's case of "open"-ness when it would cost me a significant amount of money to obtain all the word processors they refer to.

      I, for one, don't accept that. The vast majority of these tags seem to be referring to specific weird behavior, like character spacing being slightly off, when they could've simply given us an option to specify how much space between each character. That is, whether it's for "backwards compatibility" or not, you should be able to specify... oh, let's say:

      <shape>
        <point x="0" y="0"/>
        <point x="1" y="0"/>
        <point x="0" y="3"/>
      </shape>
      Rather than... say:

      <triangle type="CheeseWedge"/>
      Now, it's true, the second one takes more space. That's why we use compression techniques -- both actual zip compression and programmatic compression. For instance, we could do something like this:

      <shapedef id="CheeseWedge">
        <point x="0" y="0"/>
          <!-- you get the idea... -->
      </shape>
        <!-- now, somewhere in the document: -->
      <shape id="CheeseWedge" />
      Worst case, you can use XML entities.

      Now, you're right, half-baked conversion tools will screw you over -- but so will half-baked software. And really, after any conversion like this, doesn't it make sense to go back and check that the document was converted properly before deleting the original file?

      The point of a new standard is not to support tons of legacy documents and software. It's to make sure we're all on the same page moving forward, and Microsoft's 6000 page monstrosity doesn't help much with that, especially when it's not even complete till you buy and reverse engineer 16 years worth of word processors.
      --
      Don't thank God, thank a doctor!
    4. Re:Unfair by Anonymous Coward · · Score: 0
      If they take the first option, then writing a tool that converted to and from OOXML would be a nightmare

      Poor converter writers... Who will be they? Oh, yes, Microsoft Office developers! And guess what: they already have the implementation for the broken features.
      if the end application supported it natively

      There's only one application like this: Word.
      So, to recap your argument: ooxml is fine because writting doc->ooxml converters is easy (even if one just needs one converter, that provided by Microsoft), even if this means that EVERY end application must take the burden of emulating the broken bits of Word. And saying that a format that forces vendors to take care of their own broken stuff is better is unfair. Have I missed something?

    5. Re:Unfair by Todd+Knarr · · Score: 1

      What Microsoft has produced is not even a standard. Standards must specify everything, or reference other standards that specify everything. They can't reference applications.

      I think that's exactly where MS has issues. And I don't think it's anything they deliberately try to do. It's been speculated that not even the Word developers have a spec for the Word .doc format. All they've got is a class that was given to them by someone else that can read and write .doc files, and a definition of the internal data structures that class produces and accepts. They confuse the specification with the application because to them the application is in fact the only specification they have. This also explains why they keep going off about source code when everybody else is talking about documenting file formats, because inside MS the only documentation on the file formats is the source code that handles them.

    6. Re:Unfair by DavidTC · · Score: 1

      If MS doesn't know what's going on in the .doc format, they had a perfectly logical way to do everything:

      They could have written a standard that included generic workarounds, like 'offset this data one pixel upward' or 'put this footnote here, instead of over there where it logically belongs', to cope with some random behavior of some version of Word that no one knows about.

      Then their converter could just add that in when converting, and other people who've already reverse-engineered .doc readers could too, or just not bother, and everyone could display the stuff properly once converted.

      Instead, now we have all producers of products that read OOXML have to reverse engineer all versions of Word and even WordPerfect. If I'm writing a new word processor that doesn't read Word 95 files, why the hell do I need to know how Word 95 formatted something to implement OOXML?

      Someone writing a converter obviously will need to know that, and put in generic layout workarounds that fix specific problems, but the whole damn point of open standards is that we start using them so every word processing application doesn't have to be able to know eight million formats, and Microsoft managed to make an 'open standard' that requires exactly that.

      --
      If corporations are people, aren't stockholders guilty of slavery?
    7. Re:Unfair by spitzak · · Score: 1

      No, MSWord should pop up a scary-sounding warning like "SOME FORMATTING MAY BE LOST" and save the file the best it can.

      Rest assurred that if they implemented ODF it would pop up the most scary warnings possible, saying the user's very life may be threatened if they save this way, even if nothing would be lost. They should apply the same rules to their own code, and stop this bullshit.

    8. Re:Unfair by Simetrical · · Score: 1

      Standards must specify things "to the pixel"? Gee, it would be nice if CSS did that.

      --
      MediaWiki developer, Total War Center sysadmin
    9. Re:Unfair by DavidTC · · Score: 1

      CSS does, barring a few places that were unclear and can be intepeted in multiple ways, which have mostly been cleared up now.

      Just because it's not 100% clear sometimes, and just because no one implements it correctly, doesn't mean it's not attempting define everything to the pixel.

      Which OOXML is not even vaguely trying to do.

      It's like if CSS defined things like:

      netscape-2.0-border-color-behavior: true|false;
      If true, color the borders as Netscape 2.0 did.

      --
      If corporations are people, aren't stockholders guilty of slavery?
  43. Re:MIcrosoft sucks. by TrekkieGod · · Score: 5, Insightful

    If you get to the point where you build up a company that can even consider garnering the term "monopoly", then get back to us...At that point, maybe, just maybe, you may come to thinking that you you earned what you got, and the government has no right to tell you how to run your business...

    Yeah. Because the person best suited to decide what a company should or should not be allowed to do are the people who own the company. Of course you're going to want to be completely unrestricted to mow down your competitors using whatever advantages you have if you are in a position to do so. What you're missing is that no one should be allowed to use unfair practices to do it. Some people think we should idolize the free market as some sort of religion. We don't like free market economy because it was given to us by the gods. We like it because it tends to result in better products and lower prices. That ceases to be true when you have a monopoly in the mix.

    That being said, I'm not really informed about any Microsoft specifics, so I'm not going to argue in favor or against any "federal laws" as it applies to them (or failed to apply to them). However, suggesting that only people who have built a company that holds a monopoly should be able to decide what is fair regulation isn't rational. It may even be that the current federal laws regarding monopolies may be unfair and in need of reform, but the fact remains that the existence of a set of laws to regulate businesses is necessary.

    --

    Warning: Opinions known to be heavily biased.

  44. Re:Suck it up by DavidTC · · Score: 0, Redundant

    Saying 'If X, then imitate the behavior of some other application in a way we're not specifying exactly', is not a specification, you idiot.

    --
    If corporations are people, aren't stockholders guilty of slavery?
  45. Re:MIcrosoft sucks. by tyme · · Score: 3, Insightful
    Brandybuck wrote:
    Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.


    Unfortunately, you are wrong on almost all counts:
    1. Section 1 of the Sherman Act (Restraints of Trade) applies to everyone, not just to monopolists. If Microsoft engaged in any restrains of trade, even before they were a monopoly (which doesn't have to mean 100% market share, as so many people seem to believe, it only means that the company must have the power to set prices in the market*), they would have violated Sherman 1.
    2. Section 2 of the Serman Act (Monopolization) makes it a crime to monopolize or attempt to monopolize. Hence, you can be held in violation of Sherman 2 for doing something that is otherwise perfectly legal, so long as you were trying to maintain or obtain a monopoly. Since (1) Microsoft is obviously a monopoly and (2) they have taken steps both to attain their monopoly position and to maintain it once they had it, they are clearly in violation of Sherman 2.

    The real problem, however, with the Sherman Act is that, in general, it can only be prosecuted by the Federal Trade Commission, and that is under the direct control of the executive power. Ever since the Regan administration, there has been little or no desire on the part of the FTC to persue anti-trust litigation.

    * Courts have generally used the rule that anyone with more that 70% market share obviously has monopoly power, and anyone with less than 20% obviously lacks it, but that between 20% and 70% requires and examination of facts and circumstances before declaring that someone has monopoly power.

    --
    just a ghost in the machine.
  46. Application responsible for converting? by Knutsi · · Score: 1

    This was a worrying, but good, article. I'm sure MS is a bit in a thight spot as well, if they really desire backwards compatibility (which is what they survive on in a way). But it would make more sence to make supporting legacy documents more optional.

    When I save a Word 2007 document to the old .doc format, it warns me that "minor loss of fidelty" may happen. Similarly, when opening a document, supporting waybackthen formats could be optional/plug-based, and the app rather warning that "minor loss of fidelity" may happen since the document was converted from an old source.

    Forcing every new app ever written to this standard to support diffuse behaviours from the good ol'days is just ridiculus. Besides, most of it appears to apply just to apperance.

  47. Documents outlive applications by Geof · · Score: 4, Insightful

    There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea (see .ABW for AbiWord, .DOC for Word, .WPD for WordPerfect I would also wager is the same idea).

    Documents are worth far more than software, and they outlive the applications used to create them. See the comment to the original article - reading documents after 5, 20, 30, 100 years or more is not optional. You can pay the price of developing an independent format now, or you can pay the price of reverse engineering over and over again every time you change your internal representation.

    Repeated implementation limits future change and innovation. It's expensive: it likely costs more even for Microsoft. But they can afford it; their competitors may not be able to. Plus, Microsoft already has their first implementation.

    interoperability seems to work best when taken from the ground up - when working with another application's data structure of any complexity, you simply can't do a lossless roundtrip without losing before you've started.

    Perhaps so. But compare that cost to the cost I've just outlined. It is in the best interest of users and software developers (maybe even of Microsoft) to bite the bullet now, do the conversion once, and develop a clean format for the future.

    Maybe you have in mind an argument you're not making, but I don't see any sufficient basis for your broad contention that using a file format based on an internal representation is a "darn good idea". In specific cases, yes (e.g. where the cost of development time or effort are the most important factors). In general, I very much doubt it. That successful applications in the past have taken that approach is weak evidence. They were developed when the up-front cost of development in a time of rapid innovation, the loss of customer lock-in, and a lack of open-format competition where good business reasons for making such a choice - even if it was inferior technically, increased cost in the long term, and was bad for consumers. In today's climate of slower innovation, competition from open formats, and customers who are running into their own long-term interests, the situation is different.

    Which is not to say Microsoft's apparent attempt to set the rules of the game and throw sand in the gears of change is not in their interests, or that it will be unsuccessful.

    1. Re:Documents outlive applications by LizardKing · · Score: 2, Informative

      Documents are worth far more than software, and they outlive the applications used to create them. See the comment to the original article - reading documents after 5, 20, 30, 100 years or more is not optional.

      Which is why medical, legal and military records are often not held in word processor formats. For instance, the military records I have dealt with (NATO mostly) are held in SGML, conforming to carefully designed MIL DTD's that preserve structure rather than presentation. These files can be translated faithfully into HTML, PDF and so on without losing there meaning.

      Sadly, as MS Word has become all too pervasive, more and more documents are stuck in a format where you cannot reliably extract data or convert it. For instance, with the processing system for scientific journal articles I worked on a table of contents can't be generated from Word documents, as headings are usually just inline styling that isn't even consistent within a single document. Years ago, the journal publishers could insist on properly structured data (LaTeX for instance), but nowadays scientists are too lazy - all they care about is the cheap thrill of seeing their article in print, sod anyone who wants to index or cross reference the data for future use.

    2. Re:Documents outlive applications by dominator · · Score: 1
      Maybe you have in mind an argument you're not making, but I don't see any sufficient basis for your broad contention that using a file format based on an internal representation is a "darn good idea".


      He's not saying that the file format has to be an unreadable dump of the Word Processor's in-memory representation. Merely, the file format has to coincide with the application's view of the world. *Everything* that the application can do is (un)serializable to that format. It's like speaking in your native tongue. You're good at it.

      When you start using someone else's file format, your worldview is a bit changed. Maybe it supports fewer formatting options for lists. Or one of your idioms (say, a floating textbox frame) isn't directly translatable to their file format, so you have to "fake it" or otherwise find a compromise fall-back solution. For instance, I can conduct a reasonable conversation in (say) German, but English is my native language, and sometimes things will get lost, mis-interpreted, or dumbed-down in the translation.

      Avoiding dataloss is perhaps the best reason for having a native file format, at least for existing applications. Inter-operability is a darn good reason for supporting other people's file formats, especially "open" ones like ODF. Just don't think of it as your native tongue. Your application wasn't designed with that in mind.

      Dom Lachowicz, AbiWord developer
  48. Re:MIcrosoft sucks. by albalbo · · Score: 0, Offtopic

    Both "effect" and "affect" can be used as both verb and noun.

    Understanding their proper usage is a lot harder than saying "effect is a doing word!".

    --
    "Elmo knows where you live!" - The Simpsons
  49. Re:Suck it up by animaal · · Score: 5, Funny

    What's the deal with you people? I have seen engineers take apart the most difficult situations. You have the format in your hands. It's ugly and crappy, go figure. Just get it done and stop bitching. Why is everyone so lazy? Jeff, is that you? Haven't seen you much since you became a project manager. Congrats on getting the MBA!
  50. Re:MIcrosoft sucks. by ArsenneLupin · · Score: 3, Insightful

    If you get to the point where you build up a company that can even consider garnering the term "monopoly", then get back to us. Until then, you have no idea what you're talking about, especially when quoting arbitrary and esoteric "federal laws". Call me nuts, but if you ever got to that point, you'd might even get a crazy idea in your head that those "federal laws" that you are so damned proud of, are about as fair and just as our drug laws. At that point, maybe, just maybe, you may come to thinking that you you earned what you got, and the government has no right to tell you how to run your business that you started in your teens, and proceeded to build to make it one of the most successful companies in the history of capitalism.

    Until you get to that point, I suggest that you those "federal laws" out your ass, Mr. Ashcroft. I agree 100% with you. However, for fairness' sake, we should then abolish all those unjust business-hampering federal laws, including copyright and patent law.

    Oh, and also those so-called "computer misuse" laws. Indeed, if I want to set up a consultancy where I propose to convert customers ASP scripts to PHP I should be allowed to demo to my prospective customers in great graphical detail why ASP is so insecure, even if I don't yet have an existing business relationship. Why should I tolerate that the government tells me how I may and may not recruit new customers?

    Anything less would be one-sided and unfair.

  51. Even more lapses of judgment... by shadowmatter · · Score: 1

    You can view all the atrocities of OpenXML that he's blogged about here. Highlights include dumping bitmasks into XML as hexadecimal on a byte-by-byte basis, and an XML element for specifying whether the dates in the workbook start in 1904.

    I'm can't believe this became a ratified standard.

    "Let him who has understanding calculate the number of the beast, for the number is that of a standard; and its number is three hundred and seventy-six." Common-freaking-sense 13:16-18

    - shadowmatter

    1. Re:Even more lapses of judgment... by kahei · · Score: 1


      IMHO those are more serious problems. They're enough to make it be what I'd call a Long Ugly Hastily Written Standard, which somehow doesn't really surprise me.

      The thing the original article is freaking out about -- legacy compatibility flags -- isn't really an issue. The standard has to include the features offered by existing wps. Sometimes those features are undocumented, obscure, and almost totally forgotten. What do you do? Find the last remaining copy of the code, figure out exactly how WP4 buggily laid out small caps, express that all in English, and put it in the standard?? No. You put in a WP4SmallCapsCompatibility flag and if any vendor has customer that really really care about WP4 small caps, there's now a way to express in a document that WP4's behaviour should be emulated and the one vendor that cares can honor that flag.

      It's called 'common sense'. Now, for a standard that REALLY machine guns itself in the foot, try SVG. Or for that matter CSS -- how the W3C gets away with those I don't know. For that matter, even the .NET standard is more annoying in that it starts off 'first, do everything exactly like a Windows PE file, just to make things a tiny bit easier for the windows loader, even though .NET assemblies are basically nothing like PE files'.

      Yeah, ok, rambling again.

      --
      Whence? Hence. Whither? Thither.
  52. the real hitch - it never was clear by Erris · · Score: 4, Interesting

    The hitch here is that *not* having them means tons and tons of reverse engineering, and that's only after tracking down every release of every version of every MS Office ever.

    The real hitch, as the article hints, is that the releases are contradictory. For instance, the Mac version of small caps is different from others. This is part of the reason Word is so bloated and does not preserve printing type setting from one machine to the next.

    Ten years ago, a state agency I was working for was forced to move from Word Perfect to Word. Hundreds, if not thousands, of documents were painstakingly converted from one format to the other. The typesetting, which they had never had a problem with previously, was easily broken by moves from one machine to the other or by changing printers. That is the kind of thing that no program can account for - it was broken from then and can not be created correctly today. It's also probably the reason for all of the nebulous "guidance" sections that don't tell you anything other than to look at, and presumably measure, old printed examples. Not even M$ knows what it was really doing in the field. As I saw at the time, no two were alike.

    Of course, the time to get things right is not in your XML it's when you import the document. The author tells us this in so many words. The XML should be general enough to encompass any kind of typesetting. It is the importing program's task to figure out what the old format wanted things to look like. As the author points out, the spec does not do anything other create something impossible to follow. It's not going to magically make things look right no matter how hard they wish it would.

    --
    DMCA, Hollings, Palladium. What might have sounded like paranoia is now common sense.
    1. Re:the real hitch - it never was clear by Anonymous Coward · · Score: 0
      twitter, please read this carefully. Following this advice will make Slashdot a better place for everyone, including yourself.

      • As a representative of the Linux community, participate in mailing list and newsgroup discussions in a professional manner. Refrain from name-calling and use of vulgar language. Consider yourself a member of a virtual corporation with Mr. Torvalds as your Chief Executive Officer. Your words will either enhance or degrade the image the reader has of the Linux community.
      • Avoid hyperbole and unsubstantiated claims at all costs. It's unprofessional and will result in unproductive discussions.
      • A thoughtful, well-reasoned response to a posting will not only provide insight for your readers, but will also increase their respect for your knowledge and abilities.
      • Always remember that if you insult or are disrespectful to someone, their negative experience may be shared with many others. If you do offend someone, please try to make amends.
      • Focus on what Linux has to offer. There is no need to bash the competition. Linux is a good, solid product that stands on its own.
      • Respect the use of other operating systems. While Linux is a wonderful platform, it does not meet everyone's needs.
      • Refer to another product by its proper name. There's nothing to be gained by attempting to ridicule a company or its products by using "creative spelling". If we expect respect for Linux, we must respect other products.
      • Give credit where credit is due. Linux is just the kernel. Without the efforts of people involved with the GNU project , MIT, Berkeley and others too numerous to mention, the Linux kernel would not be very useful to most people.
      • Don't insist that Linux is the only answer for a particular application. Just as the Linux community cherishes the freedom that Linux provides them, Linux only solutions would deprive others of their freedom.
      • There will be cases where Linux is not the answer. Be the first to recognize this and offer another solution.

      From http://www.ibiblio.org/pub/linux/docs/HOWTO/Advoca cy

    2. Re:the real hitch - it never was clear by stg · · Score: 2, Informative

      I had a fun problem with a version of Word (for Windows 2.0, I think) many years back. Some friends came by to print a paper for a CS class, and the files they brought were made with the Brazilian Portuguese version of Word.

      I had the English version of Word. When I tried to print, I discovered (after a lot of pages, of course) that I had to fix the formatting because some of the formatting was translated... And not even logical stuff like accents - page breaks, footnotes, etc.

    3. Re:the real hitch - it never was clear by blincoln · · Score: 1

      The real hitch, as the article hints, is that the releases are contradictory. For instance, the Mac version of small caps is different from others.

      That's not that big of a deal. I've been reverse engineering some graphics and sound formats from my favourite videogame series (Legacy of Kain), and there are a lot of contradictions between different games in the series, or even the same game on different platforms.

      Sure, you end up with some semi-ugly code, but at long as you can confidently determine *which* version of the rules to use, it works. In this case, it's the figuring out the rules part that I think would suck. Decoding parts of games I like is fun. Figuring out pixel spacing for obscure obsolete versions of Word isn't.

      The XML should be general enough to encompass any kind of typesetting. It is the importing program's task to figure out what the old format wanted things to look like.

      No argument there. The "spec" MS has published is stupid. But I do think if they published an accurate one, there are people willing to build giant switch statements into OO.o or whatever that would follow the published rules for interpreting all the different quirks of the MS file format. Not for the XML files, but to import .doc or whatever and convert it to a sane format.

      --
      "...always new atoms but always doing the same dance, remembering what the dance was yesterday." -Richard Feynman
  53. OpenXML to help with .doc handling? by Anonymous Coward · · Score: 0

    Has anyone investigated whether it would be possible to improve existing doc/xls support in OOo etc by using the details exposed in the OpenXML spec?

  54. Re:Suck it up by mwvdlee · · Score: 1

    Ever heard of the saying "good programmers are lazy programmers"?

    Yes, we could all duplicate the significant effort of reverse engineering the missing parts of the standard (you still have a working copy of WP5 around somewhere?). Or we could just save everybody a lot of time and money in the future by making a one-time small investment of fixing the standard now.

    --
    Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
  55. Blind leading the blind by frisket · · Score: 2, Interesting
    It's instructive to observe the panic-ridden frenzy with which Microsoft have approached the business of using XML as a file format. The marketing influence is all too plain to see, with the result that they feel an inner compulsion to preserve the appearance of the document at all costs, sacrificing all logic and common-sense to do it.

    OOo did the same, but with greater elegance and less haste because they were ahead of the field. Corel screwed it up with WordPerfect by keeping their stylesheet format proprietary so that transfer between WP document code and XML was made as hard as possible (a Class A blunder, given that their XML editor is actually quite good). AbiWord makes a good job of saving DocBook XML, but it's not trying to pretend it's reimportable; it screws up LaTeX formidably, though, by trying to pretend that it absolutely has to preserve line-length and font-size, which is evidence of the same neurotic attitude as Microsoft.

    The problem in all cases is not that the assorted authors and coders don't understand XML (although some of them clearly failed that test too), but that they don't understand documents. This is particularly true at Microsoft, where leaders such as Jean Paoli have been proselytizing XML for years. They still think a document is a jumble of letters; they have no idea of structure, and the DOM is simply laughable as a non-model of a document. Microsoft's particular problem with XML is that they came to it too late, and viewed it as a way of storing data, not text...indeed to this day many XML users, trained with Microsoft blinkers on, are unaware that XML can be used for normal text documents.

    With this level of ignorance surrounding Microsoft, it's hardly unexpected that they should blunder so badly.

  56. Re:Suck it up by Anonymous Coward · · Score: 0

    ROTFL!!!

  57. Who's Scared? by Erris · · Score: 3, Informative

    ... immediately render billions of existing MSO documents obsolete if you could get govt to mandate ODF exclusively. And the bonus is that such govt mandate would render any and all features not supported by ODF (i.e. not supported by OO.o) irrelevant.

    Eh? Isn't that why M$ made this supposedly "open" format? Because governments were tired of paying through the nose for secret formats that broke between versions? The purpose of an archive is to read it later. Governments and companies have already moved to pdf for archives. They are going to move their working documents to reasonable formats next.

    But MS opened their own format, thus leveling the playing field so that you must again compete on features ...

    You must not have read the 6000 page spec, which includes lots of sections like this:

    Guidance: To faithfully replicate this behavior, applications must imitate the behavior of that application, which involves many possible behaviors and cannot be faithfully placed into narrative for this Office Open XML Standard. If applications wish to match this behavior, they must utilize and duplicate the output of those applications. It is recommended that applications not intentionally replicate this behavior as it was deprecated due to issues with its output, and is maintained only for compatibility with existing documents from that application. end guidance

    That's neither open, nor a standard.

    Microsoft is hoping people believe what you say, but everyone knows better. Shit like OOXML this only proves that they have not changed. It's just another, more elaborate and more expensive lie. Even the name, by using "OO" is intentionally confusing. The New Office is everything the old Office was and always will be. Vista and Office 2007 are non starters.

    --
    DMCA, Hollings, Palladium. What might have sounded like paranoia is now common sense.
    1. Re:Who's Scared? by Anonymous Coward · · Score: 0
      twitter, please read this carefully. Following this advice will make Slashdot a better place for everyone, including yourself.

      • As a representative of the Linux community, participate in mailing list and newsgroup discussions in a professional manner. Refrain from name-calling and use of vulgar language. Consider yourself a member of a virtual corporation with Mr. Torvalds as your Chief Executive Officer. Your words will either enhance or degrade the image the reader has of the Linux community.
      • Avoid hyperbole and unsubstantiated claims at all costs. It's unprofessional and will result in unproductive discussions.
      • A thoughtful, well-reasoned response to a posting will not only provide insight for your readers, but will also increase their respect for your knowledge and abilities.
      • Always remember that if you insult or are disrespectful to someone, their negative experience may be shared with many others. If you do offend someone, please try to make amends.
      • Focus on what Linux has to offer. There is no need to bash the competition. Linux is a good, solid product that stands on its own.
      • Respect the use of other operating systems. While Linux is a wonderful platform, it does not meet everyone's needs.
      • Refer to another product by its proper name. There's nothing to be gained by attempting to ridicule a company or its products by using "creative spelling". If we expect respect for Linux, we must respect other products.
      • Give credit where credit is due. Linux is just the kernel. Without the efforts of people involved with the GNU project , MIT, Berkeley and others too numerous to mention, the Linux kernel would not be very useful to most people.
      • Don't insist that Linux is the only answer for a particular application. Just as the Linux community cherishes the freedom that Linux provides them, Linux only solutions would deprive others of their freedom.
      • There will be cases where Linux is not the answer. Be the first to recognize this and offer another solution.

      From http://www.ibiblio.org/pub/linux/docs/HOWTO/Advoca cy

    2. Re:Who's Scared? by __aaavgi4732 · · Score: 1

      I think you're missing the point of this specification. Neither OOXML or ODF are presentation specifications, they are content specifications. Do either explain how to rasterize a specified font? Do either explain the algorithm that dithers an embedded image? They do not. Why? Because those are details of presentation, not content.

      I am not traditionally an MS-backer but if in the same situation, I would probably do exactly what they've apparently done. Their only obligation is to explain the semantics of the tag. They are under no obligation to explain how every pixel is rendered to the screen/printer/plotter -- if they were, forget 6000 pages, it would probably be less verbose to just release the source code for MSWord. God help you trying to grok that God-forsaken mess...

      And finally, even if MS were to go through the trouble of explaining the precise side-effects of every one of these tags, you'd still have to write the code that reproduces it. In the end, you'd very likely end up doing what you'd have to do otherwise -- run the legacy version of Word (or the current MSWord with that compatibility flag enabled) and see what it does. Frankly, it's easier.

  58. Do backward compatibility in the converter by rjungbeck · · Score: 2, Insightful

    Where is the problem in doing the conversion (for the legacy features) in the converter, so that the new format is free from this bloat? OK, its harder to write the converter (which has to implement this old behaviors), but its Microsoft who wants to have the backward compatibility. So it only needs to be done once.

    1. Re:Do backward compatibility in the converter by spitzak · · Score: 1

      The problem with that solution is that it will allow software other than MSWord to read the saved documents correctly by making the spec simple enough to implement. Though not in the ISO submitted documentation, "make sure it is impossible for other software to read this correctly" was one of the primary design criteria, so the solution you propose was eliminated immediately.

  59. purism as the enemy of progress by gjuk · · Score: 1

    As often, purism is the enemy of progress here. Whilst it'd be great to be able to render, faithfully, every detail of any legacy document - it's an unnecessary and unrealistic constraint. One day, Microsoft themselves will choose to drop support for WPx or WW8 etc. They will. Really, they will. For owners of documents whose only record is held in proprietary formats - that will happen one day. Might as well happen with the adoption of a standard which prevents it happening again. Let's face it - PC's no longer ship with 5.25 inch floppies. Try opening an EBCDIC WordStar document in anything now; or a Tasword III document. Legal documents usually specify in the preamble that the layout is purely for ease of reading and of no legal significance. Even old photocopies and faxes are usually mashed in some way. To be honest - the inability of many earlier versions of Word to render correctly on different printers (even making it dificult to use A4 in the UK when Word insists on US Letter) was much more of an issue for many of us than the lack of WP support. At the end of the day, if the document retains the right characters and numbers in the right order, it meets the real needs of users. Let's make it easier to open up the document market by being realistic; not close it down through artificial maintenance of unnecessary standards.

  60. Then get the customer to supply it by Anonymous Coward · · Score: 0

    Either

    a) They have it so why not let you see what you're supposed to be working to - doesn't cost them anything

    or

    b) They don't have it and so how will they know that you've followed it or not?

    1. Re:Then get the customer to supply it by Anonymous Coward · · Score: 1, Interesting
      They have it so why not let you see what you're supposed to be working to - doesn't cost them anything

      Because they won't be inconvenienced by 10 competing vendors fighting over who gets to study their copy of the standards. Instead, they'll just pick a vendor who has their own copy.

    2. Re:Then get the customer to supply it by BillGatesLoveChild · · Score: 1

      > Because they won't be inconvenienced by 10 competing vendors fighting over who gets to study their copy of the standards. Instead, they'll just pick a vendor who has their own copy. That's right. And that vendor is usually a multinational! ;-)

  61. Re:MIcrosoft sucks. by infofc · · Score: 2, Insightful

    My mind must be failing. I seem to recall that a free market is based on fairly competing businesses, hence no monopoly can be tolerated. We allow monopolies to form and exist as long as competitors have a chance to emerge.

  62. Re:MIcrosoft sucks. by hachete · · Score: 4, Insightful

    Yes, they got into trouble for bundling but it misses the point every time. The secret sauce that Microsoft uses is to strong-arm the OEMs into bundling windows with PCs, espeicially for consumers. I'm also thinking that the Windows Tax is levied even if you buy Linux on a Dell. This is the lynch-pin of Microsoft domination, without it all their other strategies whither on the vine. Without bundling of windows with new pcs, the bundling of IE (and all the other sofware), the resistance against inter-operability, the mysterious file formats etc wither on the vine. I've been disappointed that *none of the investigations I've read about have gone after the OEM-Microsoft link. Break that, and you'll have a free-market again.

    I think the Office XML format style is a play straight out of IBM's hand-book: make the standard complex and incomprehensible, and the little players - that's you - will find it hard to compete. In a way, that's a good sign: Microsoft is now lumbering into middle-age, hoist on their own evermore complex petard.

    The other thing about middle-age is that every little technological step away from their established base-line is treated as a revolution. In reality, it's no such thing, just a small stepping stone to shouting "pesky kids. Get off my lawn." Or maybe they've reached that stage already.

    --
    Patriotism is a virtue of the vicious
  63. binary blob RAS? by ifknot · · Score: 1

    isn't that like saying binary binary large object?

    bit like pin number - personal identification number number

    sorry just bugs me

    http://en.wikipedia.org/wiki/RAS_syndrome

    --
    we are all cosmic nuclear waste
  64. Seems fair enough by istartedi · · Score: 1

    If you were faced with output from a 15 year old program, what would you do? 15 years? In software, that's an eternity. These tags are essentially saying "here is where this old crap used to be". How many people are actually using these programs? Maintaining documents in the old format? I defy any of you out there in Linux-land to say you wouldn't take the same approach under the same set of circumstances. Actually, Linux people would probably just say "it may not open old documents properly, but that's OK because you have the source". Really not much better.

    --
    For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
    1. Re:Seems fair enough by rohan972 · · Score: 1

      How many people are actually using these programs? Maintaining documents in the old format? I defy any of you out there in Linux-land to say you wouldn't take the same approach under the same set of circumstances.

      It's ok to take this approach writing an application. It's not ok in writing a standard. The difference is that no-one other than MS can now write an application that is "OpenXML" compatible, which makes a mockery of having it as a standard. Your application can have any features you want, but a standard (by definition) must be able to be implemented by others.

      Actually, Linux people would probably just say "it may not open old documents properly, but that's OK because you have the source".

      Here is the focus of your misunderstanding. You are obviously refering to an application, as file format specifications do not open documents. The article is refering to a file format specification published by MS, making the point that it is not adequate to be used as a standard. It is saying nothing about the quality/adequacy of MS software products.

    2. Re:Seems fair enough by spitzak · · Score: 1

      No, I would pop up a warning that says "SOME FORMATTING MAY BE LOST", just like Microsoft does with EVERY other format it can save to (whether or not anything will be lost). Now maybe, having access to the source code, I would try to be nice and CONVERT (look up the word "convert" if it is foreign to you) to the new format, if it was possible. But that is not important. I CERTAINLY would not put a marker in the program that says "hey, YOU convert for me, as I don't want to!".

      What they are doing is indefensible, so don't even try.

    3. Re:Seems fair enough by John+Hasler · · Score: 1

      > How many people are actually using these programs?

      Doesn't matter. If your appilcation does not support these tags it does not conform to the standard and Microsoft will say so, loudly.

      --
      Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
  65. Re:MIcrosoft sucks. by theLOUDroom · · Score: 1

    Or maybe I, too, can post unprovable, untestable anti-Microsoft conjecture to slashdot and get modded up?

    Mircosoft has been convicted TWICE and has yet to face significant penalties. This is a statement of FACT.

    --
    Life is too short to proofread.
  66. So what is WP small caps? by Anonymous Coward · · Score: 1, Insightful

    If it isn't specified, how can the format be a specification?

  67. user expectation by r00t · · Score: 1

    To normal non-nerd users and most nerds as well, a document is a jumble of letters.

    I highlight text. I click the "B" button to make my text bold. I don't screw with styles.

    Sorry to burst your bubble, dear Holy Priest Of The Most Highest XML.

    1. Re:user expectation by kryten_nl · · Score: 1

      I highlight text. I click the "B" button to make my text bold. See, that's why your letters to the editor about the "FRIKKIN YUGE ALIEN-GOVERMANT CONSPIRACY" never get printed in the local paper.....
      --
      For the perfect anti-Unix, write an OS that thinks it knows what you're doing better than you do and let it be wrong.
    2. Re:user expectation by NickFortune · · Score: 1
      To normal non-nerd users and most nerds as well, a document is a jumble of letters.
      ... and you know, that comment makes perfect sense, if only you assume that this is nothing more than a "open-office-is-better-than-ms-word" dicksize war.

      Trouble is, the debate is about open formats not about applications. So it doesn't make the least little bit of difference what the user sees a document as being, because this isn't about interfaces, it's about file formats. And outside of Microsoft if not within, we tend to expect developers working on office suites to have a slightly more sophisticated conception of a document than "a jumble of letters".

      Sorry to burst your bubble, dear Holy Priest Of The Most Highest XML.
      I think you'll find you missed. Feel free to try again.
      --
      Don't let THEM immanentize the Eschaton!
  68. Re:MIcrosoft sucks. by CAPSLOCK2000 · · Score: 1

    Let's compare it to an all time favorite: Guns.

    You don't have to break a law to get a gun. But as soon as you start using it you'd better be very carefull. Except for a few very well defined cases (sport, hunting, self-defense), using your gun is illegal.

    Even in the cases named above, using your gun in the wrong way will send you to jail.

  69. Re:MIcrosoft sucks. by Mo+Bedda · · Score: 1

    Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.

    I don't know when their monopoly officially started, but they stole technology from STAC pretty early ('91-'92). As we all know, IP law applies to just fine to non-monopolies, and Microsoft certainly broke those laws. Last time I checked, patent law was federal law. So in short, history says you are incorrect.

  70. Re:Suck it up by tehcyder · · Score: 1
    Ever heard of the saying "good programmers are lazy programmers"?
    Does this mean that lazy programmers are good programmers?

    If so, I must be a fucking programming whizz, as I can't ever be bothered to do more than a quick "Hello World" before I claim full knowledge of that language on my CV, then go and get drunk on meths for a week.

    --
    To have a right to do a thing is not at all the same as to be right in doing it
  71. Guillaume Portes = Bill Gates by tendays · · Score: 2, Interesting

    I don't know how many of you noticed: The fictional name "Guillaume Portes" is actually a literal translation of "Bill Gates" in French ...

    1. Re:Guillaume Portes = Bill Gates by Thumper_SVX · · Score: 1

      Brilliant. I didn't even notice this (and I spent several months working in Paris). The irony is the name isn't fictional... a friend of mine has exactly that name and he's going to get such a load of crap from me about this when I see him later.

      Irony is... he's an avid Mac user. :)

  72. OOXML's Origin Is Not The Problem by NickFortune · · Score: 3, Interesting
    ODF is a nice idea in theory, but really, it's a similar situation (OpenOffice.Org internal dataformat jammed into a standard, so designed with OO.o in mind by necessity)
    The ODF format must necessarily describe the structure and layout of an office document. There's no need for it to reflect the internal data structures of any specific application, except to the extent that they too describe office documents.

    OOXML includes data elements that should be part of internal import routines rather than being enshrined in the document format, and it includes elements that are not specified except by reference to applications for which no public specs exist. This is the problem, not the fact that OOXML is derived from MS Office file formats.

    RTF. It may not get press attention, but it's actually a fairly well-documented standard, has been working as an interchange format for years, and yet is designed with enough expandability that it's still useful with the kinds of documents produced today. It's a true de-facto standard.
    Well, I was a big fan of RTF at one time. But a few years back I found that documents with any kind of formatting more complex than paragraph+justification+font just wasn't working between MS Office and back. I don't know if this was because the format couldn't cope, or because of faulty implementations. In either case, it led me to give up on RTF.

    In any event, to be a replacement, RTF would need to work for spreadsheets and presentations at a minimum - something I don't think there's a lot of support for in the current RTF specification. We'd also lose the benefits of an XML based format, which given the amount of work on the seamless integration of XML documents into databases, web services and other data management applications means losing a lot of functionality.

    for those who really want interoperability, RTF is the way to go with today's software
    Interoperability is only part of the problem. We also want a spec that can be fully and freely implemented by anyone, which isn't under the control of any single vendor.We want a format to which we can entrust documents, knowing that in twenty years time there will be an application capable of reading them.

    an unnecessary dichotomy is drawn between OpenXML and ODF with regard to their design goals - both are repurposed native formats for a single application.
    I don't know what you mean by native in this case, but the repurposing of OOXML isn't the problem. It's one of size and obfuscation, and as TFA points out specification by reference to closed formats and the behaviour of extinct proprietary software. These are non trivial problems with OOXML which are not (to the best of knowledge) found in ODF.

    There's nothing wrong with ODF. Re-creating it based on the non-XML RTF would be a waste of time and effort.

    --
    Don't let THEM immanentize the Eschaton!
  73. Re:MIcrosoft sucks. by 99BottlesOfBeerInMyF · · Score: 2, Insightful

    Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works.

    I think your logic is more than a little broken. Monopolies have a great deal of power that other's don't have. They can undermine capitalism in a market and destroy innovation in entire industries. They can spread causing that damage to other markets. Think of it like this, people piloting airplanes aren't allowed to drink or step outside for a cigar, while those behaviors are perfectly legal for people who aren't piloting planes. Isn't that crazy?

  74. Re:MIcrosoft sucks. by Gr8Apes · · Score: 2, Interesting

    They did. It was "resolved" by being disallowed. There is no per machine "MS tax" anymore. That fell victim in teh IBM suit, I believe. (So many suits, so long ago, so many beers.... ahh - that explains it!)

    --
    The cesspool just got a check and balance.
  75. Re:MIcrosoft sucks. by 99BottlesOfBeerInMyF · · Score: 1

    You forgot those crazy laws that grant human rights to corporations in the first place.

  76. Re:Suck it up by mwvdlee · · Score: 1

    I'm pretty sure you're being sarcastic, but just to be safe:

    If all dogs are animals, are all animals dogs?

    --
    Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
  77. mac excel by Anonymous Coward · · Score: 0

    XML element for specifying whether the dates in the workbook start in 1904.

    I know that one. I mucked around in the excel 97 format for a while. Its for mac excel, not sure what the version was. I don't know why it was 1904, instead of 1900 like the Windows version. The windows version counts Feb 29, 1900 as a real day (to maintian compatibility with Lotus 123 who apparently screwed it up), so maybe they purposely set the date forward with the mac version to avoid the issue.

    1. Re:mac excel by stefanb · · Score: 1
      XML element for specifying whether the dates in the workbook start in 1904.
      I know that one. I mucked around in the excel 97 format for a while. Its for mac excel, not sure what the version was

      Up until Mac OS 9, dates were handled as days from January 1st, 1904.

  78. Open to what by Anonymous Coward · · Score: 0

    The great misconception is that OpenXML is actually Open. I great ploy by MS. The fact is if Microsoft wanted a open standard they would have supported ODF. But they dont, they want to be able to control it and be able to re-define it at will without any outside consensus. Microsoft's OpenXML license is incompatible with any open source license and does not allow everyone equal access like ODF.

    The use of the term Open is misleading, OpenXML is just another propriatry format of Microsoft. So what is Open about OpenXML?

  79. That's me! by Anonymous Coward · · Score: 0

    > [Job requirements like]
    > * 5 years experience with Java, J2EE and web development, PHP, XSLT
    > * Fluency in French and Corsican
    > * Experience with the Llama farming industry
    > * Mole on left shoulder
    > * Sister named Bridgette

    That's me! Except that:
    * I drink Java for 10 years, experiemented the drugs PHP, XSLT, J2EE, and collect spiders for 20 years so I'm very good at web development.
    * am fluent in elivish and Klingon (close enough)
    * my experience tells me that something called Llama farming industry exists
    * I was a mole for my company and I constantly looked over people's left shoulder
    * Went to Sister Bridgette Grade School

    So expect for these minor variations, I'm perfect for the job.

    Where can I sign up?

  80. Looks the same--who cares! by Anonymous Coward · · Score: 2, Interesting
    'so not only must an interoperable OOXML implementation first acquire and reverse-engineer a 14-year old version of Microsoft Word, it must also do the same thing with a 16-year old version of WordPerfect.'"
    Someone needs to tell every developer of word processing and page layout software on the planet to abandon the 'must look the same' obsession described by the above. Why worry about making content in application B look like content in Application A? I create books out of Word files submitted by several people. The last thing I want is all the inconsisent formatting from each of them to control a book's look.

    Named styles is the answer. If a paragraph is body text, call it that. If it's an inset quote, call it a quote. If a term is in italics, label it as italicized style not Times Italic 12 point. But don't get all hung up in the distinctions between Times Roman and Times New Roman. The purpose of XML is to define what something is. Not what someone thought it ought to look like on Tuesday three weeks ago.

    Ditto transfers between applications. Why is there so much effort devoted to importing every little odd quirk of Word into InDesign as if the quirk mattered. Bring in the text tagged with what it is and let InDesign determine what it looks like. InDesign is far more powerful and predictable than Word anyway.

    It is, of course, the Microsoft's advantage for everyone to define RTF and now OpenXML as the "standard" and obsess over the sort of things described above. But there's no sane reason for this obsession to exist. If you want a text to always look the same, use PDF. If you want a document to look good, make it look good in the application you're using. Don't try to make that application retain the 'sorta-looks-ok" feel of another application. That's too much work for too little result. It's why all too many ordinary users shrug their hands and buy Word rather than hassle with import quirkiness.

    1. Re:Looks the same--who cares! by hxnwix · · Score: 1

      "It's why all too many ordinary users shrug their hands and buy Word rather than hassle with import quirkiness."

      I may be misreading your post, but this last bit suggests that users do care about formatting. Why else would the import quirkiness bother them? Consistent formatting, fonts and precise layout do matter, and using a static document format is clearly not the answer.

      LaTeX is!

  81. Typical FUD - To support OOXML 'fully'... by Assmasher · · Score: 1

    ...you would indeed need to support everything that is covered by the specification, duh, right? Well, how likely is anyone to even want to support OOXML fully except for Microsoft?

    If I want to write a plugin for an open source text editor so that people can exchange word 2000 and later documents with my own editor I would certainly concern myself with supporting the aspects of OOXML which denote Word 95 emulation.

    --
    Loading...
    1. Re:Typical FUD - To support OOXML 'fully'... by 99BottlesOfBeerInMyF · · Score: 1

      If I want to write a plugin for an open source text editor so that people can exchange word 2000 and later documents with my own editor I would certainly concern myself with supporting the aspects of OOXML which denote Word 95 emulation.

      Ahh, but assume the market standardizes on OOXML for word processing, which is what MS is trying to get governments to buy into. So you've written a text editor that supports that format and conversion from some old MS Word formats. The next version of MS word that comes out then relies upon part of this spec that is part of Word95. Suddenly you can no longer interoperate with standards compliant word processors, including Word because you did not implement this aspect of the spec. You're out of compliance with government contracts, not MS, since they are following the specification and you're not. Since Word still has 90% of the market, no one can just switch to something else easily.

      Thus, MS locks out your word processor and other competitors and maintains their lock-in on the file format, until people realize that this specification was not really open and did not really bring the advantages of an open format, despite MS claiming it does. And having heard all this before, governments and companies are even less likely to try again and assume the new standard is actually, really a standard and not another trojan horse.

      This has been MS's strategy numerous times. Customers want the advantages of open code and open standards and MS promises them that if they stay on board with them then they will deliver open standards and open code, or something just as good. And because most people don't understand the mechanism by which open standards and open code delivers the advantages they want, they assume that what MS is offering will provide that. Look at MS's "shared source" initiative. It lets customers look at the code just like open source, but prevents those users from being able to contribute real changes, branch, make workable fixes, avoid lock-in to one vendor for improvements, or get a pile of free code from other vendors with similar issues. It is open source with all the real advantages stripped away and people buy into it because you can see the code. It won them a number of sales from places that had heard of the advantages of open source code, but did not truly understand anything about how it worked.

      OpenXML is the exact same ploy. It is an pseudo-open pseudo-standard designed by MS to provide the minimum number of advantages to consumers and the maximum advantages to MS, while still seeming to be sort of like OpenDocument. It exists only to steal sales from decision makers in companies and organizations that have heard of OpenDocument or who have been advised by consultants or internal experts to move to OpenDocument, but don't understand why and can be sweet talked by MS salespeople. Don't buy into it. Only a real open standard will bring the benefits of an open standard and OpenXML is NOT open or a standard.

    2. Re:Typical FUD - To support OOXML 'fully'... by rohan972 · · Score: 1

      Well, how likely is anyone to even want to support OOXML fully except for Microsoft?

      Well, it depends on if any governments and corporations require conformance to the standard for their software purchases. If it is a requirement of selling software, I guess a few people might be a bit interested in being able to claim full compliance.

    3. Re:Typical FUD - To support OOXML 'fully'... by Assmasher · · Score: 1

      First, this is NOT why Microsoft has produced the format. It has produced the format so that it can claim that it uses an 'open' standard so that it doesn't find itself forced into support for ODF.

      Second, you go the paranoid route (which is ok, M$ has done some crazy sh** in its time) and presume that Microsoft would actually make a new version of word's file format dependent upon a construct already marked 'deprecated' from word 95? Come on, you and I can both come up with crazy paranoia ideas that are better than that one ;). Are you a hollywood writer? ;)

      --
      Loading...
    4. Re:Typical FUD - To support OOXML 'fully'... by 99BottlesOfBeerInMyF · · Score: 1

      First, this is NOT why Microsoft has produced the format. It has produced the format so that it can claim that it uses an 'open' standard so that it doesn't find itself forced into support for ODF.

      Which is pretty much what I said.

      ...presume that Microsoft would actually make a new version of word's file format dependent upon a construct already marked 'deprecated' from word 95? Come on, you and I can both come up with crazy paranoia ideas that are better than that one ;)

      This is just one of many possibilities. Last I heard they still had both patent encumbrance, and a license that only allowed the current version of the spec to be used, theoretically meaning they could legally stop any other word processor that implemented OpenXML v1, v2, and v3, from providing backwards compatibility with v1 and v2 (when and if such versions are released). More likely, they will try to glom on some proprietary encryption as part of a "secure document" scheme and then sue the DMCA to make interoperability illegal. They have lots of options. The point being, this is not actually an open standard and they have many ways to prevent users from getting the same benefits OpenDocument will provide.

  82. Re:MIcrosoft sucks. by Overly+Critical+Guy · · Score: 1
    What they got in trouble for was actually using their monopoly to get into other markets - i.e. bundling IE with the OS meant that they used their OS monopoly to get into the browser market.

    That was just one of the things they were doing. The big thing was threatening OEMs with higher Windows license fees or even license revocation if they shipped competing products on their computers. Without Windows, these companies would die, so they were forced to help Microsoft keep its products front and center in spite of superior competitors. One could argue Microsoft actually held back computing by several years because better alternatives weren't allowed to compete in the marketplace, forever altering the course of computing history for the worse.
    --
    "Sufferin' succotash."
  83. No, it's eternal... by Svartalf · · Score: 1

    The Wheel of Time turns and Jordan cranks out another book in the series. Like the conflict in the Wheel of Time, it'll never end...

    --
    I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
  84. Re:MIcrosoft sucks. by Overly+Critical+Guy · · Score: 1

    Right, and you also can't criticize movies until you've made your own Lord of the Rings trilogy, you bastards!

    Meanwhile, Microsoft FUCKED over OEMs by forcing them to drop competing products over threats of raising Windows licensing fees or outright revocation. Microsoft succeeded in keeping competitors from ever competing on the market, holding back computing by several years.

    --
    "Sufferin' succotash."
  85. She got as far as the third? by Svartalf · · Score: 1

    I had to quit at trying to read the second. The first book in the series was okay- decent enough to rate me being
    interested in seeing where he was going to take the arc in the second book. He kind of lost and bored me in the
    second book- so I gave up on it.

    --
    I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
    1. Re:She got as far as the third? by h4rm0ny · · Score: 2, Insightful


      I read the back cover. Looked derivative. Put it back.

      --

      Aide-toi, le Ciel t'aidera - Jeanne D'Arc.
    2. Re:She got as far as the third? by 644bd346996 · · Score: 1

      Wheel of Time does have a lot of similarities to Dune, on the surface. But everybody I know who has read both agrees that the stories they tell are fundamentally different, and that Dune is more interesting.

  86. Re:Suck it up by Anonymous Coward · · Score: 0

    I think you guys are taking a very narrow view of what a specification is because were talking about MS.

  87. Re:MIcrosoft sucks. by I'm+Don+Giovanni · · Score: 1

    Sorry, but slashdot cult hero Judge Jackson determined in his infallible Findings of Fact that MS didn't violate antitrust laws to gain a monopoly.

    Also, since Apple controls over 70% of the (legal) online music biz, and have made agreements with certain content providers to *exclusively* provide certain content only through iTMS, is not Apple restraining trade, and thus in violation of antitrust law? Even if there's some techincality that makes Apple's dealings legal (of course, we don't know if they're legal or not; any judge can rule anything), the spirit is the same. The point being, MS is no more evil than Apple or anyone else. Other companies in the same position as MS would have done the same or worse (Apple almost certainly would have been much more draconian).

    --
    -- "I never gave these stories much credence." - HAL 9000
  88. MS is lazy! by Bob-taro · · Score: 1

    I doubt the spec was written specifically to make rendering the xml difficult, but rather to make creating it easy. They probably don't know how to explain the behavior of these old word processors because they were buggy and inconsistent. So rather than have to figure out how small Word 5.0 would render small caps in a given situation, they can just tack on a "do it like Word 5.0" attribute. Much simpler!

    --
    Prov 9:8 Do not rebuke mockers or they will hate you; rebuke the wise and they will love you.
  89. Re:MIcrosoft sucks. by I'm+Don+Giovanni · · Score: 2, Insightful

    "I think your logic is more than a little broken. Monopolies have a great deal of power that other's don't have. They can undermine capitalism in a market and destroy innovation in entire industries. They can spread causing that damage to other markets. Think of it like this, people piloting airplanes aren't allowed to drink or step outside for a cigar, while those behaviors are perfectly legal for people who aren't piloting planes. Isn't that crazy?"

    A pilot knows that he's drinking at the time that he's doing it, and knows that it's against the law to do so while flying.

    But a company doesn't know that it has a monopoly until some judge declares so. So while a company is engaging in normal business activity, some judge years later can rule that the company had a monopoly years ago, and rule that those normal business activies were therefore illegal. So, in order for a company to be sure to not run afoul of antitrust law, the company has to second guess every thing it does on the off-chance that at some point in the future, a judge *might* rule that the company had a monopoly at some point in the past. Well, you cannot run a company that way. It's best to engage in normal business practice, and if some judge rules in the future that it was illegal because he declares that you had a monopoly at the time, then deal with it at that point. And doing that would not be "evil". Second guessing whether you can engage in normal business practice or not in order to avoid what a judge might say in the future is not prudent.

    Taking MS, specifically, at what point, what day and date, did they knowingly acheive monopoly status in the "desktop OSes for intel CPU" market? IBM was selling and heavily advertising OS/2 throughout the 90's. So when should MS have thought to itself, "OK, now I have a monopoly, so I'll no longer offer OEM discounts"? Even when OS/2 faltered, MS subsidized Apple, and many said that part of the motivation was to ensure that MS did NOT have a monopoly (everyone (certainly Mac advocates) assumed that Mac OS and Windows were competitors; MS didn't imagine that a judge would rule that Mac OS isn't even in the same market). So it would seem that MS never thought they had a monopoly, and even took steps to keep it that way.

    Take Apple or Google, for other examples. Is it really so unimaginable that a judge could rule in the future that Apple or Google have monopolies *today* in mp3 players or online music (in the case of Apple) or web search advertising (in the case of Google)? In which case the same judge could rule that things Apple and Google are doing today are illegal? In such a case, would you demonize Apple or Google as "evil"? Should Apple and Google curtail their normal business activity because a judge in the future *might* rule this way? Do you see what I'm getting at?

    BTW, this is why antitrust law is so screwed up. IMO, you should be able to engage in normal business activity until a judge officially rules you have a monopoly. Once that happens, then you can alter your business activities accordingly. But you should not be punished for things you did before you were officially declared to enjoy monopoly status in a particular market, nor should you be demonized for it. This is much cleaner since everyone would know upfront what standard they're being judged against. No second guessing what would be normal business practice, no subsidizing competitors to make sure they stay in business so that you don't get a monopoly, etc.

    --
    -- "I never gave these stories much credence." - HAL 9000
  90. Re:MIcrosoft sucks. by jav1231 · · Score: 1

    No the OEM agreements were NOT disallowed. That, to me, was such an easy thing to do to curb their monopoly that didn't happen. Goto Dell and order an OS-less PC. I'll wait...

  91. Also found in the spec by snowwrestler · · Score: 1

    2.15.3.78 providePonyUponWish (Emulate Word 5.x for the Macintosh Pony Provision)

                    This element specifies that applications shall emulate the behavior of a previously existing word processing application (WordPerfect 5.x) when providing a pony upon a user wishing it, using the userWish element (2.3.1.82). This emulation typically results in a pony which is reduced from its normal size and suffers from chronic diarrhea.

                    [Guidance: To faithfully replicate this behavior, applications must imitate the behavior of that application, which involves many possible behaviors and cannot be faithfully placed into narrative for this Office Open XML Standard. If applications wish to match this behavior, they must utilize and duplicate the output of those applications. It is recommended that applications not intentionally replicate this behavior as it was deprecated due to issues with its output, and is maintained only for compatibility with existing documents from that application. end guidance]

    --
    Build a man a fire, he's warm for one night. Set him on fire, and he's warm for the rest of his life.
  92. FUD upon FUD by I'm+Don+Giovanni · · Score: 1

    Wow, two FUD posts in a row.

    First, the GP says, "some standards' patent licensors forbid implementors to publish a partial implementation. I don't know if this applies to OOXML's license." A classic FUD tactic is to speculate about something "bad" then say, "I don't know if it applies here." Yeah, you don't know if it applies in this case, but that didn't stop you from putting forth the "Fear" and "Uncertainty" that it might. Classic.

    Then, the parent post uses pharses like, "My understanding is...", "It seems that if ...", ect. Again, phrases used to intentionally spread FUD without making explicit assertions. Classic.

    BTW, there is NO "Full implementation requirement" for OpenXML. None.

    --
    -- "I never gave these stories much credence." - HAL 9000
    1. Re:FUD upon FUD by bigpat · · Score: 1

      Wow, two FUD posts in a row.

      First, the GP says, "some standards' patent licensors forbid implementors to publish a partial implementation. I don't know if this applies to OOXML's license." A classic FUD tactic is to speculate about something "bad" then say, "I don't know if it applies here." Yeah, you don't know if it applies in this case, but that didn't stop you from putting forth the "Fear" and "Uncertainty" that it might. Classic. Sorry I guess you would be an expert on FUD tactics, hope you are enjoying your new laptop. Or are you just getting paid cash by Microsoft?

      Then, the parent post uses pharses like, "My understanding is...", "It seems that if ...", ect. Again, phrases used to intentionally spread FUD without making explicit assertions. Classic.

      BTW, there is NO "Full implementation requirement" for OpenXML. None. I am making a clear assertion that can be verified. From what I have read about the OOXML license, Microsoft can and will enforce its patents and trademarks against any developer that deviates from the OOXML standard and still calls it OOXML. No you don't have to fully implement it in terms of every XML element, but you can't deviate from the standard without violating the license. That would seem fine if Microsoft was a third party that wasn't making a competing product and could be trusted to enforce this requirement on itself, but it has an financial incentive not to do so and a long track record of "embrace and extend".

      So, no it does not have to be a full implementation in terms of implementing every last XML tag, but say if Microsoft introduces a new series of tags that are outside the standard it is fine for them since they are not implementing OOXML under a license, but if any licensor of OOXML tries to follow their lead, then Microsoft could sue them for violating the terms of the license. Microsoft looks like they are going ahead regardless of the standardization, but this certainly has a potential to get messy.

      Microsoft would get a great deal of negative press for a move to copy its own deviation from the standard and it would jeopardize its use in government, but it is a possible scenario under this flawed standard. More likely they will just extend the format in awkward ways while still technically following the standard, that way they can maintain technical compliance with ECMA OOXML and just jam in incompatibilities under the allowed way of doing extensions.

    2. Re:FUD upon FUD by I'm+Don+Giovanni · · Score: 1
      Sorry I guess you would be an expert on FUD tactics, hope you are enjoying your new laptop. Or are you just getting paid cash by Microsoft?

      I'm not paid by MS, in cash, laptops, or anything else. I guess you're one of those that pulls the "astroturf" card when you've been beaten in debate.

      But seeing as you're carrying IBM's water on FUD'ing OpenXML, I'll assume you're on IBM's tit. And your patron, IBM, is the one that invented FUD, BTW.
      --
      -- "I never gave these stories much credence." - HAL 9000
    3. Re:FUD upon FUD by tepples · · Score: 1

      So, no it does not have to be a full implementation in terms of implementing every last XML tag

      If this is the case, that's nice. But can you cite that this is the case?

      but say if Microsoft introduces a new series of tags that are outside the standard it is fine for them since they are not implementing OOXML under a license, but if any licensor of OOXML tries to follow their lead, then Microsoft could sue them for violating the terms of the license.

      XML supports namespaces. Stick OOXML in the o: namespace, stick XHTML in the h: namespace, XLink in L:, SVG in g:, MathML in m:, and whatever you want in any other namespaces, and you're within the spirit of XML.

    4. Re:FUD upon FUD by bigpat · · Score: 1

      You questioned my integrity first, asshole.

  93. Yo by jav1231 · · Score: 1

    I say keep it real, Yo! ASCII FOREVER!

  94. Changing file formats and Archiving by Alien54 · · Score: 1

    What we have here is the counterpart to a similar problem in image formats and other media.

    The problem is how to create a specification for documents that will not obsolete after five or ten years. You see the issue brilliantly with the old NASA video and data tapes that no longer have appropriate reader equipment to access the data, meaning that the information is potentially lost to our posterity.

    Now we have it regular documents, with the very real possibility that in the event of a technology disaster, the formats used with not be accessible. Data Encryption to protect copyright holders will prevent easy access to data vital for a rebuild.

    On a tangent, so far the best storage solution I can come up with is the HD-Rosetta Stone project which only requires an optical microscope.

    But this does not address the issue of obsolete file formats. There is the chance that that the most durable and accessible will be the most simple. Not Frontpage HTML or some similar monstrousity. In this case, simple IS good.

    --
    "It is a greater offense to steal men's labor, than their clothes"
  95. Re:MIcrosoft sucks. by yakovlev · · Score: 1

    Well, typically the mechanism is that monopolies are allowed to continue to exist when they compete fairly. Sometimes economies of scale produce natural monopolies, where the most efficient form of the market is a single, large, monopoly producer. Others are not able to compete simply due to the nature of the market. Operating system software MAY even be an industry like this. The problem is that Microsoft forced their competitors out of the market with unfair business practices, and unfair business practices are what should not be tolerated.

  96. Re:MIcrosoft sucks. by 99BottlesOfBeerInMyF · · Score: 1

    A pilot knows that he's drinking at the time that he's doing it, and knows that it's against the law to do so while flying.

    And so you think MS, after having been convicted of it multiple, and with more lawyers than god on retainer don't know when they're illegally leveraging their monopoly. For $10K a year they can e-mail me and I'll let them know. It isn't rocket science. It is naive to think MS is ignorant. They know exactly what they're doing and have built their business model upon the assumption that it will be more profitable to break the law and pay fines and settlements and bribes to politicians, than it is is to not break the law in the first place. So far, they've been completely right.

    So while a company is engaging in normal business activity, some judge years later can rule that the company had a monopoly years ago, and rule that those normal business activies were therefore illegal.

    In the US and EU 70% of a market is the point at which the courts first look. If you have 70% of a market, you just ask your lawyer. It is pretty obvious by your influence on the market if you have a monopoly. Further, the courts basically did nothing to MS the first time they were convicted. So what is their excuse for all the new and continuing violations since then? Did they forget they were a monopoly?

    's best to engage in normal business practice, and if some judge rules in the future that it was illegal because he declares that you had a monopoly at the time, then deal with it at that point. And doing that would not be "evil".

    Hey guess what, monopolies are rarely punished for antitrust violations except for practices after they have been found by the courts to be a monopoly. I have no problem with a company saying, "hey we didn't know" and then changing their business practices the first time. After that however, they have no excuse. MS has a monopoly and since they were declared to have one they have not stopped bundling and tying to other markets. That is knowingly breaking the law for profit. I have zero sympathy.

    IBM was selling and heavily advertising OS/2 throughout the 90's.

    And their market share among OEMs was what again? Oh, less than 1%. Nice try, but no dice.

    MS subsidized Apple, and many said that part of the motivation was to ensure that MS did NOT have a monopoly

    Umm, MS lost a patent violation lawsuit. They bought some shares of Apple as part of the settlement, but the money was less than 1% of the liquid assets Apple had on hand. If you consider that "bailing them out" you're on crack.

    ...everyone (certainly Mac advocates) assumed that Mac OS and Windows were competitors...

    If the economists, lawyers, and financial advisors working for MS thought that I hope they were all fired. You're only a competitor if you sell to the same people. MS sold to computer vendors. Apple sold to retailers and individuals who were buying computers. Anyone who thinks Sony and Ford are in the same market because Sony sells car radios to car manufacturers, needs to go back to school.

    MS didn't imagine that a judge would rule that Mac OS isn't even in the same market

    Are you kidding? They were IBM partners and the talk of the 80s was IBM's antitrust issues. Your belief in MS's ignorance is a big stretch, and irrelevant.

    Take Apple or Google, for other examples. Is it really so unimaginable that a judge could rule in the future that Apple or Google have monopolies *today* in mp3 players or online music (in the case of Apple) or web search advertising (in the case of Google)?

    In the case of Apple, no they are being investigated right now, as I'm sure Apple expected they would be. Depending upon quite how much influence they have in that market, they may well be forbidden from tying iTunes, ITMS, and iPods. They play by the same rules. Google has way to small of a chunk of the search market to be declared a monopoly, something like 40

  97. Using internals as a format is bad, but by dallaylaen · · Score: 1

    From a technical perspective, using the internal structure as the file format is the stupidest thing you can do. There is *no* good technical reason to ever do so.

    Well, there is one reason. Speed.

    Imagine you have an application which needs storing several megs of structured data routinely. You can

    (a) write a specialized format parser (it's hard, and there may be bugs);
    (b) use standard (e.g. XML) parsers (it would be dog slow in most cases);
    (c) write the damned objects/structs to disk.

    The (c) is the way to go when you need to get the product out of the door, yesterday. Which seems to have been the case for MS.

    Yes, this will get you in trouble, quickly. But the alternative is having trouble now.

    --
    WYSIWIG, but what you see might not be what you need
  98. Re:MIcrosoft sucks. by x_MeRLiN_x · · Score: 1

    That was his point..

  99. No. by DrYak · · Score: 1
    isn't that like saying binary binary large object?


    No. Binary blob in lowercase. (Not the initialism)
    It's more like a binary blob-that-ate-everyone.
    Or a binary poring.
    Or a binary jelly.
    Or pick your own duscusting sticky goo-like from your favorite RPG.
    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  100. Formats... by DrYak · · Score: 1
    There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea...
    ...for a temporary file. ( .TMP ) or the internal storage used on the servers of Google-Office.

    Not for saving a file that should be stored on a disk for later retrival (at a future time when the specific version of the software maybe won't exist anymore) or that should be sent in an e-mail (to someone who may use a different software).

    Also Microsoft choose the worst way to implement such internals and retro-compatibility.
    Different versions of word interpret layout parameters in different way.

    The correct way to implement should be either :
    - convert the parameters into the new standart (for exemple : if older implementation didn't count border in width, add them to the width while saving).
    - extend the standard so one may specify which behaviour to follow in a generic way (like giving the scale or the units used if two different version doesn't interpret "points" of the same size : create a new parameter that helps specifiying the size of a "point" in cm, so that any software could do the math and convert to what it needs)

    What microsoft did instead :
    - They extended the standart with stupid paramater like " isCalculatedUsingFormulaFromWord9x="{yes|no}" " which doesn't add anything useful to a standart except support for specific legacy software from microsoft.
    - Worse they even only say that those parameters are deprecated and shouldn't be used in new documents and failed to specify what those actually mean in the standard. (There for Word 2007 is the only software that could correctly open XML documents using legacy parameters)

    In short, where they should have put " spacingIs="{int}cm" " they put " spacingIsLikeWordForDos="{bool}".
    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  101. Ultima Pule! by Anonymous Coward · · Score: 0

    OMGzorz!!! How teh 3vil M$ iz!!!

    How evil of them, wanting to maintain backwards compatibility with old documents!

    And not only that, but they evilly slipped in Word Perfect, so it would extend backward compatibility of another product. Isn't it perfectly obvious how evil they are being? It's all part of their plan to... ummm... uh... spread their evil!!!

    Lunix Zealots of these internets, unite!!

  102. Re:MIcrosoft sucks. by Anonymous Coward · · Score: 0

    > Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.

    I don't know of them doing anything criminal, but I DO know that they ran plenty of folks out of business using tactics they had to pay millions upon millions in civil legal damages for, and I don't mean just monopoly abuse, I also mean the things they did to get there. It's not done until Lotus won't run, etc.

  103. Re:MIcrosoft sucks. by Brandybuck · · Score: 1

    Microsoft isn't the only one with OEM bundle deals. Before the "monopoly", I remember being in a store and seeing a choice between a i386 with Windows and a i386 with DRDOS/GeoWorks.

    My point still stands. Acts that are illegal for a monopoly can be perfectly legal, ethical and moral for a non-monopoly. It was perfectly legitmate for non-monopoly Microsoft to bundle Windows with OEM systems.

    Microsoft's unforgivable sin was being successful. It's a lesson all aspiring companies should take to heart.

    --
    Don't blame me, I didn't vote for either of them!
  104. Re:MIcrosoft sucks. by hachete · · Score: 1

    Oh, MS is a monopoly all right. It's written there in Judge Jackson's statement; something that's *never been overturned. Ever tried to buy a bare PC? Seems you can't. It's Windoze or Nodoze. So, pre-loaded it is and not a choice in sight.

    As in "too successful" ... when have Americans started liking monopolies? I thought that was an anathema to the free-market. But, hey, it's your market. I just wish the Imperialists wouldn't thrust their idea of a "free market" onto the rest of us.

    I take the opposing view. The markets are moral, and we have regulators to make sure the markets behave themselves within limits. Microsoft is an object lesson for regulators, o'wise they'll find some shit monopoly clogging up the pathways of business, demanding tax from all and sundry, holding back the reins of what America wants to be. *AA and Microsoft, they're made from the same cloth.

    --
    Patriotism is a virtue of the vicious
  105. Re:MIcrosoft sucks. by Gr8Apes · · Score: 1

    I'll disagree, try any of their servers. Their consumer desktop machines suck eggs anyways.

    --
    The cesspool just got a check and balance.
  106. "Dark Corners"? by chris-chittleborough · · Score: 1

    "Dark Corners"? Those corners aren't just dark, they're full of grues. And they're not just corners, they're entire dungeons. All dark, and full of enormous, very hungry grues ...

  107. Re:MIcrosoft sucks. by Haeleth · · Score: 1
    Goto Dell and order an OS-less PC. I'll wait...
    It's quite hard to find OS-less Dells, I'll grant you, but it was easy enough to find plenty of Dells that come with Red Hat Enterprise Linux rather than Windows, which makes it rather unlikely that Microsoft has forced them to buy a Windows license for every PC they sell.

    (The Linux machines are slightly more expensive than an equivalent with Windows preinstalled, but that's probably more because they get paid to dump a whole pile of shovelware on Windows PCs than because Microsoft is forcing them to make Linux an undesirable option. Also, Red Hat is hardly known for being a cheap option.)
  108. Re:MIcrosoft sucks. by Anonymous Coward · · Score: 0
    Microsoft bended and broke the law every chance they got to ensure that there never was any competition.
    Oh? So what was that highly-successful DR-DOS product that remained a popular alternative throughout the 80s? And why did so many people pay good money for OS/2, DESQview, etc, all of which, if not mass-market successes, were certainly far from the utter failures that the anti-Microsoft crowd like to pretend?

    (In case you were about to bring up the old chestnut about Windows 3.0 not working on DR-DOS, please don't bother: that was only true of early beta versions of Windows 3.0, probably for perfectly valid reasons, and every public release worked just fine on DR-DOS.)

    The reason Microsoft ended up with a monopoly on PC operating systems wasn't because it broke any laws. It's because Windows 95 blew away the competition - OS/2 had concentrated too hard on being compatible with earlier versions of Windows to have any killer apps of its own, while the various other alternatives were still just fairly basic shells on top of DOS. I'll repeat that: Microsoft gained its current monopoly by having the best product on the market, period. The only serious alternative would have been MacOS, and Apple never unbundled that from their hardware for long enough to have any mass-market impact. So everyone switched to Windows 95, which while deeply flawed was the best thing they could get, and Windows 95's applications were so far ahead of anything the competition could run that nobody ever caught up.

    Today, viable competition exists in the form of OS X (market share growing rapidly now that Apple hardware is no longer extortionately priced) and GNU/Linux (now that WINE is mature enough to provide adequate Windows compatibility), and this is reflected in the fact that mainstream PC manufacturers such as Dell now offer GNU/Linux as an alternative to Windows. (Which they couldn't do if Microsoft was breaking the law by making the kind of lockout deals that conspiracy theorists claim they make...)

    Note that Microsoft's total domination of the market exactly matches the period during which Microsoft's products offered the best price/performance deal. Before that was the case, competition flourished; now that viable alternatives have begun to appear, Microsoft's monopoly is beginning to falter. There is no need to make up implausible conspiracy theories or to spread anti-Microsoft FUD about illegal tactics. Microsoft simply had the best mousetrap, and now that other people have made even better mousetraps, lo! the world beats a path to other doors. It's just the way capitalism is supposed to work.

    (Would OS X have been half as good as it is if Microsoft hadn't had a monopoly? Looking at how unutterably crap MacOS 9 was on a technical level, I think not. That's competition for you, and the consumer has certainly benefited.)

    As SCC had already committed to the other deals, they had no choice but to take MS's offer. Sure, no one held a gun to the head of the SCC executives forcing them to take the deal, however, they didn't have any other reasonable alternatives. MS's behavior was legal, but certainly not ethical.
    That rather depends on whether Microsoft was aware that SCC had already spent the money before the deal was finalised. If Microsoft knew that, then there is a very tenuous case to be made that they acted unethically; if, as seems rather more likely, Microsoft did not have inside information on the stupidity of the SCC executives, then they were behaving perfectly ethically, and the SCC executives have only themselves to blame for being incompetent enough to paint themselves into a corner by committing to a deal before the terms were finalised.
  109. Social construction of technology by Geof · · Score: 1

    Avoiding dataloss is perhaps the best reason for having a native file format, at least for existing applications.

    I think this is a good point, particularly relevant both for existing applications and for areas of rapid innovation where it's not yet clear what should be in the app or the format. I consider myself corrected (and am somewhat embarrassed to have overlooked this in my previous post). I am not convinced it is the best case for word processing though, so I'm curious if you have any specific insights there.

    Social constructivist theories of technological development describe the stabilization of the technology, at which point differing interpretations - often corresponding to different technical designs - converge on a single understanding of the technology. Word processing in particular is an established class of application for which consensus has been reached regarding its role and feature set[1]. According to this reasoning, as innovation slows, the benefits of custom formats would evaporate while the merits of standardization increase. I would therefore expect gravitation towards using open formats by default, with custom formats reserved for niche uses in need of application-specific features.

    Significant innovation, I suspect, would tend to be understood not in terms of "word processing", but would be differentiated into new classes of application, which at a later date might (or might not) get folded back into word processing proper. HTML and blog post authoring strike me as two current examples of this.

    None of what I say, however, is a result of direct experience in consumer application development, so your experience may point to other outcomes.

    [1] Modulo a few exceptions; I'm sure this represents the vast majority of people - far more than Joel's 80% (though I think Joel overrates convergence - witness the iPod, game machines, portable DVD players, and the proliferation of other single-task technologies).

  110. Re:MIcrosoft sucks. by zero_offset · · Score: 1

    "Ever tried to buy a bare PC? Seems you can't."

    Bullshit.

    A story about HP machines in France is not support for your sweeping comment.

    Just a few months back I helped my mother-in-law pick out a no-OS PC and the list of choices I showed here included machines from such major retailers as Wal-Mart, CompUSA, and Circuit City. Christ, the cretins of Wal-Mart will even sell you a Linspire PC with Linux pre-installed if that's your thing.

    I'm no fan of monopolies, and Microsoft has pissed me off quite a bit in the past (though I'm no foaming-at-the-mouth MS-hater, either), but the point is, your statement is dead wrong in the US at least, and I would suspect outside the US as well.

    --

    Slashdot quality declines as the number of hot grits posts decreases. - Provolt's Law, Apr-09-2005

  111. Re:MIcrosoft sucks. by FiberOPtic · · Score: 1

    "You forgot those crazy laws that grant human rights to corporations in the first place."

    And the laws that create corporations in the 1st place.