Slashdot Mirror


Docvert 3.0 Lessens Reliance On Microsoft Office

An anonymous reader writes "After 10 months of development Docvert 3.0 was released today. This open source web service converts DOC files to Oasis OpenDocument 1.0, and then to HTML, RSS, or any XML format. Try the ODF demo or download the source and install it on your own box. Version 3.0 comes with an MS Word Plugin, FTP/WebDAV upload, and an in-browser document editor."

108 comments

  1. It promises to be an interesting battle by 0racle · · Score: 3, Insightful

    Ya, I'm on the edge of my seat. It will get adopted as a standard or it won't. Office will use it either way and anyone wanting to interoperate with Office will have to try to implement it as well.

    --
    "I use a Mac because I'm just better than you are."
    1. Re:It promises to be an interesting battle by SuperStretchy · · Score: 1

      Phew I hear you man... I'm staying up every night until I hear more! Its like that one time when the PS3 came out...

    2. Re:It promises to be an interesting battle by Zaiff+Urgulbunger · · Score: 5, Insightful

      All true, but if it does get adopted as a standard, then MS can use this to ensure the continued use of MS Office by government agencies around the globe. If it doesn't get adopted, MS will be under pressure to provide a supported, native, OOD format.

    3. Re:It promises to be an interesting battle by CastrTroy · · Score: 1, Interesting

      What are the implications if it does get adopted as a standard? Can anybody implement for free? Can MS get fined for saying they support the standard when in fact their software actually does not (ala, Java, CSS, HTML, Kerberos, and others). If we could just get MS to follow some standard and actually implement it as the standard as written, then I think we could get long way to interoperability with MS word. If it's an open standard, and MS can't just go ahead and change it whenever, and they have to actually follow it, then what does it matter who made up the standard?

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    4. Re:It promises to be an interesting battle by Divebus · · Score: 1

      It only figures... Microsoft expecting a compendium of proprietary digital glop to be officially embraced by all. Why even submit this for standardization? De-facto "standards" have worked so well for them [ -and so badly for everyone else].

      Application Rejected. Thanks for Playing. Please Try Again.

      --

      Most of the stuff on /. won't survive first contact with facts.
    5. Re:It promises to be an interesting battle by truthsearch · · Score: 4, Insightful

      In my opinion there are two reasons Microsoft is trying to create their own standard: PR and government contracts. The PR aspect is obvious. The US government is Microsoft's largest customer (by far) and also the most likely to demand open document standards. Other governments will likely do the same long before corporations demand it. So Microsoft needs to have their own standard which they implement first in order to get the contracts.

      They don't have to implement it correctly. They can claim support for a standard for years without actually following it (e.g. CSS, Kerberos, etc.) and still get the contracts. They were actually involved in creating some CSS standards and still didn't follow them.

      It's all about the money. Get the big contracts and nothing else matters.

    6. Re:It promises to be an interesting battle by Zaiff+Urgulbunger · · Score: 2, Insightful

      I don't *know* the answers but I believe:

      Can anybody implement for free?
      I think so! But, you'll need to get a copy of the standard first, and I believe ISO normally charge.. rather more than I'd like for that.

      Can MS get fined for saying they support the standard when in fact their software actually does not...
      I doubt it, but if a test case can be produced to prove the fault, they'll maybe/probably/hopefully/perhaps fix it. Depending on whos asking for a fix!

      You're right that *a standard* is far better than no standard at all. But the only reason MS have done this now is because they've been forced into it due to various governments demanding open standards for documents, and thus, by getting this adopted as a standard, they get to keep business! They are not interested in other people using the standard to write competing software, and as such, I expect they'll move the goal posts as soon as anyone gets close.

    7. Re:It promises to be an interesting battle by mrchaotica · · Score: 3, Informative
      Can anybody implement for free?

      No, because bits of it are patented (especially the "legacy compatibility" parts that basically just say "emulate old versions of Office").

      Can MS get fined for saying they support the standard when in fact their software actually does not (ala, Java, CSS, HTML, Kerberos, and others).

      In this case it won't matter, because the OOXML "standard" is effectively defined as "whatever MS Office does." In other words, MS basically documented Office's behavior down to the smallest detail, and submitted it to ECMA and now ISO.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    8. Re:It promises to be an interesting battle by FireFury03 · · Score: 3, Informative

      In other words, MS basically documented Office's behavior down to the smallest detail

      They didn't even do that. A lot of the document states that when you encounter certain tags you will emulate a Office bug, but never specifies the details of that bug because that is "beyond the scope of the document". So even if you have the standards document, you can't fully implement the standard without getting all the old versions of Office and reverse engineering their behavior.

  2. I originally read OOXML ... by alispguru · · Score: 5, Funny

    as:

    Object
    Oriented
    X
    M
    L

    and whimpered at the thought...

    --

    To a Lisp hacker, XML is S-expressions in drag.
    1. Re:I originally read OOXML ... by Anonymous Coward · · Score: 0

      If the thought of object-oriented XML makes you whimper, check out JSON - the fat-free alternative. It's object-oriented, with most (though not all) of the benefits of XML, lightweight, and simple.

    2. Re:I originally read OOXML ... by Anonymous Coward · · Score: 0

      Object Oriented XML... that'd be SOAP.

    3. Re:I originally read OOXML ... by parvenu74 · · Score: 1

      That is exactly what I thought at first as well!

      But let's assume that this OOXML thing get through the approval process... with an open standard anyone can make import/export functionality for MS Office documents in non-MS applications. From iWork to KOffice to OpenOffice and whatever else is out there, will there be any need to have MS Office in order to read, edit, and forward on "MS Office documents?" To me, it seems like MS is creating a way for everyone else to erode their market share.

    4. Re:I originally read OOXML ... by xenocide2 · · Score: 1
      --
      I Browse at +4 Flamebait

      Open Source Sysadmin

    5. Re:I originally read OOXML ... by rucs_hack · · Score: 1

      Me too, not a nice moment.

    6. Re:I originally read OOXML ... by Knuckles · · Score: 3, Insightful

      That is exactly what I thought at first as well!

      And i wonder how you could. Even just reading the the /. blurb makes it clear that the "standard" as proposed is non-implementable.

      --
      "When I first heard Daydream Nation it quite frankly scared the living shit out of me." -- Matthew Stearns
    7. Re:I originally read OOXML ... by MoxFulder · · Score: 2, Interesting
      But let's assume that this OOXML thing get through the approval process... with an open standard anyone can make import/export functionality for MS Office documents in non-MS applications. From iWork to KOffice to OpenOffice and whatever else is out there, will there be any need to have MS Office in order to read, edit, and forward on "MS Office documents?" To me, it seems like MS is creating a way for everyone else to erode their market share.


      Indeed... that would be nice. Try reading the article to find out all the obstacles Microsoft has thrown down to actually prevent this interoperability from ever happening :-)
    8. Re:I originally read OOXML ... by Anonymous Coward · · Score: 0

      "with an open standard anyone can make import/export functionality for MS Office documents in non-MS applications..."

      Yes, but too bad that is not the case for OOXML. Their "standard" calls for backward compatibility with old MS Word documents but does not tell anybody how to implement this proprietary feature.

    9. Re:I originally read OOXML ... by backwardMechanic · · Score: 1

      I read it and thought how childish MS can be. I've no idea what the OO stands for, but I can think of a related open software package it might be designed to frustrate...

    10. Re:I originally read OOXML ... by j1mc · · Score: 1

      I originally read it as "open office xml," which is probably what microsoft wants me to think . . .

    11. Re:I originally read OOXML ... by ozbird · · Score: 1

      Can't OpenOffice.org do them for trademark infringement? "Office Open XML" is both confusing and misleading.

    12. Re:I originally read OOXML ... by Millenniumman · · Score: 1

      Um, that would that be a hard sell coming from them. The reason OpenOffice.org has that stupid .org at the end is because OpenOffice is held as a trademark by someone else. If "OpenOffice.org" does not infringe "OpenOffice", then "Office, Open XML" does not infringe "OpenOffice.org".

      --
      Stupidity is like nuclear power, it can be used for good or evil. And you don't want to get any on you.
  3. Please recommend compliance validation tools by statusbar · · Score: 3, Interesting

    One of the things that bugs me are these 'enormous specifications' that are inconsistent. What we need is not just a document, but the tools necessary to verify a generated file. Not just for valid XML, but for all the little microsofty-bits hidden inside.

    --jeffk++

    --
    ipv6 is my vpn
    1. Re:Please recommend compliance validation tools by Zaiff+Urgulbunger · · Score: 4, Insightful

      Microsoft isn't doing this for you silly! The whole intent is likely that it is *hard* for anyone to implement.

    2. Re:Please recommend compliance validation tools by xenocide2 · · Score: 2, Insightful

      On the contrary, when taking the article at face value, I think that the whole intent is to make it *easy* for Microsoft to implement the first time, because it's already done. This is of course backwards from how a community standard should work, it should be an effort that is repeatable. Instead we have whatever crap their contractors turned in, with apparent flaws turned into requirements. I doubt even Microsoft could write a second compatible handler for this document format. Well, perhaps instead of "on the contrary" I should say "to clarify"; the standard appears to be designed so that any implementation but the original is near impossible.

      It sounds like pretty much like business as usual for MSFT, although describing in 6,000 pages how hard it would be to create an interoperating product is new. Their format is the standard, even the flaws that they didn't fix before release.

      --
      I Browse at +4 Flamebait

      Open Source Sysadmin

    3. Re:Please recommend compliance validation tools by TheRaven64 · · Score: 2, Insightful
      I am beginning to think that a requirement for anything being a standard should be two independent implementations (i.e. no shared code). I would even like to require that at least one be under a license no more restrictive than the 3-clause BSD license (GrokLaw FUD aside), and ideally MIT-licensed or Public Domain.

      ODF has already been supported by several implementations, and some of these threw up some OpenOffice-isms; if the support had been finished before the standard had been finalised then this would have allowed them to be fixed.

      --
      I am TheRaven on Soylent News
    4. Re:Please recommend compliance validation tools by bigpat · · Score: 1

      So you have been talking to the folks at the IETF have you?

  4. Describing exceptions doesn't make a standard. by splutty · · Score: 4, Insightful

    Despite what Microsoft thinks and how they're been acting in the past with all their 'standards'; Describing all the exceptions doesn't make something a standard. Describing them in the context of a non-standardized environment, makes it even less so.

    Although I'm quite sure that Microsoft really doesn't give a and will push this through as 'their' standard that everyone else will have to adhere to to be able to do anything with Mickyshaft generated content anyway.

    Whether ISO approves of this or not is inconsequential, the only thing that matters is that M$ can now say: Look, we proposed a standard, it's not our fault 'they' think it's not good enough.

    --
    Coz eternity my friend, is a long *ing time.
    1. Re:Describing exceptions doesn't make a standard. by Gazzonyx · · Score: 2

      Whether ISO approves of this or not is inconsequential, the only thing that matters is that M$ can now say: Look, we proposed a standard, it's not our fault 'they' think it's not good enough.

      My response: I proposed a rational solution for the tech department that I control - it's not my fault that we decided to go to another vendor when you no longer support Office2K. Google gives its love and regards. As does OpenOffice, MySQL, and Linux. Sincerely,

      The guy who makes the tech decisions

      --

      If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

    2. Re:Describing exceptions doesn't make a standard. by Anonymous Coward · · Score: 4, Insightful

      Whether ISO approves of this or not is inconsequential, the only thing that matters is that M$ can now say: Look, we proposed a standard, it's not our fault 'they' think it's not good enough.
      It matters to governments, who are coming under increasing pressure to rationalize their MS Office upgrade cycle (and why they're not getting out, via standards)

      But yeah it doesn't matter much to the private sector / industry.
    3. Re:Describing exceptions doesn't make a standard. by truthsearch · · Score: 1

      Not true. As the AC above states, it does matter to governments who are finally starting to demand open standards for documents. The US government is Microsoft's largest customer. Microsoft would lose them if they finally decided to only use applications that have documented and standard file formats.

    4. Re:Describing exceptions doesn't make a standard. by suggsjc · · Score: 2, Insightful

      It was mentioned before, but since the US gov't is their biggest customer. Until you are the "guy who makes the tech decisions" for them, M$ won't care about your response.

      Idealism will only get you so far, especially when it squares off against practicality.

      --
      When I have a kid, I want to put him in one of those strollers for twins and then run around the mall looking frantic.
    5. Re:Describing exceptions doesn't make a standard. by Gazzonyx · · Score: 1
      It is with deep regret that I acknowledge that validity of this assertion. That being said, I'll not give in simply because I have picked a fight I cannot win - call it hubris, idealism, stupidity, or whatever you'd like.

      I'm not necessarily anti-M$ (ducks), however, unless they tone down their schoolyard-bully tactics, they won't see another dime of my milk money.

      --

      If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

    6. Re:Describing exceptions doesn't make a standard. by suggsjc · · Score: 1

      I agree. Grass-roots movements can and do make differences on a large scale if given enough time and support. So, it doesn't mean that we should all just give up our goals and follow the crowd.

      There is only 24 hours in a day (roughly) so there are only so many battles we can fight. Pick the ones that matter most to you and do what you can. But I think the most important aspect is to acknowledge that we will all have different priorities. Just because your way makes most sense to you and the products/services you choose may be better than the "big guys" that doesn't mean that it will be the right choice for everyone especially when it isn't the most practical choice.

      I'm typing this on a dell laptop running xp (because my wife uses it too). My dell computer at work runs xp (most of our software tools are windows only). But the servers I personally maintain I hand built and run linux. I like the toolsets of *nix boxes much more for administration and the desktop variants of *nix are getting close to "prime time" as well. But it isn't worth the hassle for me to teach my wife how to "use linux" (even though it wouldn't be THAT difficult...mostly perception) or to try to find alternative software for work. So, I do what I can where I can, and thats as much effort as I'm willing to spend. To each their own.

      --
      When I have a kid, I want to put him in one of those strollers for twins and then run around the mall looking frantic.
  5. Deja Vu Docvert by ei4anb · · Score: 5, Interesting
    Way back before the web I worked in a Unix shop that was a development lab for a big multinational. Head office kept sending us e-mail with large MS Word attachments. We got tierd of having to go down to the library, where we kept the only PeeCee in the department, just to see what was in the attachment.

    I solved the issue by writing a program that ran on a Windows PC (an old one that had been discarded and was gathering dust in the closet) that received SMTP mail, detached the Word attachment, started up Microsoft's Word Viewer to read the attachment, then "printed" it to a file in PDF format and finaly SMTP mailed it back to the sender.

    From then on all we had to do was forward the email to the robot and wait for a readable version to bounce back. As I used Microsoft's own Word Viewer there were no problems whenever a new version of Word came out, I just downloaded the latest viewer :-)

    1. Re:Deja Vu Docvert by CastrTroy · · Score: 1

      That's a pretty smart Idea. It's a pretty good way to protect against Word Macro viruses too. I'm pretty sure the viewer doesn't support Macros, and even if it does, only the computer processing the documents would get infected, and that machine could be reimaged every night if it was a real problem.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    2. Re:Deja Vu Docvert by cnettel · · Score: 1

      It should be noted that the recent Office exploits have been buffer overruns and similar effects, not directly related to macro handling. As the rendering mechanism in the Word Viewer is quite identical, it's not invulnerable. (So for an automated machine stuffed away to do only this, reimaging can be quite needed if no other checks are performed.)

    3. Re:Deja Vu Docvert by Anonymous Coward · · Score: 0

      It would be nice to have the viewer running inside WINE ... then you wouldn't need a separate server. Of course, the main problem would be to instruct the viewer to generate pdf. Hmm ... maybe a nice project. Any takers?

    4. Re:Deja Vu Docvert by TheRaven64 · · Score: 1
      There have been vulnerabilities that affect word viewer (I believe one is still in the wild). This could, however, be prevented from becoming a vulnerability by running the viewer in a VM that was restored to a snapshot state immediately after running. Ideally the document would be loaded from a disk image that would be generated before starting the VM and then printed to a generic PostScript printer over IPP on the host machine, which would then convert the PostScript to PDF. No connections other than to the host machine on the IPP port would be allowed.

      If the document did contain a virus then all it would be able to do is trash the hard drive on the VM, which would then be restored as soon as it had run.

      --
      I am TheRaven on Soylent News
    5. Re:Deja Vu Docvert by kripkenstein · · Score: 1

      That's a nice solution. Any chance you might release it under an open-source license?

    6. Re:Deja Vu Docvert by endrue · · Score: 1

      If you had an old PC laying around, why didn't you just use that one to read the Word docs?

      - Andrew

      --
      I meta-moderate because I care.
    7. Re:Deja Vu Docvert by multipartmixed · · Score: 2, Interesting

      UNIX shop... before the web... only one PC... rendering to -- PDF?!

      I think I smell a tiny fib.

      I would have believed you if you had told me that you used Windows' (native since 3.1) Apple Laserwriter printer driver set up to print to a file, then mailed the resulting (PostScript) file to yourself to print or view with GhostScript/gv.

      Well, except that I didn't think the Word viewer was released until either '95 (or as late as '97?), and it was released because MS broke the Word 6.0 (Office 4.3) document format, which was in fairly widespread use at the time (putting Wordperfect 5.1 out of business).

      --

      Do daemons dream of electric sleep()?
    8. Re:Deja Vu Docvert by sootman · · Score: 1

      Maybe because having dozens of users get up and leave their desks to read documents isn't really effective in a business setting?

      --
      Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    9. Re:Deja Vu Docvert by Anonymous Coward · · Score: 0

      PDF 1.0 came out in 1993. Same year as the WWW came into being. Microsoft's Word Viewer didn't come out until Office 97.

    10. Re:Deja Vu Docvert by wonkobeeblebrox · · Score: 1
      From then on all we had to do was forward the email to the robot and wait for a readable version to bounce back. As I used Microsoft's own Word Viewer there were no problems whenever a new version of Word came out, I just downloaded the latest viewer :-)

      That seems like the hard way to solve the problem...

      ...you can do largely the same thing with the unix "strings" command. You'll lose formatting, but you're interested mostly in content, right?

      Watch my example:
      computer:~/Documents wonkobeeblebrox$ ls Op*
      Operation Restore Hope Election Night 2006
      Operation Restore Hope.pdf
      computer:~/Documents wonkobeeblebrox$ file "Operation Restore Hope Election Night 2006"
      Operation Restore Hope Election Night 2006: Microsoft Office Document
      computer:~/Documents wonkobeeblebrox$ strings "Operation Restore Hope Election Night 2006" | more
      jbjbd
      Operation Restore Hope*
      Join me for a night of celebration as we
      watch our favorite shows make us laugh and then
      watch our favorite candidates make us cheer!
      Tuesday, November 7, 7PM
      (soft drinks and snacks will be provided)
      * Alternate names considered:
      Operation Dumbo-Drop
      Operation Its-About-Time!
      Operation Investigation
      Operation We-Actually-Won-This-Time!

      ...etc

      you lose some of the formatting, but it is dead simple and gets the contextual points across, I think...

  6. Open XML is a transliteration by suv4x4 · · Score: 1

    It's to be expected as Open XML is a straight transliteration of the DOC "dumps" to XML format.

    I wonder how it ended this way: not enough time to properly develop and implement a more proper standard, or by design.

    I feel it's both.

    1. Re:Open XML is a transliteration by TheRaven64 · · Score: 5, Insightful
      The design requirement of Microsoft's XML format was (obviously) that it be possible to convert existing Word documents to it without any loss. In order to do this, there must be a one-to-one mapping between the .DOC semantics and the OpenXML semantics.

      The second design requirement was that the spec be developed and released quickly, before ODF had time to gain much traction. Between these two objectives, it's hardly surprising that it ended up the way it did...

      --
      I am TheRaven on Soylent News
    2. Re:Open XML is a transliteration by cnettel · · Score: 2, Interesting

      Well, the XML notation for Office 2003 was even more so. They broke that one now, and some changes are to the better. The requirement to be able to represent just about anything that was possible in the previous versions, faithfully, is still a great contaminant, as you say.

    3. Re:Open XML is a transliteration by Hal_Porter · · Score: 1

      Any format that allows you roundtrip from .doc to that format and back without altering formatting has to be like this, right? It has to support all the features the .doc format does.

      That's the reason for all the "render like WordPerfect 5.x" options that people have complained about, because they have to allow people to convert to the XML format and then convert back without reducing the document to an unreadable mess.

      I remember reading some interview with the Office program manager where he said rountripping to HTML was a big feature, long before they thought of making their XML format an open standard.

      http://xml.coverpages.org/microsoftHTML971215.html

      But obviously the HTML or XML that you emit that can be roundtripped back to .doc will be fairly unreadable, since you need to encode all the stuff that .doc supports somehow. Like suppressTopSpacingWP in fact.

      The alternative is to not support roundtripping and then wait for slashdot headlines like "Users find that the new Office XML format mangles their documents". More to the point, Office is dead if it does that, the advantage it has is that it's a defacto standard, used for the vast majority of editable documents in big companies. Anything which dilutes that is dangerous. And I doubt somehow that Microsoft really care if anyone else recognizes their standard, they just need to be able to claim that they have documented it to make it seem open.

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    4. Re:Open XML is a transliteration by Maxo-Texas · · Score: 1

      You know that's an interesting take.

      Microsoft is so huge now (and i work for a huge corporation as well) that it may be easier/more natural to just let the business mangle things in a natural way than to get a purposeful plan going to do this kind of thing.

      The project planning and meeting time alone would be bad plus you wouldn't want a documentation trail showing you intended to lock in to word or it might come back to bite you later.

      --
      She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
    5. Re:Open XML is a transliteration by 99BottlesOfBeerInMyF · · Score: 4, Insightful

      That's the reason for all the "render like WordPerfect 5.x" options that people have complained about, because they have to allow people to convert to the XML format and then convert back without reducing the document to an unreadable mess.

      There is no reason I know of why the XML format cannot support all the features of Word and round trip, without relying on nasty hacks like this, it just takes more work. The problem with "Open"XML that I've seen is the concentrate entirely on supporting only the features of .doc files and their interactions with other programs to the exclusion of anything else. Rather than "render like WP 5.x" you need to define how WP 5.x renders that feature, then incorporate it into your conversion script in a way that makes sense in general for documents.

      The whole format is built upon the assumption that only MS and Word will be using it and it is not designed to abstract word processing documents in general, but to kowtow to the eccentricities of Word.

      The alternative is to not support roundtripping and then wait for slashdot headlines like "Users find that the new Office XML format mangles their documents".

      No, the alternative is to do it right and build hacks like the ones you mention into the import and export routines, rather than embedding them, without any definition, into the format.

    6. Re:Open XML is a transliteration by kalpaha · · Score: 1

      Yes. And for the reasons you list, no one should adopt this "standard", which hopefully it will never be declared.

    7. Re:Open XML is a transliteration by spitzak · · Score: 1

      must be possible to convert existing Word documents to it without any loss. In order to do this, there must be a one-to-one mapping between the .DOC semantics and the OpenXML semantics.

      Uh, no, that is wrong. A many-to-one mapping would work for that requirement. If doc has many ways of representing the same information, turning them all into the same way would work just fine and would be lossless.

      A requirement that it lossly convert *both* ways, however, would require this. Otherwise converting to XML and back would probably map the many ways of representing the same data to a single one and would be lossy. However I am willing to bet that OOXML does not satisfy that criteria. So this excuse is totally bogus.

    8. Re:Open XML is a transliteration by complete+loony · · Score: 1

      In Word documents and other OLE formats, the object responsible for rendering the content is also responsible for loading and saving its own state into the document format. In the DOC format this is mostly a binary dump of the objects state. For OOXML it seems they have done some kind of serialisation based on their internal variable names.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    9. Re:Open XML is a transliteration by Hal_Porter · · Score: 1

      The whole format is built upon the assumption that only MS and Word will be using it and it is not designed to abstract word processing documents in general, but to kowtow to the eccentricities of Word.

      Well it's a Microsoft format, and the only word processor they have is Word. What do you expect?

      Rather than "render like WP 5.x" you need to define how WP 5.x renders that feature, then incorporate it into your conversion script in a way that makes sense in general for documents.

      Microsoft spent lots of time and money taking over the market from WP, and they have no incentive making it easy for someone to take over the market from Word.

      And the logic inside Word just needs the "render like WP 5.x" hint, so support for this feature comes free if you already have Word. The fact that the OpenOffice people need to spend ages fiddling around with WP 5.x to implement this feature is a plus from Microsoft's point of view

      Whereas doing it your way means that Word needs a rewrite but OpenOffice has a spec they can implement. It's not surprising thay Microsoft chose to do it their way rather than your way. And it doesn't work - see below.

      No, the alternative is to do it right and build hacks like the ones you mention into the import and export routines, rather than embedding them, without any definition, into the format.

      "Right" is subjective here. You define right as something which OpenOffice can implement easily. Microsoft, define it as something which Word can implement easily.

      But I still think you don't understand my deeper point that roundtripping with .DOC means the new format needs to be able to handle all the quirks of .DOC. The face of the matter is that .DOC files have a "render like WP 5.x" attribute, so if you want to be able to roundtrip, that must exist in the XML format. You could add a bunch of finer grained attributes that describe how WP 5.x renders and add them to the standard but as soon as you needed to convert back to DOC you'd need to recongnize them and turn them into a .DOC style "Render as WP 5.x" attribute. But doing that is hard, especially if the user might remove some of the attributes when they edit the file. So in practice you pretty much need to have the .DOC atrribute.

      Consider the "render like WP 5.x" to be a high level attribute. It's not easy to convert low level things back to high level ones. Imagine going from assembler back to C for example. More to the point imagine going from C to assembler, editing the assembler and then trying to go back to C. Doesn't seem so "right" now, does it?

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    10. Re:Open XML is a transliteration by 99BottlesOfBeerInMyF · · Score: 1

      Microsoft spent lots of time and money taking over the market from WP, and they have no incentive making it easy for someone to take over the market from Word.

      The point of having an open standard is the fact that anyone can do this. Once again, MS has promised to create something that is just like and almost as good as, what customers want, but which really is nothing of the sort. The advantages of open standards are that they allow any vendor to provide a solution, and in fact, multiple vendors to provide different solutions for different uses of the format. OpenXML, is may have "open" in the name, but if it is castrated and obfuscated to prevent that, then it is simply trying to mislead people into thinking they are getting the benefits of open standards when they are not. Purchasers don't want open standards or open source, they want the advantages those bring, and MS is great at confusing the issue and giving people as few of those advantages in as limited of a way as possible, while claiming the opposite.

      It is the same strategy as their "shared source" initiative. Ask a manager what open source is and they'll tell you that you can see the code. Ask them the advantages it brings and they get a little fuzzy. Open source means anyone can work on something and you can take bids from multiple developers. It stops lock in and it means you can get free improvements from other contributors. Shared source, means you can see the code if you sign a license that prevents you from getting any of those benefits. It is almost useless except as a way to mislead decision makers that have been advised what to do, but don't understand the mechanism or the difference between MS's version of things and what everyone else is talking about when they say "open source" or "open standard."

      But I still think you don't understand my deeper point that roundtripping with .DOC means the new format needs to be able to handle all the quirks of .DOC. The face of the matter is that .DOC files have a "render like WP 5.x" attribute, so if you want to be able to roundtrip, that must exist in the XML format. You could add a bunch of finer grained attributes that describe how WP 5.x renders and add them to the standard but as soon as you needed to convert back to DOC you'd need to recongnize them and turn them into a .DOC style "Render as WP 5.x" attribute. But doing that is hard, especially if the user might remove some of the attributes when they edit the file.

      Yes, I understand and I think it should be handled by the conversion process. If a person modifies a document and re-saves it they can't expect it not to change. Whether those finer grained attributes are implemented in Word using modern positioning attributes or with the "act like WP" version should not matter or be visible to the end user. You don't have to interpret the low level attributes back into a high level attribute as you claim, provided you account for all the functionality of the low level attributes in Word.

  7. And the sad thing is... by Durkheim · · Score: 5, Insightful

    ...Some people think its fine that way. A friend of mine, quite pro-ms, told me that all those little strange things in the specification where normal to have backwards compatibility, and that reading the specification was a waste of time. Instead, he directed me towards a preview of Ms office 2007. Because for him, as for many more, what's important is the final product, the cuteness of the buttons, the way it works and displays its own format. Why bother using a free program that displays word documents badly, when Office is already perfect huh? I feel so misunderstood sometimes. What makes me sad is that they don't see the use of a clear straight-to-the-point format. Maybe only geeks can be horrified by this one.

    1. Re:And the sad thing is... by westlake · · Score: 3, Insightful
      I feel so misunderstood sometimes. What makes me sad is that they don't see the use of a clear straight-to-the-point format. Maybe only geeks can be horrified by this one.

      The user cares only for the document he sees in print or on screen. The internal structure of the file interests him not at all.

    2. Re:And the sad thing is... by TheRaven64 · · Score: 2, Insightful

      Give him a few years. Like Free Software, the need for Open Standards is only really apparent to people who haven't yet been bitten by their lack. I still have a load of ClarisWorks 1.0 documents from years ago that I can't open; even if I could find my copy of ClarisWorks and the disk hadn't been corrupted, I don't even own a disk drive that could read it anymore.

      --
      I am TheRaven on Soylent News
    3. Re:And the sad thing is... by CastrTroy · · Score: 1

      Doesn't he feel bad that he has to pay hundreds of dollars for a new version of MS Word every couple of years. I mean, that's a lot of money to be spending on a tool that hasn't changed much in the last 10 years (at least in terms of the functionality that most people use). Could you imagine if you had to pay $100 for a hammer, and that even if you didn't use the hammer that much, you still had to buy a new one every 3 years?

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    4. Re:And the sad thing is... by hawaiian717 · · Score: 1

      Well, you wouldn't have to buy a new hammer, just like you can keep using Word 97 if you want. It would be more like, if after a few years, they came out with new nails and you had to buy the new hammer to use the new nails. As long as you only need to use the old nails you're fine with the old hammer.

      --
      End of Line.
    5. Re:And the sad thing is... by DragonWriter · · Score: 1
      The user cares only for the document he sees in print or on screen. The internal structure of the file interests him not at all.


      Actually, many users care about their ability to use the document, not just the appearance. Now, in the short-run, that means the applications that support it now; in the longer-run, though, the underlying format and its adaptability and limitations are a real-but-hidden issue that effects utility, though most users are not able to evaluate it directly.

    6. Re:And the sad thing is... by CastrTroy · · Score: 1

      But what happens when you can't buy any old nails because they don't sell them anymore? And they start selling wood that isn't compatible with the old nails, so even if you have old nails you can't use them, because it doesn't work with the new wood.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    7. Re:And the sad thing is... by mrchaotica · · Score: 1

      Do ClarisWorks files store text as text (i.e., without mangling it)? If so, just use strings to convert the files to plaintext, delete all the garbage (which used to be formatting) and be done with it.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    8. Re:And the sad thing is... by hawaiian717 · · Score: 1

      Definitely valid problems, and why the analogy mostly works. In the case of Word, there actually is one advantage: You can make your own documents (nails) with Word 97 (the old hammer) as long as you want; the problem comes in when you try to use documents from other people (new nails). Most people can't make their own nails.

      --
      End of Line.
    9. Re:And the sad thing is... by TheRaven64 · · Score: 1

      For a lot of the documents, images (diagrams in particular) are quite significant components. Thus, extracting the text would not benefit me much (it might only be a few hundred words in a lot of cases).

      --
      I am TheRaven on Soylent News
  8. Divy it up? by plopez · · Score: 3, Insightful

    I wonder if you could get 60 people to review 100 pages each (or divide up chapters or sections in some logical manner). That may be feasible in 1 month. At least the glaring problems would be flagged. I have no idea how to organize this however.....

    --
    putting the 'B' in LGBTQ+
    1. Re:Divy it up? by Dragged+Down+by+the · · Score: 3, Funny

      Why, you could use Office 2007's Auto-Collaborate-Review feature, of course!

    2. Re:Divy it up? by plopez · · Score: 1

      Ooooo.. like using the Death Star to destroy the Imperial fleet!

      Cool!

      --
      putting the 'B' in LGBTQ+
  9. Objection! by Kamineko · · Score: 1
    Submitting contradictions?


    Wait a minute, I know this! This is just Phoenix Wright!

  10. For all the students out there by Catbeller · · Score: 2, Funny

    This is why we oldsters hate Microsoft. 25 YEARS of this.

    1. Re:For all the students out there by Anonymous Coward · · Score: 0

      And, if I humbly add to why I hate them, is that they have a bizarre way of obfuscating the "worldview" by introducing, puuuushing, then mothballing, which some call innovative, technologies, but which I call as questionable and not thought through; throw it at the wall and let's see if anything sticks methodology. Then we are left to understand and support the obfuscation, forever.

  11. I'm shocked! by Anonymous Coward · · Score: 0, Troll

    That Andy Updegrove, who runs a law firm that works for IBM (which has a massive vested interest in making sure that there is one and only one XML word processing format (ODF)) would submit a story to /. pointing out his own article that is critical of Microsoft's XML word processing format.

    Amazing. Who would have thought of something like that.

    1. Re:I'm shocked! by moranar · · Score: 2, Insightful

      I'm shocked too, that someone using ad-hominem attacks would resort to anonymous posting. Amazing. This must be Slashdot.

      The fact that Updegrove might have a vested interest in ODF succeeding doesn't detract from the OOXML proposed standard being a crock of shit.

      --
      "I think it would be a good idea!"
      Gandhi, about Internet Security
  12. Who needs OOXML... by Chris+Mattern · · Score: 2, Funny

    ...when you can have oo-mox?

    Chris Mattern

  13. Open Source community debugs MS code by RichMan · · Score: 3, Funny

    So it looks like the Open Source community is now debugging Microsofts Document format. I am sure Microsoft does not itself know what is going on in here half the time and much of this document was generated by code scrappers looking for structures and interfaces.

    Congrats to the world community but they should really submit a bill to Microsoft.

  14. Yeah, that's a Microsoft product alright by Master+of+Transhuman · · Score: 3, Funny

    "additional Microsoft technology that must be emulated (but is not covered by the Microsoft patent pledge); elements that can't be implemented without Microsoft technical assistance; dependencies on Windows itself; mandatory bugs; and more. And then there's also the fact that OOXML heavily overlaps ODF -- a platform-independent, already-adopted ISO/IEC."

    Pretty much like everything they do.

    Wait - where are the virus APIs? Did they leave those out?

    Naah...

    Gotta be there somewhere. Keep looking.

    --
    Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
    1. Re:Yeah, that's a Microsoft product alright by Anonymous Coward · · Score: 0
      "additional Microsoft technology that must be emulated (but is not covered by the Microsoft patent pledge); elements that can't be implemented without Microsoft technical assistance; dependencies on Windows itself; mandatory bugs; and more. And then there's also the fact that OOXML heavily overlaps ODF -- a platform-independent, already-adopted ISO/IEC."

      Pretty much like everything they do.

      Wait - where are the virus APIs? Did they leave those out?

      Naah...

      Gotta be there somewhere. Keep looking.


      You answered your own question...
  15. Here's a contradiction by Anonymous Coward · · Score: 0

    Does the OO mean "Object Orientated" or "Open Office"? Microsoft always arrive at the poolside late and attempt to muddy the water and ruin it for everyone else. I propose Microsoft remove the ambiguity by renaming it OMXML, "Obnoxious Monopolist XML" which is still awful because the fact it uses XML is irrelevant. How would Microsoft feel if the open source community began prefixing all their work with "MS"? That might help us get more F/OSS past management, since they may be under the impression we are talking about Microsoft software. Hell it works for Microsoft!

    Anyway, the contradiction is that Microsoft want to be seen to be involved, they participate in community standards for all the wrong reasons. They submit technically unsound proposals and generally waste everyones time. How is the naming of their proposed "standard" not intended to cause confusion and slow the adoption of ODF and OpenOffice.org?

  16. What, like... by Mythrix · · Score: 1



    (Content of .doc file)

  17. Re:What, like... (Oops, forgot, no xml tags.) by Mythrix · · Score: 5, Funny


    <microsoft_word_document>
    (Content of .doc file)
    </microsoft_word_document>

  18. Now We'll Now... by segedunum · · Score: 2, Interesting

    ...whether ISO has simply become a dumping ground for people simply wanting to market their stuff as standards (ECMA), or a real standards body.

    As it is, there is not a snowball in hell's chance that OpenXML can become an ISO standard. It is simply a dump of the existing awful doc format into a nice incomprehensible 6000 page document, and it doesn't even use existing ISO standards. There's even a set group of banners and bullet points defined in there which can by no stretch of the imagination be called international.

    I know Microsoft has managed to butter the ECMA up as their usual standards dumping ground, but I simply cannot see how they can get past the shortcomings in that article. To do so would be a huge amount of work (and Office 2007 is already using this format) and it would threaten their Office monopoly - which is what this obfuscation was about in the first place.

    1. Re:Now We'll Now... by I'm+Don+Giovanni · · Score: 0, Troll

      ...whether ISO has simply become a dumping ground for people simply wanting to market their stuff as standards (ECMA), or a real standards body.

      You mean like how ISO rubberstamped the half-spec for ODF that OASIS submitted? You don't even have spreadsheet formulas spec'ed for crying out loud! OO.o is the "reference" implementation. Whenever anyone implements ODF and runs into a wall because the spec isn't fully spec'ed, they say, "Just do whatever OO.o does". Some spec. It's a spec based on and written for OO.o, indeed it's derived from OO.o's previous XML format. And OO.o writes lots of stuff in its documents that are NOT in the spec (spreadsheet formulas being the most well known example).

      ISO rubberstamped ODF with no revisions, no critiques, or any opposition (despite MS being one of the committee members for the ISO rubberstamping; MS raised no objections, quite apart from IBM crying like a baby and getting outvoted 20 to 1 at the ECMA ratification of OOXML).

      And the ECMA process for OOXML was far more rigorous than ISO's rubberstamping of ODF. The ECMA process took over a year, with various revisions. And the parties involved included governments, tech companies (e.g. Apple and Novell), and businesses. The issues that IBM is whining about were raised during that process (Novell themselves (which, I know you guys despise now) raised many of the same issues), and they were dealt with. There were some issues for which doing the so-called "right" thing would have been counter productive as those cases are exremely rare (much more rare than creating spreadsheets with formulas, which OASIS didn't bother to spec for ODF). Others were dealt with the "right" way.

      Pragmatism has its place. THat's something you guys can't seem to get thru your thick skulls. And your double-standards and hypocrisy are beyond belief. "Don't complain about the spec in your brother's eye while ignoring the log in your own."

      And you guys are now running around trying to convince governments to mandate exclusive use of ODF, even when the ODF spec doesn't cover everything that OO.o does, let alone MS Office. And one of the reasons for the latter, is that by mandating exclusive use of a format that doesn't cover MS Office's features, you automatically make those features irrelevant since they couldn't be used, therefore evening the playing field wrt features, not by OO.o catching up, but by government fiat! You guys are too much!

      --
      -- "I never gave these stories much credence." - HAL 9000
    2. Re:Now We'll Now... by codemachine · · Score: 1

      You mean like how ISO rubberstamped the half-spec for ODF that OASIS submitted? You don't even have spreadsheet formulas spec'ed for crying out loud! OO.o is the "reference" implementation. Whenever anyone implements ODF and runs into a wall because the spec isn't fully spec'ed, they say, "Just do whatever OO.o does". Some spec. It's a spec based on and written for OO.o, indeed it's derived from OO.o's previous XML format. And OO.o writes lots of stuff in its documents that are NOT in the spec (spreadsheet formulas being the most well known example). No doubt the ODF standard needs some work, but at least the reference implementation is available with source code. The OOXML spec references implementation details in old versions of MS Word and even Corel Wordperfect, but does not document how they work. At least with ODF you can go and look at the code implementing these edge cases, whereas with OOXML, the products being referenced are closed source and likely covered by various patents that wouldn't allow reimplementation anyhow.

      Plus OOXML just reinvents the wheel everywhere it has a chance (well actually not quite, it just used the old .doc way of doing things in an XML-ized way), whereas ODF actually uses existing a XML standards in many places. ODF, while maybe not feature complete, is an actual attempt at an XML based document standard. OOXML is an ugly reimplementation of .doc whose sole purpose is to obtain "standard" status for government bids, as well as to confuse the general public (even the name is meant to confuse it with ODF - Office Open XML??? Like nobody around the ODF format uses the initials OO and the words Open Office...)

      In the end it may help competing office suites import MS Office documents better, but only versions saved in the new format, which currently excludes about 99% of all existing .doc files. Whereas MS gets to still be the only one with full compliance and backwards compatibility (though they aren't 100% at it either).

      It has already been proven how easy it would've been for MS to just support ODF (others have written the plugins). If ODF was missing features is MS's opinion, why didn't they contribute to the standard instead of reinventing the wheel?
    3. Re:Now We'll Now... by yo_tuco · · Score: 1

      "Whereas MS gets to still be the only one with full compliance and backwards compatibility (though they aren't 100% at it either)."

      And to pour salt on the wound, I hear Microsoft's promise not to sue only applies to those who can fully implement their "Open Standard". And since it appears that only Microsoft can do that, what good it that promise? More smoke and mirrors from Microsoft?

    4. Re:Now We'll Now... by segedunum · · Score: 1
      You mean like how ISO rubberstamped the half-spec for ODF that OASIS submitted? You don't even have spreadsheet formulas spec'ed for crying out loud!
      Simply doesn't matter. That's not the point. The point is that OOXML is trying to be an ISO standard when it doesn't even respect other ISO, or even accepted, standards - a lot outside of the IT industry. I'm not talking about how complete you might happen to think ODF is.

      And the ECMA process for OOXML was far more rigorous than ISO's rubberstamping of ODF.
      I've seen no evidence for that whatsoever. I wouldn't call a panic 6000 page dump of an existing closed binary format into XML rigorous in any way.

      The simple fact is, if it depends on Windows, depends on features in Windows and Office and depends on the quirks of Windows rather than even other ISO standards then it is a poor pathetic attempt at getting something accepted as a show standard in order to quell any interest in ODF.

      That's what OOXML is for, and that is all it's for. I know it. You know it. Microsoft knows it.
    5. Re:Now We'll Now... by complete+loony · · Score: 1

      When Microsoft says "Reverse engineer whatever it was that Word 97 did" or worse one of the obsolete DOS word processors that OOXML has special flags for, you think this is better than "Do whatever OO.o does, oh and here's the code"?

      Sure the ODF spec could do with some work. But if you want to implement it you *do* have access to everything you need to understand it.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    6. Re:Now We'll Now... by I'm+Don+Giovanni · · Score: 1

      "And to pour salt on the wound, I hear Microsoft's promise not to sue only applies to those who can fully implement their "Open Standard". "

      You heard wrong. That was more of IBM's FUD. When are you going to understand that 99.99% of IBM's statements regarding OOXML are to be disregarded as pure FUD (FUD is an invention of IBM). You do NOT have to fully implement the standard.

      --
      -- "I never gave these stories much credence." - HAL 9000
    7. Re:Now We'll Now... by segedunum · · Score: 1
      You heard wrong. That was more of IBM's FUD. When are you going to understand that 99.99% of IBM's statements regarding OOXML are to be disregarded as pure FUD (FUD is an invention of IBM). You do NOT have to fully implement the standard.
      Where is that stated?
  19. Is OOXML Truly Open Source by Anonymous Coward · · Score: 1, Informative

    The biggest question to ask is not whether or not Microsoft provides access to the XML, nor whether Microsoft provides access to a schema, the question to ask is, "Is OOXML Truly Open Source?"

    The biggest issue I have with the OOXML "standard" (and I use the word quite losely) is there are BLOB's (binary large objects) in the OOXML file created by Microsoft. In this BLOB is all the byte code used in the Macros, etc for the file in question (i.e. an Excel file). Since Microsoft has not provided proper instructions (whether it be a schema, or source code) to read the the BLOB containing this information, and how to intrupret this information, I doubt this will ever pass as a true ISO standard, nor be truly accepted as open source (not to mention marcos are still programmed using the Microsoft defined, and patented, VBA rather than using an open source standard such as JavaScript).

  20. More info @ groklaw by mario64 · · Score: 5, Informative

    Check out the article on Groklaw Searching for Openness in Microsoft's OOXML and Finding Contradictions for further comments. The article also has links to a couple of wiki pages with further comments.

  21. ISO maybe, but never an IETF standard by Pascal+Sartoretti · · Score: 2, Insightful
    Maybe they'll get an ISO standard, but I have the feeling that an IETF standard would be out of question. Look at the requirement for being just a "Draft standard" (see here):

    A specification from which at least two independent and interoperable implementations from different code bases have been developed, and for which sufficient successful operational experience has been obtained, may be elevated to the "Draft Standard" level.

    Outside Office 2007, who would ever implement this "standard"?
    1. Re:ISO maybe, but never an IETF standard by SwashbucklingCowboy · · Score: 1
      Outside Office 2007, who would ever implement this "standard"?
      Corel, in their WordPerfect Office product for one, at least that's what their announcement claims.
  22. Surprised? by UnknowingFool · · Score: 1

    If it's true, is anybody really surprised? This is MS after all.

    There is a another analysis on groklaw.

    --
    Well, there's spam egg sausage and spam, that's not got much spam in it.
  23. hope not by suezz · · Score: 1

    here's too it not getting adopted and I hope they kick it back out on the proprietary horse it rode in on.

  24. Hey guys, I've an idea by matt+me · · Score: 2, Funny

    > there's also the fact that OOXML heavily overlaps ODF -- a platform-independent, already-adopted ISO/IEC.
    Couldn't the Microsoft people use the existing standard instead? That way everyone would be able to communicate. Someone should call to let them know about it.

  25. The OOXML acronym by WebCowboy · · Score: 1

    Does the OO mean "Object Orientated" or "Open Office"?

    It means neither. OOXML is shorthand for Opaque and Obfuscated eXception-based Markup Language. However, Marketing rejected the longhand name for the format because it didn't test well in developer focus groups. However, marketing found the shorthand OOXML appealing because psychologists have said the roundness of the O's induces a sense of calmness. BillG liked it because legislators could make an (incorrect) association between OOXML and OpenOffice.org (often abbreviated OO.o), and he hopes the confusion could lead to the inadvertent acceptance of MS' pseudo-open file format in government.

    So OOXML stays but it officially stands for about as much as DVD does, which is nothing (or whatever you want it to stand for--it is all about "personal freedom" after all, so OOXML stands for what means the most to you ;-).

  26. ODF converter ready to go since May 2006 by Anonymous Coward · · Score: 0

    So, we've had an ODF "production ready" converter ready since May 2006. http://www.groklaw.net/article.php?story=200605040 15438308

    Where is it? Why won't Gary Edwards of the OpenDocument Foundation say anything?

    Hey people, how about some promoting of ODF?

  27. You must be new here by deimios666 · · Score: 0

    Oh go ahead and call Microsoft if you wish. But I bet they implemented their own phone standard which is incompatible with ours. Embrace, Extend, Exterminate. 'nuff said.

    --
    I think, therefore you are.
  28. MS is not worried about the format's success! by PostPhil · · Score: 1

    They already know that everyone is locked-in to their proprietary Office formats. This XML "standard" is not created as a real product that Microsoft hopes to promote. That would be a conflict of interest. Instead, they want to make sure that it is a gigantic convoluted spec that no one can implement. It's designed as a distraction. They want the spec to leave you with a feeling of disgust for open XML formats because:
    1. You'll go back to your works-good-enough-for-me Office formats you've already been using (e.g. Word Doc).
    2. If you're the typical uneducated business person, you'll get confused between OOXML and ODF and falsely believe that ODF is that bloated mess of a spec you believe you heard about from your fresh-out-of-high-school IT guy. Well he knows about computers, so that ODF (OOXML? Open Office XML? XML? Open Document?) thing must be a bad idea.

    Not many people know much about Open Office, even many supposed "techs" in many businesses (at least in the U.S.). Microsoft wants to take advantage of their greater mind-share to control public opinion through their usual tactics of FUD and confusion. They want to make sure that the reputation among developers of XML as being a bloated exchange medium will work in their favor by amplifying that perception thereby killing off ODF and any chance of the industry adopting a common format.

  29. OOXML vs. ODF Deathmatch! by dtabraha · · Score: 1

    Wikipedia has a good doc outlining the difference between OOXML and ODF:

    http://en.wikipedia.org/wiki/Comparison_of_OpenDoc ument_and_Microsoft_XML_formats

    It may not be an ISO standard, but it's a heck of a lot better than the completely proprietary older formats.

    How about a good "atta boy" for Microsoft at least? :)

    1. Re:OOXML vs. ODF Deathmatch! by Anonymous Coward · · Score: 0

      "It may not be an ISO standard"

      But that is the subject here. MS is submitting their "Open Document" to be an ISO standard. An ISO standard that relies on proprietary features from a single software vendor! It's a contradiction. "Atta-boy", Microsoft, try again when you understand what "Open" means.

  30. Re:What, like... (Oops, forgot, no xml tags.) by dascandy · · Score: 1

    Doc files don't parse as UTF-8. Try application/octet-stream?

  31. How is OOXML better than older formats? by walterbyrd · · Score: 1

    Or do you mean better for msft?

    The old formats are known. OpenOffice and AbiWord can read the old formats - as can older versions of ms-office. With this OOXML cr@p it's right back to square one.

  32. Oasis OpenDocument by VGPowerlord · · Score: 1

    Why is this still being called Oasis OpenDocument? Have you already forgotten that it's an ISO standard?

    --
    GLaDOS for President 2016! "Well here we are again. It's always such a pleasure." -- GLaDOS, 2011
  33. I think you mean by Keeper+Of+Keys · · Score: 1


    <microsoft_word_document>
    <[!CDATA[
    (Content of .doc file)
    ]]>
    </microsoft_word_document>

    Got to make sure it's valid XML.

    1. Re:I think you mean by dkf · · Score: 1

      That doesn't work; you can't put arbitrary binary data in an XML file because some characters are forbidden (e.g. U+0000). The standard XML way of dealing with this is to base-64 encode the binary data. On the plus side, this does mean that you don't need to fiddle around with CDATA stuff.
      <?xml version="1.0" encoding="UTF-8">
      <microsoft_word_document encoding="base-64">

      [blob of nastiness, err, word document, suitably bloated by the encoding scheme]
      </microsoft_word_document>

      --
      "Little does he know, but there is no 'I' in 'Idiot'!"
    2. Re:I think you mean by Keeper+Of+Keys · · Score: 1

      You'd better include the CDATA block as well, since presumably this blob won't look like a single text node to an XML parser - it will contain things that look like XML tags.