Slashdot Mirror


Is the New Microsoft Office Really Open?

joesklein asks: "From CNET, there is an article about the new Microsoft Office 11. In summary 'Microsoft says it's opening its Office desktop software by adding support for XML--a move that should help companies free up access to shared information. But there's a catch: It has yet to disclose the underlying XML dialect.' Could this be grounds for another anti-trust suit against Microsoft?"

485 comments

  1. sure it is! by Anonymous Coward · · Score: 5, Funny

    it supports .DOC, the de facto standard for documents. What's this XML you're talking about?

  2. Here, I'll answer this simply. by Anonymous Coward · · Score: 0, Informative

    But there's a catch: It has yet to disclose the underlying XML dialect.' Could this be grounds for another anti-trust suit against Microsoft?"

    No.

  3. That's still to be seen... by Eric+Damron · · Score: 2, Interesting

    "In summary 'Microsoft says it's opening its Office desktop software by adding support for XML--a move that should help companies free up access to shared information."

    Are we talking about true standard XML is Microsoft going to "embrace and extend" it?

    --
    The race isn't always to the swift... but that's the way to bet!
    1. Re:That's still to be seen... by C.+Mattix · · Score: 2, Insightful

      Aren't you supposed to "extend" it....
      eXtensible Markup Language...

      Just my $.02

    2. Re:That's still to be seen... by Jondor · · Score: 1

      Besides, who cares.. as long as they control the dtd they can make everybodies life as miserable as before..

      --
      Nobody expects the spanish inquisition!
    3. Re:That's still to be seen... by JebusIsLord · · Score: 3, Informative

      No because the dtd and/or namespace will have to be referenced in plain text in the xml document. so, even if they use absurdly complex element names, they have to use a valid dtd or namespace uri which can be easily referenced, or it just ain't xml at all. Also you aren't allowed to put binary data in an xml document, but even if they did reference their dtd by memory address for instance, its an easy task to just read that address. In conclusion they would have to break xml pretty hard-core in order to make their doc types proprietary. Besides, then what would be the point of going xml in the first place?

      --
      Jeremy
    4. Re:That's still to be seen... by ftobin · · Score: 4, Insightful

      Besides, then what would be the point of going xml in the first place?

      The same point that most technical decisions are based on. Buzzword compliance.

    5. Re:That's still to be seen... by Anonymous Coward · · Score: 2, Interesting

      MIME-encoded binary data, on the other hand, is perfectly happy in a ForeignData XML tag...and MS already ships a product that does exactly that.

    6. Re:That's still to be seen... by Anonymous Coward · · Score: 0

      > Besides, then what would be the point of going xml in the first place?

      You see, you misread what MS is trying to say. It's not xml, it's MS-XML. Like DOS is MS-DOS.. I hope you never make that mistake again.

    7. Re:That's still to be seen... by EnVisiCrypt · · Score: 3, Informative

      The hell you can't put binary data in an XML document. As long as it's base64 encoded you can put anything in there.

      --


      *everything* is Orwellian to cats.
    8. Re:That's still to be seen... by Eryq · · Score: 3, Insightful

      First, you don't have to reference a DTD to produce valid XML. SAX/DOM parsers will work just fine on a document without a DTD.

      Second, you can have "binary" data in an XML document. Just base64 encode it.

      Third: the point of going to XML if you're just going to produce a mess? Simple. You get to claim openness. Most PHBs probably don't know the difference between turly structured, stable, "open" XML, and syntactically-correct but semantically-useless XML.

      --
      I'm a bloodsucking fiend! Look at my outfit!
    9. Re:That's still to be seen... by mccalli · · Score: 2
      First, you don't have to reference a DTD to produce valid XML. SAX/DOM parsers will work just fine on a document without a DTD.

      You certainly do have to reference either a DTD or a schema. I'm aware that most parser implementations will operate on documents without them, but that doesn't make the original documents valid.

      Cheers,
      Ian

    10. Re:That's still to be seen... by 9jack9 · · Score: 5, Insightful
      But they can make it so massively complex that it is very difficult to implement interoperability with foreign tools, but that it is somehow much easier to implement with MS-centric tools.

      The registry in Windows NT/2000/XP is sort of like that. It makes a lot more sense from a Microsoft-centric viewpoint than it does from a non-Microsoft-centric viewpoint. Now that it's been around so long, there are lots of ways to get at registry data (for instance, using Perl modules), but when the registry was new the only way to do it was through the Microsoft API, but until many people went through the pain of encapsulating the MS API, the pain of accessing the registry from a non-MS-centric toolset was high.

      So maybe the XML format will be like that. If you're Linux-centric, for instance, the threshold of pain for accessing Word XML docs will be fairly high, but if you're Microsoft-centric, with all of their tools, code-snippets, documents, etc., then it won't be nearly as painful.

      This way MS gets to claim interoperability, make Word data easily accessible to MS-centric solutions, but put a damper on non-MS-centric solutions.

    11. Re:That's still to be seen... by Fnkmaster · · Score: 2

      You do know that the namespace URI is just that - it's a unique identifier for a namespace, NOT a URL that you can dereference to find anything. The topic of schema URIs (i.e. targetNamespaces) has been debated a million times and it's 100% possible to have a document that validates against a schema or DTD that is identified with a URI at which the DTD or schema CANNOT be downloaded, and which can be stored locally on a machine for validation purposes (the parser uses an internal map to correlate the URI with the schema/DTD).

    12. Re:That's still to be seen... by MrResistor · · Score: 5, Insightful

      No because the dtd and/or namespace will have to be referenced in plain text in the xml document. so, even if they use absurdly complex element names, they have to use a valid dtd or namespace uri which can be easily referenced

      I think an analogy to Frontpage is appropriate here. Sure, it produces HTML, but the result just doesn't look right unless it's viewed in IE. Maybe the dtd is referenced, but encrypted or otherwise proprietary. Maybe MSXMLVIEWER (whatever it may be called) doesn't need the reference to be in plain text.

      There are any number of things MS could do to ensure that the document just doesn't look right in other viewers. Since formatting is the whole point of XML, people will use MSXMLVIEWER and whatever it reads will be the de facto XML standard, just like whatever IE renders is the de facto HTML standard.

      or it just ain't xml at all.

      While technically correct, the point is sadly irrelevant. As long as MS is effectively a monopoly XML will be whatever they say it is, for the majority of people.

      Also you aren't allowed to put binary data in an xml document

      Not true. It's recomended that you don't put binary in an XML document, but nothing prevents you from doing so. This is exactly what will give MS the ability to hijack the standard.

      In conclusion they would have to break xml pretty hard-core in order to make their doc types proprietary.

      Only in spirit, I'm afraid, but that will likely be enough.

      Besides, then what would be the point of going xml in the first place?

      To make documents searchable. This is an ability which is extremely valuable to anyone who has a large amount of information they need to access. The upshot is that the actual content will likely be plain text, though important markups may not be. Sadly, format is more important than content for a lot of people.

      Of course, most people won't use the XML format at all, since it won't be the default.

      --
      Under capitalism man exploits man. Under communism it's the other way around.
    13. Re:That's still to be seen... by 4of12 · · Score: 2

      Heh, I've been thinking the same thing all along...

      <displayhintobject>
      982a2eba7a88a04d7b1132042d3f649b5fcd
      f8136ebcd3d700008f6fe2698df90feecfbe387c1551
      </displayhintobject>
      --
      "Provided by the management for your protection."
    14. Re:That's still to be seen... by Anonymous Coward · · Score: 1, Informative

      the XML specification talks about "well-formed XML" and "valid XML", where the former means valid in all the usual senses of the word, and the latter means "can be validated by a program".

    15. Re:That's still to be seen... by JebusIsLord · · Score: 2

      Actually he's right, you can have valid xml without a dtd or schema, its just completely open and impossible to validate, which means I find it most unlikely MS would go that way because usually dtd-less documents are extremely simple.

      --
      Jeremy
    16. Re:That's still to be seen... by JebusIsLord · · Score: 2

      sorry I mistyped. You can have binary DATA but you cant for instance have binary-encrypted elements (tags, attributes etc.). So the document must remain parsable by a text viewer. They can't for instance put xmlns:0x16f53ea4 or something when referencing the namespace.

      --
      Jeremy
    17. Re:That's still to be seen... by mccalli · · Score: 2
      Actually he's right, you can have valid xml without a dtd or schema

      No - you can have well-formed XML. You can't have valid XML.

      Cheers,
      Ian

    18. Re:That's still to be seen... by Ranger96 · · Score: 1

      Just to be nit-picky: You can't have valid xml without a dtd or schema. Valid XML, by definition, requires one or the other. You can have "well-formed" xml without a dtd or schema.

      --
      What has been will be again, what has been done will be done again; there is nothing new under the sun.-Ecclesiastes 1:9
    19. Re:That's still to be seen... by CondeZer0 · · Score: 5, Insightful

      How does this misinformed crap get moderated up?

      As some others have pointed out:

      1) You don't need a DTD or Schema to have XML
      2) The url used in a namespace declaration doesn't need to correspond to a real document
      3) Even in case the document used a DTD or Schema, that DTD or Scheme where available, and the document actually validated against it, you still don't know what the hell the tags mean, the DTD or Scheme are just syntactical(and grammatical?) rules, and don't tell you how to interpret the tags or attributes.
      4) You can always include binary data in an XML document(ie., base64 encoded)
      5) The point of using XML is Buzzword compliance and *perceived* openness

      There are more reasons why XML not necessarily = openness. But this ones are more than enough.

      XML means nothing, it's just a way to define languages, is like an charset, just because I have a document that is ASCII doesn't mean that I understand what is written on it if I don't know the meaning of the words that are on it(eg., just because you know the name of each letter doesn't mean that you know the meaning of "lkasdertunxsjd", right?)

      Even if a language is in XML, you still need to *document it* to be able to *understand* it.

      Sorry if I was a bit rough, but I'm sick of people that assume that because something is in XML it's automatically open. That is one of the biggest myths the XML buzz-wagon is based on, and is spreaded by people
      that don't really understand what XML is.

      Please, before you post to /. make sure you know what you are talking about.

      Best wishes

      \\Uriel

      --
      "When in doubt, use brute force." Ken Thompson
    20. Re:That's still to be seen... by Anonymous Coward · · Score: 1, Funny

      "We make XML the way it SHOULD have been made in the first place"

    21. Re:That's still to be seen... by Anonymous Coward · · Score: 0

      one will go from this RTF:

      {\rtf1\ansi\ansicpg1252\deff0\deflang1033{\fontt bl {\f0\fswiss\fcharset0 Arial;}}
      \viewkind4\uc1\pard\f0\fs20 Hello, World!\par
      }

      to this?

      Hello, World! ...or will it be even worse than that, with GUIDs, MIME-encoded stuff, etc., all over the place?

    22. Re:That's still to be seen... by Anonymous Coward · · Score: 0

      one will go from this RTF:

      {\rtf1\ansi\ansicpg1252\deff0\deflang1033{ \fonttbl{\f0\fswiss\fcharset0 Arial;}}
      \viewkind4\uc1\pard\f0\fs20 Hello, World!\par
      }

      to this wild guess XML format?
      <rtf1>
      <ansi codepage="1252">
      <def lang="1033">
      <font id="0" charset="Arial">
      <pard>Hello, World!</pard>
      </font>
      </def>
      </ansi>
      </rtf1>

      ...or will it be even worse than that, with GUIDs, MIME-encoded stuff, etc., all over the place?

    23. Re:That's still to be seen... by butane_bob2003 · · Score: 1

      Its going to read/write in MS-XML, so the limitations imposed by the MS-XML parser will apply. Which is to say that the documents could contain anything that can be parsed by the MS-XML parser. Which is and always has been up to microsoft.

      --


      TallGreen CMS hosting
    24. Re:That's still to be seen... by ZlOrB · · Score: 1

      Since when is Microsoft conforming any standard ? As usuall, they will use slightly different mechanism than the standard and change it every version, so it will be hard for anyone else to follow.

    25. Re:That's still to be seen... by jonadab · · Score: 2

      > I think an analogy to Frontpage is appropriate here. Sure,
      > it produces HTML

      No, it doesn't. It produces something that looks vaguely similar
      to HTML, perhaps, but HTML it is not. You look at a FrontPage
      document's source closely, and you see a mishmash of deprecated
      HTML3 markup, newer markup that didn't exist in HTML3 but was
      introduced later, plus the occasional attribute that never
      existed in _any_ version of HTML, thrown in for good measure.

      It is only because of the long-standing practice of browsers since
      Mosaic (possibly before) to ignore any tag or attribute they don't
      understand that a FrontPage document will display at all in any
      browser. (This is fun to try sometime: make up a tag, completely
      out of thin air, and use it in a webpage, and see how various
      browsers handle the page.)

      <voice id="Linus" rate="slow">I pronounce Linux as Linux</voice>
      Any browser will display the quote as if the voice tags weren't
      there at all -- does that make it HTML?

      --
      Cut that out, or I will ship you to Norilsk in a box.
    26. Re:That's still to be seen... by NoMoreNicksLeft · · Score: 2

      Actually, it would be to break XML, as they have tried to do with other "standards".

    27. Re:That's still to be seen... by jonadab · · Score: 2

      I've worked up an even better demonstration

      --
      Cut that out, or I will ship you to Norilsk in a box.
    28. Re:That's still to be seen... by jonadab · · Score: 2

      > You don't need a DTD or Schema to have XML

      You can have wellformed XML without them, but there must be a
      DTD or Scheme in order to have _valid_ XML.

      > The url used in a namespace declaration doesn't need to
      > correspond to a real document

      Or, more to the point, the document at that URL can be an inside
      joke from the movie Ghostbusters, rather than having any actual
      declarations. (Those of you who think I am kidding on this point
      have never tried to access the document that the XUL namespace
      declaration points to.) This, however, is not really important.

      > Even in case the document used a DTD or Schema
      To be valid it has to... anyway, even if it doesn't, there
      is one implicit.

      > that DTD or Scheme were available
      The availability of the DTD or Schema[1] is really not important.
      It would be easy enough to write a program that analyses documents
      that are known to be valid and keeps track of which tags contain
      data, and which ones contain PCDATA, and which other tags they
      have nested in them. Analyse enough documents, and you have a
      subset of the original DTD that's good enough for creating
      documents that are guaranteed to be compatible and can use all
      the features used by the documents you analysed.

      > you still don't know what the hell the tags mean

      Of all the points you made, this is the important one. XML is
      by its very nature a very flexible standard. It's not like HTML
      where a formal standard specifies that <p> is a paragraph and
      that it is a block-level element with certain amounts of white
      space top and bottom and so on and so forth. The tags and
      attributes an the format can be interpreted in whatever way
      the application sees fit.

      In practice, that means another word-processing app can with
      relative ease use the same format in such a way that tools for
      searching and indexing will work on documents created by both apps,
      and it means that if you open a Word document in whatever other app
      that uses that format you can make minor changes (such as wording
      changes) and save it, and when Word opens it again it won't be
      munged (assuming the other app does things in a sane manner that
      preserves whatever markup it doesn't understand). But it does NOT
      mean that the doc will necessarily look the same in the other app
      as it does in Word.

      [1] And when did "schema" become singular, anyhow?

      --
      Cut that out, or I will ship you to Norilsk in a box.
    29. Re:That's still to be seen... by RetiredMidn · · Score: 1
      But they can make it so massively complex that it is very difficult to implement interoperability with foreign tools, but that it is somehow much easier to implement with MS-centric tools.

      Exactly right; and they don't have to make Excel or Word massively complex -- they already are.

      XML is no more than an open-format serialization of the internal model maintained by application. Opening the format via XML makes it easier to parse the serialization, but it doesn't help reconstruct a valid model in another program, nor guarantee that you can modify the model (or create a new one), or write a syntactically correct model that is also semantically correct.

      This is not really a consequence of Microsoft's obfuscation; it is really a limitation of XML (or, more properly, a problem that XML does not pretend to solve).

      OTOH, I think Microsoft is over-representing how XML support "opens" their applications, and/or tacitly allowing the trade press to breathlessly over-represent it for them.

    30. Re:That's still to be seen... by mkweise · · Score: 1

      And when did "schema" become singular, anyhow?

      Always has been, or at least since the time of Plato (around 2300 years ago.)

      The plural of schema is schemata, akin to stigma / stigmata. Both are originally ancient Greek, not Latin as one might think.

      --
      Gentlemen! You can't fight in here, this is the War Room!
    31. Re:That's still to be seen... by Anonymous Coward · · Score: 0

      Ok, so decode this:

      <krumme>det er ikke saa svaert</krumme>

    32. Re:That's still to be seen... by benhaha · · Score: 1

      I bet you didn't know you can still format XML tags with CSS regardless of whether they are valid HTML tags?

      How to Write a CSS Style Sheet for Browsing XML

      --
      NO ID: BEING FREE MEANS NOT HAVING TO PROVE IT
    33. Re:That's still to be seen... by more+fool+you · · Score: 1

      i am someone who is completely ignorant about legalese, i guess it comes down to how many lawyers are protecting your software against theirs. once it becomes plain text, it kinda invalidates the reverse engineering argument, no? at least from my POV

    34. Re:That's still to be seen... by Sesticulus · · Score: 1

      What version of Frontpage do you use? I use Frontpage for my site, I test with and it looks fine on IE, Mozilla, Netscape (haven't tried anything lower than 4), and Opera. I'll admit that Word generates some crap, but Frontpage is fine.

    35. Re:That's still to be seen... by Anonymous Coward · · Score: 0
      Oh yes, I can see it now...


      <?xml version="1.0" ?>

      <document type="word">
      FE54FD28
      3B2F5C35
      9E34FF00
      00FF638D .. ..
      FF34FD00
      </document>
    36. Re:That's still to be seen... by jonadab · · Score: 2

      > Both are originally ancient Greek, not Latin as one might think.

      But alpha is a plural suffix in Greek, too... neuter nom/acc...

      > The plural of schema is schemata

      Oh, duh, I see it now; it's third declension, and the a isn't a
      suffix at all; the root ends in t, which drops off in the nominative
      singular where there's no ending. Why didn't I see that before?

      I learned something today.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    37. Re:That's still to be seen... by jonadab · · Score: 2

      > I bet you didn't know you can still format XML tags with CSS

      Yes, I did know; that's why in the demo I wrote this:
      > (Presumably, this is so the rendering engine for HTML and XHTML
      > can share a lot of information with the one for general XML.)

      But in theory, if we were being strictly specification-complaint,
      that would only work in XML. The demo is served as text/html and
      does not have an xml version declaration (one of those funny things
      with the question marks beginning and end before the doctype (which
      also isn't there in the demo)). So it ought to be treated as HTML
      (or SGML), not XML. In theory.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    38. Re:That's still to be seen... by MrResistor · · Score: 2

      That is exactly the point of my analogy. Thank you for noticing it.

      I predict that MS will do similar things with XML.

      --
      Under capitalism man exploits man. Under communism it's the other way around.
    39. Re:That's still to be seen... by MrResistor · · Score: 2

      You must have a very simple site layout, then.

      --
      Under capitalism man exploits man. Under communism it's the other way around.
  4. LOL by Boss,+Pointy+Haired · · Score: 4, Funny

    Well if the way Microsoft Word saves out as HTML is anything to go by, then concise it most definitely will not be.

    1. Re:LOL by Anonymous Coward · · Score: 1, Insightful


      <DATA>
      asdfafs%65356FG653$5#@$%6Asdtkasdt@ %@#$%@#$%245
      .
      .
      .
      .
      </DATA>
      </XML>

    2. Re:LOL by Anonymous Coward · · Score: 5, Funny



      <head>
      <META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
      charset=3Dus-ascii">

      <meta name=3DGenerator content=3D"Microsoft Word 10 (filtered)">

      <style>
      <!-- /* Font Definitions */
      @font-face
      {font-family:Tahoma;
      panose-1:2 11 6 4 3 5 4 4 2 4;} /* Style Definitions */
      p.MsoNormal, li.MsoNormal, div.MsoNormal
      {margin:0in;
      margin-bottom:.0001pt;
      font-size:12.0pt;
      font-family:"Times New Roman";}
      a:link, span.MsoHyperlink
      {color:blue;
      text-decoration:underline;}
      a:visited, span.MsoHyperlinkFollowed
      {color:purple;
      text-decoration:underline;}
      span.emailstyle17
      {font-family:Arial;
      color:windowtext;}
      span.emailstyle18
      {font-family:Arial;
      color:navy;}
      span.EmailStyle19
      {font-family:Arial;
      color:navy;}
      @page Section1
      {size:8.5in 11.0in;
      margin:1.0in 1.25in 1.0in 1.25in;}
      div.Section1
      {page:Section1;}
      -->
      </style>

      </head>

      <body lang=3DEN-US link=3Dblue vlink=3Dpurple>

      <div class=3DSection1>

      <p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
      style=3D'font-size:
      10.0pt;font-family:Arial;c olor:navy'>

      I agree.

      </span></font></p>

    3. Re:LOL by commodoresloat · · Score: 5, Interesting
      Or anything close to "standard." The best we can hope for is code that is recognized as valid, and I wouldn't hold my breath for that either. I've seen HTML like the following come out of Word:

      <B><A HREF="http://whatever.org"> Link </B></A>.

      I'm not kidding, either. Seems like an easy thing to avoid in an HTML generator. Validator routinely reports hundreds of coding errors in simple short documents generated by Word. Ugh. What really sucks is when you're working on a web page for someone and cleaning out all the crap that Word generates, then at the last minute they send you the same document with some minor errors corrected.... and all the same major errors generated by Word. Fun.

    4. Re:LOL by Anonymous Coward · · Score: 0

      That should be modded "Paranoid" or "Stupid". "Insightful" is the last thing it is...

    5. Re:LOL by Wolfier · · Score: 2

      How about

      SGFoYSwgaWYgeW91IHJlYWxseSBhcmUgdHJ5aW5nIHRvIGRl Y2 9kZSB0aGlzLCB5b3UgaGF2ZSB0b28gbXVjaCB0aW1lIG9uIHlv dXIgaGFuZHMh

    6. Re:LOL by Wolfier · · Score: 5, Funny
      <?xml version="1.0" encoding="base-64?>
      <!doctype MS_WORD
      <!ELEMENT WORD_DATA>
      ]>
      <WORD_DATA>SGFoYSwgaWYgeW91IHJlYWx seSBhcmUgdHJ5aW5nIHRvIGRlY29kZSB0aGlzLCB5b3UgaGF2Z SB0b28gbXVjaCB0aW1lIG9uIHlvdXIgaGFuZHMh<WORD_DATA>
      </xml>
    7. Re:LOL by JebusIsLord · · Score: 2

      wow you only used 2 elements and no dtd, and its STILL not well formed. congratulations.

      <?xml version="1.0">
      <!DOCTYPE doc PUBLIC "http://www.microsoft.com/xml">
      <doc xmlns="www.microsoft.com/xml" "xml:lang="en">
      <data>
      asdfafs%65356FG653$5#@$%6Asdtkasdt@%@#$%@#$%245
      </data>
      </doc>

      Would be more in line for all you paranoids out there.

      --
      Jeremy
    8. Re:LOL by cmeans · · Score: 1, Offtopic
      I hope this was typed in, rather than pasted in.

      Should the last WORD_DATA be /WORD_DATA, and what's that ";" doing before the /xml tag...looks bad to me.

    9. Re:LOL by loconet · · Score: 4, Informative

      I know exactly what you mean. Word spits out complete garbage when it converts .doc => .html . Microsoft attempted to address this issue by releasing an HTML filter plugin that you can install and cleans up the html word spits out. It does clean up the html but it's still kinda messy.

      --
      [alk]
    10. Re:LOL by jtra · · Score: 1

      Actually text in parent post WORD_DATA does make sense. It is really encoded in base64 (you need to skip spaces)

      see session in interactive ruby:
      irb(main):004:0> 'SGFoYSwgaWYgeW91IHJlYWxseSBhcmUgdHJ5aW5nIHRvIGRlY 29kZSB0aGlzLCB5b3UgaGF2ZSB0b28gbXVjaCB0aW1lIG9uIHl vdXIgaGFuZHMh'.unpack 'm'
      ["Haha, if you really are trying to decode this, you have too much time on your hands!"]

      No, it did took me 1minute.

      --
      -- Wanna textmode user interface for ruby? http://freshmeat.net/projects/jttui/
    11. Re:LOL by Mike+Schiraldi · · Score: 3, Informative

      Dude: mmencode -u

    12. Re:LOL by Wolfier · · Score: 3, Funny

      damn, what happened? I was just trying to type random characters

    13. Re:LOL by Kragg · · Score: 2

      Whew, thanks. I nearly panicked when my SAX parser threw an ELEMENT_DECLARATION_MISSING exception as I parsed the parent.

      --
      If you can't see this, click here to enable sigs.
    14. Re:LOL by realdpk · · Score: 2

      yeah me too. phew! i was like, trippin. OMG OMG, my SAX parser!

      XML 4 EVER

    15. Re:LOL by smyle · · Score: 2

      The demoroniser is your friend.

      --

      Sleep is just a poor substitute for caffeine, anyway. -Bob Lehmann

  5. Reverse Engineer by timothy_m_smith · · Score: 2

    At least with XML it will not be very long until many software companies and project reverse engineer the XML. I suppose they could put some weird binary or encrypted data in the files, but that would defeat the purpose of XML.

    1. Re:Reverse Engineer by Mandi+Walls · · Score: 2
      maybe they'll do the opposite of the .doc format as it is now: encrypt the actual data of the document but let the xml tags hang out in text.

      running "strings" on a .doc xml file would dump just the tags.

      that would be funny.

      --mandi

    2. Re:Reverse Engineer by Phroggy · · Score: 5, Insightful

      I suppose they could put some weird binary or encrypted data in the files, but that would defeat the purpose of XML.

      The purpose of XML is to have buzzword compliance, and this doesn't defeat that.

      (Of course that's not the purpose most other people use XML for, but we're talking about Microsoft.)

      --
      $x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
      $x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
    3. Re:Reverse Engineer by Anonymous Coward · · Score: 1, Interesting
      At least with XML it will not be very long until many software companies and project reverse engineer the XML. I suppose they could put some weird binary or encrypted data in the files, but that would defeat the purpose of XML.

      I fully expect that this is what Microsoft intends to do. Just because a document is in XML, doesn't mean it has to be readable or writable XML. XML, ultimately, is just a term, and a loose format. It is not a guaruntee that you'll actually be able to interpret and re-write it using notepad or vim.

      I have a suspicion that the reason Microsoft set up its next version of Office to be XML compliant is that a few of their customers became interested in Staroffice's idea to do the same thing.

      Looks like a case of 'Deliver buzzword to shut them up' to me.

      As for whether it's grounds for an anti-trust suit?

      Sure, why not.

      Will it matter?

      Not in 2009, when we might expect the trial to end.

    4. Re:Reverse Engineer by Anonymous Coward · · Score: 3, Interesting

      No, of course MS wouldn't put the data in weird binary or encrypted format in their XML output formats ... like they did with Visio 2002's XML output (http://groups.google.com/groups?hl=en&lr=&ie=UTF- 8&oe=UTF-8&threadm=OiH2rn9nCHA.1808%40TK2MSFTNGP10 &rnum=3&prev=/groups%3Fq%3Dxml%2Bvisio%2Bmime%26hl %3Den%26lr%3D%26ie%3DUTF-8%26oe%3DUTF-8%26scoring% 3Dd) where they put all the really important stuff needed for interoperability in ForeignData elements.

      HINT: if you see MS use the phrase "full fidelity" when they talk about their new Office's XML output then you can be sure they're not giving you the data interoperability/portability you thought XML output was going to give you.

  6. Another Trial?? by Jaysyn · · Score: 1

    Maybe they'll get it right next time....

    Jaysyn

    --
    There is a war going on for your mind.
  7. Defaults by Snoe · · Score: 5, Insightful

    RTF has been in office for years and it is an open, portable standard readable on many platforms and with many programs. The problem is that Microsoft chooses to retain their obfuscated binary format as the default save type for documents.

    If the XML files office produce are not made the default save types or if the XML merely encapsulates large portions of binary code, it will not matter one lick that office can save these xml documents because the majority of people will be stuck on the default, unreadable formats.

    1. Re:Defaults by C.+Mattix · · Score: 3, Insightful

      Exactly. And as the maker of a software product it is thier perrogative as to what the default value is. I would hate to have the government telling me what the default values for things should be. If the user's don't use open standard type, yet they are given the oppurtunity to, then it is no longer the software manufacturer's fault.

    2. Re:Defaults by Planesdragon · · Score: 3, Insightful

      RTF has been in office for years and it is an open, portable standard readable on many platforms and with many programs.

      Obviously you haven't tried it. RTF has gotten more complaints from users than raw word Docs does!

      Replace "RTF" with "HTML" and you've got a winner, though.

      The problem is that Microsoft chooses to retain their obfuscated binary format as the default save type for documents.

      It's not "obfuscated" so much as it's "optimized." The whole idea seems to be for Word to save as quickly as possible--which the doc file is best at for Word for some reason, probably becuase it's derived from how the program structures documents, and not how some document spec says documents should be handled.

      If the XML files office produce are not made the default save types or if the XML merely encapsulates large portions of binary code, it will not matter one lick that office can save these xml documents because the majority of people will be stuck on the default, unreadable formats.

      1: It's HIGHLY unlikely that MS's XML implementation will be unnecessary binary code. They have a doc-to-HTML converter allready, and the XML converter will probably just be an update of that.

      2: You CAN change the default Office save format to RTF, HTML, old_doc_version, or just about any random 'save as' converter you have! (The only major feature I saw missing was the MHTML format.)

    3. Re:Defaults by Anonymous Coward · · Score: 1

      The problem with RTF isn't that it is not open, it is just not as useful as XML.

      With XML you get tools like XSLT, SAX and DOM for free. Who cares if it is their own, even pataented (if that is possible) schema. I'll use an XSLT to convert to DocBoook if I need to. Or an XSLT to SQL to stuff info from Excel into mysql. I really don't see how they can do it wrong and still call it XML. If it is well formed and doesn't embed proprietary binaries in it, it is a HUGE step forward.

    4. Re:Defaults by EisPick · · Score: 5, Insightful

      It's not "obfuscated" so much as it's "optimized." The whole idea seems to be for Word to save as quickly as possible--which the doc file is best at for Word for some reason, probably becuase it's derived from how the program structures documents, and not how some document spec says documents should be handled.

      In an era of 2+ GHz computers with 7200+ rpm hard drives, it seems odd that Microsoft would be unable to write an application than can quickly save and open text files that, on average, run well under 50 kilobytes.

    5. Re:Defaults by Anonymous Coward · · Score: 0

      Your sig has Gamer before Christian ;)
      Now that is when you know someone is truely hardcore.

    6. Re:Defaults by dubious9 · · Score: 2

      Yes, make a table and some list in RTF and then open it up in a text editor. RTF is as verbose as it possibly could be.

      Also, microsoft doesn't say exactly how it interprets (i.e. whether this tag has to be before that tag, whether you can say just border instead of border-top, bottom, left right,) so I wouldn't exactly call it an open standard. RTF viewers/writer are very hard to implement.

      --
      Why, o why must the sky fall when I've learned to fly?
    7. Re:Defaults by killmenow · · Score: 1
      I would hate to have the government telling me what the default values for things should be.
      Yeah, well you're not a monopoly convicted of anti-trust violations are you?

      Microsoft is a criminal organization. It's proven in court. Why shouldn't the government be allowed to tell them what to do? IMO, Microsoft lost the right to bitch about the government "regulating" them when they knowingly and willfully broke the law.
    8. Re:Defaults by MadAhab · · Score: 5, Insightful
      You are goddamned fucking lucky that the government tells you what the default values for things should be. That's what the government is there for, mostly; to tell you that the default value for a building is to have a fire exit and that it may not be locked. And without standards, there is no interchangeability of parts. And without that, every consumer and customer gets assraped by manipulative vendors. And since you can never tell precisely how this battery differs from that battery, you just have shit exploding battery acid all over the place.

      But if you really think they have no right doing these things, go live in a 3rd world country; they generallly have the government telling you less about what to do. Except once in a while when they kill your familiy. You could be armed of course. You know what a totally armed society with a weak government looks like? Afghanistan.

      That being said, it's hard to see what business the government has engineering document formats. They could, on the other hand, specify disclosure of formats as a remedy in an anti-trust case, but they generally fall into one of two categories which precludes this: stupid or bought.

      --
      Expanding a vast wasteland since 1996.
    9. Re:Defaults by Anonymous Coward · · Score: 0

      RTF
      RTF what? RTF Article, RTF Manual, help me out here! :)

    10. Re:Defaults by Strange+Ranger · · Score: 2

      Corporations use custom installs all the time to change the default Save Type. A common example was to have everybody's default save type revert to Word 95 (.doc) because only half of the company was up and running on 97.

      So, why don't more companies make RTF or now XML the default save type? They're already doing custom network installs anyway. If a majority of Fortune 500 companies did this it wouldn't matter what Jane & Joe Home User had as their default. They'll be used to what they see at work.

      One might imagine there are many readers here who have some influence over their IT department. Shouldn't be that hard to just say No to default .doc?

      --

      Operator, give me the number for 911!
    11. Re:Defaults by Yi+Ding · · Score: 2, Insightful

      RTF has been in office for years and it is an open, portable standard readable on many platforms and with many programs. The problem is that Microsoft chooses to retain their obfuscated binary format as the default save type for documents.

      Even though RTF is and open standard, many programs which claim compatibility are still not 100% compatible, and can screw up things like embedded images. I supposed Microsoft's implementation of XML will be similar. It will be open, but the more complicated documents would still be displayed differently by non-Microsoft products. It would also force everyone to switch to Microsoft XML, or at least be compatible with it, retaining the dominance of Office.

    12. Re:Defaults by sparkz · · Score: 2

      It is not a particularly open standard - for example, pagenumbers in headers / footers (a pretty common thing to use) is not even mentioned in the specification - the only way to work out how to do it, is to do it in Word, save as RTF, and work out what it does.

      I know - I've had to do it! Even for a relatively simple document, the RTF Spec is not much use - you just have to do it in Word, and replicate that in your own code.

      Oh, and if Word decides it doesn't like the document, it doesn't return an error message, an ill-formatted RTF file is guaranteed to kill Word, and very likely to kill Windows.

      --
      Author, Shell Scripting : Expert Re
    13. Re:Defaults by tshak · · Score: 3, Insightful

      Most businesses do not build game machines.

      In an era of practicallity most offices are still running on 500mhz boxes with 128MB of RAM and 5400rpm HD's.

      --

      There is no longer anything that can be done with computers that is nontrivial and clearly legal. -- Paul Phillips
    14. Re:Defaults by interiot · · Score: 2
      • Most businesses do not build game machines.
      Hear hear. I work at a Fortune-100 company (well, it was last year anyway), and my current machine is sloooow and has very little memory. I've managed to make it resonably peppy by replacing Outlook with a remote Mutt (HUGE improvement, if only for the 30mb ram savings), and making it just be a dumb terminal for remote Solaris boxes. The only things I run locally are TeraTerm, VNC, Winamp, and Phoenix. Now if Phoenix wouldn't be such a hog, I'd be happy.
    15. Re:Defaults by dillon_rinker · · Score: 5, Informative

      Yup. Government standards are why you can buy screws and nuts from different manufacturers and have them work together. They are why you can buy "orange juice" at the grocery store and know that it's not "juice" wrung out of a pile of autumn leaves (hey, it's juice, it's orange, what more do you want?). Government standards are why you can fill fly in an airplane and know it won't crash.

      Sure, all these needs could be fulfilled by voluntary industry standards, if it weren't for those pesky human beings, fallible and greedy creatures that they are.

    16. Re:Defaults by Anonymous Coward · · Score: 0

      Optimized? What the fuck are you smoking, pal. The format used for Word, and Office in general, is neither optimized for speed nor memory... nor robustness for that matter. It's designed to be as difficult to decode as possible, without sabotaging the ability for Microsoft's own apps to open it -- and it's a fine line, as anyone who has ever tried to open a Word document, only to find the memory dump that passes as a saved document bomb the fucking machine when Word skips a beat.

    17. Re:Defaults by Anonymous Coward · · Score: 0

      A clue to how Word works with other file types is that it converts everything to RTF, even between Word versions, when importing/exporting word documents.

      The Word document format is an OLE Compound Document (or whatever it's called now). You need to pick the pieces out of that.

    18. Re:Defaults by g4dget · · Score: 2
      RTF has been in office for years and it is an open, portable standard readable on many platforms and with many programs.

      I would dispute that RTF is "portable" or "standard". However, whatever it is, it simply does not seem to preserve appearance and markup sufficiently well to be used as an interchange format. Perhaps it could in theory, but in practice, it doesn't seem to.

    19. Re:Defaults by Malcontent · · Score: 2

      Who said anything about the govt? Unless of course you mean that the govt should not provide courts so that people can sue each other or that there should be no laws so that people can't be tried.

      --

      War is necrophilia.

    20. Re:Defaults by Galvatron · · Score: 2

      But many home users do. It's not like people only run Office in the office, we can also see how well this software performs on hour home machines.

      --
      "The question of whether a computer can think is no more interesting than that of whether a submarine can swim" -EWD
    21. Re:Defaults by siphoncolder · · Score: 1
      Did you forget that Word does more than text?

      Think:

      Inserted Pictures.
      Office connectivity with other Office apps.
      Mail Merging.
      Document Templates.
      VBA & Add-Ins. (blah blah virii - the functionality is a godsend for data processing and feature functionality)

      If you're using plain text, use Notepad (whoa, Microsoft made that one, right? Opens 50K text files pretty damn fast if you ask me). Word doesn't fit into your rant.

      --
      i'm amazed that i survived - an airbag saved my life.
    22. Re:Defaults by Anonymous Coward · · Score: 0


      RTF
      RTF what? RTF Article, RTF Manual, help me out here! :)


      RTF, noob!

    23. Re:Defaults by Planesdragon · · Score: 1

      Even though RTF is and open standard, many programs which claim compatibility are still not 100% compatible, and can screw up things like embedded images. I supposed Microsoft's implementation of XML will be similar. It will be open, but the more complicated documents would still be displayed differently by non-Microsoft products. It would also force everyone to switch to Microsoft XML, or at least be compatible with it, retaining the dominance of Office.

      Actually, they're MUCH more likely to simply use the model they used for the office HTML system.

      Use what's there when it works for replicating .doc, and extend it where it doesn't.

    24. Re:Defaults by tshak · · Score: 1

      My mom is still running on a 450 Cellery with 96MB. Many people at home don't have the money to upgrade every 3 years.

      --

      There is no longer anything that can be done with computers that is nontrivial and clearly legal. -- Paul Phillips
    25. Re:Defaults by tshak · · Score: 2

      Run Opera instead of Phoenix. It's extremely lightweight especially considering how many features it has.

      --

      There is no longer anything that can be done with computers that is nontrivial and clearly legal. -- Paul Phillips
    26. Re:Defaults by Galvatron · · Score: 1

      Are you deliberately being stupid here? SOME people do upgrade frequently, and SOME of those people run Office. Office is still slow on new, expensive machines (not really slow, but slow enough that its obvious the MS Office file format is not designed the way it is because of speed). The first guy you replied to was not saying that all machines are fast. All he was saying is that MS Office still takes a second or two to open files, even on very fast machines.

      --
      "The question of whether a computer can think is no more interesting than that of whether a submarine can swim" -EWD
    27. Re:Defaults by donutello · · Score: 4, Insightful

      Government standards are why you can buy screws and nuts from different manufacturers and have them work together.

      Nonsense. Screw and nut sizes have been standardized without government involvement.

      --
      Mmmm.. Donuts
    28. Re:Defaults by donutello · · Score: 3, Insightful

      Amazing how many points you got wrong.

      You are goddamned fucking lucky that the government tells you what the default values for things should be. That's what the government is there for, mostly; to tell you that the default value for a building is to have a fire exit and that it may not be locked.

      That's a safety standard. The government does not tell you what color the walls should be, however. It doesn't tell you whether you should use carpet or hardwood on the floors.

      But if you really think they have no right doing these things, go live in a 3rd world country; they generallly have the government telling you less about what to do. Except once in a while when they kill your familiy. You could be armed of course. You know what a totally armed society with a weak government looks like? Afghanistan.

      Assuming you're talking about Afghanistan before the US bombed the hell out of it, you are wrong again. The government in Afghanistan told you exactly what you could or could not do. It told you what you could wear and how much. It told you how long to keep your beard. It told you whether you could study or not (if you were a woman). It told you what you could study. It told you who you could sleep with.

      --
      Mmmm.. Donuts
    29. Re:Defaults by bubbha · · Score: 1

      I've posted about this before. I work in the Phama area. We collect data in clinical trials for new drugs. Some of the data must be kept for decades (till the patients die.) It must able to be processed 30 or so years from now. Same with our documents. Storing this data using published open file formats is a requirement. Microsoft characterises these situations as discriminatory. It's not. It's a system requirement. If they want to get into this kind of business, they have to build software that meets the requirements.

      --
      I want to be alone with the sandwich
    30. Re:Defaults by Anonymous Coward · · Score: 0

      You hate the government to tell you what to do
      regarding open standards? Yes, it is why the
      convicted criminal at Microsoft has grown to such levels that
      nobody can tame. YOU, and your right-wing poems,
      are directly responsible for the insanity
      that you praise. (And the moderators have
      marked you "insightfull". Baffoons!! )

    31. Re:Defaults by Kragg · · Score: 2

      It told you who you could sleep with.
      Mmmm.. Donuts


      Dude, you're sick.

      --
      If you can't see this, click here to enable sigs.
    32. Re:Defaults by Anonymous Coward · · Score: 0
      Sir,

      Please go eat a cock.


      Thank you.
    33. Re:Defaults by Kashif+Shaikh · · Score: 2

      Government standards are why you can fill fly in an airplane and know it won't crash. ...and have the wonderful assurance that hundreds of seagulls and other birds were literally used to test the wing propellors.

      I should know, since my father worked for GE back in the days when they were in Aviation building bird-resistant propellors and missile shells. Though, I don't know if they still do that stuff. But my father found the bird-testing sickening as do I.

    34. Re:Defaults by Kashif+Shaikh · · Score: 2

      Microsoft would be unable to write an application than can quickly save and open text files that, on average, run well under 50 kilobytes.

      You haven't ever stored pictures in word files, have you? Just having a couple of big pictures makes the size of doc file grow around 5 to 10 megs.

    35. Re:Defaults by kalidasa · · Score: 2

      In an era of 2+ GHz computers with 7200+ rpm hard drives, it seems odd that Microsoft would be unable to write an application than can quickly save and open text files that, on average, run well under 50 kilobytes.

      Problem is, that's 50 kB for a one-page memo inviting both colleagues in your department for lunch.

    36. Re:Defaults by Zordak · · Score: 2
      My mom is still running on a 450 Cellery with 96MB. Many people at home don't have the money to upgrade every 3 years.
      Ha, I've got you beat. My mother-in-law is running on a 75 MHz first generation pentium Compaq POS with a whopping 24 megs of RAM. About 4 months ago, I got in trouble with my wife for building us a system to replace our old 350 MHz box with 64M RAM and a 4G HD. Forget about having the money to upgrade every three years. When I built the new system, we tried to give the old one to the in-laws (I got frustrated when they bought a new HP printer, and it took like an hour to load the drivers on the old 75), and after one day, they told me to come back and hook up their old box again. It seems they didn't like not having their Compaq address book (some silly 2-bit app that came pre-installed), and couldn't be productive transitioning from MS Works to Word 2000. I tried to tell them that I could export the old address book and they'd have all the features plus many more if they would take the trouble to larn the new apps, but their mindset was that it was hard enough to learn how to use the stupid thing the first time around, and they didn't want to have to do it again. Figure out how to overcome that problem, and you've really got something.
      --

      Today's Sesame Street was brought to you by the number e.
    37. Re:Defaults by fejikso · · Score: 1

      It depends how you save it... There's something called "PASTE SPECIAL" which let's you specify the format (JPG, PNG, etc) in which the image will be stored in the document.

    38. Re:Defaults by gvonk · · Score: 2



      Uh, OK.

      Swap the hard drives.

      Problem solved.

      --


      El Karma: excelente(principalmente la suma de moderación hecha a los comentarios de los usuarios)
    39. Re:Defaults by g4dget · · Score: 2

      And any design that mixes up the images with the text and thereby risks writing 5-10Mbytes every time you save a document is seriously broken. There are better ways of keeping images and text together than OLE structured storage.

    40. Re:Defaults by cam_macleod · · Score: 1

      "Most businesses" ? Man, I wish I worked there. I do training and support for people still using P166's with 64MB RAM, 14" monitors, etc. It's sad. Actually, not sad -- nightmarish.

    41. Re:Defaults by ratamacue · · Score: 1
      Sure, all these needs could be fulfilled by voluntary industry standards, if it weren't for those pesky human beings, fallible and greedy creatures that they are.

      There is a glaring hole in this argument. Government is an organization comprised of unique, thinking individuals who are primarily motivated by self interest, just like any organization in the private sector. Human beings in business are pesky, fallible and greedy, you say? Well, human beings in government are absolutely no different. What makes you think you can trust the individuals in government any more than the individuals in Wal-Mart? Especially considering that government has the unique authority to conduct their business by force?

    42. Re:Defaults by Anonymous Coward · · Score: 0

      Exactly!
      If M$ knows that, why is windows so bloated up?
      And have you ever wonderd why an internet explorer install file is 20 MB?

      Anyhow good thing word shaves off those miliseconds so i can manage to run windows......

    43. Re:Defaults by Anonymous Coward · · Score: 0

      I think you hit the nail on the head... No one should blame Microsoft for putting out crap software that doesn't conform to a "standard". It's up to the customer to dictate what they need and only use what satisfies that need.

    44. Re:Defaults by dublin · · Score: 2

      RTF has been in office for years and it is an open, portable standard readable on many platforms and with many programs.

      Obviously you haven't tried it. RTF has gotten more complaints from users than raw word Docs does!

      Replace "RTF" with "HTML" and you've got a winner, though.


      OK, Let's see you put a page break in that HTML document... Seriously, an extended HTML could make a very nice document format, some of the better ones, like the one used by HTMLDOC actually *do* let you put in line breaks and such. I've started using HTMLDOC to generate lots of my documentation now, because it does a pretty good job of retaining the gist of the formatting and produces very nice PDFs from the same web pages I have to generate anyway. This product has really improved lately. In fact, the only thing wrong with HTMLDOC, IMO, is that it uses the GPL rather than a truly free license.

      Now if only the Netscape/Mozilla team would add support for the HTMLDOC extended tags in Composer, and make HTMLDOC a standard output filter option (which would dramtically improve their ability to print web pages, anyhow...) we'd really have something.

      --
      "The future's good and the present is nothing to sneeze at." - Roblimo's last ./ post
    45. Re:Defaults by Planesdragon · · Score: 2
      OK, Let's see you put a page break in that HTML document...
      <br clear=all style='mso-special-character:line-break;
      page-bre ak-before:always'>
      MS extended HTML to accomodate page breaks and other features that Office supports but most standard web page editors don't. And the fun part is that, aside from the bloated document, they don't really impeded the HTML rendering.

      I remember HTMLDOC. It looked interesting to start off with, but it's missing a certain something to make it a worthy standard.

      I'd rather Mozilla support MHTML first--or even CHM, or even just the Moz-help system! (If it does and you know it, feel free to correct me with a link...)

      This product has really improved lately. In fact, the only thing wrong with HTMLDOC, IMO, is that it uses the GPL rather than a truly free license.

      If you don't like it, don't use it. Feel free to write you own, or buy Acrobat.

      The GPL is fine and dandy for standard, public-commons systems that run by themselves without amalgration with any other software. It only imposes on a justifiable freedom if it's used on a standard module, library, or format.
  8. Embrace and Extend by Anonymous Coward · · Score: 0

    Is it open? Yes and no. They meet some of the standards and then add some of their own. Embrace and extend: the Microsoft strategy for market domination since day 1.

    1. Re:Embrace and Extend by Rick+the+Red · · Score: 5, Funny

      The difference between Microsoft and their competitors is that MS is willing to take a long-term view:

      1) Establish a monopoly on office productivity software
      2) Profit!
      3) See income drop once everyone has Office. Market saturation!
      4) Less Profit :-(
      5) Release new Office with new file formats; use monopoly to get it pre-loaded on all new PCs.
      6) Eventually everyone else upgrades Office in order to read new file formats they're getting from their co-workers.
      7) Profit!
      8) Release new OS with filesystem that looks like a database.
      9) Release YAO (Yet Another Office) [see 5 & 6] that only works with new database/filesystem in new OS.
      10) Now, not only do the masses have to upgrade Office to read co-workers files, they have to upgrade Windows as well.
      11) Profit!!!!!

      --
      If all this should have a reason, we would be the last to know.
    2. Re:Embrace and Extend by Anonymous Coward · · Score: 0

      I wish I could find a computer with office preloaded on it... All the one's I've found you still have to pay for it.

      NOTE: I still only use Office 97, as does my work/parents/most people i know.

    3. Re:Embrace and Extend by i8msft · · Score: 1

      Dude, I think you nailed it.

      Combine your comments with the earlier post about the amount of legacy corporate data in .doc memos, files, proposals, letters, etc. and it's becoming more obvious that the XML move isn't that big a deal from M$FT's perspective.

      Cheers

      From someone who is so comfortable with my choice of religion I don't feel the need to put in my sig...

      --
      Keep It Simple
  9. Can you copyright/patent a schema ? by aron_wallaker · · Score: 5, Insightful

    The big question (to me) is whether Microsoft can put a legal encumbrance on the XML schema they use for a new file format. Could you publish a schema but have it so wrapped in legalese that (for example) open source projects could not be allowed to use it ?

    1. Re:Can you copyright/patent a schema ? by Mysticalfruit · · Score: 2

      That's exactly what I'm thinking they'll do. They'll be a bid disclaimer in the XML that says "These Schemas are for use the intellectual property of microsoft. Use of any program not licensed by microsoft to interpret the data stored within these schemeas is a breach of copyright..." or some other type legalese...

      --
      Yes Francis, the world has gone crazy.
    2. Re:Can you copyright/patent a schema ? by davmct · · Score: 2, Insightful

      I don't think MS is so worried about people making their own OpenSource software to interpret the XML as it will most likely not be as efficient as MS software.
      as far as content is concerned, anybody could write their own xml parser, what MS knows is going to sell more copies of Word et al. is the fact that it has a strong support for embedding ActiveX objects. So, the next time you want to embed a Rational Rose UML diagram in your word document, you'll most likely find that other software packages aren't going to interpret how this is stored in xml as well as the MS Office suite could.

    3. Re:Can you copyright/patent a schema ? by Anonymous Coward · · Score: 0

      Either one, actually. A schema is an intellectual work, and copyright adheres to it automatically here in the U.S. There are also existing patents which are of broad enough extent to cover any XML implementation of an idea; see ContentGuard's patent on the notion of rights expression languages
      (http://www.contentguard.com/patents.as p) for a
      relevant modern example.

    4. Re:Can you copyright/patent a schema ? by anonymous+loser · · Score: 2
      No, because reverse-engineering for interopability is specifically allowed by the DMCA:

      `(f) REVERSE ENGINEERING- (1) Notwithstanding the provisions of subsection (a)(1)(A), a person who has lawfully obtained the right to use a copy of a computer program may circumvent a technological measure that effectively controls access to a particular portion of that program for the sole purpose of identifying and analyzing those elements of the program that are necessary to achieve interoperability of an independently created computer program with other programs, and that have not previously been readily available to the person engaging in the circumvention, to the extent any such acts of identification and analysis do not constitute infringement under this title.
    5. Re:Can you copyright/patent a schema ? by roemcke · · Score: 1
      Sorry, but an XML-document is not a computer program :(

      The same thing was said about dvd-movies in the DeCSS case

    6. Re:Can you copyright/patent a schema ? by gazbo · · Score: 1
      What? But he just talked about the DMCA, including an extract with some bits highlighted, and used it to explain how MS will be evil, and OSS will be hindered.

      How could you possibly disagree with such slashdot wisdom?

    7. Re:Can you copyright/patent a schema ? by anonymous+loser · · Score: 2

      WTF are you talking about? The question was whether the XML be copyright such that OSS projects are not allowed to use it. I showed the part from the DMCA that specifically says you are allowed to reverse engineer code in order to achieve interopability. It has nothing to do with MS being evil, it has to do with how the law is worded.

      BTW I personally would consider XML a computer program in this case (it is a compuer language describing/implementing a particular function...isn't that pretty much the definition of a program?), but I guess that's up to a judge.

  10. XML... sharp?!? by wikthemighty · · Score: 2, Interesting

    Once again MS will embrace a standard, only to warp it enough that you get stuck using their version anyway...

    --
    "There are people who do not love their fellow human being, and I _hate_ people like that!" - Tom Lehrer
  11. Lovely... by 9Numbernine9 · · Score: 1

    I'm going to be the pessimist (realist?) and say we can probably expect an "embrace and extend" maneuver from Microsoft. Allow Office 11 to use XML + some "Microsoft Office extensions" and .NET support, and then watch as businesses use it over standard XML...

    --
    Illegitimi non Carborundum.
    1. Re:Lovely... by ceejayoz · · Score: 5, Funny

      Oh no! Heaven forbid someone extend the eXtensible Markup Language!

  12. "XML dialect"?!? by TrevorB · · Score: 4, Interesting

    "XML dialect"?

    It's called a schema.

    Talk about embrace and extend. Sounds like this will be more "XML-like" than real XML... :)

    1. Re:"XML dialect"?!? by ProtonMotiveForce · · Score: 1

      Wow, you sure like to be wrong. "XML dialect", while vernacular, is certainly a reasonable thing to say in the general sense.

      Who died and made you incorrect corrector of common terms of speach?

    2. Re:"XML dialect"?!? by Samrobb · · Score: 1

      I think "dialect" is an entirely appropriate term... while "schema" may be the technical term, using "dialect" brings to mind the idea of a California Valley Girl trying to converse with someone from northern Ireland. The point is that even while both are speaking English, they're very different kinds of English, and communication is not neccesarily as simple as you would think...

      --
      "Great men are not always wise: neither do the aged understand judgement." Job 32:9
    3. Re:"XML dialect"?!? by Frobnicator · · Score: 5, Funny
      Who died and made you incorrect corrector of common terms of speach?
      ahem. speech

      :-)

      frob.

      --
      //TODO: Think of witty sig statement
    4. Re:"XML dialect"?!? by Anonymous Coward · · Score: 0
      It's called a schema.
      Unless it's a DTD.

      Dialect is entirely appropriate as an all encompassing term, unlike your suggestion. It's not even clear if Microsoft are using W3 Schema or their own propritary method, which really shoot the fuck out of your post.

    5. Re:"XML dialect"?!? by Anonymous Coward · · Score: 0

      Except for the fact that the person from Northern Ireland may not be speaking english, but gaelic.

    6. Re:"XML dialect"?!? by kaphka · · Score: 2
      "XML dialect"?

      It's called a schema.
      No. A schema is a set of rules that defines which constructs are allowed and which aren't. A dialect is what you get when you implement a schema.

      Think about the word language and the word grammar. Many people are perfectly good at speaking the English language, even though they know very little about English grammar. (Quick, is English a head-final, head-medial, or head-initial language? You don't know? Yet you managed to read that sentence just fine.)

      It's a subtle distinction, but it's real. If you happen to know what language and grammar mean in the technical sense, then it should be even clearer.

      Of course, I don't know what a CNET reporter's alleged misuse of the word "dialect" could possibly tell us about Microsoft's plans for world domination, but I assume that part of your post was just a troll.
      --

      MSK

    7. Re:"XML dialect"?!? by ipjohnson · · Score: 1

      Actually Nothern Ireland is under british rule and english the national language. So chances are they are speaking english, whether you can understand him/her is a whole other matter :)

    8. Re:"XML dialect"?!? by Spruce+Moose · · Score: 1

      Gag me with a spoon!

  13. My Guess..... by jamesdood · · Score: 2, Interesting

    Would be that it will be "open" to other Microsoft technologies. This has been their method of operation in the past. As long as you only have a Microsoft environment everything works well with each other..

    --
    *narf!*
  14. Duh? by natefaerber · · Score: 0, Troll

    At what point did anyone NOT see this coming?

    --
    -- My HARDWARE, My CHOICE.
  15. You've seen Microsoft generated HTML by Anonymous Coward · · Score: 1, Interesting

    Now, just think of the many ways they can screw around with XML. I bet IE and other MS readers will only do real basic XML validation, too. Then we'll have people telling other XML readers that their tools should be as understanding to bad XML as MS products.

    1. Re:You've seen Microsoft generated HTML by JebusIsLord · · Score: 2

      IE does a pretty decent job of parsing xml already actually. Its perfectly strict. 6 Does make some errors though that Moz gets right.

      --
      Jeremy
  16. "Could this be grounds for another lawsuit?" WTF? by Wakko+Warner · · Score: 5, Funny

    Yes, mister Hairtrigger, we should sue Microsoft simply because they won't release trade secrets. We will surely win.

    - A.P.

    --
    "Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"
  17. This illistrates the shortcoming of XML by Anonymous Coward · · Score: 4, Insightful

    I've always said the XML Emperor has no clothes: all XML is is a meta-framework for markup languages. No more, no less. And pointless if schemas are never disclosed.

    1. Re:This illistrates the shortcoming of XML by halftrack · · Score: 2

      I've always said the XML Emperor has no clothes: all XML is is a meta-framework for markup languages. No more, no less. And pointless if schemas are never disclosed.

      I think that's the way many programmers think (I know I do) it's just a way to avoid yet-another-file-parser for every project. And some - Norwegian SGML guy with a name comes to mind - is not a true, open format because the programmers can use schemas, you still need to know the schema. But then again, is it possible to create a open format which supports everything per default, is human readable and - to the extreme - does not require knowledge about the language. How are aliens going to crack the ASCII code in a binary radio stream from earth? Is there some formulae that makes it easier to decypher than hyeroglyphs?

      --
      Look a monkey!
    2. Re:This illistrates the shortcoming of XML by rob_from_ca · · Score: 2

      I mostly agree, but it can't be totally useless if you can define a Turing machine with it...:-)

      http://www.unidex.com/turing/tmml.htm

    3. Re:This illistrates the shortcoming of XML by Matts · · Score: 2

      Nonesense. As the author of one of the available OpenOffice to HTML (and DocBook) converters out there, I can honestly say we did most of the work without the Schema in front of us (especially since that Schema is a 400+ page pdf). We just used plain old reverse engineering principles most of the time. Works damn well, and XML makes it infinitely simpler than a binary format.

      --

      Matt. Want XML + Apache + Stylesheets? Get AxKit.
    4. Re:This illistrates the shortcoming of XML by g4dget · · Score: 2
      XML has some clothes. While you may not be able to understand the content of arbitrary XML documents, you can understand their structure. That enables a lot of things that would not be possible with formats like Word's native format or even other markup languages.

      For example, being able to understand the structure of XML documents makes reverse engineering much easier. It also lets you embed one XML document inside another and deals with the resulting namespace issues correctly. And there are many other things that XML helps with--it's not sufficient for a universal format, but it takes care of the nitty-gritty that, if not taken care of, can break portability.

    5. Re:This illistrates the shortcoming of XML by Dirtside · · Score: 2

      Why is it a shortcoming? XML was designed to be a meta-framework for markup languages. That's all it's designed to do, and that's what it does. It's not a shortcoming if something does what it's designed to do. :)

      --
      "Destroy science and religion. Science would re-emerge exactly the same; but not religion." - Penn Jillette, paraphrased
  18. NO! by halo8 · · Score: 2, Insightful

    Could this be grounds for another anti-trust suit against Microsoft?

    No it is not...

    The Bush administration made it clear on the first day they wanted this to go away. As long as Billy isnt taking your 401K im sure no one is going to bother him for a while..

    How many Millions were spent on this farce? and for what? a verbal reprind from the judge? think about it.. all that money could have gone into tanks and bombs to bomb other countries and free us all from "terror"

    --
    The More Knowledge you have the Luckier you Get- J.R. Ewing
    1. Re:NO! by WasterDave · · Score: 3, Insightful

      Y'know, before posting I thought I'd check to see if anyone else had put what I was going to put. Tadaa, problem solved.

      After years of work, hundreds of thousands of lawyer man-hours, what do we have to show for it? "Expose your API's unless they are to do with security, and don't be bad again". Honestly, this should have been a bitch slapping of biblical proportions. Not only should the company have been broken up, but a tier 1 deity should have rained down the wrath of the ancients in order to make it happen.

      Another anti-trust suit? I don't think anyone's going to be going down *that* road in a hurry.

      Dave

      --
      I write a blog now, you should be afraid.
    2. Re:NO! by schon · · Score: 2, Insightful

      all that money could have gone into tanks and bombs to bomb other countries and free us all from "terror"

      OK, so is this a good thing or a bad thing?

    3. Re:NO! by halo8 · · Score: 2

      What do YOU think it is?

      who are YOU going to be voting for?

      eather wich way.. im a Canadian in "soviet Canuckistan" so i dont really care... i just use words like "we" and "us" to SOUND like an american so i can karma whore...

      saying "WTF cares im a Canadian how dose this effect me?" didnt help my karma any.

      --
      The More Knowledge you have the Luckier you Get- J.R. Ewing
  19. Why not by haplo21112 · · Score: 1, Flamebait

    Lets all pile on the one company that drove home the standards for the PC industry for years, and made sure that there would be one common platform that everyone could write to, and achived critical mass so there wouldn't be conflicting standards anymore....
    For those who are clueless its supposed to be funny...paraphrasing MS themselves...before you troll me, at my comments in the past about good old M$....

    --
    Power Corrupts,Absolute Power Corrupts Absolutely, leaving one person(group)in charge is absolutely corrupt.
    1. Re:Why not by mao+che+minh · · Score: 2

      This may all be true, but Microsoft never achieved the technological prowess and glory (IMHO) that Sun Microsystems enjoys with their achievement of making the computer the network.

  20. what does it matter by greechneb · · Score: 5, Insightful

    No matter what microsoft does, all they will get is a slap on the wrist. Microsoft will just point to staroffice and openoffice and say, hey, there's compitition, its not a monopoly.

    Big deal if they don't open it up anyway (I don't really expect them to), staroffice/openoffice will crack it to a certain extent anyway. For most people's file conversions, its not that much of a difference to convert documents. Doesn't always look pretty, but it works fairly well.

    Wake me up when something Microsoft does is suprising...

    1. Re:what does it matter by moreati · · Score: 1


      I don't think it was ever claimed that MS Office is a monopoly. Windows on the other hand is. I can't see an antitrust angle to bastarized file formats/protocols, other than opening them as a resolution.
      </bite>

    2. Re:what does it matter by Anonymous Coward · · Score: 0

      Speaking of Office applications...

      My wife purchased Office 2000 a few years back. However, at work we still use Office 97. Believe it or not, I have had to use Open Office to convert "correctly" between MS Office 97 and MS Office 2000 (and vice versa) on several occasions.

      Enjoy your "lock in" to the monopoly. I'm outta there. Open Office does what I need and I save US $400 every 24 months!!

      AC

  21. InfoWorld articles by andynms · · Score: 5, Informative

    There are a couple of good articles on this at InfoWorld. Try here and here.
    Good quote:
    THE GOOD NEWS is that Office 11 supports XML Schema. The bad news is that XML Schema has been described even by XML experts as "confusing," "impenetrable," "fuzzy," and "as user-friendly as a stick in the eye."

    1. Re:InfoWorld articles by ifreakshow · · Score: 1

      <i>The bad news is that XML Schema has been described even by XML experts as "confusing," "impenetrable," "fuzzy," and "as user-friendly as a stick in the eye." </i>

      The way I see it only programmers would mess with the schema so it doesn't have to be super user friendly. I personally wouldn't want someone messing with the Schema I use to publish my web pages.

    2. Re:InfoWorld articles by Anonymous Coward · · Score: 0

      That more or less describes every single XML document or schema I've ever seen. XML itself is as user friendly as a stick in the eye. So naturally anything implemented using will be also.

    3. Re:InfoWorld articles by murdocj · · Score: 1

      I'm not an XML "expert", but I've used XML schema quite a bit the last few months and it seems much easier to read and far more powerful than DTDs. XML Schema has got enough bells and whistles that it's possible to create really hard to read schemas, but it certainly isn't "impenetrable".

    4. Re:InfoWorld articles by ruriruri · · Score: 1
      XML Schema is actually one of the most user-friendly formats, ever. Well, not user-friendly so much as developer-friendly, which is the more important thing. It's much less confusing than DTD format, and far far easier to decipher than any binary format. I'm not sure why an XML "expert" would consider it confusing. Consider:
      <xsd:element name="SlashDotPost">
      __<xsd:complexType>
      ____<xs d:sequence>
      ______<xsd:element name="Subject" type="xsd:string"/>
      ______<xsd:element name="Date" type="xsd:date"/>
      ______<xsd:element name="UserID" type="xsd:long"/>
      ______<xsd:element name="Body" type="xsd:string"/>
      ____</xsd:sequence>
      __</xsd: complexType>
      </xsd:element>
      (Please forgive the formatting.) It's verbose, but hardly any different than a C struct definition. Of course this is just the basics, but it all follows fairly logically.
    5. Re:InfoWorld articles by frisket · · Score: 5, Informative
      I was at the launch presentation of Office-11 by Jean Paoli at XML 2003 in Baltimore MD last week, and I'm also a late sign to MS's extended beta list for the product (now closed).

      To clear up some points people have commented on (based on a very preliminary inspection plus a lot of discussion at the conference):

      1. The default save format is still .doc (ie you have to go the extra click to save in XML format)
      2. If you pick to click it, the default XML format is MS's own office-document vocabulary, which retains all the formatting, held in attributes. Hairy but processable, and they will be shipping their schema for it so people can reprocess it externally. But this format will (of course) only represent the appearance, not any structure.
      3. It will also let you specify your own schema (or an industry standard one) and let you supply a binding of named styles to your element types, so you can edit using what look like styles but actually get represented in the saved file as XML markup. There is some debate as to whether this constitutes "being an XML editor" or just "being a wordprocessor that saves data in XML" (my money is on the latter).
      4. It will not support DTDs, so you're stuck with W3C Schemas whether you like them or not*
      5. The discussion over a [more?] suitable schema/DTD for handling office documents (wordprocessing, spreadsheet, presentation) continues at the OASIS TC on Open Office XML Formats **
      With Office-11, Microsoft has nearly caught up with Corel's WordPerfect, (which has had a fully-fledged SGML and XML editor built-in for years) and XMetaL (which Corel took over from SoftQuad earlier this year). MS still has a long way to go to match industrial-strength applications like ArborText's EPIC or even Emacs with psgml-mode et al , but Office-11 will be a solution for the masses who believe the Word interface to be more desirable, or the Microsoft licensing régime to be more attractive, or the software to be more stable.

      * [Bias note] I think W3C schemas were a big mistake; provision for data content typing and validation, namespaces, and extended grouping could have been achieved by extending DTD syntax; and wimpy programmers who moan about having two syntaxes to handle should get a life - it's not a big deal, the code is free and has been in use for 15 years :-)

      ** Sun has donated the OpenOffice (aka StarOffice) XML file formats to the public domain. It's worth remembering that {Star|Open}Office has been saving in XML as its native format for some time now, and has a lot more experience at this than MS.

    6. Re:InfoWorld articles by Anonymous Coward · · Score: 0

      It's quite understandable from a developer's point of view. Users don't need to worry about schema, so it doesn't matter to them.

    7. Re:InfoWorld articles by butane_bob2003 · · Score: 1

      What would Office users care that Office 11 'supports' XML Schema? And notice they didnt say W3C Schema. Which could mean anything. And what do they mean by 'supports'? I have written an application that supports W3C Schema to the degree that it is useful to the app (its Schema driven, so it better). The only usage of XML Schema I could see applicable to Office 11 is for validation of Office created documents. Valid Office 11 documents can't be created outside of Office because Microsoft hasn't made the schema for Office 11 available (assumption). Office 11 isnt meant for Schema modelling or mapping or any other purpose related to Schemas, so who cares. Not me. Schema is confusing to novices, but for those of us who make their living implementing apps based on specifications like the W3C schema or XML, its no problem.

      --


      TallGreen CMS hosting
  22. here's for the best by codeonezero · · Score: 1

    So long as there are no proprietary additions to XML from Microsoft and as long as the "open" XML format MS will use is truly open for all of us to see, and not an XML tag with binary data in the middle at every possible point, as some other have suggested before.

    I will give props to MS if they will work with other companies in a fair way to do this.

    --

    ....
    int main (void) { ... }

  23. IN SOVIET RUSSIA by SpinyNorman · · Score: 0, Offtopic

    Your anus opens YOU.

  24. well, of course by Planesdragon · · Score: 5, Interesting

    Could this be grounds for another anti-trust suit against Microsoft?

    Of course it could. But so could any bit of news about MS on /. in the past twenty years, from EULA alterations to Palladium.

    But "could" and "is" are differnent things. I suspect MS will decide that closing XML will render it useless, and make it at least as open and useable as their MS-HTML files.

    So, at the worst, we'll have a new "save as" option that's bit sloppy--but since MS won't have to extend XML to get their office functionality, they probably won't do it just to spite a few OSS coders who'll figure it out in a year anyway.

    1. Re:well, of course by Dirtside · · Score: 2
      Of course it could. But so could any bit of news about MS on /. in the past twenty years, from EULA alterations to Palladium.
      Twenty years! Damn, was /. running on a WWIV BBS back in the day, or something?
      what am I, a contradiction?
      No, just unnecessarily credulous. :)
      --
      "Destroy science and religion. Science would re-emerge exactly the same; but not religion." - Penn Jillette, paraphrased
  25. XML-COM by wowbagger · · Score: 2

    I will bet all they will do is create an XML schema for the COM serialize function, since that is pretty much all any Microsoft application does when you select File->Save - it just calls the COM serialize function with the output pointed at the disk.

    So, you will have a file that is nominally XML, but is nothing but memory dump of the COM object.

    Technically, XML. Actually, COM.

    1. Re:XML-COM by The+Bungi · · Score: 1

      So? The compound document format and APIs are well documented. You can write a 10 line C++ program that pulls the XML from it. You can do it today with all the Office file formats except Outlook (which uses the undocumented .PST format). In fact, I've written utilities that do this very same thing to catalog documents.

    2. Re:XML-COM by wowbagger · · Score: 2

      Then why are MS-WordXP documents not compatible with MS-Word 2000?

      The bad thing about COM streams is that if you change the methods of the object, you render the data incompatible with previous versions.

      If you represent a paragraph as <p>, then you needn't worry if you redefine WordDoc::BeginParagraph.

    3. Re:XML-COM by The+Bungi · · Score: 1
      Then why are MS-WordXP documents not compatible with MS-Word 2000?

      I haven't seen an Office XP document, but that's not the point in any case. I don't see how one can expect the previous version of an application to open a file created with a newer version. Of course, you can always save in O2K format if you want.

      The bad thing about COM streams is that if you change the methods of the object, you render the data incompatible with previous versions.

      You don't really know much about COM, do you? First off, the core stuff stored in a compound document by Word (for example) in the WordDocument stream is not a serialized COM object. If you want to see for yourself, try to open the stream and see if it has an embedded GUID. The format doesn't work that way.

      Second, Microsoft can hardly break their own code by "changing the methods of an object" because you can access the data in those compound documents with a few, well documented interfaces that have absolutely nothing to do with Office. Of course, once you get the data you need to know how to format it, which is the real problem.

    4. Re:XML-COM by wowbagger · · Score: 2
      I haven't seen an Office XP document, but that's not the point in any case.


      That is exactly the point - Microsoft has repeatedly and for no good reason introduced incompatiblities on every upgrade of Word - a standardized XML schema would prevent that. Since incompatiblities are how Microsoft forces everybody to upgrade, it is unlikely they would change.

      I don't see how one can expect the previous version of an application to open a file created with a newer version.

      You mean like how every version of Wordperfect can open any WordPerfect document, as long as you don't use features added in the newer version? It is child's play for a competent software engineer to design a format that describes a document in chunks, and to specify that a conforming processor will skip over the chunks it does not understand. For example, the Amiga Information File Format described and audio file in chunks, and the first words of each chunk were the chunk type and length. Perhaps you are familiar with it - Microsoft stole the format for WAV files (although they DID reorder the words from big endian to little endian).

      Second, Microsoft can hardly break their own code by "changing the methods of an object" because you can access the data in those compound documents with a few, well documented interfaces that have absolutely nothing to do with Office.


      Of course, those methods exist only under Windows, and only if the appropriate DLL is present. Have you ever looked at the very files you claim to be an expert on, without a Microsoft supplied DLL between you and the data?

      Of course, once you get the data you need to know how to format it, which is the real problem.


      And that is the real nub - once you have recovered the data, how do you format it - information that is curiously undocumented - and that is my point. The odds that Microsoft will of their own choosing define an XML format that allows everybody to see how to interpet and format the data are approximately the same as Sarah Brady sending a sympathy card to Charlton Heston.

    5. Re:XML-COM by The+Bungi · · Score: 1
      That is exactly the point - Microsoft has repeatedly and for no good reason introduced incompatiblities on every upgrade of Word - a standardized XML schema would prevent that. Since incompatiblities are how Microsoft forces everybody to upgrade, it is unlikely they would change.

      I hate to break it to you, but barring some formatting weirdness, Word 97 can still open Word XP documents just fine.

      You mean like how every version of Wordperfect can open any WordPerfect document, as long as you don't use features added in the newer version?

      See above.

      Of course, those methods exist only under Windows, and only if the appropriate DLL is present. Have you ever looked at the very files you claim to be an expert on, without a Microsoft supplied DLL between you and the data?

      I missed the part where I (or anyone else) said that Microsoft was going to provide an implementation of the compound document for Linux or BSD. In any case, first, neither you or I have looked at the files so it's kinda dumb to fight over whether or not they will have that format. For all I know they've dropped it. Second, the compound document layout hasn't changed much since OLE2 was released back in the WFW311 time, so I'm guessing it'd be child's play to write a library that will read from them.

      And, BTW, I don't claim to be an expert OLE compound documents and structured storage - I am.

      And that is the real nub - once you have recovered the data, how do you format it - information that is curiously undocumented - and that is my point

      Nope. Your point was that "the file would be a COM stream" and that it would be impossible to get the XML out of it if that was the case. Read your original post.

  26. Sure, it's XML, but... by phong3d · · Score: 2, Redundant


    <document>
    <content>
    kdjf348o0jOIJ*$)J@#ijfO34ijf9o84j2193
    )#_@#)UJfnwmejh082u-(U@)*#u08ur@)#RU@
    f934J#EJELKJF%GHWI#UJ(@*#)!)@#@)#(@IF
    fijsjhF*(WU(*@U#IOJWEFJW)*OEURWIOJO:W
    </content>
    </document>

  27. Sure it's Open! by Halo- · · Score: 2

    "Open? Sure it's open! Just click here... and *poof* your document is open. What's that? You mean you want to open it with something other than M$ Office? Oh, well in that case maybe not..."

  28. Why? by citking · · Score: 1
    I just want to know why M$ has such a problem with industry standards. Is it money? Is it so that documents created with their products can only be opened by their products? If so, why does Gates still do this? He's far richer than anyone else.

    As mentioned in a previous port, M$ did their own thing with converting Word docs into HTML. I tend to believe that this is going to happen again, this time with XML. I hate to see technology with two parallels: The one that standards boards such as IEEE create, and those that M$ reinvents just to earn a few more bucks. It's disgusting.

    --
    "This food is problematic."
    1. Re:Why? by Anonymous Coward · · Score: 0

      I just want to know why M$ has such a problem with industry standards. Is it money? Is it so that documents created with their products can only be opened by their products? If so, why does Gates still do this? He's far richer than anyone else.

      And has given more to charity than anyone else. That selfish prick!

      It's disgusting.

      I agree, they should stay closed and keep their trade secrets to themselves.

      Damned if they do, damned if they don't. Might as well stick with what they do best eh?

      Is an Adobe .pdf open to other programs? Is a Quicken .qdf file open to other programs? Is a Goldmine database open to other programs? Why does the majority of all software now have install KEYS? Even most games! Whats with all this protective bullshit dammit!!!

      OPEN EVERYTHING! DESTROY MICROSOFT! EVERYONE IS EQUAL! NOBODY OWNS ANYTHING! NO MORE RICH! NO MORE POOR! WE ARE ALL EQUAL!!!!!!!! COMMUNISM RULES!

  29. Are you kidding me?!? by unterderbrucke · · Score: 1

    "'Microsoft says it's opening its Office desktop software by adding support for XML--a move that should help companies free up access to shared information. But there's a catch: It has yet to disclose the underlying XML dialect.' Could this be grounds for another anti-trust suit against Microsoft?"

    1. They have the document spec closed
    2. No legal repercussions!
    3. They make it more open. (of their own free will)
    4. Antitrust suit!
    5. ???
    6. Laughed out of court!

  30. Excellent. by llamalicious · · Score: 2

    That great, wonderful even. Hopefully it's not Microsoft just using XML as a springboard for saying the equivalent of, "see, we're a good dog, and we're using open standards now," to cloud the judgement of any non-technical committee/court/public speaker that may attempt to point out their obvious monopoly.
    Meanwhile, myself, the company I work at, and the fire department I volunteer at will continue on with Office 97, happy as clams. Well, some Office 2000 too.

    Is there anything else of value they're going to bring to the table with Office 11? More speed, smaller disk footprint, free beer?

    1. Re:Excellent. by 40000 · · Score: 1

      MS Office 4.3 (from 1994) is still good enough if you want to do word-processing (without HTML).

  31. Microsoft XML != XML by Grip3n · · Score: 4, Insightful

    But there's a catch: It has yet to disclose the underlying XML dialect

    Remember, you can also save a Word document as an HTML file, however the HTML is so digusting, so non-standard that the only things that could possibly read it are more Microsoft products. The same, I would presume, will be happening to their XML feature.

    Additionally, its not too far fetched that Microsoft would make their own DTD (Document Type Definition).

    --
    To make a pun demonstrates the highest understanding of a language
    1. Re:Microsoft XML != XML by Anonymous Coward · · Score: 0

      Yes, but that doesn't mean that someone else couldn't write code to read it if they wanted to. And of course they will have their own DTDs. That's what XML is for, creating your own document types and definining them. Maybe they won't release the DTDs for others to use to inspect or validate their XML generation with... Maybe that's what you meant to say.

    2. Re:Microsoft XML != XML by Planesdragon · · Score: 2

      Remember, you can also save a Word document as an HTML file, however the HTML is so digusting, so non-standard that the only things that could possibly read it are more Microsoft products. The same, I would presume, will be happening to their XML feature.

      Do you have Word 97, Word 2000, or Word 2002/XP?

      97 had abyssmal HTML. Thankfully, I don't have to even touch it anymore.

      2000 and 2002 have, as far as I can tell, nearly identical HTML schemas. And, excluding the proprietary office tags ( and and the like), it's rather standard--if cumbersome--HTML.

      If you have Word 2000, you can even get an HTML filter that'll strip the custom HTML and CSS from the file, leaving an HTML file that really couldn't get much cleaner.

  32. Re:Yeah by Neuronerd · · Score: 0, Offtopic

    Mod this down please
    No uncommented links to porn pictures on slashdot please. Its also offtopic

    --
    Googlefight "Slashdot Troll" against "BSD is dying" 303:229. BSD thus cant die.
  33. Hello DMCA! by Wee · · Score: 2, Redundant
    At least with XML it will not be very long until many software companies and project reverse engineer the XML.

    And these other apps can cut into Office revenue. Which is as good a cease-and-desist argument as any.

    I suppose they could put some weird binary or encrypted data in the files, but that would defeat the purpose of XML.

    It defeats nothing if every app speaks the same binary/encrypted language. It prevents other apps from conversing with Office stuff, and that's probably seen as a good thing for MS.

    Anyone who thinks MS is using XML as their file format for the purpose of being "open" or playing well with others had better find another daydream. They're doing it because it helps them in some way, not because it'll help others. And there's actually nothing wrong with that. They're in business to protect shareholder value, after all.

    -B

    --

    Ash and Hickory, straight-grained and true, make excellent bludgeons, dandy for the cudgeling of vegetarians.

    1. Re:Hello DMCA! by ILikeRed · · Score: 2
      And there's actually nothing wrong with that. They're in business to protect shareholder value, after all.
      This statement presupposes that it is OK to cheat, and that cheating is in the best interests of shareholders. But then, personally, I would not invest in a company that I know has a culture of cheating, or that lacks ethics. Also, I find it sad that a company's lack of ethics is defended as helping shareholders, when the same company seems to care very little for it's shareholders. If it did care for it's shareholders, I believe it would pay dividends.
      --
      I have come to a conclusion that one useless man is a shame, two is a law firm, and three or more is a congress -J Adams
    2. Re:Hello DMCA! by Anonvmous+Coward · · Score: 2

      " They're doing it because it helps them in some way, not because it'll help others. And there's actually nothing wrong with that. They're in business to protect shareholder value, after all. "

      I think you're sort of on the right track. You have to remember that MS is branching out to other platforms like Pocket PC. Text is very easy to get around and is quite mobile. (Hence HTML's popularity...)

      I agree there's nothing wrong with what they're doing. I have no doubt that people'll have to sift their way through it to make sense of it, so what? If it's really that important, it'll happen.

      From the article: " It has yet to disclose the underlying XML dialect. Could this be grounds for another anti-trust suit against Microsoft?"

      Um no. At best, that comment was menat to stir up the trolls. Everybody acts like Microsoft owes them everything. All I can say is, grow up. MS is in the business of making money and it will always be like that. They're not required to explain their dialect. Nobody is. You wouldn't be saying that if Sun did that with Star Office.

    3. Re:Hello DMCA! by Wee · · Score: 2
      This statement presupposes that it is OK to cheat, and that cheating is in the best interests of shareholders.

      As long as a public company's main focus is to preserve shareholder value, then you will always have ethical problems. CEOs can get sued for not protecting investments. That can cause them to cut corners. I'd bet most large sharholders (the kind with lawyers and such) care primarily about money. What they don't know won't hurt them. Sure, some folks won't buy RJ Reynolds and Philip Morris and whatever, but lots of people do.

      For the record, if MS wants to keep their file formats obfuscated, then more power to them. It's their right to do so. Is it nasty? Yeah. Do I like it? No. Do I understand why they do it? Yes. They have a responsibility to their shareholders. The only way they know to fulfill that responsibility is by engaging in shady business practices (viz. "embrace, extend, extinguish").

      -B

      --

      Ash and Hickory, straight-grained and true, make excellent bludgeons, dandy for the cudgeling of vegetarians.

  34. Only on Slashdot... by Mitchell+Mebane · · Score: 1, Offtopic

    ...do links to Goatse get modded up as "Informative".

    --

    The roots of education are bitter, but the fruit is sweet.
    --Aristotle
    1. Re:Only on Slashdot... by Jucius+Maximus · · Score: 1
      "...do links to Goatse get modded up as "Informative"."

      ...will your post get modded as funny ;-)

  35. Could this be grounds for another anti-trust suit by tmark · · Score: 2

    How - and why - should it be ? AFAIK, MS never disclosed their e.g. Word or Excel binary formats, so why should they be exposed if they fail to disclose, or even obfuscate, an XML schema ?

  36. Why? by maggard · · Score: 2
    Why would this be grounds for a suit?

    Insofar as I understand MS isn't under any court order to open their file formats, just not to continue with specific unethical tactics on others (wristslap.) So if MS claims they're using XML in Office v.11 (hey, didn't they claim that about Office v.10 too...) big whoop-de-doo, it's really their decision.

    Actually it's remarkable MS is even going for XML at all. MS's own internal formats are a terrible mess, the code that produces it apparently such a tangle MS has terrible trouble keeping on top of it, now trying to put this all into a new format has got to be a monster. Doing all of this while keeping all of the MS'isms and editing features and not breaking every other part (both theirs & third-party) that uses these services & components has got to be daunting.

    Yeah, it'll likely end up being idiosyncratic and quirky full of all the bugs MS is famous for but hell, a semi-legible format has gotta be better then the stuff MS pumps out now. Of course this whole "beta" process we're in right now has been pretty conclusively demonstrated to be a marketing sham with the significant decisions all made and the feature-set frozen long ago.

    --
    I don't read ACs: If a post isn't worth so much as a nom de plume to its author then I wont bother either.
  37. Inside information... by davidstrauss · · Score: 2
    Microsoft Word's new XML format is as follows:

    <xml><worddoc>
    klj49ja90235%@#U42LKJDS9@#&@#$%(@# $90u89oj456@#%#@*#()$*$@%(F5f65F6@#%(&@#%&$#(*%*lk jdsflkjsdh
    </worddoc></xml>

    Technically, it is standard XML.

    1. Re:Inside information... by ceejayoz · · Score: 2
      Actually, that's not standard XML...
      <?xml version="1.0"?>

      <worddoc>
      klj49ja90235%@#U42LKJDS9@#&amp;
      @#$%(@#$90u89oj456@#%#@*#()$*$@
      %(F5f65F6@#%(&amp;@#%&amp;$#(*%
      *lkjdsflkjsdh
      </worddoc>
      That is :-)
    2. Re:Inside information... by davidstrauss · · Score: 2
      Actually, well-formed XML just involves having all tags opened and closed for a perfect hiearchial structure. The
      <?xml version="1.0"?>
      is optional. Refer to XML Prolog Type Declaration before correcting someone. Even if you try to say mine's not "valid", neither is yours.
    3. Re:Inside information... by ceejayoz · · Score: 2

      However, any ampersand character must be written & in XML - otherwise it'll be treated a an entity and there can be problems.

      So, perhaps you should be checking the XML specs yourself before making witty corrections, eh?

    4. Re:Inside information... by MattCohn.com · · Score: 1

      You, sir, are compleatly parinoid.

      Microsoft is trying to do something good, and you can do nothing but bash them. Try and look past your infintile views, huh?

      Another recent post on this discussion.

  38. Re:Yeah by Anonymous Coward · · Score: 0

    Oh come on, the link wasn't even hidden. If you don't know about that site address by now, you shouldn't be on the internet.

  39. Points to remember... by MosesJones · · Score: 5, Insightful


    1) XML, SOAP and all these new technologies were pioneered by Microsoft

    2) They killed all the standards they didn't pioneer (CORBA anyone ?).

    3) There is NOTHING in the XML spec that _requires_ people to open up their schema definitions. Its purely a structure definition in the same way as Microsoft's old Word documents were stored, its just that now the markers are in Text format and any standard XML parser will be able to read the file.

    4) Open Office can already read word documents even though they aren't in XML.

    5) So can Word Perfect.

    6) Using XML doesn't stop you embedding binary into the document, often people do this to store data (images for instance), thus an OLE reference might still be binary.

    7) Pure XML and XSLT are great ways to use up all the power on your processor. Binary has previously been used here because its inefficient, if MS had opened the format up everyone would just complain that its too inefficient and its quicker to save using an older format. So MS are either trying to burn cycles or are customising the XML or their application for speed, is that wrong ? Would it be wrong if KDE did it ?

    8) People won't switch to or from Word because of XML, Open Office and other tools will be able to read the Word files because other tools (Google for instance) need the format and MS can see real business need to allow them to see it.

    9) XML is a meta-language as such anything can be written. Hell they could have a bitch of an external format and then a simple parser that makes it useful, but not tell anyone about the simple parser so everyone elses documents take years to load.

    10) XML is the buzzword of today, OLE to be replaced by SOAP as the buzzword for Office next ?

    Get off the high horse guys, whether its binary or XML is irrelevant, making something XML doesn't make it open. Thats like saying that everything you do makes sense, but just because people don't understand the Mayan Calendar and Ancient Greek they complain.

    MS will always use Mayan and Ancient Greek, and we _can_ understand them, its just easier for them as its their native language and calendar.

    --
    An Eye for an Eye will make the whole world blind - Gandhi
    1. Re:Points to remember... by NullProg · · Score: 4, Insightful

      1) XML, SOAP and all these new technologies were pioneered by Microsoft


      XML came out of "SGML for the Web" team sponsored by the W3C. I think this was back in 97/98.

      Enjoy,

      --
      It's just the normal noises in here.
    2. Re:Points to remember... by illtud · · Score: 1

      3) There is NOTHING in the XML spec that _requires_ people to open up their schema definitions. Its purely a structure definition in the same way as Microsoft's old Word documents were stored, its just that now the markers are in Text format and any standard XML parser will be able to read the file.

      Oh come on, XML 101 - if your document is based on a schema, then it has the uri for the schema right there in the document - else you couldn't validate it. ie - your app has to be able to read the schema. Now maybe they're going to release the documents with a private schema, but that would lose the document about 75% of the point of XML.

      Oh, wait, maybe that's the point...

    3. Re:Points to remember... by ryanvm · · Score: 4, Funny

      Get off the high horse guys, whether its binary or XML is irrelevant, making something XML doesn't make it open.

      You keep using that phrase, I do not think it means what you think it means.

    4. Re:Points to remember... by Danse · · Score: 2

      Being able to validate it is pretty much worthless if you don't also know how to interpret it. That's the key to the whole thing. Including a schema doesn't fix the problem. It needs documentation on just what the hell all of it means.

      --
      It's not enough to bash in heads, you've got to bash in minds. - Captain Hammer
    5. Re:Points to remember... by CableModemSniper · · Score: 1

      You keep using that phrase, I do not think it means what you think it means.

      please remind me what that is from, thanks.

      --
      Why not fork?
    6. Re:Points to remember... by yomahz · · Score: 2

      1) XML, SOAP and all these new technologies were pioneered by Microsoft

      Really?

      --
      "A mind is a terrible thing to taste."
    7. Re:Points to remember... by Iridar · · Score: 1

      Princess Bride. Inigo says it to Vizzini (all spelled wrong, I'm sure) after the fourth of fifth time V. says "Incontheivable" (sic).

      --


      Information doesn't want to be anything

      .
    8. Re:Points to remember... by CableModemSniper · · Score: 1

      Thanks! I was almost positive it was princess bride but couldn't remember the context.

      --
      Why not fork?
    9. Re:Points to remember... by cryms0n · · Score: 1

      It is from Office Space.

      Cheers.

    10. Re:Points to remember... by KidSock · · Score: 2

      MS didn't pioneer XML, saying Open Office can read word documents is technological hair splitting, writing binary memory snapshots to disk is not inefficient, and I don't understand 9) but I'm not trying to make a point, I just don't think your message should be labeled 'Informative'.

    11. Re:Points to remember... by Kashif+Shaikh · · Score: 2

      Get off the high horse guys, whether its binary or XML is irrelevant, making something XML doesn't make it open.

      I believe people -- and this is my opinion -- think XML is more "open" because its a tangible format. i.e. you can open it in notepad.exe and see some logical structure(but you can't intrepret it). Where as all you see from a doc file is bunch of binary gibberish.

    12. Re:Points to remember... by Anonymous Coward · · Score: 0

      ...of which MS was a part of the working group. The W3 has many people from many different interests groups and companies that help to invent, define and solidify their recomendations.

    13. Re:Points to remember... by Anonymous Coward · · Score: 0

      but that would lose the document about 75% of the point of XML

      Umm, no. For Microsoft, at least 75% of the point of XML is for marketing XML.

    14. Re:Points to remember... by more · · Score: 1
      I agree that reading the document is not a big deal. The real interoperability issue is interpretation and there the largest problem is in layout algorithms. It may be impossible to reconstruct or reverse engineer a bit-perfect layout algorithm so that any document written by a Word would look the same elsewhere. Bit perfect rendering in interoperability is difficult enough even without secret rendering algorithms in one end.



      Note, that a single roundation error of 0.499999999999 instead of 0.500000000001 may lead to different pagination or image layout and consequently render the whole document differently. For seamless interoperability, bit perfect computation is required, not just approximations (or "improved" algorithms).

      --

      -- Imperial units must die --

    15. Re:Points to remember... by Anonymous Coward · · Score: 0

      Office space? Ha ha

    16. Re:Points to remember... by NullProg · · Score: 1

      Granted! Now where is the group list. I cannot find a complete list on the WC3. I can find a partial list (individuals of which MS is not represented). Maybe this is the whole list! I am giving you the benefit of a doubt on my part.

      AC show your name and be recognized. Where is your proof?

      Enjoy,

      --
      It's just the normal noises in here.
  40. Grounds for another anti-trust suit? NO! by DangerTenor · · Score: 1

    WTF!?! Was the existing .XLS, .DOC, .PPT format the basis of the first anti-trust suit? NO! You Microsoft-bashers just can't get it straight, can you?

    This moment of frustration brought to you by the holiday season.

    --
    Check out our infosecurity industry blog: http://securitymusings.com/
  41. Mod parent down by Neuronerd · · Score: 1

    for obvious reasons

    --
    Googlefight "Slashdot Troll" against "BSD is dying" 303:229. BSD thus cant die.
    1. Re:Mod parent down by Anonymous Coward · · Score: 0

      Is there an echo in here? I could've sworn you already said that. Do you keep clicking on the link and then writing this?

    2. Re:Mod parent down by peterpi · · Score: 2
      "There are doorways I haven't opened... and windows I've yet to look through..."

      Well, I guess that's one less now!

    3. Re:Mod parent down by Happy+Monkey · · Score: 1

      for obvious reasons
      There are doorways I haven't opened... and windows I've yet to look through...


      Interesting comment/sig juxtaposition...

      --
      __
      Do ya feel happy-go-lucky, punk?
  42. ms relies on office formats by reitoei1971 · · Score: 1

    Will this be anything like Ms's wonderful HTML? with so much bloat you can barely even see the content? maybe encrypted XML? One of MS biggest advantages is Office, not just because it is a decent program, but because "everyone" has to use it since "everyone" else sends stuff in .doc format and there isnt really a good way to open them (Openoffice, etc. usually cant handle complex formatting). I really doubt MS would make their formats more open and readable by competion's products

    1. Re:ms relies on office formats by Christianfreak · · Score: 2

      aahhhrrg! Every one says this "OpenOffice can't handle complex formatting of word docs"

      What complex formating? I've been using OO instead of Word for a long time now. I've converted tables, footnotes, tabstops, embedded images, bulleted lists, graphs and combinations of it all in the same document. I've never had a problem with formatting. More often I have problems with Word 97 or RTF documents opening in Office XP. Screws it up everytime. So please tell me what OO can't do I'm dying to know.

    2. Re:ms relies on office formats by reitoei1971 · · Score: 1

      I use OO and it works 95%+ of the time but I have run into office XP documents with tables and figures which dont convert quite right.(weird fills, tables overlapping, unreadable text in tables, captions in odd places) Ive also had some problems with excel formulas. Office 2K docs always seem to work though. Someday you too will find the Office document that doesnt look quite right in OO.

  43. Of course by nuggz · · Score: 2

    That is probaly what will happen.
    Technical compliance, while completely avoiding the spirit of the standard.

    Of course if I was MS, that is what I would do too.

  44. yeah by ScubaS · · Score: 1

    "Could this be grounds for another anti-trust suit against Microsoft?"

    Only as long as people continue to use the definition of monopoly, as stated in the dictionary:

    1. COMMERCE noun A company you do not like because of a product they made; who has complete control over their product's market supply.

  45. "open" by Anonymous Coward · · Score: 0

    It's not more open than .doc because XML is mainly a buzzword.

  46. Mod parent as Windows Supporting Troll!!! by Anonymous Coward · · Score: 0

    asswipe

  47. Open? by Grip3n · · Score: 4, Informative

    I'd say the title of this article (Is the New Microsoft Office Really Open?) is extrmely misleading. Microsoft isn't even trying to be open, they're just adding support for another opensource language. A true open program would have its source code available. What this article is about has nothing to do with that. Microsoft Office is closed. Period.

    --
    To make a pun demonstrates the highest understanding of a language
    1. Re:Open? by jpmorgan · · Score: 2

      XML isn't an 'open source' language. It has nothing to do with open source/free software. It's just a document metaformat based on SGML.

    2. Re:Open? by scm · · Score: 2, Insightful

      "Open" used to imply something different before "Open Source" because popular. It meant that file formats, APIs, ABIs, etc. were well documented. Many Unix venders used to call their OSs "open" not because they gave away the source, but because everything was documented and accessible to third parties.

    3. Re:Open? by leandrod · · Score: 2
      > "Open" used to imply something different before "Open Source" because popular.

      Yes.

      > It meant that file formats, APIs, ABIs, etc. were well documented.

      No! It meant that there was conformance to open standards, that is, standards estabilished by open organisations that congregated users and vendors from all over the world.

      > Many Unix venders used to call their OSs "open" not because they gave away the source, but because everything was documented and accessible to third parties.

      No, they were open because they conformed, and still conform, to POSIX, OSI and other relevant open standards.

      --
      Leandro Guimarães Faria Corcete DUTRA
      DA, DBA, SysAdmin, Data Modeller
      GNU Project, Debian GNU/Lin
  48. Adoption of standard no guarantee of interop... by Sigh+Phi · · Score: 5, Insightful

    Microsoft (and Netscape) essentially tried the same thing with HTML. Sure, we're using HTML, but to actually view our HTML, you have to use our browser.

    Adoption of a "standard" is no guarantee of interoperability. Understanding the conceptual underpinnings of the standard is just as important. The question is, when Microsoft says they are using XML as a document format, are they doing it because they believe in the principles underlying it, or solely for the cynical "this is what is selling now" aspect?

    The body of HTML out there is an paresable, babble of a mess, largely because the two dominant browser makers did not respect many of the underlying notions of markup and hypertext to begin with. The state of the art progressed, but not in the way a lot of people wanted it to go.

    This could bode poorly if the meme survives somehow that the Office format is now equivalent to XML. When it "doesn't work," who knows where the blame will fall?

    1. Re:Adoption of standard no guarantee of interop... by Sigh+Phi · · Score: 1
      > The body of HTML out there is an paresable, babble of a mess,

      Make that "unparseable babble of a mess..."

    2. Re:Adoption of standard no guarantee of interop... by 9jack9 · · Score: 1
      The question is, when Microsoft says they are using XML as a document format, are they doing it because they believe in the principles underlying it, or solely for the cynical "this is what is selling now" aspect?

      There's another possibility. Perhaps they're doing it so that the documents are interoperable, but the interoperable they care about is with other Microsoft products. I could easily see that the goal is interoperability with IIS, Sharepoint, Exchange, as well as the new stuff that's scheduled to come out next year. It's a lot like using the IE engine as a display mechanism within the OS -- it's a pretty good solution as long as you're only building stuff with MS components, it's when you want a choice then it's a problem.

  49. the new XML .doc file header looks like: by SirSlud · · Score: 2
    --
    "Old man yells at systemd"
    1. Re:the new XML .doc file header looks like: by Anonymous Coward · · Score: 0

      resistance is="futile"/resitance> No, that's not a typo, the opening and closing tags are intentionally different.

  50. XML is now targeted by Salubri · · Score: 1
    (This one should replace the one that is empty. Gotta love it when your friend accidentally clicks submit before you post.)

    "Microsoft would lose a lot of money," Rischel said. "Right now, Microsoft can set the price of Office products based on knowing their large clients don't have an alternative." Open formats "would create a market for other products" and competitive pricing.
    The issue here that really comes out to play is that this is a customizable language. There are some groups trying to put together a standard, but as far as I can tell if they invent their own standard they're within their right.

    Can't help but state the obvious here... XML is now targeted. It is in Java's position, only as far as I know without any licensing issues. Oasis might help to standardize for anyone in the open-source community, but M$ is clearly making sure they get everything else before others do.

    It's like the article says, Office is thier cash cow. They're not going to let some open source movement mess with it. Not when they can make more money.

    --
    ----- I want my LART.
  51. Open standards by Anonymous Coward · · Score: 0

    If I see a single or tag in any document I make, I will go down to wherever they make office and personally shove those CDs up their asses.

  52. lets see which dialect of XML will they use? by myowntrueself · · Score: 2

    How about Microsoft Visual XML++?
    If it doesn't exist now it will...
    or something sufficiently based on XML
    that it can have XML in the name,
    but sufficiently different to XML that
    its incompatible with XML from other vendors and developers will need to learn a whole new way of working with XML.

    Just a wild guess.

    --
    In the free world the media isn't government run; the government is media run.
  53. XML can be as cryptic as binary by Jelloman · · Score: 5, Insightful
    All the hype about XML seems to skip over the fact that XML is never guaranteed to be any less cryptic than binary data formats. For example:
    <?xml version='1.0' ?>
    <wordDoc>
    <base64 value='kjkjKJ+kyRgMhiuI9KqU/hjkj'/>
    <base64 value='OlRg8LKp8UI883Jjk+krNhjkj'/>
    <base64 value='pRhjjhO9asdJiQ99kjkjU8j=='/>
    </wordDoc>
    XML was designed to be machine-readable, not human-readable, much less human-understandable, or easily-reverse-engineerable.

    The Office file formats will be open if M$ decides to:
    • Document them, and
    • Not change them with every update.
    I doubt they will do either of those things.

    1. Re:XML can be as cryptic as binary by Anonymous Coward · · Score: 0

      Eh?!

      "XML was designed to be machine-readable, not human-readable"

      Quite the opposite. Machines have been able to read complex binary formats for years. XML is supposed to be human readable. That is why tags are written in human terms and not just secret codes.

    2. Re:XML can be as cryptic as binary by haggar · · Score: 2

      You are quite right. But not only this: I will explicitly say that even if they publish the DTD, they can still have a format that is NOT represented correctly by any other office suite, but their's. That's because having the DTD does not help you in the representation of the content. And yet, you could still contain representation information in the XML document, but that content would be not documented. So, yeah, you have the DTD, you can validate the XML document, and still you have no f*cking clue how to represent it: how should this thing look like, how does it print?

      --
      Sigged!
    3. Re:XML can be as cryptic as binary by Dirtside · · Score: 2
      XML was designed to be machine-readable, not human-readable, much less human-understandable, or easily-reverse-engineerable.
      False. Point 6 of the W3's list of goals for XML is that it should be "human-legible and reasonably clear".
      --
      "Destroy science and religion. Science would re-emerge exactly the same; but not religion." - Penn Jillette, paraphrased
  54. right by Anonymous Coward · · Score: 0

    rubbi$h

  55. reverse engineering by k3v0 · · Score: 1

    in my non-coding mind it seems that it would be easier to reverse engineer something not entirely based on proprietary code.

  56. Slashdot... Jump to Conclusions by SaturnSS · · Score: 0, Troll

    It's a "Jump to Conclusions Mat"! You see, you have this mat, with different CONCLUSIONS written on it that you could JUMP TO!

    Here's a novel idea... why don't we wait until they RELEASE the specs? Then complain about it?

    --
    85% of Americans think this signature sucks
  57. mmmm ok by Anonymous Coward · · Score: 0

    you're still a polesmoker

    1. Re:mmmm ok by Anonymous Coward · · Score: 0

      My mistake, I thought your mom was a woman....

  58. IN SOVIET RUSSIA... by Anonymous Coward · · Score: 0, Troll

    XML formats YOU.

  59. Details on Microsoft's new XML format by MillionthMonkey · · Score: 3, Funny

    <officeDocument>
    <base-64>
    R0lGODlhSwA3AIQAAA8NDZqHf5I/NldIQszKyEw qJ3ZoYcykoHBIOIZrYLannDEpJv39+244KW5Z
    T6uZj+dkb8i vo7xARPN0gZNRRSUeHOrY2MJSUfyVmIt4bvuwtLq4tl9UT9nX1 Ug5NAAAACH+Dk1h
    ZGUgd2l0aCBHSU1QACH5BAEKAB8ALAAAA ABLADcAQAX+4CeOZGmeptUpzxNkz0ZYaG3fuGg9DoVQ
    jqBj4 BkYiZ7kYrmoOJ/OQSdHrVkiFEFju0U4Ao9IREEWj8muVybj8Cy hFc+DVj11OA4FodOxaDQY
    GBMQFz4IBgkZCQZDRBwJkC0ZbgM clm5wFQMbU3UiBh4KnTcGBRUFTUwLAwYGMGgPGRweTqtCA021
    A3p0VR0eDjOeIg9uwBlgZCwvjJZIuU8GMr3DOgkDDzN0FhsBo BVMBaiqb3FJSqpw5AseBehMwI8K
    Mn3VNx0Bs5hQ6/1y9gAJI ODSBYGXBAESgonVykEzI27KwXEwCmCJHVkESNhIoaNBA2ASrsn QClGr
    RUf+8HAgkmlBBmHVAsiBedGChQNaxOlktySJq1csMhy p5OAStArSKuawsJLmjWLvesJo4c1hM3Qe
    8LDUxCugBwNOPW1 w1RChSJKMcGWq9fXBPAJ7qOFQEQCA3bt48+rduzfT3Qp89zKJE 2ADJ4soOvBA
    cq7xOVYK5CK2yCMBgiNJiAxeC+7JAgWTqxAI4 MCyQQeniw4QUjoBa8z74HggEBrHA4JcEMAwI4aq
    QmZDlKzdJ Dm0hYwSBGhpgMCswpFrklFd03DWUSdylNqzEKBBBhmAIECYcKE QEGSxEiWw1CoAwljG
    ira7rmkawDsO6PXyA4h88i06ifMYMq/ EQgRRQaz+lEoUh1VjwCb12EBAAwKOw8QAa7zXwjIcLLGa
    arR 4ltQwD1TwQIQ2KABNLj05p4AYyyiiT1YrhVgBB3qg6MtXtNWhg gJJcODKQjC2kIZrSECUijQ9
    2gNKWD5OIiR6L7LggiIGdJiJB 99BKdoCYBU3wgpvESDDaE18tUha50jEGTiVqBVHDHFRQUCaNdJ 3
    CipwBPZmLQu448RdqQQKzxIc0FnRBgMAFpifnv1ZS2dNPMo XOJXaFYN2tSn2oGOZgbRBbaTqoAAo
    74Cq6ialAsRdI8+suCA /bywQQKueRJABBUgOBVFE6gw3Kq4ocBNBAIac1spKR3jBgUEzA pvJrcT+
    ltAdcwZl60UQkERHUhDN5MmZS9WScBxuAzW30IYKM APJQxG5yWC5I9yWW7Zg8AaLkWmg1Ya8oYg5
    2QEJdLRcFyDxm y9vByyjBhuxPeFBAJwixp1yGCvHHDIhvbCmQwhRZ8nImqkzYm0 boBZABBdsVJ5H
    pYWEpRF4kATDJEFqRt9MAvtiAALZwPVHIIM U4gXHLyTkmkMkgTGJLVnKCQWEiBHAin46DF20AAO1
    E+A5IQc QlCMj6zOrBw1WY8GDOV70hwbk5RSgOD1xEJ1bBtL8Gp9OcJC2J wR4kECdNUyoE4WGeviT
    2GhAnFWCW9VyMuC2elnCAzphAk8ir rj1nAP+uXw4xFEu/W0njz2LsIEpnWG6RGlTmaEAG9YVoZKN
    C +RhOSkLbJC66iFKxA4MD0NHuzFxBl+YjjtysPtF3jxoSxFTvXg lJKodEYScuv+eGI/V3BEohgtV
    eaUi8rkjHDhMumrE8xJmwI4 DVDbc7yLBZYIj/DYEUIEoFiGASRyyrvOhhDPYsI+DQlGxmphgE mRh
    BCI+pqA/eehAGciR9z4QgMpJ5keNioM+jAANYDSkZJLyj DikNrG38KFna+OSHsz0NHLsxHVLyEs/
    NkOO9fUwM6ygk5h+Y ZcnFNFRAMDUoCyVlze5oYmRihSYFLWNsSCRiViEIg4XkEUdZio UMggBAQA7
    </base64>
    </OfficeDocument>

    1. Re:Details on Microsoft's new XML format by fgb · · Score: 1

      All that just to get the Borg Gates icon! I feel cheated!

    2. Re:Details on Microsoft's new XML format by fredrikj · · Score: 1

      Hmm. I find it interesting that that passed the lameness filter :)

    3. Re:Details on Microsoft's new XML format by MillionthMonkey · · Score: 2

      Actually I was surprised by that too. Although I'm sure the XML format that Microsoft is using wouldn't get by.

    4. Re:Details on Microsoft's new XML format by Dog+and+Pony · · Score: 2

      Is that the specification or an example?

    5. Re:Details on Microsoft's new XML format by MillionthMonkey · · Score: 2

      It was an example. And a joke example at that.
      Microsoft would never specify the base "64" in plaintext. :)

    6. Re:Details on Microsoft's new XML format by Dog+and+Pony · · Score: 2

      Well, it was a joke question at that, too. Microsoft would never publish a specification in plaintext either. :)

    7. Re:Details on Microsoft's new XML format by Dave2+Wickham · · Score: 1

      Well... if you put it into binary it's the "Borg Gates" icon, so I'd doubt it ;).

  60. MSXML format by rirugrat · · Score: 1

    [?msxml version="1.0"]
    -[office11]
    -[drm]
    [backwardcompatible]No[/backwardcompatible]
    [linux]Evil[/linux]
    [profit]$$$[/profit]
    [globaldominationmode]On[/globaldominationmode]
    [/drm]
    [/office11]

    1. Re:MSXML format by Vermithrax · · Score: 1

      I think there's an error there
      surely you've added an extra [/globaldominationmode]

  61. Boo Hoo Hoo by VividU · · Score: 2, Insightful

    The problem is that Microsoft chooses to retain their obfuscated binary format as the default save type for documents.

    Comments like this give me the creepies. As a software developer, the last thing I want is some entity telling me what my default format should be.

    It's also indicitive of the elitist attitudes of many Linuxites. In effect, the poster is saying that users will never have the capability to inform themselves and make a choice as to how they want to use their computers.

    1. Re:Boo Hoo Hoo by killmenow · · Score: 1
      As a software developer, the last thing I want is some entity telling me what my default format should be.
      Ugh...How many times must I say this: then don't break the law. When you do, the government gets to tell you a lot of what you can and can't do anymore.

      Microsoft did break the law. They were found guilty. Even though the current administration rolled over and played dead on the issue, they certainly have the right to force default formats on Microsoft if it were to be found as a fitting remedy to anti-trust violations (you know THOSE LAWS THEY BROKE).
    2. Re:Boo Hoo Hoo by robinthecandystore · · Score: 1

      > Comments like this give me the creepies. As a software developer, the last thing I want is some entity telling me what my default format should be.

      Really? I don't believe that this should be up to the software developer. The information that goes into office documents belongs to the company that writes those documents not the developer of the application, and it is only fair that they be allowed to access that information from any application that they want. XML document formats would allow this. By sticking with proprietary file formats you are hurting your customer in an attempt to lock them into your file formats.

      Choice people! That's what its all about, I should be able to use any office suite to edit my office documents in the same manner that I can use any text editor to edit my text files.

      Just my $.02

    3. Re:Boo Hoo Hoo by yelligsc · · Score: 1

      Agreed, its not any sort of governments concern what the default datatype should be.

      That being said, I do think some additonal ability to customize the operation of the program would be nice.

      I try to save things as .RTF whenever I remember, especially when I will be sending the data out to some company that might not use word. However, it is MUCH quicker to save as a .DOC because its the default. As a user I would like to be able to configure what the dialog boxes default to for file types.

      Im not sure exactly how to respond to your comment about linux users attitude. But, in my opinion, the used should not be prevented from informing themselves and making a choice as to how they want to use their computers.

      Scott.

    4. Re:Boo Hoo Hoo by robinthecandystore · · Score: 1

      Actually, you can set all the defaults for office using Group Policy objects in Active directory. There are additional add-ons in the office resource kit which allow you to change office settings for all your users. You can have a policy that makes Word XP always save as rtf or word 97 or even html (ugh word html). Its pretty easy. see here for more info.

    5. Re:Boo Hoo Hoo by Anonymous Coward · · Score: 0

      They were not found guilty breaking the law WRT the MS Office formats.

      Retard Logic:
      MS twists the arms of PC OEMs ==> Open formats for Word Files!!!

    6. Re:Boo Hoo Hoo by WasterDave · · Score: 2

      As a software developer, the last thing I want is some entity telling me what my default format should be.

      I used to agree with you, wholeheartedly. After all, it's a shitload easier to cook up your own spec and code to that. Or, more likely, just code. Many Linux apps only get away with this attitude because their files are primarily plaintext, and therefore a complete absence of formality is generally OK.

      But I've spent the last nine months going into battle, daily, with a video compression standard that's the hugest bastard in the whole world to work with. It's patent encumbered, it's not trendy, it doesn't have a pretty GUI, it won't enamour me to the OSS community and the whole experience has nearly killed me. So why bother?

      Metcalfe's law: The utility (usefulness, approximately) of a network is proportional to the square of the number of nodes on a network. When I've finished building my stuff based on this bastard standard, it'll be compatible with the umpteen million other devices that also use it. That's one whole shitload of value proposition, right there.

      Non standard format => Use to communicate with the other eight people who use the product.
      Standard format => Use to communicate with the other two million people who use the standard.

      See? Step 1, use standards. Step 2, ? Step 3, Profit!
      Dave

      --
      I write a blog now, you should be afraid.
    7. Re:Boo Hoo Hoo by yelligsc · · Score: 1

      Thanks! I never would have found that otherwise.

      Mod parent up!

  62. This is very simple by mao+che+minh · · Score: 4, Interesting

    If they really wanted to join the open market and truly compete, then they would just open the .doc format. This is nothing more then a pitiful pandering to open source advocates or those businesses that are interested in OSS. Any person with a shred of common sense and a basic knowledge of technology developments over the past 5 years can plainly see how pointless this is.

    1. Re:This is very simple by Anonymous Coward · · Score: 0

      What kind of retarded are you? Can you mean anything meaningful by "open the .doc format"? Do you mean, *document* the .doc format? That's about the stupidest concept I can think of happening. I don't think MS even knows how it works. Also, people have significantly reverse-engineered it already, more on this below. Also, by documenting .doc, MS would be stuck "supporting" it.

      How is it not a million times better to move to a more sane document framework (XML vs. binary dump).

      Here's what MS gets by going to XML:

      - Office formats become de facto readable, WITHOUT MS having to document (read support) the format.
      - Relatedly, XML is easier to "extend" from version to version, including backwards AND forwards compatibility, than a binary format. This makes MS's life easier - I don't think they like to be tied to the inane .doc format either!
      - That's important because, it allows the rest of the intdustry to make tools based on the format, without having to worry (much) about later incompatibilities.
      - Dumb people on slashdot feel that it's all about Open/Star, which are two irrelevant apps. What it's actually about is, getting a whole catalog of third-party apps built to work with Office-generated data.
      - Even if the format becomes totally transparent, which it may (probably not with Office 11, but this is only step 1 - think Office 15), MS is still in the driver's seat. With 3% market share, MS is counting on Open/Star never being able to Embrace & Extend the schemata. As long as MS controls the formats, it's actually the huge boon to have other people developing tools and apps - lock-in you see.
      - By doing this, MS is saying that it no longer needs the opaque formats to maintain control. Which I think means they think they can compete purely on features. That has to be good.
      - MS needs their docs to be easily searchable. By like Google, but also just within Windows.
      - MS needs it to be easier to integrate their apps internally. Even within MS, no one wants to touch the .doc format - what other MS apps can do anything with it?
      - .NET is all about integration through XML, as is Yukon (the next SQL) which will also be the basis for a new filesystem for Longhorn (the next Windows).
      - MS gets to finally shun .doc. Do you think they like it either?
      - It's good technology. It's what XML is for. You misunderestimate MS if you think they're not good technologists. They're also good at business, but they're also good technologists.

      How can MS do anything BUT move to XML? It makes sense in all the ways. Screw .doc.

  63. A 20 year old solution: by Anonymous Coward · · Score: 0

    From: Richard Stallman (RMS@MIT-OZ@mit-eddie.UUCP)
    Subject: new UNIX implementation
    This is the only article in this thread
    View: Original Format
    Newsgroups: net.unix-wizards, net.usoft
    Date: 1983-09-27 10:35:59 PST

    Free Unix!

    Starting this Thanksgiving I am going to write a complete Unix-compatible software system called GNU (for Gnu's Not Unix), and give it away free to everyone who can use it. Contributions of time, money, programs and equipment are greatly needed.

    To begin with, GNU will be a kernel plus all the utilities needed to write and run C programs: editor, shell, C compiler, linker, assembler, and a few other things. After this we will add a text formatter, a YACC, an Empire game, a spreadsheet, and hundreds of other things. We hope to supply, eventually, everything useful that normally comes with a Unix system, and anything else useful, including on-line and hardcopy documentation.

    GNU will be able to run Unix programs, but will not be identical to Unix. We will make all improvements that are convenient, based on our experience with other operating systems. In particular, we plan to have longer filenames, file version numbers, a crashproof file system, filename completion perhaps, terminal-independent
    display support, and eventually a Lisp-based window system through which several Lisp programs and ordinary Unix programs can share a screen.
    Both C and Lisp will be available as system programming languages. We will have network software based on MIT's chaosnet protocol, far superior to UUCP. We may also have something compatible with UUCP.

    Who Am I?

    I am Richard Stallman, inventor of the original much-imitated EMACS editor, now at the Artificial Intelligence Lab at MIT. I have worked extensively on compilers, editors, debuggers, command interpreters, the Incompatible Timesharing System and the Lisp Machine operating system. I pioneered terminal-independent display support in ITS. In addition I have implemented one crashproof file system and two window systems for Lisp machines.

    Why I Must Write GNU

    I consider that the golden rule requires that if I like a program I must share it with other people who like it. I cannot in good conscience sign a nondisclosure agreement or a software license agreement.

    So that I can continue to use computers without violating my principles, I have decided to put together a sufficient body of free software so that I will be able to get along without any software that is not free.

    How You Can Contribute

    I am asking computer manufacturers for donations of machines and money. I'm asking individuals for donations of programs and work.

    One computer manufacturer has already offered to provide a machine. But we could use more. One consequence you can expect if you donate machines is that GNU will run on them at an early date. The machine had better be able to operate in a residential area, and not require sophisticated cooling or power.

    Individual programmers can contribute by writing a compatible duplicate of some Unix utility and giving it to me. For most projects, such part-time distributed work would be very hard to coordinate; the independently-written parts would not work together. But for the particular task of replacing Unix, this problem is absent. Most interface specifications are fixed by Unix compatibility. If each contribution works with the rest of Unix, it will probably work with the rest of GNU.

    If I get donations of money, I may be able to hire a few people full or part time. The salary won't be high, but I'm looking for people for whom knowing they are helping humanity is as important as money. I view this as a way of enabling dedicated people to devote their full energies to working on GNU by sparing them the need to make a living in another way.

    For more information, contact me.
    Arpanet mail:
    RMS@MIT-MC.ARPA

    Usenet: ...!mit-eddie!RMS@OZ ...!mit-vax!RMS@OZ

    US Snail:
    Richard Stallman
    166 Prospect St
    Cambridge, MA 02139

    1. Re:A 20 year old solution: by Anonymous Coward · · Score: 1, Funny

      I am going to reply to this email and offer my help. This seems like a great project to work on. Is .UUCP a new TLD?

    2. Re:A 20 year old solution: by Anonymous Coward · · Score: 0

      Please note posting date of 1983. UUCP stands for Unix to Unix copy, which is some special high-tech super magic that allows you to copy a file from one unix computer to another, somehow using a wire connecting them instead of a smelly bearded guy with a 12inch floppy. It's the wave of the future, and we hope to eliminate all smelly bearded guys from the computer world soon.

    3. Re:A 20 year old solution: by glenstar · · Score: 2
      To begin with, GNU will be a kernel

      Oooops! Missed the the boat by a little bit there, eh Richard?

    4. Re:A 20 year old solution: by glenstar · · Score: 2
      GNU will run on them at an early date.

      Uh oh... *double* oops.

    5. Re:A 20 year old solution: by Puu · · Score: 1

      a smelly bearded guy with a 12inch floppy

      Now that does sound like out of an old 80's pr0n flick.

  64. Open but Secure by mugnyte · · Score: 5, Interesting

    Something in my gut tells me that beyond all the extraneous tags, attributes and data types, the XML is going to have a hash code built into it.

    Edit this file outside of MS Office (invalidating the hash code) and suffer the consequences: MS treats it as "untrusted" input and rips out only the text content, no formatting.

    The hash will be a giant number created through a secure portion of the Intel-ish hardware calls. Keys hidden where? That'll be interesting to see who posts 'em first. Perhaps on a .NET server at MS hosting? Nah, this cripples offline Office. Keyless hash?
    Curious Curious.

    mug

    1. Re:Open but Secure by el_chicano · · Score: 2
      Edit this file outside of MS Office (invalidating the hash code) and suffer the consequences: MS treats it as "untrusted" input and rips out only the text content, no formatting.
      So my word processor and spreadsheet will refuse to let ME, the user, do what I want with MY documents? I myself would refuse to run those apps.

      Is this really a big problem? Unless MS cripples the export function you can always Save->As a RTF or CSV file. You can then parse the file and then format the data using the XML schema of your choice. Or am I missing something?
      --
      A man who wants nothing is invincible
    2. Re:Open but Secure by ScubaS · · Score: 1

      lol. who would use a program that pretended to be a word processor? Yeah, you can't type up a document with Word anymore!

  65. Come on now by Anonymous Coward · · Score: 0
    Microsoft......open source? Surely you jest!

    That would be like Slashdot actually investigating articles before posting them.

    Guys, April is months away......

  66. XML-Dev thread on WordML by watchful.babbler · · Score: 4, Informative
    There was a fairly recent thread on this issue over at the XML-Dev list (see here). The upshot, according to W3C XMLWG member (and occasional Microsoft foe) Tim Bray, is that Word is capable of saving documents in a WordML format that is parsable even without a DTD:
    I didn't see anything that I couldn't pick apart straightforwardly with Perl, and if someone asked me to write a script to pull all the paragraphs out of a Word doc that contain the word "foo" in bold, well you could do that. Which seems pretty important to me.
    So, from a technical perspective, there isn't much to worry about right now. From a legal perspective, no, there's no grounds for another antitrust suit, any more than there's grounds for suing Quark for not disclosing their file format.
    --
    "Freedom is kind of a hobby with me, and I have disposable income that I'll spend to find out how to get people more."
  67. Are you paying attention? It's Microsoft. by burgburgburg · · Score: 4, Insightful
    Of course it isn't open. It's a silly question. Open is EVIL. Actually open would eliminate advantages. People would be able to create their own tools to interact with documents, instead of with MS tools. Where's the money in that?

    Dancing MonkeyBoy doesn't hop across a stage for his health. He "loves this company" because it makes money as only a monopoly can.

    Silly rabbit. Open is for kids.

    1. Re:Are you paying attention? It's Microsoft. by gmack · · Score: 5, Insightful

      That right there is one of the things that makes working with windows a pain.

      On any Unix or Unix clone you can just run standard tools or write your own.

      Unfortunatly with everything in a proprietary format you then end up having to build scripting languages into everything making all of your data files potential entry points for malicious code.

      The move to XML has the potential to eliminate that sort of brain damage once and for all provided they actually open their file formats.

      I hope they do it.. but given their past I'm not holding my breath given that the options are long term financial security for MS or Security for their customers and the risk of losing market share in the future.

    2. Re:Are you paying attention? It's Microsoft. by BasicOp · · Score: 0, Flamebait

      People would be able to create their own tools to interact with documents, instead of with MS tools. Where's the money in that? News flash: Hundreds of thousands of developers worldwide already developer their own tools to interact with MS documents. Some if not most serious developers have made a lot of money off writing programs for Windows/Office. Open your eyes and you will see that Microsoft makes business a lot of money. MS is a big help to the economy in that perspective.

    3. Re:Are you paying attention? It's Microsoft. by Sivar · · Score: 3, Insightful

      News flash: Hundreds of thousands of developers worldwide already developer their own tools to interact with MS documents. Some if not most serious developers have made a lot of money off writing programs for Windows/Office. Open your eyes and you will see that Microsoft makes business a lot of money. MS is a big help to the economy in that perspective.

      Really? Excellent! Please point me to the specification for the MS Office format, so I can write a cross-platform tool to open their files.

      --
      Computer Science is no more about computers than astronomy is about telescopes. --E. W. Dijkstra
    4. Re:Are you paying attention? It's Microsoft. by Axe · · Score: 3, Interesting
      ..so I can write a cross-platform tool to open their files.

      ifstream("MyOfficeFile.doc", ios::in);

      Crossplatform enough for you?

      Oh, you mean edit the files? I remember writing VBA code that did that just fine.. Good documentation how to do that - much easier then working with a crazy-ass XML schema?

      So what exactly are you asking for?

      --
      <^>_<(ô ô)>_<^>
    5. Re:Are you paying attention? It's Microsoft. by Sivar · · Score: 4, Funny

      ..so I can write a cross-platform tool to open their files.

      ifstream("MyOfficeFile.doc", ios::in);
      Crossplatform enough for you?


      As funny as it is useful. I can read the most thoroughly encrypted files that way, too. It's good to have a Windows programmer around...

      Oh, you mean edit the files? I remember writing VBA code that did that just fine.. Good documentation how to do that - much easier then working with a crazy-ass XML schema?

      It seems that between your first sentence and your second, you forgot the "cross-platform" part. Of course, if you're a VB programmer I can't blame you--you were probably born that way.
      (I'm just kidding, no personal insult intended)

      --
      Computer Science is no more about computers than astronomy is about telescopes. --E. W. Dijkstra
    6. Re:Are you paying attention? It's Microsoft. by Anonymous Coward · · Score: 0

      VBA is the bestest! it's so portable! and it's really good for viruses!

      i think you should take a good hard look at what's best for the company you work for and recommend that someone who actually knows something replace you with a very small shell script

  68. GURU SAYS: by Anonymous Coward · · Score: 0

    Never trust a multimillion company.

  69. Re:"Could this be grounds for another lawsuit?" WT by commodoresloat · · Score: 2

    I realize it's a joke, but it seems to me that mucking with an open standard and then closing it in order to extend their monopoly might just be a reasonable cause of action. XML is not a "trade secret," and making their version incompatible with the rest of the world's in order to force the world to adopt MS products is not "innovation." Reminds me of what they did with Kerberos a couple years ago. This may or may not be worth a lawsuit, but it would certainly be anticompetitive of them.

  70. FUD alert by The+Bungi · · Score: 2, Insightful
    Rischel said. "Right now, Microsoft can set the price of Office products based on knowing their large clients don't have an alternative." Open formats "would create a market for other products" and competitive pricing

    Nope. Microsoft can set the price of Office because the applications fullfill the needs of its customers. The fact that the file format is propietary has little if nothing to do with it.

    The last time I saw StarOffice running on Windows, I damn nearly puked. It's written in something that looks like Java/AWT, the apps take bloody ages to load, opening a document takes even more bloody ages, the UI looks childish and the printing sucks. And I didn't really spend much time with it.

    OTOH, the Office apps load damn near instantaneously on even a PII 450, opening even ~50MB documents with hundreds of embedded images never takes more than a few seconds, the GUI is consistent and tight, and the things just work.

    Sun (and everyone else) has a problem if it thinks that it can compete with Office on Windows with that stuff, and unless they provide an alternative to VBA, they'll never even make a dent. There are hundreds of thousands of people who write full-fledged bussiness applications using VBA and aggregating Office functionality, and that's not something that a company will just throw away because the file formats are now compatible. w00t.

    If anything, opening the formats up will increase the popularity of office suites in Linux, because people won't have to dual boot or whatever to a) be productive; and b) read the stuff that the rest of the world produces.

    1. Re:FUD alert by cranos · · Score: 2

      I hate to disappoint you but VBA is one of the biggest loads of crap I have ever seen. People write full blown apps in this shit because PHBs don't give them the fundage necessary to use proper development tools. Trust me on this one, I had to provide support once for a company whos two biggest apps were written using Word Macros and MS Access 2.

      Seriously if someone in the company I work for said that they wanted a complete App written under VBA, I would have to tell them to think again. At the moment we're stuck with bloody excel macros for a lot of what we do and boy do they bite.

      As for functionality, I think you'll find that Open Office offers all the functionality that your average user is going to need.

      Have you looked at Open or Star Office running on anything but Windows? It runs fine on my Linux box and Im running a PII 450.

    2. Re:FUD alert by The+Bungi · · Score: 1
      Trust me on this one, I had to provide support once for a company whos two biggest apps were written using Word Macros and MS Access 2.

      That's not VBA. VBA was introduced with Office 97.

      At the moment we're stuck with bloody excel macros for a lot of what we do and boy do they bite.

      Macros != VBA. I've seen some pretty impressive stuff written in VBA, and yes, I did see a lot of crap back in the macro days, especially in Access. But things have changed.

      As for functionality, I think you'll find that Open Office offers all the functionality that your average user is going to need.

      Probably true, and Office does suffer from feature bloat. But that wasn't my point.

      Have you looked at Open or Star Office running on anything but Windows? It runs fine on my Linux box and Im running a PII 450.

      No, but I was talking about Windows =)

    3. Re:FUD alert by cranos · · Score: 2

      Alright Macros might != VBA but I still say that using what is essentially a hack of a hack of a weak language to run business applications on is not a good thing.

      I realise that you were talking about Windows, but in regards to Open and Star Office, one of the major benefits they provide is the ability not to get locked into the whole Dual Boot nightmare. By allowing people to access Word Docs, Excel SpreadSheets and so on, on other platforms, it opens up the arena in terms of competition on the desktop. We've already replaced a lot of MS Office licenses around here with Open Office and will be replacing more once the lack of Scripting Support is remedied.

      Remember Competition Good, unlawfully Maintained Monopoly bad.

  71. Forget it by TerryAtWork · · Score: 1, Redundant

    MS Software will NEVER be open. MS is all about closed proprietary systems people have to pay money to use.

    Sooner or later Bill will grabe the Internet. I don't know how he's going to do it, but if there's anyone smart enough to do it it's him.

    --
    It's Christmas everyday with BitTorrent.
  72. XML IS Office 11? Pah by TrippyZ · · Score: 2, Interesting

    Does everyone remember how Office 10 was promoted as the BIG XML release? And now Hailstorm has disappeared too.

  73. open by Anonymous Coward · · Score: 0

    Ha...its open if you pay microsoft a nominal fee of 1 billion trillion dollars

  74. dialect? Maybe is XML by Anonymous Coward · · Score: 0

    But there's a catch: It has yet to disclose the underlying XML dialect

    Sure they did. Its XML, just like you stated in your post. Mistery solved. Oh, did you mean the schema (if so, thats not a dialect)? Wait, I know. Maybe its not done yet so they are waiting to publish.. Na, cant be that simple... Or could you just be paranoid...

  75. Re:yeah by Anonymous Coward · · Score: 0

    As I never tire of pointing out, the "science" of economics in the United States can be summed up in three sentences:

    1. Charge more than your competitors - price gouging.
    2. Charge the same as your competitors - price fixing.
    3. Charge less than your competitors - price dumping.

  76. Here's the format by saider · · Score: 1, Funny
    "Hello World" in Office 11 XML looks like this...
    <XML>
    <WORDDOC>
    #%@F %@gfsd $^HW$@N@$B^^B$@YV$%(afgasU$agT()B*agagH^(U
    VMQ#%) %G%)GaBfgasfghEPQ)_T$I^))(
    V$<@MO%)^
    VM@%))HK)^^_KH$^_
    </WORDDOC>
    </ XM L >
    --


    Remember, You are unique...just like everyone else.
  77. RTF? Ha! by MamasGun · · Score: 1

    Microsoft's embraced-and-extended RTF cannot be read by AppleWorks, aka ClarisWorks. When MS Word encounters RTF from AppleWorks, it doesn't parse it right and you are left with gobbledygook. MS and standards are an inherent contradiction.

    --
    "But you've already got a DVD. It lasts forever....In the digital world, we don't need back-ups..."
    -- Jack Valenti
  78. Yes it could be grounds. by GOD_ALMIGHTY · · Score: 4, Informative

    This is a monopoly. They have been found in violation of Anti-Trust laws and held up on appeal. The government has a legitimate reason to tell them how to conduct their business and every right to do so.

    Simply because the Anti-Trust trial focused on the OS rather than Office software, does not mean that the government has no reason to impose restrictions to keep MS from shifting their monopoly power. MS's monopoly has been under government scrutiny for almost 10 years, but we still get a bunch of posts on here about how the government shouldn't be able to tell 'a company' what to do. Either the trolls are really busy or you guys decided to skip Economics 101 for Libratarian Fanaticism 101.

    In order to maintain a capitalist system, we must have competition. Without healthy competition, we don't have capitalism. The government has an obligation to step into an otherwise free market to ensure that competition stays healthy. There is no magical 'Free Market Fairy' that is going to come along and restore health to the industry.

    So yes, depending on the result of the States' AG cases and the DOJ's settlement, MS could very much be liable for making their document formats some sort of completely bastardized XML. If you want to know the probability, then you should go read the settlements, and the grievences in the new filings against MS.

    --
    Arrogance is Confidence which lacks integrity. -- me
    1. Re:Yes it could be grounds. by poot_rootbeer · · Score: 2


      YANAL, STFU.

      IANAL either, but my understanding is that when a company is found guilty of monopolistic business practices, the remedies must specifically address those practices. Until Microsoft is found guilty of abusing its market status in the realm of application software, the government has no authority to tell MS how to run its Office division.

      I don't trust the business world to police itself enough to propose a true laissez-faire system, but neither do I think it's a good idea to give government unlimited power to meddle in business affairs.

      Besides which, there's nothing illegal about having a closed document format, even if it's encapsulated within an open structure like XML.

    2. Re:Yes it could be grounds. by Anonymous Coward · · Score: 0

      The monopoly charge was trumped up. Monopoly of what? No "desktop computers". Because they aren't. No Apple, Sun, IBM, all those machines were thrown out of the definition ebcuase if they hadn;t been there was no "monopoly". So MS was found to have a monopoly on i86 IBM Clone PCs, whoop dee do. Even someone as simple as yourself should be able to see that. MS has triumphed on 99% of everything since because all the judges see that there is in fact NO CASE. There really never was. Suing for inclusion of IE into Windows for free? Well Sue ALL of Open source for being free then! You are obviously giving away something of value in order to gain market share and destroy a competitor. THAT IS ILLEGAL, even when you don't have a "monopoly".

      I find that the O.S. and Linux crowd in general to have far LESS integrity than MS has ever shown. You and your post are further proof of that.

    3. Re:Yes it could be grounds. by Malcontent · · Score: 2

      "I don't trust the business world to police itself enough to propose a true laissez-faire system, but neither do I think it's a good idea to give government unlimited power to meddle in business affairs."

      But the govt does not have unlimited power. There has to be a trial and endless appeals first. Besides MS had 50 billion to spend on this trial which is much less then the budget the justice dept allocated for it.

      --

      War is necrophilia.

    4. Re:Yes it could be grounds. by runderwo · · Score: 2
      Why are AC's invariably clueless idiots?
      So MS was found to have a monopoly on i86 IBM Clone PCs, whoop dee do.
      Having a monopoly is not illegal. Using anti-competitive practices and product tying when you are a monopoly is illegal.
      MS has triumphed on 99% of everything since because all the judges see that there is in fact NO CASE.
      No, in fact, Jackson threw the book at them, and they got off on a technicality (he blabbed to the press before the case was over).
      Suing for inclusion of IE into Windows for free? Well Sue ALL of Open source for being free then!
      This statement is so idiotic, I don't know where to begin. First of all, Internet Explorer was originally a separate product. Microsoft then tied it to Windows when they saw that Netscape was remaining dominant in the browser market.

      Second, "Open Source" is not a monopoly. If, for instance, Red Hat somehow gained monopoly status in the next few years, then they would be subject to the same rules that MS was subject to, in that they can't take separate products and bundle them with the monopoly product.

      You are obviously giving away something of value in order to gain market share and destroy a competitor. THAT IS ILLEGAL, even when you don't have a "monopoly".
      Are you kidding? Businesses give stuff away for free all the time, and it's not illegal in the slightest. Unless you are a monopoly, in which case different rules apply; the reason is to prevent horizontal expansion, and to prevent the monopoly from erecting barriers to entry in the monopolized market.

      A monopoly, by definition, has no need to gain market share; and so anti-competitive acts and other things that were fair business when the weren't a monopoly are no longer fair, because they have no reason to use them except to maintain their monopoly status. And that's the whole point of antitrust law, so that monopolies are not indefinitely maintained.

      I find that the O.S. and Linux crowd in general to have far LESS integrity than MS has ever shown. You and your post are further proof of that.
      You have proven yourself to be such an idiot that I doubt anybody could care less what you think.

      Yes, IHBT, whatever.

    5. Re:Yes it could be grounds. by mrobinso · · Score: 1

      > The government has a legitimate reason to tell
      > them how to conduct their business and every
      > right to do so.

      Ah, no.
      The government has a DUTY to direct them on how NOT to conduct their business. After all, this is 21st century North America, not 1939 Europe.

      Mike

      Ok ok, I'll be good. Gimme back my karma.

      --
      -- Karma whore? You betcha. --
    6. Re:Yes it could be grounds. by Arandir · · Score: 2

      There is no magical 'Free Market Fairy' that is going to come along and restore health to the industry.

      You're right, in the face of monopoly, only the ultimate monopoly can save us. After all, Linux didn't appear out of thin air, it was the government that created it! Linus Torvalds was sitting at home whining about DOS before the all powerful state stepped in and wrote an OS for him...

      --
      A Government Is a Body of People, Usually Notably Ungoverned
    7. Re:Yes it could be grounds. by angle_slam · · Score: 1
      First of all, Internet Explorer was originally a separate product. Microsoft then tied it to Windows when they saw that Netscape was remaining dominant in the browser market.

      Or maybe MS thought that browsers were so useful that users would want a browser with their OS? Obviously the makers of OS X think that users want a browsr, because they include one with their OS, as does Red Hat and a large number of other Linux distributers. Linux distributors decide that users want a browser with their OS and that's a good thing. But when MS decides users want a browser with their OS, that's a bad thing.

    8. Re:Yes it could be grounds. by runderwo · · Score: 2
      Or maybe MS thought that browsers were so useful that users would want a browser with their OS?
      You can bundle a product without tying it. Contrast the following scenarios for me, please:
      • distributing a copy of Internet Explorer for free, or free download, with each copy of Windows sold
      • integrating Internet Explorer into the very guts of Windows so that it would be very difficult if not impossible to remove, and present onerous licensing terms to OEMs that prevent them from shipping any other browser on Windows systems

      Guess which one's legal? Guess which one MS did?

  79. Draw you Own Conclusions by Alien54 · · Score: 5, Funny
    well, tongue in cheek

    the Love Caculator demonstrates that

    Draw your own conclusions. cute little widget.

    --
    "It is a greater offense to steal men's labor, than their clothes"
    1. Re:Draw you Own Conclusions by mt_nixnut · · Score: 1
      Awesome work!

      Have you ever considered starting your own consulting firm?

      I am sure you could do better with your Love calculator than most of the big pundits and prognosticators I have seen printing things lately ;)

      In fact if you simply strip off the lovecalculator.com disclaimer and say it was produced by some (bogus) study no one would be the wiser, since in this great land (USA).
      Said in media == true.
      Even though most will deny it.

      Thanks for the laugh!

    2. Re:Draw you Own Conclusions by epcraig · · Score: 1

      I'm not gonna comment on the 84% Love Calculator gave "Richard Stallman loves Microsoft".

      --
      Ed Craig "Who cares what you think?" George W. Bush, 4th of July 2001
    3. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0

      you almost forgot...

      bill gates loves linux

    4. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 2, Funny

      Or the 47% "penis loves vagina." Its description isn't terribly promising for the human race!

    5. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0

      my foot loves your ass

      95 %

      Dr. Love thinks that a relationship between my foot and your ass has a very good chance of being successful, but this doesn't mean that you don't have to work on the relationship. Remember that every relationship needs spending time together, talking with each other etc.

    6. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0

      These are the results of the calculations by Dr. Love:
      Bill Gates Free Software Foundation

      95 %

      Dr. Love thinks that a relationship between Bill Gates and Free Software Foundation has a very good chance of being successful, but this doesn't mean that you don't have to work on the relationship. Remember that every relationship needs spending time together, talking with each other etc.

      Ummmm... no.

    7. Re:Draw you Own Conclusions by EschewObfuscation · · Score: 2
      On a hunch I also discovered:
      • Microsoft Loves Virus 97%, and vice versa
      --

      (email addr is at acm, not mca)
      We are Number One. All others are Number Two, or lower.
      --The Sphinx
    8. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0

      You missed one that is kinda central to the thread (sorry no link):

      Microsoft loves Open Source - 18%

    9. Re:Draw you Own Conclusions by dalamcd · · Score: 1
      Silly, it works on names.

      The human race is assured.

      dalamcd

      --
      moer liek CELtroid prime!!@1!
    10. Re:Draw you Own Conclusions by Alien54 · · Score: 2
      These are the results of the calculations by Dr. Love:
      Bill Gates Free Software Foundation 95 %

      Well, this is based on numerology somehow ... The usual industrial size grain of salt applies.

      But maybe it means that Bill Gates is desperately fighting against his inner geek, who would really love Free Software, etc.

      --
      "It is a greater offense to steal men's labor, than their clothes"
    11. Re:Draw you Own Conclusions by mkweise · · Score: 1

      After running a few queries (results below,) my conclusion is that the "Love Calculator" suffers from some bad rounding or something. It acutally manages to produce less accurate results than a random number generator.

      Dr. Love thinks that a relationship between IBM and Microsoft has a reasonable chance of working out, but on the other hand, it might not. Your relationship may suffer good and bad times. If things might not be working out as you would like them to, do not hesitate to talk about it with the person involved. Spend time together, talk with each other.

      The chance of a relationship working out between Slashdot and Beowulf Clusters is not very big

      Dr. Love thinks that a relationship between Windows and a rabid monkey has a very good chance of being successful, but this doesn't mean that you don't have to work on the relationship. Remember that every relationship needs spending time together, talking with each other etc. (76%)

      Also, for some reason the following query bugs out completely:
      http://www.lovecalculator.com/love.php?name1=Micro soft+Office&name2=a+retarded+sewer+rat

      --
      Gentlemen! You can't fight in here, this is the War Room!
    12. Re:Draw you Own Conclusions by Alien54 · · Score: 2
      I'm not gonna comment on the 84% Love Calculator gave "Richard Stallman loves Microsoft".

      What you want is Richard Stallman loves Steve Ballmer 26%

      --
      "It is a greater offense to steal men's labor, than their clothes"
    13. Re:Draw you Own Conclusions by rjch · · Score: 1
      The human race is assured
      Don't be so sure... We could be heading for serious trouble...
    14. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0

      obviously you are very learned to be sure!

    15. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0

      Here's what I got...

      Microsoft + freedom: 43%

      "The chance of a relationship working out between microsoft and freedom is not very big, but a relationship is very well possible, if the two of you really want it to, and are prepared to make some sacrifices for it. You'll have to spend a lot of quality time together. You must be aware of the fact that this relationship might not work out at all, no matter how much time you invest in it."

      Say no more...

    16. Re:Draw you Own Conclusions by sharkey · · Score: 2

      Bill Gates Loves Janet Reno 92%, giving them a good chance of having a good relationship.

      On the other hand, Bill Gates Loves Spuds Mackenzie 63%. Maybe Bill is a closet necrophile who's obsessed with small dogs?

      --

      --
      "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
    17. Re:Draw you Own Conclusions by Anonymous Coward · · Score: 0
    18. Re:Draw you Own Conclusions by Alien54 · · Score: 2
      Also, for some reason the following query bugs out completely: http://www.lovecalculator.com/love.php?name1=Micro soft+Office&name2=a+retarded+sewer+rat

      The limit is three words per field [shrug]

      remember, this is not my widget [smile]

      --
      "It is a greater offense to steal men's labor, than their clothes"
    19. Re:Draw you Own Conclusions by mkweise · · Score: 1

      Nope, that can't be it because: William Jefferson Clinton III loves Al Gore 17 %

      Do you know what it actually does? When I first saw it, I thought it probably used google searches to see how frequently both strings are used in each other's vicinity with (a) "good" verbs and (b) "bad" verbs inbetween, the report a/(a+b) as a percentage. I guess it's really just a lame function based on the sums of the ASCII values of each string, modulo some arbitrary number.

      Well, at least there are some things it gets right:
      Cartman loves Cheesy Poofs 91 %
      Al Gore loves the internet 94 %

      --
      Gentlemen! You can't fight in here, this is the War Room!
    20. Re:Draw you Own Conclusions by alexburke · · Score: 2

      Likewise, from corporate and personal perspectives, the Love Calculator is [right on the money|pretty accurate|crazy|WTF?].

    21. Re:Draw you Own Conclusions by nicodaemos · · Score: 2

      Dr. Love thinks that a relationship between bill gates and osama bin laden has a very good chance of being successful

      In fact he gives it an 89% chance.

      Makes a lot of sense to me. They're both doing their part to kill corporate America. Bin Laden with bombs and Bill G with his monopoly. One has to wonder who is winning between them.

  80. Microsoft opening? Naw... Waiting for Palladium. by Anonymous Coward · · Score: 2, Interesting

    I seriously doubt that Microsoft is opening anything that they previously held private. This just isn't Microsoft's way. They've previously held .DOC, .XLS, etc private and obscured them to the point that 3rd party programs have a difficult time accurately opening them. This has worked fairly well for them, but it is also a thorn in Microsoft's side, as each new version of Office needs to hold compatible to all that legacy stuff, plus the new formats.

    What if they could scrap all that and have an easily read document format? They could tighten integration with IIS -> Office and web pages generated from saved documents, spreadsheets, etc. An XML file format can do it. This would be something MS would like to do.

    The problem is XML could be readable by anyone. Or at least it CURRENTLY could. But, what if, MS had a technology to transparently encrypt/decrypt files on the save/read? And, what if the keys to those files were then stored in a protected memory vault that only trusted apps could get to? A trusted nub could ensure that the apps weren't tampered with... You can see where this is going.

    As I understand it, with Palladium, MS could declare that the next Word format is PlainText, but documents still wouldn't be able to be opened by 3rd party software, as they aren't trusted by MS to hold the keys to decrypt the data files.

    It's a win/win for Microsoft. They get to dump legacy code and create something simpler, while gaining greater control over how people use their own files. It's a win/lose for the consumer, though. They'll get new functionality if they stay all Microsoft, but will be locked into an all/nothing choice of whether they choose the MS route, or not.

    THAT, to me, sounds like a typical MS business plan.

  81. Re:"Could this be grounds for another lawsuit?" WT by BigBir3d · · Score: 1

    I assume that Microsoft will argue that the 'standard' is 'open' to interpretation... ;-)

  82. Even grep replacing doesn't help by burgburgburg · · Score: 5, Informative
    Word HTML output was always atrocious. It failed everywhere from correct tag order (as is shown above), not properly quote parameters (sometimes it uses ", sometimes it uses ', sometimes nothing). Multiple tags, all with different styles one after another (actual example below)
    <b style='mso-bidi-font-weight:normal'><i style='mso-bidi-font-style:normal'><span
    style='f ont-size:12.0pt;mso-bidi-font-size:10.0pt;font-fam ily:Arial;mso-fareast-font-family:
    "Times New Roman";mso-bidi-font-family:"Times New Roman";color:black;
    mso-ansi-language:EN-US;mso-f areast-language:EN-US;mso-bidi-language:AR-SA'><br
    clear=all style='page-break-before:right;mso-break-type:sect ion-break'>
    </span></i></b>

    Even with grep replace tools, cleaning up this crap takes hours.

    1. Re:Even grep replacing doesn't help by HiThere · · Score: 2

      Did you insert those spaces, or did MSWord, or did SlashDot?

      At first I thought that it was impossibly bad, and then I remembered that slashdot filter.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    2. Re:Even grep replacing doesn't help by sgarrity · · Score: 2, Informative

      I use this Word HTML cleaner web service. Works well. Drop a penny in the paypal bucket if you like it.

    3. Re:Even grep replacing doesn't help by kazad · · Score: 2, Informative

      Dreamweaver has a "clean up word html" option. But then again, another proprietary solution.

    4. Re:Even grep replacing doesn't help by Anonymous Coward · · Score: 0

      There is always HTML Tidy. That page also has links to programs (some binary, some source) that use the HTML Tidy program in very constructive ways. Years ago, I used it to replace Word HTML with simple HTML and got elegant style sheets (i.e. no more infinite font declarations, just one or two at the top).

  83. Hello World in Word 11 XML: by Anonymous Coward · · Score: 0



    Hello World.

    1. Re:Hello World in Word 11 XML: by Puu · · Score: 1

      No, rather it is:

      Hello Word.

  84. XML is an open standard, so all XML docs are open? by nate.sammons · · Score: 1

    I think the MS logic goes something like this:

    1) Convert MS file formats to XML.

    2) Tell everyone you're using XML, and since XML is an "open standard" then clearly all XML documents are open, right?

    3) Never, ever, ever tell anyone what should go in the XML docs.

    4) Change the "standard" often enough that 3rd party companies are never able to read the files.

    I think if you replace "XML" with "binary" then you have the current MS plan. Just because they are using XML doesn't mean they are open. In this case, XML is just as open as binary files, since 1s and 0s are an open standard, right?

    right.

    -nate

  85. Well... here is a TRUE example by twistedemotions · · Score: 1

    This is ripped from another board where a beta tester posted it up.

    The dog ran up the hillThe dog ran up the hillThe dog ran up the hillDogRanUpTheHill

  86. Re:"Could this be grounds for another lawsuit?" WT by Wakko+Warner · · Score: 3, Insightful

    XML, as a language spec, is most certainly open. It's what you do with the spec that makes it closed. C is also an open spec, but if I write a program in C, I'm by no means obligated to give everyone the source code to it (despite what some people here insist is the "right thing to do" in all cases.)

    - A.P.

    --
    "Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"
  87. Could new .XML doc format be LESS open than .DOC? by NetShadow · · Score: 2, Insightful

    One thing that nobody seems to have considered yet is the possibility that, not only might this new XML Word Document format not be "open" as currently being assumed and touted, but it might be less open than the binary junk that Word spits out now.

    It seems from the context of the quotes in the article, Microsoft is very much concerned about how interoperable Word documents are now that they have been reverse-engineered and implemented from scratch in OpenOffice / StarOffice, WordPerfect, etc. .DOC is too open (meaning well-understood with a large base of source code to process it). They have stated as much in the article. MS Office is now becoming "just another Office Suite, same as the rest." They want Word to be "less of a commodity".

    Here's my theory:

    Besides value-added features, such as the internet calandar and workgroup features that have been dropped, the best way to achieve this differentiation would be to engineer an incompatible default format (an obfuscated XML DTD or binary encoding format) for new Word documents, leverage their massive installed base of desktop users, and fire up the good-ole FUD-o-matic 9000...

    Boom! Office 11 Ships, creating new, incompatible format with new, incompatible documents floating around the LAN, marginalizing the use of Open Source / "fringe" Office software.

    MS FUD: "But Open Source / Free Software Word Processors just don't work properly with the cutting-edge features of Office 11!". "They don't have the new whiz-bang features like 'Enhanced' XML, which Office depends on."

    No, Mr. Hacker, you can't use Open Office. The company policy is for everyone to use Microsoft Word, because we want everyone to be able to read everyone's documents. By the time the OSS hackers completely reverse engineer the file format, the damage will have been done. And the few glitches in compatibility in engineering compatibility into OSS Office Software will be more fuel for the FUD fire, emphasising how buggy open source software is, and Microsoft is the best choice for 100% correct display and authoring of Word Documents for your MS Office-Run Business.

    And until Office 11 ships and they're ready to roll with this new spin, they can take advantage of the hype regarding XML and how wonderful their new file-format will be, see, this Open Office package isn't so special! We can do you one better! XML is designed to be Open, see?

    Then, in reality, the new document format will be more closed to us, because we don't know how to read it. Trust me, they won't make it easy. They gain too much by closing up the new format and throwing away the key, profiting from the time it takes to pick and chisel away at the locks.

    --
    NetShadow
  88. Closed file formats are worse than closed apps by Anonymous Coward · · Score: 2, Insightful

    Business and personal users are starting to wake up to the fact that storing valuable, durable information and knowledge in proprietary file formats is not a good idea. Internet formats and communication standards illustrate the power of widely-adopted technical standards well. Business documents, technical documents, personal records, photographs, music, movies -- anything that may be of value and interest in the unforeseeable future must be stored in an open format to retain that value.

    I think this is a more compelling "pitch" for open source that the usual line of "if you can't get the source you can't fix the bugs".

  89. Doh! Ah here is the code above was the output by twistedemotions · · Score: 2, Interesting

    <?xml version="1.0" encoding="UTF-8" standalone="no"?>
    <?mso-application progid="Word.Document"?>
    <w:wordDocument xmlns:w="http://schemas.microsoft.com/office/word/ 2002/8/wordml" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:SL="http://schemas.microsoft.com/schemaLibra ry/2002/8/core" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instan ce" xmlns:aml="http://schemas.microsoft.com/aml/2001/c ore" xmlns:wx="http://schemas.microsoft.com/office/word /2002/8/auxHint" xmlns:o="urn:schemas-microsoft-com:office:office" xml:space="preserve"><w:docInfo><w:tit le w:val="The dog ran up the hill"/><w:author w:val="Peter James Templeton"/><w:template w:val="Normal.dot"/><w:lastAuthor w:val="Peter James Templeton"/><w:revision w:val="1"/><w:appName w:val="Microsoft Word 11.0"/><w:totalEdit w:val="1"/><w:created w:val="2002-12-19T21:50:00Z"/><w:lastSaved w:val="2002-12-19T21:51:00Z"/><w:pages w:val="1" wx:estimate="true"/><w:words w:val="13" wx:estimate="true"/><w:characters w:val="80" wx:estimate="true"/><w:lines w:val="1" wx:estimate="true"/><w:paras w:val="1" wx:estimate="true"/><w:charactersWithSpaces w:val="92" wx:estimate="true"/><w:version w:val="11.4523"/></w:docInfo><w:docPr><w:vie w w:val="normal"/><w:zoom w:percent="175"/><w:doNotEmbedSystemFonts/><w:proo fState w:spelling="clean" w:grammar="clean"/><w:documentProtection/><w:defau ltTabStop w:val="720"/><w:defaultFonts w:ascii="Times New Roman" w:fareast="Times New Roman" w:h-ansi="Times New Roman" w:cs="Times New Roman"/><w:summaryLength w:val="0"/><w:punctuationKerning/><w:characterSpac ingControl w:val="DontCompress"/><w:optimizeForBrowser/><w:va lidateAgainstSchema/><w:saveInvalidXML w:val="off"/><w:compat><w:breakWrappedTables/><w:s napToGridInCell/><w:wrapTextWithPunct/><w:useAsian BreakRules/></w:compat></w:docPr><w:fonts><w:fo nt w:name="Wingdings"><w:panose-1 w:val="05000000000000000000"/><w:charset w:val="2"/><w:family w:val="Auto"/><w:pitch w:val="variable"/><w:sig w:usb-0="00000000" w:usb-1="10000000" w:usb-2="00000000" w:usb-3="00000000" w:csb-0="80000000" w:csb-1="00000000"/></w:font></w:fonts><w:lists><w :listDef w:listDefId="0"><w:lsid w:val="47EF5BD8"/><w:plt w:val="HybridMultilevel"/><w:tmpl w:val="7EE46F94"/><w:lvl w:ilvl="0" w:tplc="04090001"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="h"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="720"/></w:tabs><w:ind w:left="720" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Symbol" w:h-ansi="Symbol" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="1" w:tplc="04090003" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="o"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="1440"/></w:tabs><w:ind w:left="1440" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Courier New" w:h-ansi="Courier New" w:cs="Courier New" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="2" w:tplc="04090005" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="X"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="2160"/></w:tabs><w:ind w:left="2160" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Wingdings" w:h-ansi="Wingdings" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="3" w:tplc="04090001" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="h"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="2880"/></w:tabs><w:ind w:left="2880" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Symbol" w:h-ansi="Symbol" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="4" w:tplc="04090003" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="o"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="3600"/></w:tabs><w:ind w:left="3600" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Courier New" w:h-ansi="Courier New" w:cs="Courier New" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="5" w:tplc="04090005" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="X"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="4320"/></w:tabs><w:ind w:left="4320" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Wingdings" w:h-ansi="Wingdings" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="6" w:tplc="04090001" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="h"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="5040"/></w:tabs><w:ind w:left="5040" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Symbol" w:h-ansi="Symbol" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="7" w:tplc="04090003" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="o"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="5760"/></w:tabs><w:ind w:left="5760" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Courier New" w:h-ansi="Courier New" w:cs="Courier New" w:hint="default"/></w:rPr></w:lvl><w:l vl w:ilvl="8" w:tplc="04090005" w:tentative="on"><w:start w:val="1"/><w:nfc w:val="23"/><w:lvlText w:val="X"/><w:lvlJc w:val="left"/><w:pPr><w:tabs><w:tab w:val="list" w:pos="6480"/></w:tabs><w:ind w:left="6480" w:hanging="360"/></w:pPr><w:rPr><w:rFonts w:ascii="Wingdings" w:h-ansi="Wingdings" w:hint="default"/></w:rPr></w:lvl></w:listDef><w:l ist w:ilfo="1"><w:ilst w:val="0"/></w:list></w:lists><w:styles><w:version OfBuiltInStylenames w:val="3"/><w:style w:type="paragraph" w:default="on" w:styleId="Normal"><w:name w:val="Normal"/><w:rPr><wx:font wx:val="Times New Roman"/><w:sz w:val="24"/><w:sz-cs w:val="24"/><w:lang w:val="EN-US" w:fareast="EN-US" w:bidi="AR-SA"/></w:rPr></w:style><w:styl e w:type="character" w:default="on" w:styleId="DefaultParagraphFont"><w:name w:val="Default Paragraph Font"/><w:semiHidden/></w:style><w:sty le w:type="table" w:default="on" w:styleId="TableNormal"><w:name w:val="Normal Table"/><wx:uiName wx:val="Table Normal"/><w:semiHidden/><w:rPr><wx:fon t wx:val="Times New Roman"/></w:rPr><w:tblPr><w:tblI nd w:w="0" w:type="dxa"/><w:tblCellMar><w:top w:w="0" w:type="dxa"/><w:left w:w="108" w:type="dxa"/><w:bottom w:w="0" w:type="dxa"/><w:right w:w="108" w:type="dxa"/></w:tblCellMar></w:tblPr></w:style>< w:style w:type="list" w:default="on" w:styleId="NoList"><w:name w:val="No List"/><w:semiHidden/></w:style></w:styles><w:body ><wx:sect><w:p><w:r><w:t>T he dog ran up the hill</w:t></w:r></w:p><w:p><w:pPr><w:rPr><w:b/></w :rPr></w:pPr><w:r><w:rPr><w:b/></w:rPr><w:t>The dog ran up the hill</w:t></w:r></w:p><w:p><w:pPr><w:rPr><w:i/></w :rPr></w:pPr><w:r><w:rPr><w:i/></w:rPr><w:t>The dog ran up the hill</w:t></w:r></w:p><w:p><w:pPr><w:listPr><w:ilv l w:val="0"/><w:ilfo w:val="1"/><wx:t wx:val="P"/><wx:font wx:val="Symbol"/></w:listPr><w:rPr><w:i/></w:rPr>< /w:pPr><w:r><w:rPr><w:i/></w:rPr><w:t>Dog</w:t></w :r></w:p><w:p><w:pPr><w:listPr><w:ilvl w:val="0"/><w:ilfo w:val="1"/><wx:t wx:val="P"/><wx:font wx:val="Symbol"/></w:listPr><w:rPr><w:i/></w:rPr>< /w:pPr><w:r><w:rPr><w:i/></w:rPr><w:t>Ran</w:t></w :r></w:p><w:p><w:pPr><w:listPr><w:ilvl w:val="0"/><w:ilfo w:val="1"/><wx:t wx:val="P"/><wx:font wx:val="Symbol"/></w:listPr><w:rPr><w:i/></w:rPr>< /w:pPr><w:r><w:rPr><w:i/></w:rPr><w:t>Up</w:t></w: r></w:p><w:p><w:pPr><w:listPr><w:il vl w:val="0"/><w:ilfo w:val="1"/><wx:t wx:val="P"/><wx:font wx:val="Symbol"/></w:listPr><w:rPr><w:i/></w:rPr>< /w:pPr><w:r><w:rPr><w:i/></w:rPr><w:t>The</w:t></w :r></w:p><w:p><w:pPr><w:listPr><w:ilvl w:val="0"/><w:ilfo w:val="1"/><wx:t wx:val="P"/><wx:font wx:val="Symbol"/></w:listPr><w:rPr><w:i/></w:rPr>< /w:pPr><w:r><w:rPr><w:i/></w:rPr><w:t>Hill</w:t></ w:r></w:p><w:p><w:pPr><w:rPr><w:i/></w:rPr></w:pPr ></w:p><w:sectPr><w:footnotePr><w:p os w:val="page-bottom"/></w:footnotePr><w:endnotePr>< w:pos w:val="doc-end"/><w:numFmt w:val="lower-roman"/></w:endnotePr><w:typ e w:val="next-page"/><w:pgSz w:w="12240" w:h="15840" w:orient="portrait"/><w:pgMar w:top="1440" w:right="1800" w:bottom="1440" w:left="1800" w:header="720" w:footer="720" w:gutter="0"/><w:noEndnote w:val="off"/><w:docGrid w:line-pitch="360"/></w:sectPr></wx:sect></w:body> </w:wordDocument>

    1. Re:Doh! Ah here is the code above was the output by gazbo · · Score: 1
      I don't know whether you're using this code as an example for or against MS, but to me it looks perfectly reasonable at a glance. Remember that this is not a toy app, people want this to be able to represent exactly what is on screen, as well as the behind the scenes options.

      By printing it as a monolithic block of text, of course it is illegible, however if you were to look at it through a pretty-printer I'm sure it'd look fine - and remember it's not supposed to be parsed by a human. I intended to save this and view it through IE to let it format it for me, however it is corrupt (possibly due to slashdot adding spaces, possibly due to the original code being invalid).

      A link to where you got this from would be appreciated so I could check it out, and indeed check it's genuine. The reason I would be surprised if it is invalid was that the format was given the blessing by a beta-tester who sat on the panel who designed XML (I forget the details, it was on /. a while back)

    2. Re:Doh! Ah here is the code above was the output by JebusIsLord · · Score: 2

      So lets look at this... the namespaces are all published in clear text as urls, no binary data is apparent, and no dtd or schema is even used. It is also well-formed using the W3C's own validator. In short, I see no problems at all, especially when saved as an xml doc and viewed in IE where it formats it all pretty.

      --
      Jeremy
    3. Re:Doh! Ah here is the code above was the output by Anonymous Coward · · Score: 0
      the namespaces are all published in clear text as urls, no binary data is apparent, and no dtd or schema is even used. It is also well-formed using the W3C's own validator.

      What.The.Fuck???1

      This is an office document we're talking about here, not a webpage. You need to catch up on your XML knowledge. Not everything is supposed to be set up for validation. When they say Microsoft needs to make XML the de facto format, they don't have the "Save for Web.." in mind.

      As for formatting, I'm pretty sure /. formatted it by Plain Old Text settings. Had he used the {ecode} tag, it would have shown up perfectly.
    4. Re:Doh! Ah here is the code above was the output by JebusIsLord · · Score: 2

      I know it doesn't NEED to have urls as namespace declarations, but it does which is nice - especially if those eventually actually point to real documents (i assume they dont right now). By validation I don't mean xhtml validation obviously i mean is is simply well-formed xml.
      Oh and the original poster was trying to make a show of how hard the thing is to read, and my point regarding formatting was just that he dumped it all as a block there, which can be easily sorted out if for instance you viewed it as an xml document in IE. Relax dude, I know its not html :)

      --
      Jeremy
  90. XML dialect - Say it with me. by Wolfier · · Score: 2

    Urx Earm Alloa diaolig!!

  91. Hello World in Office 11 XML (2nd try) by Anonymous Coward · · Score: 0



    Hello World.

  92. XML..... by Tsali · · Score: 2

    Did they mention Extensible Markup Language in the article or could it be one of these???

    - eXtra Money Language
    - eXtremely Microsoft Language
    - eXtra MuddLed.
    - eXtraneous Markup Language
    - eXtrapolated Modded Licensing
    - XBox Machine Language
    - XDocs Monopoly Language

    Can someone clear this up? I don't have to time to tinker with the whole "reading articles" concept.

    --
    This space for rent.
  93. IN SOVIET RUSSIA... by Anonymous Coward · · Score: 0

    Microsoft Office opens YOU!

  94. MOD THIS GUY UP by nate.sammons · · Score: 1


    this is an excellent point.

  95. exactly by ink · · Score: 3, Informative

    I wish I had some mod points for you; that's exactly what Microsoft means when they say that their documents are saved using XML. They include Win32 class-ID objects all over the place.

    --
    The wheel is turning, but the hamster is dead.
  96. Word no longer a monopoly by rjnagle · · Score: 1

    Given that by default, Corel WP, not MS Word comes installed on home machines by HP, Gateway and even Dell, it's hard to say that MS has a monopoly on Office software anymore.

    HP and Gateway switched a few months ago, and last time I checked a week or two ago, I saw that the same is true with Dell.com also.

    Now, this is partly irrelevant. Office is an overwhelming majority among PC users/owners now, and especially dominant in the business world. Still, it is encouraging that the major OEM's are at least emboldened to offer non Microsoft productivity software for new systems.

    rj

    --
    Robert Nagle, Idiotprogrammer, Houston
  97. Just because they're putting "XML support" in it.. by JohnnyBigodes · · Score: 1

    ... it doesn't mean that .DOC isn't the default document format. When the Average Joe is confronted with the "Save file" dialog box, if he sees "Microsoft Word Document format" in the box, then .DOC it goes. If he sees something else, it goes anyway.

    As long as the binary .DOC format is the default format, nothing's going to change an inch. Plus, the fact that they state that XML is now supported for document types, doesn't mean that it will even write documents to XML. Either way, I think nothing changes.

    Not to be a zealot, but this is Microsoft we're talking about. If this was going to move they office desktop monopoly even a fingernail back, do you'd think they would make this move anyway?

  98. New lawsuit? by drdanny_orig · · Score: 1
    Could this be grounds for another anti-trust suit against Microsoft?"
    I certainly hope so: nothing would make me happier :)
    --
    .nosig
  99. Um... by RomSteady · · Score: 2
    Let me see...Office 11 is in beta. Microsoft often makes tweaks of file formats and internal structures for their products up to the last minute. The beta is only in use at a limited number of sites, and is merely a tech beta, not even a feature beta. Documentation for products isn't even ready for final tech review until 16 weeks prior to the product being done. It's very possible that the documentation that they're asking for is either 1) not done, or 2) done, but in an internal spec document that is subject to change.

    I'd say wait and see what happens at release. Anything developed off of assumptions made based on the current state of the product will most likely be broken at release anyway. If it isn't released at ship time, then worry. Until then, it's kind of pointless to ask for the stuff.

    --
    RomSteady - I came, I saw, I tested. GamerTag: RomSteady / http://www.romsteady.net
  100. I can't believe it... by fudgefactor7 · · Score: 2

    From the snippet: "But there's a catch: It has yet to disclose the underlying XML dialect.'"

    Just because the XML dialect isn't readily available people are already assuming MS will not make it open? Got news for ya, Office 11 is still in beta, that means things may still change. And as you all know, MS publishes an absolute shitload once they set their mind to it.

    So, chill out a little, will ya? Wait for it, then bitch when it doesn't appear. It's almost like you guys are new at complaining, or something.

    1. Re:I can't believe it... by Anonymous Coward · · Score: 0

      No, they're WELL practiced at complaining. Complaining on /. is such a fine art that they start to do it months before a product even exists. And in the COMPLTE absence of facts! That takes great skill to overcome the very last rational brain cell so be this paranoid. Give them credit!

  101. In other news by quintessent · · Score: 2

    Bill gates paused in a grocery store line to let someone in front of him. We're not sure what he was up to, but we noticed he didn't let any other customers in front of him. This could be a deliberate attempt at gaining another monopoly in yet another critical area, and we're pretty sure it has to do with cash register printers and XML. Could this be the Achiles heel that brings down the giant in the courts? Citizens arm and unite!

  102. /. inserted the spaces but look at the rest! by burgburgburg · · Score: 2

    Just take a look at the rest of it, without the spaces. Or copy to your editor and remove them yourself. It's still ridiculous, atrocious and pathetic.

    1. Re:/. inserted the spaces but look at the rest! by HiThere · · Score: 2

      I assume you mean that slashdot inserted the spaces.

      (Yes, I agree that it's quite bad. But there's a significant difference between quite bad and impossibly bad.)

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
  103. Re:"Could this be grounds for another lawsuit?" WT by Danse · · Score: 2

    That's the problem. Microsoft is not obligated to release the info on their format. As a convicted monopolist, they should be. This is yet another example of just how poorly our judicial system handles this kind of case. By the time you can prove something and get through all the appeals, it doesn't fucking matter anymore!

    --
    It's not enough to bash in heads, you've got to bash in minds. - Captain Hammer
  104. Why should they have to open up anything? by Anonymous Coward · · Score: 0

    I'm not trolling at all, I'm asking a legit question. Why should Microsoft have to open up anything, file formats including? Since most people here are Capitalists and agree with most of Capitalism and "free markets", isn't it contradictary to Capitalism for MS to have to do this? Opening up anything will only help their competition. Yes you can then say if they're such a good and innovative company it won't hurt anything, but that is simply not true.
    I would like some proof of their "EVIL" business practices. MS got their monopoly because no one else really wanted the business. Did Sun, SGI, or DEC really produce any consumer oriented applications and computer systems? Their was DR-DOS and Wordperfect, but where is the proof that MS forced them out of buisness through "EVIL" deeds and not because people liked MS's applications better?

    If I was in the business of making some application and everyone bought it, why the hell would I want to share any of my application with competitiors? This is capitalism and has anyone here really thought of taking this "openness" to other worlds of business. How about automotive manufactures start opening up their new car designs.

    While I'm all for opening up everything and sharing of information, it really goes against Capitalism. You can't open up eveything and have business. Fuck selling "support" of your product. Your product is probably very shitty if you can survive off of support. I'm ignoring "consulting".

  105. Re: That's still to be seen by Anonymous Coward · · Score: 0

    Are we talking about true standard XML is Microsoft going to "embrace and extend" it?

    Your forgot the "subvert and control".

    Its "embrace, extend, subvert and control."

  106. has the RTF spec been kept up to date? by jlusk4 · · Score: 2

    Has the rtf spec been kept up to date as Word doc formats have changed?

    I had the feeling the existing spec was old and outdated.

    1. Re:has the RTF spec been kept up to date? by MadFarmAnimalz · · Score: 2

      I don't know if the spec has been kept up to date or any such thing, but it does occur to me that it is not a real alternative to begin with; you lose formatting features, most notably to me is the lack of footnotes and end notes, along with a million other things.

      Yes, these limitations could be overcome by changing the way you structure your document, but it is just easier to go back to saving in .doc.

      This is what I personally expect will sink MS Office's XML feature: the loss of formatting features.

      --
      Blearf. Blearf, I say.
  107. Why Not Wait Till Word 11 Ships? by Flamesplash · · Score: 2

    As far as I can tell Word 11 hasn't shipped, so why is it so bad that they haven't given info about an aspect of a currently unavailable product? It's like worrying a date will dump you and then yelling at them with out actually knowing.

    "How dare you dump me"
    "huh? what are you talking about."

    Paitence Is.

    The software maker says it plans to disclose additional information on Office 11's XML schemas, possibly when the update ships next spring.

    Sounds to me like they plan on telling people when the functionality is actually usable. While it may not be the "ideal" timeline for some I see know problem with it. You get the functionality you get the outline of the XML.

    Maybe I missed something in the article, maybe Word 11 has been out for a while already, if I have I apologize.

    --
    "Not knowing when the dawn will come, I open every door." - Emily Dickinson
  108. It will be easier by PineHall · · Score: 2

    If Microsoft keeps its schemas proprietary, looking at the XML code will make it easier to figure out one's own schema than the way it is now figuring out the binary Word format. But people likely will still want to save in the default Word format, instead of XML. Hmm, using the XML output may make it easier to decode the Word format. That would be nice. Microsoft is not going to give away its advantage, but I think they are confident of their market share to let things become just a little easier for the other word processors.

  109. The XML buzzword by zzyrc · · Score: 1

    Now one thing that wonders me again and again is that people really think XML will make all data exchangeable. Face it, XML is nothing more then a container, as is .AVI or even tar archives.

    The interesting question is, what is inside the container? Well documented, or even standardized data or just one huge CDATA chunk?

    XML can be useful if you want to create software that needs to exchange data with someone else because you don't have much discussion about encapsulation, support under most operating systems and there is a big chance that even the data transfer just uses port 80. Beyond that XML is just a buzzword - but I must admit, a long living one.

  110. Answer by Anonymous Coward · · Score: 0

    Yes

  111. maybe they havent released the specification..... by InnovATIONS · · Score: 2
    Because it isn't FINISHED? After all we are speaking about Office 11, a product that is not itself released either. I am at least willing to reserve judgement until I see what the thing really is. I don't expect to be really surprizedl but I could be wrong.

    And bear in mind that XMLDocs are not likely to be simple because word processing documents are not simple. People grouse about Word HTML docs but most of that complexity was necessary to create a HTML document that accually looked like the original word document. XML docs are unlikey to be all that concise because users are going to be unwilling to sacrifice layout and formatting features just in order to have the resulting document be pretty looking XML

    You could create a word processor that simplified and structured its features toward creating nicely structured HTML but then it would be FrontPage and not Word.

  112. It would be the first time they documented anythin by RockyJSquirel · · Score: 2

    Not only was RTF never fully documented, but different versions of Word had incompatable RTF readers.

    If you examine an RTF file you'll notice all kinds of redundant codes that are put in to cope with incompatable MSWord versions.

    Fully design, fully document a protocol, Microsoft?
    I just spit out my drink.

    Rocky J. Squirrel

  113. Sorry by twistedemotions · · Score: 1

    It was in nice pretty format... but damn slashdot kept throwing up:

    Your comment violated the "postercomment" compression filter. Try less whitespace and/or less repetition.

    So I had to put up the abomination that you see. Yes... Slashdot added spaces as it usually does, I'm not sure how to prevent it from doing that.

    You have no e-mail address so I can't send it to you, and I'd rather avoid posting a public link and slashdotting my friends servers. So you'll have to take my word for it.

    All of the other "example" postings are trolls. This is legit. Well for MS Word anyways. I don't know for the other office components (Powerpoint, Excel, etc).

    I'm aware that it will ugly as heck... which is fine since it will be parsed by the machine.

    1. Re:Sorry by gazbo · · Score: 1
      happy_sacks at hotmail dot com

      Go on, send us a link and if it's nice and simple I'll stick up a mirror that can be posted. And yes, it did have an authentic feel about it, but then again a lot of people have too much time on their hands, so I was being cautious.

  114. Re:Could new .XML doc format be LESS open than .DO by AnyoneEB · · Score: 2, Informative

    Someone will end up with a leaked alpha or beta copy of Office 11 and start working on the file format. If they will be able to figure it out fast enough is the question. It's possible, but if it's not done completely enough by Office 11's release what you describe will happen. Someone else said that Microsoft won't change .doc anymore partially because Google supports returning .doc's in search results... of course that just requires stripping all formating, which would probably be pretty easy.

    --
    Centralization breaks the internet.
  115. Dreamweaver has "Word HTML Cleanup" feature by gregger · · Score: 1

    Works pretty well too. I guess it's bad enough that they had to make a special menu item for it.

    This might explain why Microsoft is going to buy Macromedia.

    TTFN

  116. What I Expect by ChristopherLord · · Score: 2, Insightful

    What I am hoping/expecting for in this new format is something like XSL:FO plus binary sections for ActiveX controls, etc.

    For the 5 or so posters saying this will be something like:

    <data>
    ASdfksjdfFjfjAAASADFfddfds==
    </data >
    I highly doubt it. They are on record in several places as saying they want these new files to be indexable and parsable with standard tools, and base64 encoded blocks I am sorry to say, are not indexable. But of course Embedable objects will probably be forced to manifest this way.

    Regarding the claims that this will be like their horrid HTML implementation, I think it is clear you've not done much work with XML. Either a document is valid or it is not. If its not valid, most parsers will simply reject the file (unlike HTML, which just deals with the problems). If a document is valid, there should be no tool that doesn't properly load and parse it into the DOM, unless it is somehow broken!

    The question for me is how well they implement content-presentation seperation. Will there be a 'Word 11 XSL file' with the actual content of the file seperated nicely into tags like

    <SectionHeader>Resume</SectionHeader>
    or will the style and content be mashed together like so:
    <font size="50pt">Resume</font>
    This is the question I want answered more than anything, and I can't wait to see which way they go with it. If everything is seperated nicely, we may just have an excellent source for user-produced well-formed xml documents which can be integrated into XML-based content management systems with PDF-based presentation and HTML previews, etc.
  117. Open mouth by Anonymous Coward · · Score: 0

    Insert foot

  118. Re:yeah by Anonymous Coward · · Score: 0

    #1 isn't illegal unless is in need of an emergency #2 and 3 are relative to competition. But I never heard of price dumping, since thats how you compete. Product dumping would be charging way too little, like giving out the xbox for free. which i think they are in violation of.

  119. speculation by wwwgregcom · · Score: 1

    Any reply to this will just be speculation, simply because we don't know. We don't know what Microsoft is going to do. They may make it completly open and standard or they might not, but as of now, we don't know. Why is slashdot encouraging this kind of worthless speculation?

    --
    What signature defines me as a person?
    1. Re:speculation by Anonymous Coward · · Score: 0

      Well, I'm just going to mediate here. By most people's standards on here, theres alot of rhetoric flames being sent back and forth, usually cause they don't know what they are talking about, or they just want to make spread their beliefs like the christian church. Everyone thinks they are right, and it will always be that way. Because linux fans want people to use OSS and contribute, they attack MS supporters. MS supporters attack OSS for lack of support. It's a fight that can't be won, which is why its like asking someone "What's the best religion for me?" "God, Which religion gets into heaven?"

    2. Re:speculation by Anonymous Coward · · Score: 0

      personally, the reason why I prefer windows over oss is because its easier to use, takes less time to get things done. its not that i can't use linux, cause i used to be up into it. but now im basically living off MS beta software, since the free aspect was something that spoiled me. the culture is basically the same, in a way that I contribute ideas but I really dislike unix coding, though I am rather interesting in C#.net. it's all preference. so fuck you if you want to dictate what I like.

  120. Power vs. Power by pyrrho · · Score: 1

    I think it's a more practical question. The principle of what you say is clear, and I agree. But this isn't about principles. This is about playing fair, which involved the known rules, published as the law of the land. If an industry can't work within itself to achieve a minimum amount of interoperation -- allowing the government and other flexibility in the tools they use and total ownership of thier own data -- then the government will come in an impose defaults and a lot more. If they don't have a principle handy, they will invent one. Simple matter of pragmatics and power.

    Interoperation is good for the consumer, one of which is the government which should not risk losing control of the data it encapsulates with software tools. The Government knows this. The companies know this. Ignore at own risk.

    Do I support pragmatism over principles? Not really, but I don't see how the world could actually function any other way, principles are too brittle, unprovable, and in the end, often apparently arbitrary.

    --

    -pyrrho

  121. XML Horror Stories by Anonymous Coward · · Score: 0

    I'm sitting at my programming job now (reading Slashdot, I'm a slacker). XML was added last minute to our 1.0 release and some classes were serialized to XML by serializing MFC objects into a CArchive, then ascii-hex encoding the resulting binary into a string and dumping it into an XML attribute. How's that for human readability? I wouldn't be surprised if MS does the same.

  122. Re:They Have Too by moderators_are_w*nke · · Score: 1

    Of course they're gonna put object IDs in there. How else is OLE going to work? One of the strengths of Windows is you can embed virtually any application in any other using OLE. Word has been doing this since the Windows 3.1 days, so they're not going to stop doing it now. And they're not gonna break backwards compatibility just so their XML looks nice.

    --
    "XML is like violence. If it doesn't solve your problem, use more." - Anonymous Coward
  123. XML is as XML does by roffe · · Score: 2

    I think I'll just point to something I wrote a long time ago, at the time Microsoft first announced XML support but before the US Courts gave Microsoft unlimited license to do as they damn well please.

    --
    -- Rolf Lindgren, cand.psychol
  124. it has to be the default by g4dget · · Score: 2
    You are (presumably) not a convicted monopolist, so you can do whatever you want to when it comes to file formats. But Microsoft is a convicted monopolist, and it is proper for the government to tell them what format to save their files in, in particular when their choice of format is one of the principal ways by which they are able to maintain their monopoly.

    As for making it the default, if it isn't the default, it won't work. Not only do most users not understand how to save in other file formats, if it isn't the default, it probably will be too buggy to be used. None of the non-Microsoft formats in Word, PowerPoint, or Excel are really usable for day-to-day use because they lose formatting or worse.

  125. MIRROR: Original XML by gazbo · · Score: 2, Informative
    I've mirrored the actual xml file that has not been mutilated by slashcode policies.

    Look here using a browser that will display the raw xml nicely formatted - IE works fine, supposedly Mozilla does too but I can't seem to get it to work; it parses the file and just displays the text.

    Shame this is all so hidden away in the story.

    1. Re:MIRROR: Original XML by Reziac · · Score: 2

      Well, that's weird... NS3 thinks XML saved from OfficeXP locally is fine and displays it correctly, but thought this sample page was plaintext. Must be something different in the tags somewhere.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  126. xsl by bigbadbuccidaddy · · Score: 1

    They are claiming the documents will be viewable in any browser -- which leads me to believe they will be translating them via xsl into something more viewable than the xml they're in. It seems to me that even if they never disclose the xml schemas, that this hypothetical xsl file would be very instrumental in reverse engineering them.

  127. Re:Could new .XML doc format be LESS open than .DO by Anonymous Coward · · Score: 0

    It's this kind of rampant negative paranoia that keeps Linux and Open source from succeeding. You yourself could have written an office program using the same amount of energy you used writing the completely uninformative and unjustified rant.

  128. Linux? by The+Pi-Guy · · Score: 3, Funny

    Dr. Love thinks that a relationship between microsoft and linux has a reasonable chance of working out, but on the other hand, it might not. Your relationship may suffer good and bad times. If things might not be working out as you would like them to, do not hesitate to talk about it with the person involved. Spend time together, talk with each other.

    D'oh!

    1. Re:Linux? by Economist · · Score: 1
      talk with each other.


      That's why it isn't working, Windows doesn't talk back...
  129. Yes, we should. by g4dget · · Score: 2

    The rules are indeed different for convicted monopolists, or even companies that dominate a market. It's OK for you to release a broken version of XML for your office suite, it's not necessarily OK for Microsoft.

  130. EXCEL SAMPLE by gazbo · · Score: 1
    Sample from Excel (Largish file ~300K)

    This is especially easy to understand, presumably as it doesn't have to worry about the average user wanting text to look exactly as it did on his screen.

    Same as before, some browsers should format the xml source nicely. If not, this source is neatly indented so 'View Source' and you'll still get it pretty-printed.

    Oh, thanks to twistedemotions for sending me this stuff.

    1. Re:EXCEL SAMPLE by zeugma-amp · · Score: 3, Interesting

      This displays really well as source in Phoenix .5. There is a blurb at the top that says "This XML file does not appear to have any style information associated with it. The document tree is shown below." ... Then it displays it as prettily formatted (though fairly useless) code.

      I'd like to see a clean HTML version of the same. It might make it somewhat easier to understand more or less what it is doing

      --
      This is an ex-parrot!
  131. Re:They Have Too by Malcontent · · Score: 2

    Then why even bother with XML? Why pretend that it's some kind of an open format? Why not just stick with the proprietary format they have now?

    --

    War is necrophilia.

  132. Office-xml example... by Anonymous Coward · · Score: 0



    Totally flexible & open, supports just about any format...

  133. It's XML, get over it. by Ankh · · Score: 5, Informative

    Wow, what a lot of false information. Maybe this will help a little. Disclaimer: I am XML Activity Lead at W3C, so I have a bias.

    The new Visio is using SVG.

    The new Word lets you use any XML vocabulary you like. How obfuscated it is is *entirely* up to you.

    It's not using base64 to put binary propietary data into XML documents. It's using plain XML.

    It's well-formed, and Word appears not to make up thousands of elements. The person in charge of this project is actually clueful, and was in the W3C XML Working Group (1996-1998 by the way).

    The tools all use XSLT extensively.

    It wouldn't surprise me if you could get Word to read and write the OpenOffice format just fine. There's a restriction that you can't re-order content in Word right now, I think.

    People claiming to have "insider info" and then posting blatant falsehoosd, or claiming you can put binary data directly in XML, aren't helping here. Even if you get high from hating Microsoft, the open source community and Free software world need to understand that the goalposts have moved a little.

    The extent of corporate assets tied up in memos, reportsand other documents is very large, massively higher than the collective value of relational databases.

    Yes, it looks as if Microsoft has suddenly discovered XML just as they suddenly discovered the Web. In fact, they were involved heavily in XML from the start, were among the first to ship commercial support for XML, and have been working on XML in Office 11 for a long time.

    --
    Liam Quin

    --
    Live barefoot!
    free engravings/woodcuts
    1. Re:It's XML, get over it. by mkweise · · Score: 2, Insightful

      If the were going to use XML as the native document format, I'd be impressed. But adding it as an export format that most users probably won't even notice unless they actively look for it? That's not exactly what I call embracing the standard.

      --
      Gentlemen! You can't fight in here, this is the War Room!
    2. Re:It's XML, get over it. by Sesticulus · · Score: 1

      I can certainly understand them leaving the default the old DOC format. I work in a company with close to a half million employees. Deployment of new software is never quick. You might have a mixed batch of Word users for years. If all the new installs produced documents the old installs couldn't read, they simply would never upgrade.

    3. Re:It's XML, get over it. by mkweise · · Score: 1

      If all the new installs produced documents the old installs couldn't read, they simply would never upgrade.

      You mean in the same way people never upgraded to Word for Windows because Word for DOS can't read the new file format?

      --
      Gentlemen! You can't fight in here, this is the War Room!
    4. Re:It's XML, get over it. by Sesticulus · · Score: 1

      Okay, maybe never is strong word, but the Word for DOS, Word for Windows comparison is not really the same thing. In those days all we sold were mainframe products, not like today where everyone has a PC or two and all the documentation is generated and reviewed using word. The Word 2.0 to Word 95 is probably a better comparison. At the time I worked for company with only about 1100 people, it took about 6 months for that to get deployed and it was nightmare. Nobody with 95 could remember to save in the old format, you were always having to ask folks to resave and send documents. That is why I think keeping the DOC format as the default is perfectly reasonable.

    5. Re:It's XML, get over it. by mkweise · · Score: 1

      It's easy to see how this inconvenienced your company as well as many other paying customers...but how was it not in Microsoft's interest?

      I've always assumed that they intentionally changed the file format with every release specifically for the purpose of preventing previous versions from reading it. They *want* previous versions of their products to cease being useful once they have a new version to sell.

      --
      Gentlemen! You can't fight in here, this is the War Room!
    6. Re:It's XML, get over it. by Sesticulus · · Score: 1

      Maybe that was the original strategy, but they haven't changed the format since Word 6, and that was in their best interest. They got so much flak from that 2.0 to 6.0 format change that they haven't changed it since then. I can still install word 6 and open an Word XP document. MS can't win in this one, somebody is going to unhappy. If they make the default format DOC, you have people like the original poster complaining that they are only paying lip service to the XML format. If they make the XML the default, they irritate the big corporate customers who don't want to replace every copy of Word at the same time. Since the big corporate customers probably spend a lot more money than the original poster, I expect DOC will stay the default and the XML format will be popular in the web crowd, but heh, I could wrong.

    7. Re:It's XML, get over it. by mkweise · · Score: 1

      I remember at one point, they did release a patch for the old version of Word to allow opening the new document format. I don't remember the version designations involved, although I do remember that some valid new-format documents would cause it to crash. As stated earlier, where interoperability is concerned Microsoft is not in the habit of giving one inch more ground than demanded by their large customers.

      expect DOC will stay the default and the XML format will be popular in the web crowd, but heh, I could wrong

      Since the web is just a fad that will pass as quickly as Al Gore's popularity, I guess it doesn't really matter. (Makes you wonder what Al Gore will invent next, huh?) But as long as XML is a popular buzzword, why not hijack the acronym to stand for eXcellent Microsoft Language?

      --
      Gentlemen! You can't fight in here, this is the War Room!
  134. Humpty Dumpty by jefu · · Score: 2

    A short quote:

    'When I use a word,' Humpty Dumpty said, in a rather scornful tone,' it means just what I choose it to mean, neither more nor less.'

    'The question is,' said Alice, 'whether you can make words mean so many different things.'

    'The question is,' said Humpty Dumpty, 'which is to be master - that's all.'

  135. I'm so fucking tired of this FUD by NineNine · · Score: 2

    All I've been reading about on Slashdot is that "the *only* reason that our company is still using Windows is because Office file formats are proprietary. We're tied to Office and Windows." Now, at least at this stage, this is the BEST possible fucking news, and everybody is still bitching. Nothing is more open than XML. That's all we know right now. Office data may be in completely open, standard XML. There's no telling what it'll look like, but there's no possible better news to hear than the Office formats may be wide open.

    Yet, everybody's still bitching. I have a feeling that what it is is that all you l33t *nix gurus are finally gonna have to put your money where your big fucking mouths are when the format is open, and you're gonna have to actually move to OSS/StarOffice, etc., and you're still looking for reasons not to.

  136. Re:They Have Too by moderators_are_w*nke · · Score: 1, Interesting

    You'll have to ask Microsoft why they hav a suddern desire to switch everything to XML, I have no idea. The current proprietry format is actually pretty clever, it has to be when you consider what can go in there (just about anything).

    I'm guessing that Microsoft are viewing this as start of a transition to XML. As I understand the current format, if you copy and paste a CorelDraw drawing into Word at the moment, its Corel, not Microsoft who decide what goes into that section of the Word document. In this case, there is no way for Microsoft to XMLise this part of the document. I've no idea how they're gonna do it unless the dump binary (Base64 encoded?) into the XML. They also need to store the object ID so the Word knows which legacy app this chunk of binary came from.

    --
    "XML is like violence. If it doesn't solve your problem, use more." - Anonymous Coward
  137. Don't get confused. by twitter · · Score: 4, Interesting
    You are goddamned fucking lucky that the government tells you what the default values for things should be. That's what the government is there for, mostly; to tell you that the default value for a building is to have a fire exit and that it may not be locked.

    Most rational specifications are for performance. The method should not matter as much as the end result. Fire codes are an extreem example, but even there the specification is flexible. The local government does not tell people how to build buildings, only that there needs to be so many exits per so many people and floor space. They don't nail you down to real specifics. Most rational specs are such as mil-specs for acryilic - it must be able to sit in the South Florida sun for one year without delaminating. How you make the thing does not matter, so long as it does what it should.

    By these rational and objective standards M$ junk generally fails. If you say that a Word doc should be legible and keep it's formatting for a number of years, Word fails. The same thing can be said of all other M$ junk - it's designed to break and therfore government should reject it's use anywhere records are kept. That's all public work. That's hardly engineering the document, it's simply stating the thing should work as advertised.

    All normal standards, from ASCII to WWWC are formed by professional agreement. Governments intervention is not needed. Disruptive vendors are generally seen through.

    --

    Friends don't help friends install M$ junk.

    1. Re:Don't get confused. by Anonymous Coward · · Score: 0

      All normal standards, from ASCII to WWWC are formed by professional agreement. Governments intervention is not needed.

      Yeah, maybe that's why Unix (POSIX) failed as a standard.

  138. Re: XML is like HTML think ActiveX by Anonymous Coward · · Score: 0

    It would be quite easy to make the M$ document xml format propriatry. Make all default generated documents have linked in components like some ActiveX HTML pages. You might be able to read the base parts of the document but that won't make it very userful without M$.

  139. No DocBook support by Anonymous Coward · · Score: 0

    its not too far fetched that Microsoft would make their own DTD
    Nope. They already said they aren't supporting DTDs, only schema. That means you won't be using Word to author DocBook documents anytime soon. The DocBook toolchains are the standard for most open-source publishing, and so far there is no DocBook schema, only DTD.

  140. System-wide scripting support by yerricde · · Score: 1

    Unfortunatly with everything in a proprietary format you then end up having to build scripting languages into everything making all of your data files potential entry points for malicious code.

    Either that, or build features into applications that expose parsers to the existing scripting languages on the machine. On Windows, this would usually be Jscript, VBScript, or any other WSH language interacting with COM objects; on Mac OS 7-9, AppleScript and Frontier can talk to any Mac program supporting high-level events.

    --
    Will I retire or break 10K?
  141. MS is evil monopolistic empire by Anonymous Coward · · Score: 0

    MS is pure monopolistics evil empire.

    MS uses all kind of cloak and skins to diguise and fool the vendors for the pass 15 years.
    MS only goal is to run the whole information age.

  142. <word>TVqQ93JSF0ds92jJs</word> by yerricde · · Score: 2

    you cant for instance have binary-encrypted elements

    Oh yes you can: just put a doctype, then <word xmlns="http://xmlns.microsoft.net/office/11/word"> , then a block of MIME encoded data, then </word>. If not, what in the XML specification prohibits this?

    --
    Will I retire or break 10K?
  143. XML Spy by metoc · · Score: 1

    The nice thing about XML is that it is fairly easy to derive the DTD or XSD from samples of the XML. XMLSpy will do it, and even MS Visual Studio will generate an XSD from a sample XML file.

    Granted it doesn't help with embedded binary data 0x567f or tell you what GARF=5 means. But there is nothing to stop you from saving the text of your choice (with the attributes of your choice), and see what comes out. I believe in crypto-analysis this is called a plain text attack?

  144. 1337ness by 1g$man · · Score: 3, Funny

    I guess the cool thing now is to put the tagline "Could this be grounds for another anti-trust suit against Microsoft?" on every Microsoft story, even when the context has absolutely nothing to do with anti-trust.

    Huh.

  145. Re:They Have Too by Anonymous Coward · · Score: 0
    Then why even bother with XML? Why pretend that it's some kind of an open format? Why not just stick with the proprietary format they have now?

    INFIDEL! XML is newer, faster, smarter - XML is better.

    Allow me to introduce you to my supervisor.

  146. Now what we gonna do? by bubbha · · Score: 1

    Pr0n-boy is on to us. Nothing left to do but zip-up and go find girlfriends.

    --
    I want to be alone with the sandwich
  147. Is this article a surpise to anyone though by nucrash · · Score: 1

    I read through the article yesterday. Damn that sucks, MS has set another standard that shouldn't exist in the first place. What the hell is new. The only difference between this and Java is that someone owned Java. Now Microsoft gets a chance to butcher without remorse or circumstance. The only thing that seems to piss me off is that all of us geeks are too busy bitching over then standards of how MS can do it, rather than concentrating on how to bring the software titan down. If everyone here seems to know so much about programming and XML, join up with OpenOffice.org and get starting on making the superior office product. Meanwhile, low-life pirates like me should do our best to spread the word of open software and distribute the goods to the public. Now quit your bitching and let's show how much the world doesn't need to rely on Bill G and the bald jumping jackass Steve-O B.

    --
    Place something witty here
  148. Hold the phone here... by bubbha · · Score: 1

    Simply because the Anti-Trust trial focused on the OS rather than Office software, does not mean that the government has no reason to impose restrictions to keep MS from shifting their monopoly power. ...

    I thought Microsoft said the Office WAS part of the operating system...

    --
    I want to be alone with the sandwich
  149. Ahem.... by bubbha · · Score: 1

    * Not change them with every update.

    ...don't you mean improve them with every update?

    --
    I want to be alone with the sandwich
  150. Re:You don't know what you are talking about by Anonymous Coward · · Score: 0

    am sorry but you clearly don't understand the technologies that you are talking. XML is not a format or a tag set or anything analogous to HTML.

    >Since formatting is the whole point of XML
    XML is a method for marking up data. XHTML is a display tag set for XML markup. XML has nothing intrinsically to do with formatting.

    >>or it just ain't xml at all.
    >While technically correct, the point is sadly irrelevant. As long as MS is effectively a
    >monopoly XML will be whatever they say it is, for the majority of people.
    MS might be a monopoly in the desktop area but that is mostly irrelevant, the major advantage of XML usage is in business data management. This is an area Microsoft wants to move into so they will have to comply with the standards or there data won't work with existing systems.

    >>Also you aren't allowed to put binary data in an xml document
    >Not true. It's recomended that you don't put binary in an XML document, but nothing
    >prevents you from doing so. This is exactly what will give MS the ability to hijack the
    >standard.
    An XML document must be completely composed of character data. If it contains anything else it is not an XML file. While it is possible to encapsulate character encoded binary data in an XML file Microsoft would not do this because it is bulky, and not a very good way to make there documents proprietary.

    >>In conclusion they would have to break XML pretty hard-core in order to make their
    >>doc types proprietary.
    >Only in spirit, I'm afraid, but that will likely be enough.
    You are both wrong. ActiveX style linking to components would allow them to make the format proprietary very easily. If you combine this with dynamically loaded components using .NET the possibilities are limitless.

    >>Besides, then what would be the point of going XML in the first place?
    >To make documents searchable. This is an ability which is extremely valuable to
    >anyone who has a large amount of information they need to access. The upshot is that
    >the actual content will likely be plain text, though important markups may not be.
    >Sadly, format is more important than content for a lot of people.
    If MS uses the above-mentioned approach they can also use the format to leverage their server technology. Only MS servers will have access to the components that are needed to understand the content of the documents.

  151. Re:They Have Too by Sivar · · Score: 2

    Microsoft is switching to XML because it will become the standard data exchange format of all things .NET (other than source code, obviously), and because it is faster and simpler to parse.

    After the format wars between Office and WordPerfect--the wars to make each incompatible with the other, I have heard the Office format described as:
    "...is not just a data format. It is an entire world philosophy in and of itself. It is more complex than a space shuttle, more confusing than trying to complete the Fourier analytic proof of quadratic reciprocity."
    I've seen Office 2000 corrupt two of its own documents twice in the last two months. This may be why.

    --
    Computer Science is no more about computers than astronomy is about telescopes. --E. W. Dijkstra
  152. And this is good why? by thogard · · Score: 1

    XML is just a more verbose version of what is already in the RTF. Its got the same parsing problems and XML just makes bigger files. If you want to see why XML is bad, research Knuth's comments about parsing in TeX. A complete parsing solution will diverges in two directions, one require an infinite amount of memory, the other an infinite amount of time but I don't expect modern computer "experts" to even know who Knuth even is.

  153. Word dumps RAM by crovira · · Score: 2

    Word files are RAM dumps. The memory is allocated, uh, oddly and chunks are scattered all over and over and over (because parts have been re-indexed but not yet over-written or garbage collected.)

    If you don't know the scheme, you haven't got much of chance of re constituting the document. Even if you DO know the scheme, it still bites. In fact that's why versions of Word files are incompatible. Not even M$ can do that properly. (Actually its because they'd need to have redundant implementations of code to perform the same functions from the different versions. Its easier to turn that incompatibility into a marketing lever.)

    The streaming I/O performance is actually quite poor compared to that of WordPerfect. And they lock up the files so you have to use DDE or OLE to get at the actual text stream.

    --
    MSBPodcast.com The opinions expressed here are my own. If you don't like 'em... Think up your own stuff.
    1. Re:Word dumps RAM by Planesdragon · · Score: 2

      Word files are RAM dumps. The memory is allocated, uh, oddly and chunks are scattered all over and over and over (because parts have been re-indexed but not yet over-written or garbage collected.)

      Can you provide a source? Not that I doubt you, I just want to see the original for myself so I'm absolutely sure that you're correct.

      It seems to me that a RAM dump would be faster to load and save than a text stream--and easier to implement version changes in, to boot. So I don't think that its something they did to be malicious...

  154. Open as in chest wound... by mkweise · · Score: 3, Funny

    ...not as in can of worms.

    In other words they're involuntarily providing the bare minimum of interoperability that the marketplace demands. News for nerds to yawn at.

    --
    Gentlemen! You can't fight in here, this is the War Room!
  155. Re:"Could this be grounds for another lawsuit?" WT by Anonymous Coward · · Score: 0

    Microsoft was convicted of monopolizing the PC operating system market.

    MS Office is not an Operating System (no more than emacs is, anyway)

    You might as well say that, because MS is a convicted monopolist, they should buy everyone ice cream.

  156. NotNeccesirily (XML != Open) by AShocka · · Score: 1

    I'm not an XML expert, but my understanding is that MS or anyone can write proprietary code into the CDATA section of any XML document and therefore only their tools can accurately parse the document.

    Of the XML code I have seen generated by MS applications, it's a mess, and lacks any adherence to well structured content, it's spaghetti xml. Same with the style sheets generated and the associated classes.

    If this was put in the hands of the programmers who gave us .doc to .htm or those responsible for the code generation in FrontPage, what would you expect the results to be like?

    There is also the huge debate (if my memory serves me right) that happened regarding the first W3C XSL recommendation where MS fought for a less strict implementation (because they saw their documents could not comply with a strict implementation).

    Also, didn't Microsoft Corporation Selects SoftQuad XMetaL To Create XML Content ?

  157. HTML Tidy by Pseudonymus+Bosch · · Score: 2

    You don't seem to know HTML Tidy, one of its capabilities is cleaning Word's pseudoHTML.

    --
    __
    Men with no respect for life must never be allowed to control the ultimate instruments of death.
    GW Bu
  158. billg's dream by bubbha · · Score: 1

    8) Release new OS with filesystem that looks like a database.

    So by 2010 we will have 30 million VB "developers" who do not know what a file is.

    --
    I want to be alone with the sandwich
    1. Re:billg's dream by ThePeeWeeMan · · Score: 1

      Oh, just fyi, but there *are* VB developers who know what files are. =P

  159. Re:Could new .XML doc format be LESS open than .DO by IntlHarvester · · Score: 1

    It most likely will not be less "open" than DOC.

    However, as Office becomes more and more of a client-server^W^Wweb-services product, the file itself will become less and less accessible.

    With a little setup magic, the usual shitty workflow of mailing each other DOC files could easily be be replaced by mailing each other URLs to a centralized content/licence mgmt server.

    --
    Business. Numbers. Money. People. Computer World.
  160. Re:Yeah by Anonymous Coward · · Score: 0

    Who are you, the hall monitor?

  161. Re:They Have Too by Malcontent · · Score: 2

    " Microsoft is switching to XML because it will become the standard data exchange format of all things .NET (other than source code, obviously), and because it is faster and simpler to parse."

    This makes no sense on two levels.

    One is that you are presuming that the .NET platform (or whatever the fuck it is today) is incapable of exchanging binary formats. In fact it is probably more efficient to send .doc files back and forth instead of streaming them to text and back.

    Two is that the MS-XML that office will be using will not be interchangable with any body elses parser. If you are going to embed binary data into the XML document then you are going to have the incompatible documents.

    --

    War is necrophilia.

  162. Re:They Have Too by Malcontent · · Score: 2

    "You'll have to ask Microsoft why they hav a suddern desire to switch everything to XML, I have no idea."

    Mmm very interesting. Either they are stupid or evil.

    Do you really think they can force everybody else to stream their version of XML into office files?

    --

    War is necrophilia.

  163. Re:"Could this be grounds for another lawsuit?" WT by Danse · · Score: 2

    But Your Honor, it makes no difference that I was convicted of raping that other woman! This is a completely different woman! I've never been convicted of raping this particular woman! Don't you see that you should give me the benefit of the doubt?

    In case anyone still doesn't understand, what I'm saying is that a company convicted of monopolizing one market should not simply be reprimanded for that one market while being allowed to monopolize another area. The judicial system is incredibly inadequate when it comes to dealing with problems like Microsoft. By the time anything gets done, it simply doesn't matter anymore and MS has found some other way to monopolize the market. Then the whole thing starts over again. We've been playing that stupid game with MS for nearly 10 years now. It's ridiculous that they continue to get away with it and have never gotten more than a slap on the wrist.

    --
    It's not enough to bash in heads, you've got to bash in minds. - Captain Hammer
  164. Re:"Could this be grounds for another lawsuit?" WT by Anonymous Coward · · Score: 0

    This is a completely different woman! I've never been convicted of raping this particular woman! Don't you see that you should give me the benefit of the doubt?

    I dearly hope you don't live in the US, because that's exactly how the legal system is supposed to work here.

  165. Soviet sad man is saying: by Anonymous Coward · · Score: 0

    i am sad that concise it most definitely will not be :*(.

  166. Forget new features. How about something stable? by vandan · · Score: 2

    We use Access 2002 as a front-end to our SQL Server / MySQL databases. Access 2002 is the most unstable product we have ever had from anyone, apart from maybe Windows 3.11. It regularly crashes and damages databases with dialog boxes saying "Microsoft appologises for the inconvenience. Would you like to send a bug report?". And once the mdb file gets more than about 10MB (forms and code - no data) things very really strange. Forms get corrupted and dropped. Saving changes to anything takes 5+ minutes, and often results in a crash. It really is a pile of shit. If only there were a reasonable open-source alternative that didn't require learning some obscure language like Object Pascal (for God's sake, what were they thinking).
    No upgrading for us anyway. We'll put up with this and save our money for faster machines.

  167. same as usual by jdkane · · Score: 2
    Could this be grounds for another anti-trust suit against Microsoft?"

    No. Because in XML you are allowed to define your own application of it. Hopefully I as a developer could also create my own XML application (cryptic or not) without getting in legal troubles. Otherwise I might as well start learning a trade if the computer world is really that much of a mess.

    The move could also hamper data exchange with competing desktop productivity software that recognizes XML, such as Corel's WordPerfect or Sun Microsystems' StarOffice, say analysts and competitors.

    Just because somebody else is first to the game doesn't mean the last guy has to follow. Microsoft has always created their own standard. They will do it again. That should be of no surprise to anybody. And MS Word won't change much as a result because it is currently proprietary and most likely will continue to be.

    However I can definitely see that if Microsoft uses common XML standards that are compatible with other office suites then the underdogs might get a chance. So should we blame Microsoft if they don't do this? Microsoft is not open source, they are about the money. They have no reason to support standards and compatibility if it will hurt their bottom line. On the other hand, they might shoot themselves in the foot with such a strategy because people may not like it. Of course history hasn't taught us this lesson even though we would like to see it learned from an open source standpoint.

  168. Re:XML is not Java by Anonymous Coward · · Score: 0

    XML is not a language, so it is not in the same position as Java. In actual fact Microsoft has made no effort to customize XML. You will find that Microsoft XML will be passed by any standard conformant XML parser. You are not really understanding what XML is. XML defines a way of representing data in a character serialized form, what most people think of as XML - the file - and a logical structure that can be accessed through an interface, think Document Object Model (DOM), or Simple API for XML (SAX). These things work well which is why Microsoft is using them. Microsoft has to work out what information they need to put represent, and make/use and existing document type to represent their files using XML. Currently there are a number of document types.

  169. Re:"Could this be grounds for another lawsuit?" WT by Anonymous Coward · · Score: 0

    I live in the US, and while you're supposed to be innocent until proven guilty, a person's past criminal record can be admitted to show that they are indeed capable of committing whatever crime they happen to be accused of, and that they have done it before. That alone is not going to get a conviction, but combined with good evidence could help to ensure a conviction, and probably an increased sentence as well. Microsoft is guilty of monopolization. We know they are quite capable and very willing to monopolize a market given any opportunity to do so. At the first sign that they may be doing so again, the DoJ should have them under a damn microscope.

  170. Re:"Could this be grounds for another lawsuit?" WT by commodoresloat · · Score: 2

    Also this is not about their past criminal record on unrelated crimes; they were abusing their monopoly power in both the OS market and the Office market at the same time, and their monopoly in one area aided and abetted their monopoly in the other. These are separate crimes only by legal fiction.

  171. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  172. Re:HTML Tidy and a slight rant by commodoresloat · · Score: 2
    I have used several tools like this including tidy. There's one called demoronizer, and there's one that used to be a bbedit plugin (I can't seem to find it now); these things all help but as someone else noted, Word's output of HTML is so screwy there are very few scripts that will really fix these things consistently. It's almost as if they went out of their way to confound all clean-up tools. How hard can it be to automatically output HTML that isn't a complete and utter nightmare? Isn't that one of the reasons we have standards in the first place? For some reason I keep thinking optimistically that MS will fix this in the next release of Word. It's like the OS X bug where dragging an icon to the left side of the screen and letting it go makes everything jump go to the right (which still happens in 10.2.3). It's stupid, annoying as hell, even inexcusable, and it's probably frightfully simple to fix, yet it's ignored in every update. Of course, the OS X bug is a minor annoyance compared to the absolute drain on productivity provided by the MS bug.

    (And before anyone tells me "if you don't like it don't use it" -- I don't use it. I mean, Word is great for writing academic papers and all (I don't know any other office-type product that works well for people who write with a lot of footnotes, and no I don't have time to learn LaTeX, as cool as I might think that would be) but I would never think of using word to output HTML. But the problem is if you are getting documents from other people who only use Word, no matter what.

    I tell you what, the killer app, at least for the average desktop user, would be a streamlined version of Word that only did what a word processor should do, and that automatically (and preferably seamlessly) sent other tasks to more well-designed applications for those purposes. I mean, I understand why relatively clueless people use Word for HTML, but why the hell do they try to use it for desktop publishing, for image manipulation, even for freakin' web browsing? The program shouldn't encourage such behavior by providing bad implementations of these tasks; instead it should send the task to a program that knows what it's doing.

    The craziest thing about this is that MS is in a unique position to deliver exactly such an app -- they have Word already recognized as the absolute standard, they have their own desktop publisher, image manipulation tools, web design tools, and web browser. If they were willing to let go of the bloatware and open up and standardize their formats, this project would be a no-brainer. Since they won't do it, somebody else should. Apple is too committed to Word to do this (I don't think AppleWorks is taken seriously by anyone, though I could be wrong), so there really is the possibility projects like openoffice or koffice being able to deliver something like this.

  173. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  174. Think DXF by Bernal+KC · · Score: 1
    Having an open, plain text format -- much less well written XML -- makes for a whole lot of openness even if it is not the default, most efficient, or widely used format. The DXF "open" format in AutoCAD is an example of what I'm talking about. It allowed end users a world of options for reusing, archiving, and exchanging data -- even though most data is stored in the "closed" DWG form. An open format is an easily opened door.

    Of course, this goes (or will go) well beyond DXF since it promises to integrate all manner of Office documents.

    The real challenge may be to convince authors to write structured documents. Think how few docs now use any sort of templates or employ style sheets.

    1. Re:Think DXF by leandrod · · Score: 2
      > Think how few docs now use any sort of templates or employ style sheets.

      Agreed about your point, but I want to point out a reason I consider to be partial explanation for the fact.

      Templates and stylesheets in MS Office are difficult to use, do not work at all for complex stuff, and break from one version to another.

      When I used MS Word for DOS and OS/2, from versions 3 to 5.5, we had stylesheets and templates as separate things. Templates were just documents set aside as documents. Stylesheets were separate files that contained only the style definitions and formatting. You could easily apply different stylesheets to any document, thus getting the desired output.

      When MS Word for Windows, in its version 6 if memory does not fail me, merged templates and stylesheets, chaos ensued. I could not convert my old documents properly. I failed to reproduce the efficiency of the old work flow. I had been educating fellow users on the benefits of structuring and separating formatting onto stylesheets much before I heard of LaTe or SGML, but now even myself could not make it work. Even when I could structure complex documents, they would break in other systems. Never again I could separate content and formatting, and apply different stylesheets to the same document.

      I have heard about Microsoft systems that they are a matter of luck. That some people (whom I never met) have bulletproof systems (I doubt) and some others have just bad luck. Even if it was true, which I doubt, it would still be a comment on the sad state of things that so much depends on sheer luck. As it is, the better explanation I find about these so different perceptions is that some people had knew Unix, DOS and mainframe systems (like I did), and so they find MS-W32 to be worthless; much more people have been reared on DOS and MS-W16, and so find MS-W32 to be the greatest thing on Earth.

      --
      Leandro Guimarães Faria Corcete DUTRA
      DA, DBA, SysAdmin, Data Modeller
      GNU Project, Debian GNU/Lin
  175. Not very open by Anonymous Coward · · Score: 0

    "But Office's XML support will allow larger companies to extract and use data from documents more efficiently, according to Microsoft."

    Larger companies with enough money to license their XML schema.

  176. Re:They Have Too by Malcontent · · Score: 2

    I seriously doubt it. There is just too much pressure from openoffice which is free and has a completely open XML file format. Sure some CIOs will stick with MS office but little by little that monoply will fade. It will start at the small business level because they can least afford office. As those businesses grow they will continue to use openoffice just out of momentum if nothing else. Also there will be tremendous amount of foreign countries which can not afford msoffice.

    Unfortumately for MS their twin monopolies are being threatened by free competitors which are pretty damned good. Given a choice between pretty-good-and-free and better-but-expensive most rational people will will choose the former.

    --

    War is necrophilia.

  177. XML != open. XML only makes *syntax* clear by divec · · Score: 3, Insightful

    Just because a file format is XML, it does not mean it's open. Even if it's "real" XML and not a wrapped binary dump (Vvjfio1@1/515...). All XML does for you is to make the *syntax* of the file format clear, not the underlying meaning. Analogously, in German, every noun begins with a capital letter, and root verb forms generally end with "-en"; this tells you a bit about the phrase "Mit grossem Bedauern haben wir vom Ableben Ihres Gatten erfahren", but it's certainly not enough to understand it.


    Even an XML schema is not enough - that just tells you which elements can appear where and what they can contain. That's like knowing that a normal German sentence has the main verb in the second position in the sentence. This still doesn't tell you the meaning of the above sentence, though you can see that "haben" is the verb and "Mit grossem Bedauern" is the first part of the sentence.


    For an XML language to be open, you need a full description of what each possible construct in that language means.

    --

    perl -e 'fork||print for split//,"hahahaha"'

  178. I think you mean... by Dog+and+Pony · · Score: 2

    "Open? Sure it's open! Just click here... and *poof* your document is open. What's that? You mean you want to open it with something other than M$ Office? Why on earth would you want to do that?"

  179. Typical slashdot ms-hating BS by Assaf+Lavie · · Score: 1

    MS is taking a small step in the right direction and all you have to say is "sue them!!!". Predictable and pathetic. Even if they don't (which is uncertain) disclose the so called "underlying XML dialect", XML is still much better and open then their old .doc (persistent storage) format, plus they would probably publish software packages for data manipulation. So it's not totally GPL, so what?! Grow up.

    1. Re:Typical slashdot ms-hating BS by Anonymous Coward · · Score: 0

      Hi! I'm "Assaf Lavie", and I'm militantly uninformed! I feel good saying stupid, stupid things on Slashdot - it makes me feel FUZZY!

      GIMMIE MY WINDOWS XP I WANT GAMES OMG OMG!

      Fucker.

  180. Matter Of Air Superiority by Puu · · Score: 2, Informative

    The testing is sickening. But it's us or them, really.

  181. Word document keeps old fragments by Puu · · Score: 1

    AFAIK, Word documents keep some old, already "over-written" stuff (for Undo functionality, maybe).

    I saw a story (in a dead tree mag, no link). A company sent an offer to several companies, using just one Word document as a template to finish up slightly different deals to the companies. Then a geek at one of those companies unearthed all the versions from the offer they received. Somewhat embarrassing to the sender.

    I don't know whether the story is true or myth; or whether this is "possible" or "routine" out there ;-)

  182. Re:TVqQ93JSF0ds92jJs by JebusIsLord · · Score: 2

    No no no, thats not my point. I agree with you there. My point is the ELEMENT still has to contain binary tags and attributes, but the data INSIDE the element can be binary. This might sound like a silly/pointless thing to say, but the fact is, an xml file containing nothing but base-64 binary data is STILL parsable by anyone's text viewer, just the DATA isn't (yes i know that is the important part). This is at least a little easier to read than a pure binary file, because the binary blocks have to have some ascii metadata attributed to them.

    --
    Jeremy
  183. Re:TVqQ93JSF0ds92jJs by Anonymous Coward · · Score: 0

    nice to see your backtracking when you got completely fucked over.

    flushed where it belongs

  184. Re:HTML Tidy and a slight rant by jbert · · Score: 2

    mean, I understand why relatively clueless people use Word for HTML, but why the hell do they try to use it for desktop publishing, for image manipulation, even for freakin' web browsing?

    Not just MS users...
    *cough* emacs *cough*

    OK. (So this *could* be taken as a troll, but hey - try and see if in a jovial, festive, spirit :-)

  185. !open by more+fool+you · · Score: 1

    i don't see where ms profit from closing the format here. obviously it would benefit them as well as everyone to open the "standard". i can't see how integrating excel+word etc into every-day web services produced by anyone with a .NET || J2EE || whatever server would hurt. there's plenty of stuff around now that provides a hint of what it could be like.

  186. Old News by tacocat · · Score: 1

    Years ago there were legal rumblings about the copyright/patent protection of XML formats as it pertained to other areas of the internet.

    Microsoft was in the middle of this fight and if you were paying any attention then you would have realized that just because someone goes XML means nothing with respect to Open Source. Especially when you consider DMCA and the USPTO.

    Any company can create any XML format and go two-faced on the deal. Yes we are Open Source. If you attempt to understand my XML format it's a violation of the DMCA and my IP rights and you will go to jail!

    I am not the least surprised by this. XML has distinct advantages that many people would voluntarily choose to use. Including MicroSoft. However, XML can still be "protected".

  187. Hello World by sharkey · · Score: 2


    xmlns:w="urn:schemas-microsoft-com:office:word"
    xmlns="http://www.w3.org/TR/REC-html40">

    <head>
    <meta http-equiv=Content-Type content="text/html; charset=windows-1252">
    <meta name=ProgId content=Word.Document>
    <meta name=Generator content="Microsoft Word 9">
    <meta name=Originator content="Microsoft Word 9">
    <link rel=File-List href="./Hello%20World_files/filelist.xml">
    <title >Hello World</title>
    <!--[if gte mso 9]><xml>
    <o:DocumentProperties>
    <o:Author>Seth Ramsey</o:Author>
    <o:LastAuthor>Seth Ramsey</o:LastAuthor>
    <o:Revision>1</o:Revision>
    <o:TotalTime>1</o:TotalTime>
    <o:Created>2002-12-20T13:09:00Z</o:Created&g t;
    <o:LastSaved>2002-12-20T13:10:00Z</o:LastSaved>
    <o:Pages>1</o:Pages>
    <o:Company>Arlington/Roe &amp; Co., Inc.</o:Company>
    <o:Lines>1</o:Lines>
    <o:Paragraphs>1</o:Paragraphs>
    <o:Version>9.4402</o:Version>
    </o:DocumentProperties>
    </xml><![endif]-->
    <styl e>
    <!-- /* Style Definitions */
    p.MsoNormal, li.MsoNormal, div.MsoNormal
    {mso-style-parent:"";
    margin:0in;
    margin-bottom:.0001pt;
    mso-pagination:widow-orphan;
    font-size:12.0pt;
    font-family:"Times New Roman";
    mso-fareast-font-family:"Times New Roman";}
    @page Section1
    {size:8.5in 11.0in;
    margin:1.0in 1.25in 1.0in 1.25in;
    mso-header-margin:.5in;
    mso-footer-margin:.5in;
    mso-paper-source:0;}
    div.Section1
    {page:Section1;}
    -->
    </style>
    </head>

    <body lang=EN-US style='tab-interval:.5in'>

    <div class=Section1>

    <p class=MsoNormal>Hello World.</p>

    </div>

    </body>

    </html>

    --

    --
    "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
  188. How to convert DOC to XML? by IXI · · Score: 1

    just caesar 9 and then swap chars on even and odd positions.

    --
    He saw some dirty arabs and fired. Too bad it was just some friendly kurds, BBC reporters and his fellow cowboys.
    1. Re:How to convert DOC to XML? by IXI · · Score: 1
      Implementation:
      #! /bin/sh
      #
      # Copyright (C) 2002 IXI
      #
      # This program is free software; you can
      # redistribute it and/or modify it under
      # the terms of the GNU General Public License
      # as published by the Free Software Foundation;
      # either version 2 of the License, or (at your
      # option) any later version.
      #
      caesar 9|sed -e '/\(.\)\(.\)/s//\2\1/g'
      or
      #! /bin/sed
      #
      # Copyright (C) 2002 IXI
      #
      # This program is free software; you can
      # redistribute it and/or modify it under
      # the terms of the GNU General Public License
      # as published by the Free Software Foundation;
      # either version 2 of the License, or (at your
      # option) any later version.
      #
      y/ABCDEFGHIJKLMNOPQRSTUVWXYZ/JKLMNOPQ RSTUVWXYZABCDEFGHI/
      y/abcdefghijklmnopqrstuvwxyz/ jklmnopqrstuvwxyzabcdefghi/
      s/\(.\)\(.\)/\2\1/g
      --
      He saw some dirty arabs and fired. Too bad it was just some friendly kurds, BBC reporters and his fellow cowboys.
  189. Yes and No by burgburgburg · · Score: 2
    Yes, I wrote that Slashdot inserted the spaces (using the commonly accepted /. as a substitute).

    I never said it was impossibly bad. I have grepped out the stupid ridiculous errors. It just took far more time then it should have because the code was just so atrocious.

    I often found it easier to save Word files as raw text and write the HTML around them instead of having Word do it. It saved time. That's a sad, pathetic statement.

  190. Re:just run standard tools or write your own. by apankrat · · Score: 1

    On any Unix or Unix clone you can just run standard tools or write your own.

    I would really really like to see you writing a viable substitute for MS Word. Let me explain.

    Most of the people forget that the proprietory file format is not the most important of things that made Microsoft a monopoly.

    Their products were simply better and more appealing to users (either visually or financially) than WordPerfect, QuattroPro and others. Even if the file format was open, it would not have changed a thing, because one would've needed to compete against MS development resources, which would be a challenging task.

    The same remain true now. PDF and RTF are the open formats. So why aren't there a flourishing competition to the Acrobat and such ?

    Unfortunatly with everything in a proprietary format you then end up having to build scripting languages into everything making all of your data files potential entry points for malicious code.

    This is absurd. I dont know who is the mysterious 'you' in the quote, but the need for the scripting is not driven by the proprietorness of the format.

    --
    3.243F6A8885A308D313
  191. Wait - MS pioneered SOAP?? by Anonymous Coward · · Score: 0

    I guess I should have known; given that he never showers, SOAP certainly wasn't invented by RMS....

  192. They've patented the core by Frodo420024 · · Score: 1

    If anyone makes a compatible word processor, they'll be sued. M$ in a nutshell...

    --
    I'm in a Unix state of mind.
  193. Do they still Not Get It? by billstewart · · Score: 2

    " But this format will (of course) only represent the appearance, not any structure." WHAT!?!?! Do they still not bloody get the bloody concept, or are they deliberately trying to make interoperability unusable? They did this in earlier versions of Office with their save-as-html modes, which did stupid things like saving a "Header Type 2" as "14-point-boldface-text" or whatever your current style was rather than saving it as an HTML "H2", but at that point it could be attributed to stupidity and/or incompetence, since some people think for some reason that HTML is an appearance description language rather than an specific implementation instance of a content description metalanguage, which is a bit too abstract for some people. But XML is much more explicit about being a content description metalanguage, and if you've got enough of a clue about it to output your material as XML, you've got to get that much of the concept. I'd attribute this one to malice.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks