Slashdot Mirror


Microsoft Patents XML Word Processing Documents

theodp writes "Embrace. Extend. Patent. On Tuesday, Microsoft was granted US Patent No. 7,571,169 for its 'invention' of the Word-processing document stored in a single XML file that may be manipulated by applications that understand XML. Presumably developers are protected by Microsoft's 'covenant not to sue,' so the biggest question raised by this patent is: How in the world was it granted in light of the 40-year history of document markup languages? Next thing you know, the USPTO will give Microsoft a patent for Providing Emergency Data in XML format. Oops, too late."

13 of 357 comments (clear)

  1. Bad Summary by Grond · · Score: 5, Informative

    As is all too often the case here on Slashdot, the summary has seized upon the title of the patent, which has no legal effect whatsoever, while ignoring the actual patent claims, which are all important.

    If one actually reads the claims, one sees that the main new part of the invention are the 'hint elements' contained in the XML file. The written description expands upon what hint elements mean: "hints are provided within the XML associated files providing applications that understand XML a shortcut to understanding some of the features provided by the word-processor. By using the hints, the applications do not have to know all of the specific details of the internal processing of the word-processor in order to recreate a feature."

    Basically, the invention here is the inclusion of information that lets third-party programs better understand what to do with the format. You can imagine, for example, if HTML included something like this. The del ('strikethrough') tag might be written:

    <del hint="draw line 1px horizontal">

    That code would allow a program that did not natively understand the tag to implement a simple version of it. The idea is to allow new features to be introduced into the format while enabling older versions of the software to use them without updating their code. The necessary code comes with the file.

    Now, whether that's still new and nonobvious, I don't know, but it's a significantly more accurate summary of the invention than "Microsoft Patents XML Word Processing Documents."

    1. Re:Bad Summary by Jerry+Coffin · · Score: 5, Informative

      As is all too often the case here on Slashdot, the summary has seized upon the title of the patent, which has no legal effect whatsoever, while ignoring the actual patent claims, which are all important.

      Geeze, there you go ruining everybody's fun, posting facts instead of completely uninformed complaints.

      Next you'll point out that the patent cites no fewer than 77 other patents going back to 1988 as related art, or that it cites 113 other documents, including documentation for file formats of things like AbiWord, StarOffice, Wisdom++, Docbook, WorX, MML, XMill, YAWK, and so on and so forth.

      Were it not for your UID, I'd have to pull out the "you must be new around here" wheeze, since you're in clear violation of /. groupthink guidelines!

      --
      The universe is a figment of its own imagination.
    2. Re:Bad Summary by dshadowwolf · · Score: 3, Informative

      You, apparently, have missed out and not read the actual claims of the patent. This patent covers any XML document which has an XSD definition and has:

      • rendering hints via an element or property of an element
      • a bookmarks element (of which two must be used to be valid)
      • a comments element
      • a 'text' element
      • a 'style' element
      • a 'font' element
      • a formatting element
      • a section element
      • a table element
      • an outline element
      • a proofing element

      And any variation of implementation on the above. It also covers the manipulation of a file meeting that description on any computer—whether or not it has the program that generated the file installed.

      The thing is... this patent can be read as covering HTML5 in its XML embedding and it completely fails the "obviousness test". How does it fail that test? Because it is, simply, plainly obvious to "one skilled in the field". A lot of the above features have been proposed for ODF and are braindead to add to ODF or any other XML format. Additionally XML is used as a format for storing data simply because it is a well defined format and easily manipulated--so easily, in fact, that there is a complete language defined for manipulation and transformation of XML.

      Where it really fails is that it is neither "new" nor "novel. If Netscape had tried to patent the specific version of HTML supported by, say, Navigator 4 there would be as big a backlash. It'd be similar to someone implementing an open spec - say ECMA-262 - and claiming a patent on it as "new" and "novel" because it has a specific set of system interface functions.

      Or maybe you'd like a car analogy... In this case it would be like GM filing a patent on a car because their car has a specific feature set as a standard that a company has not put out as standard options before. I hope you now understand exactly why people are rather pissed about this patent.

  2. This patent does not cover ODF by belmolis · · Score: 4, Informative

    One of the claims in this patent is that everything is stored in a single XML document. That is not true of ODF. An ODF file is the result of zipping up a bunch of files including not only XML files but various other things, such as image files.

  3. Re:WTF??? by jrumney · · Score: 3, Informative

    Wrong. Each claim stands alone, that is why they always start with a basic all encompassing Claim 1, which probably wouldn't hold up under scrutiny, and refine it in later clauses to cover every special case they can think of. Usually at least some of the claims are mutually exclusive, so to create something that violated all of the claims at once would be impossible.

  4. Re:Won't hold up by MightyMartian · · Score: 4, Informative

    But not before SGML. The whole thing is a pile of shit, a worthless patent predated by at least a quarter century (and probably a bit longer) of markup languages. The US patent system is fucking broken, because if it worked, Microsoft would have been sent packing.

    --
    The world's burning. Moped Jesus spotted on I50. Details at 11.
  5. Their independent claims by dtmos · · Score: 4, Informative

    What matters isn't what the abstract says, it's what the claims, especially the independent claims, say. Here are the two independent claims in this patent, formatted for improved clarity (I hope). They basically say the same thing, except that the first is a "method" claim, claiming a method for doing something (in this case, "creating a document in XML in a computing device that is understandable by many applications"), while the second is an "apparatus" claim, claiming an apparatus (in this case, "a computer-readable storage medium having computer-executable instructions for interacting with a document") that performs a function:

    Claim 1. A method for creating a document in XML ("Extensible Markup Language") in a computing device that is understandable by many applications, comprising:

    accessing a published XSD ("XML Schema Definition") in said computing device, wherein the XSD defines rules relating to the XML file format for documents associated with an application having a rich set of features;

    determining an element to create in an XML file in said computing device, wherein the element is selected from a set of elements, including:
    a style element;
    a hints element that includes information to assist an external application in displaying text of the of the document;
    a bookmark element; wherein the bookmark element includes an identifier attribute that associates a start bookmark with an end bookmark element wherein two bookmark elements are used in book marking a portion of the document; wherein each of the two bookmark elements include an opening tag and an ending tag;
    a document properties element;
    a text element that contains text of the document; wherein all of the text of the document is stored within text elements such that only the text of the document is contained between start text tags and end text tags; wherein there are no intervening tags between each of the start text tags and each of the corresponding end text tags and wherein each of the start text tags do not include formatting information for the text between each of the start text tags and the end text tags;
    a text run element that includes the formatting information for the text within text elements;
    a font element;
    a formatting element;
    a section element;
    a table element;
    an outline element;
    and a proofing element;

    creating the document including the element in said computing device;

    and storing the document in said computing device.

    Claim 12. A computer-readable storage medium having computer-executable instructions for interacting with a document, comprising:

    interpreting a published XSD (Extensible Markup Language (XML) Schema Definition), wherein the XSD defines rules relating to the XML file format for documents associated with an application having a rich set of features;

    and creating an element in an XML file, wherein the element is selected from a set of elements, including:
    a style element;
    a hints element that is interpreted according to a hints sch

  6. Re:Won't hold up by marcansoft · · Score: 3, Informative

    Samna/Lotus Ami Pro used a text-based markup language for documents. It predated Word for Windows (aka Microsoft Word).

  7. Re:Won't hold up by wampus · · Score: 4, Informative

    You mean XPS?

  8. Re:Won't hold up by marcansoft · · Score: 4, Informative

    How about something like this?
    http://www.xml.com/pub/a/2001/02/07/openoffice.html

    Dated February 07, 2001. States that OpenOffice (its first release as open source) already uses the format and goes on to explain some of the XML used.

  9. Re:two patent offices by PPH · · Score: 3, Informative

    The second patent office already exists. Its called the court system.

    The problem with the funding system is: You (the accused infringer) fund that second one.

    --
    Have gnu, will travel.
  10. Re:Won't hold up by Planesdragon · · Score: 4, Informative

    Heck, I even wrote an XML based text editor back when I was learning Java in 2001 or so.

    Go read the patent. Go!

    The darn thing isn't for a pseudo-WYSIWYG XML editor. It's for a specific bundle of features that let you save your non-XML based word processing file as one single XML file, which includes bookmarks, styles, and "formatting hints" as well.

    Making your word processor save to XHTML, or a randomly selected XML dialect? Obvious. The specific way you do that, and include some conventions for features that XML really wasn't meant to support? Non-obivous, and therefore patentable.

    Also not all that broad.

    And, of course,, the real nice thing: this patent only applies if you through a lot of formatting crap into your XML file as well... and I certainly don't remember anyone dumb enough to do that before Microsoft.

  11. Re:Stop the madness by Tubal-Cain · · Score: 3, Informative

    Red Hat love patents too.

    I'll just point you here.