Slashdot Mirror


Microsoft Ends Era Of Closed File Formats

RzUpAnmsCwrds writes "According to an MSDN Channel 9 interview with an Office file-format developer, the next version of Microsoft Office (Office 12) will default to newly-developed XML file formats in Word, Excel, and PowerPoint. The new formats will apparently include XML files along with other files (images, etc) inside of a Zip file. Microsoft will also be providing extensive documentation of the new format to the public through MSDN. The developer likewise announced that Microsoft would be releasing updates for Office 2000, XP, and 2003 to read and write the new formats when the new version of Office is released. If this interview is correct, it could mean the beginning of the end of Microsoft's proprietary file formats." Coverage at Beta News, Information Week, and the Washington Post.

55 of 651 comments (clear)

  1. Loosing lock-in capability? by haluness · · Score: 4, Interesting

    Would'nt this approach cause MS to loose its lock-in ability based on file format?

    Of course this assumes that lock-in was one of their goals with a propietary format

    1. Re:Loosing lock-in capability? by IntlHarvester · · Score: 4, Insightful

      The future profitibility of MS Office is as a component of network groupware systems. Because if you are primarily using Office in standalone mode, you are just fine with any version of Office released in the last 8 years. So, the "value" has to be in improved collaboration or document management.

      In this respect, Microsoft needs open formats just as much as anyone. Ever try to write a server-based system that reads information from DOC files? Using winword.exe with automation just doesn't really work. XML lets MS use a relatively lightweight parser in a server-based system.

      Oh, and changing the default fileformat will surely spur some upgrades, but from what I've seen the corporate market is generally not in a big hurry to get onto the latest version of Office. I don't foresee a repeat of Office 97.

      --
      Business. Numbers. Money. People. Computer World.
    2. Re:Loosing lock-in capability? by Shalda · · Score: 4, Interesting

      The existing Office document formats are all pretty well documented by 3rd parties. It hasn't even scratched Microsoft's sales. Microsoft's vision on this takes several forms. First, Office is a suite of programs that interoperate. You can embed your spreadsheet in your Word doc or your PowerPoint presentation. Update your spreadsheet and it also updates wherever those numbers also exist. Their second angle is for developers. They want you to use .NET for your in house development. Your user needs to send out a letter? Your program will pull up a document and prefill nearly everything. Thirdly, the now have something they can take to governments and other organizations that are demanding open formats. Finally, no matter how well it's documented, there will be dozens of odd little quirks. But that's ok, so long as you're using the tools that Microsoft provides. Proprietary formats get cracked, quickly and easily. Microsoft wants an end to end lock in.

    3. Re:Loosing lock-in capability? by bckrispi · · Score: 2, Informative

      Well, considering that on Tuesday, they were granted a patent on marshaling XML to and from objects, I'd guess they still have their bases covered. Yeah, the XML is "open", but you can't write an application to convert that XML into an object map without violating their new "intellectual property".

      --
      Xenon, where's my money? -Borno
    4. Re:Loosing lock-in capability? by jacksonj04 · · Score: 2, Informative

      I'm with you on this. Much as I'm not keen on MS, the SharePoint server is absolutely phenominal in terms of actually getting things done in a group. Tie it with a properly configured Exchange Server and a 2003 domain, and you have a rock solid (Yes, solid) platform for group work, communication and management that OSS can't even touch.

      --
      How many people can read hex if only you and dead people can read hex?
    5. Re:Loosing lock-in capability? by Trigun · · Score: 2, Insightful

      you have a rock solid (Yes, solid) platform for group work, communication and management that OSS can't even touch.

      Which, imho, is one of the two problems with the OSS business desktop. Gnome and KDE are great desktops, Linux has a long pedigree of network interoperatiblity, but it is really nothing more than a chain of islands, each doing its own thing. There is not a large network collaboration software for linux that takes care of all of the needs of business users. You can cobble the parts together, but if developers don't control all aspects of the equation, it makes things difficult.

      Microsoft is in a unique position where it can tie in parts of its operating system and application software together for a 'just works' solution. People can cobble together a 'works' solution, and even a 'works better than MS' solution, but there is a lot of issues with setting these solutions up. To date (and I have been looking) there is no single definitive solution for something as simple as network logon, and the preferred solution (Ldap, pam and Kerberos) is not the easiest thing to deploy.

      Even if you were to create a ldap-pam-kerberos network, with a document management system that used the kerberos authentication, e-mail that used kerberos authentication, and a plugin that allowed you to check out and check in documents into the dms for OpenOffice,without using a third party middleware that added twenty extra steps into it, you would need a huge company or dedicated group to do it, do it right, and do it seamlessly.

      Novell's working on it, but because it's new, it isn't mature enough for business to see it as a viable solution. Novell still has its fanboys, and their stuff does warrant it, but they are not seen as a competitive threat to MS.

    6. Re:Loosing lock-in capability? by VGR · · Score: 3, Informative

      Here's two examples of prior art.

      --
      The Internet is full. Go away.
    7. Re:Loosing lock-in capability? by xpl_the_myst · · Score: 2, Informative

      Open Format: These formats use XML and ZIP, and they will be fully documented. Anyone will be able to get the full specs on the formats and there will be a royalty free license for anyone that wants to work with the files.

      From the blog of Brian Jones, Program Manager Microsoft Word here. So they intend to enforce control over who uses them, but not by paying royalties.

      --
      This sig is empty.
    8. Re:Loosing lock-in capability? by dave420 · · Score: 2, Interesting

      on a windows server you can use com objects to open a doc file very quickly. I do it in PHP every once in a while. Locked-in file formats don't even figure in most peoples' thinking. They just want software they're familiar with. Most people don't have software ideology.

  2. Yes and No by Anonymous Coward · · Score: 2, Interesting

    They will use an "open" XML format, but some of the objects embeded in that XML file will be binary (read prorpietary).

    1. Re:Yes and No by aldoman · · Score: 3, Informative

      No they won't.

      Watch the video - the entire file format is completely open.

      He admitted that inside the ZIP they are currently storing the binary copy to make it easier to test and profile against the formats, but when Office 12 is released it'll just be the one XML, completely open format. He also made a point that they are going to have 'thousands' of examples on MSDN, along with very detailed documentation and whitepapers.

      Now whether it's patented or not, I don't know. But this is a _VERY_ big step for Microsoft. It's going to make translating between this and OASIS (which OpenOffice2 and a lot of others are considering/implementing as their default) as simple as an XSLT transformation.

    2. Re:Yes and No by LWATCDR · · Score: 2, Informative

      Last time I checked the XML format stored a serialized form of COM objects that where not documented. Better than nothing but not really all that open.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    3. Re:Yes and No by mwood · · Score: 2, Funny

      They *do* have a choice: they could compete on quality and usefulness. Power is not the only option.

  3. ZIP patent... by Vo0k · · Score: 2, Interesting

    AFAIK ZIP was evil because of some patent issues, and that's why gzip was developed. The patent has supposedly expired in the US, but not necessarily in all other countries (same as with GIF). Any info on that?

    --
    Anagram("United States of America") == "Dine out, taste a Mac, fries"
    1. Re:ZIP patent... by andi75 · · Score: 3, Informative
      To clean up some confusion:

      gzip and zip are completely different things. gzip compresses a stream (and does a much better job than compress, which it has replaced entirely. However, gzip is slowly being replazed by bzip2 nowadays), whereas zip is an archive format that can store individual (usually compressed) files. The huge advantage of zip over compressed tar archives comes from the fact that you have random access, i.e. can extract a single file from a potentially HUGE archive).

      GIF had patent issues with the LZW-Algorithm it used. The patent has expired recently, but the GIF issue is completely unrelated to ZIP (ZIP uses LZ77).

      About the patent issue: There are a dozen or so zip-related patents, but they're all highly specific and shouldn't stop anyone from using zip, or even writing a zip utility. See also Patents on data compression algorithms.

    2. Re:ZIP patent... by evilviper · · Score: 2, Insightful
      The huge advantage of zip over compressed tar archives comes from the fact that you have random access, i.e. can extract a single file from a potentially HUGE archive).

      Actually, I don't find much of an advantage. In my experience, even if you are trying to extract a tiny file from a large archive, it still seeks through the majority of the zip file, and is only slightly faster than uncompressing the entire thing.

      Despite that, tar and gzip could be even better. A little programming and you could modify tar to transparently compress individual files with gzip/bzip2 before adding them to the tar archive. In other words, instead of a "tar.gz" file, you'd have a "gz.tar" file...

      Why this hasn't been done yet, I don't know.
      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  4. Convenient... by daveschroeder · · Score: 3, Interesting

    ...now that they've all but killed off all of the commercial, vendor-supported competition.

    And whatever happened to Office Integrated Rights Management, essentially a DRM for Office documents (New Office locks down documents) that (of course) requires a Windows server to administer, and only works with Microsoft Office? You don't think that they're just going to let that go by the wayside, do you?

    And what about patents?

    Sure, OpenOffice is great, but commercial enterprises will stick with commercial solutions for which there is support. And yes, this could be built for something like OpenOffice (and indeed exists for StarOffice), just as it has been for Red Hat, but I can't see this as anything more than a much belated, empty gesture on Microsoft's part. This sums it up: "Microsoft is doing this as a way to protect its presence on the desktop." Microsoft even dug up Charles Goldfarb, "co-inventor of the concept of markup languages", for its press release to say, "Making XML the default Office file format is, for me, the culmination of a 35-year dream," Charles F. Goldfarb, the inventor of the markup language technology, said in a statement released by Microsoft. Nice touch.

    Also, "Microsoft Ends Era Of Closed File Formats" is a little overreaching, don't you think? They're looking for the biggest lock-in of all with the proprietary Windows Media formats. Microsoft wants to be everywhere there is any kind of media, and it's NOT open. Boy, I can't wait to live in a world where Microsoft controls and meters content and has everyone from the end consumer to cable, satellite, and telecom operators, movie and TV production houses, and everyone in between by the balls, which is exactly what will happen if they get their way. (And submission to SMPTE *hardly* means anything. Standards are standards AFTER they've been vetted by standards bodies, have had the patent searches and pools completed, etc., and have been, you know, actually approved. Not when they've been "submitted for consideration". Further, that gesture is nothing more than an attempt to get pinhead PHB-type managers and executives on board with Microsoft when their technical underlings are pulling for open standards like H.264 - then Microsoft can shoot back to the management, Hey, we're just as open as the MPEG family of standards! Look, we even submitted our codec to SMPTE! It's not our fault they take so long to approve things! Do you really want all that H-dot-whatever-gobbledeygook that your oddball IT guys are talking about? After all, that's what *Apple* uses. You don't want an Apple technology, do you? Go with us; you know Microsoft is the right choice for your 18-million-customer cable service! Utter bullshit. And ignores the fact that all of the codec improvements and tools will NOT be open; the SMPTE submission is nothing more than a thinly veiled attempt to put Windows Media everywhere as well by claiming to be "open" when they're anything but.)

    1. Re:Convenient... by AKAImBatman · · Score: 4, Insightful

      Also, "Microsoft Ends Era Of Closed File Formats" is a little overreaching, don't you think?

      That's exactly what I was thinking. If Microsoft was really opening up Office, why didn't they go for the OASIS Spec? Me thinks that this is an attempt by Microsoft to lead the industry around by the nose, thus solidifying their place as "Industry Leader". And with a proprietary document format, they can make minor, but frustrating, changes every version just to keep the competition on its toes.

    2. Re:Convenient... by telbij · · Score: 3, Insightful

      Well I gotta hand it to you for a what amounts to an absolutely brilliant troll. You had me nodding my head the whole way through, but actually your response is just as hyperbolic as the story title. I really don't care to get into all the details... but one thing you said,

      I can't see this as anything more than a much belated, empty gesture on Microsoft's part.

      is true from the MS perspective, but that doesn't mean nothing good can come of it. Having a documented XML format could do wonders for OpenOffice compatibility, which wouldn't necessarily put a dent in Microsoft's monopoly, but it would make life a lot easier for those of us who don't want to participate in it. I'm not saying it'll pan out, just that there are possible real benefits.

    3. Re:Convenient... by KagatoLNX · · Score: 2, Insightful

      Ummmm, hello people. This is an XML format.

      If MS needs extensions, that's what namespaces are for.

      As long as MS extensions don't change formatting functionality (this is really not rocket science, Word is not an innovator here), they can tack whatever metadata they need into the file format and still have it be portable.

      If you don't believe me, look at what Inkscape has done with SVG. Psodipodi built on it, adding a namespace to provide their needed data. Inkscape did the same on top of that. It produces one file containing three XML namespaces containing with interoperable metadata for two editors--and it's viewable in stock SVG viewers.

      Obviously it's up to Microsoft to "do the right thing" with their metadata, but this certainly levels the playing field so that others can do what they need to with the documents.

      Now the patent on XML-to-object-mappings, that's another story...

      --
      I think Mauve has the most RAM. --PHB (Dilbert Comic)
  5. Published Specs is Not Enough by gowen · · Score: 2, Informative

    Because, let's face it, the only reason this is happening is because MS have lost the battle to outlaw reverse engineering. Now they'll have widely available specs for their file format -- and all you'll have to do is license the 20 or so patents that protect these formats, and you'll be able to make a competing product that can read Excel files.

    Remember, GIF was a completely open format -- but that didn't mean Open Source software got to use them freely.

    --
    Athletic Scholarships to universities make as much sense as academic scholarships to sports teams.
  6. XtraML by OakDragon · · Score: 2, Interesting

    I wonder if they'll do the same thing to XML as what FrontPage did to HTML...

  7. Hopefully the end of .doc, etc incompatibilities by Mikito · · Score: 2, Insightful

    Hopefully this file format change will bring about the end of ever-changing file formats from one version of an app to the next. Who among us doesn't have files saved in an old version of, say, Word, which can no longer be read correctly in a newer version of Word?

    --
    Anakin Simpson: If you're not with me, then you're my enemy--ooh, donuts!
  8. Consider this. by PsychicX · · Score: 5, Interesting

    It's interesting that they're doing this. I've been playing with OOo 2.0 beta lately, both under windows and *nix. I'm an Office user, but a home user, not a power user (I'm not a business dealing in several hundred page docs, I just do my homework). And I basically can't see any particular difference between the two packages. I have Office 2000, and so I'm using it, but I'd probably be perfectly comfortable using only OOo (2.0, I hate 1.1)

    Anyway, my point is that MS is making it clear that they're not threatened by competing packages, and I'm not entirely sure why not. OOo could easily replace Office for many (I hesitate to say most) users, and if we switch to totally open formats, they'll be able to interoperate without any difficulties. I'm not trying to say that OOo is in a position to hurt Office...but I'm curious if it might be. MS doesn't seem to think so, and I'm really, really wondering what makes them so nonchalant.

  9. Re:Patents? by The+Snowman · · Score: 3, Informative

    You fucking troll, since when microsoft's binary document formats are documented, or fast?

    Implementing a file format as binary data or even a simple SGML structure such as RTF means less overhead. Using XML you have to run an XML parser, and the file is more freeform. There are no set data structures, it is just a stream of text. With a binary format you can structure it in such a way that you can read a header in and know exactly where to seek in the file to get the information you need. With XML you are pretty much stuck reading sequentially and figuring things out as you go along. Sure, an XML parser library may make it easier, but behind the scenes it is still parsing that stream and processing each tag one at a time.

    --
    24 beers in a case, 24 hours in a day. Coincidence? I think not!
  10. Catching up, but still missing OpenDocument by SgtChaireBourne · · Score: 2, Insightful
    MS looks like it's goal is to catch up with OpenOffice.org/StarOffice, which have had this kind of XML support for many years. Other, lesser, suites also have zipped XML files, like AbiWord.

    The one thing that these others have in common, that MS Office lacks, is support for the OpenDocument DTD. OpenOffice.org v2 will use OpenDocument as its main format.

    Note that many of the articles linked to by the original post express skepticism about how open MS' XML will actually be. Recall that in the last year, and even in the last weeks, MS has sought patents from the USPTO for XML and XML related functions. And is even now pushing to get legislation in Europe to make those same patents valid in the EU. That smacks more of a PR stunt rather than an actual opening up.

    Furthermore, since the articles don't mention the current leaders in productivity tools with XML-based formats (i.e. OpenOffice.org or StarOffice), that looks all the more like warmed over press release being passed of onto the public as news. What's next? A press release about MS suddenly supporting PDF export like in OOo or StarOffice?

    --
    Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
  11. 75% file size savings by Bytal · · Score: 4, Informative

    For those who don't want to watch the video, the new format will supposedly offer a %75 improvement in file size. The old, binary format did not use any compression at all. Some of the other features include having the formatting information at the end of the file so that a half transmitted file still contains all the content.

  12. Nice marketing ploy. Too bad it's a scam by doublem · · Score: 2, Insightful

    Access to the MSDN documentation will require a MSDN developer's subscription and a signed NDA. The NDA will of course forbid the use of file format specification in unsecured software. Appropriate copyright, patent and other licensing fees will be required of developers writing commercial software to access the new file format.

    All kidding aside, I think any hope about this is misplaced. There will no doubt be numerous restrictions on the use of the format information.

    There's also the fact that MS has done quite a bit to document their Office Formats in the past. The major issue is that the documentation differs significantly from the implementations.

    In other words, this is a load of marketing, designed to grab a few buzz words so the sales staff can toss around the phrase "Open Format" when necessary.

    --
    "Live Free or Die." Don't like it? Then keep out of the USA
  13. Microsoft begins era of patent encumbered formats by dyfet · · Score: 5, Insightful
    Take look at Office 2003 XML Reference Schema Patent License and reconcile that with the claims and headline of this article.

    In particular; consider "Microsoft may have patents and/or patent applications that are necessary for you to license in order to make, sell, or distribute software programs that read or write files that comply with the Microsoft specifications for the Office Schemas." taken from the same page...

    What changed? How is that an "improvement" exactly?

  14. Re:Heard this before by AKAImBatman · · Score: 4, Interesting

    Actually, that's not a bad point. What's to stop Microsoft from implanting things like VBScriptlets that are required for the document to render? That would put a real cramp in the competitors' ability to read the format, regardless of how "Open" it is.

  15. Lock-in continues via DRM by SgtChaireBourne · · Score: 5, Insightful
    Would'nt this approach cause MS to loose its lock-in ability based on file format?
    No. The lock-in continues via DRM and ties to "Office Servers". MS is really pushing the server based aspects of Office 12, so there will be hooks to the server like crazy. MS is also really pushing the DRM encumberance in Office 12. In all likelihood, the XML files will still have key components encrypted so as to support MS' DRM and as a 'side effect' lock out competitors.

    The interesting thing is that all this server based control and logging of DMR'd functions gives an enormous boost to the type of information available for international and corporate espionage. Through backdoors, security holes or escrow keys it was possible before to get only the documents themselves for the most part. Now it's possible to monitor who's collaborating with who, and see everyone in the distribution chain.

    That much can be guessed even now during the vaporware stages. However, as more technical information becomes available it will be possible to guess whether these same functions can be used for more than monitoring and can actually be used to stifle or suppress dissent or specific individuals or groups.

    --
    Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
    1. Re:Lock-in continues via DRM by man_of_mr_e · · Score: 4, Informative

      Why do people like you keep reiterating the tired old "without a network" line?

      It hasn't been true since at least NT4 SP6a, when NT4 achieved a C2 rating *WITH* network. Windows 2000 achieved CC both with and without networking.

      The NT4 link is no longer around on MS's site, but there are still some pages out there that reference it:

      Such as this one

      And here is Win2k

  16. Re:Patents? by menkhaura · · Score: 3, Informative

    I agree with you in that binary formats can be faster, and I don't love XML-as-storage-format too much, but the case in point is *microsoft's* binary formats, which are little more than straight memory dumps, and UNDOCUMENTED, and SLOW.

    A well-designed binary format makes much more sense than XML, in this I concur with you, but XML is better than current microsoft's doc formats in that it would be easier to figure out the inner workings of the format, and making struggle for compatibility a much less gory task.

    --
    Stupidity is an equal opportunity striker.
    Fellow slashdotter Bill Dog
  17. Re:Nice marketing ploy. Too bad it's a scam by SolidGround · · Score: 5, Informative

    First of all, the entire MSDN library can easily be accessed online (http://msdn.microsoft.com/library/), second an MSDN subscription doesn't involve any kind of NDA. The only times I've personally come across this was with pre-release stuff and with their limited beta programs and in those cases it's nothing that any other company doesn't do either.

  18. Re:Sheesh by Slashcrap · · Score: 2, Informative

    Microsoft announces that they are going to do something that Slashdot has wanted them to do for ages, and Slashdot proceeds to find faults with it?

    Leaving aside that fact that, like you, Slashdot is not an intelligent entity with needs and desires, this is not what we wanted.

    We want an open, documented and non-patent encumbered format that allows interoperability between Office and other software.

    That is not what this is.

    Another poster has already provided a link that will tell you everything you need to know about the conditions attached to these formats.

    They have patented everything. You will need to licence their patents in order to use these formats in your software.

    That makes them useless. And yet you want us to get down on our knees and praise Microsoft for their generosity? With all due respect, you are either a Microsoft shill or a fucking retard. Not that I am suggesting that those two options are mutually exclusive.

  19. So they claim. by mcc · · Score: 4, Insightful

    Watch the video - the entire file format is completely open.

    Honestly, I am not going to believe it until I see it.

    Microsoft has lied before.

    It's quite possible they don't intend to open their file formats at all, they just intend to make the Washington Post and its readers think they've opened their file formats. In the meantime, if Microsoft actually wanted to "end the era of closed file formats", all they'd have to do is, you know, actually comply with the letter of the antitrust decision currently handed down against them in the E.U. and the spirit of the toothless antitrust "settlement" currently in effect against them in the U.S.. Mysteriously, they haven't.

  20. Re:Nice marketing ploy. Too bad it's a scam by doublem · · Score: 2, Informative

    Office 2003 XML Reference Schema Patent License

    Just TRY to use the Office XML Specification in an Open Source application. Go ahead, I dare you.

    --
    "Live Free or Die." Don't like it? Then keep out of the USA
  21. Re:Patents? by rben · · Score: 5, Insightful

    I use Open Office exclusively and have for the past couple of years. Reading the files in certainly isn't a problem for me. The only files that are slow to load are the master document files, and that's because they link to dozens of other files.

    The XML specification is being expanded (it might already be done) to allow binary formats. There are good reasons, though, why it's best to keep data files in straight XML text format. It eliminates the need to worry about machine architecture. Little endian or big endian, it maks no difference to you. The files are perfectly portable across platforms, which is increasingly important these days. XML files zip very nicely, making them almost as small as a corresponding binary file.

    It is far easier to provide backwards compatability to earlier file formats when you are using XML than if you are using binary file formats. With XML, if it sees a tag it doesn't understand, the parser ignores it. If a binary file format loader sees stuff it doesn't understand, it bails out with an illegal file format error.

    When you move to a new expanded file format with XML, you don't have to write a conversion utility. Since you are merely adding new tags, your program can read any of your old data just fine, then add the appropriate tags and new data. This saves a great deal of trouble for programers.

    Machines are fast and cheap. People are slow and expensive. It is far better to have our computers do a little extra work on loading a text file and eliminate conversion utilities and complicated loading routines that a prone to bugs.

    --

    -All that is gold does not glitter - Tolkien
    www.ra

  22. Re:Nice marketing ploy. Too bad it's a scam by yeremein · · Score: 5, Informative

    Uhm ... try reading the license.

    Looks kinda like a BSD license, don't it?


    Yeah, especially the part that says "You are not licensed to sublicense or transfer your rights."

  23. Re:Microsoft begins era of patent encumbered forma by Anonymous Coward · · Score: 2, Interesting

    Finish reading, numnuts.

    Stripped down to its essence, the license says:

    Microsoft hereby grants you a royalty-free license under Microsoft's Necessary Claims to make, use, sell, offer to sell, import, and otherwise distribute Licensed Implementations solely for the purpose of reading and writing files that comply with the Microsoft specifications for the Office Schemas.

    If you distribute, license or sell a Licensed Implementation, this license is conditioned upon you requiring that the following notice be prominently displayed in all copies and derivative works of your source code and in copies of the documentation and licenses associated with your Licensed Implementation:

    "This product may incorporate intellectual property owned by Microsoft Corporation. The terms and conditions upon which Microsoft is licensing such intellectual property may be found at http://msdn.microsoft.com/library/en-us/odcXMLRef/ html/odcXMLRefLegalNotice.asp."

    By including the above notice in a Licensed Implementation, you will be deemed to have accepted the terms and conditions of this license. You are not licensed to distribute a Licensed Implementation under license terms and conditions that prohibit the terms and conditions of this license.

    You are not licensed to sublicense or transfer your rights.

    Microsoft reserves the right to terminate this license grant if you sue Microsoft or any of Microsoft's affiliates for patent infringement over claims relating to reading or writing of files that comply with the Office Schemas. This license is perpetual subject to this reservation.


    So basically, the license grants you perpetual rights to read or write 'their' schema so long as you don't sue them for patent infringement, and you do display their blurb in your source and docs. You can't sublicense 'their' schema, but no problem, other can get the same license you got, simply by attaching the blurb.

    Looks a lot like the *more* free BSD license!

  24. Re:Loosing lock-in capability? No... by amliebsch · · Score: 2, Informative
    So if you want to be able to use MS file formats, you'll have to pay a hefty patent licensing fee.

    Or not.

    http://www.microsoft.com/Office/xml/faq.mspx

    Q. Who can obtain a license?

    A. The license is not restricted to particular individuals or entities. It is available for customers, governments, academics, hobbyists, and IT companies.

    ...

    Q. How much does the license cost?

    A. The license for the Office 2003 XML Reference Schemas is being made available free of charge.

    --
    If you don't know where you are going, you will wind up somewhere else.
  25. Re:Patents? by Ruphuz · · Score: 2, Insightful
    What information do you have that suggests that the XML format will be bloated?

    Not to be a troll, but Microsoft Word's HTML output gives a good idea of how greatly can they bloat XML.

    --
    My other post is a First.
  26. Re:Patents? by jedidiah · · Score: 2, Insightful

    So you think that a 1Ghz cpu is going to be slowed down because someone's resume or board of directors presentation isn't binary anymore?

    Oh puuuulllleeze.

    Such concerns might be relevant for a C64 or perhaps even a MacPlus. However, for small consumer documents such notions are absurd.

    This isn't exactly someone's corporate data warehouse we're talking about here.

    --
    A Pirate and a Puritan look the same on a balance sheet.
  27. Re:Patents? by Zeinfeld · · Score: 2, Insightful
    Implementing a file format as binary data or even a simple SGML structure such as RTF means less overhead. Using XML you have to run an XML parser, and the file is more freeform. There are no set data structures, it is just a stream of text.

    The term you are looking for here is 'self describing structure'.

    If you have data in the form of LISP S-Expressions you know where structures start and stop. If you have just one document in that format you can pretty much work out the entire file format - or at least the features being used.

    If you have a binary document you have to do a lot more digging and it can take you days to just work out the basic structure.

    This will make Word much more useful, it will be much easier to create documents with other applications and emit them in Word format. So for example if I have a report writer component in my server I could spit out a Word Document rather than HTML which I would use today.

    I can also write filters to automatically convert from Word format to other formats, so I can take HTML source and spit out Word, I can take an XML data structure and emit word.

    So why would I prefer Word format over HTML when I was one of the people who helped write HTML? Well the answer is that virtually all HTML editors are optimized for editing Web pages. I write books and other reports that really don't fit that structure. There is nothing in HTML land that I have seen that provides the power of the Word outline mode and has built in spell checking as you go.

    --
    Looking for an Information Security student project suggestion?
    Try http://dotcrimeManifesto.com/
  28. Patents can be lost by btarval · · Score: 2, Informative
    Yes, submarine Patents are indeed the first thing one suspects Microsoft of doing whenever they appear to open some interface up. With good cause too, given Microsofts' current rampage on Patent Filing.

    However, Microsoft may be walking a tight line here (at least in theory) as they are a convicted Monopolist. I refer you to this quote from Nolo Press' "Patent It Yourself" (p 1/8), on how Patents can be lost:

    "The patent owner engages in certain defined types of illegal conduct, that is, commits antitrust or other violations connected with the patent".

    I'm not suggesting that Microsofts' patents aren't a threat to the Open Source community; nor am I suggesting that the patents be taken lightly in any way. However, I do have to wonder how much of Steve Balmer's chest-thumping about Patents is just FUD, and perhaps Microsoft isn't as strong on this point as they'd like everyone to believe.

    I have no doubt whatsoever that Microsoft would try to pull whatever they can get away with. The question I would like to raise is whether they would actually be successful (or how successful they might actually be); especially given that there are now deep pockets behind Open Source?

    I am not a lawyer (nor ever wish to be one ;) ). But I mention this for two reasons. First, for everyone's general awareness. And second, to solicit some input from those who are more knowledgeable in this area of the law.

    --
    The best way to predict the future is to create it. - Peter Drucker.
  29. Re:Loosing lock-in capability? No... by AstroDrabb · · Score: 3, Informative
    Or just be locked out if your using the most popular open source license?
    Q. Can I distribute a licensed program under an open source software license?

    A. Yes. There are many open source licenses available in the developer community. One useful place to review the various licenses that have been approved by the open source community is at Open Source Initiative. The terms and conditions of these licenses differ in material respects. We believe you can distribute your program under many open source software licenses so long as you include the notices described in the licenses for the Office 2003 XML Reference Schemas. On the other hand, some open source licenses may include specific constraints or restrictions that might preclude development under the Office 2003 XML Reference Schema licenses. You should check with your legal counsel if you have questions about a particular open source software license.
    I guess it is just a coincidence that MS made their office XML patent license incompatible with the GPL?
    --
    If Tyranny and Oppression come to this land,
    it will be in the guise of fighting a foreign enemy. -James Madison
  30. Re:Nice marketing ploy. Too bad it's a scam by amliebsch · · Score: 3, Informative
    Yeah, especially the part that says "You are not licensed to sublicense or transfer your rights."

    From the FAQ:

    Q. Can the licenses for the Office 2003 XML Reference Schemas be used by open source developers?

    A. Yes. Open source developers who wish to participate in a community development project can enter into the agreements and then work in a collaborative fashion on development of a program or programs.

    Q. Can I distribute a program that can read and/or write files that support the Office 2003 XML Reference Schemas in source code form?

    A. Yes. You can distribute your program in source code form. But, note that the patent and copyright provisions in the license for the Office 2003 XML Reference Schemas require you to include a notice of attribution in your program.

    Q. Can I distribute a licensed program under an open source software license?

    A. Yes. There are many open source licenses available in the developer community. One useful place to review the various licenses that have been approved by the open source community is at Open Source Initiative.

    The terms and conditions of these licenses differ in material respects. We believe you can distribute your program under many open source software licenses so long as you include the notices described in the licenses for the Office 2003 XML Reference Schemas. On the other hand, some open source licenses may include specific constraints or restrictions that might preclude development under the Office 2003 XML Reference Schema licenses. You should check with your legal counsel if you have questions about a particular open source software license.

    --
    If you don't know where you are going, you will wind up somewhere else.
  31. Re:Patents? by Master+of+Transhuman · · Score: 2, Interesting


    While VB.Net has its admirers, I think Mono was more a response to the need to interoperate and prevent MS from taking over Net apps development and locking it in to MS languages, more than it was any "admiration".

    --
    Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
  32. Re:Loosing lock-in capability? No... by Kiryat+Malachi · · Score: 2, Informative

    From the website:

    But, note that the patent and copyright provisions in the license for the Office 2003 XML Reference Schemas require you to include a notice of attribution in your program.

    I'm just guessing that the GPL's noted incompatibility with an advertising clause is what breaks compatibility here. MS being MS, they could well have done it intentionally; that said, an advertising clause might also have simply been seen as appropriate. Who knows.

    --

    ---
    Mod me down, you fucking twits. Go ahead. I dare you.
    (I read with sigs off.)
  33. Re:CPUs that run Gzip are not free(beer) by tepples · · Score: 2, Insightful

    XML data structures are all going to make life a whole lot easier now that memory and processing time are such commodities.

    Commodity != free, especially when you are trying to deploy something on a battery-powered device.

  34. Re:Microsoft begins era of patent encumbered forma by cahiha · · Score: 2, Informative

    Clean-room reverse engineering doesn't help if the format is patented, and this format is patented.

  35. Re:Patents? by Baricom · · Score: 2, Interesting

    It's a shame Microsoft won't be using the recently-standardized OpenDocument format, which would mean exisiting products could read it. It also saves the XML file and all included data files (images, etc) within a ZIP file.

    (Disclaimer: They might be using it, but TFA doesn't mention it and it wouldn't fit with their MO.)

  36. Re:Loosing lock-in capability? No... by Kiryat+Malachi · · Score: 2, Insightful

    Considering that even MS sometimes has problems interpreting their own binary formats from way back when, I could see it just being a move on their part to eliminate that problem in the future.

    That said, I don't believe the GPL incompatibility was intentional; I believe it to be a side-effect of what MS thought was an appropriate way to make sure that their patents and patent issues were appropriately labeled in software using their schema.

    --

    ---
    Mod me down, you fucking twits. Go ahead. I dare you.
    (I read with sigs off.)
  37. Re:Patents? by gui_tarzan2000 · · Score: 2, Informative
    "Can you please redirect me to a file that takes a long time to load (after the program is started)"

    Take a look at a high school senior's Publisher & PowerPoint files some time. They are HUGE, especially with lots of pictures (as instructed). It's not unusual to see file sizes 100-250mb on an ordinary PP or Pub file. And boy do they take a long time to open across a network on P4 computers with lots of ram. We just had a graduation PowerPoint file that was in the neighborhood of 700mb and it took a couple of minutes to open from the network (using a 100BaseT connection).

    I've told the teachers to make sure the kids optimize their photos, etc. but it falls on deaf ears...

    --
    Have you hugged your penguin today?
  38. Visual Diff of application files by CarpetShark · · Score: 2, Insightful
    There are good reasons, though, why it's best to keep data files in straight XML text format. It eliminates the need to worry about machine architecture.
    One of my favourite features of XML vs. binary application files is that, if you run diff on two XML files -- say, SVG drawings --, you can actually read how that drawing has been changed, just like with code or any other text file. It gets difficult to visualise on larger changes of course, but that's not unlike code either. I think, if people start noticing that aspect of XML file formats, and using it more, then there might come a time when we develop visual diff for XML etc.