Slashdot Mirror


Public Request For Microsoft To Release Deprecated File Formats

SgtChaireBourne writes "NLnet, a Dutch foundation for an open information society, has publicly called for Microsoft to release its deprecated formats into the public domain. The maker of Office has made large efforts during the last year to move against the OpenDocument Format (ISO/IEC 26300). These efforts have been producing a lot of commentary regarding the amount of data bound up in the Redmond-based company's proprietary specifications. It's a nasty situation to end up with files that cannot be read because the sole vendor with the documentation for the files has withdrawn permission. ODF is the way forward, or a step forward at the least, with new documents. But for the old documents in the legacy formats, they cannot be read without supporting software and that support requires full access to the specifications."

45 of 154 comments (clear)

  1. Inaccurate summary by elrous0 · · Score: 5, Informative
    This call is not just to release OLDER file formats. That's the pretense, but if you read it carefully, you'll see sentences like this in the press release:

    releasing the full blueprints of the many different versions of Microsoft's old Office formats (better known as doc, xls and ppt)

    Last time I checked "many different versions" of doc, xls, and ppt are NOT old, obsolete file formats. They're essentially asking MS to not only open up their old file formats (such as Word 97 and older doc files), they're also asking them to hand over the full specifications on all their EXISTING modern formats--a move that would allow comptetitors to develop Office clones at will.

    This is a thinly disquised shot at MS and closed source formats, not some noble attempt to help out archives. If it wasn't, they would have limited this to older files only and also called on other companies that make other older, proprietary formats (like Corel, Adobe, etc.) to release all their specs too.

    --
    SJW: Someone who has run out of real oppression, and has to fake it.
    1. Re:Inaccurate summary by tonsofpcs · · Score: 4, Interesting

      I think a big part of the issue is that MS has routinely removed full support for their own older file formats from newer apps. I have some Word 2.0 .doc files that the newest program that I can use to read them with all of the advanced settings intact is Word 6.

    2. Re:Inaccurate summary by faloi · · Score: 5, Interesting

      If you really feel that Microsoft stands to lose so much from releasing specs so that people can potentially even work on their most recent formats, then surely you can appreciate that Microsoft has some responsibility to its customers to make sure they can access their data. Most companies would likely be completely happy with a reader or proprietary file converter that would let them open up older documents. You know...like we could essentially always do when a new version of Office rolled around.

      All it would take is for Microsoft to release a fully compatible viewer/converter so that everybody can open the oldest of documents, and companies would likely cease to care.

      --
      "It is a miracle that curiosity survives formal education." -Albert Einstein
    3. Re:Inaccurate summary by nine-times · · Score: 4, Insightful

      Seems to me that with Microsoft trying to push everyone to OOXML, the old "doc" and "xls" formats are the obsolete and depreciated formats, even if MS won't officially say so.

    4. Re:Inaccurate summary by Bert64 · · Score: 4, Insightful

      Those formats are deprecated, the currently selling ms products use the OOXML based formats.

      On the other hand, why should it be explicitly limited to old formats? All data should be in open formats for a huge number of reasons, archiving is just one of them.
      And formats should be opened up while they are new, once they become old the specs often get lost (try opening a really old word document in the current version), often there never were any formal specs beyond "whatever the program outputs".

      Finally as to other formats, yes they should request the release of other proprietary formats, but they are going after the biggest target first as it affects more people... As noble as it would be to get the format specs for Wordworth on the Amiga (a long forgotten app, and its original vendor wont sell me a new copy, give it to me for free, or release the source or any specs, their official line is that my documents are lost), this would only benefit a very small number of people. Also, microsoft disclosing their old formats would set a powerful precedent for others in the industry to follow.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
    5. Re:Inaccurate summary by Gadget_Guy · · Score: 5, Informative

      All it would take is for Microsoft to release a fully compatible viewer/converter so that everybody can open the oldest of documents, and companies would likely cease to care.

      But they have done this for years, and yet everybody still complains.

    6. Re:Inaccurate summary by ketilwaa · · Score: 4, Insightful

      Can you please direct me to Microsoft's Linux versions of those viewers, so I can try them out? Thanks!

    7. Re:Inaccurate summary by CastrTroy · · Score: 3, Interesting

      I think that a major problem with many of the old Microsoft formats is that there is no format. They are basically a big memory dump from MS Word, and there's not really a spec of how to actually interpret the information. If the format followed some logical specification, then the OO.o team would have already figured out how to interpret MS Word files.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    8. Re:Inaccurate summary by elrous0 · · Score: 4, Insightful

      MS is a for-profit company. It is not their job to serve you, answer to you, provide you with public service, or unzip their flies and hand you all their trade secrets so you can develop a competing product. If you don't like their stuff, don't use it. There are plenty of great alternatives like OpenOffice, Linux, Apple's OS, etc. available. Feel free to tell MS to go to Hell. Feel free to never buy one of their products again. Feel free to encourage your friends to do the same.

      --
      SJW: Someone who has run out of real oppression, and has to fake it.
    9. Re:Inaccurate summary by faloi · · Score: 2, Insightful

      The oldest supported format is Word 6.0. Some businesses may require older versions. That's the crux of the argument, even their supported tools fail to open some of the much older formats. If Microsoft truly had all the old formats supported, as I stated earlier, they'd be able to really say there's no need.

      --
      "It is a miracle that curiosity survives formal education." -Albert Einstein
    10. Re:Inaccurate summary by Gadget_Guy · · Score: 2, Funny

      Can you please direct me to Microsoft's Linux versions of those viewers, so I can try them out? Thanks!

      Here you go.

    11. Re:Inaccurate summary by jamar0303 · · Score: 3, Informative

      It only goes to 97. There ARE versions of Word and Excel before that. I remember using Word 4.0 for Mac in elementary school to write stuff and still have the disks full of stuff I wrote back then. If not for my old SE/30 with Word 4.0 I wouldn't be able to open those documents anymore.

      --
      OSx86 FTW
    12. Re:Inaccurate summary by elrous0 · · Score: 2, Insightful

      Who owns your data, you who have created it with the assistance of a tool or the company you bought the software from?

      If someone really cared about proprietary formats so much, they always had the option of saving their documents (or at least backups) in neutral formats like rtf. AFAIK, virtually every version of Office has supported these kinds of open formats. People don't use them because the VAST majority of users don't give a rat's ass about the propriety vs. open source issue. Is that MS's fault? Is it their responsibility to promote the open-source/open-document movement? Is it their obligation to roll over and give you all their trade secrets so you can develop an open-source competing product and drive them out of business?

      --
      SJW: Someone who has run out of real oppression, and has to fake it.
    13. Re:Inaccurate summary by Ed+Avis · · Score: 2, Interesting

      Have you tried the viewers under Wine? If you pay attention to EULAs, then you are not allowed to use the accompanying fonts on non-Microsoft systems, but you are allowed to use the viewer.

      --
      -- Ed Avis ed@membled.com
    14. Re:Inaccurate summary by nschubach · · Score: 4, Funny

      MS is a for-profit company. It is not their job to serve you, provide you with public service...
      I thought that was the point of business... I guess I have to re-evaluate my view on what running a business is. I mean, if the people don't matter, why would you need the people your providing a service to? Just kill a man and steal his money. Hell, we can start by exterminating everyone on the planet. Then you'd have the most successful business in the world. Total monopoly, no competition, no taxes, no salaries to pay and total domination!
      --
      Every time I start to have faith in humanity, I ruin it by driving to work between 7 and 8 am.
    15. Re:Inaccurate summary by peragrin · · Score: 2, Insightful

      I would but MSFT is an illegal monopoly in the USA, South Korea, and Europe. Yet they still have total control over file formats.

      MSFT is a for profit company which means they should sell products and services people are asking for. People are asking for converters for other OS's. MSFT doesn't even provide converters for OSX an OS which it does support. let alone for other OS's.

      If your business is totally dependant on trade secret file formats then you had better be very careful. As the one day that some one builds something better your screwed.

      --
      i thought once I was found, but it was only a dream.
    16. Re:Inaccurate summary by RyuuzakiTetsuya · · Score: 2, Interesting

      If we're talking consumer products, great! Yes! I have the right not to use them. However, if the document I had been handed wasn't written by my opensource alternative I chose, I'm kind of screwed. Particularly with Microsoft also pretty much in bed with Governments, both the Federal, state and all levels of local municipality, selling them umpteen number of licenses for Office and particularly Word, they do have something of a responsibility to make sure that documents that are created by our own government are readable by the general population. Now, I'm not talking about military secrets here, but if I do a FOIA for some data and I get a word document back, that's really not useful now is it? (Yes, I'll probably get a PDF or actual paper, but it's possible if that's what the document was stored as.)

      --
      Non impediti ratione cogitationus.
    17. Re:Inaccurate summary by QuietLagoon · · Score: 3, Informative
      MS is a for-profit company. It is not their job to serve you, answer to you, provide you with public service, or unzip their flies and hand you all their trade secrets

      Microsoft is a company that has been found guilty of the illegal leveraging of its monopoly. As such, a different set of laws apply to the sharing of Microsoft's intellectual property. We have already seen that Microsoft can be forced to share its protocols with competitors.

    18. Re:Inaccurate summary by howlingmadhowie · · Score: 4, Insightful

      nope. the whole point of a monopoly and proprietary formats is that you're not screwed. before another company stands a chance, it doesn't just have to build something better, it has to build something better, install it on all computers worldwide and convert all existing documents to the new format. otherwise nobody can move to a different piece of software.

      and seeing as the monopoly office suite is made by the same people who make the operating system, it would be trivial for them to not allow a competitor's products to run.

    19. Re:Inaccurate summary by Anonymous Coward · · Score: 2, Insightful

      Why do you feel that Microsoft should start programming for different OSes? If you don't like the fact that your documents can't be opened in Linux, stop using Linux. Or stop creating documents in Word.

    20. Re:Inaccurate summary by Frank+Battaglia · · Score: 2, Insightful

      Of course, US competition/monopoly law is very similar to EU law in the relevant areas; it's just apparent not being applied at present after MS was let off with a slap on the wrist last time. Not really. EU competition law is designed to defend consumers against big corporations. US anti-trust law is designed to preserve a competitive market. That somewhat nuanced difference in goal leads to somewhat different implementations.
    21. Re:Inaccurate summary by dpilot · · Score: 4, Insightful

      Once upon a time, someone told us that the point of a business was to take your money, and have you feel happy about it. In other words, you give the business money, but feel that you have received fair value of goods and/or services in exchange.

      The point of a free market is that if the above conditions are not true, you should be able to do business with someone else, instead.

      Relate this to Microsoft as you will. But keep in mind a few things...
      - There are very few viable (The word "viable" can scope quite a few meanings, here.) competitors to Microsoft in many situations.
      - Many times their real customer is not you, but someone else - a supplier of one sort or another. Your involvement may be many-times indirect.
      - Microsoft has been found guilty of illegal monopoly practices in a court of law.

      --
      The living have better things to do than to continue hating the dead.
  2. Yeah but by Arthur+B. · · Score: 3, Interesting

    If you do that, people will never learn and continue to use closed formats. It's too easy to fall for a closed format for your crucial documents and then go whining when the company stops supporting them. Let people pay the price of their mistake, then, then open document formats will pick up steam.

    --
    \u262D = \u5350
    1. Re:Yeah but by TofuMatt · · Score: 3, Insightful

      Yes, people should definitely pay for their mistake of buying Office in '97 when they could have got OpenOffi-- oh wait, that's right: it wasn't even released in '97. People don't deserve to "pay for their mistakes" because they bought the best software at the time for office/productivity work. They deserve to scope out the alternatives now (iWork and OpenOffice, though OpenOffice still sucks, largely, except for Writer), but they didn't do anything wrong by buying good yet closed software.

      --
      -Matthew Riley "TofuMatt" MacPherson
      I have a website
    2. Re:Yeah but by Jeppe+Salvesen · · Score: 3, Insightful

      Oh come on!

      Once the company has stopped earning money on a format, they should open it up under an appropriate license. (Patents might play a part, in an ideal world they would not but let's play in this world for now). Microsoft does not make any money on Excel97. Why on earth be so mean to their previously paying customers that they refuse to open that obsolete standard?

      --

      Stop the brainwash

  3. Microsoft cant do that by a_n_d_e_r_s · · Score: 5, Insightful

    The reason - they don't have any documents describing the formats.

    Code are descriptions of formats.

    When Microsoft was forced to disclose information about the SMB format to EU anti-trust department they tried to give them the source code - complaining that it cost them too much to describe the format.

    So they are sadly asking for something that dont exists.

    --
    Just saying it like it are.
    1. Re:Microsoft cant do that by adpsimpson · · Score: 5, Interesting

      Considering the code for rendering the older .doc formats is now officially considered 'unsafe' by Microsoft, and has been disabled in Office 2007, perhaps releasing the code itself (or choice chunks of it) would be just as useful?

      Surely if you have a chunk of code for a no longer supported format, which you consider too buggy and unsafe, which is 10 years old and which you've disabled in your latest products, you wouldn't mind letting other people clean it up for free, since it can't be of any commercial value?

      Right?

      --ducks the '-1 flamebait' mod---

      --
      Is crushing a suspect's child's testicles illegal?
      John Yoo: "No, [if] the President thinks he needs to do that."
    2. Re:Microsoft cant do that by dominator · · Score: 3, Informative

      The reason - they don't have any documents describing the formats.


      Except, they do. They've released specs for at least Word97, RTF, and PowerPoint's file formats, the OLE container format, and the Excel chart format. The docs were hosted on MSDN for a few years, even. I'm not saying that these docs are perfect or anything (they're far from it), but they're a decent start. I say this as someone who has used the docs to implement popular F/OSS tools that read and write these formats.

      http://www.wotsit.org/list.asp?fc=10
      http://www.wotsit.org/list.asp?fc=6
    3. Re:Microsoft cant do that by Ilan+Volow · · Score: 2, Funny

      If you're the world's largest and most profitable software company, there's probably some commercial value in not looking like a bunch of idiots who can't plan file formats.

      --
      Ergonomica Auctorita Illico!
    4. Re:Microsoft cant do that by psbrogna · · Score: 2, Funny

      That ship, my friend, has sailed.

  4. Not neccessary by Anonymous Coward · · Score: 2, Insightful

    Technically, it would be sufficient for the sake of old documents to provide a free tool that is able to read those documents, or a tool that would convert them to an open document format. This tool wouldn't need to have its source published.

  5. Opening up OOXML by adpsimpson · · Score: 2, Informative

    The worst proprietary 'hooks' such as 'footnoteLayoutLikeWW8', 'lineWrapLikeWord6' and 'useWord97LineBreakRules', appear now to have been documented - see this link. This in effect means that some of the quirkier behaviour of old versions of MS Office may now have been made public (difficult to say for sure as the ECMA resolution is behind a passworded site).

    Microsoft would make their, and everyone else's, lives a lot easier if they went the whole way and documented the entire depreciated office formats, allowing others to write filters to correctly interpret them. This would also give them a foothold in claiming that the tags above truly do point to an open format, since the behaviours they refer to would be openly documented.

    But let's not hold our breath.

    --
    Is crushing a suspect's child's testicles illegal?
    John Yoo: "No, [if] the President thinks he needs to do that."
  6. No such thing by martinag · · Score: 3, Insightful

    Isn't it quite obvious that there are no specs? The OOXML specs are probably the best they can do when they have to reverse engineer the code into documentation. Don't expect any better than that and furthermore, don't expect them to even try (which they at least have when it comes to the OOXML documentation).

  7. They might not have it... by Pedrito · · Score: 5, Informative

    Microsoft may not have the formats formally specified anywhere...Many, many years ago, shortly before my book was published, Microsoft actually wanted to hire me to write the official documentation for the Segmented Hyper-Graphic (SHG) file format because their own in-house documentation for the format was for an even older, unsupported version.

    I mean, think about it, if you write code to store a document, do you sit down and write the byte-layout of that file? I suppose you could, but it's generally not necessary for the coders. My guess is that MS doesn't even have this stuff lying around. They'd probably have to have someone actually piece it together from the code.

    1. Re:They might not have it... by Lonewolf666 · · Score: 4, Insightful

      I mean, think about it, if you write code to store a document, do you sit down and write the byte-layout of that file? I suppose you could, but it's generally not necessary for the coders. My guess is that MS doesn't even have this stuff lying around. They'd probably have to have someone actually piece it together from the code.

      At the company I work for, we usually do sit down and document the byte-layout of that file. When this was neglected, it has invariably come round and bit us in the ass ;-)
      --
      C - the footgun of programming languages
    2. Re:They might not have it... by roystgnr · · Score: 2, Funny

      I mean, think about it, if you write code to store a document, do you sit down and write the byte-layout of that file?

      Yes, of course. Is there any other industry where this attitude would be accepted?

      "Blueprints? No, we just hammer some wood together until we think it won't fall down, or until we run out of nails."

  8. What specifications? by FeatherBoa · · Score: 4, Insightful

    I think that something people don't get is that there are not and never were comprehensive specifications for these formats. The specification is likely the code and nothing more. The document formats weren't conceived as a du jure standard, they are things that grew over time and evolved. Somewhere at the core you're going to find things like a C structs - from some old and forgotten compiler - being copied verbatim to disk.

    Asking Microsoft for the spec will not mean simply taking an existing doc off the shelf and handing it over. It will mean either handing over the code for the old products that read and write those formats or spending person-years of effort combing through that code, constructing a specification, and then, somehow, testing the spec.

    I wouldn't hold my breath for either.

    1. Re:What specifications? by Jeng · · Score: 2, Funny

      So basically Microsoft could probably greatly improve their file formats if they sat down and attempted to write out documentation for them?

      --
      Don't know something? Look it up. Still don't know? Then ask.
  9. Cost and Mechanics of Certain Free Tools by NetSettler · · Score: 2, Insightful

    it would be sufficient for the sake of old documents to provide a free tool that is able to read those documents, or a tool that would convert them to an open document format. This tool wouldn't need to have its source published.

    As noted in another post about this article, it may be that there is no "format" other than "the code". If so, then the only free tool that is cheap to make is a wrapper around a complete application that just calls only part of that application. If so, making the wrapped tool free means giving away the entire program, not just the file part. In effect, then, this amounts to requesting that old versions be made free. Any difference between your proposal and asking old versions of their editor to just be made "free" (for whatever free you might be meaning) is just words, I suspect--nothing semantic.

    Of course, this comes back to the question of whether there should be software patents at all, and whether software copyrights should have the immensely long durations that they do. Indeed, at some point, probably much shorter than happens now, having old tools be free so they can be recycled for other purposes may not be bad. It might even give vendors a kick in the pants to move faster to make newer tools be different enough that the old tools didn't threaten them. But bypassing a proper change in software copyright and patent law and instead just beating up on certain people who have things one wants does not seem the best approach to me.

    --

    Kent M Pitman
    Philosopher, Technologist, Writer

  10. Its rare by Shados · · Score: 2, Interesting

    It is rare that I agree with Slashdot articles on such things, but on this, even the most pro-Microsoft zealot cannot really disagree... everybody wins if these specs are released... They're no more supported, don't compete with Microsoft's newer formats, and would -heavily- show all the entities investigating Microsoft's monopoly that they can "do the right thing".

    It would also be a superb PR move (even though they don't deserve the publicity for something they should have done on their own long ago): it would reassure clueless CEOs. "See?? We can use closed source software, because once Microsoft doesn't support it, they'll just open it up!!!". It is far from true, but enough would think that way to make it worth it.

    So come on MS, do it.

  11. Conversion shouldn't be that hard by alexhmit01 · · Score: 2, Interesting

    It's not NEARLY as clean as Apple's Applescript solution, but since you can script OLE Components, you should be able to set up a computer to migrate the documents. If they are on a file server, you should be able to set up a machine with whatever is the last version of Office that can read the old files, and have it walk through your document tree, looking for each appropriate document. Then it should be able to load it in Office and save it in a newer format.

    That would get all your documents in the latest (Office 2003 or something), then you adapt the script to run on a machine with Office 2007 and do the same thing. Presto-chango, your documents are up to date and safe.

    Regarding formatting... if you're talking about documents not updated in 5-10 years, you probably don't care that much. You might want the content (I need to go through old hard drives and rescue any high school and college papers I care about, that are now hitting the 10 year old point), but if you haven't used it in years, and then want to use it, you can take the time to reformat. You're preserving because 1% of those documents might be needed in the future, which makes it worthwhile to bring them forward with an automated solution.

  12. Re:release a convertor and support legacy! by NorbrookC · · Score: 4, Insightful

    we want to move forward, to adopt a standard -give some time to deprecated formats by supporting them till some time (a deadline), and provide conversion tools for free.

    Yes, we'd like to have a standard, and one which is readable for a long period of time - which is the point of the whole ODF standard in the first place. The problem with the proprietary formats is that they have every reason to change and a considerable number of reasons to drop support for "deprecated" formats.

    I used to work for a medical transcription unit, and we generated over 250K documents annually. It is a non-trivial exercise to convert those documents from one format to another. That doesn't include the loss of formatting which occurs, and there are instances where the formatting is important. This loss occurs even when moving between versions of the same software - just take a Word 97 document and translate it to 2K and then to 2003, and you'll see it.

    Your idea is feasible if it's a one-time function. That is, there is a standard format which will be used for a considerable length of time, and you need to translate your older documents into that standard. If you're going to have to do it ever two or three years, it's going to be a non-starter.

  13. A Little Overblown by blueZhift · · Score: 2, Insightful

    I'm beginning to think that a lot of the worry over old file formats becoming inaccessible in the future is overblown. With the continuing advances of emulation and virtualization technology, it seems highly unlikely that we'll lose all access to documents in old file formats. Emulation of the proper platform and installation of the appropriate software are all that's needed. The real trouble rests with obsolete physical storage media. I still have 5.25" floppies that I haven't been able to read for many years now, but that's hardly Microsoft's fault! And if there's a market for it, someone will be happy to copy all of your old media onto something more modern.

  14. Re:A Little Overblown-NOT NECESSAIRLY by Nom+du+Keyboard · · Score: 3, Informative

    I'm beginning to think that a lot of the worry over old file formats becoming inaccessible in the future is overblown. With the continuing advances of emulation and virtualization technology, it seems highly unlikely that we'll lose all access to documents in old file formats.

    Cannot agree with you here. Obviously you feel you can continue running Windows 98SE with Office 97 in a virtual partition essentially forever - and in that case, you probably can.

    However, the moment you get to Windows XP and recent versions of Office, you hit the dreaded Product Activation bugaboo. Now you're dependent on MS, Adobe, or whomever to continue supporting activation servers as you migrate old software and operating systems to newer virtual platforms. Also EULA's that prevent using software in virtual environments exist. You may well find that running Office 2003 on Windows XP can't be done, legally at least, on the machine that follows your next one. Then where are you?

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  15. Based on previous success by Jim+Hall · · Score: 3, Funny

    Based on my previous successes in getting Microsoft to release the source code to the deprecated MS-DOS 4.x (i.e. before the MS-DOS 5.0 complete re-write) under a free / open-source license, I'm confident that Microsoft will be happy to release deprecated file formats under a similar license.

    Oh, wait ...