Slashdot Mirror


Help Write An Open Data Format Bill

AdamBa writes "There has been a lot of discussion of open source bills, but I think open data format bills have a much greater chance of actually becoming law. Over at the Open Data Format Initiative site, I have written an article explaining "Why Open Data Format Laws Are Better Than Open Source Laws". I also have a sample Open Data Format bill; I invite comments from slashdot readers, in particular on how the sample bill could be improved."

36 of 208 comments (clear)

  1. Opening up office formats... by gnuadam · · Score: 3, Insightful

    ...might or might not now help linux ... a lot of people I've talked to just don't like openoffice (and I've noticed that big spreadsheets are intolerably slow). There's a bit more work to be done there, besides the standard ms office problems. (That said, it wouldn't hurt a bit). But it would definately help a company like apple.

    Imagine flawless powerpoint support in keynote.

    --
    You say :wq, I say ZZ. Why can't we all just get along?
    1. Re:Opening up office formats... by 73939133 · · Score: 2, Insightful

      I don't like MS Office either, and I resent having to pay for it to access information (often public information), when, in my view, it has no other merit and I don't use it for anything else.

      So, we may each like or not like MS Office or OpenOffice, but the principle is that simple, textual public information should not require the purchase of a package that costs several hundred dollars in licensing fees.

      And if the formats are forced to be open, maybe we will get a lot more applications that can read and write MS Office file formats, applications that we actually like.

    2. Re:Opening up office formats... by sql*kitten · · Score: 2, Insightful

      I don't like MS Office either, and I resent having to pay for it to access information (often public information), when, in my view, it has no other merit and I don't use it for anything else.

      I find it interesting that you blame Microsoft for creating a product which they sell to whoever wants it, and not the people who chose to use the product inappropriately. There are plenty of ways to save a complex document in a portable format (RTF, PDF and HTML are a few examples). MS Word supports all of those (I think you need a plugin to save PDFs but MS Word supports plugins so what's the problem?). The creator of the information chose not to use the available tools, yet you blame the creator of the tools, not the user of the tools.

      This leads me to suspect that you simply want to bash Microsoft, and don't really care about or understand the underlying issue at all.

    3. Re:Opening up office formats... by pmz · · Score: 2, Insightful

      This leads me to suspect that you simply want to bash Microsoft, and don't really care about or understand the underlying issue at all.

      Microsoft creates barriers for users, such as proprietary default file formats, because it is in their interest to do so.

      OpenOffice.org uses open file formats by default, because it is in their interest to do so.

      It is appropriate to blame Microsoft, because their position of conflict of interest cannot be ignored.

  2. Yeah, your bill will be open by Anonymous Coward · · Score: 1, Insightful

    When it hits the legislature where it will quickly be rewritten to allow MS to do whatever it wants.

  3. He's right by Otterley · · Score: 4, Insightful

    The author has the right idea; it would certainly make building compatible third-party implementations of data-processing software (by that I mean nearly all software) much easier than if the file formats were closed, as they are now.

    The trouble is that the distinction between code and data isn't as bright a line as you might think. I don't mean code/data segmenting; if you think about what evals and ELF are, you know what I mean.

    Algorithms (not to mention software in the US) are certainly patentable -- and efficient data storage mechanisms are too. Think about what a gold mine in terms of IP value a hash table would have been if it invented at a commercial organization instead of in academia.

    So, the conundrum we still face is that there are still valid IP claims WRT data structures, because oftentimes, as much thought is put into them as into the processing software that reads and writes them.

    1. Re:He's right by gnuadam · · Score: 5, Insightful

      Is it really good public policy to permanently store records that are meant to be for the public's benefit in a data format that cannot be guaranteed to be readable in the future?

      This is the argument.

      Closed standards are of course allowed by law. Companies and individuals should be allowed to use them. I argue that it is not good public policy, and governments should demand that their software vendors provide them with software that stores its data in a publicly reproducible way.

      --
      You say :wq, I say ZZ. Why can't we all just get along?
    2. Re:He's right by stubear · · Score: 3, Insightful

      "I argue that it is not good public policy, and governments should demand that their software vendors provide them with software that stores its data in a publicly reproducible way."

      They do. It's called text, RTF and HTML. MS Office saves out to all these formats. You can even install Acrobat and save Office documents as PDFs.

  4. Hmmm by alptraum · · Score: 5, Insightful

    On the second bullet you say "computer data owned by the [government] be permanently available to the [government] throughout its useful life."

    Somewhere you may want to define "useful life", I could see this as a possible loophole, this term could mean different time lengths to different people.

  5. Patent It by Anonymous Coward · · Score: 2, Insightful

    Stuff this open data format crap.
    I think data formats are patentable... because some of them are inventive and require real ingenuity. eg: video streaming, etc.

    If you don't award patents, you don't get the great inventions. (cf: Edison)

    from,

    your friendly patent attorney

  6. What goverment? by axxackall · · Score: 3, Insightful

    Goverment? What goverment? We are talking about open data formats for Internet, not for national network of one country, right? If so then the law should be international. Otherwise - it's wasting of time and efforts.

    --

    Less is more !
    1. Re:What goverment? by FCKGW · · Score: 3, Insightful

      If one government does this, then another, then another, eventually all governments would be forced to use open formats. Not to mention that governments can tell people to give them documents in open formats or else. It could spread similarly to how MS .doc format became the de-facto standard. Of course, MSFT and possibly other closed-source vendors would lobby against this and pay off representatives to keep this from happening, so who knows if any of this will work.

      I don't care nearly as much about what software governments use as I do about their interoperability with other systems. A law based on this bill would help erase vendor lock-in and force MSFT to compete on a level playing field, as well as allow me access to public info without having to buy the same $400 program used to create it.

      --
      It's an operating system, not a religion.
  7. Re:Why doesn't the government just ... by jericho4.0 · · Score: 3, Insightful
    The problem with doing this is the problem this bill is trying to solve, namely that goverments don't have the rights to the data formats they use, leaving them dependent on the likes of MS, Orcale, etc.

    --
    "A language that doesn't affect the way you think about programming, is not worth knowing" - Alan Perlis
  8. Re:The color scheme made my eyes hurt by dnoyeb · · Score: 5, Insightful

    I don't think people want open source "laws." We want freedom of choice.

    We do not want to be locked into a particular software package so they can exthort money out of us. Charging for what amounts to an upgrade is just wrong. Charging for and upgrade that makes the software do what you said it would do in the first place is wronger. Charging for security fixes is wrong. I don't like to see my tax dollars wasted. If you are a private company, that I have no stock in, do what ever you want...

    I have never seen Ford charge their customer for a recall...Only in software.

  9. More Legislation? by Acidic_Diarrhea · · Score: 1, Insightful
    I'm always critical of legislating common sense. It's common sense that you should wear a seat belt and so I don't believe there should be a law which dictates that you must wear one. As we add more and more laws to the books, each single law loses values because there are so many other laws.

    With that being said, common sense should dictate to anyone in government who is in charge of purchasing that an open data format is much more inviting than a non-open format. But I can see that just because this should be common sense does not neccescarily mean that it is.

    Here is my only concern: I don't like to have the government's hands tied, one way or another. Sometimes the market does not produce a product to satisfy these conditions or the market will produce an open data format product and a closed data format product where the closed data format product is superior in all ways but the obvious one. Now, I want the government to be able to evaluate their needs (considering how likely it is that they will need backups of the data - for instance if the data is only going to be temporary work product for an internal office and will eventually be converted to HTML) and make an informed decision. But I suppose in many cases it is better to lay down a rule which will work positively in 99% of all situations.

    --
    I hate liberals. If you are a liberal, do not reply.
  10. commitee this and organization that by tomstdenis · · Score: 1, Insightful

    What ever happend to independent and competing ideas that will eventually succumb to the better format?

    Remember LZH, ARC, PAK, ARJ, JAR, ZIP, RAR, ACE, ZOO, ...

    All competing algorithms and eventually ZIP won [or more so deflate] for the most part and BZIP2 to a certain degree...

    If you stiffle independent thought before you get out the door why bother?

    Tom

    --
    Someday, I'll have a real sig.
    1. Re:commitee this and organization that by tomstdenis · · Score: 2, Insightful

      Not really. I used ClarisWorks when I was a kid, then I moved to StarOffice and then OpenOffice.

      I never by choice used MS Office tools :-)

      Tom

      --
      Someday, I'll have a real sig.
  11. you could at least require an open standard by Trepidity · · Score: 3, Insightful

    I would prefer a complete free and open standard with no patent encumberment, but a second-best option would be to have an open and fully documented standard that may or may not be subject to patents, along the lines of the stuff put out by the MPEG consortium (MP3, AAC, etc. are all patented, but are also completely open formats). This way at the very least you have an guarantee that your data won't be permanently inaccessible, because when the patent expires you can reimplement the open specification.

  12. no facts back him up by SuperBanana · · Score: 4, Insightful
    The author has the right idea; it would certainly make building compatible third-party implementations of data-processing software (by that I mean nearly all software) much easier than if the file formats were closed, as they are now.

    Except the author thinks that software companies are just going to lovingly hand over specifications on their file formats. Throughout computing history, software companies have used file formats to lock people into their product, force them to buy upgrades, and use them as leverage for strategic partnerships. Ie- it's a huge cash cow to have proprietary formats.

    The author, in the very first paragraph, dismisses "open source laws", saying they "won't work". Huh? Says who? Then he says his WILL work. You can't just make enormous blanket statements like that without backing them up!

    In his second point, he says "Many open source laws seem designed to force a government to replace Windows/IIS/Office with Linux/Apache/*Office". Where is he getting this crap from? He goes on to dismiss the security benefits of open-source software, the cost savings...he pretty much dismisses every single argument for considering open-source software, with nothing to back up his reasoning. He 'thinks', therefore it is.

    My impression of "open source laws" was that they instruct government agencies to -CONSIDER- open source solutions- it does not FORCE them to use them- yet he makes it out like there's a cop sitting there with a gun to a government IT manager's head, saying "Go ahead punk. Make my day. Install Office. Are you feelin' lucky, punk?" Maybe I misread it, but his idea seems to be to -FORCE- companies to release file formats if their software is used by the government.

    One requires you consider open-source software, leaving commercial software companies plenty of oppertunity to compete if they've got a better solution(remember, open-source is not $0, you still have labor involved, possibly a migration, maybe staff training and hiring, maybe different equipment.) Open source doesn't automatically, if ever, "win" outright simply because it's open-source, yet the author seems to think open-source always will, and hence open-source laws will be bad because there will be a huge inconvenience to the government or the software companies. Again, consider- not force! If the manager thinks commercial software will overall be better, he/she will make that call.

    The other forces software companies to do something that threatens, in a BIG way, how they compete against other companies.

    Now, which has more of a chance of failing?

    1. Re:no facts back him up by rjch · · Score: 5, Insightful
      Except the author thinks that software companies are just going to lovingly hand over specifications on their file formats.
      No, that's not what he said. What he said is that if open data laws were passed, companies would be more likely to open up their data formats in order to retain lucrative government contracts.
      The author, in the very first paragraph, dismisses "open source laws", saying they "won't work". Huh? Says who? Then he says his WILL work. You can't just make enormous blanket statements like that without backing them up!
      Gee... maybe we weren't reading the same article. The great bulk of the article was discussing why software companies would be more likely to open their data formats than their source code.
      Likewise, he is not talking about forcing companies to open their data formats, he's talking about making it more commercially viable. You seem to be picking quotes out from their context and twisting them to your preferred view.
  13. right tools for the right job by MrLint · · Score: 2, Insightful

    Admittedly i read the 'bill' a bit quickly, but there is one problem (which seems to be partially addressed later on) is that all data be in human readable format. Well this is good for string data. However I very much suspect there are going to be number formats that are much better stored in their binary format. Now as I recall from college COBOL's strategy as to have everything stored as string data with (interesting) defining headers. As I also recall this was to help portable between COBOL and other foreign (computer wise) data retrieval systems, like when data was passed around on tapes.

    This is all well and good, but you trade off portability for parsing overhead. Of course CPU time is way cheap as to compared to yesteryear. However you will end up with people's nutjob parsing routines, I mean hell CPUs and calculator software still get shipped making math mistakes. Reading in data in any of the standard int and float storage formats is cheaper and you don't have to worry about joe-bob desk jockey hosing up an import routing and your tax bill being 100x too large.

    Now as most of us who have been thru a proper programming class know there are routines for these kinda things.. but again beware of joe-bob who wants to do it 'better' or whatever. Perhaps along with open data one needs to define a data import stadard?

  14. Open data formats are good, but not sufficient by Anonymous Coward · · Score: 4, Insightful

    Open data formats are a good start, but don't fool yourself into believing they are sufficient. HTML is perfectly open, but how many web sites are there that test against anything but IE? How many rely on the behavior (even the bugs) of IE?

    The non-openness of Microsoft Word's document formats is not the biggest obstacle. There are plenty of office suites now that can read them. The biggest obstacle is that people write their documents expecting them to look exactly as they do in Word. That's not something that open data formats by themselves will solve. Microsoft could move entirely to XML, and that problem would still remain.

  15. Re:Damned by genuine praise by ZhuLien · · Score: 2, Insightful

    1 - say a wordprocessor developer decides to get rid of the concept entirely of files - does that mean they have to disclose how they store the information? ie: perhaps using their own disc format and writing sectors/tracks directly?

    2 - where does the difference in data and program exist? ok, most of the time it is plain and clear, but what if the text of a wordprocessor wasn't stored as text, but rather a program that was interpretted to give the text layout? Also, is a BASIC program tokenised not data for the BASIC interpreter?

    3 - Why should a programmer document how all their data is? They may not even know, it may be purely left to how the language that they used writes their data structures. Some of the data may be specifically not designed for 'users' to exchange? ie: indexes of a database.

    4 - forcing people to disclose how their data is - is just not right. A programmer should have the right not to disclose how their program works.

    Julian

  16. Re:html by sisukapalli1 · · Score: 3, Insightful
    XML works for everyone

    This is like saying ASCII works for everyone. It is more than just the format on which the "real format" is based on. It would be ideal to have a full specification or detailed, non obfuscated documentation.

    S

  17. Re:Damned by genuine praise by MrLint · · Score: 5, Insightful

    1) well then said app wont be running on any current OS I can think of. Reimplementing low level data storage routines is plain stupid, especially for a high level app like a word processor. (i can hear the grumbling 'but that was just an example you are too literal.. etc)

    2) This argument is older than dirt and way too contrived. If a user is inputting records into a database its not code. Besides interpreted basic (as its a structured high level programming language the 'data' is already defined. Besides if the govt. is writing apps in interpreted basic you better ph33r f0r y0ur l1f3!

    3) This is the whole point of the exercise, to write your data on purpose such that you document its output. In addition indexes to a database *could* be made user friendly, but since they are ephemeral anyway and are rebuilt and updated as needed and not generally data that is input by a user for long term data storage its kidna falls outside of the bounds here.

    4) Well this is forcing the govt. to document their data structures that contain data that is not internal to the govt. And since ostensibly we all own the dat that the govt. has we can tell them (ideally) to do whatever we want.

  18. going at it the wrong way? by Dhraakellian · · Score: 2, Insightful

    Instead of making major corporations, such as Microsoft, open their formats, perhaps Word and similar programs should be required to have the ability to read and/or write major open formats, such as StarOffice/OpenOffice formats.

    Yes, I could see arguments along the lines of "but we can't possibly support every format," but in the case of behemoth corporations, it should be doable for at least the major formats. This could also be said to stifle competition, but in Microsoft's case, it has already done enough of this, and this measure could actually restore competition.

    --
    I've read Grocklaw. BoycottNovell, you're no Grocklaw
  19. Re:Damned by genuine praise by Anonymous Coward · · Score: 1, Insightful

    You're too literal -- Within a few years MS Office will be transformed into a server-based content management system. There won't necessarily be any "files", just database content that is rendered dynamically. This nicely eliminates both the DOC file issue and the piracy issue.

  20. source... data... ? by Anonymous Coward · · Score: 1, Insightful

    there is no well defined distinction between source code and data, nor is such a definition possible (think Lisp).

    if closed source is legal, but closed data formats are not, MS will just make their data documents "runnable" (think embedded VBscript macros, etc...) and say "that's not just your average data document; it's a *program* (value added for customer satisfaction!), and it's *compiled* (ie obfuscated) for performance; therefore, we don't have to make it open."

    MSword and its ilk will just become an interpreter for "running" your saved documents (think C# .Net runtime).

  21. Re:Common Sense by RodgerDodger · · Score: 2, Insightful
    The poster's argument wasn't about choice; it was about common sense.

    No, it was about choice. He was saying you should not be forced to do something that is common sense; that's a debate about choice.

    I'm going to ignore most of your post on the grounds that you either have deliberately missed my point, used your own strawman arguments, or are just thick (I'll let you decide which...). However, there's a key point I want to address:

    Yes, sometimes wearing the seatbelt causes you to get hurt more than you would have if you didn't. Those cases are far less likely than the reverse.

    So you're saying the government is allowed to (pre-)determine for me a choice that could affect my living or dying? Do you think they should be able to do this in every case? What if I am going to have surgery where I have a 10% chance that I could die or be severely maimed? Because it's "only a 10% chance" does that mean the government gets to decide for me?

    Actually, in general, yes I do believe that.

    Most modern societies believe that we have a duty of care to other members of our societies. This results in such things as publicly funded hospitals.

    Such facilities are almost always heavily utilised. In such situations, most societies look at ways of reducing the need for these services.

    The seatbelt case is an excellent example. When seatbelts first started being available in cars, some bright spark noticed that there were relatively fewer injuries amongst seatbelt wearers. Statistical studies were done and they showed that, on the whole, wearing a seatbelt meant you were less likely to be hurt. Local governments started passing seatbelt laws. Other studies were done showing a large correlation between seatbelt laws and reduced hospital demand. So everybody picked it up.

    If seatbelts weren't required, hospitals would have even greater demands on them. Hence the law. I feel that governments have the perfect right to curtail behaviour that puts higher demand on government services.

    So, that's how a private choice (the decision to wear or not wear a seatbelt) becomes something worth legislating.

    This is the dumbest argument. Those people who die or are severly hurt by seat belts should not have been legally bound to wear them. Perhaps they would've worn them anyway, perhaps not, but at least it would've been their choice and not the government's.

    First obvious point: It's always their choice. The car doesn't have a sensor in it to not let it start if the seatbelt isn't worn. They can drive without a seatbelt. It's just now, in addition to risking greater injury in the event of an accident, they also risk getting fined.

    Second obvious point: Yes, seatbelts sometimes result in greater harm than they do good. Laws are aimed at the benefit of society as a whole, not individuals. On the whole, society is helped by seatbelt laws.

    To take another example, some people think vaccinations should be a personal choice. Vaccinations occasionally have undesirable side effects. Despite that, high vaccination rates for measles, polio, chickenpox, whooping cough, dyptheria, and other nasty diseases have vastly reduced breakouts and associated fatalities. However, it's also been shown that when vaccination rates drop below a threshold (like, 80%), the breakouts start occuring again. Because of this, if a government wants to make vaccinations mandatory, then I think they have every right to.

    The desire to have governments not intrude on these apparently individual (and often common sense) choices is, at heart, an entirely selfish one. It ignores the fact that these laws are not designed to curtail your rights, such as they are; they are designed to help society as a whole.

    And to tie it back to the topic: a government body mandating that suppliers of software to it use a publicised data format is a good thing. It may be common sense ("Of course I want the ability to access my data in any way I want"), but that doesn't mean it's not something that should be legislated.
    --
    "Software is too expensive to build cheaply"
  22. The purpose of government by Trailer+Trash · · Score: 3, Insightful

    The first approach takes too simplistic a view of the type of software that governments use. Much of the it is customized for specific tasks such as processing drivers' licenses, and the market for providers of such software is presumably small. If software vendors release their software as open source, they may find that cash-strapped governments in other states gladly help themselves to it for free, so the vendor may get only one paying contract instead of fifty. Therefore, it's quite possible that governments won't be able to find companies willing to provide them with open source software, and then what alternative do they have?

    I just covered this recently in an op-ed piece in the business journal. Let me make this simple: governments do not exist to provide business opportunities to software developers. Reread that until it makes sense.

    We (collectively "the people") shouldn't have to pay twice for a piece of software. The open source world needs to come up with solutions to boring problems, like drivers' license registration. If a particular company doesn't want to be a part of that, then so be it.

    Michael

  23. Re:Common Sense by RodgerDodger · · Score: 2, Insightful

    The problem in your argument is that, despite what the ads say, seatbelts don't save lives.

    Yes, that's right. Your chances of death in a car accident (at least, death without getting to a hospital first) are not decreased significantly by wearing a seatbelt. Seatbelts prevent the "being tossed around the car, suffering massive bruising, broken bones, spinal damage and internal bleeding" type of injuries. Most quick deaths from car accidents occur due to broken necks from whiplash (seat belts don't help, may hinder), or foreign object intrusion.

    So how are your feelings of someone risking greater, and more costly, injury, but without raising their chances of cheaply dying very much?

    (Nice pragmatic attitude, BTW... reflects mine a lot)

    --
    "Software is too expensive to build cheaply"
  24. Where the GPL really shines... by El+Cubano · · Score: 4, Insightful

    If software vendors release their software as open source, they may find that cash-strapped governments in other states gladly help themselves to it for free, so the vendor may get only one paying contract instead of fifty.

    IIRC, the GPL specifies that you only need to make the source available to the entities to which you distribute binaries. I.e., if Michigan gets a contract with some company for some software, the company can release the software as GPL and only provide the source to the Michigan state government. It is then in the best interests of the state government to NOT redistribute the software. That way another state, say New York, also buys the software to make sure that the company is still there next year. The best thing is that if company X pisses off the state, they can take the source and hire company Y to maintain it. If I understand the intent of the GPL, this is the kind of choice that is available with free software.

    Compare that to a previous post (I don't recall the exact thread) where the poster pointed out that his organization used a particalur personnel management product. PeopleSoft bought the competitor and disconitnued the product. Security flaws were found, his company had to spend $2M to switch to PeopleSoft. Had that software been GPL, they could have just hired another company to fix/maintain it. Again, in that situation it is in the company's best interest to NOT freely (as in beer) redistribute the program.

    I think the same can be said for many non-commodity software products. I mean, how many people actually need or want (or have the hardware) to run a PeopleSoft level program at home? Better yet, if a company pays, say $250000, for a program and accompanying support, what incentive do they have to turn around and give it away? It seems to me that the GPL is really perfect for situations like this.

  25. Export/import not enough by Anonymous Coward · · Score: 1, Insightful

    Wrong. By making the native format of the application open the format must fully represent the data. With input/export formats it's just too easy for unethical companies to leave out some critical element, making them useless for interoperability. eg. M$Word's useless "XML" format.


  26. Why they write the laws, not us by Quila · · Score: 2, Insightful

    I've seen all the comments on loopholes and inconsistencies in this proposed law. This is why the politicians and corporations normally write the laws.

    Unlike us, they know how to eliminate loopholes and gross internal inconsistencies. Take, respectively, the tax laws and the DMCA for example.

  27. Everytime you make a law... by NightEyez · · Score: 1, Insightful

    .. you lose a freedom. Let the market decide the formats. This law sounds like something Europe would make.

  28. Yes, this is why open format laws are a good idea by smcv · · Score: 2, Insightful

    If you have a law forcing data to be in properly-documented formats, proprietary software companies are free to implement apps that work with that format, without open-sourcing anything.

    They'll just have to compete (with open-source apps, and with other proprietary apps) on features, ease of use, reliability and things like that, that actually help the user, as opposed to competing on "but we're the only people who *really* know how .doc format works".

    (IMO governments should use open formats; private companies shouldn't be required to use open formats, except when they interact with governments, which would hopefully mean they see the sense of going with open formats throughout.)

    (Of course, if someone decides the best way to document their format is to open-source a sample implementation, that'd be nice too...)