Copyright Status of Thermodynamic Properties?
orzetto writes "I work at a research institute, and programming models of physical systems is what I do most of the time. One significant problem when modeling physical processes is finding thermodynamic data. There are some commercial solutions, but these can be quite expensive, and to the best of my knowledge there are no open source efforts in this direction. In my previous job, my company used NIST's Supertrapp, which is not really that expensive, but is written in Fortran, and an old-fashioned dialect at that. As a result, it is a bit difficult to integrate into other projects (praised be f2c), and the programming interface is simply horrible; worse, there are some Fortran-induced limitations such as a maximum of 20 species in a mixture. I was wondering whether it would be legal to buy a copy of such a database (they usually sell with source code, no one can read Fortran anyway); take the data, possibly reformatting it as XML; implement a new programming interface from scratch; and publish the package as free software. Thermodynamic data is not an intellectual creation but a mere measurement, which was most likely done not by the programmers but by scientists funded with our tax money. What are your experiences and opinions on the matter? For the record, I am based in Germany, so the EU database directive applies."
FORTRAN awful? Give me a break.
</sarcasm>
If the NIST program is the product of the work of US Government employees it is in the public domain. I would not be surprised if many of the commercial closed-source programs for the same purpose are based on it. In any case, tabulated data is not protected by US copyright so someone in the US could certainly do as you suggest.
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
I Recommend Talking To A Lawyer In Your Jurisdiction.
HTH
Let me tell you something: God speaks ForTran, and the guys who translated the bible from ForTran to Hebrew did a really really bad job.
Anything produced by the United States Federal Government (which the National Institute of Standards and Technology certainly qualifies as), is in the public domain.
That's what he meant.
My blog
A database is copyrightable. See http://www.bitlaw.com/copyright/database.html
I can't find my copy of Supertrapp at the moment, but as I recall there is some strange wording in the license. It's definitely NOT public domain as asserted by the uninformed.
It's also not tabulated data. It's a collection of equations and empirical constants embedded in what may be the worst code I've ever seen.
It may be easier to track down the original papers and work from those, though that too is difficult as lots of the original work was published in obscure journals.
FWIW I am very comfortable w/ FORTRAN and prefer it for serious numerical work (default choice is C). I'm also quite skilled at interfacing FORTRAN to other languages.
I'm interested in working on such a project and have quite a bit of experience w/ the problem, though only limited experience w/ Supertrapp because it is so bad I tended to avoid using it unless I absolutely had to. Please send me an email so we can discuss more. rhb acm.org
Reg Beardsley
People using this NIST data do it because it has NIST sign on it, so they don't risk being dependent on tabulated values from not exhaustively verified source. If you're rewritting the source code, you should take care to establish means by which users could check that data are unaltered with respect to what NIST servers contain. If you work for renowned institute, that should be easy, just store the database on your server and sync it with NIST, along with sources of data cited at NIST website.
As it comes to Fortran programming, it's optimal language for scientific computing. Modern dialects have some of the power of C (allocatable arrays, long subourtine names, free format code, modules, interoperability with C), but, what is preferable in scientific computing, programmer isn't encouraged to tinker with machine-specific stuff. Many existing codes are written in Fortran, e.g. powerful LAPACK library and many computational chemistry packages, so for many physicists/chemists/engineers Fortran is the only language they know and care of. Moreover, Fortran in recent years has gained parallel-programming functionality thanks to OpenMP (it's provided with features eqivalent to that in C/Cpp).
The EU database law specifically does not protect foreign databases unless that foreign country also creates a database a law and establishes mutual protection. The US has no such protection, in fact it seems no country outside the EU has established reciprocal database protection. It should be possible to do this open source project based on data from the US or from anywhere outside the EU.
The FAQ [nist.gov] claims that the US government has a copyright on the material.
The factual data in that database cannot be protected by copyright, it is not protected as a database in the US, and is not covered under EU law. The only copyright they could claim on it is either if it contains creative images or creative text or the like, then those particular elements could be protected, or they could perhaps claim a copyright on the creative arrangement and formatting of the data in the database. Both of those issued can be avoided.
What can be done is use this database and read out the needed factual data elements and then re-write it into the database for the open source project. Purely factual text-fields such as the name of an element or compound or whatever can be copied, just be careful not to copy any images or free-form text fields such as descriptive text or explanatory text. Then write the data out in your own arrangement. The best thing to do there is to arrange the data in some strict alphabetical or numerical order - there is no creativity and no copyrightability in that sort of unique ordering. That means not only storing the records in alphabetical order, but also order the data elements within each record in name-of-field alphabetical order. It might even be a good idea to rename any fields that care reasonably open to custom naming. There is no need to rename a field like "name" or "address" or "phone number", but a field like "work contact number" could easily be called "work phone".
The best way to go about it would be to create a mostly-empty, but functioning, database before even looking at your intended source material, that way by definition there is no copying of the formatting of the database. Once there is a functioning database design then the factual data elements can be copied from the source to fill the already-designed database.
-
- - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
I am a federal worker and I oversee some contracts that involve writing Fortran codes for simulating nuclear reactors. That is not quite right. You need to consult the Federal Acquisition Regulation (FAR), Chapter 27. Specifically, see
27.404-2 Limited rights data and restricted computer software.
and
27.404-3 Copyrighted works.
http://www.acquisition.gov/far/current/html/Subpart%2027_4.html#wp1041836
If you read those sections, and take the time to really understand the definitions they use, and read the appropriate appendices, etc, you will find that the legalese seems to indicate that the contractor IS allowed to copyright data generated in performance of the contract (with the government's permission), and that the goverment maintains an exclusive, irrevocable license to use such data for its purposes, but the government does not necessarily maintain an exclusive right to "redistribute" such data.
It is my belief that the law is written this way so as to give potential contractors an incentive to do business with the government. If a company can't build a portfolio of intellectual property, then it has no means of distinguishing itself from the competition. In the long run, the government would not get the best value for its $.
That's a legal question. The answer to that question might seriously complicate your life if you get it wrong. What would possess a person to ask this of Slashdot instead of contacting a lawyer? Better yet, why would a German expect a USA-based Web site to be familiar with the nuances of German (or EU) copyright law? I'm trying to picture a situation where I'd contact a German online forum to ask for legal advice pertaining to American law and I just can't come up with anything.
I suppose next we'll see an Ask Slashdot which says "hi, I'm a diabetic and I forgot how much insulin I am supposed to inject myself with, please advise." And I'll have to scroll down significantly to see a partly-buried comment where someone finally suggests that perhaps he should be asking a doctor...
It is a miracle that curiosity survives formal education. - Einstein
All the formulation for the prediction of water properties are published by International Association for the Properties of Water and Steam (IAPWS) so that they can be used. It is often important to use exactly the same correlations so that that all the thermodynamic data are self-consistent. Therefore the formulations are standardized by an international body. I do not think that the use of this formulae is in any way restricted because such restriction would defeat the very purpose - standarization. See http://www.iapws.org/ for the collection of the current formulations. There may be a restriction for a particular implementation (computer program) or sets of tables ( "lookup tables" and interpolation are often used for performance). Not sure. Hope someone starts an extension to implement these kind of things in Gnumeric. Cheers. Ephraim the horse.