Copyright Status of Thermodynamic Properties?
orzetto writes "I work at a research institute, and programming models of physical systems is what I do most of the time. One significant problem when modeling physical processes is finding thermodynamic data. There are some commercial solutions, but these can be quite expensive, and to the best of my knowledge there are no open source efforts in this direction. In my previous job, my company used NIST's Supertrapp, which is not really that expensive, but is written in Fortran, and an old-fashioned dialect at that. As a result, it is a bit difficult to integrate into other projects (praised be f2c), and the programming interface is simply horrible; worse, there are some Fortran-induced limitations such as a maximum of 20 species in a mixture. I was wondering whether it would be legal to buy a copy of such a database (they usually sell with source code, no one can read Fortran anyway); take the data, possibly reformatting it as XML; implement a new programming interface from scratch; and publish the package as free software. Thermodynamic data is not an intellectual creation but a mere measurement, which was most likely done not by the programmers but by scientists funded with our tax money. What are your experiences and opinions on the matter? For the record, I am based in Germany, so the EU database directive applies."
FORTRAN awful? Give me a break.
</sarcasm>
If the NIST program is the product of the work of US Government employees it is in the public domain. I would not be surprised if many of the commercial closed-source programs for the same purpose are based on it. In any case, tabulated data is not protected by US copyright so someone in the US could certainly do as you suggest.
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
I Recommend Talking To A Lawyer In Your Jurisdiction.
HTH
Let me tell you something: God speaks ForTran, and the guys who translated the bible from ForTran to Hebrew did a really really bad job.
The FAQ claims that the US government has a copyright on the material. This could be possible if the material was not directly generated by the NIST itself --- for example, they paid a contractor to generate it and it is considered a "work for hire".
The facts themselves probably can't be considered to be under copyright.
OTOH, I agree with a previous poster that you should consult a lawyer if you want to actually do anything which isn't sheeple-ish with the data.
Anything produced by the United States Federal Government (which the National Institute of Standards and Technology certainly qualifies as), is in the public domain.
That's what he meant.
My blog
I would assume that it would be difficult to sell a commercial solution for scientific purposes unless it is based on already documented and accepted data. Basing your scientific work on calculations made by a commercial solution with homegrown data would make it difficult to openly document your method to other scientists. So why not find the published version of those data instead of lifting them out of software?
But what do I know? I am an engineer, not a scientist.
In my work I do a lot of calculations of water and steam properties, and the available software I know of is strictly using the calculation methods published by IAPWS. So if I wanted to, I could buy the IAPWS documentation and make my own software.
A database is copyrightable. See http://www.bitlaw.com/copyright/database.html
I can't find my copy of Supertrapp at the moment, but as I recall there is some strange wording in the license. It's definitely NOT public domain as asserted by the uninformed.
It's also not tabulated data. It's a collection of equations and empirical constants embedded in what may be the worst code I've ever seen.
It may be easier to track down the original papers and work from those, though that too is difficult as lots of the original work was published in obscure journals.
FWIW I am very comfortable w/ FORTRAN and prefer it for serious numerical work (default choice is C). I'm also quite skilled at interfacing FORTRAN to other languages.
I'm interested in working on such a project and have quite a bit of experience w/ the problem, though only limited experience w/ Supertrapp because it is so bad I tended to avoid using it unless I absolutely had to. Please send me an email so we can discuss more. rhb acm.org
Reg Beardsley
People using this NIST data do it because it has NIST sign on it, so they don't risk being dependent on tabulated values from not exhaustively verified source. If you're rewritting the source code, you should take care to establish means by which users could check that data are unaltered with respect to what NIST servers contain. If you work for renowned institute, that should be easy, just store the database on your server and sync it with NIST, along with sources of data cited at NIST website.
As it comes to Fortran programming, it's optimal language for scientific computing. Modern dialects have some of the power of C (allocatable arrays, long subourtine names, free format code, modules, interoperability with C), but, what is preferable in scientific computing, programmer isn't encouraged to tinker with machine-specific stuff. Many existing codes are written in Fortran, e.g. powerful LAPACK library and many computational chemistry packages, so for many physicists/chemists/engineers Fortran is the only language they know and care of. Moreover, Fortran in recent years has gained parallel-programming functionality thanks to OpenMP (it's provided with features eqivalent to that in C/Cpp).
...they specialize in international copyright law. While a US Citizen may be able to "copy" or "rewrite" code US taxpayers paid for, don't assume other, non-contributing (non tax-paying to the US) foreigners can openly copy and redistribute code that is technically the property of US citizens. Personally, I don't care as your topic is of little interest to me, but unless someone is attorney/lawyer in these copyrights, I wouldn't listen to anyone here.
Architectural Renderings
Empirical models of thermodynamic properties are definitely protected by copyright. There is a high-value market for these models, and different models of the same thermodynamic process will evaluate differently so it is a valuable creative product rather than a mere description of reality. For fields where tiny improvements in efficiency generate big cost savings, you want to use the most accurate model available where "most accurate" will be a function of the use case.
Thermodynamic property models are not measurements of reality, they are mathematical models of a physical process derived from empirical data. They are what you use to predict reality when it is not possible or practical to measure it. Turning the empirical data points into continuous functions is a creative step and the value of the creative step is in minimizing the divergence between the model and reality over as broad a range as possible. There are companies that specialize in producing and selling ultra-accurate thermodynamic property models.
I think that the biggest problem isn't intellectual property, but the people who administer it. I don't think that the demand is particularly great. As such, there isn't a great incentive to release it freely. There are costs to administering such a large DB. Furthermore, nobody wants their name on a database of all the fundamental properties because in that data there are bound to be mistakes.
You are looking at the liability issue for the creator/admin, the supply side. The bigger liability problem is on the engineer, demand side.
Something that is missing from this discussion, is some Chemical Engineer specific knowledge that I can attempt to provide. The whole point of a "steam table" and similar products like discussed here, is there is no accurate formula for vapor pressure at various temps. The simplistic linear equations taught in high school don't work at the extremes, or don't give accurate enough results to design a safe and profitable plant. So, more than a century ago, physicist / chemist / engineers started making lab measurements, and selling graphs and tables of data. The modern version of that product is the expensive computer models discussed in the article, which optimistically try to answer any input conditions with correct and continuous answers based on a mixture of theory, optimism, and some distinct individual laboratory measurements.
Because the data model is used to design multi-million dollar plants, and because the only way to verify the results is very expensive lab work, and is therefore often glossed over, a mistake in the data model could be a multi-million dollar mistake, assuming the losses are purely economic and no human victims. The creator/admin probably was intelligent enough to release under a license that removes all liability for data errors. The end user engineer will not be so lucky.
On one far extreme of the provability / testability spectrum, you've got yet another word processor, where if the screen doesn't match what you typed in, literally a trained gorilla could figure out the word processor is broken, and act accordingly (throw poo at programmer? The more things change, the more they stay the same) Or maybe a crypto hash where a hundred programmers can write it in a hundred languages and all the outputs better match for a given input.
At the other extreme of provability / testability, you've got a Chem-E basically having to take the program output on faith that it's correct. The program says the pressure of supercritical steam at 700 K is 230 atm, I know that is somewhat above critical temp and above critical pressure, so the best I can do is "sounds about right to me"? So specify plant components based on a 230 atm environment (adding appropriate safety factors, etc) Now steam pressure is old stuff, boring, and everyone knows about what to expect, but using really weird stuff under really weird conditions, who knows what crazy output from the data model might slip past, resulting in a disaster?
Despite the dangers, it would be great for education, and cheap experimental research/simulation, even if it would be too legally dangerous to use in formal design work.
"Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
That's a legal question. The answer to that question might seriously complicate your life if you get it wrong. What would possess a person to ask this of Slashdot instead of contacting a lawyer? Better yet, why would a German expect a USA-based Web site to be familiar with the nuances of German (or EU) copyright law? I'm trying to picture a situation where I'd contact a German online forum to ask for legal advice pertaining to American law and I just can't come up with anything.
I suppose next we'll see an Ask Slashdot which says "hi, I'm a diabetic and I forgot how much insulin I am supposed to inject myself with, please advise." And I'll have to scroll down significantly to see a partly-buried comment where someone finally suggests that perhaps he should be asking a doctor...
It is a miracle that curiosity survives formal education. - Einstein
I warned the project manager that sure go ahead and use the data as a basis for programming but not for the production program.
A couple of months later, the competitors lawyers appeared and (cough) out of court (cough) settlement.
Never did find out how much it cost "my" software house...
In the end they had to employ a gaggle of impoverished undergrads to build their own DB.
So, be very very careful. It might be a good idea to *ask* if you can re-use the data - often it's possible for non commercial purposes...
Andy
slashdot knows more than most IP lawyers. You have to remember, lawyering is essentially about either telling you what case law precedent has established (like don't rob banks), or arguing for your side, not about "truth" or "right and wrong". Most lawyers are more the type that will take the money, then figure out how to argue the way you want... they don't do good with "advice" not tied to rulings in court. Ask the right question and you'll pay the lawyers a bunch less money... nobody wants THAT!
In regards to the question, he's looking to pull government funded data out of a program. Considering most countries in Europe allow the state to charge for everything it can and that they have "database aggregation" "copyrights". His plan would probably get him sued.
So now that most of slashdot would agree on that outcome.... what other resources are available to obtain his desired outcome. This is where the slashdot crowd helps because they're in different countries and chances are pretty good somebody will know what government or research office he should talk to... There are still huge chunks of government services that aren't documented anywhere on the internet. Corporations that know who has the info freely available love to keep their sources opaque so the industry has to go thru them to get free information, and in many cases that means "handshake" deals so key offices never have time to get their web pages posted properly. You'd be surprised how many government services are still known only by the posting outside the office door (in the basement just above the beware of leopard sign) or maybe the phone book if you're lucky.
I don't know if you're trolling or just grossly misinformed, but that's not even close to correct. Lyrics are the copyrighted creative works of the person/people who wrote them. "Changing one word" does not allow someone else to then distribute the lyrics legally. That would be considered a "derivative work," the creation of which is a right provided to copyright holders under copyright law.
All the formulation for the prediction of water properties are published by International Association for the Properties of Water and Steam (IAPWS) so that they can be used. It is often important to use exactly the same correlations so that that all the thermodynamic data are self-consistent. Therefore the formulations are standardized by an international body. I do not think that the use of this formulae is in any way restricted because such restriction would defeat the very purpose - standarization. See http://www.iapws.org/ for the collection of the current formulations. There may be a restriction for a particular implementation (computer program) or sets of tables ( "lookup tables" and interpolation are often used for performance). Not sure. Hope someone starts an extension to implement these kind of things in Gnumeric. Cheers. Ephraim the horse.