Open Source And Genetics
UnanimousCoward writes "SFGate has an article about some researchers pushing for the open sourcing of genetic research software. Of course, the pros and cons are debated." It's the age-old debate; what follows the heart of the scientific method more? Peer review, or getting the information out as fast as possible?
Didn't we see this article yesterday right here?
If it gets open sourced can we expect to find a book on our bookshelves titled "Genetic Engineering for Dummies..." and "Learn Genetic Engineering in 21 days"?
--- I used to moderate, then I read the -1 articles and decided having to filter through them was not worth it.
This is just my opinion (YMMV, IANAS, etc), but I would argue for peer review, insofar as it relates to the ability to duplicate results and experiments. My thinking here is that the ability to duplicate results is at the heart of science. If others can't duplicate them, the results aren't accepted.
A grain of salt is provided for those who disagree.
The Independent: Reverend Spooner Arrested in Friar Tuck Incident - ISIHAC, Historical Headlines
Despite all the arguments favouring IP rights et al, I'd say that most of the research has to be open-sourced, so to speak.
There is no point in re-inventing the wheel. It becomes ridiculous when 10 different companies engage in enormous investments independently on 10 different projects, when they could have all done 10 different projects to begin with.
IP has it's own say, no doubt, but over doing it leads to commercialization of science, and defeats it's very purpose. Engaged in silly patent and copyright wars, we have no doubt postponed many a useful invention by at least a few decades, IMHO.
Right now with the boom in genetics this software is top-dollar in some cases. I doubt these emerging companies would be willing to part with their profit margins in the name of open source.
Be more clear in the writeup: the subject of the article is a petition to open-source any code that results from federal grant money from reseachers in a university setting. It is *not* about open sourcing all bioinformatic software, everywhere. In other words, code developed in a private setting, independant of university grant money - who cares? (at least in this case). Alternatively, grad students who strike it rich on code they wrote while on the government dole? Open source *their* work, since it was already paid for by the taxpayers. I don't see anything wrong with this particular distinction.
I don't think genetic code and source code should be treated the same. After all malicous sourcecode only hurts computers and inconvienices users. Genetic code effects us on a much more real level.
I can release code that has a bug in it and fix it pretty quicky. If it causes problems, it only causes problems. Meddling with life is not so trivial and the concequences can be fatal. It is for this reason that I am against open source genetics.
Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
Why not both? Quick publication with clear marks as unconfirmed or waiting to be replicated would do both. It also would provide a quick source of worthwhile experiments, repeating previous successs and confirming them.
So long and thanks for all the fish . . . !!!
*O*H***W*E*L*L*
Software for research in genetics, biology, fluid dynamics, astronomy and any other subject that requires such colossal amounts of computations should, in my opinion, be open sourced. This way, several things can take place:
Well, you get the picture. All of this becomes incredibly expensive with closed source software. Of course, nobody said it has to be free software. Obtaining the source code could very well require an NDA, if that's what will float the developer's boat.
*O*H***W*E*L*L*
We use Mac's too. For image processing. And one particular Gene-specific application. But that totals maybe 10 machines. We have more Linux desktops. And more Penguins (4) in the data center than Apples(0).
Carl G. Jung
--
"With one breath, with one flow, You will know Synchronicity" -La Policia
If public funds were used to finance the research, then it should be public information. The only exceptions should be driven by security concerns. If private monies are put into the mix, let us thank the donor for their donation.
We are the public and our tax dollars are used to generate this information. We own it.
Give it to us at the bare cost of distribution.
Unlike scientists who keep research secret until it is published in a peer-review journal, some software developers -- who get little credit when their code leads to a genetic breakthrough -- want to share their work as soon as it leaves their keyboards.
It's an old debate in the world of computing -- and a new culture clash in bioinformatics...
It seems like there is some confusion here. Sure there is some time-lag between the research and its publication, but peer-reviewed journals are in fact similar to bugzilla.
The problem being addressed by the petition isn't what is published in peer-reviewed journals, its what isn't being published. Making scientific techniques proprietary not only slows the advance of science, but takes technology which could benefit the public and allows a few people to benefit disproportionate to their contribution.
I once obtained RNA secondary prediction software from Washington University. Before we could obtain it, we had to sign waivers that we would not re-distribute it outside of our own research university. Once we got the software though, it came in source form. So AFAIK the source for academic software developed for life sciences is probably available upon request, you just can't redistribute it at will. As for publications, the alogrithms for predicting the RNA secondary structures were published, but not the code itself. Hope this is informative.
NO CARRIER
What should be open source and free beer free is the specialized software used by many researchers for processing DNA and Protein sequence information. For instance, take the very nice software package Vector NTI. Does many things researchers want. Also costs $5000. Instead of NIH (government science funding agency) money being spent many times (once per funded lab), what they should do is earmark some money to get programmers to write for free usage software like Vector NTI. It would cut down on the redundant purchase of software by many government-funded laboratories. Don't even get me started on how much research money is being spent on Windows, Word, Excel, PowerPoint, and Adobe Acrobat...
I work in genetics, and particularly with the computing aspect of it. My lab has a tight budget, and open source analysis software would be a tremendous help to us.
Of course, we have some OSS tools already, namely EMBOSS and others, but the big hitters in the bio-computing field (VectorNTI, MacVector, DNAstar, etc) run in the $5000-10000 range per license. Right now, they are much more usable than their open sourced cousins. I'd love to see genetics software continue to be developed open source. We could give less taxpayer money to software corporations.
"It's the age-old debate; what follows the heart of the scientific method more? Peer review, or getting the information out as fast as possible?"
The scientific method is both peer review and the fast, free flow of information. Further, both are so intertwined that it is impossible to separate the two or say one is more important than the other. Without the free flow of information, one cannot cannot give a proper peer review of the work. Without good peer review, the information you get is garbage, thus clouding the review and making the knowledge suspect.
The work should be made open source after good review, not before. That way, you build upon that previous work such that the next scientific advancement is upon solid ground.
-When going for broke, go for Ithaca!
bioinformatics.org
They host a large number of these open-source bio software. Really worth a look if you're interested in the topic.
There was an interesting discussion on this topic last year over on bionet.software, with contributions from several authors of significant bioinformatics software (search for 'open source' in the bionet.software group at groups.google.com).There's also a good interview with Ewan Birney (of BioPerl and EnsEMBL fame) about open source bioinformatics here:
http://conferences.oreilly.com/biocon/
Like some of the other contributors to this thread, the argument I find most convincing is that peer-reviewed scientific publications usually (and for very good reasons) require full disclosure of methods - why should software be exempt from this? Similarly valid criticisms were made about _Science_'s decision to publish the Celera human genome paper with only conditional access to the primary data. Biotech companies are of course free to conceal both their data and methodology, but the scientists involved should not then expect the right to contribute to the peer-reviewed literature.
There are also sound economic reasons for an open source approach to significant scientific software: it's ironic that software developed with public funding has in some cases been commercialized by the host institution of the programmers, leading to a situation where other laboratories that receive grants from the same funding body end up paying substantial license fees! (some funding agreements are now savvy enough to include a clause preventing this). In fields like molecular biology, there's also the risk that initially reasonable license fees can rocket when the software proves to be useful. This is pretty much what happened in the case of GCG, a set of sequence manipulation utilities originally developed by a university department, but later acquired by a large biotech company. The increasingly restrictive and expensive license was one of the factors that led to the creation of EMBOSS:
http://www.hgmp.mrc.ac.uk/Software/EMBOSS/
a GPL/LGPL'd alternative that in many respects now surpasses GCG, and runs on a wider range of Unix-like platforms (including Linux, Cygwin and MacOS X).
I received the following email message from the CFO of a company called LabBook, about my Bull Shit Markup Language (BSML) web page.
Appearently, they would prefer that people searching for "BSML" did not turn up my web page. I wonder if they've tried to get the Boston School for Modern Languages to change their name, too?
Now isn't the whole point of properly using XML and namespaces to disambiguate coincidental name clashes like this? If LabBook thinks there's a problem with more than one language named BSML, then they obviously have no understanding of XML, and aren't qualified to be using it to define any kind of a standard.
Maybe LabBook should put some meta-tags on their web pages to decrease their relevence when people are searching for "Bull Shit" or "Modern Language".
-Don
========
From: "Gene Van Slyke" <gene.vanslyke@labbook.com>
To: <don@toad.com>; <dhopkins@maxis.com>
Sent: Monday, November 12, 2001 10:36 AM
Subject: BSML Trademark
Don,
While reviewing the internet for uses of BSML, we noted your use of BSML on http://catalog.com/hopkins/text/bsml.html.
While we find your use humorous, we have registed the BSML name with the United States Patent and Trademark Office and would appreciate you removing the reference to BSML from your website.
Thanks for your cooperation,
Gene Van Slyke
CFO LabBook
========
Here's the page I published years ago at http://catalog.com/hopkins/text/bsml.html:
========
BSML: Bull Shit Markup Language
Bull Shit Markup Language is designed to meet the needs of commerce, advertising, and blatant self promotion on the World Wide Web.
New BSML Markup Tags
CRONKITE Extension
This tag marks authoritative text that the reader should believe without question.
SALE Extension
This tag marks advertisements for products that are on sale. The browser will do everything it can to bring this to the attention of the user.
COLORMAP Extension
This tag allows the html writer complete control over the user's colormap. It supports writing RGB values into the system colormap, plus all the usual crowd pleasers like rotating, flashing, fading and degaussing, as well as changing screen depth and resolution.
BLINK Extension
The blinking text tag has been extended to apply to client side image maps, so image regions as well as individual pixels can now be blinked arbitrarily.
The RAINBOW parameter allow you to specify a sequence of up to 48 colors or image texture maps to apply to the blinking text in sequence.
The FREQ and PHASE parameters allow you to precisely control the frequence and phase of blinking text. Browsers using Apple's QuickBlink technology or MicroSoft's TrueFlicker can support up to 65536 independently blinking items per page.
Java applets can be downloaded into the individual blinkers, to blink text and graphics in arbitrarily programmable patterns.
See the Las Vegas and Times Square home pages for some excellent examples.
Take a look and feel free: http://www.PieMenu.com
Science and Open Source are different animals.
Just because Scientists appear to be independent agents, it does not mean that Science in itself is not cohesive.
Open Source was to some extent initiated by published listings, including source code for the early Unixen. When this was closed, the attempt was to develop "free" copies of the utilities, and then, the ultimate operating system.
The Open Source is now just moving into more advanced applications, replicating the larger commercial things in scope.
In some extent, Web Browsers were the first really open-source application that was a class leader. Netscape made a browser that rolled in the email client and ftp client, but Mosaic was there first.
But what we need is some sort of open-source idea development.
Science, especially as practiced in the West, is more based on a "religion" that has won its place through arguement, rather than cooperation.
In essence, an idea that moves into a spot becomes entrenched, and more difficult to move. So a new idea needs to arrive as more powerful to displace it through arguement.
For this reason, Scientists tend to be more cautions of letting ideas gain a foothold.
In the book "The Cathedral and the Bazaar", Eric S Raymond contrasts the world of acedemia and open source, the former competative and the latter cooperative.
Acedemia places fully developed ideas in competition, and your status in acedemia depends on your publications. The Open Source community is more open to the development of someone else's idea, and overlooks the flaws in the supplied implementation.
So whereas Scientists rejected, say, Velikovsky's ideas as crazy [because the presented package as a whole does not work], the open source people accepted Linus' alpha kernel as an idea to be extended into Linux.
We could have an open source Science that looks into alternate views. An Enthusist's science, not guided by acedemia, so to speak. But we disparage the opportunities as cranks, and build no alternate to acedemia.
de Bono offered Six Hats of thinking, as an alternate. Maybe we should look at this.
OS/2 - because choice is a terrible thing to waste.