Geneticists Push For Databases Over Journals As Main Source of Information (theatlantic.com)
neoritter writes: The issues of reproducibility in journals continues to present problems. This time in the world of clinical geneticists where a misleading or incorrect journal on the effect of a gene variant can affect the decisions made by doctors and patients alike; from heart monitoring implants to abortions. Poor sampling and low thresholds for evidence have led some clinical geneticists to work towards an open database of genetic information. Scientists and doctors would go to a "one-stop shop for disease genes" to check and share information with each other under the strictest of standards.
it's duck season!
It's not the doctors that I worry about having access to this database.
You are welcome on my lawn.
This is a wonderful idea. I mean, why not push for more studies to actually provide their raw data along with their conclusions? Extend the peer review process of the scientific method to include all of the data they generate, as advances in technology allow for the storage and communication of that information now. What is wrong with that, as a general idea? There is always the worry of security or safety of the data, but that was the same with publishing some things in journals already.
The main person quoted in the article, Heidi Rehm, is 100% right about the need for a central open database of known genetic disease variants. And, just to get the Slashdot crowd interested, she also has a bit of the sexy librarian look going on.
But, as far as I can tell, she really hasn't been able to get much funding to be allocated for such databases (e.g. ClinVar and ClinGen). A couple years back, she got a grant for a few million dollars. But in a world where the USA thinks a long running war in Iraq is so wonderful that it's worth spending trillions on it, a few million is absolute peanuts. And Obama has made some worthless speeches about a "Prescision Medicine" initiative but hasn't actually ponied up any real cash.
Personal/clinical genomics today is like personal computers in the1980s. Personal computers didn't give us self-aware AI and personal/clinical genome sequencing isn't going to make us live forever (i.e. cure aging). But personal/clinical genome sequencing is one of the biggest revolutions in the history of medicine - right up there with aseptic surgery and antibiotics. Back in the 1980s there were networked computers and limited forms of email that were available in very limited and specialized contexts. But now everyone has a (networked) computer and all kinds of electronic communication that goes far beyond email. In the last decade, a relatively small number of people have had their genomes sequenced - and obtained useful clinical information. But that's going to explode. In a decade or two pretty much everyone in the developed world will have their genome sequenced.
I know that there's a lot of anger and cynicism about medical care in countries like the USA. There are some obvious market failures in the form of monopolies that limit the availability and dramatically increase the cost of access to medical doctors and medicines. And the USA has responded by layering on additional bureaucracy in the form of mandatory health insurance.
But there's also hope. A lot of lives are going to be saved and a lot of disability and suffering is going to be prevented by wide-spread personal/clinical genome sequencing. Let me give just one example. There are certain drugs that are known to either be ineffective or toxic to people with certain rare genetic variants. As it is, everyone is given the drugs and the doctors hope that they can detect the problem before the patient ends up dead (sometimes they do detect it in time and sometimes they don't and the patient ends up dead). With personal genome sequencing, people will know ahead of time which drugs to avoid - and won't end up dead from being given the wrong drug (i.e. wrong for their particular genetics).
Any research or study of merit should be put into a database. This helps not only verification and result replication, but also makes searching and cross referencing far more effective. The verbosity required for journal publication is unnecessary, and the formats unusable without re-entering the data for proper formatting and processing.
Other areas that desperately need database coverage are things like copyright / patent / trademark registrations. In fact, copyright should go back to registered concept (instead of the default copyright system that we have now) and the work must be added to the fully searchable database with all appropriate key fields and variants (eg. lyrics + score + references + recording for music, etc). Trademarks and patents are currently searchable only because of entities like google, and not because they are made properly accessible (by the government offices in question) including all pertinent raw data, references, and patent examiner notes that go into the applications.
No databases. Journals are still a must. Why?
First, with databases, it is easy for a company to control access to who has the data, charging astronomical rates. With a journal, I save a copy or have a hard copy, and now, or maybe ten years from now, I can access it for a reference. With a database, the data could still be there, it could be expired, and who knows how much I would have to pay for access.
Second, the databases can be hacked or modified. Nobody would ever know.
Third, the data can disappear at the DB owner's whim.
All and all... just no.
But if everyone populated and used curated public databases, then there would be no need for the army of PhD and Masters students employed by IPA/IVA to read papers and feed their proprietary knowledge base. What would people do with all the money that's currently spent on propping up the Qiagen army?
Ask me about repetitive DNA
This will fail for academic geneticists. The reason is there are already vast commercial databases that are far bigger than anything academia could put together or fund in the form of Helix recently spun out of Illumina and the database being built by 23andMe (btw if you get sequenced by 23andMe, they keep your sequence data for others to reference in the future; that's a call out to you privacy guys). An open database sounds good but it's only as good as the data that you put in and Helix is already WAY ahead. So it's usability will be doomed to fail when you can just access a Helix reference for a few hundred bucks.
Once we have full genome sequences for enough species (even different members of the same species with some known differences tagged per specimen) we can create an algorithm to figure out how to write DNA. Synthetic biology will go from copying-and-pasting pseudorandom fragments of code that likely do what we want to things like a seed that will grow into a house or an amoeba that will turn into a leviathan-style spacecraft if you give it an asteroid to eat. Our ability to exploit the universe will multiply exponentially once we have a way to code novel organisms from scratch like we might a computer.
It was only a matter of time before the SJW-ification in higher education and research departments spread from the liberal arts 'research' over toward scientific research.
When we start getting scientific journals about how certain chemical compounds are misogynistic published and then pass peer review, we will definitely need access to the data in order to reach our own (more sane) conclusions.
the usual checks on falsehood and self delusion simply break down
Everyone in the industry of genetic analysis and gene prediction has a hand out for a dollar, the temptations are huge and the self checking is a joke
Time for a common database? Sure
Also some for civil penalties to those who sell bogus data