Genome of DNA Pioneer Is Deciphered
unchiujar writes "The New York Times reports that the full genome of James D. Watson, one of the discoverers of the structure of DNA in 1953, has been deciphered, marking what some scientists believe is the gateway to an impending era of personalized genomic medicine. A copy of his genome, recorded on a pair of DVDs, was presented to Dr. Watson on Thursday in a ceremony in Houston by Richard Gibbs, director of the Human Genome Sequencing Center at the Baylor College of Medicine, and by Jonathan Rothberg, founder of the company 454 Life Sciences. 'The first two genome sequences belonging to individuals are now being made available to researchers within a few days of each other. One is Dr. Watson's and the other belongs to J. Craig Venter, who as president of the Celera Corporation started a human genome project in competition with the government. Dr. Venter left Celera after producing only a draft version of a genome, his own, in 2001, which the company did no further work on. He has now brought his genome to completion at his own institute in Rockville, Md., and deposited it last week in GenBank, a public DNA database, he said.'"
Torrent pls?
this is really really cool for obvious reasons. it's also really really scary for equally obvious reasons. if I wasn't so afraid of the potential harm of misusing this power I'd sign my name now to be the third person done.
thats right, I rarely use capitals. deal with it. but don't mistake my laziness for stupidity
For the curious, read a pretty good synopsis of Dr. Watson here: http://en.wikipedia.org/wiki/James_D._Watson#Contr oversy_about_using_King.27s_College_London.27s_res ults, and if you are extremely interested, pick up a copy of "The Double Helix." It is really strange, but even his autobiography makes him sound like a total ass, and includes an apology of sorts in the revised version, which is commendable.
In short, Watson stole a lot of data, and the structure of DNA would have been determined in less than a couple months by the more deserving Linus Pauling, who has conducted himself in a much more dignified fashion. It is really strange how superficial history records events, with the "first" often the most noisy, obnoxious scientist / engineer / artist, and not the industrious, studious type.
Well, perhaps they will find some genes responsible for the "jerk" phenotype... (at work, have to post AC).
However, Dr. Watson was told that he could not use his DNA, as it had been patented by the company and any use of his own DNA without proper permission would lead to serious legal consequences...
Cum catapultae proscriptae erunt tum soli proscripti catapultas habebunt
I've often thought about this (I'm a doctor...)
.75 billion bytes = .75 GB = 750 MB.
By my calculations:
3 billion base pairs in the entire human DNA sequence (give or take). Each base pair can be A, C, T, or G. (look at wikipedia or biology text for details.) Thus, each base pair can be represented by a 2 bit number (00 01 11 or 10).
Thus, 3 x 10^9 base pairs * 2 bits / base pair = 6 x 10^9 bits = 6 billion bits * 1 byte / 8 bits =
A standard DVD holds 4.3 GB, so you could fit almost 6 full humans on a DVD. Of course, this doesn't count compression (which would be astoundingly effective given repetition and patterns in DNA sequences) nor the fact you could just encode the delta as much DNA is conserved. In fact, very little DNA varies between humans, so I'd bet you could quite deterministically encode a human in as little as 100 MB if you had a "standard human DNA sequence" for reference.
Of course, you would need some magical method to reconstruct this DNA and put it into an egg at the right timing, which would likely form an approximation of the identical twin of a person. The technology for this is not here yet. Also, this does not encode any of the proteins / apparatus / mother that is needed to go from DNA in egg to functioning human.
Still, it is interesting to think about!
Slashdotter, ID #101. UIDs are in binary, right?
Just for anyone not following along:
Celera is a bad news company, and news involving them should always set off alarm bells.
They are decent at motivating people, though. Based on their track record and stated intentions they caused a massive movement to decode the human genome as public property after they announced they would compete with the federally funded decoding initiatives for the purpose of patenting the findings and licensing that data to private companies. As John Sulston, who led the British arm of the Human Genome Project put it: 'We were in a position of responsibility... without us, the human genome would be privatized.'
Here's a quote from The New Atlantis:
"Celera's mission was to sequence the human genome better and faster than its government-funded rival. It aimed to sell access to genomic information as well as the tools to interpret it, with an eye to "big pharma" and other biotechnology companies looking for a treasure trove of new drug targets."
Venter, named in the submission, was the CEO of Celera at the time this strategy was developed and was deposed several months after it became clear that the public would beat Celera to the goal.
This is admitedly troll bait, but I feel a burning personal need to inform people about this man's actions whenever I see his name in print.
Regards.
Yes, the man was part of a team that made a huge scientific breakthrough. If someone wants to argue that that makes him a genius, well, I won't start an argument on that front. But there's no doubt that Watson was (and still is) also of poor character.
He and his colleagues knowingly stole vital DNA X-ray diffraction data from Rosalind Franklin and Raymond Gosling without their knowledge and consent (indeed, Franklin had even refused to share it), which tarnishes their acheivements.
More recently, he has called for genetic screenings before birth to weed out "really stupid" people (the bottom 10 percent or so), and he has a nice line in how to deal with homosexuality, too. He believes "that if the gene [for homosexuality] were discovered and a woman decided not to give birth to a child that may have a tendency to become homosexual, she should be able to abort the fetus." Not to put too fine a point on it, but that strikes me as being rather too close to Third Reich thinking for my liking.
He might have performed some fantastic science but, to me, his words preclude him from being considered a great scientist. Certainly they show that he's not a great human being.
"Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
so you could fit almost 6 full humans on a DVD.
Only six? With lossy compression, you could do significantly better, as long as you don't mind all your offspring being funny-but-similar-looking lactose-intolerant non-deterministic sociopathic freaks.
it's a blue bright blue Saturday hey hey
I'm doing undergrad biochem and we've done this math several times, as has been mentioned here in other threads, 1GB is the ballpark amount of space a single UNCOMPRESSED human genome should take up.
On one hand, this is a marginal underestimate because there are more than 4 DNA nucleobases (quite rare, but they exist and need to be recorded if you're profiling a genome).
However, the genome should be quite happily compressable (think bz2 or some specialized lossless form of compression) due to MANY repeating sequences and the fact that most exons (that you'd normally use 6 bits to describe) can be described using 5 bits by pinpointing their product on an amino-acid table (numbering 20 members most of the time), or even 4 bits if you narrow that table from the 20-most-common to the 15-most-common and use the 16th position to describe less-common sequences using more bits, just to name a few reasons.
Maybe a bit of added data they put in describes things we've learned about the data which wasn't physically present in the original DNA such as "here ends intron, here starts exon, here be boundary" etc.
In short, it should be highly compressible and fit in way under 2 DVD's, so for the life of me I can't figure out what they plugged onto two DVDs. Software to decipher it? Gene database correlating what's in your personal genome to what the genes are known to do? Free BonziBuddy extra content? Bonus "behind the scenes" material?
-