Burn your genes on CD -- for $500,000
An anonymous reader writes "Venter says he plans to offer the service, with the goal of burning individual human's entire DNA sequences onto shiny compact discs.
It will cost about $500,000 per person, says the entrepreneurial scientist who helped decode the human genome. "
From this first post: "Craig Venter, Time Magazine's Person of the Year in 2000 has a new hobby: collecting rich people's DNA. Millionaires are lining up to buy their personal gene maps for the cool price of USD$621,500."
Sources say there's about 3 billion base pairs in the human genome. If we assume a reasonably efficient encoding scheme, we can get 4 base pairs into a normal 8-bit byte without compression. This gives us a total data size of a little over 700 megabytes, uncompressed. Run it through gzip, and you could probably fit it onto one cd, definitely 2.
But then again, I could be wrong.
"Each chemical that forms your DNA" is adenine, guanine, cytosine, or thymine, and we've known the chemical structure of all those for decades.
This website says that we have about 3 billion base pairs, 30 thousand of which are genes (the rest is the mysterious "junk dna"). There are 4 base pairs, therefore each base pair is 2 bits of data. That's about 7.5kb for all the genes, and 715MB for every base pair - which after compression should fit comfortably on a standard CD.
Many links can be found at:
... but one link near the bottom was rather interesting. Thymine in particular is worthy of checking out. AIFF format.
linkage.rockefeller.edu/wli/dna_corr/music.html
I can't say that I tried them all
www.healingmusic.org/SusanA/order.html
As another person who replied to this, I'd like to reiterate that the chemical composition of DNA is known. Composed of four different nucloside triphosphates (GATC) in an dynamically ordered structure.
If I follow your train of thought, than all of genomes that are sequenced are worthless to me and the scientific community because we aren't "the same company who made the CD".
Look here at the National Center for Biotechnology Infortaion's Genomic Database. I'd assume you would receive something similar to this from Venter's group.
Also one can FREELY browse the human genome and look for differences between your genome and those used to construct this draft of the genome.
More speculatively, there may be other things we dont know about yet that get a free ride from mother to child. To be very speculative, certain protein sets might very well influence the exprression of your genome. That is to say different developement.
This is not an unreasonable hypothesis, despite its high degree of speculation. Your and my Genonomes are so similar it is reasonable to suppose our differences arrise in part from HOW the genese are expressed. Expression is regulated by proteins in the cell that contains the DNA. Thus implanting your genome in another cell might not produce the same phenotype individual despite the common DNA.
Some drink at the fountain of knowledge. Others just gargle.
Yes and no. I just tried gzipping chromosome 22 (one of the smallest) - it goes from 35MB to 10MB. The entire genome is about 3.5GB. However, keep in mind that the repetition isn't perfect, because from what I understand repeat motifs are more like regex's than simply the same sequence over and over again. A custom compression scheme could probably do much better than gzip.