Burn your genes on CD -- for $500,000
An anonymous reader writes "Venter says he plans to offer the service, with the goal of burning individual human's entire DNA sequences onto shiny compact discs.
It will cost about $500,000 per person, says the entrepreneurial scientist who helped decode the human genome. "
If you bring your own disc, that'll only come to $499,990.00
what it would sound like. Take the data on the CD, convert it to MP3 or OGG and then have a listen. While most of our "songs" would sound the same, I suppose some filters could be applied to record only the major differences. It might make for some interesting electronica.
This article has been done before!
And it doesn't cost me a penny!
Government of the people, by corporate executives, for corporate profits.
From this first post: "Craig Venter, Time Magazine's Person of the Year in 2000 has a new hobby: collecting rich people's DNA. Millionaires are lining up to buy their personal gene maps for the cool price of USD$621,500."
This is the sort of thing I'd like to see put on a satalite flying through space, for possible future contact with intelligent species. Then they would have a good chance to study other lifeforms, even if we are long gone.
The snow doesn't give a soft white damn whom it touches. -- ee cummings
Gives a whole new meaning to a "bug" in your software...
than trying to find a suitable, willing girl to carry my genes, and probably almost as much fun, too!
Just more proof that, one day, the RIAA will indeed own all of us.
cool, now i can get the DNA sequence of someone i really dont like, and use it as a coaster for my coffee at work.
"gee, if im using your genetic sequence to keep my desk clean, chances are i dont really care for your opinion either, huh?"
How long till the RIAA finds out how this violates the DMCA?
I don't see what the consumer gets for $500K, but I do see what the vendor gets: your DNA and a big chunk of money.
1) What keeps them from exploiting your DNA for their profit? Suppose they discover something profoundly unique about your DNA that has significant medical implication. Who has the rights to that information?
2) How is the information encoded on the CD? Is it proprietary or some kind of de facto standard? (Oh, so you want to use the information? We'll have to read that for you! $100,000 per reading!)
3) CDs last forever right? Thirty years from now I'll be able to use the information on that CD, right? Didn't think so.
Remember, this is the guy who swapped HIS OWN DNA with the "random sample" that was supposed to represent all of humanity. Maybe this DNA-on-a-CD scheme is what he wanted to do all along?
Humpty Dumpty was pushed.
Sources say there's about 3 billion base pairs in the human genome. If we assume a reasonably efficient encoding scheme, we can get 4 base pairs into a normal 8-bit byte without compression. This gives us a total data size of a little over 700 megabytes, uncompressed. Run it through gzip, and you could probably fit it onto one cd, definitely 2.
But then again, I could be wrong.
Why does it cost half a million dollars to get your genes on a cd when you can get 'em put on a t-shirt for 50 bucks?
"Sic Semper Tyrannosaurus Rex."
"Each chemical that forms your DNA" is adenine, guanine, cytosine, or thymine, and we've known the chemical structure of all those for decades.
Chromosome 1:t tagt atcgatcgttagctactggtactgtgatgctgtgatgcgtatcgtatctg tgatgcgtatgctgtgatgctgtgggtggtgtggtgattatatatataaa atattttaaaaaaaaaaaaaaaaaaaaaagtgtctgtatgctgtgagctg tgactggttagtggcgtgcgcccccccccccccccccccgtattgggatt atttattatattatatatattatctctatcgcttctgcgtctgctgtgct gctgtgctctctcttcttcttttttttctctctcccgcggcgatgcatgc ggtcttgatcgttaggcttgtatgcgtggtacgtgatgctgtgtctgagt ctggtggatggtctggtctgatgcgttggattgc
atgcgcctagtttatagcgagcgtatgctgatcagtctggtatgg
Linux - Because Mommy taught me to Share.
Will a CD like this get me through the express line at an airport, regardless of whether or not I wear a turban.
/^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,4}$/i
This website says that we have about 3 billion base pairs, 30 thousand of which are genes (the rest is the mysterious "junk dna"). There are 4 base pairs, therefore each base pair is 2 bits of data. That's about 7.5kb for all the genes, and 715MB for every base pair - which after compression should fit comfortably on a standard CD.
Could i get my genome sequenced onto vinyl?
"Sic Semper Tyrannosaurus Rex."
From the Human Genome Project FAQ:
Q. How big is the human genome?
The human genome is made up of DNA, which has four different chemical building blocks. These are called bases and abbreviated A, T, C, and G. In the human genome, about 3 billion bases are arranged along the chromosomes in a particular order for each unique individual. To get an idea of the size of the human genome present in each of our cells, consider the following analogy: If the DNA sequence of the human genome were compiled in books, the equivalent of 200 volumes the size of a Manhattan telephone book (at 1000 pages each) would be needed to hold it all.
It would take about 9.5 years to read out loud (without stopping) the 3 billion bases in a person's genome sequence. This is calculated on a reading rate of 10 bases per second, equaling 600 bases/minute, 36,000 bases/hour, 864,000 bases/day, 315,360,000 bases/year.
Storing all this information is a great challenge to computer experts known as bioinformatics specialists. One million bases (called a megabase and abbreviated Mb) of DNA sequence data is roughly equivalent to 1 megabyte of computer data storage space. Since the human genome is 3 billion base pairs long, 3 gigabytes of computer data storage space are needed to store the entire genome. This includes nucleotide sequence data only and does not include data annotations and other information that can be associated with sequence data.
As time goes on, more annotations will be entered as a result of laboratory findings, literature searches, data analyses, personal communications, automated data-analysis programs, and auto annotators. These annotations associated with the sequence data will likely dwarf the amount of storage space actually taken up by the initial 3 billion nucleotide sequence. Of course, that's not much of a surprise because the sequence is merely one starting point for much deeper biological understanding!
Contributions to this answer were made by Morey Parang and Richard Mural formerly of Oak Ridge National Laboratory; and Mark Adams formerly of The Institute of Genome Research. [01/01]
"For every right, an equal responsibility..."
burning individual human's entire DNA sequences onto shiny compact discs
I can do that for less than $500k:
Ingredients:
One CD (make use of an AOL one for a change).
A skin or blood sample.
Preperation:
Put all the ingredients into a casserole dish, preheat oven to gas mark 9. When ready place casserole dish into oven and leave until black acrid smoke comes out of the oven. Et Voila, your DNA 'burned' onto a CD.
A nice keep sake for years to come! And as Nigella Lawson would say, "Absolutely Scrumptious"!
Are you local? There's nothing for you here!
Sure, it's expensive, but think of the value! I mean, that's just pennies per gene! With all of that information you can.. uhh.... erm..... prove your genetic superiority! I mean, after they sequence your genes and find out that you share 99% of your genes with every other human on the planet, you can use the remaining 1% to find out absolutely nothing that you didn't already know about your phenotypical characteristics! I know -- I'm just getting too excited.
Now I just have to sell my stock in Venter's enterprise to affod it --- oh wait: I OWE 500 grand on ledger. Silly me!
Cheers,
-----[0_o]-----
We are not amused.
Or you could just take high school biology and not worry about these mysterious 'chemicals' not being actually present on the CD. You also can't use the CD to impregnate women with, but I don't think anyone's gonna complain about that.
Endless arguments over trivial contradictions in books written by ignorant savages to explain thunder in the dark.
I wonder if this would violate ebay's rules against biological items or not.
T Money
World Domination with a plastic spoon since 1984
And it's more usable in a biological format too :)
"Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it." -- Linus Torvalds
""Venter says he plans to offer the service, with the goal of burning individual human's entire DNA sequences onto shiny compact discs. It will cost about $500,000 per person, says the entrepreneurial scientist who helped decode the human genome."
Even though it's you, you know they will copyright it.
And even though it's you, you know they will prevent you from copying and sharing it.
Bad what people do for money.
@fbiTopTen = wget("http://www.fbi.gov/mostwant/topten/fugitives /fugitives.htm");
foreach $criminal (@fbiTopTen) {
1. Download $criminal iso image from Kaaza
2. Clone
3. Put $criminal in enhanced growth chamber.
4. Get Lunch
5. Take $criminal to FBI and collect ransom
}
Profit!
Live web cams
As another person who replied to this, I'd like to reiterate that the chemical composition of DNA is known. Composed of four different nucloside triphosphates (GATC) in an dynamically ordered structure.
If I follow your train of thought, than all of genomes that are sequenced are worthless to me and the scientific community because we aren't "the same company who made the CD".
Look here at the National Center for Biotechnology Infortaion's Genomic Database. I'd assume you would receive something similar to this from Venter's group.
Also one can FREELY browse the human genome and look for differences between your genome and those used to construct this draft of the genome.
What is the chance that you'll have an archaic CD-ROM drive in your computer to read it????
This looks like one of those old stories about "overcoming his death by sending a message to the future".
Presently the only good potential customer I could see would be the one that dreams on making a copy of himself when medicine gets to advanced to achieve what can't do today. This thing goes on the same wave as the frozen cadavers, frozen human cells and the frozen human DNA. However we know that all this risk to degrade in time. So the idea of writing up one's DNA would be an intersting solution to these drawbacks. Is it?
No. Because CD are also not eternal. And besides there is a huge difference between genes and what comes up after. We humans are the less genetic species on Earth, and every detail on character, behaviour and knowledge is mainly a product of our everyday experience. We are formed under the circumstances we grow up, the conditions of our family, society and the world in the whole. Besides every single piece of experience can be very fundamental to our character.
Let's remeber an old tale that many people used on several SF tales - Adolf Hitler's clones. Would Adolf Hitler revive from his genes? Absolutely not. His copies wouldn't ever seen his strict mother and his father with that very character of an austriac small burocrat. He wouldn't have suffered that poisoning in Ypres battlefield and wouldn't have seen the turmoil of the Russian October Revolution beating on the doors of Germany. He would not be the same racist bastard because his antecessor managed to wipe out a good piece of Jewish population in Germany and this populistic view that "jews are to be blamed for everything" is hardly to be overused today. Who would really be the new Adolf Hitler is hard to predict. However I would believe that his fate would not be shinny. Because he would not have parents, his artificiality would probably hunt him for the rest of his life and society, with its stupidities, faiths and superstitions would always mark him as the "Butcher of the World".
Well, probably soon we will have "an holographic image of your brain on DVD" together with instructions to reproduce it... But even then I would hardly believe that anyone may get ready for eternity. What would happen if I suddenly travel 1000 ahead from now? Well, let's take someone 1000 before us, and think the SHOCK he would get:
Boxes showing people or talking.
Mettalic tubes that spit fire and make huge thunders.
Big metallic things that move without horses, some EVEN FLY like birds.
People talking to each other on distance.
Fire that burns without wood.
Some strange boxes, made of metal and something like glass, where people write some strange symbols that look like letters and pass huge amounts of time on them. Some of these boxes even play songs or seem to talk. Others show demons, dragons and even trolls.
In other terms - The Hell...
Interesting. I had assumed the whole purpose was to make a back up. Compared to a contract with IBM it seemed like the price was justified.
Not quite. The estimate is 30000 *genes*, at an average size of 3 kbp apiece. That means ~90 Mbp of coding sequence.
You couldn't even make a bacterium with 30 kbp of coding sequence.
- This website [ornl.gov] says that we have about 3 billion base pairs, 30 thousand of which are genes (the rest is the mysterious "junk dna"). There are 4 base pairs, therefore each base pair is 2 bits of data. That's about 7.5kb for all the genes, and 715MB for every base pair - which after compression should fit comfortably on a standard CD.
Thanks for doing the calculation. Kind of magical in several ways:- Even including the junk - not very much by today's information-processing standards. Given some outrageous tech, it would be possible to re-create "a" human race with ten or twenty CDs.
- Talking about outrageous tech: a CD full of information squashed together in a space it takes an electron microscope even to see. We have some catching up to do
yes, we have no bananas
I can burn my genes on the stove for free. Heck I can burn my jeans too while I'm at it.
The sequence probably compresses very well, too. My understanding is that human DNA sequences are relatively low entropy--significant portions of human DNA are repetitive sequences that don't encode a protein (i.e. they don't belong to a gene), and the fact that they're repetitive lends themselves well to lossless compression schemes like Huffman coding.
"It take 9 months to bear a child, no matter how many women you assign to the job."
You're confusing encoding with compression. Something composed from an alphabet of 4 letters is already essentially encoded in base 4, and can be encoded in binary using 2 bits per letter.
How much you can compress the resulting bit stream completely depends on the nature of the data, and without knowing something about the actual data patterns in the genome, it's not really possible to know in advance how much it'll compress.
IIRC, the accuracy on the current sequencing, while not awful, isn't perfect. Is the company liable for errors in the half-million dollar CD you're buying?
May we never see th
Anyone who buys it won't find out they have the Sucker Gene until they get their disc.
If I wasn't so lazy, I'd have a sig.
More speculatively, there may be other things we dont know about yet that get a free ride from mother to child. To be very speculative, certain protein sets might very well influence the exprression of your genome. That is to say different developement.
This is not an unreasonable hypothesis, despite its high degree of speculation. Your and my Genonomes are so similar it is reasonable to suppose our differences arrise in part from HOW the genese are expressed. Expression is regulated by proteins in the cell that contains the DNA. Thus implanting your genome in another cell might not produce the same phenotype individual despite the common DNA.
Some drink at the fountain of knowledge. Others just gargle.
Midochondrial DNA has an extremely low mutation rate. I believe that as a result their geneic diversity is pretty small. The differences between your midochondria and mine would be slight.
"Never, never suspect the dreams within the dreams of dreaming children." ~The Amazon Quartet
first of all, we are talking about genotypes and not phenotypes (phenotypes are impossible to put on CD for obvious reasons). secondly, the human genome is roughly 3 Giga Basepairs, which, if you consider that you need 2 bits to store one basepair, gives you just over 715MB, with some compression that happily lives on a CD.
Of course the vast (vast) majority of that is identical for all people, so you only need to store the differences.
sic transit gloria mundi
Yeah? Well I can burn my DNA into a live human being with just a few pelvic thrusts.
eTrade SUCKS
I'm certain that genetics will have a great deal more to do with the field of medicine in the future. I'm not sure why you bring up drugs, except as an inference (there is no reference to drugs in the item). Anyway the "article" is not about drugs nor is it about Genetics, the science. It is a thinly veiled advertisement for Venter's business; a press release and sales pitch disguised as news. It is designed to appeal to the egos of the rich, while suggesting to everyone that it is for the good of all humankind. Bah.
Sigs are bad for your health.
Why pay $500,000 for my genetic code on CD when I can just get it off of kazaa or gnutella for the cost of bandwidth? I mean really.
You see? You see? Your stupid minds! Stupid! Stupid!
This of course brings up a different possibility - watching your gene map on TV. Dark room, techno music, maybe a little beer - quite the show.
Schnapple
Yes and no. I just tried gzipping chromosome 22 (one of the smallest) - it goes from 35MB to 10MB. The entire genome is about 3.5GB. However, keep in mind that the repetition isn't perfect, because from what I understand repeat motifs are more like regex's than simply the same sequence over and over again. A custom compression scheme could probably do much better than gzip.
Interesting, interesting....
AFAIK, chimpanzee DNA has never been fully sequenced(?). Comparisons have probably been made using simpler DNA typing procedures.
However, by now we have at least two complete human DNA's. Has anyone run a diff on them to determine exactly how much difference there is between two not exactly random but anyway humans? Assuming the DNA is 99.9% identical, your chromosomal uniqueness (stored as a diff from some Standard Human DNA) should fit on a single floppy!
Heck, using this scheme, you should be able to store the DNA code of every single human being alive in < 10 PB, soon within reach of SAN storage clusters. The mind boggles... Of course, that would cost < $5e15 with current pricing. Maybe I can get a volume discount.
BTW, is anyone working on mapping mitochondric DNA? How large is that, anyway?
I choose to remain celibate, like my father and his father before him.
It's alway good to have a backup copy of your genes in case you have to reformat / reinstall yourself.
Ideology is for ideots.
And even better is if there were a standard template for the "average" human genome. Then you can just show where your genes differ, and that will uniquely identify you. And you'll cut the size into almost 1/1000th of storing the full data. Of course, you would have to have the standard human template to compare it to, to get your full genome. I just imagine people going to pharmacies or doctors and they swipe their DNA card to get their full genome. If the pharmacy/doctor kept a standard human template, you can cut down the data each person needs to carry by a huge amount.
yeah, i had a friend who partitioned himself wrong and went to unichs (unix)
Disco Stu was talkin' to you.
The backbone is made up of deoxyribose (a sugar) and phosphate. It is quite different from a protein, as you indicated..
Also - don't be scared off by the 3.5GB figure - that is probably encoded as a text file which gives 8 bits per base (so that is 4X space waste right there). Plus it probably has carriage returns and position indexes intersperced, which is some more overhead.
Man, when Jr. destroys that CD (as he has many others), the propagation of my DNA will be halted...
Love many, trust a few, do harm to none.
While the data-density of DNA is pretty high keep in mind that the fidelity of replication is VERY LOW by computer-standards (though quite impressive by just about any other standard). Mutations happen all the time - usually in non-critical regions. Computers are not very fault-tolerant - you would need a lot of error-correction overhead if you wanted to use DNA for data storage. And it doesn't store very well. You can't just take some DNA and stick it on a shelf for 5 years and expect it to make it unharmed. When scientists extract DNA from dead animals it is quite fragmented, but certainly good enough for identification/ classification purposes. Individual strands of DNA are quite fragile. Just moving it around without shearing it is difficult. (Picture a piece of spaghetti a million miles long and try to move it without breaking it - you're talking a length-thickness ratio along those lines.)
Actually it's pretty unlikely, I would guess. It depends strongly upon how the brain has been preserved - if it's in a strong formalin solution then the DNA is largely unrecoverable. There are methods for getting some DNA out of formalin-fixed tissue, but it wouldn't be an easy job.
I don't think the information would be a lot of use anyway until a LOT more is understood about brain development, and that's still assuming that whatever made Einstein's brain so brilliant was completely genetic in anyway. In utero environmental factors and probably lots of other factors we don't even know about yet might play a role. Make a complete DNA copy of Albert and you might just end up with an unusually bright kid, but not a world-class genius.
Before I plop down 500k, does anybody actually know if this sounds good? I mean, I could pay a lot less for bad music.
Yeah, "burn your genes"! Now!
After you have burned your genes on CD put the CD into the microwave.
Oh what fun.
which after compression should fit comfortably on a standard CD.
We're assuming lossless compression, right?
Don't want to lose any important bits....:)
Anyone serious about preservation of his/her DNA would best save $498,000 and bank their blood or other bodily fluids for 10 or so years. Some companies even offer payment plans on that $2000 charge.
In ten years time the technology to sequence quickly will allow for this operation to be done at 1% of today's cost. (Yes, I will put money on that prediction.)
Of course when the time comes, if you really want to keep the sequence a long time, I wouldn't suggest CD-ROM. With a shelf-life of 50-200 years under optimal conditions, you'd be better with a book printed on acid-free paper. There you're looking at a shelf life of half a millenium or more under the same conditions.
Mad scientists create an amusement park with the theme of cloning a post-neanderthal whose DNA was discovered during the government recall of all "Boy Band" music.
When the experiment goes awry, a young female tourist saves the day with her knowledge of Unix workstations*.
*UNIX is a registered trademark of William H. Gates DCXVII
WTF sense does this make. You want mt DNA? OK scratch some skin, take some blood.
Right, I'm not taking the small character set into account, and in this case there are newlines (but not position indices)- which might reduce the apparent repetition as far as gzip is concerned, but doesn't do much to the file size. However, there are not _really_ just four characters in any of the files, because the genome still has so many gaps. Centromere regions are a bitch to sequence, and right now the small arms of even the well-covered chromosomes are virtually blank (as far as I know), mainly because they're mostly rDNA and not as interesting. So you need to add the 'N' character, because for the time being you'll never get a completely sequenced chromosome. Even considering that sequencing technology is getting better and better, some stuff is just a pain in the ass and not really worth the extra effort in most cases.
I guess the obvious snide in joke to make here would be that since Venter's talking about this, we can assume that everyone who goes for this will end up with quite a few N's in their sequences. (sorry, I've heard a few too many Cato Institute types use Celera as an example of why we should eliminate the NIH. morons.)
this is pretty close
http://www.ncbi.nlm.nih.gov/UniGene/
and as for the poster 2 up -- over 98% ov genetic material between humans is the same. current estimates are that only 30-50megabases differentiate any one huamn from another.
I would be REALLY surprised, as his name is J Craig, not Greg.
sorry, I couldn't resist.
what I'd love to see is the "create a kid" feature:
Pop in your CD and one from a potential mate -- then push the button and have the software generate the array of potential vital stats on offspring. propensities, diseases, height and weight, correlations. as long as I'm dreaming, a good face shot too would be nice too.
it's a long way off. probably not in my lifetime, but it'll be here within 100 years.
(the rest is the mysterious "junk dna")
:)
Until we become collectively aware of its purpose, anyway.
Do you like German cars?
It sounds like you're trying to compress an ASCII file with a bunch of letters representing the genome data. This is going to compress a lot better than an encoded file with two-bit bytes due to the large number of repeated "garbage" data in the ASCII representation. That's not to say that the encoded representation wouldn't compress well, perhaps with a customized compression algorithm. Run-length encoding would probably catch a good number of runs.
Josh Woodward
I guess you have to look at the goal. For Ventner, 99.999% is good enough (though it leaves huge regions unsequenced). Actually, just seqencing the ESTs would be good enough - his main goal was to patent important genes and sell important sequence data to companies. Nobody knows enough to market the non-coding regions - though I personally think they'll be just as important once everything has been figured out. But this is more along the lines of basic research - which is of little interest to companies like Celera - and one of the reasons that the NIH is still important...
I couldn't agree more on the "junk" DNA.
I think part of the problem is that DNA has to accomplish a lot of tasks. It has to mutate at just the right rate, it has to be pretty durable, it has to carry data, it has to be regulatable, it has to fit inside a confined space, and it probably has to do a lot of things nobody has thought of yet. Bacteria and viruses go for the most coding DNA in the smallest space. Eukaryotes go for the most versatility. Having all that non-coding DNA is a bit of a luxury when you think about it, but eukaryotes aren't driven by efficiency. Maybe a megabase of seemingly repetitive DNA convers a very slight advantage - maybe due to the physical structure of the DNA, or maybe due to the ability to implement a fancy regulatory system using it. For a bacteria, a megabase of extra DNA would be unthinkable unless it conferred a tremendous advantage. For eukaryotes, the cost of having that DNA is much lower, and so you are more likely to keep it around in case it comes in handy.
Picture these scenarios:
1. You live in a 2 bedroom apartment with a wife and three kids. Property costs a fortune in the area - you couldn't get more space without paying a FORTUNE for it.
2. You live in Bill Gate's house. Land costs 1 cent per acre, and you can build a 500,000 square foot warehouse for $29.95.
If you live in world #2 - would you ever throw anything out that didn't smell? Just the remote chance that it could come in handy later would make things worth saving. Somebody who purused your warehouse might not be able to figure out what some of that stuff is for - you don't seem to use it at all. You know what it is for, but you don't use it since you replaced it with something better years ago and are only keeping it since it doesn't cost you anything.
If you lived in world #1 there would be nothing in your house which isn't absolutely essential. Anybody who studied your house would figure out pretty quickly what everything is for. After all, they could see how you use it every day.
Chimp DNA is being worked on right now; not sure of the ETA, but you can be certain that this will make a big splash when even a draft sequence is released. Most comparison made between chimp and human DNA so far have either been simple hybridization experiments, or comparisons of tens (now hundreds) of genes sequenced in both species. The short story is: wow, these genomes are close, but on the other hand we are *rather* different from chimps in many ways that matter quite a lot.
Babar
human DNA is about 99% the same as any other human, chimps DNA is about 94% similiar then any other chimp.
Chimp are more diverse because they have been around longer.
The Kruger Dunning explains most post on
"Albert and you might just end up with an unusually bright kid, but not a world-class genius."
good enough.
A cd of Alberts DNA might sell. If it was 5 bucks, I might buy it for the novelty sake.
The Kruger Dunning explains most post on
I've heard that humans share something like 97% of the same genetic code with a mouse and like 99% of the same code with a chimpanzee.
If that's true, then you could save a substantial amount of space by just burning the diffs between yourself and a standard chimp.
Well, in some cases around here you might not need a whole CD.
A floppy would probably suffice for all the diffs between some of us and the chimp.
"Provided by the management for your protection."