Using Bacterial DNA For Data Storage
NPV writes "January ACM Communications has an article on the use of DNA in genetically modified bacteria to store information. This is an attempt to achieve the ultimate in archival storage (one of the modified bacteria can tolerate 1000X more radiation than a human being). Now just suppose that the "junk DNA" in the human genome is the documentation package for the machine code. Who wrote that manual?" Here's the article abstract.
Who wrote that manual?
I think the important question is... who has IP rights over it?
The Raelians, duh! That's how come Clonaid is so far ahead of other human cloning efforts... they read the documentation.
Java: the COBOL of the new millenium.
So when one of these engineered bacteria wipes out the human species, and some alien species comes along and ganders a look, the bacteria will be carrying a precise record of how we humans fscked ourselves.
--- Grow a pair, liberals... stop letting the Republicans bully you!
so much for P2P networks, if anyone wants the new Apache release, I just sneezed.......
(one of the modified bacteria can tolerate 1000X more radiation than a human being).
I haven't read the article (don't have access to where I am) nor have I thought about this subject much, but one question I have is how the authors keep the sequences under selective pressure. DNA sequences are only conserved over many years if evolution needs them. Non-coding regions (So called "junk-DNA", poor choice of words, btw) would easily mutate into other sequences. One could imagine sequencing many cells, and infer the original sequence, but this gets more expensive as time goes on (as the number of sequences you need to sequence goes up).
-Sean
Scientist have discovered that humans and all life on earth was just a discarded bacterial disk drive from a geek with pimples living in his mother's basement 5 million light years from the solar system.
That keeps four copies of it's DNA in rings and error checks constantly. They're probably using one of these, as it happens to be very radiation resistant, I'm guessing they used these, and so the mutation rate would be very, very low. So it wouldn't keep forever, but would for a very long time.
You could also put error checking (parity, checksums, etc) so once you found some bactera you could check to make sure they had the right version and not a mutation
autopr0n is like, down and stuff.
Just to be clear, no non-coding segments have been found in bacteria yet (last I heard). So putting data in as 'junk-DNA' in humans is quite a bit different from interrupting a fully functional bacterial DNA segment with the data to be stored.
Also note that the introns in eukaryotes are highly mutable (look up 'tandem repeats' if you have the inclination), so the fidelity of the data would be sacrificed by putting it there. The longest lifetime for the data would be achieved by tricking the replication machinery into thinking the segment was an exon, which would involve tying it to a functional protein that would be absent were the sequence to be mutated.
Duplication of the data would also work, but it would only hammer down the probability of mutation, since the probability of a point mutation of a base at the same location in two widely separated sequences is roughly 10^-18 to 10^-17 per year for exons.
To be entirely fair, they were using a brute force mechanism and dealing with a changing, hostile environment. We can use a controlled environment.
Yet I don't see this hitting the market in the next ten years.
I remember about eight years ago an article about how the future of storage was going to be in a frozen solid containing bacteria that change shape when a certain intensity of light hits them -- two lasers, each with half the requisite amount of light, would shine in to cause the bacteria to change shape where they met. Terrabytes in a little cube. Never happened.
May we never see th
I think that you may have your terms a little mixed up. An intron is the DNA between exons (coding regions) in a gene. i.e.
o n- --junk---junk---junk.
junk---junk---junk---exon-intron-exon-intron-ex
The junk DNA often referred to is mainly intergenic DNA, and this is where most of the non-coding DNA is found. This also makes up the majority of the eukaryotic genome. Prokaryotes (bacteria) do contain intergenic DNA, but no introns.
...if only the machines had used the humans for data storage!
Morpheus coulda pointed to a SAN/NAS box!
Instead they make a duracell commercial and mumble about the "human body generating more bio-electricity than a 120-volt battery and over 25,000 BTUs of body heat."
Ok I'll quit ze bitching... it was spiffy anyway.
Interactive Visual Medical Dictionary
Just to be clear, no non-coding segments have been found in bacteria yet (last I heard).
My first impluse was that this is way off. I'm used to working with plasmids where frequently like 60% of the sequence is junk. They use E. Coli and D. radiodurans in the study mentioned in the article. A brief survey of E. Coli K12 (the parent of most common lab strains) sez that about 5-10% of it is non-coding. The old initial reference claims about 11% is non-coding, but a good chunk of that may be regulatory. The radiodurans genome is about 9% non-coding. The up shot is that there is actually a fair amount of 'junk-DNA' in (at least the Coli) bacterial genomes. Not a lot by human standards but enough to be able to squeeze in a chunk here or there if you're careful.
Another impulse was 'gad... that made it into Nature!?' (the journal, the article cited is a self congratulatory summary of their Nature paper). A lot of it follows a well duh kind of reasoning. 'Well duh' science is often the really good kind, but I wasn't particularily amazed by this. The DNA manipulation methods are beyond standard now, the only really clever thing was proposing the use of radiodurans as the host. Even that was sort of obvious (a blazingly well studied organism that is transformable). The DNA -> text using a 6 bit space? Well if you've ever designed linker regions in proteins I'm sure you were at least thought about spelling out you name or something in amino acids (unless your name is BOB). In part this is because every one learns the amino acids by doing stupid things like spelling out their name. Few people actually do this, mind you, as it usually would have some deleterious effect, but the point is I'm sure they weren't the first ones to try something like this, probably just the first to get funded to do this explicitly. Their big addition was to come up with a 3-letter code that includes all the letters and, ooo, punctuation. Then they spelled out bits of 'It's a small world.' My point is that it's not that far fetched and a bit surprising (to me) that it made it to Nature.
As to the utility of these things for information carriers... Mutation would be a problem in the long term. Sure radiodurans would survive nuclear war (these guys put cockroaches to shame) but they do it using lots of mismatch repair and recombinatorial repair methods. These are not perfect repair systems, they can and frequently do introduce many errors, especially in non-essential DNA space. Tying it to a functional protein isn't a bad idea, but unless the added sequence adds some survival advantage it won't enhance the lifetime of the measage (ie. if uncorrputed data gives an advantage then it is statistically less likely to propagate). Also, as you mentioned, the bacterium might notice long chunks (they're using 100 characters here) of useless DNA and excise it. For that kind of text, it might be better to just etch it into stone or something, at least you have some hope of seeing it intact in 2000 years.
"All right. Which one of you bastards put the penicillin in my hard drive?"
Reminds me of that Star Trek episode The Chase, in which Dr. Galen, Captain Picards old Archaeology professor, found genetic data-blocks from various species around the galaxy stored in the junk portion of each species DNA, including our own. When a sufficient number of these data blocks were put together it completed a stellar map, identifying the precise location of the original origin of life on out planet and countless others. The jury is still out on the Panspermia Theory, but my own hunch is that there is lots of intelligence out there vastly older and greater than we are.
Planet P Blog - Liberty with Technology.
www.enthea.org