Human Genome Contaminated With Mycoplasma DNA
KentuckyFC writes "The published human genome is contaminated with DNA sequences from mycoplasma bacteria, according to bioinformatics researchers who blame an epidemic of mycoplasma contamination in molecular biology labs around the world. The researchers say they've also found mycoplasma DNA in two commercially available human DNA chips made by biotech companies for measuring levels of human gene expression. So anybody using these chips to measure human gene expression is also unknowingly measuring mycoplasma gene expression too. The mycoplasma genes are clearly successful in reproducing themselves in silico raising the possibility that we're seeing the beginnings of an entirely new kind of landscape of infection. One option to combat this kind of virtual infection is to protect databases with the genomic version of antivirus software, a kind of virtual immune system. But this in itself could make things worse by triggering an evolutionary arms race that selects genes most capable of beating the safeguards."
why do i feel like im getting punk'd??
...clearly uses DRM.
Haven't heard that one before...
My doctor has been insisting I have a genetic defect because I lack the gene for flagella.
would they wind up with Swamp Thing?
Look back up at my post, now look back down, you're on the Internet. Now look back up. I'm a signature.
I wonder exactly how far medicine has been set back by this. Researchers trying to investigate genetic or genetically-influenced diseases cannot be happy. They've unwittingly been missing information, and treating false information as valid, for a long time.
But this in itself could make things worse by triggering an evolutionary arms race that selects genes most capable of beating the safeguards.
Why is the word "evolutionary" used here? We're talking about static data that is not "executed" - it does not reproduce, it is only copied verbatim. Invalid data that bypasses filters ("antivirus software") is simply that - corrupt, invalid data that does not belong, but at least there will be less of it after filtering. That doesn't make the data somehow more powerful or adaptive - the filter merely missed it. The key fact is the data does not get to modify itself in an iterative fashion in order to survive or improve.
Better known as 318230.
At first I was relieved that this was a bacteria infecting silicon. Now I'm concerned: When will Avast release an Antibacterial beta? I'm still running Windows, folks! I know I'm vulnerable to this!!!
khasim (12/9/06): In a blind taste test, more people preferred Coke over the Pepsi that I had previously pissed in.
So is this term to be applied both virtually and physically? While it's defined as something via computer simulation, it's also being stated that the mold is physically showing up on the chips. Is this then a concern for labs on a chip, or for that matter, any current fabrication processes? Or did I misinterpret the concept?
No resources are freed for any future generations of database contaminants to breed on by filtering. And, the notion of evolution would also require changes to the contaminants, which don't really happen. So by all means, filter. It will leave harder-to-detect contaminants there, but they won't become more numerous.
Is the human genome contaminated or is it the published sequence that is contaminated?
If only the latter, fix the title.
This was my first thought when reading the summary - there is no source of feedback to make the data filtering mechanisms loop into the evolutionary design of the bacteria involved unless - and this incredibly out there - everyone starts applying methods to kill the bacteria only based on the amount of corrupt data they scan in - then *maybe* it would be possible for those bacteria with lesser differences to make their way into the system while the rest die out, but that is incredibly unlikely given their otherwise blind nature to the whole process and the fact it would impose evolutionary constraints in direct contradiction to what makes them able to survive - not absolutely impossible, but detracting from the actual story here with such nonsensical hype would be like suggesting the same evolutionary arms race in the same situation because darth vador were to rise out of the earth's molten core to proclaim dominance over the moon, leaving some new bacteria behind on his way - it doesn't make any fucking sense.
that part is nonsensical:
The mycoplasma genes are clearly successful in reproducing themselves in silico raising the possibility that we're seeing the beginnings of an entirely new kind of landscape of infection. One option to combat this kind of virtual infection is to protect databases with the genomic version of antivirus software, a kind of virtual immune system. But this in itself could make things worse by triggering an evolutionary arms race that selects genes most capable of beating the safeguards.
static data don't evolve
Jehovah be praised, Oracle was not selected
I'm still enjoying my mycoplasma infection, almost 30 years later.
My understanding is that nowadays new high speed sequencing machine can get an entire human genome processed in a couple of month.
So I would think that after a couple of independent runs one should be able to flush out the non-human DNA assuming the same bacteria contamination is not ever present?
Obviously this is not a cheap endeavor but given that there is quite a bit of commercial interest in using correct human genome data this seems to me to be a worthwhile investment.
I find it puzzling that the abstract of the article does not allude to this.
How in the world will setting filters on a database put a bacteria in a lab half way around the world at an evolutionary disadvantage? The bacteria will still grow, contaminate the sample, and get sequenced, but the sequence will be rejected. There is no feedback mechanism here, no selective pressure.
Genome sequence assembly is pretty far removed from the milieu in which a bacteria must make it's way. And inadvertently including bacterial sequences on a gene expression chip is sloppy science, but hardly news.
Traditional computer viruses are the only things that truly 'reproduce' in silico. Memes are your next best option, but the 'net is just a carrier - they have to infect a human host to reproduce. Stay away from 4chan if you want to avoid infection...
But bacteria? In silico? Where are we going with this strained analogy, anyway?
-V-
Who can decide a priori? Nobody.
-Sartre
For genome 3.1.
They're keeping the database in the form of actual DNA? Why not digitize it?
All humans save three have passed through flagellum stage. Their names were Adam, Eve, and Jesus.
Its not like the Data in the databases is going to reproduce.
So to what extent does this "epidemic of mycoplasma contamination" increase the potential for false-positives on DNA matching tests, such as used in criminal investigation or paternity cases? Does a given lab or lab-equipment manufacturer have a common strain of contamination which increases the number of "always match" markers above the threshhold defined for claiming a match?
Everybody gets what the majority deserves.
Who knew?
=^..^= all your rodent are belong to us
This is EXACTLY what happened in 70s-80s with Henrietta Lacks IMMORTAL 'HeLa' cell. http://en.wikipedia.org/wiki/HeLa
Her cells were the first Human cells to grow outside the human body.
In fact they were so successful, that unbeknown to scientists ALL OVER THE WORLD, her cells had TAKEN OVER all of the cells in their labratories GLOBALLY.
There is an amazing BBC documentary on this by Adam Curtis called "Modern Times: The Way of All Flesh"
wiki quote " Contamination: Because of their adaptation to growth in tissue culture plates, HeLa cells are sometimes difficult to control. They have proven to be a persistent laboratory "weed" that contaminates other cell cultures in the same laboratory, interfering with biological research and forcing researchers to declare many results invalid. The degree of HeLa cell contamination among other cell types is unknown because few researchers test the identity or purity of already-established cell lines. It has been demonstrated that a substantial fraction of in vitro cell lines — approximately 10%, maybe 20% — are contaminated with HeLa cells"
Almost created a COLD WAR incident:
wiki quote:-"The USSR and the USA had begun to cooperate in the war on cancer launched by President Richard Nixon only to find that the exchanged cells were contaminated by HeLa"
In the case of little Jeffery, Mycoplasma, you ARE the father!
So it would seem Evolution is favoring hackers, that breed well...
- Dan.
~ People that think they are better than anyone else for any reason are the cause of all the strife in the world.
As a career microbiologist and bioinformatics geek, the complete and utter scientific inaccuracy of this summary made me want to cry.
The mycoplasma genes are clearly successful in reproducing themselves in silico raising the possibility that we're seeing the beginnings of an entirely new kind of landscape of infection. One option to combat this kind of virtual infection is to protect databases with the genomic version of antivirus software, a kind of virtual immune system. But this in itself could make things worse by triggering an evolutionary arms race that selects genes most capable of beating the safeguards
Mycoplasma is a common contaminant of many human cell culture lines. It is often present in low counts, and is a relatively slow growing organism. This is a problem, because many of the immortal cell lines are passed serially, meaning that the mycoplasma propagates right along with it. Most labs that perform cell culture now do routine PCR testing for mycoplasma markers as a quality control measure.
When it comes to sequencing, and in particular, high-throughput next generation sequencing (Illumina/454/SOLiD/PacBio/whatever), you are shotgun sequencing all of the DNA in a given sample extract. This means that if you had a bunch of human cells, that happenned to be contaminated with low counts of mycoplasma, those mycoplasma sequences would be present to some extent in your final sequencing project. Whether this would factor into the final assembly, or just get thrown out depends on the quality control, experience of the bioinformatics team and assembly software pipeline. I am willing to be that most issues with mycoplasma contamination were during the "formative" years of high-throughput sequencing, but may have lingered in databases. These databases would in turn might used by commercial companies that build microarrays or other high-density tools, so it's feasible that some mycoplasma sequence carried over.
Is this relevant? Probably not. On a microarray, it would most likely be wasted space (eg: always negative during gene expression studies... unless the patient had a mycoplasma infection or something). Furthermore, a simple analysis of the sequence would help to rule out sequences that were clearly prokaryotic.
"In silico" does not mean what you think it means. In fact, this whole bit about in-silico replication and arms races is complete and utter nonsense. In-silico biology usually refers to biocomputing. Eg: analyzing, manipulating and simulating gene/protein sequences, expression, signalling cascades, and the like on a computer system. It does not apply to mycoplasma sequences running around all nambly pambly causing infections that would require some sort of anti-virus software. What they might be alluding to is the fact that a lot of shotgun sequencing libraries are run, as needed, through a vector screen, which is designed to pull out irrelevant sequences that may have been necessarily introduced during cloning or sequencing. Plasmids, cosmids, whatever. These algorithms may need better tuning to do a better job of ruling out mycoplasma in human sequences, but there's no danger of these mycoplasma sequencing replicating and taking over the world.
Unless you happen to be William Gibson.
So file could win against out robotic overlords afterall! So H. G. Wells was right after all.
This article was horribly written. They go between using terms with their literal meaning and using terms in metaphorical creative language but do not differentiate between the two using context at all. It's an incredibly confusing read. Actual ancestral human DNA is not contaminated with actual mycoplasma DNA sequences.
Here's what I gather is going on:
Researchers took a sample of human DNA and sequenced it, while doing so the sample was contaminated with DNA from mycoplasma (possibly from bacteria in the lab or on the researchers themselves). While sequencing it, the data is assumed to be a representation of pure human DNA (which would be incorrect). Other researchers then use this data set as a reference to compare other human DNA samples they sequenced themselves. They use this to test gene expression and so forth. So if their DNA samples show gene expression for mycoplasma they would incorrectly think it was normal human gene expression. What they did is use software to strip the mycoplasma DNA data from the original data set (that had both human and mycoplasma DNA sequences) to only use the actual human DNA data as a reference. The biological contamination was first in the original sample that was tested, and then the contamination referred to elsewhere is computational data "contamination." This is the software they are referring to as antivirus software and virtual immune system (which isn't antivirus software or similar to a biological immune system, it's DNA data filtering software).
These people really need to think about what they're trying to say before puking up jargon salad on the readers' brains.
it was all contaminated with Craig Venter's DNA to start with!
I don't want to be excessively harsh but the summary was seriously a bunch of drivel. In silico either means it's data on the computer, or that you are simulating a biological process computationally. But as other posters have mentioned, unless you are purposely simulating evolution, mycoplasma sequences in your human databases isn't going to cause any "arms race." Yes, it seriously screws with validity, but that's a completely different issue.
This is a generalization, and no offense to fellow Slashdotters, but in my experience most of the computer scientists that I've met have a really crappy understanding of even basic biology. CS concepts don't directly translate to biology ones.
Cogito, ergo sum, fosho!
Myco means fungi right?
i believe the evolutionary arms race was triggered a long time ago (possibly in a galaxy far far away) /. is so full of tripe today its making me question my patronage
Were these the patented sequences?
Paranoia is a Survival Trait!
Am I the only person who read this and thought Parasite Eve? The Game or the Japanese film, either way. Human genome contaminated with mycoplasma = mitochondria taking over the body's genetics (causing spontaneous combustion among other things.)
I did read the article and they're just talking about mycoplasma contamination that winds up being replicated into databases (eg taking over the human genome.)
Someone else mentioned the serious nature of the problem, in that DNA databases can false-positive if the match with both samples contaminated with the mycoplasma.
An actual, working molecular biologist here. Just left the lab, actually. Why the bloody hell are they sequencing from tissue culture? Buccal cells collected and purified or buffy coat derived from whole blood seems to be a more robust method of acquiring gDNA to sequence. I addition, why the f@&$ are they not testing for mycoplasma contamination? PCR primers and probes are widely available in the literature for testing for presence or absence, or just buy a bloody kit from the likes of Sigma or Ambion or Applied Biosystems or Invitrogen or Bio-Rad. It's easy to perform and should be a standard qc procedure wether you're working in commercial r&d or academia. The lack of a simple test that may compromise years of work (not to mention derivative works, too) can be chalked up to pure and simple laziness. As a scientist, all I have is my integrity- nothing else matters to me. To ruin years of data and derivative cell lines that have been passaged to other labs disgusts me, and anyone who isn't routinely testing in research that's meant for publication or production should be ashamed of themselves.
I went ahead and read the original article: More Mouldy Data: Virtual Infection of the Human Genome
The authors found two tiny submissions in GenBank that appear to be of mycoplasma origin, but mislabeled as Homo Sapiens: http://www.ncbi.nlm.nih.gov/nuccore/AF241217 and http://www.ncbi.nlm.nih.gov/nucest/DA466599
Having BLASTed the above sequneces against the Human Genome assemblies, all hits are noise (e.g. Poly-T stretch in the first sequence).
OK...so let me make sure I got this straight.
The germ that causes walking pneumonia is infecting computer database information, and inserting its DNA into the digital record of the human genome sequences. This corrupted information is in turn being forwarded erroneously to scientists around the globe...
Will you crackheads please call Bill Nye before you post any more garbled "Bachelor of Arts and I read this scary medical story in Reader's Digest-last minute term paper idea garbage."
Trust you guys to get anything remotely related to biologic transfer of genetic material correct...sheesh
Human DNA.... microbial DNA.... it's all just a soup of the same 4 damn chemicals anyway.
And it is EXACTLY what happened in JURASSIC PARK! The frog cells allowed some of the female dinosaurs to mutate into MALE Dinosaurs! We COULD be looking at RAPTORS who live in SILICA, and WE will ALL be typing in ALL CAPS out of sheer FEAR!
Gently reply
The whole "evolution" thing is the biggest sensationalist bullshit I've ever heard. Ignore it.
As was mentioned in another comment, it seems like the summary is misleading on the "contamination" actually being in the genome sequence.
The use of terms for sequence data and expression data are not interchangeable. The U133 microarray is for RNA, yes RNA, expression data. RNA microarrays quantify the fold change difference in expression between different subjects. DNA microarrays identify polymorphisms or repeats or the like. While arrays like the U133 rely on sequence level data to create the array, this is not the same as saying that sequence-level data is contaminated. Bottom line, the fact that this is not the cover article for Nature|Genetics this month tells you a lot of the story. Unless you are some sort of conspiracy theorist, or want to get swept up in the usual slashdot "sky is falling" imperative.
Except that glass is made from silicon.
This has been known for a long time. Bacteria and viruses fly around in the air. Anybody's who's sifted through genomic reads knows this. There's a noise level in all this high throughput genomics.
http://slashdot.org/comments.pl?sid=2261720&cid=36545928 Is it because you trolled someone http://slashdot.org/comments.pl?sid=2253808&threshold=-1&commentsort=0&mode=thread&pid=36521452 and they shot you down on every so called point you tried to make and they did it with documented facts anyone could see? I saw you run like a whipped dog 10 times there in fact, and for starting trouble with others you threaten blackmail Very intelligent (not).
So, is it possible a comparison of crime scene DNA with a suspect may match on bacterial contamination of the samples?
So the arXiv article (http://arxiv.org/abs/1106.4192v1) cited in TFA is a little more accurate than the already-much-panned summary. But the authors of the arXiv article still use the term 'virtual infection' which is very misleading at best.
Basically there are a few (actually only two described in the paper) entries in one particular human genome database maintained at EMBL that appear to be mycoplasma-derived. Two out of 45,000 features on the Affymetrix Human U133 +2 oligonucleotide array, used to quantify mRNA levels in a given sample, appear to correspond to the mycoplasma sequences. So, at best, two of the genes you look at (out of 45,000) might be mycoplasma genes in that particular type of experiment.
It's not a big deal, so that's why the work isn't published in a peer-reviewed journal.
once get our replicants out of the lab. Think of what the effects this will be on these new generation of bacteria-mutated "humans".
Seems a relative of this stuff has been used for other strange purposes: http://www.luxefaire.com/devilvision/appxhtml/NappendixNMycoplasma.html
Seems there are other possibly related forms of this beast: http://www.nap.edu/openbook.php?record_id=11765&page=181
Laws are like sausages. It's better not to see them being made. - Otto von Bismarck
True. In the end, mycoplasm is just another contributing factor to signal/noise in your dataset. It's completely illusory to assume that you get noiseless measurements given the amount of data involved.
Ubi solitudinem faciunt, pacem appellant.
These guys have a lot of crap to talk. Maybe they are very idle.http://www.laserqueen.com.au/injectables.html
Hypercubes are known to be close to timecubes..
This is satire, right?