New Science Of Metagenomics to Transform Modern Microbiology?
ScienceDaily has a look at the emerging field of metagenomics that watches the DNA of whole communities of microbes to better understand the microbial world. "Metagenomics studies begin by extracting DNA from all the microbes living in a particular environmental sample; there could be thousands or even millions of organisms in one sample. The extracted genetic material consists of millions of random fragments of DNA that can be cloned into a form capable of being maintained in laboratory bacteria. These bacteria are used to create a "library" that includes the genomes of all the microbes found in a habitat, the natural environment of the organisms. Although the genomes are fragmented, new DNA sequencing technology and more powerful computers are allowing scientists to begin making sense of these metagenomic jigsaw puzzles. They can examine gene sequences from thousands of previously unknown microorganisms, or induce the bacteria to express proteins that are screened for capabilities such as vitamin production or antibiotic resistance."
extract dna from millions of microbes?
I always thought that DNA extraction was a manual process... or at least it required a significant amount of manpower to get.
Sometimes the best solution is to stop wasting time looking for an easy solution.
This is really a new paradigm for microbial ecology. Instead of worrying about how thousands of different species (most of them unknown) are interacting with each other, you can now think about what genomic and proteomic resources are present in a habitat. Think of the organisms themselves as just the bags that contain what you're really interested in looking at, and suddenly a lot of insights and high-throughput techniques open up to you.
The article was taken from a National Academies press release. Here's the full report, parts of which (maybe the whole thing? I didn't check) can be previewed as a pdf if you don't want to purchase the book.
Oh, and here's a brief (4-page summary) of the report.
Woulda been nice to have the source info in the summary...
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
The Craig Venter Institute's Global Ocean Sampling Expedition has been collecting Metagenomic samples for the past couple of years. Among other things the expedition has doubled the number of putative proteins. An excellent video from the expedition is available at http://plos.cnpg.com/lsca/webinar/venter/20070306/ index.html and a set of recently published papers from the expedition are available for free at http://collections.plos.org/plosbiology/gos-2007.p hp
A website hosting the data from the expedition catered towards use on metagenomic samples has been developed by the Venter institute and is available at http://camera.calit2.net/
I think you understand the steps in reverse order.
1. a mutation happen randomly in sperm or egg.
2. a new queen is born from this mutated reproductive cell.
3. mutation is positive (e.g. the slave from this queen are more efficient)
4. the queen give birth to more new queen than one with less efficient slaves
Let's look at it in another way.
Infertile workers are like our cells. You can have one white cells which is resistant to HIV, but this mutation won't be passed to your offspring. But maybe one of your sperm (the one) will have the mutation which give resistance to HIV. So your son will be resistant to HIV. He will have more chance of surviving and also to reproduce.
Evolution is a very slow process. Mutations happen relatively frequently. Positive mutations are rare. It's also rare that it is present in germinal cells and rare that an offspring is produced from it. The advantage gained is not always enough to allow it to be propagated. it's very rare that all the conditions are met. That's why it's so slow process.
Bacteria are a lot more efficient at propagating mutations than us.
And concerning bees, the workers have the same genome as the queen but some genes are only activated by the royal jelly given only to future queens.
The problem with this is that it ignores that not all genes an organism has are necessarily expressed and that, particularly, the expression of genes may be triggered (or suppressed) by environmental conditions or by the presence or absence of other genes. Colony insects have evolved to very effectively exploit this.
So, in short, a queen with the mutation would not die, because the mutation would be dormant in a queen.
Yes, all the genes of the various castes are present in either or both the fertile male and fertile female individuals. They clearly aren't all expressed in the fertile individuals, nor all expressed in any of the various infertile castes; which are expressed and which suppressed depends on the environmental triggers to which the inviduals are exposed (mostly, feeding in the larval stage) which create the castes, and on genetic factors (such as those between males and females).
I'm not sure whether the above post should be marked "astroturfing" but it sure reads a little too positive.
454's sequencing technology is a welcomed addition to existing technologies, but don't believe the hype, particularly when the person talking has stock options.
The analysis of genomic sequencing data (metagenomics or otherwise) is highly benefited by large contiguous pieces or ideally whole contiguous genomes. Related to this and more fundemental is the fact that the shorter the pieces of DNA spat out by a machine the harder the problem of assembling them into larger contiguous chunks. This is due in part to the combinatorics of an alphabet made up of only 4 symbols but mainly the fact that genomic DNA contains many repeat structures even in lower organisms.
Without going into detail, it suffices to say that the longer the pieces (or "reads") produced by a sequencing machine, the easier the problem. Add to this the realities of sequencing errors and throw in metagenomics where you may have many organisms with almost the same genome, the problem gets quite hard.
Currently the large sequencing facilties that use 454 machines use them to complement their existing machines which produce 3-10 times longer reads (depending on who's talking). There are in fact papers investigating the ideal ratio of reads produced by new and old technologies.
Another factor to keep in mind is that, although the new high-throughput technologies (454 is the first to market, but not the only player) hold alot of promise, a large part of their appeal was going to be an enormous cost reduction. The problem is, so far that part of the equation hasn't met expectation. They are quite costly to run due to the cost of consumables and those prices are set by the manufacturer.
only one thing I have difficulty understanding with evolutionism (which I am a strong proponent of)
You're lying. Seriously. Only creationists use phrases like "evolutionism" and "darwinistic evolution."
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
The advent of metagenomics is accentuating a sad trend in science: less lab work, more computers. Do not get me wrong, I feed my kids from the computer desk and I have never touched an "Eppendorf" or "Pipetteman" (not sure about spelling). In the race for grants we are chasing aggrandisement of the projects we are applying to NSF and DOE. More computers, more modeling instead of experimenting.
I have been reading scientific literature for almost 25 years and the tendency is clear: the results of "computer experemints" (read, modeling) are trusted more and more without any experimental verification. The procentage of sequences in GenBank and Refseq which function is determined only by homology to existing proteins grows. That means we are guessing the function of new proteins by comparing them to the proteins which function we also guessed by comparing to earlier proteins, etc...
Number of protein folds is limited: 700, 1000, 30000, does not matter: it is limited, but it does not mean the functions are limited in the same way. How on earth are we going to find out the function of completely new protein that have not enough similarity to anything in the database? We cannot do it on computers.
And obviously we do not have resources to research experimentally 1.5M genes in Refseq. So instead of blindly pumping more and more raw data into our RAID arrays, we need to be more focused on researching the genes, proteins, pathways that have a direct impact on medicine. You know, "stuff that matters".
I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.