International Challenge To Computationally Interpret Protein Function
Shipud writes "We live in the post-genomic era, when DNA sequence data is growing exponentially. However, for most of the genes that we identify, we have no idea of their biological functions. They are like words in a foreign language, waiting to be deciphered. The Critical Assessment of Function Annotation, or CAFA, is a new experiment to assess the performance of the multitude of computational methods developed by research groups worldwide and help channel the flood of data from genome research to deduce the function of proteins. Thirty research groups participated in the first CAFA, presenting a total of 54 algorithms. The researchers participated in blind-test experiments in which they predicted the function of protein sequences for which the functions are already known but haven't yet been made publicly available. Independent assessors then judged their performance. The challenge organizers explain that: 'The accurate annotation of protein function is key to understanding life at the molecular level and has great biochemical and pharmaceutical implications, explain the study authors; however, with its inherent difficulty and expense, experimental characterization of function cannot scale up to accommodate the vast amount of sequence data already available. The computational annotation of protein function has therefore emerged as a problem at the forefront of computational and molecular biology.'"
I guess during the Super Bowl, even a lowly anonymous gets to be the premier comment!
man, my ass feels so empty and relaxed it's like a meditating buddhist!
Your mom interpreted my proteins after I shot my wad into her eye last night.
Here I was, hoping for another Folding@Home.
Your similes are magnificent. They are like eggs on a pancake, butternut waiting to be waffle-ironed.
It's about time we start focusing on the future and on what we know will work. Understanding how matter organizes itself into life is one the biggest challenges ahead. I propose that we understand how life works, how life works and how to extend it before this decade is out.
Yet. Computational power can scale infinitely, and scales geometrically with time. Just wait. It'll be done soon enough.
Or make the problem more efficient. I bet with each completed protein the process gets faster and more efficient. It will be done sooner than expected.
Without a good plan, we'll be at it for decades. Here's what I think genomic researchers should do.
Genes (and proteins) are obviously organized hierarchically. Which means there must be a control hierarchy in there somewhere. To unravel and properly classify the genome, researchers must first identify and understand the hierarchical control system. Only then can they begin to populate the branches with the correct genes.
After the tree is completely built and all the genes have found their correct locations on the tree, then it's a matter of going through the tree from the top down and switching the branches of the tree off/on one at a time to see what happens. It's hard but it can be done.
Don't discount that as stupid. Most of what he said is true. Evolution makes you write code that works, not good or clean code, just code that works. The only time evolution comes into lay is when the code can't even compile.
Indeed there's even some selective pressure for code obfuscation. Viruses take advantage of compression for example. New functions usually evolve from faulty events in old genes. There's no pressure to remove accidental calls to the wrong subroutine if they don't matter, hence a lot of messages go to the wrong place as well as the right place. Even in higher animals you see this (dog's legs that scratch themselves when you scratch their ribs) is probably some back propagation on the nerve network that was not necessary to remove for proper operation of the dog.
Some drink at the fountain of knowledge. Others just gargle.
I am not a biologist so forgive me my ignorance but when people say that DNA is the blueprint for an organism I never understand how a bunch of proteins can determine an organism's shape and behavior. Aren't there more factors that determine those things, like the surroundings in which the DNA is used, like chemicals that the growing organism is surrounded with, temperature, etc?
-- Cheers!
That is all nice, but most of these prediction algorithms are based on one or more of the following assumptions, which are not always true:
So any prediction should be taken with a grain of salt and experimentally verified, which brings us back to " ... with its inherent difficulty and expense, experimental characterization of function cannot scale up to accommodate the vast amount of sequence data already available ...."
Do these algorithms autonomously file for patents on their findings and issue legal threats to competing algorithms as well?
I just realized that the word assassin has ass in there twice. Thank you /. for limiting how much I can write in the title.
...a post-genomic world be one in which we had stopped fiddeling with genes and DNA and such?
Aren't we more in the midst of a Genomics Revolution?
Or more accurately, we are in the infancy of the Genomics Revolution.
THINK! It's patriotic
This TED.com talk by Danny Hillis is informative on this topic, http://www.ted.com/talks/danny_hillis_two_frontiers_of_cancer_treatment.html "Danny Hills makes a case for the next frontier of cancer research: proteomics, the study of proteins in the body. As Hillis explains it, genomics shows us a list of the ingredients of the body -- while proteomics shows us what those ingredients produce. Understanding what's going on in your body at the protein level may lead to a new understanding of how cancer happens."