Celera Maps Entire Fruit Fly Genome
cjoh345 wrote: "Celera Genomics has just sequenced all the genes in the fruit fly. Apparently the scientists involved are amazed at the genes that we share with this dorm-room annoyance. This discovery also validates Celera's "shotgun approach" to mapping out this stuff. And yes, the genome is available free of charge via
Genbank. Good form, Celera!" What would Mendel have thought of this? How about Watson and Crick? This makes me want to break out my copy of The Double Helix .
Actually the sequencing part was completed earlier this year (giving tremendous subsequent rise to Celera stock). What they now did was the mapping - i.e. piecing the small fragments together.
:-)
Moreover, Celera have also completed more than 90% of the human genome within the last year or so. Once complete, this is an indication that the time before the human genome (more than 200 trillion base pairs) are mapped will be shorter than originally anticipated. To compare, the government-funded Human Genome Project has so far spent over 10 years on the same job.
Celera is doing a 4x oversampling on the human genome, unlike the HGP, which does 10x oversampling. This is possible because Celera is sequencing DNA from one single individual (most likely Craig Venter?), thus avoiding the uncertainty of wheter differences are due to sequencing artifacts or personal variations.
Plus, Celera is using our (PE Biosystems) 3700 DNA Analyzer (fully automated, unattended operation 24 hours per day), whereas the Human Genome Project mostly use our older 377 DNA Sequencer, which requires manual reloading of samples after each run (every 2-3 hours).
As originally stated when Celera was created two years ago, the data is going to be publicly available - a point that has gotten lost among very opinionated but not so informed readers of Slashdot. There will be a 3-month lag period, to ensure accuracy of the data, and to see if there is any information that could be used for patentable drugs & applications.
(Mostly, Celera's business model is based on providing the tools that will give access to this database).
And I have stock options!
-tor
Here is a much more detailed link of the story from Celera's site, talking about the similarities between our genes and the fruit fly's. (I've got a dollar that says their computers are all Celerons, ha ha ho ho.)
What's your damage, Heather?
Celera hasn't even by their own definition sequenced the entire genome of Drosophila. What they have done is sequenced most of, or all of the euchromatic region. The highly repetitive heterochromatic DNA that is clustered around the centromeric regions and makes up an estimated 30% of Drosophila genome is not sequenced. There may even be some B-heterochromatic regions which are also unclonable which is a serious problem in trying to sequence highly repetive DNA. While these regions don't have the glamour of the gene-rich euchromatin (it is often referred to as junk DNA for that reason) they can effect everything from gene expression to chromosome pairing.
And to the few posts I have read which think that this is some sort of private enterprise success story versus a slow blundering government, it isn't. It is a classic example of business going after the highly profitable bits and leaving the taxpayer to fund the basic research. The same basic research which incidentally made it all possible in the first place.
This is a wonderful accomplishment - now we can get started on the main problem.
Having a complete map of a creature's DNA tells us, in principle, all of the proteins that it can synthesize throughout its lifetime. This gives us the building blocks that the creature uses to build things, and the chemical signals that it uses to direct internal operations.
This is wonderful, and essential. To use an analogy, this is like a Victorian scientist, after years of studying a 1999 notebook computer, managing to deduce how transistors and the wires that connect them work.
He still needs to deduce a lot about capacitance, resistance, and inductance to tell how signals will propagate and influence each other, and needs to build up from scratch all of the disciplines involved in integrated circuit design before he can understand how it works, but it's a start.
Similarly, we can now move on to the next step in understanding biological creatures - trying to figure out what all of the proteins do, and how the systems built from them operate and interact with other such systems.
This would not be an easy task under the best of circumstances. It's made worse by the fact that evolution puts little value on modularity - the systems will interact with each other to such a degree that it will be difficult to even define individual systems within the chaos that is an evolved being.
I wish them luck. They have opened the door, and made available for study the vast landscape of interacting systems that we'll have to understand to truly understand how living creatures work.
-----BEGIN FRUITFLY-----
ATCGTAGCTAGCTGACTATCGTAGCTAGCTAGCGTATCGTAGCGATCG
GCATCGTAGCTAGCTAGCTACGTAGCTAGCTAGCGATCGTACGATCGA
CGTAGCATCGTAGCTAGCTACGTACGATCGATCGATCGTAGCTAGCTA
GACTAGCTAGCTAGCTAGCTAGCTACGTAGCGCGATCGATTCGATCGC
AGCTTGACTGATCGGATCGTGCTACGGACTGTACGATCGTACGATCGC
------END FRUITFLY------
GATCGATCGATGCTAGCTACGATCTGATCGATCGATCGTAGCTAGCTA