Slashdot Mirror


Celera Completes Human Genome. Sorta.

kovacsp was the first to write to us about the announcement from Celera that they had completed mapping of the human genome. Note: This is /not/ the be-all, end-all. They have finished *mapping* one person's genes. With Celera's approach, this means that they now need to being assembling the information they've gathered. All in all, Celera plans to do the same process with four other people. The Human Genome Project, using a more traditional approach is still a couple years away, but the race is still pretty close.

17 of 126 comments (clear)

  1. Re:NOT sequencing but mapping - worlds apart by CaseyB · · Score: 3
    ...at 2 bits per base pair (4 possible cases), you could fit a complete human genome in under a gigabyte.

    I'm curious: how well does gene sequence information compress? Is this effectively random data, or are there patterns?

    I can see the disclaimers now:

    The GeneStor PeopleBackup(tm) device can now store the complete* contents of your genetic makeup!!

    (* Storage assumes 2x data compression. Results may vary.)

  2. Re:Patents & Intellectual Property, by Just+Some+Guy · · Score: 3

    I think, rather, that the patents should be limited to a company's own database that they've created, not the original stock that they created the database from. I don't see this as being much different from copyrighting a dictionary; both are privately-compiled lists of public domain information. In one case, it's a human language, and in the other, it's the human DNA sequence, but the principal remains.

    In short, then, I believe that companies should have the right to sell any information that they've collected. They should not, however, have any ability to prevent anyone else from collecting the same information, much as Webster cannot prevent Oxford from cataloging essential the same information.

    --
    Dewey, what part of this looks like authorities should be involved?
  3. Finally! Legato WetWorker 1.0, coming soon... by Spud+Zeppelin · · Score: 3

    At least someone now has the technology to do offsite-backups of people... granted, there'll be a certain amount of data loss since the backup fileset was created (birth), but now, at least, there is the beginnings of real disaster recovery technology.

    Imagine: Our friends at Legato could license Celera's technology and produce "WetWorker" -- with the ability to put your genetic data on CD-Rom for easy transport to offsite storage. Then, when your friendly, egocentric ocean liner captain decides to go "All Ahead Full" on a foggy night in the North Atlantic AFTER receiving an iceberg warning, you can rest confident that your family can always recover you from archival backup.

    I'm aware that there are shortcomings (especially the part about "loss of all data accumulated since birth"), but after all, the centerpiece of any backup software isn't ease of recovery, it's ease of deployment. The data can always be reconstructed from "incrementals" (it pays to take good notes...).



    This is my opinion and my opinion only. Incidentally, IANAL.

    --

    MOO;IANAL.
    There used to be a picture linked here.

  4. Re:Clone on the Range by Ralph+Wiggam · · Score: 3

    I don't think you could just "change" a Y chromosome to an X chromosome. I believe they're entirely different. Think about it, Y chromosomes give you the ability to drive well, while X chromosomes just give you the sense not to wear the same socks two days in a row.

    -B

  5. According to Top500 by Robert+Link · · Score: 3
    According to Top500, the fastest supercomputer that was not at a government installation was a Hitachi SR8000/128 at the University of Tokyo, which weighed in at number 5 overall. If you want to discount academe, the fastest owned by a business appears to be at Charles Schwab. It's a 2000 processor IBM SP PC604e, and it rates number 12 overall. So, either Celera got a very big machine since those statistics were compiled, or they are using a different standard of "bigness" than the LINPACK benchmarks used in the list, or they were playing a little fast and loose with the truth. I would tend to bet on option number 2, myself.


    -rpl

  6. [sic] by Monte · · Score: 3

    Literacy is in short supply amongs most around here.

    Q.E.D.

  7. Re:This worries me. by Phrogman · · Score: 3

    The human gene sequence is in the public domain and will remain there - anything else would be ludicrous (although I agree that when it comes to the law ludicrous seems to be perfectly acceptable. Witness patenting software algorithms). What I believe they will get the patent on is their process for deriving the gene sequences - which is perfectly acceptable. They will also have the rights to their database of human gene information, which they can license the access rights to. The Human Genome Project will be making its results publically available, so it might become a matter of whose database provides the most ancilliary information.

    --
    "The first time I got drunk, I got married. The second time I bought a chimpanzee, after that I stayed sober" Arian Seid
  8. That's the problem... by Brand+X · · Score: 3

    Unfortunately, the whole gene patent scandal is because of this point exactly... they *are* being awarded exclusive rights because they got there first. Bad, bad, bad, bad, bad.

    I have every right to do whatever I want with any gene I've sequenced myself, damnit! I shouldn't have to pay royalties to someone because they sequenced it first!

    The analogy is the spanish and portuguese "claims" to the americas, not a translation of Vergil.

    --
    -- Still waiting for the Nike endorsement
  9. Re:Ethics and Genetics by gwernol · · Score: 3

    So, say you could change something cosmetic about yourself genetically for a reasonable price. For example, what if a virus were available that triggered a whole-body genetic mutation, and the end result was a change in your genetic hair color?

    So this raises an interesting question that I've wanted an answer to for some time. [side note: I am not a biologist, nor do I play one on the Net, so please excuse me if this is a dumb question].

    One of the much touted advantages of genetic engineering is the ability to cure genetic problems in living humans. This is distinct from altering the genetic code in cells that will go on to form a viable human foetus.

    So, say I have some genetic disease caused by an unfortunate sequence in my DNA. Assume we know what replacement sequence would cure this problem. On an engineering level, how would I go about making the change in every cell in my body? This is what I would have to do, right? Is this an area where nanotechnology and genetic engineering meet? Or could genetically-modified viri really perform this task?

    I assume that now we are closing in on getting detailed genetic information about humans, people are starting to think about how gene therapy might be applied in practice. Does anyone have anything they can share with us on this subject?

    --
    Sailing over the event horizon
  10. Done! by VAXGeek · · Score: 4

    Great, now how much longer until a public beta release?
    ------------
    a funny comment: 1 karma
    an insightful comment: 1 karma
    a good old-fashioned flame: priceless

    --
    this sig limit is too small to put anything good h
  11. This worries me. by raygundan · · Score: 4

    Is anyone else bothered by the fact that the first group to have a complete sequencing of the human genome is a private company? If anything ought to be in the public domain, all other arguments about software, music, etc... aside, it is the human genome. After all, everybody already has their very own. Celera deserves to reap the benefits of getting there first, but only until somebody else can get there as well. If another group finishes the sequencing, they have just as much right to use it as Celera. It's not like Celera has created an original work-- they've just finished reading through the genome first.

    I really hope that the HGP places this information in the public domain as soon as possible, and refrains from signing any exclusionary deals with Celera that would prevent this information from being free.

    1. Re:This worries me. by zavyman · · Score: 4

      What would you think if you spent a great deal of time and money on sequencing the genes, only to give it away? Of course they are in it for money, and they can keep what they find to themselves. Its not like by being the first to sequence them they get the exclusive rights to the genes.

      Liken it to a translation of an old text such as Vergil's Aeneid. Anyone can translate it and sell the translation -- it is still an original work. There are also many translations, some more correct than others.

      In the same way, they are translating the genome and retaining the rights to that translation. Nothing is preventing someone else from spending the time to get their own. In addition, how do you know what they present are correct. If you need accuracy, do it yourself. The genes are in the public domain: you have your own copy!

      The information like this should be free, but it doesn't have to.

  12. You're being served. by wowbagger · · Score: 4
    Contents of a writ delivered to Celera. I won't say how they were leaked...

    To: Celera, Inc.

    From: God

    Your attempts to reverse-engineer my closed source project "man" are in violation of federal law. The data encoded in the media "DNA" are encrypted, and in circumventing that encryption you are in violation of the Digital Millenium Copyright Act.

    Cease and desist all furthur attempts to decode this information, and destroy all copies. Do not disseminate this information, and notify all mirror sites to do the same.


  13. Clone on the Range by Frank+Sullivan · · Score: 5

    Oh give me a clone
    Of my own flesh and bone
    With her Y chromosome changed to X
    And when she is grown
    My very own clone
    She will be of the opposite sex (hurray!)

    Clone, clone of my own
    With her Y chromosome changed to X
    And when she is grown
    Since her mind is my own
    She'll be thinking of nothing but sex!

    (written by Robert A Heinlein)
    __
    (oO)
    /||\

    --
    Hand me that airplane glue and I'll tell you another story.
  14. NOT sequencing but mapping - worlds apart by orpheus · · Score: 5
    It's very important to make the distinction between mapping and sequencing.

    SEQUENCING means creating a complete list of the nucleotides in order. If you had this information, you could actually synthesize the entire genome of the individual. [There are some sophisticated niceties like methylation that distinguish the synthesized version from one extracted from a human, but it's essentially complete.] There are other factors (like which regulatory binding sites are actually bound, by what proteins; exact state of histone supercoiling, etc.) that control gene expression enough to keep this from being a working human genome, but it's awfully close.

    MAPPING means determining distances between known genes. Using this information, you can deduce where the various genes are, the approximate location of specific unknown genes, and many other useful facts. A detailed map is a good starting place for hunting down a gene, so you can locate and sequence it; it also can tell you what traits are likely to be inherited together, etc.

    A "sequence" is a complete blueprint (though there are details that aren't covered by sequence alone) A map is like a geographical map that shows where all the cities and large towns are. There are still many factories, facilities, and industrial complexes off that map -- not to mention all the roads, rivers, mountains and utility lines. ETC.

    A sequence is a lot more information, and a wonderfully compact database - at 2 bits per base pair (4 possible cases), you could fit a complete human genome in under a gigabyte. (That's only one human, however.)

    Naturally, even once we had the genome (or preferably a few thousand individuals, to let us get a real handle on variations), we could still spend decades or centuries figuring out what it all meant. 3x10^9 bases is a lot of info. You thought it was hard trying to trace western civilization in the first million digits of pi.

    I am not a Molecular Biologist - anymore. But I was, about 10 years ago.

    __________

    --

    If you can go to bed, knowing you did a valuable thing today, you're very lucky. If you can't... it's not bedtime

  15. Celera's definition of "completed sequence" by seaneddy · · Score: 5
    If we use Celera's definition of "complete" then the public project is already done too.

    Any reasonable person would define "complete" as this: there's three billion bases of human DNA in 24 different linear chromosomes. The sequence is complete when you can give me a DVD with 24 files on it, each of which contains a contiguous sequence of a human chromosome.

    That may never happen for any large animal or plant genome. Too many regions of a genome sequence are an ungodly mess, repetitive and difficult to sequence.

    The public worm (C. elegans) project, at 98 million bases, defined "essentially complete" as "we've come as close as we can to complete using existing technology". We have 97 million bases sequenced and about ~50-100 remaining gaps.

    The fly (Drosophila melanogaster) project, at 180 million bases in size, was recently declared "substantially complete" by Celera. They have 120 million bases of sequence, with several thousand gaps. The fly has more extensive regions of repetitive sequence than the worm.

    The human, at 3 billion bases in size, is nowhere near complete, either by the public (us) or by Celera, no matter what Celera press releases say.

    You need the following steps to get close:

    1. shotgun coverage. Technology limits us to reading ~500 bases of sequence at a time, so we have to blow the genome to bits, sequence millions of fragments, then assemble it all back (computationally) into a contiguous sequence. Because a successful assembly relies on deeply redundant overlap amongst the fragments, we need ~8-10x shotgun coverage (24 to 30 billion bases) to try to assemble the human genome. The fly genome was shotgunned to 12x coverage to achieve the results Celera reports.

    2. Assembly. Once you've got shotgun data, you can try to assemble the genome from those fragments.

    3. Finishing. The automated assembly (like the fly genome now) will have a great number of gaps. These must now be closed, more manually, by expert molecular biologists; the gaps represent regions that are biologically difficult to sequence.

    The actual science behind the Celera press release is that they have partially completed phase 1. They currently have 4-5x shotgun coverage of the human genome, about half of what they need for a proper assembly. They intend to get the other 4-5x coverage from the public "rough draft", which is at about the same stage Celera's project is in.

    The two projects (Celera and public) are neck and neck in this "race". The difference is that we acknowledge that our sequence is a rough draft at this stage; whereas Celera claims that their sequence is complete. Celera has every right to spin their project to their investors any way they feel is appropriate, but scientifically, they are being rather disingenuous if not dishonest.

    conflicting oblig. disclaimers: I'm a co-PI on the public project, and I (accidentally, through an acquisition) also hold substantial stock in CRA.

  16. Re:Is this a Non-event? by Signal+69 · · Score: 5
    Yeah, everybody's genes are "different." However, DNA is composed of "introns" (non-coding interening sequences, which make up a majority of genes and aren't don't code for anything) and exons - expressed sequences which do the coding.

    Although DNA fingerprinting is mostly accurate, it is based on differences in the introns, which are highly variable. As far as exons are concerned... You've probably heard that chimps and humans are 98% or so genetically similar, and humans and hamsters are 95% genetically similar.

    If you compared the genes (exons) of any 2 people, you'd find them to be 99.99999999% or more similar. The differences are very slight. What makes people unique is not the genes so much as which ones are expressed.