Slashdot Mirror


Woof! Dog Genome Assembled

Manwe's Herald writes "The first draft of the dog genome sequence has been deposited into free public databases for use by biomedical and veterinary researchers around the globe, the National Human Genome Research Institute (NHGRI), one of the National Institutes of Health (NIH), announced today."

3 of 43 comments (clear)

  1. Re:Genealogy of a Mutt by Otter · · Score: 4, Informative
    IIRC, the primary sequence was from a boxer and they did additional sequencing in 10 other breeds, including a greyhound and a Pomeranian (forget what else) plus coyotes and wolves. Celera, meanwhile, sequenced Craig Venter's poodle.

    Much of the point of looking at dogs is to understand the differences between breeds, like why collies are born knowing how to herd sheep while Shih-Tzus are completely stupid all their lives. So, to the degree that your dog is related to breeds they've looked at, you can get started as soon as they clean the data enough to post it to dbSNP.

  2. Re:Open Source Pets by SB9876 · · Score: 3, Informative

    Actually, the vast majority of gemone sequences are in the public domain in whole or part. Even many of the proprietary genomes ended up being made availabe to the public after a year or two.

    Open source is not a new idea for the academic sciences - they've been operating on that principle for a couple hundred years now.

  3. Degeneracies, IUPAC codes, etc. by dexter+riley · · Score: 4, Informative

    To generate the sequence, all the chromosomes were cut into fragments, cloned, then sequenced. Dogs have two copies of each chromosome (and either an XX or XY pair). The DNA sequence of a region of one chromosome may be different than same region of the other chromosome. In this example, one sequence had a cytosine at that position, while the other sequence had a thymine. To make it easier to decsribe that difference, or polymorphism, the International Union of Pure and Applied Chemistry (IUPAC) created a nomenclature for describing degenerate sequences. The symbol Y represents either a C or a T at that position. To answer your question, Y (C or T) pairs with R (G or A).

    A full listing of IUPAC codes may be found here.