Slashdot Mirror


1000 Genomes Project Releases Pilot Genome Data

eldavojohn writes "Three pilot projects have been completed for the 1000 Genomes Project and as a result, the pilot data has been released. This makes the data of nearly 700 people available for analysis via FTP (Americas mirror, European mirror). Dr. Eric D. Green of the National Human Genome Research Institute said, 'The 1000 Genomes project has a simple goal: peer more deeply into the genetic variations of the human genome to understand the genetic contribution to common human diseases. I am excited about the progress being made on this resource for use by scientists around the world and look forward to seeing what we learn from the next stage of the project.' There's not a whole lot of information on their site about this data, but the repositories have many readme files explaining the data layout."

16 of 55 comments (clear)

  1. Why pilots? by Anonymous Coward · · Score: 2, Funny

    Do we really suspect there are genes for flying planes and bedding down stewardesses?

  2. Stupid question by Locke2005 · · Score: 2, Interesting

    Does the database also include MD5 checksums for each of the genomes, to make sure they don't get corrupted?

    --
    I've abandoned my search for truth; now I'm just looking for some useful delusions.
    1. Re:Stupid question by fatnickc · · Score: 3, Funny

      No, but it probably gives clues about which people are more susceptible to hash, if that'll help you.

    2. Re:Stupid question by jcmurray · · Score: 3, Informative

      Although they offer FTP access to the genomic data--including population, alignment and sequences (traces, calls, etc.)--the NCBI has hosted the files with a README and guide (aspera_transfer_guide.pdf) about Aspera's "fasp technology" that the NCBI claims to incorporate automated checksum verification for both casual downloaders, via a browser plugin, and bulk downloaders, via a cross-platform command-line application. Aspera is new to me; they claim to have some throughput (bandwidth) advantages as well.

      Nevertheless, the sequence data files embed MD5 checksums directly, per NCBI documentation, which I would expect bulk downloaders to take advantage of independent of any third-party "technology."

  3. Double edged sword by Wowsers · · Score: 2, Insightful

    The genome project has the potential to help people with their illnesses, especially with tailored treatment instead of the generic drugs that work on some people, but not on others and might cause toxicity with certain combinations of drugs which you won't know until you are subjected to that combination.

    On the flip side, this could be a governments wonder weapon. Target a specific trait in the DNA, ie. people with black hair, or men, etc. etc. and kill them off, or make them weaker... all sorts of nasty things.

    --
    Take Nobody's Word For It.
    1. Re:Double edged sword by Anonymous Coward · · Score: 3, Insightful

      On the flip side, this could be a governments wonder weapon. Target a specific trait in the DNA, ie. people with black hair, or men, etc. etc. and kill them off, or make them weaker... all sorts of nasty things.

      Stop it with the ill-informed conspiracy mongering. There are downsides to human sequencing, but not for the reasons you've described. A lack of DNA sequencing has never stopped people from killing individuals with the traits you enumerated. So what are the problems?

      • Genetic testing without genetic counseling: What does a 1.2 increased risk of Alzheimer's mean?
      • Genetic discrimination: Sorry, you have a 1.1 OR risk allele for multiple sclerosis. We can't insure you.
      • Racism: Ah, your blood isn't pure enough, your children can't attend our schools.
      • Inconsistent genealogy: You can figure this one out.

      None of these rely on bizarre science fiction weapons to cause harm. Please don't spread FUD.

    2. Re:Double edged sword by gandhi_2 · · Score: 3, Funny

      Except single-edge swords.

    3. Re:Double edged sword by Turbio · · Score: 3, Informative

      Most genes are used to produce proteins. Samples of all the proteins present inside a living human cell are exposed on the cell's membrane, as part of the immune system machinery. (See Mayor Histocompatibility Complex http://en.wikipedia.org/wiki/Major_histocompatibility_complex) Those proteins can be targeted using homing peptides (think of it as a specific antibody) on a liposome (http://en.wikipedia.org/wiki/Liposome). That liposome can contain anything from drugs to viral RNA. Right now, you can make a solution with liposomes (or polymersomes), that when injected only kills the person if he/she has a certain trait. But I gess that is not a weapon of mass destruction... Anyway, making it a disease is another story. It could involve modifying the HIV virus, which already has a lipid bilayer just like a liposome. And then you have the problem of keeping that disease specific to the target population. And viral genomes tend to mutate at a very fast rate.

  4. Re:Would you guys stop messing with the godamn cod by gandhi_2 · · Score: 2, Funny

    Protip: Don't use IE.

  5. Re:Would you guys stop messing with the godamn cod by Monkeedude1212 · · Score: 3, Funny

    I select "funny" and it does nothing.

    That's not an error, Slashdot just doesn't agree with you.

  6. Re:Would you guys stop messing with the godamn cod by Yvan256 · · Score: 4, Informative

    I tried to mod you "funny", but the moderation system is indeed non-working. Also, after hitting "reply" I was sent to a new page with a messed-up "Reply to this" button, instead of simply having a comment textarea below your post on the same page.

    Someone's messing around with the live website.

  7. Re:Would you guys stop messing with the godamn cod by Monkeedude1212 · · Score: 3, Funny

    Did you try clearing your AOL cache?

  8. Re:Would you guys stop messing with the godamn cod by WrongSizeGlass · · Score: 2, Insightful

    AC complaining != /.ers caring. I'm just sayin'

  9. 4% genes of 2% total DNA of 700 people by peter303 · · Score: 2, Insightful

    Tis probably more comprehensive than marker studies, but not really whole genomes. Who knows how important the so-called junk regions will be eventually?

    1. Re:4% genes of 2% total DNA of 700 people by Turbio · · Score: 2, Informative

      Hey, I work with junk regions! (satellite DNA) And I completely agree with you. But I see that centering on the variability of those few regions rather than sequencing a second complete genome will probably be better for health-care research. The project's title is completely misleading. That's for sure.

  10. Re:How much data is this? by karlnyberg · · Score: 2, Informative

    Approximately 3.5TB. That's the math.

    --
    -- Karl --