Slashdot Mirror


Function of 80% of the Human Genome Charted

ananyo writes "In what is likely to be a historic moment in science, ENCODE, the Encyclopedia of DNA Elements, has published 30 papers in Nature, Genome Research and Genome Biology today, assigning some sort of function to roughly 80% of the genome, including more than 70,000 'promoter' regions — the sites, just upstream of genes, where proteins bind to control gene expression — and nearly 400,000 'enhancer' regions that regulate expression of distant genes. The project was designed to pick up where the Human Genome Project left off. Although that massive effort revealed the blueprint of human biology, it quickly became clear that the instruction manual for reading the blueprint was sketchy at best. Researchers could identify in its 3 billion letters many of the regions that code for proteins, but those make up little more than 1% of the genome, contained in around 20,000 genes. ENCODE, which started in 2003, aims to catalog the 'functional' DNA sequences between genes, learn when and in which cells they are active and trace their effects on how the genome is packaged, regulated and read. Nature has set up an ENCODE site with an explorer, that groups the papers by topic, and collects all the papers, which are available free."

24 of 112 comments (clear)

  1. Most of it is control code by bill_mcgonigle · · Score: 4, Interesting

    Just happened to hear an NPR interview on the way back to the office. The researcher described most of the 80% as regulating the expression of the protein codes. Brace yourself Slashdot: he called it the 'operating system'.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
    1. Re:Most of it is control code by fm6 · · Score: 4, Insightful

      Yep, and the OS can get reprogrammed by viruses.

      Various fields borrow terminology from each other. Not that big a deal. My toaster has a "cancel" button. An old-fashioned "eject" would make more sense to me, but I guess mechanical terminology is less familiar to most people.

    2. Re:Most of it is control code by Fallingcow · · Score: 2

      So when is the gcc port going to be finished?

    3. Re:Most of it is control code by tlhIngan · · Score: 5, Interesting

      Yep, and the OS can get reprogrammed by viruses.

      It's also got fairly good licensing terms - I mean the OS can be replicated (it is billions of times - once for ever cell), and the OS can make copies of itself (mitosis), and even end up altering itself through random changes (mutations).

      A virus only infects one cell to duplicate itself - all the other uninfected cells have their own copy. The antivirus system basically works by killing infected cells.

      So all in all, an interesting OS - security worse than Windows (i.e., none at all - just random strings of genetic code can alter the OS - you don't neve need root! just physical access!), yet it really works by sheer number of copies.

    4. Re:Most of it is control code by grcumb · · Score: 2

      Yep, and the OS can get reprogrammed by viruses.

      Various fields borrow terminology from each other. Not that big a deal. My toaster has a "cancel" button. An old-fashioned "eject" would make more sense to me, but I guess mechanical terminology is less familiar to most people.

      I'd be a lot more impressed if your toaster had an UNDO button.

      --
      Crumb's Corollary: Never bring a knife to a bun fight.
  2. Wait for the lawsuits by viperidaenz · · Score: 4, Insightful

    Wait for the lawsuits when they come across a gene some company holds a patent for. They "invented" it you know.

    1. Re:Wait for the lawsuits by whydavid · · Score: 5, Informative

      This is probably the single-most important factor in determining whether or not we'll see "personalized medicine" within the next 50 years. The fact that a company owns a patent on the idea of testing the BRCA1 and BRCA2 genes for breast cancer susceptibility is absurd.

    2. Re:Wait for the lawsuits by whydavid · · Score: 2

      My understanding of this is quite different. Look here: http://www.genomicslawreport.com/index.php/2010/10/11/a-do-it-yourself-genomic-challenge-to-myriad-the-fda-and-the-future-of-genetic-tests/ If you scroll down to "What This Means, Part I...." there is an explanation of the various patents, including a patent on the nucleotide sequence itself, and the process of comparing the sequence to known mutations. While they speculate that a whole genome sequence would get around the gene nucleotide sequence patents (since you are not sequencing them "in isolation"), you would still need to compare the sequence to known mutations. If that patent is upheld, then it doesn't matter how you go about retrieving the gene. I hope this interpretation is incorrect.

    3. Re:Wait for the lawsuits by Cyberax · · Score: 2

      No, situation is more complicated. Myriad has patented the preparation of the affected gene and then detection of mutations. So if you isolate the BRCA1/2 genes and then do testing on them, you'll be likely infringing Myriad's patent.

      However, if you sequence/genotype your genome and then check it for BRCA mutations then you're OK and do not infringe on Myriad's patent. That has been affirmed by the court decision (the very same one that upheld the patent claims on BRCA gene isolation process), because the process of aligning and analyzing sequences is not transformative and thus not patent eligible. So yeah, it's a mess, but not as bad as lots of news sources present it.

  3. Historic, or a bit arbitrary? by wonkey_monkey · · Score: 3

    In what is likely to be a historic moment in science

    I'm not knocking the achievement, but wouldn't it be a more truly historic moment when they've nailed down the function of 100% of the genome? Where was the big celebration when they got to 64.576%?

    --
    systemd is Roko's Basilisk.
    1. Re:Historic, or a bit arbitrary? by vlm · · Score: 2

      The historic part is its release day. Kind of like that other /. story where gathering 14 million or 13 million or whatever iphone UDIDs into a big pile wasn't the story, the story was releasing them for anyone to look at and download and mess with.

      --
      "Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
  4. The software analogy by realxmp · · Score: 3

    Imagine you were given a slightly buggy 3.2Gig piece of software used to run a group of factories and to managing communications between them. You were told to debug this piece of software, but you had no source code, only the machine code only. You could of course observe it's behaviour and set up situations in various factories to see how it behaved. To complicate matters further, you realised that the factories were all performing different functions and making different things but they were all running the same software, but it wasn't immediately obvious why they behaved differently. This is pretty much a huge oversimplification of the challenge that faces modern genetics. We have the assembler language, but we're still not sure entirely what bits are coding sections, which are data sections and which sections are marked up to act as configurations sections. It's a huge task and I love it.

    1. Re:The software analogy by HeckRuler · · Score: 3, Interesting

      Furthermore a long time ago some hackers got in and mucked about committing changes the the trunk. They didn't seem to break anything vital so it was just left alone and we hope it doesn't bite us in the ass before we retire.

      Oh yeah, btw, we use a distributed revision control with about 7 Billion branches and no one true trunk (although a lot of people claim otherwise). There's a lot of wanton merging (giggity) and branching, and it seems like every time that happens there's a chance that the revision control just fucks something up and makes a mess of it all.

  5. Re:What about the 'junk' DNA? by HeckRuler · · Score: 3, Informative

    It's time you wised up (it's RIGHT THERE). Junk DNA is non-encoding. Meaning it doesn't get used to make proteins. This was thought to be the entire purpose of DNA, so junk DNA was like commented out code or dead code. But we learned that this genes have other uses. Like some DNA affects the trans-coding of nearby genes. And it appears some non-encoding DNA is still selected for, so it probably has some unknown function.

    But nature most certainly deals in junk. And there is DNA in you right now that's non-encoding, non-functional, and not selected for. IE, it's junk. It just happens to be the random uninitialized value or a previous mismatch of old code that was cut long ago. Deal with it.

  6. Re:Junk DNA? by afidel · · Score: 3, Interesting

    Uh, it was called into question over a decade ago. No scientific paper or journal article ever referred to it as "junk" DNA, it was called non-encoding regions and it was understood fairly early on that at least some of the area held a regulatory function. What wasn't realized until the human genome project concluded was just how little of the genome was encoding and how massively important the regulatory regions were (the human body creates a heck of a lot more than 10,000 different proteins so the regulatory regions must be more than simple on/off switches but must also have the ability to affect structural changes in the protein encoding sequence),

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  7. When we understand how this works by anubi · · Score: 4, Interesting

    The sky will be the limit.

    The understanding of how DNA works, ( and correspondingly, how to hack it ) is the ultimate reverse-engineering accomplishment.

    Life is a textbook, full of worked examples. We are at the stage we realize there is an alphabet, the letters mean something, and have the definition of a few words. Kindergarten stuff.

    If we play our cards right, and don't spend all our resources fighting amongst ourselves, the future is incredibly bright. We have worked examples of damn near everything we need... photosynthesis ( solar powered CO2 sequestration and energy storage ) for starters. We have bioluminescence, electric eels, and all sorts of sensor examples.

    I figure we have been given a huge shipment of arduinos with all sorts of accessories, and we have now figured out how to make the light blink.

    We don't know how its wired yet, how the compiler works, and just now figuring out some of what makes the hardware work.

    If our society will value knowledge above greed and accounting, if there is anything limiting our potential, I have yet to see it. However if greed and accounting is all we know, we will soon run into all sorts of limits, imposed only by our inability to adapt. First of these will be exhaustion of the earth's fossil fuels, followed by food and water famines. We will be like the chick that hatched, but failed to scratch, find food, and thrive, living off the energy stored in the egg - until it is depleted.

    The earth is our egg.

    I value highly the knowledge our species acquires. It is our survival.

    --
    "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

  8. Re:Junk DNA? by whydavid · · Score: 2

    Strictly speaking, it is still universally believed that 'many' non-coding genes are historical junk with no current function, and experiments on organisms with much simpler DNA than ours bears this out (in short, they will scramble suspected 'useless' sections of DNA and look for changes in function; impossible to do in humans, but relatively simple for c. elegans). If you are talking about the antiquated view of _all_ non-coding genes as "junk DNA:" This is not the usual belief, at least not in the molecular biology/bioinformatics community. You may just be seeing the release of tons of ENCODE work today, but this research (and a vast amount of related research) has been ongoing for several years. It is well known, and accepted, that many non-coding genes play important regulatory rules in DNA transcription.

  9. Re:Junk DNA? by robotkid · · Score: 4, Insightful

    Am I misinterpreting this, or is the usual belief that many genes are obsolete sequences that have no current function being called into question?

    I don't think any serious molecular biologist ever thought the majority of DNA had no function, just as no neuroscientist ever believed that we only use 10% of our brain, but that's precisely the sort of sound bite that, when uttered in a press release somewhere, echos around the public consciousness forever and never dies because it provides a conveniently sciency premise for the next batch of rebooted superhero origin stories. The distinction is that before this study, we knew non-coding DNA was involved in regulation but not to what extent; i.e. there were plenty of specific anecdotal findings but nothing this systematic and large scale.

    As for the significance of this sort of work, yes, it exactly like release day for a major software package, it's an anticipatory excitement and not a "we finally found the Higgs Boson after decades of searching" type of achievement. Molecular biologists and geneticists everywhere can now do a simple web search see how this affects the system they are working on without needing to perpetually beg the labs that possess the specialized high-throughput instrumentation to do a one-off experiment just for their favorite gene. . .

  10. Licensing terms, oh my. by Ihlosi · · Score: 2

    > It's also got fairly good licensing terms - I mean the OS can be replicated (it is billions of times - once for ever cell), Yeah, right. But use the wrong process for replication, and you'll end up paying for it for almost two decades! How's that for vendor lock-in?

  11. Re:What about Selection? by rnaiguy · · Score: 2
    Just because it doesn't code for protein doesn't mean it isn't turned into RNA, or bound by some protein to regulate some other part of the DNA. Some of it is selected for, we just didn't know where to look to find conservation, or the nature of that conservation. It's easy to pick out regions coding for protein because there are some fairly strict rules for these, so it's easy to find conservation. For some of these non-coding regions, the precise sequence and location is not important, and many similar sequences spread over a wide location range can work just as well. Alternatively, some of these regions are selected to have a certain structure at the RNA stage, which can be satisfied by many different sequences, and undetectable by most methods. Lastly, and most excitingly, these are regions that are evolving quickly. While the function of a protein may remain similar between two species, it may need to be produced in a different time and place in different species. These may be the places in our genetic code where we find out what makes us human. Thus some of these regulatory elements are not conserved between species (though many are!).

    Now none of that rules out the existence of regions of DNA that do little or nothing and can be mutated without consequence. There are definitely such regions in the genome.

    i hope that cleared something up

  12. Re:What about the 'junk' DNA? by Cyberax · · Score: 2

    Uhm, no. "Junk" DNA is really junk. It consists of repeating regions, transposons, inactive and decaying retroviruses, etc. It has no direct functions, and its indirect usability is questionable. For example, pufferfish has almost no junk DNA while some plants (corn, for example) have lots of it without any visible effects on mutation rates or cell viability.

  13. Re:Junk DNA? by Cyberax · · Score: 2

    Actually, most molecular biologists KNOW that the majority of _eukaryotic_ DNA has no function. It's junk, deal with it. Fairly small parts of non-coding DNA perform useful functions: gene expression regulation (less than 0.1% of total DNA), mechanic 'handles' for cell replication machinery (about 5% of DNA), various RNA enzymes (less than 1%), etc. But most of it is still junk.

  14. Re:What about the 'junk' DNA? by Cyberax · · Score: 2

    Nope. We HAVE confirmed it. Like, we've analyzed them it turns out that about 33% of the genome consists of simple repeating sequences (SINEs and LINEs). Yep, your genome consists mostly of copy&pasted crap. Oh, then there are about 8% of viral remains and lots of useless LTRs.

    They serve no function at all and can be safely deleted without any effect on viability. In fact, pufferfish genome has almost none of them and it has no discernible effect at all. On the converse, plants have in general much more repeated fragments (with great variation between species) and also no discernible effects on their cell viability. But what do I know? After all, my company just produces a novel sequencing method, so it's not like I know what I'm speaking about.

  15. Re:Not what Darwinists predicted... by HeckRuler · · Score: 2

    Gee after decades of Darwinists proclaiming junk DNA to be key evidence of Darwinian evolution,

    It still is. Also, the impact of non-encoding DNA has been suspected for over a decade. These papers just kind of hammer it home. Furthermore, there is STILL junk DNA. As in non-encoding, non-selected, no-purpose sections of your DNA that are remnants from long ago. It's dead code, commented out code, or simply gibberish. (There's just less of it than we previously thought)

    (before really knowing what those non-coding segments of DNA were for)

    There are noncoding segments that appear to be selected for which we still don't understand

    I wonder what they'll say now.

    We're still right. And all of this still helps us prove our case.

    There are more robust theories of evolution than the Darwinian model

    Well, sure. Darwin laid out the basic premise that species change over time and become separate species. That's still very much true. He got some details wrong, which later models improved upon, so I guess that Darwin's specific model is out of date. I mean, he didn't even know DNA existed. But "Darwinian", "Darwinist", and "Darwiniest" are kinda loaded terms only used by fundie creationists. I'd avoid them if I were you.

    but they all suggest design to one extent or another.

    Bwahahahahah! sure kiddo, if you say so.

    the other theories are honest and don't assert a naturalistic origin for things like DNA.

    I'm actually intrigued by this. What would be the origin of DNA be other than nature? Anything that isn't man-made is natural. Kinda by definition.