Slashdot Mirror


Dumb Things With Bioinformatics

PrvtBurrito writes: "About 3% of the human genome is "coded" as genes. The proteins those genes encode can be represented as long sequences of amino acids, a twenty letter alphabet. In an attempt to perhaps prove that nothing is sacred, someone has cataloged all of the english words found in known annotated protein sequences from many organisms. It looks like after cataloging over 37,000,000 characters, the longest word is chapstick and the most common word is kilter."

1 of 30 comments (clear)

  1. Right there by heikkile · · Score: 5, Funny

    near the beginning of chromosome 1, in plain view for anyone to read: Frst Post

    --

    In Murphy We Turst