Slashdot Mirror


Creating Artificial Proteins

Spy der Mann writes "By examining how proteins have evolved, UT Southwestern Medical Center researchers have been able to design genes to create artificial proteins. The researchers have discovered a set of simple "rules" that nature appears to use to design proteins. By feeding these rules into a computer program, they were able to obtain a sequence of artificial genes. These genes were then inserted into laboratory bacteria, producing the artificial proteins as expected."

29 of 180 comments (clear)

  1. That's nothing by Mateito · · Score: 5, Funny

    I've been creating proteins by hand since I was 12!

  2. We're learning the words... by FlyByPC · · Score: 3, Funny

    Hopefully we'll figure out what not to say while learning the grammar, style, and syntax of this new language. It's a bit worrying -- but this really has a lot of potential, I think!

    --
    Paleotechnologist and connoisseur of pretty shiny things.
  3. I hate to turn this into a flamewar so soon, but.. by kyle90 · · Score: 5, Insightful

    Isn't one of the reasons that creationist use when they attack evolution (actually abiogenesis) is that it would take such a long time to generate functioning proteins through random chance that it would be statistically impossible? If there are simple "rules" to create proteins, maybe that's how nature was able to come up with life so quickly.

    --
    Real_men_don't_need_spacebars.
  4. Re:Tinfoil hat, but... by Anonymous Coward · · Score: 5, Funny

    Good lord, what if people had other unique markings that could be tracked... finger/palm prints, DNA, retinas... now THAT would be scary.

    Oh wait.

  5. Name that film... by moviepig.com · · Score: 5, Funny
    From the article: The real test will be to put [the new proteins] back into fruit flies...

    "Hel-l-l-p me-e-e-e-e..."

    --
    Seeing bad movies only encourages them. Watch responsibly
    1. Re:Name that film... by ScrewMaster · · Score: 3, Informative

      Nope. From the original movie "The Fly" with David Hedison.

      --
      The higher the technology, the sharper that two-edged sword.
  6. Creating artificial drugs by Khyber · · Score: 3, Informative

    Well, we know we've been able to modify DNA to produce insulin from bacteria.

    We've got bacteria that crap out metal wires (Can't remember if we discovered them or made them)

    Now where's the bacteria that will make substances like xanax or other drugs, so it can make the entire market cheaper and more affordable to those who need it but don't have insurance, and "naturally" at that? (Naturally as in not needing a buttload of power from a processing plant for the drug and wasting energy uselessly)

    --
    Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
    1. Re:Creating artificial drugs by k98sven · · Score: 5, Informative

      Now where's the bacteria that will make substances like xanax or other drugs, so it can make the entire market cheaper and more affordable to those who need it but don't have insurance, and "naturally" at that? (Naturally as in not needing a buttload of power from a processing plant for the drug and wasting energy uselessly)

      Um.. news flash: Drugs have been made that way for years.

      But first: This works for proteins such as insulin. Most drugs are not proteins, however.

      And for those who are, there is nothing about it which necessarily makes it cheaper or less power-consuming. Bacteria need food. Bacteria need to be kept warm. And most importantly, you've got to seperate and purify your drug from the bacteria and growth substrate and whatnot.

      Of course, for proteins you've got no choice. It's practically impossible to synthesize proteins using conventional chemistry. And it's very very difficult (and likely uneconomical) to use bacteria to produce other organic compounds. So these things are complimentary to eachother, really.

  7. Re:I hate to turn this into a flamewar so soon, bu by salemnic · · Score: 4, Informative

    I think you might be using backwards logic here. TFA states that by examinig 100 proteins they were able to notice some standard common things about the proteins they were looking at. When they made rules around those common things they could make new proteins.

    It's like having 100 pieces of example code to look at before trying to create your own, not generating the code from nothing.

    s

  8. Re:I hate to turn this into a flamewar so soon, bu by haluness · · Score: 5, Interesting

    And the answer to this could be that a lot of rules have been randomly tried out. It turns out that the rule(s) we are seeing/discovering are the ones that lasted - and if they are simple they are probably efficient in some way.

    The creationist/ID policy is to avoid facing unknowns by passing the buck onto a designer. In the current example, just because something appears elegant and simple to some person, it does'nt mean that it could not have naturally occured.

    Our jobs, as scientists, or in the more general case, as people with a scientific temperement, is to uncover how or why this simple and elegant thing is the way it is - not to say, 'It's too tough, lets pass the buck onto the designer'!

  9. Let's not get ahead of ourselves here.. by k98sven · · Score: 5, Insightful

    The researchers believe they may have found a set of statistical rules for determining the tertiary ('overall') structure of proteins from the sequence.

    (Although the summary reads otherwise, creating a 'new' protein with an arbitrary amino acid sequence isn't new at all though. )

    If this pans out, it is of course significant towards the goal of engineering 'new' proteins one day. But there is still a lot to be covered. Even if the relationship between sequence and structure were simple and known (and it isn't, yet), you still have the issue of relating structure to function.

    Which isn't known. And of course, even knowing the structure and function of a single protein doesn't mean you know what it's going to do in a complicated environment such as a cell, where there are thousands of things to interact with.

    It's a step forward, nonetheless. But if someone thinks this means we're going to be tricking-out living organisms with new custom-engineered proteins anytime soon, you'll be disappointed.

    1. Re:Let's not get ahead of ourselves here.. by clem · · Score: 5, Funny

      It'd be as liberating as developing software!

      We were liberated? Does that mean I can go home now?

      --
      Your courageous and selfless spelling corrections have made me a better person.
  10. Almost there! by superub3r · · Score: 4, Interesting

    Artificial proteins! YES! One step closer to Artificial steak!

  11. Information Theory Usages? by Frumious+Wombat · · Score: 4, Interesting

    Can this be used for information compression in any way? After all, it was discovered about 20 years ago that simple fractal equations gave shapes very much like ferns. This could give you a shorthand way of compressing the genome of an organism, then making comparisons.

    It would also, of course, be interesting if you could use this to work backwards through the genome to a set point, and (hypothetically) bring back the Auroch.

    Personally, I want to see how this deals with metal incorporation at the active site, and whether their selection rules work for that as well.

    --
    the more accurate the calculations became, the more the concepts tended to vanish into thin air. R. S. Mulliken
    1. Re:Information Theory Usages? by ScrewMaster · · Score: 4, Funny

      Yes, a protein computer operating in meat-space would hold a lot of advantages over silicon ... until your dog got hold of it. Still, it would make a great excuse for not doing your homework, "I'm sorry, Miss Smith ... my dog ate my computer."

      --
      The higher the technology, the sharper that two-edged sword.
  12. I only have one question. by phxhawke · · Score: 4, Funny

    How long before gcc supports this new instruciton set? :p

    1. Re:I only have one question. by Alsee · · Score: 3, Funny

      Oh, GCC has supported this instruction set for almost 7 years now. It's just that no one has ever written any documentation for using it.

      -

      --
      - - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
  13. Link to Nature article by JuliusSu · · Score: 5, Informative

    Full text of article, institutional/personal subscription required.

    Abstract: Classical studies show that for many proteins, the information required for specifying the tertiary structure is contained in the amino acid sequence. Here, we attempt to define the sequence rules for specifying a protein fold by computationally creating artificial protein sequences using only statistical information encoded in a multiple sequence alignment and no tertiary structure information. Experimental testing of libraries of artificial WW domain sequences shows that a simple statistical energy function capturing coevolution between amino acid residues is necessary and sufficient to specify sequences that fold into native structures. The artificial proteins show thermodynamic stabilities similar to natural WW domains, and structure determination of one artificial protein shows excellent agreement with the WW fold at atomic resolution. The relative simplicity of the information used for creating sequences suggests a marked reduction to the potential complexity of the protein-folding problem.

    From this page : a WW domain is the smallest, monomeric, triple-stranded, anti-parallel beta-sheet protein domain that is stable in the absence of disulfide bonds, cofactors or ligands.

  14. Stupid article by MillionthMonkey · · Score: 3, Interesting

    Why can't these articles include any meaningful information? They refuse to tell you what they're about.

    Earlier research has shown that for a given group of related proteins, or protein family, all family members share common structures and functions.

    What would be an example of a "protein family" in this context? Filamentous? Membrane associated? Globins? Antibodies? No idea. "Common structures and functions" could mean several different things.

    By examining more than 100 members of one protein family, the UT Southwestern group found that the proteins share a specific pattern of amino acid selection rules that are unique to that family.

    This tells us nothing that isn't already known. Of COURSE proteins with related functions share specific patterns of amino acid selection rules or they wouldn't work. WHAT sort of selection rule did this group actually find?

    "What we have found is the body of information that is fundamentally ancient within each protein family, and that information is enough to specify the structure of modern-day proteins," Dr. Ranganathan said.

    He sounds like he's talking to a little kid.

    He and his team tested their newly discovered "rules" gleaned from the evolutionary record by feeding them into a computer program they developed. The program generated sequences of amino acids,

    and how did it do this?

    which the researchers then "back-translated" to create artificial genes.

    i.e. they did a trivial replacement of single amino acid letters with three letter codons in silico, then generated the corresponding DNA sequence.

    Once inserted into laboratory bacteria, the genes produced artificial proteins as predicted. "We found that when isolated, our artificial proteins exhibit the same range of structure and function that is exhibited by the starting set of natural proteins," Dr. Ranganathan said. "The real test will be to put them back into a living organism such as yeast or fruit flies and see how they compete with natural proteins in an evolutionary sense."

    Translation from stupid-articlese: in vitro the translation products of the artificial DNA folded into shapes similar to wild type proteins. I think.

    One can only assume that these guys chose proteins that don't undergo post-translational modification.

    1. Re:Stupid article by QuantumG · · Score: 5, Insightful

      In two papers appearing in the Sept. 22 issue of the journal Nature, Dr. Rama Ranganathan, associate professor of pharmacology, and his colleagues detail a new method for creating artificial proteins...

      That's the sum total of useful information in the article. Go read the full paper in Nature if you want to know more. Scientific reporting at its finest. Now and then I read an article where a "journalist" actually understands what has been written and has something profound to say about it that the scientists themselves didn't even think of (and actually agree with). Unfortunately it's increasingly rare these days. Even rags like Scientific American seem to do more puff pieces and press releases than well researched articles these days.

      --
      How we know is more important than what we know.
  15. Re:Tinfoil hat, but... by xanthines-R-yummy · · Score: 5, Insightful

    Most proteins eventually degrade, if they are not immediately destroyed by the immune system (ie, antigenic). Furthermore, for proteins that don't degrade quickly, how would you detect these proteins? Other than putting radioactive isotopes (try getting on an airplane with that in today's environment!), I don't see how you would detect them other than strapping someone down and getting some blood. I suppose you could always try a gene therapy technique to continually express protein, but gene therapy is still highly experimental and presents its own problems. This sounds way more complicated than just implanting inorganic RFID chips/beacons/whatevers under the skin or in a (cough!) body cavity.

  16. And this is news because? by DrCJM · · Score: 3, Interesting

    If that was all they'd done I find it difficult to see how this differs from doing a multiple sequence alignment for a family of proteins, then making a gene for the consensus sequence.

    Checking the paper (and related News and Views article) in Nature itself (http://www.nature.com/nature/journal/v437/n7058/i ndex.html ) (subscription required) indicates they've done more than that. By including the effects of coevolution - where one position in the protein mutates in concert with another to maintain optimal contacts - they generate a substantially better algorithm for manufacturing particular folds. (ie: 25% success in achieving folding versus 0% for conservation alone. 60% presence of wild-type function in the 'designed' proteins.)

    Interesting, but I'm suprised it made it into Nature. (OK then, jealous...)
  17. An interesting idea, but by Nutty_Irishman · · Score: 3, Interesting

    While I realize this news seems fascinating to some individuals, it is not something so entirely new that people in Computational Biology would consider it groundbreaking. Using the computer algorithms to generate new gene sequences is actually just a matter of running the gene finding algorithms you used to find these genes backwards (in fact, many people have been testing their gene finding algorithms by using their old algorithms to generate pseudo test sets). The only thing new about this paper is that people actually went forward and experimentally validated their results. An interesting find, however, the end result does not provide a huge leap to science.

    Now, if people are really interesting in some neat ways of reengineering genes back onto themselves, then they should take a look at some of the work being done with synthetic circuits. The beauty of synthetic circuits is that since you already know how the genes will function, it's just a matter of setting the circuit up in the fashion that you want so that it produces the end result that you want. There really is no limit to what you can do with synthetic circuits (of course, researchers have a long way to go before they master and understand all the regulatory mechanisms). For example (and these are all very theoretical examples): building a cell circuit to release a drug into a body in a very time released fashion (and perhaps autonomously renewing, for example, building a circuit to release insulin into the body given the sugar level of the individual), designing a circuit to recognize and destroy tumors (or perhaps an even simpler form of designing a circuit to recognize and fluorescently label tumor cells in the body helping in removal/early detection). Of course, one could also build quite malicious synthetic circuits as well. For example, a circuit that would aggregate to the wall of the heart and, after a certain number of other cells accumulate, triggering a signal to all the malicious cells and destroy the heart in unison.

    The other nice advantage of synthetic circuits is that the more we learn out regulatory mechanisms in species, the more we can use them for synthetic circuits. The more we use them for synthetic circuits, the more we understand about how exactly the underlying mechanism works (what causes them to break, how do they deal with differing toxic environments, etc). It creates a nice feedback loop with the progression of science.

    There will come a day where it will be useful to generate new DNA/Proteins in combination with synthetic circuits, but, as noted in a previous post, we don't understand the relationship between protein sequence and structure/function enough for it to be a viable option (and this is just with how the protein folds, we haven't even gotten in to the problem of gene regulatory structures-- multiple gene splicing, chromosome structure elements, binding motifs, translational regulation, etc). In fact, this area is something we probably want to venture into as it provides us with an even finer control over the rate constants for synthetic circuits. But for now, the generation of randomly generated genes based on prior genes will go overlooked for quite some time.

  18. if anyone is interested in the algorithms used by afodor · · Score: 5, Interesting
    We published a series of papers evaluating correlated mutation algorithms, including SCA, which is the algorithm used in this pair of Nature papers. I haven't had a chance to look closely at the two new papers, but we found that SCA performed rather poorly when compared to other algorithms that calculate covariance from a multiple sequence alignment. SCA has a troubling tendency to assign high scores to pairs of columns of a multiple sequence alignment that have random sequence in them.

    PDFs of our papers, and Java code implementing 4 different correlated mutation algorithms including SCA, are at my web site:

    http://www.afodor.net

    The references are:

    Anthony A. Fodor, Richard W. Aldrich. On Evolutionary Conservation of Thermodynamic Coupling in Proteins. JBC 279(18):19046-19050, 2004

    John P. Dekker, Anthony Fodor, Richard Aldrich and Gary Yellen. A pertubation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments. Bioinformatics 20:1565-1572, 2004

    Anthony A. Fodor and Richard W. Aldrich. Influence of Conservation on Calculations of Amino Acid Covariance in Multiple Sequence Alignments. Proteins 56(2): 211-221, 2004

    The last paper contains a comparison between SCA and three other correlated mutation algorithms.

    As I said, I haven't had a chance to look carefully or critically at the new papers. (It takes me a LONG time to read a paper critically :-> This Slashdot thread will be likely long archived before I finish thinking about these papers!). But this particular algorithm aside, people who are interested in bioinformatics and contact prediction may find the math behind the correlated mutation algorithms interesting.

    Anthony

    Email: anthony.fodor(remove this and put in an at symbol)gmail.com
    http://www.afodor.net/

  19. Re:I hate to turn this into a flamewar so soon, bu by TuneShark · · Score: 3, Insightful

    The creationist/ID policy is to avoid facing unknowns by passing the buck onto a designer. In the current example, just because something appears elegant and simple to some person, it does'nt mean that it could not have naturally occured.

    I don't think that's quite correct. My understanding is that ID examines a result using statistical or logical tools to see if it could have occurred by chance. It's not a subjective test. A statistical abberration indicates some outside influence. A collection of pre-existing conditions that all must be met at once (and not a step at a time) indicates some outside influence.

    Glib oversimplified statements about ID will only come back to haunt you someday when we realize those ID guys were onto something - even if they don't quite have it all figured out yet.

  20. Re:I hate to turn this into a flamewar so soon, bu by shawb · · Score: 5, Interesting

    Why are there symbiant relationships? It allows for division of labor, essentially. The genetic load of one organism after symbiosis does not have to take care of these certain task that the other is taking care of. Most of the cells contained in your body are not actually yours. The majority of cells in the body are bacteria living in your intestine which each produce proteins which help with digestion. If our DNA had to encode for every one of those digestive and metabolic proteins that are actually used in digestion, we would be selected against compared to an organism that could make more efficient use of its DNA.

    Diversity also leads to a sort of long term stability. If there are different ways to obtain resources, the ecosystem as a whole can adapt to environmental changes far more gracefully.

    --
    I'll never make that mistake again, reading the experts' opinions. - Feynman
  21. Re:I hate to turn this into a flamewar so soon, bu by indifferent+children · · Score: 3, Insightful

    I dunno, GP might be right. The fact that a Scientist *can* drop a rock and have it hit the ground, probably means that rocks *cannot* fall off mountains without the aid of some Intelligent Dropper.

    --
    Censorship is telling a man he can't have a steak just because a baby can't chew it. --Mark Twain
  22. A cure for psoriasis by infinite9 · · Score: 3, Interesting

    I've been on enbrel for six months now to treat my psoriasis (think lizard man from lepar island, not just crusty elbows). It's a protein that I have to inject twice a week since taking it orally would end with my body digesting the meds. I could see genetically engineering a bacteria that could live in the intestine and produce the medicine. That would be awesome.

    --
    Disconnect your television. Do your own research. Draw your own conclusions. They're probably lying. Don't be a sheep.
  23. For the benefit of the slashdot crowd by soren.harward · · Score: 5, Informative

    Since protein engineering is my field of study, for the benefit of the /. crowd (and my karma) I'll fill in the gaping holes left in the New Scientist article, and give you a little more background on the Nature paper. Because the writeup on /. is a perfect example of "scientific telephone": a semi-interesting result gets written up into a paper, which once it's been through several layers of editors suddenly seems like a major breakthrough.

    The Nature paper isn't a breakthrough. It's not even really a major advance. Scientists in my field have been creating artificial proteins for five to ten years now. And yes, even some of them designed completely from scratch (though they're really simple; nothing as complex as, say, ATP synthase) instead of just taking a known fold pattern, known as a "motif." The "WW domain" (domain, in protein parlance, is a small, independent structure within a much larger protein---think of it like a module within the kernel or Apache) is a common fold in hundreds of different proteins. Basically, they analyzed the sequences of all of these WW domains, and figured out which positions were meaningful. It's kinda like reading through some code in a programming language you don't know, and figuring out which lines are comments and which lines are actual compilable code. This group found that the number of interesting positions is small, that they could identify them just from the amino acide sequence instead of having to mess with the whole complicated 3D structure of the domain, and that if they put together a protein with the meaningful amino acids intact and the non-meaningful positions randomized, then in many cases they could still get a pretty decent protein (in terms of structural similarity to the "natural" protein) out of it. Most of the paper is devoted to showing via various methods that they did get a pretty decent protein.

    So what does this mean for me, assuming that this paper is absolutely correct (which I admit is a little hard for me to determine with one quick reading, given that I'm just a first-year grad student)? It means that the number of meaningful amino acids in a protein (at least in terms of overall structure) is pretty low, and that they can be identified without knowing what the full 3D structure is. This is good, because for a lot of proteins, the 3D structure is difficult to get. However, they picked an easy target: a small domain where there are over 100 unique sequences known. We'll see how well this method holds up with longer domains and fewer unique sequences. The S/N ratio won't be nearly as good.