Distributed Computing and the Human Genome Project

Distibuted computer projects... OT by famfurnell · 1999-11-28 16:59 · Score: 2

Are they really worth the effort? RC5 and SETI are both successful, but both of them require a permanent connection to the net (in essence) in order to get the best updates etc. (if ya get what i mean). As this was a UK-based thing, why not send the whole lot around on a CD?

Re:Distibuted computer projects... OT by Kingpin · 1999-11-28 17:30 · Score: 3

All this could be done so much easier. Use applets - people do not have to understand anything at all in order to help out on a project like this. No need to install obscure clients and what have we. I think the only good use of applets is for easy distributed computing.

--
Unable to read configuration file '/bigassraid/htdig//conf/14229.conf'
Geocrawler error message.
Re:Distibuted computer projects... OT by Anonymous Coward · 1999-11-28 19:41 · Score: 0

Are they really worth the effort? RC5 and SETI are both successful,
In terms of processing vast chunks of data, I suppose they are: SETI hasn't found intelligent life yet though. A distributed effort on behalf of the Human Genome Project seems more useful to me than looking in vain for signs of ET signals...
Re:Distibuted computer projects... OT by Drog · 1999-11-28 21:39 · Score: 1

With all due respect, it is WAY to early to say that we've been looking "in vain" for signs of ET signals. So far, the SETI@Home project has only been looking for candidates. It is in the next phase that they begin to search just those candidates for repeat events. To quote from their October 22 announcement: SETI@home has now accumulated more than 100,000 years of computer time, more than any other computing project in history! We have recorded over 85 million "candidate signals" (spikes and Gaussians) in our database, and we're preparing to start the second phase of analysis, which will search these candidates looking for "repeat events". As jw3 has posted already, the human genome project is not well suited to distributed computing. SETI, on the other hand, is perfectly suited. Even though we may wind up with nothing to show for it, I think the project is still worthwhile because unlike other distributed computing projects (such as finding prime numbers, which is cool but won't actually benefit society in any way), the potential rewards of finding an ET signal are, quite literally, unimaginable. Imagine if someone suddenly gave us all the knowledge we ever wanted. Unlimited energy. Anti-gravity. Faster-than-light travel. This is the potential rewards of SETI and SETI@Home. Still think it's not worth looking?

--
Looking for political forums? Check out "The World Forum".
Re:Distibuted computer projects... OT by Steeltoe · 1999-11-29 02:34 · Score: 1

If I were an ET, I sure as hell wouldn't give the human race access to all those things. First we need to learn to coexist peacefully and happily. We can't fix our problems by just fixing the symptoms, ie lack.

- Steeltoe

--
http://www.debunkingskeptics.com/
Re:Distibuted computer projects... OT by larkost · 1999-11-29 02:39 · Score: 1

There are a couple of problems here, ones of scale. The Human Genome Project simply has too much data to ever be put on a Distributed network. The data involved at each HGR center is in the Terabyte range, there is no way that even one center's data could be put onto CDs and distributed that way.
The second problem is that, unlike SETI@Home's data, you cannot break this data up into relativly small packets for processing. The work that is currently being done at the computing centers is one of trying to fit the scraps of data together like jigsaw puzzel. Imagine the futility of randomly mailing 100 of your friends pieces of a puzzel. There is a chance that a ew of them could put together fragments, and help you solve the puzzel that way, but a much more effective way of doing this woudl be to bring all the participants together, and all work on the same huge bin of pieces. This is what they are doing rigth now. The project is not one that takes very complicated and prossesor intensive calulations, but that the data needs to me massaged en-masse, Terabyte by Terabyte!
And then, once the baseline Human Genome is pieced together, then you start to figure out what all these genes do (the real work in this project). Then you start to pair up real people with the genes that they have, and to put individuals names or even refernce nummbers woudl be an serios invasion of privacy!
Summary: the Human genome Project will never make a good canidate for Distributed Prossesing for both practical and privacy reasons!
Re:Distibuted computer projects... OT by Anonymous Coward · 1999-11-29 08:11 · Score: 0

I think applets (or more precisely Java) are the way to go. I think the portability advantages in a distributed enviroment outway the perfermance problems. I also think it may be possible to process the data "en masse" using JavaSpaces. JavaSpaces is Java's answer to distributed computing, actually the ideas have been around for a long time in distributed systems, see Linda Systems. Workers can write to the space, read or take from the space and can be notified of space events. Transaction and rollback are supported. Instead of just data being manipulated the entries in the space are objects. All the infrastructure needed is an internet connection and 1.2 JVM.

perhaps this will be a wake-up call by SEAL · 1999-11-28 17:00 · Score: 5

Patents, in general, have really taken a nose dive since the personal computer achieved widespread use. The original intent of a patent was to allow an inventor to come up with an idea and protect it for a period of time. Whether he profits from it or sits on it is then up to that inventor.

However, with the computer age, the speed of (dare I say) innovation has been astounding. This has produced two detrimental effects. First, the patent examiners simply don't have the niche expertise to scrutinize patents. I'm sure most of us have seen some of the idiotic patents out there. Second, the time span of a patent has become too cumbersome. By the time the patent expires, the invention is often useless.

I sincerely hope that this particular project will be placed under a HUGE spotlight when the patent requests inevitably filter in. I have a feeling it won't hold up, and at the very least, not in some countries.

However, keep in mind that this is scientific information about a human being, not software / computer advances. In that regard, a patent will be cumbersome, but not quashing. The patent (if granted) WILL expire someday. And I'm fairly certain that the information will still be very important and valuable when that day arrives.

Of course I'm all for beating the would-be patenters to the punch, if possible.

Best regards,

SEAL

Re:perhaps this will be a wake-up call by Courier · 1999-11-28 17:43 · Score: 1

I am all for beating THE patenters.
Re:perhaps this will be a wake-up call by PG13 · 1999-11-28 18:10 · Score: 2

>The patent (if granted) WILL expire someday

Technically yes and the same thing could be said about copyright. Except the industry which holds copyrights has gotten extremly powerful. An interesting trend is that whenever the original disney copyright for mickey mouse etc... is about to expire the copyright term is extended (yes for both new copyrights and old copyrights).

This extension of copyright clearly serves no public benifit (these works have already been created so reatroactively extending the copyright doesn't encourage the production of new works) and yet it is enacted! If the biotech industry became large enough such a scenario is possible (tho less likely because of competition within the industry).

For further information about the copyright term extension act and efforts to fight it visit copyright commmons

--
Marriage is the "pseudo-ethics" that cloaks the messy truth of sexuality in the raiment of propriety -- it's "Don't Ask,

Money by talldark · 1999-11-28 17:01 · Score: 1

More importantly - will there be money on offer?

:)

Sure this is ridiculous by Bouglou · 1999-11-28 17:01 · Score: 1

It's just as if the first man who took a photograph had said :

Now this is MY moon. You can't anymore take photographs of this.

But I wonder in which countries such patents could be valid. For instance in Europe we are having a discussion about the possiblity to patent algorithms, that was not possible in the EU until now. I hope ONLY you americans are allowing companies to patent a thing like our genome.

--
Fetchez la vache !

Sick by Anonymous Coward · 1999-11-28 17:03 · Score: 0

I find the idea that someone can patent human Dna knowledge sickening. That someone can control the use of my own DNA is horrible. Patents have gone mad, this is just sad.

Re:Sick by wangi · 1999-11-28 17:41 · Score: 1

I find the idea that someone can patent human Dna knowledge sickening. That someone can control the use of my own DNA is horrible. Patents have gone mad, this is just sad.
They cannot control the use of your DNA - you would still be quite able to pass on your DNA (or merge it) by having children.
What other use do you have for your DNA?.
The Drug Empires are just looking to guarantee a return on their investments - it sucks, but that's Big Business. I hope the HGP beat them to the key genes/patents.
Re:Sick by Anonymous Coward · 1999-11-28 17:41 · Score: 0

You're right, with this kind of patents, some company might ask you for a huge amount of money because you copied the information contained in DNA they have patented (for instance, when you make kids)...
If somebody manages this stunt, I think the guy can retire to Mars right away, he'll be able to pay for Nasa's bills for the next millenium...
Re:Sick by Anonymous Coward · 1999-11-29 00:22 · Score: 0

"What other use do you have for your DNA?."

Genetic illness. What if a person can't get a treatment because someone has a patent on that gene which prevents development of a treatment. My DNA, but no cure because of a patent.

Another Distrubuted Project by Keefesis · 1999-11-28 17:03 · Score: 1

I would like to see another distrubuted project on this. If it gets the same publictiy as the Seti Program it could have wonderful results. The only problem I could see is that the community's distrubuted-willing resources are already streched to the bone. I would say this does take presedence, eh? I don't want no stinkin' coperation owning my genes!

Another thing, you don't need a permenant connection to run distuibuted. I run seti at home on a computer with a dialup that goes on about once a week. It works wonders! Cheers :)

Hold on. The seq's can't be patented. by reve · 1999-11-28 17:05 · Score: 3

Okay, before everyone hops on this really popular anti-patent train, let's make sure we note that the sequences can't be patented. Yes, independent companies are gonna beat out the human genome project and have been filing patents. But the patents arn't on the sequences themselves, they're on applications. Whether these applications have to do with more efficient methods of genome-unraveling or whether they have to do with specific uses of the patterns they've found, it's NOT the actual sequences.

In a number of countries it's already quite specifically illegal to attempt to put intellectual property restraints on anything involving human genes. US is considering some laws as well, but let's just get all the facts straight before panicing, okay?

--
-- r . m o s q u i t o --

Re:Hold on. The seq's can't be patented. by Anonymous Coward · 1999-11-28 17:12 · Score: 0

Once they've cracked a gene, patenting every obvious application of it is only a matter of money. At that point, the rest of our society is well and truly screwed.
Re:Hold on. The seq's can't be patented. by _Marvin_ · 1999-11-28 17:27 · Score: 4

Of course the seq's themselves can't be patented.
Otherwise anyone holding such a patent would be
(AFAIK) entitled to control the reproduction of
the sequences, that is, since we are contantly
reproducing them in our bodies he could charge
us for letting us live...
Now, this would make patent law a satire just too obviously.
Still, (again, AFAIK, correct me, if I'm wrong)
patents on gene sequences (that is, their
applications) have a new quality: They do not
cover applications that the patent holder has
thought of, they cover all applications that
become possible only if you know that gene
sequence.
If I remember it correctly, there are already
cases where companies hold patents on certain
proteins in our bodies (again, not the proteins
themselves but any of their applications) and
you are not allowed to TEST for these substances
without paying them license fees, even if you're
using a completely new testing method you developed on your own.

--
"We won't use guns, we won't use bombs, we'll use the one thing we've got more of and that's our minds" - Pulp

This can't be open source! by ghoti · 1999-11-28 17:08 · Score: 5

Well I don't think anybody will say "No, let's not do it, let the big bad corps patent our genes!!".

The only problem I see here that developing a distributed client for this takes a lot of time and effort --- and one, which definitely cannot be open-source!

Two reasons:

False results. If the data format etc. are known, it's possible to feed the servers bogus results, which could lead to inconsistencies in the data base. This might even destroy results that are already there (okay, this problem also exists with closed source stuff like SETI@Home, I know).
Data Theft. An open source program could be modified by Big Bad Corporation Inc. to simply harvest raw data and feed it into their own computers, thereby gaining information they would otherwise have to find themselves. Granted, they won't have as much computing power, but when they have their own and the stolen data, they're still saving time. And I am not sure if enough data is produced to keep hundreds of thousands of computers occupied (see the problems SETI@Home had in the beginning).

So, sorry, folks, but I believe this is one of the few things that open source clearly is not suited for. But it would be kinda cool to have a proggy running on my machine that messed with genes ... ;-)

--
EagerEyes.org: Visualization and Visual Communication

Re:This can't be open source! by flux · 1999-11-28 17:25 · Score: 2

False results can he handled easily: just submit the packet to two different places, or to 1.5 places in average, and if they disagree, the system checks the packet by itself (or hands over to third machine.) Yes, It'll slow down, but I can't see any other viable alternative..

Data theft.. Isn't the idea that the data is already there, but it needs to be processed? No idea in data theft then. Also the system could look after domains or ip-address spaces that keep eating and eating the data space faster than anyone else and blackhole them.. Or sue them :).
Re:This can't be open source! by Lars+Arvestad · 1999-11-28 17:31 · Score: 4

Data Theft. An open source program could be modified by Big Bad Corporation Inc. to simply harvest raw data
and feed it into their own computers, thereby gaining information they would otherwise have to find themselves. Granted, they won't have as much computing power, but when they have their own and the stolen data, they're still saving time. And I am not sure if enough data is produced to keep hundreds of thousands of computers occupied (see the problems SETI@Home had in the beginning).

The Human Genome Project is extremely open. They try to make all data public as soon as possible, making patents impossible. So data theft is not an issue here.

False results might be a problem, but I would expect it to be relatively cheap (computationally seen) to check a solution to see if it is valid.

A distributed (open source) effort will probably not happen because a computation like this is more difficult to distribute than trying crypto-keys et.c.

Lars

--

--
Reality or nothing.
Re:This can't be open source! by ianezz · 1999-11-28 17:36 · Score: 2

> False results. If the data format etc. are known, it's possible to feed the servers bogus results, which could lead to
inconsistencies in the data base

Send the same data to multiple receivers (randomly chosen), and see if they produce the same results. (or, at least, choerent ones). If note, one (or possibly more) are lying. Anyway, a closed-source client does not prevent someone to see what it does and send bogus data anyway. It only makes things harder for the ones that actually want to send correctd data.

> Data Theft. An open source program could be modified by Big Bad Corporation Inc. to simply harvest raw data and feed it
into their own computers

This is a more realistic issue, but Big Bad Corporation is probably rich enough to do reverse engineering of the protocol by itself, and access random lumps of raw data anyway. A closed-source client don't make much sense here.

The real point is that modified versions (i.e. to improve performance) could quickly spread so that just a few uses the original clients.

If suddenly it turns out that a widespread modified version produces erroneous data from time to time, then probably large amount of computations has to be thrown away. Of course, you could check for that using the same method you use to check for "bad guys", but it's a serious problem if you got only few people running the original.

My 0.02 Euro as usual.
Re:This can't be open source! by ghazban · 1999-11-28 18:07 · Score: 2

Plus there is the double dilemma of having the company send bogus results, sabotaging the project, and also using the program to their advantage to add to their databases.
Re:This can't be open source! by John+Allsup · 1999-11-28 18:56 · Score: 2

I get the feeling that the patterns are significantly harder to find than to verify.

This would make false data less of a problem ( since it would merely act like any other flooding DOS attack).
John

--
John_Chalisque
Re:This can't be open source! by fpepin · 1999-11-28 20:19 · Score: 2

There is also a slight problem of the practicality of having a distributed client. The problem here isn't really a matter of brute force.

You need to sequence the gene first. This is the long and costly part if I remember well.

The computing power is used mainly to see the similarities with other genes already discovered (in humans and in other species). Here you need more of a huge database holding all the information as you simply search for matches and near matches in the sequences.

I'm not sure it would be very useful to have a distributed client for this. And for myself, I'd rather wait a few more years and be sure that I can trust those results.
Re:This can't be open source! by Nyarly · 1999-11-28 22:32 · Score: 1

You need to sequence the gene first. This is the long and costly part if I remember well. This is silly. The sequencing is the computer intensive part. You need to take the chunks you've got and attempt to line up the overlaps to make larger chunks. So passing out a hundred or a thousand or however many sequence chunks to be compared and good matches returned would be an excellent way of doing things. And would have the side effect of publishing results; necessarily you'd be handing out copies of the longer sequences you've established, until everyone who wants it has the whole sequence. (Although, I think it's kind of large, bitwise.:)

--
IP is just rude.
Is there any torture so subl
Re:This can't be open source! by thogard · 1999-11-29 04:40 · Score: 1

But it is open source....
Kind of...
Some of the code that is still widly used was written at OU (Okie Uni) in FORTRAN back before '85 and that was and still is open source. Most of the modern version seem to have been just ports of the old stuff.

Prior Art? by JohnG · 1999-11-28 17:11 · Score: 2

Hmmm, does anyone else think God (or Alla or Odin, or the Great Bannanarama, or whoever your supreme being is) will have a problem with these big companies patenting His invention?

Re:Prior Art? by dylan_- · 1999-11-28 18:01 · Score: 2

Hmmm, does anyone else think God (or Alla or Odin, or the Great Bannanarama, or whoever your supreme being is) will have a problem with these big companies patenting His invention?
Yes, he does. Unfortunately the Other Guy has all the lawyers.... :-)
dylan_-

--

--
Igor Presnyakov stole my hat
Re:Prior Art? by wocky · 1999-11-28 18:34 · Score: 2

You mean there are no lawyers in heaven?

--
David
Re:Prior Art? by redhog · 1999-11-28 18:49 · Score: 1

Wouldn't it be a really good idea to tell the religious right about that? Perheaps there'l finnaly come something good out of them after all!

--
--The knowledge that you are an idiot, is what distinguishes you from one.
Re:Prior Art? by molog · 1999-11-28 23:28 · Score: 1

Hmmm, does anyone else think God (or Alla or Odin, or the Great Bannanarama, or whoever your supreme being is) will have a problem with these big companies patenting His invention?
As a matter of fact, I'm pretty darn angry about it.

--
So Linus, what are we going to do tonight?
The same thing we do every night Tux. Try to take over the world!
Re:Prior Art? by JohnG · 1999-11-29 02:13 · Score: 1

Hey, yeah, somebody get me Jerr... Uh never mind, the thoughts of actually agreeing with Jerry Falwell on anything gives me shivers. Better keep this from them. ;)

Patenting genes? by Anonymous Coward · 1999-11-28 17:11 · Score: 1

How on earth would you justify patenting a gene?
It has been around for a _very_ long time. You may as well patent some newly discovered subatomar particle and charge everyone who uses it. Or maybe an even better comparison would be a patent to a microscope vs. patenting everything you can see through it. This is insane.

Ciao, Peter (still without ./ Password :)

Use of patent by Anonymous Coward · 1999-11-28 17:12 · Score: 1

From what I understand the patents apply to potential usage of the sequence rather than just the sequence itself. Unlike the systematic sequencing approach employed by the genome project the "grab and patent" companies often target potentially interesting genes (eg, receptors, particular classes of enzymes) by "fishing methods" such as degenerate PCR of EST libraries.

Patents are a sore point in molecular biology if the companies choose to prosecute those that appear to infringe them. The classic case is the patenting of PCR by Cetus/Perkin Elmer/Roche as the companies made open threats against academic institutions. This was especially sad as the PCR patent was extremely shaky as there is clear evidence of prior art.

Re:Use of patent by Anonymous Coward · 1999-11-28 19:25 · Score: 0

Dammn, 3rd typ to post this. BUS Error. Netscape SUCKS :( OK, patent genes? Or the use of them? Let's look at that. Suppose I suffer from some kind of genetical disease. It's a recessive, so I inherited a defective Gene from both, my father an my mother. I don't want to pass this on to my children so instead of 'prducing' children the traditional way we go to a specialized doc who splices the offending gene of and uses the thus produced sperm (yes I'm male, know you now, my GOD!) to fertilize my wife. Fine. Now the gene spliced in (or that was used to replace the defective one) is patented. I payed for it (or maybe an insurence or publiuc health system did). What happens to the child? Does he have to pay for every child he or she has? Maybe he has to pay for every time he has sex. Or does he belong to the corporation that has a patent of a part of his genom?

Patents - just a few ideas by CormacJ · 1999-11-28 17:15 · Score: 2

Even if we did have a distributed effort and made advances, someone would still have to patent the discovery.

As we have seen with Y2K fixes and other things, making a discovery does not stop someone else patenting the idea.

An open source body would have to be setup to patent the discoveries just so that nobody else could patent them.

This body can declare thier patent open for use.

There is a lot of legal issues here - if you opne your patent too much could you lose it.

Patent law is also a case of boiler plating your patent - you have to ensure that every option is covered and also included on the patent.

This sort of thing is costly, and this is why a lot of companies patent thier ideas. Once they have the patent they recoup thier investment, and then some.

If an open source patent body is set up there will have to a lot of time spent considering patent administration and the costs involved.

Re:Patents - just a few ideas by Null_Operator · 1999-11-28 20:20 · Score: 1

Who's to say that if a person(organization) that was planning the applications for the public, turned in an application, then it "disappeared misteriously"?(the patent offices arent free from greed). Im not saying that ALL of the corperations out there trying to patent these applications are bad, because I dont know the whole story as to why each company wants to patent them.(my guess still remains money, but you never really know).

--
May the source be with you \0perator

DeCode Genetics by lawn_ornament · 1999-11-28 17:20 · Score: 2

I live in Iceland, and here there is the company DeCode genetics. They are building a huge database with the medical histry of every Icelander in it, to be able to trace "bad genes".

the funny thing is, they're a privatly owned company and still they are entitled to go through all your medical records at their own will and put it in a database

sure, they say it'll be secure but what if they start selling info on you to insurance companies?
imagine this:
you: Hi, I'm (some name) and I'd like a life insurance.
insurance rep.:well... I'm sorry... it's gonna cost you (insert obscene amount here) because your family has a record of heart failiurs.

these are just my thoughts... check it out for yourself, I think this has made it to most news medias in Europe and America, also check out www.ie.is

---

--

---
Killroy Woz Here

Re:DeCode Genetics by PG13 · 1999-11-28 18:19 · Score: 1

I understand this company is giving kickbacks or the promise of free use of the results to Icelanders in return for this right.

If they start selling your info to insurance companies they breech a contract they had with your government and you can probably throw them in jail just like you could if your doctor started selling your medical records.

The insurance rep issue just really isn't unique. Eventually some insurance company will begin offering extremly low insurance if you DON'T have a history of heart disease in your family and these people will be more than happy to hand over records to prove this. Eventually competition will drive the price of insurance for people who don't open their records to insane values. Eventually the solutioni will have to be either a) let some people die (bad idea) or b) government guaranteed health care

--
Marriage is the "pseudo-ethics" that cloaks the messy truth of sexuality in the raiment of propriety -- it's "Don't Ask,
Re:DeCode Genetics by Lars+Arvestad · 1999-11-28 18:44 · Score: 1

But is not the medical records anonymized before DeCode can make use of them? I guess it would be possible for them to deduce for whom the data is in some cases, since they have access to a near full family tree for icelanders, but it would in the end be quite obvious that they had done something illegal in that case, wouldn't you think?

The main problem in my mind is that they have been given exlusive rights to this data. That is really giving away a gold mine.

Lars

--

--
Reality or nothing.

HGP almost completed; also, NIH computers? by The_Messenger · 1999-11-28 17:20 · Score: 2

I was priveleged enough to actually speak with one of the NIH (National Inistitute of Heatlth) scientists working on the project earlier this year. He came to speak in our school Medical Society. Being the geek that I am, I made sure to inquire as to the Y2K compliancy of the computers used for analysis and data storage; alas, he wasn't involved in that aspect. ;-) He said he "thought they were", though.

If I remember correctly, and there have been no delays, it's supposed to be finished before 2002.

I tried to tape the whole question and answer session with my microcassette recorder, to put on my webpage (in RealAudio format), but he was against it. Oh well. (I would have tried to sneak it anyway from the back of the room, but my recorder has a crappy mic, so I wouldn't have gotten much by doing so.)

The whole concept is very cool... imagine being able to prevent disease on a genetic level...

Does anyone have any information on the computing systems being used? Come on, there have to be a few NIHers reading /.! ;-)

This is slightly off-topic, but has anyone else heard about this "Soul Catcher" project, which I think is based mainly in the UK? (Based on the concept of recording an entire human consciousness to a traditional physical medium, if I remember correctly.)

--

--
I like to watch.

Re:HGP almost completed; also, NIH computers? by Anonymous Coward · 1999-11-28 19:08 · Score: 0

As far as I can tell (I did a bit of a course at the Cambridge, UK HGMP-RC) they are using big Unix boxes. We were using them from a load of SGI workstations via telnet and that GCG program... Whoa. It could use a bit of tweaking. Still, I wrote a turorial for it, so that's OK. Laterz J
Re:HGP almost completed; also, NIH computers? by imac.usr · 1999-11-28 21:17 · Score: 2

>Does anyone have any information on the computing systems being used?
> Come on, there have to be a few NIHers reading /.! ;-)

I work as a Macintosh support tech over at NHLBI (the National Heart, Lung, and Blood Institute) and interviewed recently for a position over at NHGRI (I didn't get it mainly due to non-competition agreements between the federal contractors who supply NIH). Like any good geek, I asked about the machines in use on the project. Apparently, while some processing is done here in Bethesda, a lot of it is done at other sites (universities and such) on Unix boxen, although my interviewer wasn't sure of the specific platform. At the institute itself there's a fairly large number of Macs used for graphic analysis of the data and both Macs and Wintel PCs for basic stuff like writing papers and reports.

I can tell you NHGRI is pretty well funded within NIH, right up there with the cancer institure and the infectious disease institute (which deals with things like AIDS and whatnot). They certainly have more translucent Macs than any other institute. :-]

And yes, they do use Linux there, although from what I gather, it's mostly being used by individuals experimenting with the system, and not for any actual rendering/mapping of gene data. Coincidentally, I took my first Linux support call a couple of weeks ago from somebody here who installed Caldera 2.2 and needed help setting up networking. Got him set up in only minutes, and soon he was enjoying NIH's 300kbps-and-up network connection. Makes watching MacWorld keynotes a lot more viable.

If you check the Netcraft records for NHLBI, NIDDK (National Institute of Diabetes and Digestive and Kidney Diseases), and NHGRI, you'll see that NIH is far from your typical NT government shop. Plus, the NHGRI main website has lots of info on the project and why it's a Good Thing.

BTW, slightly off-topic: there are 12 people in my support group, and of those, I'm the only full-time Mac tech, while two others are mostly PC techs with some Mac skills. Oddly enough, the PC people are always busier than me despite having roughly the same number of machines to support.....

--
I use Macs for work, Linux for education, and Windows for cardplaying.
Re:HGP almost completed; also, NIH computers? by Anonymous Coward · 1999-11-28 21:22 · Score: 0

The sequnce data is generated on mainly applied biosystems 377 and 3700 laser-based DNA sequencers; most of the work though not all being done at the "G5" , the big genome centres, including the Sanger centre in the UK, Baylor, Wash U and a couple of others in the states. 377s are driven by MACs, 3700s run on windows NT. Inital downstream analysis is often done on MACS (esp. at the smaller centres), then huge unix boxex try and piece it all together; many of these are running Linux; have to say that Linux and Macs are popular in biological science (thankfully). At the most recent meeting of the American Society of Human Genetics, Francis Collins, Director of the NIH, promised a working draft of the genome by srping NEXT YEAR!! Basically the race is on with the big bad pharm companies; goverments (eps UK and US) have thrown money at the genome centres.
Re:HGP almost completed; also, NIH computers? by Bryan_K · 1999-11-29 00:24 · Score: 1

I knew a guy who was/is involved with the human genome project. At the time (about a year or so ago), the place he was working at was using a cluster of Alpha's for their work with the project. I gathered that at least some of these were running Linux. They may also have been using Digital Unix or another Unix variant on some boxes. I don't know the details of the machines or exactly what they were doing with them.
Re:HGP almost completed; also, NIH computers? by bluets · 1999-11-29 22:30 · Score: 1

In some places within NIH, they have Beowulf clusters installed. I think they have a series of high-end PCs that get refreshed every 2-3 years, with the outgoing boxes becoming desktop machines.

Sex = piracy? by vaxer · 1999-11-28 17:27 · Score: 2

You heard it here first -- intellectual-property idealists will revive a grand tradition by copying their genes without a patent license. Someone will print "Information wants to be free love" on a black T-shirt, and all around the world, geeks will go out into the streets and protest WIPO and the genome barons by having sex. With themselves, mostly, but hey -- it's the thought that counts.

Software Patents in EU by Anonymous Coward · 1999-11-28 17:28 · Score: 2

Since when did they start allowing software patents here ?

If they are indeed allowing them then they are restricting my freedom of expression. My programs are my art (I don't and won't write them for money). Patenting software is like patenting the golden ratio in paintings.

I don't know if you've ever really considered this, but not all software is about money; in my case it's mostly about creativity and art (all non-profit, well, except intellectual profit). In case you want to know, I mostly write sound synthesis and processing software and the field is very heavily patented. What artist makes paintings and doesn't share them with other people ? Imagine if you couldn't show a painting to other people if you painted it with a certain brush unless you pay license fees. This is what software patents are to me and probably many others. They need to be stopped now (or at least make non-profit use legal) !

AC

This was my idea. by michael.creasy · 1999-11-28 17:32 · Score: 0

I submitted an idea like this to slashdot just a few days ago. Completley my idea.

Re:This was my idea. by jimmyCarter · 1999-11-28 17:51 · Score: 1

Your T-shirt is in the mail.

That's what I love about them high-school girls. I get older, they stay the same age... yes they do.
--Wooderson 1976

--

-- jimmycarter
Re:This was my idea. by troc · 1999-11-28 17:54 · Score: 1

I actually submitted this a few weeks ago, but with the huge anmout of submissions, things tend to take a while to filter through the system :)

I've had some email from Ewan Birney at ensembl about doing this but it seems they lack experience of client coding! I personally no nothing about that at all, I'm a bender of metals and I can just about write html on a good day. If anyone has any help to offer, you could visit their webpage....... I've not added his email address in case he's paranoid, but I can forward stuff to him :)

Cheers

Troc

--
Troc's dubious podcast and blog: http://www.trocnet.net
Re:This was my idea. by ewanb · 1999-11-28 19:26 · Score: 1

Thanks troc - just got around to reading this
commnet.

I have sort have appealed at the top to people
to come along. People seem more interested
in writing about patents than getting down to
nuts and bolts of course....;)

If there is anyone out there who would like to
do this coding, as sure as hell I don't know how
to it ;). But I know what to run...

Open Source Genome Projects by ewanb · 1999-11-28 17:34 · Score: 5

There are some good open source genome projects for doing this efficiently - and we do welcome help of any kind. Here are some open source projects which I know about/work on/

ensembl is an open source genome project designed to get as much data and software into the public domain as possible
EMBOSS
bioperl

All these are well backed, strong open source projects with different strengths. Everytime genome stuff comes up on slashdot I try to point these things out to people, but everything gets lost in the noise about people $%!"'ing on about patents (generally without alot of knowledge!).

Anyway - check out these projects for more information about real open source efforts in biology.

Re:Open Source Genome Projects by bluets · 1999-11-29 22:39 · Score: 2

In evolutionary biology, where we are focusing on reconstructing the tree of life, there are actually very few programs that are licensed under the GPL or the LGPL. There is *one* program (Paup, being distributed with manual by Sinauer) upon which most evolutionary biologists depend that has been in beta testing for 6+ years. With a 30 day expiration built into the binaries (of course, source code is not distributed). The author refuses to license the code under the GPL or the LGPL or any other type of open source licensing scheme. Where I work, we have a cluster of linux systems for this tree of life reconstruction - they are sitting mostly idle because the most recent beta of this program expired last January. The next beta is not even likely to have PVM or MPI support. Anybody want to do some programming for me? :)

Utter BS. by jimmyCarter · 1999-11-28 17:50 · Score: 1

The whole genome sequencing patent race has me mightily pissed off. But as someone mentioned a couple of weeks ago- maybe it's going to take something as rediculious as this to bring the entire patent-granting operation to its knees.

I think the metaphor used was of a guy walking into a forest and patenting every different type of tree he came up upon. Another favorite is the Amazon.com "one-click" patent. That's like Henry Ford whipping up a nice car and patenting the tires!

-not responsible for spelling errors-

That's what I love about them high-school girls. I get older, they stay the same age... yes they do.
--Wooderson 1976

--

-- jimmycarter

TIGR, HUGEP and genomics by jw3 · 1999-11-28 17:53 · Score: 5

Hello, my name is January and the group in which I am doing my Ph.D. thesis sequenced in 1996 a bacterial genome (Mycoplasma pneumoniae). Since we are into genomics, transcriptomics and all other -mics I know at least a little about the way it works - although on a much smaller scale.

First issue: could distributed computing help? My answer is a brief "no". First, the bottleneck is on the experimental side - getting the sequences, and not putting them all together. Second, although you need quite a lot of computing power to do so, much of the job must be revised and checked by humans, i.e. there is a lot of skilled manual work to do - you have to have "an eye" for the sequences. But the first point is more important.

Now, TIGR, the commercial alternative to the Humane Genome Project has sequenced more organisms then any other scientific group in the world. Craigg J. Venter seems to be very efficient and hard working guy. Even if you don't like the idea of making money with patents in this area the scientific community owes him a lot - he was the one to sequence the first organism, to sequence Helicobacter pylori and many, many others. On the other side... you know, when M. pneumoniae sequence was about to be published, it was supposed to be the first Mycoplasma sequence. But Venter was faster with Mycoplasma genitalium - and he kept it quiet, so noone involved in sequencing those organisms actually knew there is a race. Now Venter claimed to be able to complete the human genome with much less effort and much less $$, and considerably faster then the HuGeP. I'm not sure whether he is able to do so or not, because it depends chiefly on the "hardware" side - the new Perkin Elmer automatized sequencers they are supposed to use.

Anyway, the question is, whether it is good or bad if Venter sequences the human genome. In my opinion - it's OK. The Hugep is somewhot different in its purely scientific interest, and I'm convinced that they will produce data of much higher quality. On the other hand, human genome has a considerable variation, so two genomes are better then one. I would not be very concerned about the patent issue, because it will come anyway (because of **!'*%$! american and international patent law) - even if TIGR would not sequence the genome, someone takes the output of the HUGEP project and will patent the same sequences Venter would. Venter just wants to gain a little time for evaluating the sequence before releasing it to the public.

And of course, not the _sequences_ are patented - what is patented, is the usage of modification of a certain sequence for medical purposes, or a certain enzyme as an aim in medical treatment.

Regards,

January

Re:TIGR, HUGEP and genomics by kovi · 1999-11-28 18:56 · Score: 1

Hi,

>And of course, not the _sequences_ are patented -
>what is patented, is the usage of modification of
>a certain sequence for medical purposes, or a
>certain enzyme as an aim in medical treatment.

So, what you are telling me is if I'd like to use patented
sequence for non-medical, "basic science" purpose, I don't
even have to ask patent holder for permission? This is as
far from the reality as it could be. I know _SEVERAL_ examples
where people were not permitted to use such patented strings
of AGCT because _their_"basic science"_results_ could
possibly affect revenue of the patent holder or be used by
others to "overcome" patent claims. How good is that?
IMHO patents on the sequences will definitely slow down
progress in the basic research.

Regards,
kovi
Re:TIGR, HUGEP and genomics by Phil-14 · 1999-11-28 19:54 · Score: 1

The examples you cite are violations of patent law; one is always supposed to be able to use results like that for basic research.

--
(currently testing something about signatures here)
Re:TIGR, HUGEP and genomics by jw3 · 1999-11-28 22:39 · Score: 1

Please, give me the examples.
Look, imagine someone patented the modifications of the xxx gene as a target for gene therapy of early-onset Alzheimer. Of course I cannot try to develop a gene therapy using this sequence; but I can clearly use the sequence for scientific purposes, i.e. researching Alzheimers disease.
Maybe what you mean is patenting genes which have been artificially modified, e.g. sequences of transgenic enzymes used for research purposes (all those "TM" polymerases and such). In that case the sequence is patented, because it was developed by the company selling it. Since you have usually no access to such sequences, it should not be a problem.
Another idea - if a company sequences a gene, they can keep the sequence and you may not use it if you somehow get you hands at it (I think). But noone can forbid you to sequence the gene for yourself - in fact, that is the case of many bacterial genomes (E. coli has been sequenced several times by different teams, but only one or two sequences were published).
I am no lawyer, so I just present you my general idea of how the things work. I'm not at all sure whether I am right or not.
Regards,
January
Re:TIGR, HUGEP and genomics by donhav · 1999-11-29 00:38 · Score: 1

The two projects are, i believe, going about the whole thing in two very different ways.

HUGO is sequencing the entirity of the human genome in a slow traditional manner. This does not need excessive computing power.

TIGR on the other hand is sequencing ONLY the genes of the human genome. They are using a shutgun approach which involves sequencing bits at random and using computing power to match up all the little bits. They need lots of computing power and would be helped by a distributes computing effort.

TIGR also get to use all the info from HUGO but not the other way around.

Of course working out what all thoise basses means, needs more processing power then we currently have on the planet. And hopefully distributed networks will startup that try and do protien folding and gene searching and the like.
Re:TIGR, HUGEP and genomics by kovi · 1999-11-30 05:04 · Score: 1

No pro, here is one:
Early 90s, first ESTs. Some sq's from maize EST lib. was patented by X from BigCountry (will not mention company name here, don't have money for lawyes), even though EST stuff is usually not full lenght sequence. In some other country, friend of mine has worked his way through getting N-terminal sequcences of some glucosyl O-transferase (UDPGlu dependent, working on phenols). After some boring work (degen. primers & PCR) he managed to get full lenght clone. I've done expression system in yeast and some enzymology. Then we had a poster on some meeting, regular paper was on the way. Unfortunately on that meeting someone from company X seen our poster. They told us gene was picked up on EST and is patented and company works on some applications of it. Then we receive note from company lawyers, then our lawyers responded. Then we were asked to postpone publishing our results for some time.
In science, "some time" usually means never, so paper never appeared in any scientific journal. I still have a copy of it, if you'd like to read about phenols metabolism in plants.
X produced some herbicide-resistant lines of tobacco few years later.

Regards,
kovi
(now working on Arabidopsis and poplar - smaller genome and less importance for agriculture)

... by Skinka · 1999-11-28 17:57 · Score: 1

I might be stating the obvious, but this really depends on how much bandwidth is needed, call it some kind of "IO/MIPS-ratio". Three kilobytes worth of keyblocks from distributed.net will keep my computer uccupied for two or three days. SETI@home, I've understood, needs a lot more bandwidth, something like 100KB/day depending on the CPU (I've never tried SETI, correct me if I'm way off).

I have no idea as to how much IO these DNA-strand caluculations need, but I would be more than happy to ditch d.net and donate my spare CPU time to this project if it is feasible.

They cant do it. by Anonymous Coward · 1999-11-28 18:03 · Score: 0

Didn't Levis get there first?

warm and fuzzy by counsell · 1999-11-28 18:05 · Score: 5

It's good that hackers are well-informed and principled enough to think it matters. This happens to be my area of interest; I'm responsible for Bioinformatics at the Institute of Cancer Research in the UK. A couple of weeks back I went to an excellent talk by a clever guy call Ewan Birney from the Sanger Centre near Cambridge, UK. He is writing code to catalogue and annotate the assembled sequences in real time as they come off the mammoth robot sequencing "production line". In one of those rare occasions where the British are leading a "big science" project the Centre has been responsible for the largest fraction of the Human Genome sequenced at any single institute. The code does stuff like figure out which bits of the sequence are real genes and which bits are that 90%+ of so-called "junk DNA" you might have heard of and also attempts to assign provisional functions to the genes by various computational means. Eventually people in white coats will have to confirm such assignments properly, but it's important to beat the drug companies to making good guesses.

Ewan's code and all the data are entirely Open Source. If you've got a good reason and a reasonable Pentium with lots of memory and a 30Gb hard disk you could mirror the human genome and get it updated every night. (I feel strange just typing that sentence and I've been following this story for years). The Wellcome Trust and others (including US and European government agencies) funding the project are keeping everything Open because that's the way science is done and because this will subvert commercial attempts to stake a claim on our species' genetic heritage. (Er, go Wellcome!)

Biochemists often talk about the "rate limiting step" in a reaction---the single point which sets the speed of the whole process---like a bottleneck. As far as I understood Ewan's talk (if you're reading this Ewan, please put me right), the rate-limiting step with the Genome Project isn't the assembly of the sequenced stretches of DNA (or "contigs") as the original poster suggests, but the collection of the data in the first place. At the Sanger they have clusters of PCs and Alphas crunching the contigs---distributing the effort would give us all a warm fuzzy feeling, but wouldn't be essential. Again, I may be wrong about this.

One thing that definitely is a priority is making some sense out of all of this information. What would be great would be if members of the global community of hackers started taking molecular biology and biochemistry classes so they could write code to help people like me make sense of the embarrassment of riches that the project is creating. I'm off to Cambridge in two weeks to the Bioinformatics Open Software Development meeting to listen to some project leaders talk and discuss the existing efforts. Personally, I would love to give crash courses in biology to programmers with time on their hands in an effort to harness their collective genius rather than sponsor an effort to write a contig-crunching client to harness their collective spare cycles, but I have no idea how such a thing could be organised. Any ideas?

Re:warm and fuzzy by ewanb · 1999-11-28 18:13 · Score: 4

Consell -
Great that you were following the talk. I thought I put everyone to sleep
The rate limiting step at the moment is effectively the mapping in fact, then sequencing. The interesting thing about the analysis is that the amount of CPU is unbounded. If we have more CPU we just use more accurate algorithms. We can do something within the CPU bounds on the hinxton campus, but if anyone wants to give me a super computer, then we could get more accurate analysis.
I can always use more juice!
Re:warm and fuzzy by Anonymous Coward · 1999-11-28 18:49 · Score: 0

So would there be any scope for implementing this type of analysis as a distributed computing project..? Also, what about using this approach for data mining of the known sequence data? - I suppose you could do quite a lot of complex pattern matching using a distributed project on the scale of Setiathome!
Re:warm and fuzzy by The_Messenger · 1999-11-28 18:52 · Score: 1

Ewan -

You seem to be the guy to ask... can you give us some specifics on the hardware involved? How can I get more info on the systems used for data gathering, cataloging, analysis, storage, et cetera, on projects like these? Even just some CPU generalisations would wonderful... Drooling over supercomputers is a hobby of mine, see... ;-)

Thanks!

--
--
I like to watch.
Re:warm and fuzzy by ewanb · 1999-11-28 19:01 · Score: 2

Hardware at the moment generally are clusters of alpha boxes or intel boxes (running tru64 or linux respectively).
The two big drainers on CPU for analysis are gene prediction (genscan) and database searching (blast). database searching can't be distributed easily as you have to worry about the database ;)
However, there are programs like sim4, genewise and est2genome that could greatly help us and could be distributed.
Genewise - you can download (I wrote it) at Wise2 est2genome is somewhere around as well.
For the more general overview of the problem - check out ensembl for an idea of the project.
Re:warm and fuzzy by ewanb · 1999-11-28 19:01 · Score: 1

Absolutely - see my reply to the post above yours.
Re:warm and fuzzy by Rumor · 1999-11-28 22:24 · Score: 1

Crash courses in bio for coders? Sounds cool, but how about the same thing for geneticists and biochemists, who want to learn to code? Here's a question: what do yo uthink would be more difficult, giving coders a quick, intensive lesson in biology to get them up to snuff for the sort of thing we're discussing, or trying to teach someone like me (a non-coding geneticist) how to code appropriately, as quickly as possible?
Re:warm and fuzzy by foop · 1999-11-30 07:44 · Score: 1

It's definitely easier to teach (the necessary, relevant) biology to a coder than to teach a biologist to code!

Difficult to distribute by Lars+Arvestad · 1999-11-28 18:06 · Score: 4

Common successful distributed projects in cryptography rely on the fact that all you need on a client is the algorithm and a few keys to try. Therefore, clients are really cheap (resourcewise) to distribute and use.

In the case of the Human Genome Project, the situation is somewhat different. A well known analogy is the following: Take a few copies of a newspaper. Feed it through a shredder. Remove a handful or two of paper. Insert errors. Now, piece together one copy of the original newspaper.

In order to make a useful contribution, a client is going to need a lot of data. This means that it will be difficult to distribute (long downloading times for instance) and that few people will appreciate having the client on the machine because the client will be using a lot of memory and the machine might be a bit unresponsive (your HGP screensaver might flush all your apps to disk for instance).

Lars

--

--
Reality or nothing.

Re:Difficult to distribute by ewanb · 1999-11-28 18:15 · Score: 3

Lars
This is only for the assembly and not for the analysis. With analysis you have a better data/cycles ratio. Assembly is done at the genome centres anyway...
Re:Difficult to distribute by Lars+Arvestad · 1999-11-28 18:35 · Score: 2

Ewan is a very informed and knowledgable guy at one of the key centers in HGP, so he needs more moderation. Hey Ewan, go get more karma!

This is only for the assembly and not for the analysis. With analysis you have a better data/cycles ratio. Assembly is done at the genome centres anyway...

Then I don't get it. The original submission was about trying "to match the bits up with other bits like a giant jigsaw puzzle". Clearly this is about the assembly problem, no?

What kind of analysis what this be?

Lars

--

--
Reality or nothing.
Re:Difficult to distribute by ewanb · 1999-11-28 18:50 · Score: 2

I assumme that the original poster did not understand what was going on ;). Like alot of slashdot in this case - concerned but not knowledgeable.
Celera always talk about the assembly problem as they have gene myers solving it (he has) and think it is pretty cool. It is not trivial, but from my view (an annotation centric view) not the most important thing.
Re:Difficult to distribute by troc · 1999-11-28 19:41 · Score: 1

I assumme that the original poster did not understand what was going on ;). Like alot of slashdot in this case - concerned but not knowledgeable....

More like, 'not very good at writing things down succinctly' I've spent so long writing my bloody PhD that I tend to add millions of extraneous words to everything I type. I'm also not a biochemist, just a humble materials scientist who makes high-pressure gas cylinders for a living.

:)
troc

--
Troc's dubious podcast and blog: http://www.trocnet.net
Re:Difficult to distribute by Anonymous Coward · 1999-11-28 22:25 · Score: 0

ewan> I assumme that the original poster did not understand what was going on ;). Like alot of slashdot in this case - concerned but
not knowledgeable....

troc> More like, 'not very good at writing things down succinctly'

There are an awful lot of biochemists/biologists who don't understand what is going on with this project either. I have what I would call an outline view and I am supposed to know what is happening..

The newspaper analogy was quite close.
Should be ..

Take 20 newspapers with a small degree of typos (say 1%).
rip out and throw away a few pages from each newspaper at random.
Put them through a shredder.
Mix thoroughly.
Throw half of the pot away.
now reconstruct a newspaper.

The human genome contains about 3.5 Giga bases (thats 3.5 american billions to you). This is about the size of all the data that has been collected to date in the EMBL/Genbank/DDBJ archive.
It takes a desktop PC about 1 hour to search this data for a similar sequence of about 300 bases.
The algorithm is O(mn). (it takes a $2M 12 processor origin 2000 about 11 seconds)

Now try an all on all match.

I have made a few jumps and inaccuracies but the idea is to give you a flavour. IIRC Sanger aim for an average oversampling rate of something like 30 fold to be reasonably sure of getting most of the DNA sequenced.

But clustering and getting the sequence is only the start of the beginning.
Now we have to work out what it means. If you think this is trivial (genefinding etc.) then you are welcome to join the effort. It is remarkably complex to find anything that makes real biological sense. The accuracy rate for gene predictions (just sequence, not including function, regulation or role) leaves some distinct room for improvement.

Ewans game is large scale sequence crunching. Mine is somewhat different but related. (I am a biologist not an informaticist by training, but I pretend to be a bioinformaticist for my day job)

And all our efforts are open source;?)

..d
(EMBnet .. Distributed open source science since 1988)

Re: cycles/data by ewanb · 1999-11-28 18:07 · Score: 1

Bioinformatics generally has a very good cycles to data ratio - ie - we have algorithms that take alot of cycles for very little data. So it is feasible...

Does anyone want to write it? If so - I have alot of CPU hungry algorithms to run.

Bottleneck is somewhere else... by Silicon_Knight · 1999-11-28 18:30 · Score: 1

I believe that the bottleneck is somewhere else - namely, getting the DNA, running PCR on it to amplify it, then cleving the DNA into chunks, and then running the chunks thru gel plates, and then getting the data on the chunks ...

From what I remember and understand about gene sequencing, the process is:

Running a PCR reaction. This induces DNA reproduction. You run this (mostly) by cycling the temperature the DNA is at in a special medium. And the DNA chains cleves and reproduces exponentially (1-2, 2->4, 4->8, etc).

Cleving at certain sequences. This breaks the DNA chains into chunks. The chunks are then analysised by some gell chromatography. IN the movies when you see people hold up 2 film with bands in it, that's what they are doing - the chunks are of a certain size and migrate thru the gell at a certain speed - and when the lines match up in intensity and location that means the same concentration of a particular block is present in both the standard and the unknown.

Repeating the process, cleving at different sites, until you have enough information about the "chunks" to reconstruct the sequences.

For a computer to match the chunks up, it's not that difficult. It's just like sorting arrays - not that much processor power is required. Storage, maybe, but not processor power.

The most time consuming part is running the reactions, spottingthe plates, running the plates, blah, blah...

-=- SiKnight

Re:Bottleneck is somewhere else... by greystone · 1999-11-28 19:26 · Score: 1

While I agree that the bottleneck is elsewhere, I do not think they use the procedure you mention. If some of them do cycle sequencing, remember that it is not exponential (unlike PCR). There is no reason to use a thermostable polymerase for anything else here, and I would bet they do 'old-fashioned' sequencing with T7. And if they use restriction enzymes I would bet on SauIII because it might be used to generate more or less random fragments. Other might be useless, because they are not working with plasmids! The sequencing method you mention seem sth like Maxam-Gilbert, but you do not seem to remember it well. You do not need external standards (nor internal) for DNA sequencing, and they certainly use the Sanger dideoxy method (Maxam is usually used in footprinting experiments). Finally nobody does radiographies on film now. You just fluorescence detection in an automatic sequencing aparatus. I definitely agree with the conclusion, it takes more human-power than computer-power to complete the project.

Who makes drugs now? by lovebyte · 1999-11-28 18:38 · Score: 3

I, for one, don't like the idea of a private company owning my gene sequences. They will be able to limit the use of these so only really rich pharmaceutical companies will be able to develop drugs etc and then sell them at huge profits, which isn't realy for the benefit of mankind blah blah blah.

This is an interesting statement. How do you think drugs are made now? Well, they are made by big pharma companies which make (often) a good profit. Drugs are not made for the benefit of mankind. They are made to make money.

When it comes to patenting the use of some genes, we should consider that:

patents are short lived.
A company has no interest in not using its patent. So for some money, other companies will be able to buy patents
patents don't stop anyone from working on whatever is patented. Lawyers always find ways to circumvent patents

On the subject of open source distributed computing for genome data, I am afraid I agree with other people here. There is simply too much data to download. It's a pity, but it won't work. Maybe in a few years time when the problems in genomics will have changed, other problems might be more suitable to this type of computations.

--

I'll do it for cheesy poofs.

Prior Art.... by FooGoo · 1999-11-28 18:44 · Score: 1

I declare myself prior art....use me as you will.

--
People who bite the hand that feeds them usually lick the boot that kicks them

Re:lawyers in heaven by radja · 1999-11-28 18:45 · Score: 1

Does that surprise you?

//rdj

--

No one can understand the truth until he drinks of coffee's frothy goodness.
--Sheikh Abd-Al-Kadir, 1587

I think it's technically unfeasible by kinkie · 1999-11-28 18:48 · Score: 1

All the various distributed computing efforts have something in common: while they all require extensive processing resources, they all need very little data to work on.
Basically they just need to synchronize: in the case of the famous distributed.net RC5-64 contest, they just need to decide who will try a given set of blocks.

Sequencing the DNA seems (I don't have the title to claim otherwise) to be not that much a matter of computing power, but of an immense dataset to work on.

It wouldn't be possible to distribute that much information in terms reasonable enough to make the effort worthwile.

--
/kinkie

Re:I think it's technically unfeasible by ewanb · 1999-11-28 19:15 · Score: 1

There are aspects of the work which have
a good data/cycles ratio. (surprisingly).

I would read about the subject before you pronounce... ;)

DNA itself is prior art by Anonymous Coward · 1999-11-28 18:51 · Score: 1

They aren't inventing DNA at all; DNA technology
has been around forever. They are just
reverse-engineering something that already exists;
therefore shouldn't be allowed to get a patent.

Re:DNA itself is prior art by Anonymous Coward · 1999-11-28 20:00 · Score: 0

We could let a friendly person patent them before the corps. Setting the licensing terms as free for everyone, except `insert companies trying for patents here`.
We should just start registering patents for everything, until mickeysoft, and friends petition to abolish patents.
Anyone up for patenting the wheel?

Patents are anti-competitive by Morgaine · 1999-11-28 18:59 · Score: 2

I'm surprised that the US in particular hasn't done anything to reduce the most glaring anti-competitive aspects of patenting. Doesn't the free market lobby have anything to say on the topic?

Patents have always been intended to reduce competition for a limited period, so that inventors have an opportunity to bring their research to market during a sort of protected honeymoon period, but in practice that no longer works very well in the modern world. It's all to do with timescales: in the computer age and with instant global communications, timescales for everything are shrinking, and in some areas an advantage period for the patent holder of more than say just a couple of years is starting to become inappropriate, a restraint on progress, development and trade. Although it's impossible to tell what might have been, who knows which entire market sectors might have developed if their pivotal idea hadn't been tied down by patents.

Be that as it may, it's rare for a week to pass without totally ridiculous patents being highlighted here, and the analogy with icebergs definitely applies -- there's vastly more out there that we don't see on Slashdot. The whole area is clearly in utter shambles and needs urgent review.

A "fix" doesn't have to be complicated. As far as I can see, just three things are needed: a ban on patenting algorithms (as enforced elsewhere); a short, strict and non-extensible time limit (possibly related to the field, eg. default 2-3 years but longer in the nuclear power arena, for instance); and an informal "public review" system not unlike Slashdot, run by the patent office and used both to supply niche information and also to weed out the type of nonsense that translates into "how to breathe air".

But of course, something that simple could never come about, because otherwise patent lawyers would be out of a job. Oh well.

--
"The question of whether machines can think is no more interesting than [] whether submarines can swim" - Dijkstra

Why exactly would it help to patent this info? by jemfinch · 1999-11-28 19:06 · Score: 1

When I first read of this, I thought to myself "What exactly is the use in patenting the results of this research?" From the posts I've seen, it seems that companies intend to patent the information they discover about the human genome, which can then be used to create cures for diseases. However, if standard medical law prevails, there's no way they can deny a person access to the information necessary to save that person's life or to prevent his/her disease if that person cannot afford to pay for the information. Basically, just like an emergency room can't turn away people who can't pay, how could a company that patented a human genome withold that information from people who can't afford to pay?

Jeremy

--
Looking for a Python IRC bot?

Re:Why exactly would it help to patent this info? by Lars+Arvestad · 1999-11-28 20:15 · Score: 2

However, if standard medical law prevails, there's no way they can deny a person access to the information necessary to save that person's life or to prevent his/her disease if that person cannot afford to pay for the information. Basically, just like an emergency room can't turn away people who can't pay, how could a company that patented a human genome withold that information from people who can't afford to pay?

Hmm, sounds like a good point, but laywers have probably worked this out already. After all, you can patent compounds that are used for various treatments, and this has been going on since before the discovery of DNA.

I read in an text on patenting that you cannot for example patent a surgical method, but you can patent a device that is basically necessary for same surgical technique.

Lars

--

--
Reality or nothing.

GIMPS by Anonymous Coward · 1999-11-28 19:07 · Score: 0

Not always - the Great Internet Mersenne Primes Search (GIMPS) can be set up to contact "base" as infrequently as you like, almost. A tiny amount of communication can generate weeks of work for the computer.

ac.uk

I do not think it is neccessary... by greystone · 1999-11-28 19:08 · Score: 1

I do not think such a huge computing power is going to be neccessary. Human brain-power will be far more important IMHO.

Depending on the sequencing methodology used, there are different approaches to assembling the sequence. As far as I remember the human genome has ben cloned into YACs, which may hold some 1,000,000 base pairs. If these are sequenced with a "shotgun like" method, they would generate some 20000 fragments, around 500 base pairs each. The whole sequence would be assembled by means of mathcnig 'overhangs'. If sufficient fragments are sequenced this should not be any problem at all, sth any desk computer could perform.
Once all YACs are sequenced, they would be assembled into the 23 (+1?) chromosomes. This does not seem to be too difficult too.
I see two big problems:
1. Debugging. If they use standard sequencing methods, the error rate may be as high as 0,1%. How are they going to cope with this?
2. Sequencing telomers or regions composed of repeats. This is going to be tricky.

My conclusion is: No distributed computing project is neccessary to accomplish the task.

Unfortunately I have never participated in an coordinated sequencing project, and all of the above are just my personal views.

Oi.... gotta love capitalism by Anonymous Coward · 1999-11-28 19:13 · Score: 0

I would dump my rc5 client and try to get this client running on every box I could. Companies have gone too far, first they want our souls and now they are going for our bodies. Over the past few years I have grown to hate the economic system that exists in most countries today, even my own. Everyone wants to make a buck and some will do anything to get it. It is bad enough knowing that in a few short years I will be working full time someplace and expected to do anything and everything for a corporation to the extent of sacrificing my personal life. Money doesn't make one a better person or truly better off. My father is sucessful and makes a decent buck, and I see what he goes through to make his money. To be honest its not worth it, it changed him, more work, more money, the more it destroyed his life at home. Until he realized that the way he was acting was driving everyone insane and further away from him, all he would ever talk about is how much he is making. He still does talk about how much he makes but not as much anymore. For I it is a good thing that he makes a decent wage. The thing is I am not willing to corrupt myself, I like doing what I do, if I make a heck of alot of money that is fine with me. Sure I will be happy and maybe brag a little but I know not to brag too much. At this current time I must put up with capitalism, but it doesn't mean I have to like it. I prefer to have a society where we can all get what we want and need. I am not talking about everyone having a viper,the bigget house, etc. , nor am I talking about communism, I may be, but never as it was implemented. But then capitalism was never implemented correctly. What ever happened to "Doing it for the good of everyone"? It seems everyone wants themselves to get up, while having everyone else fall over, now tell me, Do you see something wrong with this picture?? I DO!! This has been a rant by The Reverend Helix

Re:DeCode Genetics and insurance by AntiNeutrino · 1999-11-28 19:16 · Score: 1

isn't the whole point of insurance that ti costs you money but WHEN something happens you're covered. You're paying for security. If you move more and more to charging people whit a family history (of heart failure for example) that defeats the purpose of insurance - you are charging people for the services they require not "insurance"

Neutrino

the light at the end of the tunnel is the headlight of the oncoming train.

--
I can't even remember what it was I came here to get away from - Bob Dylan

d.net coders wanted for DNA analysis by ewanb · 1999-11-28 19:17 · Score: 3

It is clear from these postings that people would
like the client to run. If there are people with
experience in writing these sorts of d.net systems
then please drop me a note. We have the problem
for you to work on - it is just a question of
figuring out how to do it.

Drop me a mail (birney@sanger.ac.uk).

Correction by Anonymous Coward · 1999-11-28 19:19 · Score: 0

But then capitalism was never implemented correctly.

that should be:

But then comunism was never implemented correctly.

Molecular Biology and BioChem for hackers by Morgaine · 1999-11-28 19:30 · Score: 2

Well, if you do decide to hold such classes then be sure to let us know. If it's anywhere near Cambridge then that means a 2-hour commute for me, but it would be well worth it -- this is an extremely important area.

I sure hope that what you have in mind is evening classes though, as otherwise you'll get just the unemployed to attend, which would be limiting.

Sounds like an excellent project!

--
"The question of whether machines can think is no more interesting than [] whether submarines can swim" - Dijkstra

Re:Molecular Biology and BioChem for hackers by ewanb · 1999-11-28 19:36 · Score: 1

join in with ensembl and help us out. You
would learn *alot* of biology v.quickly ;)

Re:Who makes drugs now? Point 1. by dodobh · 1999-11-28 19:47 · Score: 1

The problem is not that patents are short lived, they are too long lived.
Now how about defining the time in internet years? Anybody in favour of defining the time based on the technological rate of change?

--
I can throw myself at the ground, and miss.

Decode the sequences? by heroine · 1999-11-28 19:52 · Score: 2

Never knew there was a race to decode gene sequences using computers. There is a race for low paid women to load the sequencers but the "decoding" of the sequence is not the limiting factor. You've got to be damn good to get into those labs. Harvard PhD quality.

Re:Who makes drugs now? Point 1. by lovebyte · 1999-11-28 19:55 · Score: 1

The problem is not that patents are short lived, they are too long lived.
This depends a lot on how long it takes to make a profit from a patent. Drugs are in general patented for 20 years. Since you need 12 years of research and development before putting a new drug on the market, you are left with 5 to 8 years to make the money you spent in 12 years of R&D + the money spent in marketing and distribution. If your patent lasts only, say, 5 years, noone will make new drugs, which is bad for everyone.

Now how about defining the time in internet years? Anybody in favour of defining the time based on the technological rate of change?
Real life works differently! Chemicals are still designed by human beings. Yes, a lot of robots are used, but just for dumb things.

--

I'll do it for cheesy poofs.

Distrib client worries: you're looking at it wrong by Morgaine · 1999-11-28 20:05 · Score: 2

The whole area of concern about clients being compromised to return incorrect results stems from the meme-setting effect of dedicated clients like rc5des, seti@home and (it seems) all others currently in existence. Their susceptibility to being cracked and reworked is entirely due to the dedicated nature of their task, as it gives nasty-minded people a visible target.

The problem would not arise if distributed clients were generic, ie. if they would do arbitrary computations on arbitrary data received from arbitrary sources. In other words, if a global distributed computing system accepted numerous different computational tasks from the public and distributed interleaved fragments of them arbitrarily to an undifferentiated pool of clients, it would no longer be possible for clients to be compromised meaningfully. (Clients would really just be maths engines, and you'd be detected pretty quick if your client made 2+2=5.)

Would there be interest in creating such a global computing system as a free software / open source project?

[Note that pretty single-task stats displays would still be available from the task sponsers' site, but that's a completely separate issue to the one of data distribution and computation.]

--
"The question of whether machines can think is no more interesting than [] whether submarines can swim" - Dijkstra

somebody patented Brit's natl dish by Anonymous Coward · 1999-11-28 20:09 · Score: 2

Someone in Japan has applied to patent curry, of all things. If successful, the guy gets a royalty every time the Brits dig in.

And with the WTO, all other countries will have to recognize and comply with the patent.

Can you believe it? Curry, of all things. Wonder what the folks in India think of this. We've lost our minds.

It's on today's London Times.

Patenting their version of the data by archmedes5 · 1999-11-28 20:16 · Score: 1

I believe that what is meant by patenting, they probably mean their own version of the data. Like someone mentioned in a previous slashdot article (I forget the name) like a surveyors maps, if you put a copy of that map in your report, you should at least acknowledge the source, and even inform them that you're using the maps. Information from the HGC, would be free, that information could not be copywrited or patented by someone else because HGC got the information themselves.

--
Stupid is as stupid dies.

Publish it. by Ozzy · 1999-11-28 20:40 · Score: 1

AFAIK, once you publish an idea or invention in the public domain it becomes un-patentable. This is what the Human Genome Project is doing. every gene is published within 48 hours of discovery.

Correct me if I'm wrong but I've seen this mentioned in other YRO and AS articles.

--
Remove the NOSPAM to spam me...

hmm by SKicker · 1999-11-28 20:42 · Score: 2

I worked for a bit as a CO-OP student in this area last summer, which is not to say I know anything about this, but.. :]

While distributed computing would probably benifti the HGP, there are a couple of points to take into consideration.

1) How secure is distributed computing? SETI and RC5 arent really all that concerned with the the integrity of the data they are getting back. They can just re-check a data block if it is a sure sign of ET or whatever. Here there will need to be a guarantee that data has not been tampered with.

2) It seemed to me that some of the tools used could do with some open source style improvement by the hacking(coding) community before throwing lots of computing power at them.

As for the patent stuff... bah!. Let the lawyers mess around with that, everyone else can concentrate on the advancement of the human race.. or something like that.

links:
Genome database
The Sanger Centre
The NCBI

Perfect Opportunity by Anonymous Coward · 1999-11-28 21:18 · Score: 0

For open source and distributed computing to showcase itself and what it exists for!

Re:Hold on. The seq's can't be patented by Anonymous Coward · 1999-11-28 21:25 · Score: 0

You may be wrong. Several companies in the US have *already* tried to patent raw sequences. It is unclear as to wether they will be granted patents, but unless you're a patent lawyer who has information unavailable to everyone else, you're really overstating your case.

Has anyone in aerospace tried to patent breathing? by Judah+Diament · 1999-11-28 21:29 · Score: 1

I think it has just as much potential as patening the DNA sequence - just think - the way your lungs force air to pass over your lips may just infringe on patents related to airflow caused by jet engines and wings. Anyone ready for a cease-and-desist-breathing order? :)

How can they patent any of this? by Anonymous Coward · 1999-11-28 21:40 · Score: 0

The idea of using DNA sequences as predictors of diseases/cancer/... has already been thought of. The idea of introducing geenes with speciffic mutations has already been thought of. EVERY technique that can be used to put this data to use has already been thought of. Therefore, any patent that covers any part of the genome must run into prior art. The Human Genome Project was the first of these projects, correct? If that is correct, then we know that they were trying to sequence this so that drugs could be made, and/or genes developed to counteract what they found there, or to enhance something that is found in the genes. Would that not immediately set prescedent for prior art?

Just an Observation here but... by Anonymous Coward · 1999-11-28 21:55 · Score: 0

Like the title says this is just an observation here but how is it that any company could patent a application from the infomation they decern is decoding our DNA?

It seems only obvious that no one should be able to patent the DNA sequences or chromosomes themselves, but what about applications arising from said decoding?

If any of these applications are medical in nature ( i.e. the treatment of disease, gene based therapies, etc ) are they not banned from being patented given the first section of the patent law which excludes Medical Therapeutics, Surgeries and diagnostic processes?

It seems that many of the currently patented medical "applications" should be lose their patents since they have been approved by claiming to only be patenting "mechanisms" which in effect allows patent holder to "patent" a therapuetic application.

That being said... is not the patenting of drugs by pharmeceutical companies a similar thing? Are medicines not therapeutic remedies?

Just some thoughts

Jeff

False results (irrelevant) and feasibility (???) by jabbo · 1999-11-28 22:49 · Score: 2

Unless someone has the time and money to distribute microarrays and bench time at a local hospital with a good clinical lab, the clients would be worthless. Venter's efforts are succeeding because of Celera's partnership with Perkin-Elmer.

However, Celera appears to be less than picky about the quality of data they are producing. So the same approach as theirs (multiple shotgun sequencing runs for each block of base pairs) with parity checking and/or some means of verifying data would be fine.

Celera's operation is effectively a distributed effort already, it just happens to be in one building. The government will most likely step in and appropriate the sequences for a reasonable fee if it turns out that Venter et al. have reneged on their promises to distribute the sequences freely.

--
Remember that what's inside of you doesn't matter because nobody can see it.

genome mailing list by Anonymous Coward · 1999-11-28 22:54 · Score: 0

i just set up a genome mailing list on HERT (we might rename it Human Emergency Response Team), so we can continue this discussion there and maybe find a way to do something. i can offer web space and cvs space and eventually i can buy 30 gig of hd if we need that too. and btw HERT is non profit.

Do it!!! by DaPhreaker · 1999-11-28 23:01 · Score: 1

Hell I got 6 servers and about 150 workstations available. I would be more than willing to put this on a small chunk of my network. It would be better than Setti

--
root@localbrain root>ps ax |grep thoughtd ............. 12156 ? S thoughtd root@localbrain root

Software patents by rob_from_ca · 1999-11-28 23:12 · Score: 1

It's crazy what they'll grant a patent on. Does anyone remember the patent Compton's multimedia had? The one something like "Information stored and indexed on a CD-ROM"? They actually had that patented, but kept it pretty underground until CD-ROM encylopedias really took off, then announced it as some sort of trump card and implied they were going to start charging royalties to every CD-ROM manufacturer. Seems there's a whole bunch of really silly, generic patents these days. I wonder if I, as an individual can patent some fun ideas. Like "sneaking out the backdoor using yardwork as an excuse to avoid your mother-in-law." Or maybe "accessing informational databases over a computer network." Wouldn't that be wonderful?

The real fix is for the US to stop issuing these crazy patents, however as I recall the courts have done some good when it comes to sanity checking. I think Compton never tried anything because their lawyers decided no judge would let it fly in court. It makes for a nice second line of defense.

What may end up happening... by otis+wildflower · 1999-11-28 23:22 · Score: 2

... is something along the lines of Lexis/Nexis and the law. Nobody can copyright the law, but the indices and commentaries can be copyrighted, and (whether we like it or not) currently the search algorithms to manage those indices can be patented.

So, to carry it over to genetics, the underlying genes (law) cannot be copyrighted (and this is ambiguous still: is it code (copyright) or algorithm (patent)?), but the indices and commentaries on the sequences can be copyrighted, and the search/combination techniques and/or machinery can be patented.

So, we need to make clear and loud the mandate that:

the human genome itself constitutes information that is in the public domain (or, at worst, the property of the person(s) who contributed the gene sample(s))
that while indices and commentaries on the genetic code may be proprietary in a society that protects proprietary intellectual property, equally protected is the privilege of the people to compile a separate set of indices and commentaries, at public expense, of the same public-domain information, or to license (or acquire by legal means) . Assuming, of course, that any research funded by public money is released into the public domain.

We need to define the problem fairly and completely, then fight strenuously to make sure that bad precedent is not set.
Your Working Boy,

Human Genome Project is Patenting too!!! by Anonymous Coward · 1999-11-28 23:29 · Score: 0

People have this stupid idea that it is the corporations VS. the human genome project and if we help the HGP then our genes (aka souls to some) will be saved. Well here is a shocker...you will be helping the government to patent our genes. If you go look you will notice that the HGP has many many patents. Also the government has been known to approach these genomic companies to work as consultants. This is not a government conspiracy theory. I am merely trying to inform you that this is not Microsoft vs. Open source.

Secure Communications by Hallow · 1999-11-29 00:00 · Score: 1

Well, if the packets were tunneled through SSH (built into the client), there wouldn't be much of an issue of data theft now would there?

DC project not needed by bonabo · 1999-11-29 00:07 · Score: 1

The huge processor time needed to assemble the Calera sequence is Calera's problem. It comes about because of how the Co. decided to sequence, by cutting all the human DNA (3e9 base pairs, or bps) into tiny bits and sequencing ~500-1000 bps of each bit. Calera owns all this sequence, and no doubt has the computing resources to do at least a poor assembly of the mass of sequence data they'll generate. A more sensible approach to sequencing is being used by the Human Genome Project. DNA clones 10e5 bps in size are cut in small bits and 500-1000 bps are sequenced from each piece. Thus the assembly is of a 10e5 bp clone. Overlapping clones are sequenced to generate larger segments of sequence. Calera is skimming the human genome. They'll generate a bunch of raw sequence, assemble what they can, patent everything that looks appealing, and then with 40-70% done, declare the sequencing complete, and close up shop. The Human Genome Project will be years finishing things up. Jim Lund

This is a real chance for Linux by Anonymous Coward · 1999-11-29 00:20 · Score: 0

You want to put Linux and Unix in general in the public eye? Stuff like this is the ticket. We map the human genome and publish it, it becomes a VERY public positive pro-open source example. You gotta remeber that this issue is very hot at the moment, cuz mom and pop Kettle don't understand what Microsoft's buggy internal code has to do with them, but they SURELY understand what a corporation patenting their flesh and blood does. I heard a debate on NPR recently in St. Louis where I actually listened to a biotech lawyer try to convince people that this was a good thing because of so-called subsidization of said firm's investment capital for research jsutified further research to 'help us find cures' (hehe it's always for the children ain't it?). They're trying to patent anything they can because the higher prices they can demand for cures and treatments affecting patented genes and cell structures (and perhaps even organs in the future) have led Insurance companies to deny treatments and even falsly mislead patients to belive that the treatments are ineffective or do not exist. This is downright WRONG. So is the concept that a company can patent an IDEA about a cure or treatment even though it doesn't exist except in theory in order to justify capitolizing research. People understand personal health issues, and making a public statement agaist this type of thing in the name of the geek community at large could have much milage towards common acceptance... I mean CNN is a GREAT place to win the love of the common man IMHO. -Wanrat

That's a misleading stmt. by Tim · 1999-11-29 01:24 · Score: 1

The sentiment expressed on the above post ("you can't patent sequences") is alarmingly common on /.

Of course, it always gets a lot of attention because:

a) we like to hear it, and
b) its grounded in truth.

But "grounded in truth" does not equate to "true."

Yes, its true that you can't strictly patent a gene sequence. So what? You _can_ patent a gene's applications, current or future, discovered or undiscovered. Period. And you don't even need the whole gene sequence to do it.

So....unless your only use for a gene sequence is to make a pretty picture out of it, this means that gene sequences are de facto patentable. You simply can't use them for anything once they're patented (besides, perhaps, abstract art).

--
Let's try not to let fact interfere with our speculation here, OK?

End Nanotech Now! by Anonymous Coward · 1999-11-29 02:06 · Score: 0

One word- nanolawyers.

I remember the good 'ol days when all we had to worry about was death by nuclear fire...

Contigs Schmontigs by Anonymous Coward · 1999-11-29 02:56 · Score: 0

I think just about every biologist out there would agree that assembly and alignment are not the limiting steps in this, but getting the sequence out of the sequencers is. But the intractable problem, that I have seen very little progress on is going from the flat, genomic data to the folded three dimensional structure of the protein. This might be a place where a massively distributed energy minimization application might actually work; grabbing known x-ray structures and flat sequence data postulate hypothetical structures for unknown protein coding open reading frames. So far however none of the models I have heard of have even come close to predicting the structure of a modest size protein. Its as I understand it, a hellishly complicated multiple body problem.

Re:Contigs Schmontigs by elizabeth · 1999-11-29 08:23 · Score: 1

Exactly. That is a cycles problem in a way that sequencing is not.

The problem is how a 1-dimensional sequence of length n, with 21 possible values at each location, becomes a 3-dimensional structure. What is needed is an algorithm to solve the problem though, not just power. Does anyone know of a genetic algorithms approach to this problem?

Right now there are some heuristics: this pattern means a DNA binding domain, this particular sequence means a transmembrane segment, etc. A stopgap, intellectually unsatisfying approach, I must say. :)
Re:Contigs Schmontigs by UWCM · 1999-11-29 19:03 · Score: 1

I think that the best usage for distributed computing in connection to the Genome Project would be a threading analysis:

Predict the coding sequences in a given portion of the genome and then thread them across a representative selection of the PDB, checking which structures represent probable proteins.

The problem here is that the amount of data to download would be rather significant, so we might not get a lot of participants.

Patents by cabr1to · 1999-11-29 03:06 · Score: 1

Companies specializing in DNA sequencing have applied for patents on hundreds of thousands of sequences, including genes and gene fragments. PTO examines all sequence applications for fulfillment of four major patenting criteria: novelty, nonobviousness, usefulness, and enablement (i.e., detailed enough to enable one skilled in the field to use it for the stated purpose).

Human Genome News, July-September 1996; 8:(1)

I am a student at Gustavus Adolphus College (St. Peter MN) and recently attended the Nobel Conference held here annually. This year's topic was Genetics--the opening speaker was Dr. J. Craig Venter from the Institute of Genomic Research, Rockville, Md., also the president of Celera, Inc. It would seem that many /.ers have misunderstood the purpose of patenting gene sequences--A patent on a sequence itself is possible only if the researchers can prove some distinguishing characteristic or application that sets it apart from other sequences. Given that 99.9% of the human genome is identical from one person to the next, it would prove very unlikely that the gene for a trait could be isolated; since many factors contribute to heredity and one sequence can affect a large number of biological variables, patenting the use of any one sequence would be useless in application.

--
---------- ...What I lack in modesty I make up for in everything else. ----------

Re:HGP; also, digital-minds; also, apocalypse? by Anonymous Coward · 1999-11-29 03:46 · Score: 0

>This is slightly off-topic, but has anyone else >heard about this "Soul Catcher" project, which I >think is based > mainly in the UK? (Based on the concept >of recording an entire human consciousness to a >traditional > physical medium, if I remember >correctly.) Ever seen Max Headroom? I think we're still a long ways from a dynamic system capable of mapping the intricacies of human thought accurately, let alone mapping an entire consciousness. But it could closer than I think. I know I'm not ready for it yet in any case, though I think the idea is feasible. I'm sure this is a sensitive subject for many people-- that we can be broken down into so many pieces. Still, this has been a fundamental task of the sciences since Aristotle, and it of course has real-world consequences. All this concern about the mishandling of genetic knowledge is the latest strain of a reoccuring human condition: people want power. And you thought mustard-gas and anthrax were scary; imagine what it'll be like in another twenty years or so when science has had time to study our genetic makeup in some depth [queue thunder]. And people cringe at cloning. Tis the season for tales of the coming Apocalypse. By comparison, I could give two egg-rolls if companies want to compete over patent rights. - Shrift "It looks like Edison...."

Re:Money (someone missed the concept here!) by Anonymous Coward · 1999-11-29 03:54 · Score: 0

Heeeeelloooo....

The whole idea was to prevent the
capitalistic mega corps in winning this race
and you want money ?

Did you just miss the whole concept here?

And just on a subnote, money is NOT most
importantly...

Re:warm and fuzzy, the crash course. Idea: by Olof+the+Hopeful · 1999-11-29 04:51 · Score: 1

Get in front of a digital video camera and do your crash course. Convert and post the result to a heavy duty server as quicktime or realaudio, etc., then announce it on slashdot and see what happens.

An Apple G4 with lots of memory and Final Cut Pro, plus a Canon XL with Firewire link thereto should be possible for less than USD8k. That and some talent should do it. Apply to Wellcome for funds?

the idea of playing with genes is sickening by Anonymous Coward · 1999-11-29 05:22 · Score: 0

i think people have gone too far with this one. they dont even know what theyre getting into. i think a lot more needs to be thought about before people start making spare body parts and recreating themselves.

Anti-patent patent by Anonymous Coward · 1999-11-29 05:36 · Score: 0

I think the coolthing to do, would be to beat
them to a whole bunch of major useful
"gene patents", then patent them, and license them
as:

"Any company can freely use this, provided that
the company, all subsidiaries, and all parent
companies, hold no genetic patents".

If you hit really really useful ones, then other
businesses would be able to destroy the ones
that hold other less useful patents.

I think I'll patent this business method,and
call it "The microsloth method of business
negotiation" :-)

Excellent idea by Anonymous Coward · 1999-11-29 06:28 · Score: 0

I think this would be a great use of spare compute cycles everywhere. I am way up on the distributed.net site, but I would switch everything over to this.

Good Idea! Re:Contigs Schmontigs by Anonymous Coward · 1999-11-29 06:36 · Score: 0

Distributed computing is not very applicable to human genome sequencing while de novo structure prediction could need such power in the future.

It appears that computer professionals have as much clue about DNA sequencing and "gene patenting" as biologists have about DNS services and Open source licensing.

FUD Re:That's a misleading stmt. by Anonymous Coward · 1999-11-29 06:54 · Score: 0

Tim said:
>You _can_ patent a gene's applications, current >or future, discovered or undiscovered. Period...
>You simply can't use them for anything once
>they're patented...

Patent undiscovered applications?! Can you give any examples to support your statements? If you patent a SNP for diagnostic purposes how would that prevent others from, say, marketing anti-sense therapeutics based on the same sequence? Cite a gene-based patent that covers undiscovered future applications - bet you can't.
FUD is bad.

FUD _is_ bad. Good thing this isn't it. by Tim · 1999-11-29 09:13 · Score: 1

Gene patents protect use of a patented gene product in any manner. Just like you can't use a patented oil formula (or whatever) in your new product without licensing it from the patent holder, you can't use patented gene product without licensing from the patent holder. This isn't unusual.

Taq DNA polymerase is an example, off the top of my head (but not the best example, I'm sure). To have it, you must be a licensed vendor, or you must buy it from a licensed vendor. You can't legally express the stuff, even if you're not doing PCR with it.

I will note, however, that your example is different, because you're talking about patenting a *sequence*, which is not allowed. This is the truth I alluded to in my earlier post. If that sequence codes for a gene, then your patent covers the polypeptide coded by that gene, not the sequence of the gene itself. Thus, I don't believe one would be restricted from marketing an anti-sense therapeutic based on the sequence. The therapeutic does not make use of the gene, so the patent is not violated.

So perhaps I'll be more exact with my wording:

You _can_ patent a gene product, and that patent will apply to all applications, current or future, discovered or undiscovered.

--
Let's try not to let fact interfere with our speculation here, OK?

Private Information by Anonymous Coward · 1999-11-29 11:42 · Score: 0

Thoughts in my head, dreams I have, and my DNA are things that I own as a individual and it is unlawful for a company to sell information about me without my consent. This may sound like a ultra conservative position, because it is.

Gnome by Anonymous Coward · 1999-11-29 13:34 · Score: 0

Oh my god they wanna patent gnome?

DCing the Human Genome by ITShaman · 1999-11-29 20:03 · Score: 1

I give this a serious yes vote. Last thing I want to see are my (or my children's) genes patented for Big Pharm. I'll be first in line for a Linux or a Win95/98 client.

--
I can no longer read Dilbert. It's too depressing, because it is too real. -- Hyperhaplo

Open Source Genome Projects - Phylogeny? by Anonymous Coward · 1999-11-30 19:25 · Score: 0

Does anybody know of some kind of distributed computing project on phylogeny? Phylogenetic reconstruction is something that requires quite a lot of CPU time. Basically, you have to score all trees for a certain number of species, and the number of possible trees grows at the speed of N! (N!= N* (N-1)*(N-2)*...3*2*1)) where N is the number of species. There is no efficient algorithm for finding the best tree according to some criterium (the problem is NP-complete) so you cannot do a complete search for more than 20 or so species (there are 3*10^6 to 3*10^7 living "species" on earth).

So my questions are:
1) is it possible to do some distributed computing on a problem like this?
2) (if it is possible) has anybody informations on something of this kind?

And no, you cannot patent sequences.

a comment about "patents are short lived" by klode · 1999-11-30 21:54 · Score: 1

[I use many qualifiers in this response, because I can't provide a reference to the original data; it's possible that this information is the product of fevered dreams caused by a late dinner of spicy food. Hopefully some /. reader will be able to point everyone to some concrete data.]

I hope you're aware that the lifetime of patents could change.

As I understand it, patents exist to protect intellectual property. Again, as I understand it, copyrights perform a similar function: they give some entity ownership of the product of their (or someone else's) mind, and that ownership expires after some period.

I seem to recall that in some countries, the lifetime of some (all?) copyrights was extended recently, by many years, and that the change had a grandfather clause: existing copyrighted properties had their copyrights extended. Coincidentally, that extension protects some very lucrative properties, for example the Mickey Mouse cartoon character.

I don't want to sound like a wild-eyed zealot, but if existing copyrights can be extended -- largely for the benefit of large/multi-national corporations -- can we rule out the possibility that patents might be too? If not, then saying "don't worry about them, they'll expire soon enough" is naïve.

Worked at HGP by thej0ker · 1999-12-01 04:04 · Score: 1

I agree that perhaps a distributed system might help, but the time lag in the actual biology of the process allows plenty of time. At Berkeley Labs, there was plenty of time for the computers to crank through the algorithms. The hard part was getting the cells to divide, and to grow, and to cut the DNA at the right length, so the laser readers could analyze the contents. The lasers didn't do the cutting, in actuality, it was enzymes, and there is no real way of speeding those things up. When I left Berkeley, they were implementing robotic procedures for faster throughput, but, once again, the limiting factor wasn't the analysis, but the biology.

patenting of sequences/secrecy/ by genethics · 1999-12-02 05:06 · Score: 1

Open computing won't work because the commercial entities routinely refuse to share the information they have developed until they have had time to file for the patents.

The suggestion that corporate interests are "only" patenting the sequences and not the genes is sophistry. The vast majority of genes will have only one or two "uses" detection, alteration and countermeasures. Under the current patent system he who sequences, for all intents and purposes "owns" that gene. Nobody else can use any detection, alteration or counter system system using the gene or protein sequence patented without paying royalties.

For the vast majority of disease causing genes, any kind of gene therapy or other cure is decades away. Meanwhile, we are stuck with the ability to predict without the power to do anything. The genetic information about people which is being collected is by far the most intimate and dangerous information about individual humans that has ever been collected.

Then.. by BedPanDan · 1999-12-04 08:25 · Score: 1

Then you die. It's a process called natural selection, and it's quite natural. It happens to animals all the time, yet with people, everyone MUST be guarnteed(sic) a good life. Perhaps it's time to stop diluting the gene pool, allowing those who probably should die to live and pro-create. Who knows? Maybe the downfall of the human race will be caused by our incessent need to tamper with life.....

This is not flamebait, nor a troll. Don't bother commenting on my spelling/grammar... it has nothing to do with my arguement....

Slashdot Mirror

Distributed Computing and the Human Genome Project

146 comments