Slashdot Mirror


Decoding the Genome: Serious Infrastructure

Roland Piquepaille writes "The Wellcome Trust Sanger Institute is one of the largest genomics data centers in the world. In "The Hum and the Genome," the Scientist writes about the IT infrastructure needed to handle the avalanche of data that researchers have to analyze. With its 2,000 processors and its 300 terabytes of storage, the data center uses today about 0.75 megawatts (MW) of power at a cost of 140,000 per year (about $170K). But the data center will need more than a petabyte of storage within three years, and its yearly electricity bill will reach 500,000 (more than $600K) for about 1.4 MW, enough to power more than a thousand homes. The original article gets all the facts, but this summary contains all the essential numbers."

175 comments

  1. Imagine... by Ray+Alloc · · Score: 0, Funny

    A beowulf cluster of these... Oh, wait, never mind!

    1. Re:Imagine... by Anonymous Coward · · Score: 0

      BWAHAHA!! OMG WTF LMAO.

      lameness filter counterattack for all those caps..

    2. Re:Imagine... by deliciousmonster · · Score: 0

      That would use approximately 1.21 gigawatts of power...

      1.21 GIGAWATTS!

      --
      I have a plan. Using mainly spoons, we'll tunnel our way out of the city...
    3. Re:Imagine... by Anonymous Coward · · Score: 0

      Jumping Jiggawatts!

  2. Yea baby by DinX · · Score: 0, Offtopic

    Rrrrrr ... I love it when you talk dirty

  3. Breaking news from our reporter Roland Piquepaille by JamesD_UK · · Score: 5, Funny

    Lots of computers use lots of power which costs lots of money!

  4. Amazing! by poopdeville · · Score: 2, Funny

    The Wellcome Trust Sanger Institute is amazing it will-

    - optimize seamless communities
    - generate vertical e-services
    - everage synergistic convergence

    and best of all

    - engage e-business content Perfect solution

    --
    After all, I am strangely colored.
    1. Re:Amazing! by Anonymous Coward · · Score: 0

      with hardware like that it could probably do all of the above.... *AC's head explodes*

    2. Re:Amazing! by Anonymous Coward · · Score: 1, Insightful
      The Trust also believes that the basic DNA sequence of humans and other organisms as such should be placed in the public domain as soon as is practical, without any fees, patents, licences or limitations on use, giving free and equal access to all. Subject to this, the Trust is supportive of patents encompassing genes and their products when there is research data or information indicating that a particular DNA sequence has a utility such that the legal criteria for patenting can be met.

      Exactly the same bullshit is used by European software patent proponents when they say that "software 'as such' will not be patentable". It's such a victory for the public that the wellcome trust does not support patents on DNA sequences, just the practical application of those sequences.

      Wellcome trust, bunch of halfwits or deceptive front for big business? You decide.

  5. Decoding the gnome? by LiquidCoooled · · Score: 5, Funny

    I misread that and thought it involved a spotlight and torture methods to a poor garden gnome :(

    "You will tell us what we need to know. WHERE IS THE LAWN MOWER!"

    --
    liqbase :: faster than paper
    1. Re:Decoding the gnome? by Anonymous Coward · · Score: 1, Funny

      How about decoding KDE?

    2. Re:Decoding the gnome? by Fred_A · · Score: 1

      I misparsed the title as well and figured they had to resort to a super-computer to try and make sense of the Gnome dependencies...

      emerging gnome, package 3 of 46438847...

      --

      May contain traces of nut.
      Made from the freshest electrons.
  6. That's... interesting by Anonymous Coward · · Score: 1, Funny

    I think I'm immune to large numbers. This article summary has absolutely no effect on me whatsoever.

  7. Who owns the results? by Dancin_Santa · · Score: 5, Interesting

    The idea behind all this mapping is to find genetic sequences that can be used to mend ailing people. Using a computer to throw every single combination possible against the wall and seeing what sticks is certainly a way to go about this, but it also raises the spectre of a single large company owning all these combinations. This wouldn't be such a terrible thing if there was some sort of actual science involved, but by brute-forcing results, they are doing nothing more complicated than running a counting program with an infinite number of bits.

    So each result is directly traceable to a number. Will these companies own these numbers? Can you even take out a patent on a number? In the DeCSS case, it was argued that the decoding algorithm was protected even though some implementations of it were nothing more than a carefully crafted prime number.

    I don't like the idea of someone owning numbers any more than I think someone should be entitled to the fruits of their own work. This whole patent "creation/reward" system is getting turned on its head because of the power of computers. What would have been prohibitive even 10 or 15 years ago is possible (even easy) now. How can we keep our rights without sacrificing the progress of science and the arts?

    1. Re:Who owns the results? by Hittite+Creosote · · Score: 3, Informative

      The centre is funded by the Wellcome Trust and the UK's Medical Research Council. The Wellcome Trust Sanger Institute is a non-trading, non-profit making registered charity. And they tend to make their results open - these are the people who said that the genome should belong to no one individual or company. In other words, if you want to keep your rights without sacrificing the progress of science - we need more places like the Sanger centre.

    2. Re:Who owns the results? by Gurdy · · Score: 5, Informative

      > "it also raises the spectre of a single large company owning all these combinations."

      You might be interested to read our data release policy http://www.sanger.ac.uk/Projects/release-policy.sh tml which describes how the finished data is made publicly available, to all, no charge.

      (I work at the Sanger Centre.)

      Dave

    3. Re:Who owns the results? by dan+dan+the+dna+man · · Score: 2, Informative

      Well the nice thing about the Wellcome Trust is that they are an independent charity and the largest non-corporate non-governmental source of biomedical research funding in the UK.

      Maybe you'd like to read their constitution: here

      Sure theres a chance that things can get tied up in the hands of companies - but lets look at the human genome project. The best data came out of the academic sector, the private data (held by Celera) didn't turn out to be too profitable after all (or even better quality) and is now in the public domain. I worry about the commercialisation of science as much as the next man, but lets face it, business just doesn't care unless there's a drug to sell at the end. Data is still just data.

      --
      I don't read your sig, why do you read mine?
    4. Re:Who owns the results? by Petersson · · Score: 1
      This wouldn't be such a terrible thing if there was some sort of actual science involved, but by brute-forcing results, they are doing nothing more complicated than running a counting program with an infinite number of bits.

      This way of genome decoding is much more spectacular. It attracts investors. And doing it the hard way will definitely bring some results.

      But bunch of boring sciencists, writing boring equations, can also result in zero success. On the other hand it could save whole lotta money for power, hardware and time. But it's bit risky.

      And when the job is finished, this cluster will become national monument.

      --
      I'm not insane. My mother had me tested.
    5. Re:Who owns the results? by Anonymous Coward · · Score: 2, Informative

      A lot of the analysis software used is also freely availible as it most of the web display code

      http://www.ensembl.org/

      another sangerite

    6. Re:Who owns the results? by Anonymous Coward · · Score: 0
    7. Re:Who owns the results? by perdu · · Score: 1
      A lot of the analysis software used is also freely availible as it most of the web display code
      Is the browser freely available to commercial users?
      --
      You only use 2% of your DNA
    8. Re:Who owns the results? by Anonymous Coward · · Score: 0

      This wouldn't be such a terrible thing if there was some sort of actual science involved

      As one of the hundreds of lab scientists that work at the institute, i'd like to point out that there is an awful lot of actual science being done at the institute, much of it on the back of the data coming out of the infomatics teams and some of it acutally rather good.

    9. Re:Who owns the results? by sugarmice · · Score: 1

      Yes. It's GPL. Anyone can use it.

    10. Re:Who owns the results? by ckd · · Score: 1

      As noted, the Sanger, like the rest of the public HGP centers, makes their data accessible to everyone for free. You're thinking of Celera, who got out of the business.

      (Claimer: I work at another of the centers. Similar scale, but those folks at Sanger have more server room floor space...sigh.)

  8. Overkill by CleverNickedName · · Score: 0

    With its 2,000 processors and its 300 terabytes of storage, the data center uses today about 0.75 megawatts (MW) of power at a cost of 140,000 per year (about $170K)

    I just use 11 stone of meat and goop to deal with that very same data. It's been running 24 hours a day for 30 years now with no serious bugs... Yet...

    --


    Unfortunately, I am not Wil Wheaton
    1. Re:Overkill by hjo3 · · Score: 1

      > Unfortunately, I am not Wil Wheaton [slashdot.org]. Thank god. I don't think I'd want to live in a world with two of him.

    2. Re:Overkill by Ziviyr · · Score: 1

      Thank god. I don't think I'd want to live in a world with two of him.

      Really, there wouldn't be any decent ST: TNG episodes if that were the case... :-)

      --

      Someone set us up the bomb, so shine we are!
  9. Exchange Rate by squoozer · · Score: 1

    I wish I could get the submitters exchange rate. I'd be rich rich rich. It's currently around 1.9 dollars to the pound meaning anual running costs are more line $260k which could rise to around $1m.

    Having said that everything is cheaper on the US side of the pond so the submitter is probably about right. Sigh.

    --
    I used to have a better sig but it broke.
    1. Re:Exchange Rate by smithberry · · Score: 1

      I think the prices quoted are Euros.
      Currently 1 Euro = 1.23 USD, so I think the article is about right.

    2. Re:Exchange Rate by anaplasmosis · · Score: 0

      Those are Euro, not Sterling.

    3. Re:Exchange Rate by Gwyn_232 · · Score: 1

      The trouble is, the poster was too lazy to a) look up the ascii code for or b) type EUR in front of the figures (as in EUR400,000).
      Both sterling and euros are used when quoting costs in Britain.

    4. Re:Exchange Rate by Gwyn_232 · · Score: 1

      Ok, I'll revise that

      The trouble is, Slashdot is too shit to handle the Euro symbol.

    5. Re:Exchange Rate by Anonymous Coward · · Score: 0

      €, €, €.

    6. Re:Exchange Rate by Gwyn_232 · · Score: 1

      , , It may be that it can't handle if it's a euro keyboard (like mine is)

    7. Re:Exchange Rate by Anonymous Coward · · Score: 0

      The trouble is, Slashdot is too shit to handle the Euro symbol.

      Huh? You mean you can't put in a euro symbol?

    8. Re:Exchange Rate by Anonymous Coward · · Score: 2, Informative
      Ways to put the Euro symbol in webpages:
      • Hex code 0xA4 (decimal 164) in codepage 8859-15 is what you get when you press AltGr+e. This happens to be the general currency symbol in 8859-1, so it's not a good choice if you can't make sure that the document comes with the correct encoding declaration. ""
      • HTML entity € "€"
      • Unicode character reference € ""
      • Hexadecimal unicode character reference € ""
      As you can see, Slashcode filters all but the html entity, so that's your only choice here if you have to have the symbol. Most people simply use EUR.
  10. Enough to power a thousand homes by jurt1235 · · Score: 2, Interesting

    Doing some quick math here: 2000 processors+1petabyte, divide by 1000=
    2 processors + 1TB per house.
    In processors: Way past it
    In storage: Getting there (quick count of harddisks lying around= 750GB at least)

    Since my energy bill is lower, even with the hardware running 24/7/365, are they buying their energy to expensive or what?

    --

    My wife's sketchblog Blob[p]: Gastrono-me
    1. Re:Enough to power a thousand homes by Anonymous Coward · · Score: 0

      It would be cheaper and cleaner to install a few wind turbines and solar panels and reduce the energy drain and costs, and what they overgenerate they can pump back into the grid. But that requires brains, no?

    2. Re:Enough to power a thousand homes by teknofyl · · Score: 1

      Sounds good in theory, however I doubt that either the environment agencies or the local residents would support this. In the case of Wind Turbines we'd be hit by a sudden fit of NIMBY. As for solar panels, well some of the new buildings here have been designed with grass roofs in order to make them blend in with the environment. Again, I doubt we'd get permission to start decking the roofs with solar panels.

      (I work at the Sanger Institute)

    3. Re:Enough to power a thousand homes by hcdejong · · Score: 1

      The computing power density of 1000 homes is a lot lower. 1000 homes + the ground they sit on also cost more to buy/build than the one datacenter.

    4. Re:Enough to power a thousand homes by Anonymous Coward · · Score: 0

      Wave power, Geothermal, solar WINDOW panes, solar WALLS, magnetic induction from the magnetosphere :D There are plenty of non invasive green energy sources.

    5. Re:Enough to power a thousand homes by Anonymous Coward · · Score: 0

      Think about what must be massive air conditioners. Did you see the SAN consolidation diagram? Think about the live redundant mirrors and clusters to feed. Consider that the networking/SAN equipment needs juice too. Last, think about the lighting, the fridges in the break room and all the coffee machines. THINK ABOUT ALL THE COFFEE MACHINES, DAGNABBIT!

    6. Re:Enough to power a thousand homes by jurt1235 · · Score: 1

      It is more about the power consumption. It just seems to expensive. I just tried to compare it in a way which makes their claims just sound to big.

      I do not propose a shared/distributed infrastructure, especially not for the storage (if they use up my 750GB, where do I leave my own data? Offline on DVDs?)

      --

      My wife's sketchblog Blob[p]: Gastrono-me
    7. Re:Enough to power a thousand homes by Tassach · · Score: 2, Insightful
      It is more about the power consumption
      Yep.

      First off, utility companies generally charge a higher rate for business/industrial power than they do for residential power; so even if all things were equal, they'd still be paying more per KW/H than you.

      Secondly, you can't compare a couple of desktop machines running in a home office to a datacenter with multiple fully-populated 72U racks. Running 2 or 3 computers in a 120 ft^2 room isn't going to require any additional cooling. Running 2000 mahines in (say) a 1000 ft^2 data center is going to require heavy-duty air conditioning. Finally, remember that enterprise-grade hardware generally has redundant power supplies, 15000K rpm SCSI disks, and more powerful fans -- all of which draw more power (and throw off more heat) than a typical desktop system.

      --
      Why is it that the proponents of "one nation under God" are so eager to get rid of "liberty and justice for all"?
    8. Re:Enough to power a thousand homes by ciroknight · · Score: 1

      It's simple really. They have to power more stuff than your house does.

      Whereas one computer doesn't really produce enough heat to cause a problem in the house (well, dependingly...), 2000 do. This requires an inbuilding airconditioning system to vent the heat, which adds a LOT to the energy bill.

      The computers themselves are usually a small load when it comes to the utilites of the building. Oh, of course there are monitors, and things you'd find in an ordinary home (well, probably microwaves, coffee pots, crew comforts, florescent lighting..).

      --
      "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
    9. Re:Enough to power a thousand homes by stilwebm · · Score: 1


      First off, utility companies generally charge a higher rate for business/industrial power than they do for residential power; so even if all things were equal, they'd still be paying more per KW/H than you.


      Actually you that is backwards. Residential power is usually more expensive. Think of it as buying in volume. Additionally, some businesses and many industrial power users negotiate lower rates with the stipulation that in case of a certain stage of power consumption/power shortage, their power will be cut. Many large power consumers, such as automotive plants own their own transformers and receive power directly from the wholesaler (say, TVA) rather than the local distributor.

      For example, compare rates for my area:
      Business
      Residential

    10. Re:Enough to power a thousand homes by BranMan · · Score: 1

      You have to add in the 'other' stuff too - network boxes, firewalls, routers, switches, etc. Also lights and AC for it all in one place. Not all the processors in the houses will be top of the line either, which adds to the AC bill. Maybe factor in redundancy in the network too - I wouldn't want a SPOF in an array that large. Stuff adds up...

    11. Re:Enough to power a thousand homes by jurt1235 · · Score: 1

      The stuff in my house adds up too:
      My wife (means the average temperature in winter needs to be 5 degrees Celcius higher than without a woman
      TV set
      Of course lights
      Fridge
      Cooking on electricity (>2000 Watt for about 30 to 40 minutes a day on average)

      24 hour airco is pretty destructive on the energy bill for these datacenters apparently, but looking at the outside temperature, it might not be needed all the time.
      I still think they can get some more efficiency.
      Spend a few days of CPU time on that, and you can buy yourself some extra machines from just the savings on the energy bill.

      --

      My wife's sketchblog Blob[p]: Gastrono-me
    12. Re:Enough to power a thousand homes by teknofyl · · Score: 1

      24 hour airco is pretty destructive on the energy bill for these datacenters apparently, but looking at the outside temperature, it might not be needed all the time.

      No matter what the temperature outside, without the aircon we have about a half hour before we have to start shutting kit down. Any longer than that and the temperature gets high enough to start causing damage to disks and other components.

      (I work at the Sanger Institute)

  11. Windows by Elshar · · Score: 2, Funny

    They must be using Windows ClusterFun edition.

    1. Re:Windows by Anonymous Coward · · Score: 0

      I hear you can't open more than 256 instances of solitaire on it.

    2. Re:Windows by gstoddart · · Score: 3, Funny
      They must be using Windows ClusterFun edition.

      I think you mis-spelled that. :-P
      --
      Lost at C:>. Found at C.
  12. Re:Roland by Anonymous Coward · · Score: 0

    He gets a shitload of ad revenue from slashdot due to him linking to his own site in every fucking story he posts here. And like by magic, every fucking story he posts is accepted by the "editors".

  13. Big computers = big power by goneutt · · Score: 4, Insightful

    TANSTAFL. This post seems drawn into the spinning power meter dials and not caring about what the computer is. If you want a lot of power, you need a lot of power. Chip scale efficiency could reduce their bill, but its a research foundation crunching numbers all day. If they need more money they just ask their contributors politly.
    How's this stack up with google's server farm bill.

    --
    Bacardi + slashdot = negative karma.
    1. Re:Big computers = big power by Anonymous Coward · · Score: 0

      This post seems drawn into the spinning power meter dials and not caring about what the computer is.

      No, it's used to draw more hits to our "good friend" Roland's website.

  14. Re:Roland by eclectro · · Score: 5, Funny

    What's the deal wiht this roland guy

    They're trying to decode his genome to find the missing link.

    Which will lead to his website, of course.

    --
    Take the cheese to sickbay, the doctor should see it as soon as possible - B'Elanna Torres, "Learning Curve"
  15. have they heard of the petabox? by itsme · · Score: 5, Interesting

    http://www.archive.org/web/petabox.php

    it uses only 60kW for 1 Peta byte

    1. Re:have they heard of the petabox? by Lucractius · · Score: 1

      Certainly a very impressive little rack they have there.

      Shame if ive got a petabyte id rather have my system running a real os

      VMS forever!!!

      Hows them uptime records going eh?

      --
      XML - A clever joke would be here if /. didn't mangle tag brackets.
    2. Re:have they heard of the petabox? by goneutt · · Score: 1

      What structure do they use do manage the data?

      Petafiles

      If you didn't laugh, say it out loud.

      --
      Bacardi + slashdot = negative karma.
    3. Re:have they heard of the petabox? by Anonymous Coward · · Score: 0

      Still not funny.

  16. Whats with the emphasis on power and its costs? by manavendra · · Score: 3, Insightful

    What about the costs of scaling and maintaining such an infrastructure? The routine administrative tasks, reporting, etc? The costs for someone actually looking at the generated results to see if they are meaningful at all, and if it is all going in the right direction?

    --
    http://efil.blogspot.com/
    1. Re:Whats with the emphasis on power and its costs? by jonoton · · Score: 1

      You're quite right - the costs of the science far outway the costs of the IT.

      Within the IT budget the cost of maintaining the equipment & running it (including power & cooling) far outstrip the original purchase price.

      [I work here too]

  17. Re:Roland by Anonymous Coward · · Score: 0

    DO NOT CLICK ON THE SUMMARY.

    It leads to primidi.com, which is his 'blog'. If nobody goes to his site, he gets no money.

  18. Re:Roland by Anonymous Coward · · Score: 0

    MOD PARENT UP!!!
    I'm still in stiches

  19. Math by Alphanos · · Score: 4, Interesting

    Cost of 0.75 MW: ~$170K
    $/MW: ~$227K

    Cost of 1.4 MW: >$600K
    $/MW: >$429K

    Why the difference?

    --
    Alphanos
    1. Re:Math by Hittite+Creosote · · Score: 1

      Part of it is probably inflation, but personally I'm hoping my electricity bill isn't going to double in the next three years...

    2. Re:Math by Walkiry · · Score: 2, Interesting

      >Why the difference?

      Presumably, the infrastructure to get 1.4 MW safely inside the same building and distribute it is more complcated and expensive than what two independent .75 MW would be. Things tend to go down in price when you buy in bulk, until you reach a point where the amount you're asking for is giving more trouble than what is usually dealt with.

      --
      ---- Take the Space Quiz!
    3. Re:Math by Average_Joe_Sixpack · · Score: 1

      Maybe it's a cleaner grade of electron ... you know the stuff audiophiles use

    4. Re:Math by Renraku · · Score: 2, Insightful

      Diminishing returns.

      You've gotta have a lot of infrastructure outside the facility to be able to support 1.4MW. Infrastructure that is probably taken care of by the power company, for a fee.

      And the more power you push down the line, the more power that is lost to the environment. Especially if you're overcharging the lines, which causes acceleration of the loss the more power you pump into them.

      --
      Job? I don't have time to get a job! Who will sit around and bitch about being broke and unemployed then?
  20. Some computers use more power and do less by Colin+Smith · · Score: 1

    I don't know which is the best in the ix86 market at the moment, possibly Via, possibly Intel, used to be Transmeta.

    It's one of the things many geeks tend not to consider when they're dreaming up their ideal ultra powerful, ultra cheap beowulf cluster. The fact that you need a megaWatts worth of power and a megaWatts worth of cooling to go along with those $400 1U high density servers running the latest 4GHz AMD CPUs. Suddenly those cheap servers don't look so cheap.

    --
    Deleted
    1. Re:Some computers use more power and do less by jordie · · Score: 2, Informative

      Good to see you've got your facts straight before you posted.

      AMD does not have a CPU running anywhere NEAR 4GHz, you're thinking of Intel.

      As far as power consumption..
      "Even the Athlon 64 X2 4800+ consumes less power than all single core 90nm Pentium 4 CPUs" - Anandtech

      For more information please see this and this

      For less power, better performance use AMD.

    2. Re:Some computers use more power and do less by ckd · · Score: 1

      If you're doing this seriously, 1U servers are too big. IBM BladeCenter: 14 dual-processor blades in a 7U chassis, 6 of which fit in a rack. The power cables look like garden hoses.

  21. Re:Breaking news from our reporter Roland Piquepai by Anonymous Coward · · Score: 0

    Right, I'm not going to this fuckwit's blog. Can anyone tell me what sort of kit they're using?

  22. Re:Roland by Anonymous Coward · · Score: 0

    How do you know every submitted article is accepted?

    We only see his ACCEPTED submissions.

  23. Have they considered? by Anonymous Coward · · Score: 0

    Chimps on bicycles...

    I look forward to the replies.

  24. Re:Breaking news from our reporter Roland Piquepai by Anonymous Coward · · Score: 1, Funny

    Quit badmouthing my man Roland! He provides a valuable service to slashdot, and he even gave me a blowjob for only $10! Roland will do ANYTHING for many, god bless his 'im.

  25. Re:Fuck Roland by frakir · · Score: 5, Informative

    Mod parent up.

    Just have a look on http://www.google.com/search?query=Roland+Piquepai lle&as_sitesearch=slashdot.org/ or search slashdot articles on roland piquepaille.

    Real whore here is Timothy. I bet he'll post an ad for your site for some change, too.

  26. Re:Breaking news from our reporter Roland Piquepai by matt+me · · Score: 2, Funny

    enough to power 1000 homes with the equivalent power of distributed computing software?

    probably not.

  27. Roland cant get a job by Anonymous Coward · · Score: 0


    not suprising really with webpages like this
    --------

    Decoding the Genome Needs Superpower

    The Wellcome Trust Sanger Institute is one of the largest genomics data centers in the world. In "The Hum and the Genome," the Scientist writes about the IT infrastructure needed to handle the avalanche of data that researchers have to analyze. With its 2,000 processors and its 300 terabytes of storage, the data center uses today about 0.75 megawatts (MW) of power at a cost of 140,000 per year (about $170K). But the data center will need more than a petabyte of storage within three years, and its yearly electricity bill will reach 500,000 (more than $600K) for about 1.4 MW, enough to power more than a thousand homes. Read more...

    Below is a small diagram showing the current IT infrastructure of the Wellcome Trust Sanger Institute, used by the Human Genome Project (Credit: Wellcome Trust Sanger Institute).

    The current IT infrastructure of the Wellcome Trust Sanger Institute

    Here is a link to a larger version of this chart.

    Now, let's look at this IT infrastructure in detail.

    * Computers
    o Today: The datacenter hosts about 2,000 Alpha processors, originally designed by Digital Equipment (DEC), before its acquisition by Compaq, and later by Hewlett-Packard (HP).
    o Tomorrow: The Sanger Institute is looking at cheaper solutions, especially now that HP has officially stopped any development on the Alpha front.
    * Storage
    o Today: Three different computer rooms have a total capacity of about 300 terabytes.
    o Tomorrow: The IT management forecasts about a petabyte within three years -- at least.
    * Databases
    o Today: There are about 40 different databases, and only two of them are in the 50 terabytes area.
    o Tomorrow: One of the databases, the Trace sequence archive currently contains about 700 million entries, and it doubles every 10 months.
    * Power bills
    o Today: The current equipment needs about 0.75 megawatts for a cost of 140,000 per year (about $170K).
    o Tomorrow: The new setup will need about 1.4 megawatts, which will raise the yearly bill to about 500,000 (about $615K today).

    The supercomputer vendors can say all they want about diminishing costs. But they almost never talk about the power bills...

    Sources: Stuart Blackman, The Scientist, Volume 19, Issue 11, Page 15, June 6, 2005; and various websites

  28. Yeah, but... by Anonymous Coward · · Score: 0

    ...do they run on Linux??

    *ducks*

  29. Units by Hank+Chinaski · · Score: 2, Insightful

    They use Megawatts as a measurement of energy consumption? Should't that be Megawatt/hour ? P.S.: Dont click the link. Editors could at least include as "Signup required" warning.

    --
    IAAL
    1. Re:Units by Gwyn_232 · · Score: 1

      No, MW is the correct way to express it.

    2. Re:Units by Anonymous Coward · · Score: 1, Insightful

      They use Megawatts as a measurement of energy consumption? Should't that be Megawatt/hour ?

      No. Didn't you pay attention in high school? A megawatt is a unit of power. Power is energy divided by time. A watt is one Joule per second. A joule is a unit of energy.

      So, watts means joules per second. When you get your household electric bill it is in kilowatt-hours, which is the number of watts multiplied (not divided) by the time you consumed that many megawatts.

      So, since a watt is energy/time, a kilowatt-hour is energy/time x time, which is a unit of energy.

      And while household energy use is normally billed per kilowatt-hour (energy used) when you get to be a large commercial customer, you are billed not only for kilowatt-hours (energy used), but for the maximum kilowatts you draw.

      Example: when a factory starts up at 9 am, it takes a lot of power to start up the all the machinery. To keep the machinery running takes much less power. Drawing all that power from the grid at 9 am will create a big spike and possibly lead to brownouts. To compensate & prevent that big spike isn't cheap, so you're billed for max. power.

    3. Re:Units by Hank+Chinaski · · Score: 1

      mod parent insightful.

      --
      IAAL
    4. Re:Units by tempest69 · · Score: 1
      Megawatts is correct, it the amount of energy that is being pulled continuously. So if you want to multiply the MW*24*365.25 you can get a total MWH/ YEAR, but it's a little silly.

      Saying we use 75 MW Hours/Hour seems a bit retarded.

      Storm

  30. Alpha! by Pegasus · · Score: 1

    It seems all their boxen are based on Alpha processors. Why? Simply, because even today, you can get the most flops per clock tick out of Alpha. It's a shame such a wonderful architecture was burried.

    Anyway, I think I'll be the first in line when they deceide to retire their gs320 servers :)

    1. Re:Alpha! by Anonymous Coward · · Score: 0

      Like Gurdy, I work at the Sanger Institute as a sysadmin, in particular I work in the group which maintains the largest cluster. And it isn't Alpha.

      Alphas were used in the early days, because of their fully 64-bit nature, and it was known that to represent the entire genome in a single file requires 3GB, and at the time few OS's had 64-bit filesystems, let alone 64-bit processors.

      We continued to buy them as server machines, but the compute farm which produces http://www.ensembl.org/ amongst other things, has not had any Alpha CPUs added to it since 2001.

      We got into X86-based blade servers in 2002, and currently have well over a thousand of them, mostly running Debian GNU/Linux.

    2. Re:Alpha! by grouse · · Score: 1

      I use the compute farm the AC refers to and I can confirm that the vast majority of it consists of x86 blades.

    3. Re:Alpha! by rubycodez · · Score: 1

      not any more: the Itanium2 is 2 times as fast by that metric. Alpha @ 833MHz: SPECfp2000 644 SPECfp_base2000 571. Itanium2 @ 1.6 GHz: SPECfp2000 2675 SPECfp_base2000 2675. Alpha was cool in its day, some of those ideas went into Itanium. Itanium will "flop" in market for same reasons 8D

  31. Genome - the dog chasing its tail? by Wayne247 · · Score: 3, Insightful

    The interesting bit about genome research is that suppose we do find what the human genetic code all means. We can then start treatments to correct genetic problems, right? If we do so, and say we correct illness X on some kid. When this kid grows up, becomes an adult and have kids of his own, what kind of genetic heritage will he give his own kids? Will these kids inheric the original bad gene of their parent? If so, we'd be running at our lost since defects would multiply across generations...

    1. Re:Genome - the dog chasing its tail? by J.+Random+Luser · · Score: 2, Informative

      To correct the kid's kids, you need to make the correction in the gamete, before the original kid is conceived. Maybe I'm not reading enough lately, but from Huxley to Gattaca, I don't recollect anyone actually trying that method...

    2. Re:Genome - the dog chasing its tail? by mikael · · Score: 1

      Every person has about 7 major genetic defects in their DNA, which only become apparent when two parents with the same defective gene have children.

      Dealing with genetic diseases relies on three stages:

      1. Identify which genes cause the problem and how they are passed on through the generations; whether they are dominant or submissive.

      2. Create a test which determines which genes each parent has. From this information, it is possible to determine whether the disease will be passed on to their children.

      If there is a risk, then they can undergo further genetic screening to prevent the gene from being passed onto their children.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
  32. I for one welcome our new Wellcome overlords! by Anonymous Coward · · Score: 0

    n/t

  33. Why not name processor? by Anonymous Coward · · Score: 0

    Why is it that the processor is not named in the articles it if is Alpha or Itanium based. If this was Power/PowerPC/Cell based, there would be a nice big IBM logo on the article.....

  34. Imagine ... by hopethisnickisnottak · · Score: 0

    Imagine a beowulf cluster of these.

    Generating enough heat to provide for the winter time needs of all europe

    --
    -Shaunak
    1. Re:Imagine ... by Anonymous Coward · · Score: 0

      Now we know the true source of global warming.
      It isn't that there are more greenhouse gasses keeping the heat in.
      It's all the overclockers pumping heat into the atmosphere faster than it can readiate away.

  35. Password by jlebrech · · Score: 1

    Dont bother decoding the thing!! ask God for the password!!

    1. Re:Password by loose_cannon_gamer · · Score: 1

      Why ask? It is obviously 42. Somebody should have talked to Him about easily cracked passwords. *prepares to dodge lightning*

      --
      In Soviet Russia, us are belong to all your base.
  36. Yes, but... by Anonymous Coward · · Score: 0

    Is he digitally signed ?

  37. Research target by readerVoice · · Score: 1

    I wonder what the target of this research is. Daily I hear news on TV about people dying of hunger in Africa and other parts of the world. Can't this money be used there? Or am I nuts to think that way.

    1. Re:Research target by Anonymous Coward · · Score: 1, Informative

      The human genome is only one of the many genomes being studied at the Institute.

      One of the organisms being actively studied at the Sanger Institute is Paramecium falciparum, the organism that causes malaria, and Anopheles gambiae, a mosquito. Study of both of these will hopefully reap huge benefits in the treatment, prevention and perhaps eventually eradication of malaria.

      The Pathogen Sequencing Unit that's doing that is also studying other major third world diseases, such as plague.

      And much of what we do is developing technology and software tools which other researchers, both commercial and academic, can apply to their various areas of disease, pharmacology and agricultural research

    2. Re:Research target by LWATCDR · · Score: 1

      So could every dime you spend on CableTV, eating out, video games, video cards, and anything else that you do not need to survive. You could also stop waisting time and get a second job and donate all that money to feed the poor. Remember that when you say "they" should do something about it... You are they.
      That rant aside this research could lead to cures for all sorts of diseases that are currently killing people. So yes you are nuts to think that way.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    3. Re:Research target by XMyth · · Score: 1

      Brand new trolling account?

    4. Re:Research target by Hognoxious · · Score: 1
      people dying of hunger in Africa and other parts of the world. Can't this money be used there?
      Yes, I'm sure $FAVOURITE_KLEPTOCRAT could find the space to squeeze another Mercedes in.
      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    5. Re:Research target by psylent · · Score: 1

      I do not remember my high school biology very well but isn't it "plasmodium falciparum"?

    6. Re:Research target by Anonymous Coward · · Score: 0

      You are correct. Apologies for brain failure.

  38. Imagine ... by Anonymous Coward · · Score: 0

    a beowolf cluster with these..

    BTW, mod parent up.. (oops, no parent)

  39. Re:Breaking news from our reporter Roland Piquepai by Anonymous Coward · · Score: 0

    FUCK OFF TIMOTHY

  40. illegal by Anonymous Coward · · Score: 0

    Hasn't anyone told them that decoding a genome is in violation of the DMCA?

  41. Oh for goodness sake by samael · · Score: 0, Offtopic

    It's an interesting story, which I wanted to see.

    The fact that the same person who submitted it also submitted a whole bunch of other stories is besides the point.

  42. Whoa by grimhammer · · Score: 1

    That remark should be sufficient here. I mean... whoa...

    --
    Physics is the universe's operating system.
  43. Cure your piquephobia by Bemmu · · Score: 1

    Do YOU hate Roland Piquepaille? It doesn't have to be so. With my scientifically proven brainwashing program, you can rid yourself of piquephobia forever!

    http://www.bemmu.com/pique/

    1. Re:Cure your piquephobia by Maian · · Score: 1

      Haha awesome game :p

  44. Big deal! by Anti+Frozt · · Score: 1

    My car requires 1.21 jigawatts and a flux-capacitor.

    --
    In C++, friends can touch each others private parts.
  45. Re:Fuck Roland by MrNonchalant · · Score: 3, Funny
    According to the Google ads the joke might be on him:

    Roland On Sale
    Low Prices, Free Shipping
    12 Months To Pay, Always In Stock
    www.SamAsh.com

    Roland in stock
    Roland sale
    up to 80% off Liquidation Sale
    www.infinitemarketplace.com


    Anyone else want to buy Roland and make him shut up?
  46. Re:Fuck Roland by jemfinch · · Score: 1, Offtopic

    I'll take stories like this (Roland and all) over the consistently boring "Here's what Apple/Microsoft/SCO/Sony/USPTO is doing today!" stories we're inundated with otherwise.

    At least this story is interesting. Why does it piss you off so much that someone makes some money off finding this story? If Roland makes some coin because he's bothered to pay attention to news sites I don't read and report interesting articles to a site I do read, by all means, more power to him! I'm glad he's doing the legwork so I don't have to.

    Jeremy

  47. Re:Roland by metlin · · Score: 2, Funny


    Awww, he's just French... =)

  48. Re:Fuck Roland by OverlordQ · · Score: 0, Offtopic

    I've suggested an option that would let users filter, but it seems to have been ignored.

    --
    Your hair look like poop, Bob! - Wanker.
  49. Compression, people, compression! by seven+of+five · · Score: 1

    Don't tell me endlessly repeated combinations of the same four base pairs needs 300 TB...

    1. Re:Compression, people, compression! by fbartho · · Score: 1
      Don't tell me endlessly repeated combinations of the same four base pairs needs 300 TB...

      I think it needs more than 300TB infact, it probably needs an infinite amount of space
      --
      Gravity Sucks
    2. Re:Compression, people, compression! by ciroknight · · Score: 1

      Compression would add to the need for a) greater data redundancy (because compression errors DO happen from time to time), b) more computational time (unless someone made a Gzip chip and stuck it on an HD controller.. *ponders*), and c) would be terribly cost-inefficient.

      HD's are a dime a dozen. CPUs are not. If you have to have more costly CPUs running your File servers, that means less costly CPUs to run your Genetic Algorithms (pardon the pun).

      --
      "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
    3. Re:Compression, people, compression! by confusednoise · · Score: 1
      But they're storing MUCH more than just raw NT sequence -- they're storing all the annotation data around the sequence (which includes what genes are where, what papers have been written about those genes etc.) as well as a massive amount of data generated from experiments aiming at figuring out exactly what everything does and how it can be affected.

      It adds up quickly. The nucleotide sequence is just the starting point.

  50. How true by kilodelta · · Score: 1

    Friend of mine manages a cluster that models the worlds oceans. One thing they forgot about when planning it was the cooling needs. That added a nice chunk to the budget.

    I doubt they even looked at the power requirements.

    But it is cool to have access to a super computing cluster.

  51. It's not *that* special by photon317 · · Score: 1


    Those stats sound roughly comparable, if anything slightly lower, than what a private company I know of runs for seismic data processing.

    --
    11*43+456^2
  52. We can do either by cookie_cutter · · Score: 3, Informative
    Will these kids inheric the original bad gene of their parent?

    It depends. If you are doing somatic cell genetic engineering, then you only fix those cells in the patient in which the defect manifests itself, and not the germ-line cells (ie, sperm and eggs), so the 'fix' is not passed on to the next generation. If instead you modify the germ-line cells as well, then the 'fix' is passed on to the next generation.

    One of the main reasons for doing the somatic fix rather than the germ-line fix is that we're still pretty damned new to this genetic engineering thingy, so it's probably a good idea to not fuck with the genetic heritage of future generations just to cure a patient today. However, as the science and technology develops, and we gain more experience with it, our self-assuredness in our abilities will likely increase, and we'll think we know what we're doing enough to risk making 'permanant' changes to the germ-line. I put 'permanant' in quotes, because if we make genetic changes one way, we should be able to turn them back if and when we decide they are mistakes.

    1. Re:We can do either by Anonymous Coward · · Score: 0, Interesting

      Tell that to the sickle cell.

      Look, ever since humans decided that we are going to try to save any human life possible, natural selection ceased to exist. There are going to be all kinds of lovely little traits that will collect in our genome to our detriment. Now we can either decide to do one of two things:

      #1. Select for those traits, also called eugenics. We tried that out before in the 1920's.

      #2. Try to correct for those traits. It looks like we are going to try this out now.

      Now if some poor sap with a degenerative genetic disease gets saved by genetic engineering, it does not give that person the *right* to bring a human being into this world with the chance of having the same disease. As far as I am concerned he/she would have been given a gift from society, and that entitles society to make sure that he does not reproduce. If you are going to play with the cell line sterilization should accompany it.

  53. Re:Roland by tomhudson · · Score: 1
    Aggh! I clicked on the link w/o looking! It was worse than goatse.cx. I am SO ashamed :-(

    Seriously, I'm putting "127.0.0.1 primidi.com" in my hosts file TODAY.

  54. Oh well. by CAIMLAS · · Score: 1

    So much for the whole, "only as complex as a fruit fly" blurb which people use to say humans are simple creatures.

    --
    ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
  55. I haven't a clue... by Martin+Spamer · · Score: 1


    as to what they are actually doing with all this computing power.

    OK I broadly understand 'sequencing the human genome' is mapping out all the combinates of genes. There are 23 chromosomes in the human genome. That chromosomes are a pair of the genes. I understand that each gene is one of four DNA molecules called A,G,C & T. There 16 combinations of those mlecules and I can map those out with a pencil and paper, I can produce all 23 sets with desktop computing power.

    So why does it take so much computing power ?

    What are the really doing with it ?

    Why do they dum down these stories down so much ?

    Facts and figures dont make a science story/article!

    1. Re:I haven't a clue... by oneandoneis2 · · Score: 5, Insightful
      No, no!

      There are 23 chromosomes in the human genome. That chromosomes are a pair of the genes. I understand that each gene is one of four DNA molecules called A,G,C & T. There 16 combinations of those mlecules and I can map those out with a pencil and paper, I can produce all 23 sets with desktop computing power.

      There are 23 chromosomal pairs. Each half of each pair contains the same (more or less) information - you could think of it as a genetic back-up system. (Except for the XY chromosomal pair in males). At the start, one chromosome is maternal, the other is paternal. But over time, they actually swap bits around until there's a mixture.

      Each chromosome contains one immensely long strand of DNA, a double-helix. This double helix is NOT redundant, only one of the two strands contains genetic information: The other strand is only there to make it easier to copy the helix.

      The human genome is approximately 3 billion bases long, and it takes three bases (known as a codon) to code one amino acid. 4 x 4 x 4 = 64 possible amino acids. (Altho they only actually code 20 or so). Then you have to filter out all the codons that don't actually code anything, and are discarded before the gene is transcribed into a protein.

      NOW do the math!

      --
      So.. it has come to this
    2. Re:I haven't a clue... by nounderscores · · Score: 1

      I don't really know what these guys are doing with their computing power, but one cool free bioinformatics resources that allow you to browse the genome is

      http://www.ensembl.org/

      User interface is fairly intuitive and well documented.

      You can see that serving this information is a non-trivial engineering problem.

    3. Re:I haven't a clue... by Anonymous Coward · · Score: 0

      Ensembl is run as a collaboration including the Sanger. The Ensembl analysis pipeline takes up an appreciable section of the compute power at the institute.

      What does it do?

      Well I don't know in detail, but the pipeline includes:

      0 Using learning algorithms to select areas of the genome that potentially represent genes.

      0 Aligning various pieces of experimental evidence for the existance of genes to the genome. The output of these experiments will often take the form of strings of ATG+C of hundreds or thousands of characters, which must be aligned to the billions of characters in the genome, where such alignments may include gaps.

      0 Gathering together the results of predictions and evidence to have a stab at specifying certain region is a gene (remember the predicted 25K human genes only take up a small part of the genome).

      0 Comparing the sequence of genes to each other, in a number of complex ways, to predict relationships.

      This is just a small part of the work done by the ensembl pipeline, producing the info viewed on the website for the human genome. Also Ensembl currently allows the exploration of the genomes of 18 species.

    4. Re:I haven't a clue... by Misanthropy · · Score: 1

      OK I broadly understand 'sequencing the human genome' is mapping out all the combinates of genes.

      Sort of. Sequencing is really just finding the sequence of the DNA bases. (i.e. if a certain piece of DNA is ACCGT that is it's sequence)

      There are 23 chromosomes in the human genome. That chromosomes are a pair of the genes. I understand that each gene is one of four DNA molecules called A,G,C & T. There 16 combinations of those mlecules and I can map those out with a pencil and paper, I can produce all 23 sets with desktop computing power.

      There are 23 pairs of chromosomes. Each human cell (besides sperm/eggs) have 46 chromosomes.
      Each chromosome has many genes. There are about 30,000 estimated genes in the human genome. Each gene can be hundreds or thousands of DNA bases (not "one of four"). There are around 3 billion bases in the human genome. Think your desktop computer will handle that?

      And your math is wrong. Even if you just had four bases, that is 256 combinations not 16.

      So why does it take so much computing power ?
      That is a lot of data to process and analyze.

      What are the really doing with it ?
      Analyzing the raw genome data. Trying to locate genes by itself is a pretty tough job. Trying to match protein sequences with their original gene. This can be very hard because the original DNA sequence does not always match the protein it ultimately codes for. Basically there is a lot of data and a lot of processing of that data.

      Why do they dum down these stories down so much ?
      Do some google searches on DNA, genes, bioinformatics, or gene mapping. You'll find plenty of articles that aren't so "dum".

    5. Re:I haven't a clue... by tempest69 · · Score: 1
      No, no! There are 23 chromosomal pairs. Each half of each pair contains the same (more or less) information - you could think of it as a genetic back-up system. (Except for the XY chromosomal pair in males). At the start, one chromosome is maternal, the other is paternal. But over time, they actually swap bits around until there's a mixture. Each chromosome contains one immensely long strand of DNA, a double-helix. This double helix is NOT redundant, only one of the two strands contains genetic information: The other strand is only there to make it easier to copy the helix. The other strand is cruical to act as a template to repair errors. The best analogy would be a raid-1, there are mechaisms to figure out which strand went bad, so then the information is recovered from the good copy. BTW each chromosome averages 4" in length, calling it immensely long might be a bit of a overcompensation. The human genome is approximately 3 billion bases long, and it takes three bases (known as a codon) to code one amino acid. 4 x 4 x 4 = 64 possible amino acids. (Altho they only actually code 20 or so). Then you have to filter out all the codons that don't actually code anything, and are discarded before the gene is transcribed into a protein. NOW do the math! The big thing that wasnt explicitly pointed out is that each chromosome has around a thousand genes. Genes are sets of hundreds to thousands of base pairs AT TA GC or CG.

      not to knock, but listen carefully to what the person has a clue about, you skipped to the end part without correcting the original assumptions.. so even with the Now do the Math, the guy should assume that desktop power is fine. because you've said that the chromosome is immensly long, but didnt explain a base.. so he assumes that an A is an immensly long molecule composed of billions of bases, connected to an G A T or C also composed of billions of bases.

      The I havent a clue as the headed should have clued you in.

      Storm

      listen next time.

    6. Re:I haven't a clue... by Rikurzhen · · Score: 1

      At the start, one chromosome is maternal, the other is paternal. But over time, they actually swap bits around until there's a mixture. close... they don't swap bits in our body cells, only in the formation of sperm and egg cells. so, for example, the chromosomes your mother gave you thru her egg were a actually a combination of sequence originaly from your maternal grandparents. each body cell carries 2x genomes. each sperm or egg cell gets a randomly mixed 1x version. because the 1x genome is 3e9 bases, any two sperm or egg cells from a single person are totally unlikely to be identical. that's why siblings aren't normally identical.

  56. Bring Back John Katz. by Anonymous Coward · · Score: 0

    Roland can't hold a candle to the master.

  57. Roland Piquepaille Arrested by FidelCatsro · · Score: 1

    Unfortunatly im lying

    --
    The only things certain in war are Propaganda and Death. You can never be sure which is which though
    1. Re:Roland Piquepaille Arrested by tomhudson · · Score: 1
      I heard it wasn't an arrest, but that he volunteered to be a defense witness at the Michael Jackson trial. The deal Piquepaille offered was that in return for testimony admitting to being the real owner of Michael's porn mags, he'd get an exclusive interview on primidi.com, complete with pix of Michael's detachable nose collection, and banner ads from Crazy Glue. Or was it 3M duct tape?

      The defense lawyers said they'd wait and see - they want to hear back from a rebuttal witness (the goat.cx guy) first, because they think he'd have more credibility with a jury.

    2. Re:Roland Piquepaille Arrested by Roland+Picky-Pail · · Score: 1

      I resemble that remark

  58. Who is Roland Pipaquelle? by eno2001 · · Score: 1

    Only one person in the world has ever claimed to have met him - in the pressroom at Microsoft Devshed Conference in Boston complete with a Roland Pipaquelle badge - and described him as a fortyish reddish-blonde who giggled a lot.

    Oh yeah? Wonder what cold crème he uses. Rolland Pipaquelle is a 61-year-old Jehovah's Witness who lives in a shabby genteel garden apartment in desperate need of an interior decorator on a heavily trafficked commercial road at nnnn XXXXXXXXX XXX. XXXXXX, New York. XXXXXX is in XXXXXXXXXXX and XXXXXXXXXXX is Microsoft territory.

    [snip]

    Stop that! This is silly! Really! There is no room for this kind of silliness on Slashdot! Now... go home! And stop it! ;P

    --
    -"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
    1. Re:Who is Roland Pipaquelle? by Roland+Picky-Pail · · Score: 1

      And don't you forget it

  59. humans are simple creatures by nounderscores · · Score: 1

    and so are fruit flies.

    An arabadopsis plant on the other hand, that like most plants survives by modifying its cells rather than running away from danger, that's complex.

  60. Sad... by KoReE · · Score: 1

    It's kind of sad that the datacenter I work in does nothing anywhere near as important as genome number crunching.....yet uses a TON more power, and has WAY more storage than this genome DC in this story...

    --
    Instant Karma's gonna get you...
  61. a petabyte? by austad · · Score: 1

    I have a sony DMS-8400 petabyte storage array sitting in storage. They should buy it from me.

    You wouldn't believe how hard it is to sell something like this. It seems like any of the companies that need it have the money to purchase it new. Argh!

    --
    Need Free Juniper/NetScreen Support? JuniperForum
    1. Re:a petabyte? by pcnetworx1 · · Score: 0

      Err, just how exactly did you get one of those arrays "sitting in storage" my friend?

  62. genes run in both directions by nounderscores · · Score: 1

    and overlap. please see my other post linking to the http://www.ensembl.org/ genome browser.

    if you want to see a very dense genome, try looking at some viri. they take advantage of the fact that each amino acid that is used to make the protein machinery are encoded using three bases, and so can put three genes almost on top of each other. It's on the level of funkyness of a programmer writing a sequence of bits in machine language where 8 fully functional programs could be derived depending on whether you shift out one to eight bits from the start of the "program" before loading the program onto the stack of a cpu that has an 8 bit opcode system.

  63. /. malcontents need someone to hate by maynard · · Score: 0, Offtopic

    Why does it piss [some people] off so much that someone makes some money off finding this story?

    When Jon Katz left they decided to hate Michael Sims. Now Sims is gone so they need a new target. Not that Katz, Sims, offered any great insight or content to slashdot, but the hatred and paranoia against them is beyond reason. That said, I don't much like Piquepaille's site and don't click on his links. But he does offer the service of collecting and collating technology information - a service not much different from slashdot I might add - and clearly, some people find it interesting and worthwhile. The claims of some kickback scheme going on really reek of defamation though. Get some proof or STFU. --M

  64. Load Average? by Anonymous Coward · · Score: 0

    I'd like to know the load average on all of those processors or if, for the most part, they're just spinning their electronic wheels.

  65. Uh Oh by ReidMaynard · · Score: 1

    Be careful not to get a Flux Capacitor to close to this

    --
    -- www.globaltics.net

    Political discussion for a new world

  66. hummm by MutteringMonkey · · Score: 1

    The Wellcome Institute... I wonder how they get their samples?

  67. not entitled to the fruits of their own work? by Anonymous Coward · · Score: 0

    I don't like the idea of someone owning numbers any more than I think someone should be entitled to the fruits of their own work.

    I'm not crazy about the patent system either but is that what you really meant? People are not entitled to the fruits of their own work?

    What are you? Some sort of freaking communist?

  68. Clearance on low power chips, Aisle 3 by AirDave · · Score: 1

    They should check out the going-out-of-business sale at Transmeta. Pick up a few dumpsters of low power chips.

  69. Re:Fuck Roland by danharan · · Score: 0, Offtopic

    Using insults and insinuation only serve to make you look bad.

    If you want better content, go submit it.

    --
    Information: "I want to be anthropomorphized"
  70. Cheap electricity by Stankatz · · Score: 1

    That's some cheap electricity. I pay > $600/y for electric in my apartment. They must have their own power plant (no I didn't RTFuckingA, thank you very much.) I wonder how much of that waste heat they use for heating in the winter. ;)

  71. The Relevant Quote Is by Master+of+Transhuman · · Score: 1

    "A shift from proprietary to commodity hardware will also help keep costs down, as will the planned move away from a proprietary 64-bit operating system to open-source Linux. Though the move chimes with Sanger's open-source ethos for its sequence data, Butcher cites solid practical reasons for the change. "HP [Hewlett Packard] pulled the plug on the Alpha chip," he says, "so we have nowhere to go." Moving to another proprietary system means it could happen again, he says. "I want something we can rely on and have control over our destiny for a good few years into the future.""

    Linux rules again, proprietary fanboys!

    --
    Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
  72. Hmmm maybe they should call IBM by Some_Llama · · Score: 1

    I just saw a commercial the other day from IBM, int he commercial 2 scientists were looking for computers to help them map the human genome and replicate the folding of protiens.

    The IBM representative said "Here is Gene, it is able to fold protiens and map the entire human genome". It was a cluster of IBM systems (maybe 40 total).

    I just laughed and tried to explain to my wife how much BS this was (which basically describes all marketing).

  73. Re:Fuck Roland by TheRealStubot · · Score: 1

    Oh, I'm terribly sorry! I clicked the link!

    I admit, I wasn't going to, until you and your nazi friends convinced me that this Roland fellow must have something important to say, since everyone wants to repeal his freedom of speech.

    Congradulations! You achieved exactly the opposite of your intentions.

    Some social pointers:

    1. Using bad language does NOT make you cool.

    2. If someone has a different point of view than you do, it's NOT OK to assault them, verbally or otherwise.

    3. If Roland and Slashdot have a financial arrangement, it's none of your business. Implying that you should have been informed of such an arrangement is conceited.

    Remember Slashdot ( like life ) is what you make of it.

    --
    "I'd rather win in an ugly car than lose in a pretty car" - Jari Lahdenpera
  74. cooling by Dioscorea · · Score: 1

    Friend of mine manages a cluster that models the worlds oceans. One thing they forgot about when planning it was the cooling needs. That added a nice chunk to the budget.

    I used to work at the Sanger Institute (before they were quite so big). One year after the main building was occupied, there was a fire in the main server room, filling the informatics corridor with black smoke. It turned out to have been started by a fan that had been left on all year...

  75. 1.4 MW = over 1000 homes? by phliar · · Score: 1
    That would mean that the mean power consumption of a house is 1.4 kW. A good hair dryer uses that much. Large appliances like air-conditioners, refrigerators, dishwashers, clothes dryers, etc. are all around 1-2 kW.

    1.4 kW is about 2 horsepower. At 110 V, 1.4 kW is a current draw of 12 A. (At 220 V, it's 6 A.) I guess "over 1000 houses" sounds much better than "a few hundred houses."

    --
    Unlimited growth == Cancer.
  76. Re:Breaking news from our reporter Roland Piquepai by Anonymous Coward · · Score: 0

    I want their electricity provider. This works out to less than $.025 per kilowatt-hour. Large industrial plants do not get that low of a rate.

  77. Odd storage requirements. by jesdynf · · Score: 1

    300 terabytes, advancing to a petabyte in three years? I can see it now.

    "Yeah! We got it! The whole Human Genome! We scanned that sucker in... wait, what? You meant... /all/ the genome? Okay. Uhm. Well, we already got Alice scanned in. I guess we can have Bob done in another couple years or so, maybe Charlie after that, but I think we're going to need to look at how we're doing this..."

    --
    Yahoo! Pipes are awesome. How awesome? http://pipes.yahoo.com/jesdynf/slashdot
  78. Re:Research target plenty of waste by Anonymous Coward · · Score: 0

    $200 billion for war in Iraq and nothing to show for it. that;s the real waste. Military spending. It sucks the life blood.

    $200 billion new highway bill. Waste of cash.

    Science is the seed corn of the future. You are not nuts, just not completely informed.

  79. The object (CPU) of my desire (geek lust) by Moekandu · · Score: 1
    Yes, many organizations that "do this seriously" will use blade servers and ridiculous amounts of power.

    However, if I were doing this seriously, I would be shopping here.

    DS-96
    96 CPU's (Efficeon)
    230 GFlops
    1500W Max Power

    Is the hardware cheaper? No. But once you factor in the power and cooling infrastructure and costs, it's definitely the way to go. Especially when you consider that 22 of the above machines will give you 2112 CPU's cranking away with a max power draw of 33,000 Watts, instead of 750,000 Watts.

    Granted, with WTSI's setup, there's still the consideration of power and cooling requirements for the storage arrays. Not an insignificant issue.

    --
    Mediocrity knows nothing higher than itself; but talent instantly recognizes genius. -- Sir Arthur Conan Doyle
  80. I am having a hard time grasping the energy usage by Anonymous Coward · · Score: 0

    How many Library of Congresses would this power?

  81. Exponential Algorithms by SauroNlord · · Score: 1

    I am a comp sci major, just finished my Bsc and continuing towards data mining and bioinformatics for a graduate degree...I do not know as much as I would like to know however I can tell you that the best algorithms are typically brute-force solutions. Why? Because the amount of data and the precision that we require does require all combinations to be processed and analyzed. This all comes back to algorithm efficiency and analysis (big-O a.k.a asymptotic growth rates). Many of these algorithms are of the NP-Hard class, meaning that a polynomial time(read: 'efficient') algorithm in terms of the input (n x m matrix[n dna sequences, each at most m bases long]) are very very very unlikely.

    There are proofs showing that the algorithms are the best (even though they are the brute force) that is possible....so one way or another someone will have to do the 'infinate computations'

    Might as well get started!