Slashdot Mirror


Scientists Discover a New Way To Use DNA As a Storage Device (betanews.com)

Mark Wilson shares a report from BetaNews: Researchers from the Waterford Institute of Technology (WIT) in Ireland have developed a way to use bacteria to archive up to up to one zettabyte in one gram of DNA. The technique uses double-strained DNA molecules called plasmids to encode data which is stored in the Novablue strain of the E Coli bacteria. The Novablue bacteria has a fixed location, making it viable for storage, and the data can be transferred by releasing a mobile HB101 strain of E Coli which uses a process called conjugation to extract the data. The antibiotics tetracycline and streptomycin are used to control this process. The method is currently not only expensive, but also slow. Data retrieval takes up to three days at the moment, but researchers believe it should be possible to dramatically speed up this process. Equipment already exists that can be used to write to DNA in seconds. Stability and security are also an issue right now, but it is very early days for the technique, and these current downsides are not viewed as being significant enough to write it off. Potential uses for this method of data storage that have been suggested include the recording of medical records in human DNA, and increasing the traceability of the food chain.

68 comments

  1. T-Virus by darkain · · Score: 4, Insightful

    Do you want the Resident Evil T-Virus? Because this is how you get the Resident Evil T-Virus.

    1. Re:T-Virus by Anonymous Coward · · Score: 0

      All of your DNA will be encrypted unless you pay $299.

    2. Re:T-Virus by LifesABeach · · Score: 1

      I just thinking that finally someone has addressed my storage needs for all my porn. Now if they could just decrease the time to read or write the data.

    3. Re:T-Virus by tonique · · Score: 1

      If transferring one zettabyte takes three days, you get something like 3 gigabytes per second, which might be enough for your needs.

    4. Re: T-Virus by Anonymous Coward · · Score: 0

      Or the speed to read isnâ(TM)t linear.

      Or worse, itâ(TM)s not enough to satisfy his needs... that would be a case where I really think therapy is probably a good idea.

    5. Re:T-Virus by tlhIngan · · Score: 1

      If transferring one zettabyte takes three days, you get something like 3 gigabytes per second, which might be enough for your needs.

      Meh. That's only about 50-100% faster than what you get with current SSDs (the top end SSDs can do 2200MB/sec reads and 1500+MB/sec writes)

    6. Re:T-Virus by Hal_Porter · · Score: 1

      An ejaculation releases 250 million sperm. Each sperm contains 3234 Mbp. Each basepair contains 2 bits. Male orgasms take 5-22 seconds on average.

      So we can work out the bit rate. (( 250 million * 3234 million * 2bits)/13.5 seconds) in petabytes per second = 14.9722222 petabytes per second.

      Aww yeah! Bandwidth, baby!

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    7. Re:T-Virus by Anonymous Coward · · Score: 0

      Enough for a Linux ISO on DVD format = 2 seconds

      Please don't sneeze, I'm downloading the latest movies from Netflix.

    8. Re:T-Virus by michelcolman · · Score: 1

      I wonder if some day we'll finally figure out that our "unused junk DNA" portions actually contain an entire copy of something resembling the hitchhiker's guide to the galaxy.

    9. Re:T-Virus by Anonymous Coward · · Score: 0

      >"unused junk DNA"

      Are living in the sixties or seventies? There are no such thing as junk DNA. Non (protein) coding != Junk DNA.

    10. Re:T-Virus by Anonymous Coward · · Score: 0

      Never underestimate the bandwidth of... wait what?

  2. Old and busted: Ransomware. New hotness: Cancer. by Anonymous Coward · · Score: 0

    At least we finally know how the human race goes extinct.

  3. Will it run Windows 10? by Anonymous Coward · · Score: 0

    Might as well let MS-NSA have my DNA too.

  4. Storing Data No by wolfheart111 · · Score: 1

    Generating electricity Yes :) Whole specimens.... :P

    --
    [($)]
  5. Interesting tech all round by TheZeitgeist · · Score: 2

    I recall Harvard guys doing data encoding with DNA a few years ago. Same cost/benefit in their technique (don't recall exact specifics beyond the four bases = quaternary number system in their encoding scheme): The information density is vast, but I/O slooow.

    Their application ideas were interesting though; like tiny cameras with memory integrated into wall paint - it would archive history of the room. All kinds of stuff one would think of needing tape backup circa 1995 these guys were pretty sure could be future niche for DNA memory. Interesting observation was how tough DNA is; million year old examples abound throughout wide variety of earth environments.

    1. Re:Interesting tech all round by Anonymous Coward · · Score: 0

      >Interesting observation was how tough DNA is; million year old examples abound throughout wide variety of earth environments.

      DNA is not tough in the slightest. DNA has an average coherent half-life of 521 years, and less at temperatures above 20c. DNA naturally dissociates at 59c into individual strands. DNA is damaged by all forms of radiation, including some forms of light, x-rays, gamma, etc. It is chemically vulnerable to nearly everything acidic, and many things that are basic.

      There are no DNA sequences from animals that are millions of years old that aren't in tiny little incoherent fragments due to the half-life.

      It's a stupid idea from start to finish.

    2. Re:Interesting tech all round by sheramil · · Score: 1

      I recall Harvard guys doing data encoding with DNA a few years ago. Same cost/benefit in their technique (don't recall exact specifics beyond the four bases = quaternary number system in their encoding scheme): The information density is vast, but I/O slooow.

      Oh yeah.

      "Data retrieval takes up to three days at the moment, but researchers believe it should be possible to dramatically speed up this process."

      I don't think any amount of cache ram is going to help there.

    3. Re:Interesting tech all round by Anonymous Coward · · Score: 0

      >DNA has an average coherent half-life of 521 years

      Seems like hard disk, SSD, RAM, ... have a half-life far lower than 521 years.

      >DNA is damaged by all forms of radiation, including some forms of light, x-rays, gamma
      > It is chemically vulnerable to nearly everything acidic, and many things that are basic.

      Any thing different than with hard disk, SSD, RAM ?

      >DNA naturally dissociates at 59c into individual strands

      I don't see any issue with that. When doing a PCR, the DNA is warmed up to 95C. No issue. Try it with an hard disk.

      And even so, it is very easy and inexpensive to massively replicate DNA (hint: PCR).

      You have no clue what you are speaking about.

  6. Slow? by Mr.+Sketch · · Score: 1

    1 zettabyte = 1e9 terabytes
    3 days = 259200 seconds

    1 zettabyte / 3 days = 3,858 terabytes per second

    That sounds pretty quick to me, considering that the current Fiber Optic speed record is only 1,050 terrabits per second or about 131 terrabytes per second.

    1. Re:Slow? by LifesABeach · · Score: 1

      3 days for a response might not be fast enough. Multiple Read/Write devices could reduce the time. My thinking is that we are observing a Virus, not a Bacteria.

    2. Re:Slow? by Anonymous Coward · · Score: 0

      1 zettabyte = 1e9 terabytes 3 days = 259200 seconds

      1 zettabyte / 3 days = 3,858 terabytes per second

      That sounds pretty quick to me, considering that the current Fiber Optic speed record is only 1,050 terrabits per second or about 131 terrabytes per second.

      I think you'll find that's actually up to three days to do a random-access retrieval, not the throughput.

  7. technique uses double-strained ? by Anonymous Coward · · Score: 0

    "technique uses double-strained ", I think they mean double-stranded

  8. retrival in 3 days for 1 zetabyte? by Anonymous Coward · · Score: 0

    doesn't seem too bad

  9. Not a biological thing at that scale... by RyanFenton · · Score: 2

    For those wondering, there's been studies and research on maximum genome size, and beyond a certain point (a couple hundred picograms) the mechanisms don't work for copying/processing/maintaining DNA. The largest animal genome would be around 132 micrograms (marbled lungfish)

    http://www.genomesize.com/stat...

    A picogram is 1/1,000,000,000,000 of a gram, so yeah, getting up to a full gram of the stuff, you can store a LOT of raw data - but it's not something you're going to have survive on its own without a very specialized sort of engineered life/system, and likely secondary specialized critters to defend it against regular old microorganisms.

    The other problem would be read/write speed - DNA replication isn't fast with current life processes... scale that up a few billion times, and you're definitely going to have to do something different than how life as is does things.

    Ryan Fenton

    1. Re:Not a biological thing at that scale... by LifesABeach · · Score: 1

      Kind of sounds like an Engineering Problem to solve.

    2. Re:Not a biological thing at that scale... by Anonymous Coward · · Score: 0

      Right.

      If only biochemical processes existed to split long chains of DNA and proteins into easily digestible chunks.

    3. Re:Not a biological thing at that scale... by Anonymous Coward · · Score: 0

      >DNA replication isn't fast with current life processes

      It is really fast for data storage. In fact, this is the only thing that is not really a research topic in the domain. We are able to replicate massive amount of DNA fragments with PCR. The replication rate of a DNA polymerase does not seem impressive (~4Kbase/sec for E. Coli) BUT this can be massively parallelized. 1000 fragments of 4Kbase gives 4Mbase/sec. In 2-3 hours, we are able to replicate billion of time a fragment (or multiple fragments).

    4. Re:Not a biological thing at that scale... by Anonymous Coward · · Score: 0

      I propose the name "restriction enzymes" for these machines.

  10. 1 zettabyte sata by Anonymous Coward · · Score: 0

    still wont recommend this if its a seagate

  11. DNA is not stable by Anonymous Coward · · Score: 0

    Why would you store digital information in a medium that has a natural half life, and breaks down under mild heat, exposure to sunlight, x-rays, and other ambient radiation?

    I can see burning information into glass, magnetic media, aluminum discs, carbon, etc. and other static media. But why bother with inherently decomposing forms?

    1. Re:DNA is not stable by Anonymous Coward · · Score: 0

      Some are trying to use a synthetic DNA like XNA..

    2. Re:DNA is not stable by LifesABeach · · Score: 1

      The journey of a thousand miles begins with a single step. And stop interrupting the grown ups, they are currently working on this problem.

    3. Re:DNA is not stable by Anonymous Coward · · Score: 0

      You need to do some serious soul searching kid.

    4. Re:DNA is not stable by Anonymous Coward · · Score: 0

      >mild heat, exposure to sunlight, x-rays

      Like RAM and magnetic storage?

      It seems the DNA in our body did a good job during centuries. How much time for your last HD or RAM?

    5. Re:DNA is not stable by LifesABeach · · Score: 1

      You interrupted again.

  12. This won't end well by Latent+Heat · · Score: 1

    "The antibiotics tetracycline and streptomycin are used to control this process"

    What happens when generations of the bacteria develop resistance?

    1. Re:This won't end well by Anonymous Coward · · Score: 1

      These bacteria's are already resistant. The antibiotics are used to keep a selective pressure. A plasmid is a small circular DNA independent of the chromosomes. These are easily lost for example during cell division. To ensure the presence of the plasmid inside the colony of bacteria, the genes of resistance to the antibiotics are inserted in the plasmid (and the data). Cultivating the Bacteria on a antibiotic rich environment only the one having the gene survive. This process ensures that the colony contains the data plasmid. Classical biotech.

  13. Sue E.Coli for Copyright Infingement? by aberglas · · Score: 2

    If the E.Coli breeds along with its data, who do you sue?

    Much more important question than those of mere technical possibility.

  14. lunch crumbs by Anonymous Coward · · Score: 0

    When I drop crumbs in my keyboard while eating lunch at my desk, I always try to leave them there to let a culture form so I can increase my storage. But there's always someone in the office who comes along with alcohol wipes and deletes my files. Not reliable!

  15. Our storage technique is slow, expensive and diffi by Anonymous Coward · · Score: 0

    But that is not enough to justify a cut our funding? Please?

    Or is the actual research better than the slashdot article?

  16. Antibiotics abuse much? by Bruha · · Score: 1

    Seems an incredibly stupid way to use antibiotics.

    1. Re:Antibiotics abuse much? by Anonymous Coward · · Score: 0

      This is a clever way of using antibiotics. This is a classical biotechnology tool to produce proteins (for example, insuline - you know the thing for the diabetics). The role of the antibiotics is to exert a selective pressure to ensure all the bacteria contains the data plasmid. The plasmid contains a antibiotic resistance gene and the data (For insuline, not the data but the gene coding for insuline). When the bacteria reproduces only the one with the plasmid (and the data) survive.

    2. Re: Antibiotics abuse much? by Anonymous Coward · · Score: 0

      The antibiotics ensure that the data holders, who resist the antibiotics, are not outcompeted by non dataholders.

  17. Using a herpes virus by Anonymous Coward · · Score: 0

    now that would have added a real wow-factor to this research.

  18. Obvious speedup, from management by Snotnose · · Score: 2

    If it takes 3 days for a bacteria to produce a datum, then surely 3 bacteria can produce the datum in a day. Add a million or so and you've got yourself a speedy little data retrieval device. It's nothing but an engineering problem.

  19. 3 rules of DNA as storage by Anonymous Coward · · Score: 0

    whenever DNA is proposed as a data storage means, we find the following 3 rules apply

    1 magnetic storage is cheaper and better then anything else
    2 the proposed system is way more complex, and hasn't actually encoded much data
    3 Slashdot fanboys will slaver over it

    PS
    this is the paper that describes, in very poor english, the method
    https://arxiv.org/pdf/1801.04774.pdf

    PPS
    read the bottom of Page 4 in the pdf
    you have to synthesize the DNA into plasmids
    this is like a total non starter for commercial viability, but, hey, this is slashdot

  20. Future workplace excuses by jrumney · · Score: 1

    Sorry boss, the data storage evolved so I couldn't complete those TPS reports you asked for. And I'm taking the rest of day off because the bacteria in my computer gave me a bad case of the Dreaded Lurgi.

  21. Data Center by hcs_$reboot · · Score: 1

    Have 37.2 TB available.

    --
    Slashdot, fix the reply notifications... You won't get away with it...
  22. They'll have to be careful with the coding scheme. by Ungrounded+Lightning · · Score: 1

    They'll have to be careful with the coding scheme.

    Otherwise it would be possible to write an actual virus just by storing appropriate data into a file.

    Note that both DNA and RNA can have enzymatic activity - including functions that would cut them out of the backbone and form them into a viral genome, along with the molecular machinery to package and deliver it.

    --
    Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  23. Imagine a Beowulf cluster of these by romit_icarus · · Score: 1

    *ducks*

  24. Error correction by tigersha · · Score: 1

    Can you RAID these things? What sort of error correction exists here? ECC DNA?

    --
    The dangers of excessive individualism are nothing compared to the oppressiveness of excessive collectivism
  25. Facepalms by DivineKnight · · Score: 1

    Yes, we generally know this, tell us something new. Engineering principles haven't changed -> you can create logical circuits out of plumbing, electricity, optics, what-have-you...the same applies for data storage.

  26. Just wait... by Anonymous Coward · · Score: 0

    MS will build an OS that would consume all those spare zetabytes you have.
    Remember those days when an OS would fit on a 3.5 inch floppy disk.
    Even MS-DOS can run in a 5.25 inch diskette where you can then do spreadsheet, word processing and play games.
    Now win10 wanted a USB flash drive with 15.0 GiB capacity just to build a recovery disc!

  27. Viral Videos? by afc_wimbledon · · Score: 1

    The obvious first application perhaps??

  28. Re: dangerous and stupid by Anonymous Coward · · Score: 0

    Nah, put a blockchain in it

    Beastmarkcoin: now required to buy or sell

  29. Re: They'll have to be careful with the coding sch by Anonymous Coward · · Score: 0

    Honestly, I think you actually have a good point- this method does seem vulnerable to some manner of attacks, were it attached to some blind theoretical read/write device

  30. Good news for mad scientists! by BeeArt · · Score: 1

    In the old days, they had to hide their data in their cat's collar, now they can store it in the cat itself!

  31. tl;dr by Anonymous Coward · · Score: 0

    I know y'all are just responding to the gist of what you think the article is about. Not one of you stopped to notice what a horribly written article it is. Double strained? Fixed location? Clearly, the reporter understood the material less well than even all y'all.

  32. Wow by Travelsonic · · Score: 1

    1 zettabyte = 1 trillion gigabytes. That's a lot of data in a small space. If they can make this thing practical, it would really help out in technology sectors that require storage of insanely large quantities of data (like, say, YouTube, to name on example)

    --
    If you believe in privacy, and believe you have "nothing to hide" at the same time, you're a goddammed idiot
    1. Re:Wow by fafalone · · Score: 1

      By the time this technology is commercially available, large companies will still need rooms full of equipment because they now require yottabytes for the 64K SuperHyperHDR 4D Holovideo everyone's iPhone 128 is recording.

  33. What kind of internet do these guys have? by Anonymous Coward · · Score: 0

    "archive up to up to one zettabyte in one gram of DNA....The method is currently not only expensive, but also slow. Data retrieval takes up to three days at the moment".

    Maybe there's something wrong with my maths - but I get that to be around 4000 Gb/s

  34. Gibson by cas2000 · · Score: 1

    Johnny Mnemonic would be a much shittier story if E Coli were involved.

  35. HIT ME! by Joe_Dragon · · Score: 1

    HIT ME!

  36. Silly.. as usual.. by Anonymous Coward · · Score: 0

    So far all these attempts to write "data" to DNA have been nice and all but there are so many reasons not to do this. I'm sure someone will point out how every (now) famous technology started out like this.. being laughed out etc.

    You don't write to DNA.. you write DNA, meaning you have to synthesise the sequence you want, afterwards you have it replicated etc. But to claim "within seconds" is complete nonsense as they don't say what amount it written (how many bases per second?). Any HDD from the last 20 years has higher write speeds than this.

    But yeah this always makes for nice headlines.. and grant money.

  37. one zettabyte? by Anonymous Coward · · Score: 0

    That's a lotta porn, Tony.

  38. a feature of Alan Kay's talk in the 90's by Anonymous Coward · · Score: 0

    E. coli has about 100 gigabytes of info

    https://www.cc.gatech.edu/fac/mark.guzdial/squeak/oopsla.html

  39. Optimization by manu0601 · · Score: 1

    How do they plan to prevent E. Coli to optimize out this useless DNA?