New Method To Revolutionize DNA Sequencing
An anonymous reader writes "A new method of DNA sequencing published this week in Science identifies incorporation of single bases by fluorescence. This has been shown to increase read lengths from 20 bases (454 sequencing) to >4000 bases, with a 99.3% accuracy. Single molecule reading can reduce costs and increase the rate at which reads can be performed. 'So far, the team has built a chip housing 3000 ZMWs [waveguides], which the company hopes will hit the market in 2010. By 2013, it aims to squeeze a million ZMWs [waveguides] onto a single chip and observe DNA being assembled in each simultaneously. Company founder Stephen Turner estimates that such a chip would be able to sequence an entire human genome in under half an hour to 99.999 per cent accuracy for under $1000.'"
Gattica, here we come!
But at least there's Uma Thurman.
That's, what, 28 incorrect base pairs out of 4000? I'm not a biologist, but is this considered an acceptable error rate? Even the hopes of 99.999% accuracy seems really awful when there are about 3 billion base pairs in a human genome.
I realize that we aren't going to be trying to make a cloned copy from this data, but what uses is this "good enough" for?
If it's for-profit but free, you're not the customer -- you're the product (e.g., the Slashdot Beta's "audience").
Sub-$1000 genome sequencing will put the creation of 'designer' kids into the realm of the affordable for much of the middle class. Scary stuff. Now we just need to combine that with cheap and reliable cloning techniques and my plans for world domination will be comlete!
My blog
Abstract:
I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
> Company founder Stephen Turner estimates that such a chip would be able to sequence an entire human genome in under half an hour to 99.999 per cent accuracy for under $1000.
:)
I think this qualifies as a true 'technological singularity'
Is there not some form of error-correction in the sequence itself that could be exploited ?
Something like the error correction on an audio compact disk ?
Nullius in verba
Comment removed based on user account deletion
... slowly leading to an era where helicopters can transform into airplanes, blood sampling is completely painless, brain contents can be downloaded by scanning the eyes, and clones can be produced in an afternoon.
Enter Arnold Schwarzenegger's clone, Adam Gibson!
Using 454 sequencing you get average read lenghts of ~400-500 bp. Read lenghts around 20 bp would be pretty much useless. At least for de novo sequencing..
(from Google Cache) Reading DNA sequences from single molecules of polymerase using nanotechnology
I assume that the hardware at Science can withstand a slashdotting better than the crappy blog linked in the summary:
http://www.sciencemag.org/cgi/content/abstract/323/5910/133
Does anyone remember the movie, "GATTACA"?
1/2 hour for $1000, eh? And in another 5-10 years we'll cut that in half or more, both time and cost. It looks like the instant gene sequencing tech from GATTACA will be with us in most of our lifetimes. But even with this announced breakthough it'll be functionally the same.
Since this technique should be a shoe-in for the Archon X Prize.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
I guess they didn't have the foresight to use a real host.
I'll be here all night folks try the steak.
I believe the true benefit of this technology will not be for cloning, but for general medicine. For example, you would go to the doctor with a lump, and instead of him doing a biopsy, find cancer, chemo, invasive surgery etc etc, they first take your DNA, sequence it and then take the biopsy and identify the origin of the cancer (is the lump actually metastasized from your pancreas?). Then work on resolving the cure just based on your genetic makeup, rather than a shotgun approach. Additionally medicines for genetic problems, and a number of other diseases would be custom-tailored for your genetic makeup. If you are prone to hypertension, your DNA sequence could prove if you carry the genes for that malady. Really what this is about is a revolution in medicine. It's a private company now that is snatching up all the biggest heads in silicon valley - if and when this goes public, it could be an amazing investment.
Namaste
It looks to be inaccessible. Here are the abstract and fulltext links.
If you want a vision of the future, imagine a youtube comments section scrolling - forever.
DOI: 10.1126/science.1162986
Forensic genetic identification currently uses about 60 important genetic markers. Thats good enough to convict in a court law since the the chance of a duplicate may be less than a billion to one depending on marker combination.
Although humans differ from one another in about 0.1% base pairs for a total of 3 million, the number of difference that describe human variability may be vastly smaller than this. First you discard non-coding DNA which gets you done to 30,000.
Shotgun sequencing depends heavily on supercomputer. Thats a thousand-fold every 15 years right there. Multiply that by more intelligent software, understanding of genetics, and sequencing hardware, you may be squaring that rate.
This is entirely reasonable and desirable if you replace "spider-thing" with "cancer" or "aids," or even "common cold." Gene sequencing your disease and taking the right medicine for what you *actually* have - instead of today's educated guesswork - will be a HUGE advance. Thousands die every year because they have to guess a year in advance which flu strains will be prevalent and usually guess wrong.
This is just an historical accident; 99.9% was what could be done with what people judged "reasonable" effort and cost a few years ago; unless you know whatyou are going to use the sequence for, you don't know what error rate is acceptable
There are medical test that rely on dna sequence, eg myriad makes a fortune from sequencing the gene that gives women hereditary breast cancer. I don't know what there claimed error rate would be, but that would show you what is acceptable in todays clinical market place.
As for the "de novo" rate, eg the difference between a child and its parents, or between two identical twins - I don't think this has been accurately measured, but I do believe that single base changes (eg AATTC to AAATTC) are not as comon as insertion or deletion of several bases..which goes to show that biology is ocmplicated beyond belief
on the overview side, for those of us who follow this, pacbio has been hyped beyond belief, and the production of actual data eagerly awaited; time will tell if single molecule sequencing is to be the wave of the future, helicos bioscience (HLCS) has had a very hard time selling its system, which has been on market for about a year now; for instance, one problem is "blinking" single molecules that are fluorescent can go into a "dark" state where they don't give a signal - which is kind of an obvious show stopper
One base-pair does not a gene make.
If fate makes you a motorcycle, you become a motorcycle.
While they may have only recently published the article, people in bioinformatics have been going crazy about Pacific Biosciences for at least a year.
I recently went to a series of talks on Next Generation Sequencing, and there was an interesting chart that showed that when you factor in sequencing cost, read length, and accuracy, high throughput sequencing is actually *outperforming* Moore's law by a factor of 5 or so!
Regarding the error rate, just a few years ago, 454 had error rates of almost 5% but with redundancy it became negligible. Since then, error rates have gone down dramatically.
Also, an Anonymous Coward up there is wrong -- (0.993)^3 is *less* than 0.993. It should be (1 - 0.993) vs. (1 - 0.993)^3. I don't know why it's been modded to informative. Check your math!!
Company founder Stephen Turner estimates that such a chip would be able to sequence an entire human genome in under half an hour to 99.999 per cent accuracy for under $1000
Does that mean that the chip costs $1000 or that each human genome processed costs $1000?
[Suspects] that come up positive can ask for the more accurate test.
Umm... kind of like getting a lawyer for free if you need legal representation and lack funds, if you come up positive, it should be the default that they run the more accurate test.
error's
That character you're using... I don't think it means what you think it means...
You must be new here. You see, here on teh interweb, many of us are terribly afraid of word's ending in "s" - Plural's, possessive's, contraction's with the word "is", and occasionally even name's. The apostrophe is a polite way of warning the general reading public that an "s" is approaching so that we can brace ourselve's accordingly.
He's getting rather old, but he's a good mouse.
http://www.sciencemag.org/cgi/content/abstract/323/5910/133
Abstract:
We present single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs). We detected the temporal order of their enzymatic incorporation into a growing DNA strand with zero-mode waveguide nanostructure arrays, which provide optical observation volume confinement and enable parallel, simultaneous detection of thousands of single-molecule sequencing reactions. Conjugation of fluorophores to the terminal phosphate moiety of the dNTPs allows continuous observation of DNA synthesis over thousands of bases without steric hindrance. The data report directly on polymerase dynamics, revealing distinct polymerization states and pause sites corresponding to DNA secondary structure. Sequence data were aligned with the known reference sequence to assay biophysical parameters of polymerization for each template position. Consensus sequences were generated from the single-molecule reads at 15-fold coverage, showing a median accuracy of 99.3%, with no systematic error beyond fluorophore-dependent error rates.
Here is an article in New Scientist about the new process. It explains it fairly well and even defines what a ZMW is.
'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
Many people seem concerned about the reading error rate. However, as it's been pointed out, it should be easy enough to read a DNA sequence multiple times (or read the whole genome multiple times) to decrease the error rate significantly. If you have one chip that can read the entire human genome in 30 mins, you can have the same chip read it twice in an hour, or four chips reading four copies in 30 mins.
Furthermore, if you're using a technique like this to map a person's genome, you can be clever about it. Base pairs code genes, which is something you can take into account. For example, if you're reading the eye color gene, and your machine somehow consistently makes mistakes in that area, you can compare your reads to the few possible known eye color genes, and pick the most likely based on the genetic sequences of the entire gene.
The 454 pyro-sequencer currently produces 400bp reads, not 20bp. Granted, that's still a fair bit shorter than this experimental tech claims, but it's also a commercialized product you can actually buy right now. I think it would only be fair to quote current performance figures.
99.3% of your base are belong to us!
Fixed it for you..
You just got troll'd!
' = "Look out! There's an "S" coming!"
You are not alone. This is not normal. None of this is normal.
One of the real advances here is the ability to do this on a single molecule. Existing DNA sequencing techniques all depend on an amplification step, known as ploymerase chain reaction (PCR), in which the DNA is iteratively duplicated (this is done by basically hijacking DNA replication machinery from bacteria). However, PCR introduces numerous biases in the final population of DNA molecules: shorter segments and certain sequences are easier to duplicate than others. As a result, what you end up sequencing is always skewed. This may not be too important when it comes to (re)sequencing a genome, but there are a whole cadre of experimental techniques that use sequencing to investigate regulation and modification of DNA, and here that bias can really skew findings and generate many false positives (things that amplify too easily) and negatives (things that don't amplify well at all).
In medicine, the cost of a study, as well as its reliability, availability, and predictive value, enters into the decisions made in clinical management.
Comment removed based on user account deletion
Real applications of this, however, include looking for gene sequences in adults which predispose them to diseases (e.g. breast cancer) and then providing counseling and monitoring commensurate with that risk, a far less expensive effort than monitoring everyone for the same disease, even if they aren't at risk. Also, one could use this on embryonic cells obtained through amniocentesis to screen for hereditary diseases is families where there are risk factors.
That's not how significant figures work with multiplication. .007 has only one significant figure (the 7) .007 * .007 = .00005 which also has one sig fig (5). 1-.00005 = .99995, since the 1 is a known integer value (and therefore has an infinite number of significant figures).
That being said, the parent has an incomplete grasp of statistics in this case, and using this formula is inaccurate. However, I am at work, so I will not do an in-depth statistical analysis here.
However, don't you just HATE it when your date asks you, "Before this relationship progresses any further, I'll need a sample of your DNA!"
I've abandoned my search for truth; now I'm just looking for some useful delusions.
Considering current sequencing technology generates terabytes of data per day (see the Sanger center), then wouldn't it be efficient to maximize the amount of information per pixel (i.e. per byte)? This method is actually is much worse (orders of magnitude) than the current method. There are many other problems with what they do, but hopefully the cash infusion can last them another 2 years until the write a paper like this. BTW, the say that appropriate camera tech. will be available in 2-5 years, but they're ready now! They might be buying time...
Just as seeing the moon doesn't require the same amount of effort as landing on it, reading a DNA sequence doesn't mean that selective modification is "just around the corner."
Who said anything about 'selective modification'? Read what you wrote:
Also, one could use this on embryonic cells obtained through amniocentesis to screen for hereditary diseases is families where there are risk factors.
If you can screen for hereditary diseases, you can also screen for desired traits such as hair color, eye color, propensity for obesity (yes, there's a gene for that), intelligence, etc.
In no time flat, we'll have become a race of athletic, attractive, social and congenial people who all get along, but are all little short in the intelligence department.
My blog
Before slashdot posts a science story it should check to see if the submitted story is accurate. The Ti pyrosequencer by 454 has a Q20 read length of 400 bases, not 20 bases. The 99.3% read accuracy described in the story is at 15-fold coverage. This means that the raw base accuracy (sequencing a template once) is much lower than 99.3%. This is in comparison to the Applied Biosystem's SOLiD instrument, which has a raw base accuracy of excess of %99.94.