Human Genome Sequencing Completed
Arthur Dent '99 writes "According to this article at Reuters, the last chromosome in the human genome has finally been sequenced, taking 150 British and American scientists 10 years to complete. The sequenced chromosome, Chromosome 1, is the largest chromosome, with nearly twice as many genes as the average chromosome, making up eight percent of the human genetic code. The Human Genome Project has published the sequence online in the journal Nature, according to the article. It contains 3,141 genes (over 1,000 of them newly discovered), and 4,500 new SNPs -- single nucleotide polymorphisms -- which are the variations in human DNA that make people unique."
I won't bore you with the details, but theres lots of GATCAATGAGGTGGACACCAGAGGCGGGGACTTGTAAATAACACTGGGC type things here
liqbase
Can we start the patent countdown clock?
The simple truth is that interstellar distances will not fit into the human imagination
- Douglas Adams
Now where's my +1 Talent in every base?
if God wouldn't have used LISP to encode the darn sequence in the first place
I'll take my next kid with larger-than-average height, enhanced frontal lobes, a natural resistance to the polio virus, OH and dont forget the 20/10 vision!
Why do one chromosone have more genes than others?
"To map the very stuff of life; to look into the genetic mirror and watch a million generations march past. That, friends, is both our curse and our proudest achievement. For it is in reaching to our beginnings that we begin to learn who we truly are."
-- Academician Prokhor Zakharov,
"Address to the Faculty"
The surprise isn't how often we make bad choices; the surprise is how seldom they defeat us.
Scientists: All your base pair are belong to us!
Your single nucleotide polymorphisms are unique! Just like everyone else's.
ACGATCGTACGcopyrightTAGATCGCGTAGTAGCTAGCTGTbyGGCGG CGGTACGGCTATiehovaAGTCGATCGATGATCG5billionBC-TAGCT AGCTAGCTAGCTAGinfinityTAGTAGTATTTATTTunauthorizedA GGCGGTATGCTAGCTAGreproductionCTGATGTGTAGCCCAprohib itedCCAGCTTAGCTAbyGCTAGCTAGTGTAAATCGCCATCGCGCCTAdi vineTTCTCTAGAGCTTAGCATGCTAlawCGTACGTAGCTA
The grass is always greener on the other side of the light cone.
I'm forced to agree with QuantumG. I'm a Human Geneticist and the genome project is an invaluable tool in the study of human disease. I can understand the fear of the misuse of the technology, but do you think that part of the genome should have been left unsequenced? If so which parts? What would be the benefit of such and action? This technology has allowed for the development of the ability to rapidly screen for the many know disease mutations to assess risk for "genetic" disease. It has also had practical medical impact in daily life. Screen cancer samples for chromosomal abnormalities and mutations has led to the development of rational therapy for specific cancer types. Where everything is leading is rational therapy overall. Individualized medicine and preventative medicine are the goals. I do agree with you that there are dangers associated with such knowledge. The question is whether we can use it to benefit the everyday man or woman to improve the quality of life for everyone.
From the fine article:
"The scientists also identified 4,500 new SNPs -- single nucleotide polymorphisms -- which are the variations in human DNA that make people unique."
There are other variations which make us unique.
Alternate alleles*
Indels (insertions/deletions)
Variable numbers of repeats.*
The genetic code uses 4 letters, but I'll use English for explaination.
A SNP is a single letter which has different values in different individuals: "The cat and the dog" vs "the hat and the dog".
An indel is where letters have been inserted into one sequence or deleted from another (without additional data, we can't distinguish these possibilities.)
"The cat and the dog" vs "the cat and the big dog".
In alternate alleles there are a bunch of changes which always stick together, e.g. we observe "the cat and the big dog" and "the cat and the small mouse", but never (or exceedingly rarely) "the cat and the big mouse" or "the cat and the small dog."
Variable repeats are a special case of indels, but common enough to warrant a category of their own. "The cat and and and the dog" vs "the cat and and and and and the dog".
Quattuor res in hoc mundo sanctae sunt: libri, liberi, libertas et liberalitas.
Why do one chromosone have more genes than others
Why not? It's because it wasn't designed by a computer geek (or anyone/thing else) where you might have said, hrmmm, we need about 30,000 genes for this design, so we'll split that into 26 chromosomes of 1,154 genes apiece. That should do it!
The fact is, we evolved, and so our components are just bits and pieces taken from all our previous ancestors, modified according to whatever was needed to suit the environment we happened to find ourselves in at the time. As with all natural, biological, dynamic processes, what emerges is often bizarrely disorganised, yet somehow works.
This is good news but not too useful until we can model protein shaping.
The AGCT's code for proteins and so far we can only model very short combinations. All you coders keen for a life project have a crack at it. Theres 20 amino acids formed from combinations of three base pairs. The amino acids have attraction and repulsion properties with each other and their environment and form to make a unique shape. Its the analysis of that 3D shape that will solve:
- all cancer - modelling protein shapes means instant cancer cures
- bird flu - again modelling proteins means instant antibodies to diseases
- the most toxic substance ever invented - it will also open up designer drugs
gtcatgcgatacgtaggcaaatcg2tgacggcagt
hmmm i guess its not as funny unless its binary
Just to add on to this
20/20 vision means that when you stand away from something at 20ft, what you see is what the normal person would see at 20ft.
20/40 is, well, if you stand 20ft away, you see what a normal person would see at 40ft
Same goes for 20/10.
You seem to be under the impression that the number 1000 has some special meaning. Let's try your comment again, in octal:
pi * 1750 genes. Got to love those fun coincidences
Not so exciting now, is it? Nature is not decimal-based. The only reason we tend to be is because of the number of fingers we have.
Comment removed based on user account deletion
The basic idea is this. Our cells need a program that tells them what to do. That's the genome. There are a total of 46 chromosomes consisting of two sets of 23 independent chromosomes (1 - 22 and X or Y). DNA makes up the chromsomes. It's just a chemical structure that stores information; the four chemicals that make up DNA are Adenine (A), Thymidine (T), Cytosine (C) and Guanine (G). Every DNA molecule is actually two pieces of DNA that pair together as A binding to T and C binding to G. Sequencing is a chemical reaction that will tell you what the sequences of these four nitrogenous bases are. For example you may end up getting a read of AGTATTACGTATGCATAGGTCCGATG from a sequencing reaction (usu you'll get about 500 - 700 bases in one reaction). This tells you the sequence of ONE of the TWO strands of the DNA molecule. BUT since they pair in a predictable way, you know the sequence of the opposite strand (A-T and C-G). Our genomes are composed of approximately 3.2 billion total As, Cs, Ts and Gs. The goal of the genome project was just to tell us what the sequence of those bases are. That's it. Finding genes and things of that nature are really things that come about from having the primary sequence to reference. If you want to find a mutation you have to know what the sequence is SUPPOSED to be and WHERE IT IS before you can say it is different. That's your quick answer: the genome project sought to determine (1) what the sequence of bases in human chromosomes where and (2) the physical position of these sequences within the chromosomes. They did some other interesting things to prepare for it along the way, but that is a separate matter.
Completing the sequence and actually putting it together are two entirely different affairs. Small sequences called ESTs (Expressed Sequence Tags) were obtained during this effort. The big task after that was to put everything together AND in order. Think of it as a massive puzzle. Even the genome has different "builds" depending on the level of completeness of this work.
A CC-licensed illustrated horror novel
Actually there are a few "numbers" that are "magic" in nature. Depending on the species.
10 certainly is important to us, having 10 fingers and 10 toes. Unless you're carpenter.
Asking a bee, you'd prolly be called crazy and 6 is the perfect number, from legs to comb.
A spider would probably tell you 8 is more important, from legs to their web's segments.
But since this genome has meaning for us, I'd wager that our "magic" applies.
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
There are still large gaps in each chromosome, either due to repetitive sequences, high GC content or closeness to the centromere - basically saying that the human genome is finally done is like saying that 99.9% equals 100%, which it doesn't. This is especially important in cases where you actually NEED to use sequence in areas where it has not been assembled correctly or has not been sequenced... which has happend to me multiple times during the last couple of years... oh and those places in the genome have been unfinished ever since the first installment appeared publicly... they are even lacking in the Celera version of the genome... Finished my ass! -pug
This is by my count the fourth time that the human genome has been announced "finished" - anymore times and they will all be invited to become slashdot editors.
Automated DNA sequencing software