Library of Congress To Archive All Public Tweets
After the recent announcement that Groklaw will be archived at the Library of Congress, mjn writes with word that the push to archive more digital content continues: "The US Library of Congress announced a deal with Twitter to archive all public tweets, dating back to Twitter's inception in March 2006. More details at their blog. No word yet on precisely what will be done with the collection, but besides entering your friends' important updates on the quality of breakfast into the permanent archival record, the deal may improve access for researchers wanting to analyze and mine Twitter's giant database."
Given the signal to noise ratio for most tweets, I'm not convinced this is a particularly good use of resources...
Just because you can do something, doesn't mean you have to!
American Third Position
Finally, a real choice!
Clearly, once they've finished, they plan to destroy the entire world so that they can claim to have truly archived all human knowledge, forever.
I'm thinking the byte limit on tweets is the main factor here...easier to just scoop 'em all up than to figure how to get the "important" ones.
Linux, you magnificent bastard, I read the fucking manual!
all of them???
Disk space is cheap...
They should get a copy of the internet archive while they're at it.
The only time I really actively used Twitter was during the recent LHC 3.5TeV event, because the webstream was completely overloaded. LoC preserving it? Future generations will look back and conclude that some people REALLY did have to TOO much time and trivial stuff to share.
They should get a copy of the internet archive while they're at it.
And alt.binaries too. Think of the "research" potential there... ;)
I want peace on earth and goodwill toward man.
We are the United States Government! We don't do that sort of thing.
http://twitter.com/mzzt/status/12179834899
It had to be done.
I suspect a lot of the interesting information is in the aggregate anyway, not individual tweets: things like trends, analysis of subgroups, linguistic analysis, etc.
10 PRINT CHR$(205.5+RND(1)); : GOTO 10
All 'useless twits' jokes aside, this is pretty interesting. But I wonder if they'd run into any copyright laws.
Reading the Twitter ToS turns up with this:
You retain your rights to any Content you submit, post or display on or through the Services. By submitting, posting or displaying Content on or through the Services, you grant us a worldwide, non-exclusive, royalty-free license (with the right to sublicense) to use, copy, reproduce, process, adapt, modify, publish, transmit, display and distribute such Content in any and all media or distribution methods (now known or later developed).
which looks to me like posters retain copyright, but Twitter retains the right to grant others the same license you've granted them (non-exclusive license to provide their service).
So based on my reading, Twitter (and the LoC) are in the clear?
I have developed a truly marvelous proof of this comment, which this signature is too narrow to contain.
I know you are joking, but this kind of stuff is actually very important to historians. For example, the only reason we are able to reconstruct how many hours a day people worked in the medieval era is by looking at court records - the judge will ask things like "what were you doing at five" and the person will respond with answers like "eating" or "sleeping" or "working", and by going though a lot of court records, we were able to guess at how people lived back then.
This will allow the historian of the future to guess much more accurately.
Math for the day:
Without compression, all tweets in human history will fit on a single hard drive costing less than $100.
http://search.twitter.com/search?q=a (to find the latest tweet number)
http://twitter.com/about (character limit)
http://www.pricewatch.com/hard_removable_drives/ (1.5TB drive)Delete
http://www.google.com/buzz/fulldecent/18tfNfPHSBp/Math-for-the-day-Without-compression-all-tweets-in
-- I was raised on the command line, bitch
They were probably too busy watching Medieval Idol to even realize who Shakespeare or the King was ;)
A jest, I know, but it does demonstrate a serious point.
Our history books are based on records maintained by the winners of wars, the leaders, the successful, etc. We know a lot about Shakespeare. We know relatively little about how his audiences actually felt about his work.
We largely speculate as to how life was for the ordinary folk during historical periods based on writings about them, not writings from them. The exception to this is diaries, and now many people maintain those any more. Twitter can help replace some of that perspective.
Admittedly, Twitter is not an ideal way to get a picture of a society, but you get to hear historical events told from a very different perspective. Actually, you get to hear them from LOTS of perspectives. They may not be an accurate portrayal of the events, but they are a snapshot of how a society reacts to and perceives events.
Who will represent the narcissists in society for future generations?
"This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
I would that a social scientist in the 23rd Century does that think that average human of today posts every triviality in his life like most of the current twitters.
Troll is not a replacement for I disagree.
We learned more about ancient Egypt from their twitter then from all the official records designed to be survive the ages. Sure sure, very interesting to read the "unbiased" record of a pharaoh in his own tomb, but it is from the "trash" notes that were recovered that we learned about how the country itself worked. Including such little details as that the pyramids were not made by slaves.
The official records of the US will be Fox news. Better pray that future researchers have access to some other source, or they will come back in time and nukes us all, causality be damned.
MMO Quests are like orgasms:
You may solo them, I prefer them in a group.
A library archiving your work does not necessarily imply that you don't own the copyright on it.
Soon after, he publishes a paper with his revolutionary new theory: People in the 21st century were so forgetful that they decided to record all details about their daily life in a central database so they could recover it if necessary.
The Tao of math: The numbers you can count are not the real numbers.