British Library to Archive Electronic Resources
An anonymous reader writes "The British Library is a government-owned library that legally has to hold a copy of every book, pamphlet, map, journal, newspaper and piece of sheet music published in the UK. Today, that law changed and now the Library will be able to collect non-paper resources, such as websites, electronic journals, CD-ROMs and microfilms. Obviously, the library won't be archiving everything in these categories (for a start, the Wayback Machine already does a pretty good job of the websites), but will be keeping resources of national, historical or academic interest. There's more specific information in The British Library's press release. BBC News (which will now be archived by the Library) has an article on the changes."
so.... dmca vs british govt?
i got 20 bucks on the brits.
The Swedish Royal Library, which has also stores everything published in Sweden (since 1640) has been archiving all swedish web pages. (since 1996, I think)
There was a small flap about this recently, due to new data privacy legislation. They workaround is that the material is not available on the web, but can be accessed at the library.
Which is of course, a bit silly given things like the wayback machine, which are located in foreign countries where EU privacy directives don't matter.
but will be keeping resources of national, historical or academic interest.
does my personal "watch my litle baby pictures" blog will be archived then?
Well I have to wonder how all this will be stored and made secure for the next 100 years. Its going to take some large scale hardware, with a fast recall mechanism. Whatever company gets/has the contract must be rubbing their hands with glee
Rus
Cheap UK and US VPS
So when all the news web-sites have to pull a story because it relates to a trial... will it be pulled from the archive?
Will it be put back after the trial?
Or will it be a highly biased archive where anything that ever went to trial is strangely absent apart from the verdict.
I used to manage the ananova search engine and it was a royal pain to have to yank spidered stories out of the result set, yet the way some websites work (different urls for same story) it would be back in again after a while. Judges don't care for such technical excuses.
blog.sam.liddicott.com
do they plan to download and archive mp3s? i could help them out there.
The archive will comprise selective "harvesting" from the 2.9 million sites that have "co.uk" suffixes.
.co.uk (or other .uk address, the TLD for the UK) then it's a reasonable assumption that it's content is both British in its origin and intended primarily for a British audience.
If a site is using a
The potential for overlap with content covered by the DCMA seems negligible but even if there was such an overlap I fail to see how keeping a copy of a web page (and not the files that it may link to) would constitute a breech of the DCMA. Remember, the British Library is governed by British law so issues like copyright (over pictures, song lyrics, etc) aren't really issues at all.
Also remember that the British Library isn't concerned with every last printed word, only those that are believed to be of historically significance and/or academically valuable to future generations.
"Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
What the articles don't make clear is why legislation was needed. If all that will happen is for the British Library to crawl .uk sites, they could do that already.
For print publications it is mandatory to send a copy to the BL. Obviously that would never be workable for websites. But does the law now say that the BL has the right to take copies of what you publish whether you like it or not, as already happens for dead-tree publications?
For example the library might spider even sites with a robots.txt that forbids it, and be protected (in the UK at least) from legal harassment for doing so.
What new powers does this Act give the library that it didn't have before?
-- Ed Avis ed@membled.com
Is it really possilbe to archive all the information?
basically you need to copy all the webpages and documents, and the changes of those documents.
Movies files and such take up a lot of space. so how are they going to manage something like this-- it'd probably use up terabytes upon terabytes of disk space.
Well, how many years till it becomes the worlds largest archive of porn? After all... that's intersting to a lot of people...
*There's Klingons on the starboard bow, scrape em off Jim!*
They better invest in a bigger pipe - the site is terribly slow already :(
goatse.cx so I can look back on my mispent childhood in 40 years time.
I really believe there is too little discussion about issues like this. What you are hitting on is the matter of accountability. It is an extremely important tool for our society. Unfortunately, it usually takes a serious disaster (like the Great Depression) before people realize that accountability is essential to our civilization and something gets implemented. And the situation is even worse with relatively new technology.
/.er's are not immune. Why do we invest so much time into this site without demanding a certain degree of accountability? Is it not possible for our experience with this site to be pretty normal, yet what actually is going on in the background is quite contrary to our very reason for coming here? Without accountability, how will we ever know?
People tend to see technology as a separate "thing" that does not require the kind of scrutiny that other issues get. People only get excited when the technology stops working.
For instance, the majority of users have no problem with using a closed source OS like Windows. There are some really important issues about accountability that get neglected but as long as it works, people don't care. The only time people start to care is when insecure code allows their files to be erased and reality bursts their bubble. But what is the complaint? "MS, you need to get it together!" Unfortunately, the majority of people do not associate "accountability" as the main factor behind insecure code. They blame MS for being lazy (which is absurd, for so many reasons).
It seems that accountability is always an after-thought. If the system appears to be working, noone complains. However, without accountability, it is very easy for the system to be completely upside-down, yet appear to be working fine on the surface (most accounting scams appear flawlessly normal on the service, even when BILLIONS of dollars are being stolen or misrepresented).
This is not purely academic, and us
Sdelat' Ameriku velikoy Snova!
Ever try to get a library card there? Well, you can't. Not without a letter from your university saying there is no place else in the world you can find the material. I would kill for one big, central, public library in London like the New York Public library. But no, there are 10,000 crappy, little one all over the place.
The British Library isn't a public lending library, it's an academic library. It houses one of the most extensive literary collections in the world and it would seem patently obvious to me why it is that you can't just walk in, fill in a form and just take out whatever you like.
Some of its treasures are so delicate that they can't be touched by human hands - is that the kind of item you think should be easily accessed on a whim?
Is getting hold of relevant material at your own university's libraries really that difficult? Or is obtaining a letter of approval from your faculty impossible? I have to doubt that the answer to both these questions is a "yes".
On a parting note, perhaps you should try comparing the British Library to its one true American counterpart, the Library of Congress. The LoC is a fantastic archive, but despite being publicly funded and supposedly open to the public, you can't access it unless you're actually part of the political machine, as Michael Moore once illustrated.
"Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
The British Library is a government-owned library that legally has to hold a copy of every book, pamphlet, map, journal, newspaper and piece of sheet music published in the UK
Wrong. The copyright libraries (of which the British library is just one - there are also the Cambridge and Oxford university libraries) do not have to hold a copy of every pamphlet published in the UK, only books, defined as more than one set of bound pages (typically anything over 32 pages is bound into sets of 32 pages and then the sets bound into a book).
of course, what seems 'historically significant' may well be very different in the future. People might just be crying out for those personal blog entries that shaped the new, virtual social scene, or the opensource mailing lists that gave birth to a new fundamental structure of work and progress.
If you ask me, it should cover *EVERYTHING*. Resources should be set aside, just as they were way back when british libraries started, and books were expensive. This is the whole point behind the saying that software is information. And it's important.
i wonder if they'll archive the wayback machine.
i wonder if the wayback machine will archive them.
And if anyone tries to sue us as a result, we'll tell them to fuck off! Rule Britannia!
A while back it was posited that sites should actually be reponsible for providing snapshots of sites, though. Fortunately, I believe this was shot down; the cost implications would be mind-boggling.
I'm glad to see proactive steps being taken, however. Current guidelines for selecting content to archive have produced very usable resources in national libraries such as the one in Aberystwyth where I studied. It isn't as if they keep everything, after all...
Ph-nglui mglw'nafh Gates M'dna wgah'nagl fhtagn.
I understand national libraries in Scotland and elsewhere are a lot less friendly with access, but lots of people visit Wales specifically for NL access. Also, there are only, what, five national libraries? That's good enough concentration of resources for most people.
Ph-nglui mglw'nafh Gates M'dna wgah'nagl fhtagn.
It wasn't offsite at the time. The law was passed in 1911. The southern part of Ireland became an independent state in 1920.
GROGGS: alive and well and living in
I always find it a bit difficult to get worked up about British Library. Yes, it's great that it has a copy of every letter printed since God-knows-when, but who actually gets to see them?!?
... or at least reminiscent of the library in Eco's The Name of the Rose. No public access there.
... but I think British Library errs on the side of protection and caution. My husband, who's an Australian who lived in Denmark for a while, says that as a Swede I just don't understand ... it is one thing to provide public access in upright Scandinavia, it's a totally different thing altogether to provide it to Poms (that's Brits to you); in his opinon, granting Scandinavian-type access to British Library books would be tantamount to kissing them good-bye.
The way I understand that access to the Swedish Royal Library works -- and most other libraries in Sweden for that matter as well as every library I have been in contact with in Denmark and I don't suppose the situation is very different in Norway or Finland -- is that everyone has access to it. That is, it doesn't matter who you are or what you do, you can just walk in and ask to see whatever has taken your fancy and voila, the staff will hand it over to you. Now, you may not be able to take it home with you, but you will be able to peruse it at the library premisses.
British Library, on the hand, seems to be the library equivalent of Fort Knox
I understand that there is a trade-off between granting public access and preservation
The liver is evil and must be punished.
Perhaps you mean the Queen of England.
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
Queen of the United Kingdom.