Slashdot Mirror


British Library to Archive Electronic Resources

An anonymous reader writes "The British Library is a government-owned library that legally has to hold a copy of every book, pamphlet, map, journal, newspaper and piece of sheet music published in the UK. Today, that law changed and now the Library will be able to collect non-paper resources, such as websites, electronic journals, CD-ROMs and microfilms. Obviously, the library won't be archiving everything in these categories (for a start, the Wayback Machine already does a pretty good job of the websites), but will be keeping resources of national, historical or academic interest. There's more specific information in The British Library's press release. BBC News (which will now be archived by the Library) has an article on the changes."

76 comments

  1. funny face off by TedCheshireAcad · · Score: 2, Funny

    so.... dmca vs british govt?

    i got 20 bucks on the brits.

    1. Re:funny face off by lanswitch · · Score: 1
      Maybe it's the EU against the DCMA. Brits are EU-members, so I guess that their laws follow EU-guidelines and laws.

      This comment Copyright 2003 The Queen of Britain. If you steal this shit, she'll fucking cut you.

    2. Re:funny face off by TomV · · Score: 2, Interesting

      A very similar requirement benefits the Library of Congress in the USA, under the name "Mandatory Deposit" (here are the rules).

    3. Re:funny face off by Anonymous Coward · · Score: 0

      dmca vs british govt

      where do i collect my 20 bucks?

      (you would've won if you've just used british government instead.)

    4. Re:funny face off by Anonymous Coward · · Score: 0

      And by even more if you'd used UK government.

    5. Re:funny face off by silentbozo · · Score: 1

      I wonder how many times the mandatory deposit requirement has been enforced? Supposedly every item copyrighted in the US since 1978 should be on deposit at the LOC, and available to member libraries on request. How much money is allocated to storing and preserving these works?

    6. Re:funny face off by TomV · · Score: 1

      Well, it's not 100%, certainly. Though given that providing a Mandatory Deposit copy if requested is a condition of Copyright Registration, I'd be surprised if a publisher were to refuse a request.

      From the 2001 Annual Report, the Copyright Office had a budget of $40,896,000 in fiscal Year 2001, though I'm not clear on whether this includes funding for the storage of earlier MD items. Total LOC funding for 2001 was $572M, plus $119M in gifts.

      With the volume of published material increasing exponentially with time, with the best will in the world it's just unfeasible to maintain a comprehensive collection anymore. What Mandatory Deposit gives is the ability, in an environment where the LOC has to make a judgement call, to rely on the availability of any item the LOC really wants (the initial penalty for refusal to supply is a 2500 fine, plus 250 per item refused).

  2. Swedish Royal Library too by k98sven · · Score: 4, Interesting

    The Swedish Royal Library, which has also stores everything published in Sweden (since 1640) has been archiving all swedish web pages. (since 1996, I think)

    There was a small flap about this recently, due to new data privacy legislation. They workaround is that the material is not available on the web, but can be accessed at the library.

    Which is of course, a bit silly given things like the wayback machine, which are located in foreign countries where EU privacy directives don't matter.

    1. Re:Swedish Royal Library too by amembleton · · Score: 2, Funny

      From the Swedish Royal Library: http://www.kb.se/ENG/kbstart.htm Opening hours Friday October 31 the library will close at 16 PM. Saturday November 1 the library is closed Now, thats an intresting time.

    2. Re:Swedish Royal Library too by orangesquid · · Score: 2, Funny

      The PM is to distinguish from 16 AM. Otherwise, confusion could result. ;)

      --
      --TheOrangeSquid Is it any wonder things seem so awry? We swim in a sea of confusion and don't have to think to survive
  3. what about blogs by borgdows · · Score: 1, Funny

    but will be keeping resources of national, historical or academic interest.

    does my personal "watch my litle baby pictures" blog will be archived then?

    1. Re:what about blogs by mlk · · Score: 1

      Nope, but storys about IDS will...

      --
      Wow, I should not post when knackered.
    2. Re:what about blogs by borgdows · · Score: 0

      what about if my baby become the first man on Mars ?

  4. Storage by rf0 · · Score: 4, Insightful

    Well I have to wonder how all this will be stored and made secure for the next 100 years. Its going to take some large scale hardware, with a fast recall mechanism. Whatever company gets/has the contract must be rubbing their hands with glee

    Rus

    1. Re:Storage by Anonymous Coward · · Score: 0

      Nah, just put it on an FTP site and let the web back it up for you.

      Oh wait.

      Damn. :/

    2. Re:Storage by TomV · · Score: 4, Interesting

      Considering the cost of the existing 340km of basement shelving, mostly mobile, in a tightly controlled microenvironment, with fire and flood protection, I certainly wouldn't expect them to skimp on the storage. But I'd expect the competitive tendering process to keep some sort of a lid on the spend.

    3. Re:Storage by bug-eyed+monster · · Score: 5, Insightful

      Yeah it'll be interesting to see how the info will be stored. Looks like they're also collecting CD-ROMs and other "non-print publications." I don't think they absolutely need to store it somewhere that'll last for 100 years. They could store it in redundant media and just replicate them over time as the media's lifespans expire.

      As far as fast recall, the articles don't say if the info will be available on the net. If it's just for archival purposes, they don't need to put it anywhere that's quickly accessible. After all it's a government-run library, so nobody will expect to take less than a day or two to retrieve anything.

    4. Re:Storage by rf0 · · Score: 2, Funny

      Well I worked out where they can put it all on one of those unlimited disk space hosts. An you could probably do it all for $50/mo

      Rus

    5. Re:Storage by IM6100 · · Score: 1

      Or they could just put it on Freenet.

      *rim shot*

      --
      A Good Intro to NetBS
  5. Censorship? by samjam · · Score: 4, Insightful

    So when all the news web-sites have to pull a story because it relates to a trial... will it be pulled from the archive?

    Will it be put back after the trial?

    Or will it be a highly biased archive where anything that ever went to trial is strangely absent apart from the verdict.

    I used to manage the ananova search engine and it was a royal pain to have to yank spidered stories out of the result set, yet the way some websites work (different urls for same story) it would be back in again after a while. Judges don't care for such technical excuses.

    1. Re:Censorship? by squiggleslash · · Score: 1

      I doubt it'll be pulled from the archive as that's not necessary. In order to comply with the law, all that's needed is for the material to be inaccessable until a verdict is reached.

      --
      You are not alone. This is not normal. None of this is normal.
    2. Re:Censorship? by samjam · · Score: 1

      Yeah, but guess whats easiest?

      To remove something
      or
      to remove it and put it back again later
      or
      to remove it and remember to put it back later and put it back later
      or
      to remove something and be bothered to remember to put it back later and to be bothered to put it back later and to put it back later

      Guess whether or not any stories we yanked from the search set were restored, or if we left the cron job running which kept pulling them.
      *cough*
      I don't know, but I suspect the cron job may have been stopped when another was started for another trial a lot later, but certainly its not commercially worth the trouble to build such things into the spider and search specially.

    3. Re:Censorship? by Anonymous Coward · · Score: 0

      boolean show?

    4. Re:Censorship? by squiggleslash · · Score: 2, Insightful
      It's "easiest" just not to archive anything whatsoever - regardless of whether the content's legal or not. However, doing so would be against both the spirit and word of the law.

      The law puts an onus on the British Library to archive everything. It also puts an onus on the British Library not to publish material that might prejudice a court proceeding. The only way to obey both laws is to archive everything and provide conditional access.

      --
      You are not alone. This is not normal. None of this is normal.
  6. what about music? by Anonymous Coward · · Score: 1, Funny

    do they plan to download and archive mp3s? i could help them out there.

  7. RT(F)A... by WIAKywbfatw · · Score: 1

    The archive will comprise selective "harvesting" from the 2.9 million sites that have "co.uk" suffixes.

    If a site is using a .co.uk (or other .uk address, the TLD for the UK) then it's a reasonable assumption that it's content is both British in its origin and intended primarily for a British audience.

    The potential for overlap with content covered by the DCMA seems negligible but even if there was such an overlap I fail to see how keeping a copy of a web page (and not the files that it may link to) would constitute a breech of the DCMA. Remember, the British Library is governed by British law so issues like copyright (over pictures, song lyrics, etc) aren't really issues at all.

    Also remember that the British Library isn't concerned with every last printed word, only those that are believed to be of historically significance and/or academically valuable to future generations.

    --

    "Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
  8. Voluntary or compulsory? by Ed+Avis · · Score: 4, Interesting

    What the articles don't make clear is why legislation was needed. If all that will happen is for the British Library to crawl .uk sites, they could do that already.

    For print publications it is mandatory to send a copy to the BL. Obviously that would never be workable for websites. But does the law now say that the BL has the right to take copies of what you publish whether you like it or not, as already happens for dead-tree publications?

    For example the library might spider even sites with a robots.txt that forbids it, and be protected (in the UK at least) from legal harassment for doing so.

    What new powers does this Act give the library that it didn't have before?

    --
    -- Ed Avis ed@membled.com
    1. Re:Voluntary or compulsory? by sh4de · · Score: 2, Interesting

      Why is this important? Unless you have "sensitive" data on you web page, storing the contents of your index.[html|shtml|php] is no big deal now, is it? If you do have this "sensitive" data on your web page in the first place, don't you wish it to be archived somewhere. The age-old question of privacy appalls me sometimes. Not everything is government control and big brother watching upon us. Lighten up!

    2. Re:Voluntary or compulsory? by Ed+Avis · · Score: 1

      It's certainly important from the Library's point of view. (BTW, I didn't say in the above post whether I'd be in favour of or opposed to such a policy.)

      --
      -- Ed Avis ed@membled.com
    3. Re:Voluntary or compulsory? by jasoncart · · Score: 1

      I believe the legislation was needed to increase the library's remit to cover electronic documents.

      The library is currently government funded to cover just paper documents.

    4. Re:Voluntary or compulsory? by Freddles · · Score: 1

      I suspect there are two asepcts to this:

      1. New media, such as CD Roms, etc were not previously covered by the mandatory deposit rights the library has, so they may simply be taking this opportunity to make an announcement that covers both CD Roms and the web since that's simpler for the general public to understand.

      2. It may also be that without the law being changed, they would not have received the necessary government funding to create and maintain their web archive.

  9. possible? by mehtars · · Score: 0, Redundant

    Is it really possilbe to archive all the information?
    basically you need to copy all the webpages and documents, and the changes of those documents.
    Movies files and such take up a lot of space. so how are they going to manage something like this-- it'd probably use up terabytes upon terabytes of disk space.

    1. Re:possible? by __past__ · · Score: 2, Insightful

      The existence of the wayback machine pretty much proves that it can be done, doesn't it? Of course, it is inclomplete, but it doesn't restrict itself to the UK either.

    2. Re:possible? by mlk · · Score: 1

      It will not try to, only sites of "historical (british?) intrest".

      --
      Wow, I should not post when knackered.
  10. Historical Interest... by Ceadda · · Score: 1, Funny

    Well, how many years till it becomes the worlds largest archive of porn? After all... that's intersting to a lot of people...

    --
    *There's Klingons on the starboard bow, scrape em off Jim!*
  11. better hardware + bandwidth by millette · · Score: 1

    They better invest in a bigger pipe - the site is terribly slow already :(

    1. Re:better hardware + bandwidth by millette · · Score: 1

      Oups, I take it back, it's not so slow after all. I also found this listing more archives from the same company (Reading Room) that created the BL site.

  12. I hope they archive... by Anonymous Coward · · Score: 2, Funny

    goatse.cx so I can look back on my mispent childhood in 40 years time.

  13. Moving into some scary times . . . by Idou · · Score: 4, Insightful

    I really believe there is too little discussion about issues like this. What you are hitting on is the matter of accountability. It is an extremely important tool for our society. Unfortunately, it usually takes a serious disaster (like the Great Depression) before people realize that accountability is essential to our civilization and something gets implemented. And the situation is even worse with relatively new technology.

    People tend to see technology as a separate "thing" that does not require the kind of scrutiny that other issues get. People only get excited when the technology stops working.

    For instance, the majority of users have no problem with using a closed source OS like Windows. There are some really important issues about accountability that get neglected but as long as it works, people don't care. The only time people start to care is when insecure code allows their files to be erased and reality bursts their bubble. But what is the complaint? "MS, you need to get it together!" Unfortunately, the majority of people do not associate "accountability" as the main factor behind insecure code. They blame MS for being lazy (which is absurd, for so many reasons).

    It seems that accountability is always an after-thought. If the system appears to be working, noone complains. However, without accountability, it is very easy for the system to be completely upside-down, yet appear to be working fine on the surface (most accounting scams appear flawlessly normal on the service, even when BILLIONS of dollars are being stolen or misrepresented).

    This is not purely academic, and us /.er's are not immune. Why do we invest so much time into this site without demanding a certain degree of accountability? Is it not possible for our experience with this site to be pretty normal, yet what actually is going on in the background is quite contrary to our very reason for coming here? Without accountability, how will we ever know?

    --
    Sdelat' Ameriku velikoy Snova!
  14. That's nice but. by CGP314 · · Score: 0, Flamebait

    Ever try to get a library card there? Well, you can't. Not without a letter from your university saying there is no place else in the world you can find the material. I would kill for one big, central, public library in London like the New York Public library. But no, there are 10,000 crappy, little one all over the place.

    1. Re:That's nice but. by TomV · · Score: 2, Informative

      There's nothing in the BL that you can't get within a fortnight by Inter Library Loan from the crappy little library of your choice.

    2. Re:That's nice but. by Daniel+Dvorkin · · Score: 1

      I talked my way into a "British Library card" once. I was stationed in the UK as a USAF medic and was spending the day in London. I told them I was doing a research project on historical military medical techniques, showed them my two ID's (the regular military ID card and the "don't shoot this guy" Geneve Convention card that medics get) and was issued a three-day BL card that gave me access to, well, pretty much the whole damn thing.

      They were watching me pretty closely, though, so, in a profound fit of geekery, I actually did spend the day browsing through old military medical texts. It was fascinating, actually, but ... [shudder] I am really, really glad I was a medic in the late 20th c. rather than any previous era, let me tell you.

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    3. Re:That's nice but. by matthew.thompson · · Score: 2, Informative

      Often it's faster than that and if necessary the copy can come from the BL - I've received a book before now that was held in the BL before being delivered to my local library for my attention.

      --
      Matt Thompson - Actuality - Insert product here.
    4. Re:That's nice but. by TomV · · Score: 1

      Actually, if I dust off my old Librarian degree for a second, the British Library is in fact not a biulding full of books at all, but rather a collection of orghanised knowledge, which can be acesses from any of thousands of crappy little libraries, and at several huge ones too ;-)

  15. Are you kidding me? by WIAKywbfatw · · Score: 3, Informative

    The British Library isn't a public lending library, it's an academic library. It houses one of the most extensive literary collections in the world and it would seem patently obvious to me why it is that you can't just walk in, fill in a form and just take out whatever you like.

    Some of its treasures are so delicate that they can't be touched by human hands - is that the kind of item you think should be easily accessed on a whim?

    Is getting hold of relevant material at your own university's libraries really that difficult? Or is obtaining a letter of approval from your faculty impossible? I have to doubt that the answer to both these questions is a "yes".

    On a parting note, perhaps you should try comparing the British Library to its one true American counterpart, the Library of Congress. The LoC is a fantastic archive, but despite being publicly funded and supposedly open to the public, you can't access it unless you're actually part of the political machine, as Michael Moore once illustrated.

    --

    "Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
    1. Re:Are you kidding me? by CGP314 · · Score: 1

      I guess I was just dissapointed that there is no huge library in London that I can spend the day in beeing a huge nerd. The closest thing is Borders :/

    2. Re:Are you kidding me? by Anonymous Coward · · Score: 0

      there are a number of pretty damn big ones though.
      east sheen library (in an obscure part of sw london) is quite big enough for all but the most hardcore nerds, and it's a general example, not an exception. they have a big collection of microfiche there :)

    3. Re:Are you kidding me? by Anonymous Coward · · Score: 0

      If you're an academic (you can probably get an entercard if you are, through your institution), and can travel, get into one of the other legal deposit libraries. There are two within striking distance of London:

      the University Library of the University of Cambridge

      the Bodleian Library of the University of Oxford.

      The UK legal deposit libraries are, I believe:

      the British Library
      the University Library, Cambridge
      the Bodleian Library, Oxford
      the National Library of Scotland
      the National Library of Wales,
      and, bizarrely, the Library of Trinity College, Dublin (in Eire).

      I'm impressed that the UK legal deposit system has offsite backups :)

    4. Re:Are you kidding me? by ktorn · · Score: 1
      Some of its treasures are so delicate that they can't be touched by human hands


      Why discriminate humans? Are the hands of a chimp any cleaner?
  16. Point of order by Anonymous Coward · · Score: 0

    The British Library is a government-owned library that legally has to hold a copy of every book, pamphlet, map, journal, newspaper and piece of sheet music published in the UK

    Wrong. The copyright libraries (of which the British library is just one - there are also the Cambridge and Oxford university libraries) do not have to hold a copy of every pamphlet published in the UK, only books, defined as more than one set of bound pages (typically anything over 32 pages is bound into sets of 32 pages and then the sets bound into a book).

    1. Re:Point of order by Anonymous Coward · · Score: 0

      My understanding is that the British Library isn't merely one of the several copyright libraries, but holds a special status; whereas the others are entitled to whatever publications they want, the British Library actually takes them all.

      I'd best back this up: According to the British Library's site: "We receive a copy of every publication produced in the UK and Ireland," and their collection does include maps, music, newspaper, theses, etc.

  17. historical significance... by Anonymous Coward · · Score: 0

    of course, what seems 'historically significant' may well be very different in the future. People might just be crying out for those personal blog entries that shaped the new, virtual social scene, or the opensource mailing lists that gave birth to a new fundamental structure of work and progress.

    If you ask me, it should cover *EVERYTHING*. Resources should be set aside, just as they were way back when british libraries started, and books were expensive. This is the whole point behind the saying that software is information. And it's important.

  18. hmmm by Anonymous Coward · · Score: 2, Funny

    i wonder if they'll archive the wayback machine.
    i wonder if the wayback machine will archive them.

  19. Hooray for we Englishmen! by Anonymous Coward · · Score: 0

    And if anyone tries to sue us as a result, we'll tell them to fuck off! Rule Britannia!

  20. More on the legal implications... by Denyer · · Score: 2, Interesting
    ...no, not obvious copyright ones (the web being a publishing medium no different to any other in this respect; content is copyright but is said to have been published publically unless password-protected. I don't think robots.txt would stand up in court if other agents such as browsers have access.)

    A while back it was posited that sites should actually be reponsible for providing snapshots of sites, though. Fortunately, I believe this was shot down; the cost implications would be mind-boggling.

    I'm glad to see proactive steps being taken, however. Current guidelines for selecting content to archive have produced very usable resources in national libraries such as the one in Aberystwyth where I studied. It isn't as if they keep everything, after all...

    --
    Ph-nglui mglw'nafh Gates M'dna wgah'nagl fhtagn.
  21. National library access is straight-forward... by Denyer · · Score: 1
    ...particularly if you attend a university next to one. For example, the University of Wales, ABerystwyth even allows undergrads access; tutors are generally very willing to sign papers. I've still got a card valid into 2005 which I rarely used because the university's own facilities are more than sufficient for many disciplines.

    I understand national libraries in Scotland and elsewhere are a lot less friendly with access, but lots of people visit Wales specifically for NL access. Also, there are only, what, five national libraries? That's good enough concentration of resources for most people.

    --
    Ph-nglui mglw'nafh Gates M'dna wgah'nagl fhtagn.
  22. It wasn't offsite at the time by marnanel · · Score: 1

    It wasn't offsite at the time. The law was passed in 1911. The southern part of Ireland became an independent state in 1920.

    --
    GROGGS: alive and well and living in
  23. Access To The Books by bettiwettiwoo · · Score: 1

    I always find it a bit difficult to get worked up about British Library. Yes, it's great that it has a copy of every letter printed since God-knows-when, but who actually gets to see them?!?

    The way I understand that access to the Swedish Royal Library works -- and most other libraries in Sweden for that matter as well as every library I have been in contact with in Denmark and I don't suppose the situation is very different in Norway or Finland -- is that everyone has access to it. That is, it doesn't matter who you are or what you do, you can just walk in and ask to see whatever has taken your fancy and voila, the staff will hand it over to you. Now, you may not be able to take it home with you, but you will be able to peruse it at the library premisses.

    British Library, on the hand, seems to be the library equivalent of Fort Knox ... or at least reminiscent of the library in Eco's The Name of the Rose. No public access there.

    I understand that there is a trade-off between granting public access and preservation ... but I think British Library errs on the side of protection and caution. My husband, who's an Australian who lived in Denmark for a while, says that as a Swede I just don't understand ... it is one thing to provide public access in upright Scandinavia, it's a totally different thing altogether to provide it to Poms (that's Brits to you); in his opinon, granting Scandinavian-type access to British Library books would be tantamount to kissing them good-bye.

    --
    The liver is evil and must be punished.
  24. There is no Queen of Britain. by DrSkwid · · Score: 1

    Perhaps you mean the Queen of England.

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  25. I hope that's a joke by Anonymous Coward · · Score: 0

    Queen of the United Kingdom.