Slashdot Mirror


Washington State Archives Go Digital

prostoalex writes "USA Today and dozens of others report that Washington state archives went online. Over the past two years project participants scanned 1 million documents issued by state and country authorities. The archive is located in my alma mater Eastern Washington University (go Eagles!) The 800 terabyte storage system was developed by Microsoft and EDS."

13 of 131 comments (clear)

  1. Well, by chewy_2000 · · Score: 4, Insightful
    Personally I would find this, or something like it, very useful in research, even as just an undergrad History major. The amount of times I've wished for something like this while digging around in musty old archives...

    Although, it has to be said, I hope they make everything accessable for *everyone*, regardless of OS and browser. No doubt a lot of researchers would be using OS X/Linux/Firefox.

    1. Re:Well, by ImaLamer · · Score: 4, Informative

      I'm using Firefox (from Windows sadly) and I can access the content just fine.

      As for OSX and Linux users, there is a plug in for viewing the content needed. But they report to support OSX and "UNIX". The plug-in is called DjVu and has an open source equivalent at sourceforge (with RPMs, OS/2 and even Cygwin support).

  2. WWW address by JamesD_UK · · Score: 5, Informative

    Just in case someone actually wanted the address for the archives it's http://www.digitalarchives.wa.gov/

    1. Re:WWW address by El+Cubano · · Score: 4, Informative

      Just in case someone actually wanted the address for the archives it's http://www.digitalarchives.wa.gov/

      FYI. Turn on cookies or you receive this extremely helpful error message:

      An error occured on the site. Please try again or come back another time.

      Otherwise, it's pretty cool.

  3. Just another link (or two) by Neumsy · · Score: 5, Informative
    --
    %blow
    %blow: No such job

    ^how did the sex change go?
    Modifier failed
  4. NB Archives by X-Phile · · Score: 5, Informative

    The Province of New Brunswick Provincial Archives have been like this for quite some time now, with birth, death, marriage certs and census records. I have been able to search for information about my family history online using their handy dandy search tool, as well as visiting the Archives themselves at University of New Brunswick. It never occurred to me that others might be trying catching up, but I guess that this type of service isn't something that most governments deem necessary for the public.

    --
    "Well you're not Fiona Apple, and if you're not Fionna Apple, I don't give a rat's ass."
  5. Search capabilities by vinukr · · Score: 4, Insightful

    One thing that they have to concentrate on in the future when the number of records grow fast is a nice search strategy. Time taken for search is one thing that can make the mass use this facility.

    As far as i have tried it out in these few minutes, the search strategy is good... there are separate search that researchers can use to know historical data and the like... This is great.

  6. Privacy by chewy_2000 · · Score: 4, Insightful

    The site seems to be slowing a bit, so I can't find details, but surely there are some privacy concerns here. I know that this just replicates the publically avaliable material in the physical archives, but there is a big difference between going to the archives and digging through books, and harvesting info over the web, especially given the sheer amount of info on the site, many of them recent records.

  7. Digital twilight. by haeger · · Score: 4, Interesting
    How about the "Digital Twilight" that people have talked about? One of the big problems with these kind of archives is that they aren't permanent the way that paper is. Washington could very easily end up the way that Stasi did in East Germany. They have several hundred tapes of data with information about every spy in the west on them but the information is still "safe" since noone no longer knows how the data was saved to disk or which file format was used.

    And I'm still ignoring the fact that machines grow old and has to be replaced. It's a known fact that disks break so You'll need backup but how long could You keep an old storage solution around. Sooner or later You'll have to migrate old backup data to newer media.

    Note that I don't think that this is a bad idea, moving everything online, but there are concequences that I don't think that everyone has thought of.

    Where I live one can go into the royal library and find (and read) an official document written by someone in the 16:th century, but can we be sure that 100 or even 50 years from now someone can read a DLT300-tape?

    .haeger

    --
    You are not entitled to your opinion. You are entitled to your informed opinion. -- Harlan Ellison
    1. Re:Digital twilight. by LousyPhreak · · Score: 4, Insightful

      you still can move the data from the old system to a new one if its at the end of its lifetime.

      harddrives can easily be replaced (assuming its a sort of raid with hotswap)

      sql will also stay around really long, and if not there will be at least a gazillion tools to convert to a new format (it is quite sure that the data will be stored on a sql server)

      and as long as the data is safely stored the access mechnism shouldnt be a problem but thats just my .02

      --
      -- Karma: beyond good and evil - mostly affected by posting political
  8. no maps? by Apreche · · Score: 5, Insightful

    Dang, there are no maps in there. The best stuff in the archives at town hall have always been maps of the town and blueprints of various buildings. But nobody scanned those in the archives. Oh well.

    --
    The GeekNights podcast is going strong. Listen!
  9. System Spec by LiquidCoooled · · Score: 4, Funny

    The 800 terabyte storage system was developed by Microsoft and EDS.

    Microsoft was able to confirm the system is expandable, and contrary to previous rumours, will infact have enough disk space to install Longhorn.

    They do however state, that to do anything actually useful, more upgrades will be required.

    --
    liqbase :: faster than paper
  10. Not 800 Terabytes, & using DjVu by illtud · · Score: 4, Informative

    The system isn't 800TB, but will scale to 800TB, according to this EDS press release. In fact, given that they've spent a mere $2.5M (powerpoint!) there's not a hope in hell that they've got 800TB! The powerpoint says it's a 5TB EMC SAN & an ADIC tape library for backup.

    An interesting point is that they're delivering the documents using DjVu by Lizardtech, which is GPLd, and developed by the creators of DjVu in conjuction with LizardTech (after a period of LT not-getting-it). The DjVuLibre home page is here. LizardTech still have the best encoders for the format.