Slashdot Mirror


How the Wayback Machine Works

tregoweth writes: "O'Reilly has an interview with Brewster Kahle about how The Internet Archive's Wayback Machine works, with lots of juicy details about how the biggest database ever built works."

2 of 134 comments (clear)

  1. Not the biggest DB by costas · · Score: 5, Informative

    100 TBs do not make the biggest DB ever. I am personally working on an 60-70TB ERP system that's also writeable; I am sure there are bigger systems out there (e.g. Wal-Mart's or GM's ERP systems come to mind).

    A read-only DB containing highly-compressible text does not really make for a very challenging datamine. Just because it's on and about the Web and sexier than a stodgy ERP system should not make you overlook the real technology.

  2. Government Removed Site still Available by Tazzy531 · · Score: 4, Informative

    A number of you have asked whether the websites taken down since 9/11 are available on archive.org. The answer is yes. One example is:

    DC Air National Guard on Archive

    Same Page - 404

    One of the conspiracy websites that I have read was saying that combat airplanes, normally on 24 hour alert, at this base should have and could have prevented the plane from entering the restricted airspace in DC. They were saying that this site was removed because it provided evidence that somebody dropped the ball.

    --


    _______________________________
    "I'm not Conceited...I'm just a realist..."