Slashdot Mirror


Broken Links No More?

johndoejersey writes "Students in England have developed a tool which could bring the end to broken links. Peridot, developed by UK intern students at IBM scans company weblinks and replaces outdated information with other relevant documents and links. IBM have already filed 2 patents for the project. The students said Peridot could protect companies by spotting links to sites that have been removed, or which point to wholly unsuitable content. 'Peridot could lead to a world where there are no more broken links,' James Bell, computer science student at the University of Warwick, told BBC News Online. Here is another story on it." See also the BBC story.

3 of 212 comments (clear)

  1. German readers... by dukoids · · Score: 5, Informative
    may want to take a look at the master's thesis of Nils Malzahn (from 2003, in German) to see (in detail) how this actually can work:

    http://www-ai.cs.uni-dortmund.de/DOKUMENTE/malzahn _2003a.pdf

    Basically, the thesis evaluates different methods to build a kind of "finger-print" of a page. The finger print is used to find the page with google if it is gone, or has changed significantly.

    The internet wayback machine was used to learn distinguishing disappeared pages from pages changing slightly over the time.

  2. Re:What if the page is deleted, not changed by troon · · Score: 4, Informative

    That's what the 410 Gone HTTP response header is for. If only admins would use it more...

    --
    Ydco co ,df C erb-y go. a Ekrpat t.fxrapev
  3. been there, done that. by Quickening · · Score: 4, Informative

    For those running a real browser, just make this a link, preferably in your personal tool bar.

    javascript:Qr=document.URL;if(Qr=='about:blank') {v oid(Qr=prompt('Url...',''))};if(Qr)location.href=' http://web.archive.org/web/*/'+escape(Qr)

    Now when I click on a link that isn't there, I select my Archive search button and it shows me the Wayback Machine's history of that link. Of course it works only if the url hasn't been modified by the server. If it has it's another couple steps (copy link, ^T, archive search, paste url in pop-up dialog)

    --
    tcboo