Slashdot Mirror


Internet Archive Opens Crawler Code Under LGPL

ramakant writes: "It looks like the Internet Archive, which hosts the infamous Wayback Machine has opened its newest in-development crawler code under the LGPL. From the announcement: 'Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix , or misspelled or missaid as heratrix / heritix / heretix / heratix) is an archaic word for inheritess. Since our crawler seeks to collect the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.'"

2 of 186 comments (clear)

  1. Re:I thought it sounded like... by Anonymous Coward · · Score: -1, Flamebait

    They could have done worse. Like calling it "IKnowKungFu" or something gay like that.

  2. freakin Nazi moderators by Anonymous Coward · · Score: -1, Flamebait

    Htf is that offtopic??