Link Rot Rx: 'Amber' Add-on For WordPress and Drupal

← Back to Stories (view on slashdot.org)

Link Rot Rx: 'Amber' Add-on For WordPress and Drupal

Posted by timothy on Wednesday February 3, 2016 @11:48AM from the nobody-likes-a-rotten-link dept.

David Rothman writes: If you run a WordPress or Drupal site, you can now fight link rot with Amber, a new open source add-on from Harvard's Berkman Center. If links are dead, visitors can still summon up the pages as stored on your server or, if you prefer, outside ones such as the Internet Archive. TeleRead has the details, and the Amber site is here, with download information.

17 comments

Min score:

Reason:

Sort:

let's store the Internet by stooo · 2016-02-03 11:51 · Score: 1

let's store the Internet...

--
aaaaaaa
1. Re:let's store the Internet by Trax3001BBS · 2016-02-03 13:19 · Score: 1
  
  yeah, like why would netflix want to set up servers all over the world and redundantly duplicate all of their content over and over...
  LOL, 7 years on the UseNet with a separate handle, I have over 100K (Google) hits on it and run into myself all the time.
  Understand UseNet only, I never posted on a .COM, .NET, or .ORG site yet that's where all of my post are showing up at.
2. Re:let's store the Internet by i.r.id10t · 2016-02-03 14:00 · Score: 1
  
  But even pre-google newsgroups were available via web browser and services like Deja News ...
  
  --
  Don't blame me, I voted for Kodos
3. Re:let's store the Internet by Trax3001BBS · 2016-02-03 15:21 · Score: 2
  
  But even pre-google newsgroups were available via web browser and services like Deja News ...
  Shame about Deja News http://www.dejanews.com/
People still WordPress? by Anonymous Coward · 2016-02-03 12:14 · Score: 0

I would be interested in any back end frameworks written in Rust if anyone can suggest.
[slow clap] by pushing-robot · 2016-02-03 12:51 · Score: 1

The last link is a nice touch.

--
How can I believe you when you tell me what I don't want to hear?
The Crowd Goes Wild! by Anonymous Coward · 2016-02-03 13:31 · Score: 0

The Amber download link returns 404.
This is the kind of project I want to rely on! /s
Working link for Amber download by DavidRothman9947 · 2016-02-03 14:23 · Score: 2

Try this link: http://amberlink.org/#download and email me at davidrothman@pobox.com if it does not work. Don't blame the Amber. It's possible something happened when I was posting the item here. OK. You may now switch off your irony detectors. Thanks. DR (not associated with Amber but glad it's around!)
Archive.is - better than most options + HTTPS! by Anonymous Coward · 2016-02-03 14:56 · Score: 0

I urge others to try:
https://www.archive.is/
It's free, you have the option of downloading a nice zip file of the page(s) you archive. All pages you submit are searchable through their database and archived for all to see. It works well with sites that The Wayback Machine (TWM) (archive.org/web) just stops for a lot of sites saying robot.txt blah blah or some other stupid error. So while TWB is pretty nice, often saving a lot more content than Archive.is saves [like software for download and other content], Archive.is is quick, easy, and simple to use. The links generated by Archive.is are a lot easier to remember, write down, etc. than TWM.
1. Re:Archive.is - better than most options + HTTPS! by Anonymous Coward · 2016-02-03 14:58 · Score: 0
  
  You may also use archive.is to archive content saved from the wayback machine. that's funny shit!
The Internet Archive is a poor backup by drinkypoo · 2016-02-03 15:34 · Score: 2

On one hand, there's a lot of content in the IA that would have vanished forever if not for their help. On the other hand, I can't access my old content, if they even have it, because the current owner of the domain has configured their robots.txt such that they won't permit me to have it back. If a site goes down, it's highly likely that the new site will get a robots.txt that does not permit archiving, and you won't be able to access the links anyway... because rarely does a domain actually ever go away. Usually, it gets parked and farmed.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
1. Re:The Internet Archive is a poor backup by mujadaddy · 2016-02-04 03:18 · Score: 1
  
  Are there not 'evil' crawlers which ignore robots.txt, serving the public interest? That is a kind of disappointing.
  
  --
  Populus vult decipi, ergo decipiatur...
  "Force shits upon Reason's back." - Poor Richard's Almanac
Robustify,js by tuxzone · 2016-02-03 20:12 · Score: 2

Any website CMS that allows you to specify the JavaScripts you run could use a similar tool "Robustify.js" (https://github.com/renevoorburg/robustify.js), except for that it doesn't archive itself but relies on other web archiving services to have done that for you.
With Robustify.js, if a user hits a links that returns a 404, the user will be redirected using the Memento-protocol to a webarchive that does have a copy.
René
Open Source? by Anonymous Coward · 2016-02-04 01:25 · Score: 0

their page says open source, but there seems to be no links to the code. IsItJustMe?
Future "Mystery of the Universe" by VernonNemitz · 2016-02-04 02:42 · Score: 1

One of the Mysteries of the Universe has the very simple and generic name, "women". Others exist, too. In the future a new one will be added, something like, "If the Internet remembers everything, then why do links go bad?"
Good idea... by Lumpy · 2016-02-04 05:30 · Score: 1

Go ahead and look for Sirius tuner RS232 information online.... most of the links are dead-dead. having non dead links on your site by caching the information is a good idea because your content does not go irrelevant when someone shuts down their freebie page because they got bored.

--
Do not look at laser with remaining good eye.