Slashdot Mirror


Firefox Will Try To Show You Saved Archive Of a Page Instead Of 404 Error (ndtv.com)

Firefox has announced a new add-on dubbed No More 404s in its Test Pilot platform which aims to change the way we see 404 links on the web. The add-on, Firefox says, replaces the Error 404 from missing webpages, and replaces them with saved archives from the Wayback Machine. From a report on Gadgets 360: Normally, when presented with a missing link, the browser shows the 404 error. However, Mozilla's No More 404s add-on will give Firefox users the choice to see old Internet snapshots saved in the Internet Archive's Wayback Machine. This is especially handy for users trying to do research or just digging up some old graves out of curiosity. For now, this add-on is only available in Firefox's experimental Test Pilot platform, with no details on availability for regular Firefox users. Interested users can install the test version here. Apart from this, the Test Pilot platform also introduced improved search results through the Awesome Bar, redesigned the Tabs bar to the side, and even tweaked the history feed.

70 of 119 comments (clear)

  1. Addons don't exist I guess. by xession · · Score: 5, Insightful

    This has already been a fixed issue for probably over a decade with addons for Firefox. The addons are why Firefox became so popular in the first place. While its great that they want to include this functionality in the browser, why not just have a list of suggested addons when you install or update? Waste of time on their part...

    1. Re:Addons don't exist I guess. by Anonymous Coward · · Score: 1

      It sounds kinda neat, if they'd keep it as an addon.

      They could have an entire list of mozilla-developed-and-supported addons! They could get the spotlight/be highlighted/get preference when searching for new addons. They could even be default-installed if they really wanted to.

      But no. They'll end up adding it as an integral part, because they want to feel like their hard work is being used. It would crush their sould if they made an addon and it's only installed by 3 people.

    2. Re:Addons don't exist I guess. by LichtSpektren · · Score: 4, Informative

      TFA in fact says it is an add-on.

    3. Re:Addons don't exist I guess. by wonkey_monkey · · Score: 1

      This has already been a fixed issue for probably over a decade with addons for Firefox.

      Fixed issue? What issue? If a page isn't there, it isn't there.

      There's no issue. If a page can't be found, the browser should tell the user. If the user wants to install an add-on to avoid this for some reason, that's their choice, but it shouldn't be default behaviour.

      --
      systemd is Roko's Basilisk.
    4. Re:Addons don't exist I guess. by xession · · Score: 1

      Doesn't really matter much how they implement it if its included already with the browser. Its just like throwing sand in the face of FF addon devs, when they've already had a beach worth thrown in their faces. My point was, why not better support the community that already exists by including or suggesting addon installs when you install or update FF instead of taking it upon themselves to do it themselves. There are so many bigger issues with FF that should be addressed, especially when numerous other addons have already provided this functionality if people were interested in having it.

    5. Re:Addons don't exist I guess. by xession · · Score: 1

      Depends on your use of the internet. If you are doing research, it actually can be a pretty major issue and time sink to continually go between the broken links and Wayback. I'll give you that it shouldn't be default behavior though. That's why I was suggesting promoting addons at install or during updates if they want to include features like this more universally.

    6. Re:Addons don't exist I guess. by LichtSpektren · · Score: 1

      Who says it'll be included with the browser by default? Mozilla has made a bunch of extensions that aren't pre-packaged.

    7. Re:Addons don't exist I guess. by phishybongwaters · · Score: 1

      let me be the first to throw sand in the face of the FF devs who spend their time making fucking crap useless addons instead of fixing their code base, or you know, implementing any for of modern browser security. And this doesn't even touch on the fact that FF is the slowest browser on the face of the earth. 37 seconds to start streaming netflix? Are you fucking with me? 15 seconds to open, another 5 seconds before it's usable. And no, this isn't on some sinky dink POS. FF is dead, let it die, it stopped being relevant 5 years ago.

    8. Re:Addons don't exist I guess. by LichtSpektren · · Score: 1

      So what features of "modern browser security" does Firefox lack? It can't be anything that the Tor Project or Bruce Schneier considers to be a non-starter.

      I fail to see how a browser with hundreds of millions of users could not be relevant, but ok.

    9. Re:Addons don't exist I guess. by InvisiBill · · Score: 1

      Fixed issue? What issue? If a page isn't there, it isn't there.

      There's no issue. If a page can't be found, the browser should tell the user. If the user wants to install an add-on to avoid this for some reason, that's their choice, but it shouldn't be default behaviour.

      Except sometimes, a page isn't there, but it is, because someone has made a copy of the page while it was still available. Having one-click access to this backup copy can be very handy. In case you didn't RTFA, Mozilla's No More 404s add-on will give Firefox users the choice to see old Internet snapshots saved in the Internet Archive's Wayback Machine. No, just showing an old copy of a page instead of the current one is not a good idea. Giving the user a choice of pulling up an archived copy instead of just an unhelpful error page is a nice thing to do.

    10. Re:Addons don't exist I guess. by Dagger2 · · Score: 1

      Sandboxing.

      Firefox doesn't support sandboxing yet, therefore sandboxing is the security feature that every browser needs to have (for anybody who wants to complain about Firefox).

  2. Wayback machine's banner by Anonymous Coward · · Score: 2, Informative

    This is the one time - exception to the rule - where I would like a banner to be obnoxious.

    I see myself being blurry eyed and getting an archived page with outdated information and thinking it's up to date - and being fatigued enough where I just ignore the Wayback Machine's banner as just another advertisement. Actually, even when I'm alert, I ignore all banners as just being some dipshit advertisement - even with adblock on, some make it through.

  3. Include google cache as well by Anonymous Coward · · Score: 4, Informative

    If you use the search box it's quite easy to get a cached page view via Google. cache: + url in the search box will redirect you to the cached version of the url, if it exists.

    Example:
    cache:https://news.slashdot.org/story/16/08/05/1439229/firefox-will-try-to-show-you-saved-archive-of-a-page-instead-of-404-error

  4. Horrible Idea by Anonymous Coward · · Score: 5, Insightful

    I think this is a horrible idea from the website perspective. What if we had to remove several pages/images from the website due to copyright issues? Or maybe the company is listing clients it doesn't represent anymore? Or other information that is no longer valid? They are assuming is a 404 is a mistake but what if it is intentional.

    1. Re:Horrible Idea by HumanWiki · · Score: 1

      If that's the case, then you should be using something like a 301/302 to redirect to / or another sub folder. If you 404 pages, some searching and indexers will down rate you because a 404 is usually a dead end path. If you 301/302 it, then you're handling the issue and sending the client to something that's actually valid instead.

    2. Re:Horrible Idea by mrbester · · Score: 4, Informative

      If it has truly been deleted then a 410 Gone should be issued. However, that means you have to have a record of what was deleted that is checked to issue that instead of a 404 Not Found. That's more effort with no real gain.

      Users should only get 301 Permanent Redirect or 302 Found if the resource exists elsewhere. If it doesn't then redirectng them to something other than what they were expecting, *especially* the index page of the site, is incorrect.

      --
      "Wait. Something's happening. It's opening up! My God, it's full of apricots!"
    3. Re:Horrible Idea by Anonymous Coward · · Score: 1

      Except the content has not been moved, it's been removed. There's nothing to redirect to.

      404s are frequently very intentional.

    4. Re:Horrible Idea by HumanWiki · · Score: 1

      There's absolutely something you can redirect to. You intercept the request pattern for whatever was there and rewrite it to / or /something/ or /index.whatever.

      and you do this on purpose because of other places that have kept old links or were statically linked to your content and you're not able to get them to change or remove the links (even if it's a search crawler) so this way it will snag the http://www.blahblah.com/someth... that's being called and you back it up a level and drop it to that folder's index.

    5. Re:Horrible Idea by caitriona81 · · Score: 2

      Because of the nature of 404, the only thing that can be inferred from it is that the file is no longer there - which 99% of the time is a result of reorganization rather than deliberate removal. 410 (Gone) is the right code for a deliberate removal. You can throw a 410 via your webserver config (ie, via .htaccess or equivalent mechanisms). This response also serves as a signal to other services such as Google that they should remove access to cached versions.

      Internet Archive has procedures in place for removing items from their archive as well - they will automatically detect certain robots.txt entries, and they
      will also accept manual removal requests.

      However, think long and hard about whether something should be remove from archives - unless there is a compelling legal reason to do so, it's bad form, and if you are high profile, enough to have things to hide, someone will likely have mirrored your content, and they may respond to you taking down content by distributing it as widely as possible.

    6. Re:Horrible Idea by HumanWiki · · Score: 1

      It's tough being 44.

    7. Re:Horrible Idea by mrbester · · Score: 1

      What's worse is these custom 404 pages (not that there's anything that wrong with having one instead of a YSOD) usually don't themselves issue a 404 back to the client, but a 200 OK. This doesn't help the user, useragent, REST services, search engines, etc. as they never get the correct code.

      If the URL gets hosed then they are compounding the stupid by sending a 302 ( even though a 303 was meant, thanks Microsoft) back to the client to start the GET all over again when Server.Transfer should have been used.

      But then Microsoft have never "got" the Web (Bill Gates famously said he didn't see the point, but everybody thinks that is apocryphal like the 640K quote) so they implemented the usual assume infinite resources half-assed measure and never fixed it. This means it's only those who are clued up and want to use codes like they were supposed to be that jump through the hoops to do what should have been baked in from the start.

      --
      "Wait. Something's happening. It's opening up! My God, it's full of apricots!"
    8. Re:Horrible Idea by InvisiBill · · Score: 1

      If you don't want services automatically crawling your pages, configure your robots.txt appropriately. http://archive.org/about/faqs.php#2, http://lmgtfy.com/?q=prevent%20google%20from%20caching%20my%20site, etc. This feature (which has been around in the form of the Resurrect Pages addon for at least 10 years) simply gives users an easy way to access a backup copy of a page when it isn't found live.

  5. In other news... by whoever57 · · Score: 4, Insightful

    In other news, archive.org becomes inaccessible due to excessive load.

    Seriously, it's slow already. Adding millions of hits on the site isn't going to help.

    --
    The real "Libtards" are the Libertarians!
    1. Re:In other news... by NotInHere · · Score: 4, Insightful

      Firefox is number 2 on the desktop. http://gs.statcounter.com/

      It never really was in a minority there, but its almost nonexistent in the mobile world, so its combined stats has gotten much lower due to the rise of the mobile platform.

    2. Re:In other news... by Shompol · · Score: 1, Troll

      Firefox turned against Firefox. It is slow borderline unusable for any pages that stream video or use scripting heavily. When Firefox's own ex-CEO decided to use Chromium for his new project it certifies that there are deep issues under the hood. I only keep it because of bookmarks, it will be purged from my new OS installs.

    3. Re:In other news... by Nunya666 · · Score: 2

      Comments like this are why people feel slashdot has turned against firefox ..

      No, Mozilla turned against Firefox by turning it into a Chrome-clone.

    4. Re:In other news... by LichtSpektren · · Score: 1

      Firefox has about 10% market share (several studies collected here), which is hundreds of millions of people.

    5. Re:In other news... by trawg · · Score: 1

      Wikipedia stats are probably more representative: https://analytics.wikimedia.or...

      Firefox is a bit further down and, looking at the graphs, still decreasing. It's not as dire as GP indicates but it looks a little grim.

    6. Re:In other news... by NotInHere · · Score: 1

      That site doesn't remove the mobile stats. If you calculate them out you get roughly the same value, and iexplore being more is probably a statistical effect. Probably it can't be said whether firefox is third or second, but it is under the top three on the desktop.

  6. At least it's and add-on by The-Ixian · · Score: 1

    and not a core feature of the browser...

    This does not sound like something I would want. Browsers do too much caching already, imo. It makes it hard to troubleshoot issues as it is.

    I am having trouble coming up with a scenario where I would want to see an old version of a page rather than the current page.

    --
    My eyes reflect the stars and a smile lights up my face.
    1. Re:At least it's and add-on by LichtSpektren · · Score: 1

      Instead of hoping, why don't you RTFA and find out?

  7. Let's hope this won't be default behaviour by Alwin+Henseler · · Score: 1

    For several reasons:

    a) Any attempt to access a non-existing page that results in showing a page anyway, is basically fooling the user. Some (ehm.. read: many) users may even think that page still exists even though the original is gone. From a UI perspective that's just wrong even if convenient in many cases.

    b) Access to old / archived versions of pages often comes in handy. And that is what the Internet Archive is for. But sometimes pages (or sites) are pulled for a reason. Sometimes good reason(s). Not all information ever placed on the internet needs to be preserved forever, imho.

    c) If every 404 leads to a request to the Internet Archive, can they handle the extra load? Even if so, would the extra bandwidth / CPU / disk IO etc be a good use of the IA's limited resources? I very much doubt that, and perhaps Firefox maintainers should answer that question first before activating such a feature by default.

    As one of many add-ons: sure why not. As a default feature: bad idea imho.

  8. As long as it's optional ... by golodh · · Score: 1
    As long as it's optional, not in the way, and impossible to activate by accident I don't have a particular problem with it.

    Personally I think it's a terrible idea. What's shown when I enter an URL should be between whoever designed the website and me. If a site is down, or a page is missing, I want to know about it.

    The last thing I need is a bunch of programmers dreaming up ways to divert me from the real website to whatever is their idea of what I should be seeing. A typical example of a group of coders not knowing what to do with their time and messing with the basic functionality of their application, if you ask me.

    So: great to see that you're having fun Mozilla programmers, but make sure to implement this an an optional feature and keep it out of my way unless I explicitly activate it. Otherwise I'll be looking for a new browser. Fair warning.

  9. Re:Melania's website by NotInHere · · Score: 4, Funny

    Yes, it'll be a copy of michelle obama's website.

  10. What happens with web services by oobayly · · Score: 1

    What happens when I use a 404 status in a web service to signal that the requested resource couldn't be found - the front end handles 404 gracefully and informs the user, updates the UI, etc. Will it still return a 404 status, but inject a whole load of unexpected content?

    1. Re:What happens with web services by JustAnotherOldGuy · · Score: 1

      What happens when I use a 404 status in a web service to signal that the requested resource couldn't be found - the front end handles 404 gracefully and informs the user, updates the UI, etc. Will it still return a 404 status, but inject a whole load of unexpected content?

      If I understand the article correctly, web services will be unaffected because this will be baked into Firefox itself.

      So, unless you're running your web service through an instance of Firefox, this won't make any difference.

      --
      Just cruising through this digital world at 33 1/3 rpm...
    2. Re:What happens with web services by oobayly · · Score: 1

      Ah, that's not what I meant. Our web services are used as a back end for our website (as well as through mobile apps & windows clients). That way the majority of the website is static (and cached) and just the dynamic data is fetched (through the abomination that is javascript!). It also means that bugs (or at least some of them) can be fixed quickly with no need to update clients.

    3. Re:What happens with web services by JustAnotherOldGuy · · Score: 1

      Ah, that's not what I meant. Our web services are used as a back end for our website (as well as through mobile apps & windows clients). That way the majority of the website is static (and cached) and just the dynamic data is fetched (through the abomination that is javascript!). It also means that bugs (or at least some of them) can be fixed quickly with no need to update clients.

      If people access your web services through Firefox and there is a 404 error, then I would think that your service will probably appear broken or will malfunction for that user. You may be able to intercept the error (catch the 404 return code) and direct the user to another page.

      Firefox should disable this 'feature' by default, but of course they won't.

      --
      Just cruising through this digital world at 33 1/3 rpm...
  11. Bad idea by paolo.redaelli · · Score: 1

    That's a BAD idea for several reasons: 1 - too much load on archive.org 2 - loss of privacy 3 - most users may want to know if an URL/page does not exist anymore

    1. Re:Bad idea by LeadSongDog · · Score: 1

      Somebody mod parent up, just a little. This will effectively DDOS archive.org, which is already in a pretty feeble condition. That said, it will be obvious to a reader if the page comes from the WABAC.

      --
      Oh, I'm sorry sir, I thought you were referring to me, Mr. Wensleydale.
  12. Firefox: Stop breaking things ! by Anonymous Coward · · Score: 1

    If a page doesn't exist I *WANT* a 404 error. Not some redirected crap. If I want to find a cached version of the page I'll do so myself. Google search results offer a cached version or there's always the wayback machine.

    I wish all these idiots would stop their inane "helping". All their ideas are just "Clippified" crap.

  13. uhh...this is the microsoft way right... by evolutionary · · Score: 1

    is there anyone out there who thinks this is a BAD idea? I mean, an error is an error, hiding it usually causes more harm than good.

    --
    "Imagination is more important than knowledge" - Einstein
    1. Re:uhh...this is the microsoft way right... by LichtSpektren · · Score: 1

      Who says it's hidden? Have you actually used the add-on yourself yet?

  14. Re:Melania's website by JustAnotherOldGuy · · Score: 1

    Yes, it'll be a copy of michelle obama's website.

    Melanoma Trump didn't copy Michelle's speech, she um, "creatively re-purposed" her work by quoting it verbatim!

    Like when I duplicate a $100 bill and spend it at the store, it's like an art project, I'm not copying it!

    --
    Just cruising through this digital world at 33 1/3 rpm...
  15. UGH by Holi · · Score: 1

    I hate when my ISP's DNS redirects 404's, I certainly don't want my browser doing it.

    --
    Sorry, teleporters just kill you and then make a copy. A perfect, soul-less copy.
  16. Re:Who will be resposible? by Anonymous Coward · · Score: 1

    That's really bad. Just imagine you check the conditions of a bank or some prices of a store and it turns out that the information was obsolete.

  17. But why? by jandersen · · Score: 1

    Maybe it's because I am a SW developer, but I want to see errors, when they occur. Otherwise, how can anybody hope to correct them?

  18. I can see it now by Chrisq · · Score: 2

    .... but I checked the Travel advice for Turkey and it said that it was fine!

  19. better idea by e432776 · · Score: 1

    It would be more transparent, and send less load to archive.org for Firefox to give the 404 error but then also include a link to the archived version, clearly marked. There is no good reason to hide the fact that the page is not available.

  20. Either/Or? by Tablizer · · Score: 2

    Typically one would want to know they are looking at a stale copy. I'd suggest a prompt such as:

    Error 404: Page Not Found

    Suggestions:

    - View archives of the page at Wayback Machine [link]
    - Tips for verifying your URL [link]
    - Find similar sites using your search engine [link]
    - Try the page again [link]

    1. Re:Either/Or? by Anonymous Coward · · Score: 1

      That's what MSIE did. Replaced the web site's 404s with its own 'friendly error pages'. The webmasters hated it, and for good reason.

  21. Why? by deadwill69 · · Score: 2

    How about just no. Some things are broken for a reason. Let's leave it that way. Just because you can doesn't always mean you should.

  22. This seems like a terrible idea... by Mysticalfruit · · Score: 1

    How is this NOT going to cause massive amounts of confusion.

    I pity the first person who buys a piece of property that had a popular establishment that's long gone except for it's archived page... who gets subjected to an endless barrage of:

    Person: "Where's wacky world?
    Owner: "Wacky world burned to the ground with no survivors in 2011... Sorry man"
    Person: "But I was just on their website!"
    Owner: "Yeah, your browser just skull fucked you."

    --
    Yes Francis, the world has gone crazy.
  23. How to kill the wayback machine by WaffleMonster · · Score: 1

    The first rule of Usenet applies to the Wayback machine in spades.

    More attention on it especially something like this doing the lookups automatically that make it seem part of browser the more site owners feel compelled to exclude their content from the machine as a matter of course or worse lawyers for sites and third parties third parties try their hand at nonsensical legal theories.

    http://www.netdisaster.com/des...

  24. Untangle your panties, Slashdot by LichtSpektren · · Score: 1

    TFA says it's an add-on. There is no indication that it will even ship with Firefox by default (Mozilla has made other extensions that aren't pre-packaged into Firefox).

    Now, it's totally fine to complain when bloatware is added directly to the browser like Hello and Pocket (I would defend Reader though). But TFA does not indicate in any way that this will be the case.

  25. Re: Exploits by Anonymous Coward · · Score: 1

    What about subtraction?

  26. Re:Time to stop using Firefox? by LichtSpektren · · Score: 2

    There's plenty of extensions in the wild right now that are filled with security and PR problems, why don't those stop you?

  27. Been using Resurrect Pages add-on for years by Solandri · · Score: 4, Informative

    https://addons.mozilla.org/en-US/firefox/addon/resurrect-pages/

    Not sure why it's never been very popular. It's one of the most useful add-ons I have. Unlike the proposed add-on in TFA which only uses the Wayback Machine, Resurrect Pages lets you pick from four possible sources (Google cache in full and text-only mode for those annoying pages which won't show the text until all the nonexistent pics finish loading first, Wayback Machine, WebCite, archive.is) for a cached version of the page. There used to be more, but I guess some of those archiving projects died.

  28. Re:*** FUCK THIS ADD-ON *** by ausekilis · · Score: 1

    Can I suggest removing the "tinfoil hat me" plugin and replacing the "They are always watching me" plugin with "Lets watch cat videos"?

    The combination of tinfoil hat and watching me plugins overadvertises the "building a doomsday bunker in your backyard".

  29. cached pages? by phishybongwaters · · Score: 3, Interesting

    Isn't this the same as google offering to show you a previously cached page, but using the way back archive and offloading the storage and traffic costs on them while providing no revenue or benefit? Yes, that's exactly what this is. Lets let the wayback machine pony up the storage and traffic costs, as well as IO and Compute. That is what some people would call, a dick move. No different than linking images from article A into your article so site A has to pay the hosting and traffic costs, cracked did a funny article on this very subject.

  30. Staying on old site by CanadianMacFan · · Score: 1

    So once you hit the archived site does the add-on change the links so that you don't stay on the archived site?

  31. Re:Firefox better? by LeadSongDog · · Score: 1

    Gordon Bell of DEC had a sign in his office, "Better is the enemy of Good".

    Sure he did. It meant that to sell a VAX architecture, the PDP-11 architecture could not be sustained. Anyhow, despite the beauty of those machines, DEC got sucked into Compaq, and thence into HP. Crap reigns supreme.

    --
    Oh, I'm sorry sir, I thought you were referring to me, Mr. Wensleydale.
  32. Re:Censorship Engine by LichtSpektren · · Score: 1

    Every browser already has this functionality (i.e. arbitrarily alter requests from the server before the user can interact with them).

  33. Re:Melania's website by q4Fry · · Score: 1

    That's not a confidence scheme, Officer, it's performance art!

  34. Re:Melania's website by JustAnotherOldGuy · · Score: 1

    That's not a confidence scheme, Officer, it's performance art!

    That's what I told the judge, and he sentenced me to conduct a 3-to-5 year "performance art" piece in prison with time off for "creative ability" or something.

    --
    Just cruising through this digital world at 33 1/3 rpm...
  35. Re:Melania's website by Darinbob · · Score: 1

    It's not a confidence scheme, Officer, it's a self esteem building scheme!

  36. 404 - 503 by CptLoRes · · Score: 1

    Great, how is 503'ing archive.org any better?

  37. Re:Melania's website by Gr8Apes · · Score: 1

    That's not a confidence scheme, Officer, it's performance art!

    That's what I told the judge, and he commissioned a 3-to-5 year "performance art" piece complete with living accommodations, meals and housekeeping

    Got to keep in character.

    --
    The cesspool just got a check and balance.
  38. Re:Who will be resposible? by InvisiBill · · Score: 1

    That's really bad. Just imagine you check the conditions of a bank or some prices of a store and it turns out that the information was obsolete.

    If you make financial decisions based on old copies of a page, knowing that the current page is unavailable, you deserve whatever you get.

  39. ~!NO!~ please do not attack Wayback, Mozilla by TheRealHocusLocus · · Score: 1

    "Hey I've got a great idea! If we do this little thing our product will be even better!"
    They do some little thing no one asked for, to half a billion installed browsers.
    That thing in no way affects their bottom line.
    But somewhere else, the bottom drops out of something else and something precious is broken.
    Please don't do this.

    So Wayback is going to be the error page for every damned malformed or mis-typed link? That is abuse and attack.

    I can see Wayback hit like a gigaton of bricks as people who have no deep interest in the content they're trying to access say "gee, what's this? All these changes! What did it look like in 2009?" And often the last crawl is NOT the one with real content. There is no substitute for knowing what you're doing and what you are looking for. There is no way to dumb down the process. And what about those [stupid] pre-load and crawler plugins? Will they hit Wayback too?

    And more site owners will drop in that magic blip into robots.txt that will shut Wayback access for good, even the pages in the database that are there ready to serve. It's bad enough that domain squatters often block Wayback.

    And even the constant stream of reflected robots.txt lookups will hit Wayback like a malevolent attack. Do you [Mozilla] even know how Wayback works?

    Add-ons like Resurrect Pages are the way to go. Only folks with a real and deep interest will go further. And THAT level of traffic is what Wayback can support.

    --
    <blink>down the rabbit hole</blink>