Is the Web Heading Toward Redirect Hell?
Ant snips from Royal Pingdom this excerpt: "Google is doing it. Facebook is doing it. Yahoo is doing it. Microsoft is doing it. And soon Twitter will be doing it. We're talking about the apparent need of every web service out there to add intermediate steps to sample what we click on before they send us on to our real destination. This has been going on for a long time and is slowly starting to build into something of a redirect hell on the Web. And it has a price."
Funny just this morning I noticed that it took at least 5 redirects or more for Google to let me login to Analytics. It felt like my browser had a life of its own!
The real problem though are the link shorteners. I'd like to vote with my feet and never click on them, but for many, they are like drugs, because they let you track your influence (how many people clicked) in real-time. It's especially bad on slower connections such as smartphones. Not everyone has 1MB/s.
Any ideas on how to convince people to stop?
--
Don't work on your startup project without a safety net
I refuse to click on any "shortened" link, because I want to know PRECISELY where I'm going to end up. Thank you Slashdot and goatse.cx. If it's important enough to go visit, it's important enough to spell out properly. And thank you, but I don't live my life via SMS, so the few extra characters is worth my piece of mind.
For those of us who use things like NoScript, the price can be that we don't get there. Ever.
I know that when I go to a site that can't work unless I allow a half dozen or more other sites to run scripts, I sometimes decide that it's not worth my time. When I click a link that then has to contact several domains, (sometimes ones I have specifically blocked) I might stop right there and close the tab.
The web isn't just headed towards redirect hell - it's turning into a damn sketchy web of tentacles working their way into every page. When I find ones that I'm not comfortable having around, I don't go back.
I'm not sure I like what the web has become. Thanks to NoScript, I at least know what it's become.
Velociraptor = Distiraptor / Timeraptor
Folks at linuxtoday.com have been doing this for a long time. It's one reason I fled the site. Instead of taking me to where I wanna go directly, they make me click twice on the same site. This I believe, enables them to collect 'vital information' to present to their advertisers.
The bad thing is that they lost me and many others in the process.
By the way, it's intentional for me not to link to them from Slashdot directly.
... so they had to find yet another way to slow things down... so the web could live up to its reputation of "world wide wait" ;)
Never antropomorphize computers, they do not like that
[Wait 30 seconds or click here to skip to comment]
Speaking of redirects how about websites waiting to load content on Doubleclick analytics? Not only do the ads you're serving not load up but I tend to give up on visiting the page altogether. Everybody loses.
The Optimize Google add-in for Firefox gets rid of some of their hellish redirects. Sadly, it doesn't update frequently and seems prone to breaking.
I want to create a redirect loop. Just imagine, google to tinyurl to bit.ly to dilv.it back to google.
Or you could always just make a really long way to get to someone who'll never give you up, never let you down.
=================
Unix is very user friendly, it's just picky about who its friends are.
I am constantly irritated when I copy a link, only to paste a wall of text in my little IM window.
Fnord.
If someone is paying me for the clicks I send to their site, I need to count it so that I know how much I should charge, and they need to count it as well to know I'm not lying. They could make the count on the destination page, but usually it's far more easy to make a special service for it.
A redirect page is usually just a couple of hundred bytes large. Cookies might add some clutter, but probably still less than 1k in each direction, still fits in a single packet. I don't see the problem here.
Google and Facebook both use these "intermediate steps" to weed out malware infested sites and warn the user. Sampling can also be useful in judging if something is NSFW, or more importantly, rickrolling prevention.
Jeff Atwood hit on this issue in a blog post last year: http://www.codinghorror.com/blog/2009/06/url-shorteners-destroying-the-web-since-2002.html
Insert Sig Here
This used to be considered something that was potentially a Good Thing. To help prevent link rot and redesigns from breaking links, people thinking a lot about Hypertext came up with initiatives like PURL's: http://purl.oclc.org/docs/
Now that the primary usage of these redirects are simply to shorten links to something more convenient, we're using the same tech (a 301) and using it in different ways. One question is, how many people use the "custom link name" feature of tinyurl.com vs, simply let a random string of text be used? And, will a service start letting us update link destinations after the fact (like the original purl site did)? If so, how do you prevent nefarious uses of this (like moving it to goatse after it's memeing about)?
In terms of the filtration-for-tracking-purposes? That horse has left the barn already; I'm more concerned with final destinations not being recorded over time for posterity. These redirect services are totally interchangable anyway... as soon as one starts using interstitals, people will move to another one.
Hire a Linux system administrator, systems engineer,
Less fragile and less of an unnecessary intermediary on this Web 2.0 (or whatever) age would be to catch the click of a link with onclick, set a cookie, and open the original, intended link. When user would again come back to the site, this cookie would be dumped to the site that so much wanted the information it was clicked. Even if the user would have some sort of embedded resource from this site open somewhere else, it could harvest the information and send it back.
Instead, we seem to be ending up with endless chains of redirectors and opaque identifiers that are bound to organizations that don't necessarily exist in a year. What a joy to use technology which is driven by needs of utter morons and greed of those interested to press most information out of the morons...
and there is no useful (i.e. non-light-entertainment) content created primarily through advertising revenue. Slashdot developers who have made their money over the last decade producing tat by not overestimating the intelligence of the general public cannot bear to admit this, but you simply cannot produce high-calibre content when your primary aim is to suck in as many as possible of the kind of people who take notice of adverts.
Murdoch, often maligned for his lack of business sense but mysteriously still richer than all of us, seems to have tried and failed at pushing the subscription model. Obviously there are other viable models for producing information on the web such as government sponsorship (BBC, academia) and well organised groups of hobbyists (e.g. ham radio), but how will the sites who do not already have a dedicated subscription base through off-line heritage sustain themselves? Or maybe the answer is that they will not, the moment they take their eye off the advertiser as customer and start worrying directly about satisfying the desire for the reader to intellectually advance himself.
Not only do you not know where you're going to end up, but also the service can track your behaviour. Obviously this latter reason is why all the companies want to do it.
So, how do you get around it? I don't even think we can. I think we're screwed, to be honest. It's just going to be like that, perhaps until the day an exploit comes out and re-targets all of a services re-directs (i.e. tinyurl) to some hostile domain. Then, and perhaps only then, would it get enough attention to bring it to the "mainstream" users that it might be a bad idea.
Not.. that I'm suggesting anything... 4chan.
I didn't even know this was occurring. Guess it hasn't bothered me so far.
A work that expires before its copyright never enters the public domain and thus enjoys eternal copyright protection.
To play the devil's advocate - facebook's redirects started as a way to filter out all the spam links.
(Redirects to this page.)
As I commented on TFA:
So we have jQuery, and we have AJAX. Why don’t they just attach an onClick to their links that sends a quick POST to Google before sending the user on their way, directly to the site in question? It won’t work for people without Javascript on, but that’s such a small percentage that I doubt it matters to them much. The important thing is that they could get their statistics, while still avoiding a redirect. The service providers could argue that they need the tracking even if Javascript is off, but that just seems extreme.
Can someone tell me if there are any issues I'm not seeing with this approach?
It's better to vote for what you want and not get it than to vote for what you don't want and get it.
- E. Debs
That sound you hear is the point, zooming at mach 5 over your head.
(ok, RDR is not that good, but it helps, and I'm sure as this becomes even more prevalent, people will work around it)
On the other side of this are the search engines. They may not follow the chain of links, especially if it involves "cookies". So a reference that uses a redirection service may not be credited as an inbound link for ranking purposes.
Then there's the firewall/proxy issue. Firewalls need to see where you're really going, so they have to run down the link chain. This may result in bogus hits on the end site, if both the firewall and the browser separately do this.
That's not the kind of URL hiding the OP is talking about. If you did this to URLs pointing to external sites, ones not controlled by you or your company, *then* you'd be contributing to redirect hell.
DRM: Terminator crops for your mind!
Oh dear God yes! We can abuse JavaScript more to get more clicks!
Redirects are a minor inconvenience on the net. Much more insidious is the enormous number of sites that have to be accessed to get all the content on many webpages. Add to that the layers and layers of CSS needed to render them. And the massive, often buggy stack of scripts they bring. Not to mention the server-side scripting that slows down fetching the pages and embedded content and CSS pages and scripts before you get them.
It'd be interesting to see the average number of bytes transacted to render one byte of information to the screen. And the average loading time of a page.
Where are you seeing this? I just looked the the html source for the results of a google search and the target url is the target hfre with no google middleman. Am I missing something?
Need Mercedes parts ?
when it was considered a security hole if you DIDN'T use a redirect on your page? IIRC there used to be an attack vector where malicous sites used links from freemail pages to steel session IDs from the referer-headers.
bickerdyke
That you even have to highlight the url shows a lack of imagination on the part of my fellow programmers.
It should be auto-parsed so you just hover and right-click, go to url.
(-1: Post disagrees with my already-settled worldview) is not a valid mod option.
I think we could just write a Java program that will loop through a range of your top 10 urls every minute. Then you would never have to click. Call it autoBrowser. Because no one should actually click on things.
those tinyurl makes archives mailing lists hell also, have anyone seen or code a tool automatic lookup the real url and replace the tinyurl for archival purposes ?
Accused by a site that is dependent on scripts coming from other domains.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
That defeats the purpose of search engines. If you don't click on the links, how does google know it is finding the right pages for you?
links you to Jap Porn.
New Economic Perspectives
Moderators, thought this redirection is written to go to goatse.cx
TinyURL does a preview of it and experience surfers will avoid it.
I was thinking of trying this myself but looks like TinyURL is already on the ball to prevent goatse.cx abuse.
Obama's legacy: (N)othing (S)ecure (A)nywhere and (T)error (S)imulation (A)dministration
Exactly. I don't even think advertising should exist in its current form. If capitalism is supposed to be about delivering the best goods and services to people at the best price, then what's needed to facilitate that is an independent directory of goods and services, that doesn't allow any bullshit about what the goods and services really offer for the money.
I'm going to assume that most of these use HTTP redirects. This is where the server returns a 3xx result that tells the browser where to go, as opposed to rendering a full page and using Javascript to redirect.
The nice thing about HTTP redirects is that a service like Twitter can just follow the HTTP redirects for you and cut all of the middlemen out of the chain. Even forthcoming server-side Javascript interpreters could parse out Javascript-based redirects.
No, I will not work for your startup
Anytime I'm on a website, and I go to my iGoogle homepage, and then try and click back, it re-redirects me back to iGoogle. I have to hammer the back button about five times to get it to go back to the site I was at before.
This is the price we pay for using all of these "services" at no direct cost to ourselves. Something has to pay for all of the infrastructure, developers, support, etc from Twiter / Facebook / Google / etc.
Google inserts redirects on a low percentage of its search results. Most of the time you don't get them and they usually slip by unnoticed.
I am becoming gerund, destroyer of verbs.
It is definitely a possibility, although it is tempting to avoid consuming individual nuggets of information on a micro-payment basis unless you already "know" it's going to be useful, i.e. it tells you what you want to hear. Faced with a decent quality publication you have paid for in full, you can enjoy various viewpoints rather than being tempted just to fund/read the ones you like. Hm.
Personally, I find the trend of redirecting to innocent sites via shady URL's much more alarming: http://5z8.info/foodporn_e0g0l_taliban-meetup
(I promise I'll get modded "troll" by someone who glanced at the link and assumed the worst. Hard to blame them, but I do love using those links whenever possible...)
so... product reviews?
Spoon not. Fork, or fork not. There is no spoon.
...is paved with redirections.
Dark Reflection
HTML5 supports a ping attribute on the anchor ('a') tag to notify a second url when the user clicks on the link if the user has enabled this feature (as browsers implementing this should have a configuration option to enable/disable it). This reduces the issues with url redirection on sites like google.com that use the information for analytical/advertising/tracking purposes provided that (a) all or most browsers adopt it, (b) users browse the web using one of these browsers and (c) these websites use the ping attribute in preference to other means when browsers expose it.
This does not solve the other main use of redirects: URL shortening. These sites provide short URLs that redirect you to the full URL when you navigate to them and are used by twitter to keep you within the 140 character limit. These are worse in many ways because:
a/ you don't know where you are navigating to (without using an addon or other utility);
b/ can be exploited in some cases (like tinyurl);
c/ break a lot of pages if the shortening URL site goes down (e.g. most of twitter's information would be useless if bit.ly went down);
d/ don't work well with archiving.
That's stupid. My browser does malware protection, I don't need Google or Facebook getting involved. And as @spazdor says, do that before presenting the link to me.
You're crazy if you think Google and Facebook intercept links for your benefit. They're doing it to track you, pure and simple.
At least I can infer their outbound links. Link shorteners are Russian roulette. If services like Twitter wanted a better user experience, they would unshorten links when they present a feed to browsers.
=S
http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html
Yes you are really missing something! Just by viewing source you should notice on the a tag
onmousedown="return rwt(this,'','','','3','AFQjCNElSuk8pqYMVk5pKG9sycYfDSh7zg','UsteGasJKDRPW0uis7I9Ig','0CCsQFjAC')"
class="l"
href="http://example.com/the/original/URL"
So on mouseover you see the original URL, but on click, function rwt ("rewrite"?) sends you to Google first with all that tracking crap, which then redirects you on your way.
If I right-click and Copy Link Location, I get a Google URL in Firefox with this tracking crap. If I feed that to curl, I don't get a status 301 redirect, I get a small piece of HTML back containing both a script that changes the window.location and a meta http-equiv refresh tag.
Disable JavaScript to disable all this.
=S
Go indirectly to Hell. Do pass Go, and seven other affiliates. Do collect for us $200 in click-through money.
The enemies of Democracy are
I'm pretty sure archive.org said at some point they'd take over URL shorteners which would go dead, so those billions of pests wouldn't all go 404 on our asses
They dont know either way, so this argument makes no sense.
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.
No, more like technical specs, lab results, and much less image, peer pressure, and other audience manipulation.
use loads of redirects - I have just spent several weeks testing on a big site 5M+ pages removing redirects with the real urls. Having to follow to many redirects is a good way of killing your site for search engines.
We even had to get our devs to go the extra mile and sort the underlying problems as they wanted to fix problems with redirects rather than fix the underlying problems.
That's the nature of things once a system matures. --The ______ Agenda [cafepress.com]
Is it ironic that I clicked on your tee shirt store link and with my NoScript running, could not view, shop for, nor purchase anything? In fact, it is impossible to leave the barren first page. Keep up the good work.
> If capitalism is supposed to be about delivering the best goods and services to people at the best price...
Microsoft.
Your supposition is invalid.
---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
It would be trivial to do something with javascript - put an onclick handler that does an xmlHttpRequest to save the "needed" information without even needing to worry about header redirects and the like. The link can be something like
<a href="http://www.thesite.com/path/to/page.html" onclick="return notifyBigBrother(this);">
where notifyBigBrother() is a function that sends the click info to the search engine site. Why isn't this done?
I have no problem with your religion until you decide it's reason to deprive others of the truth.
They get most of the major offenders in the list. Sure it breaks some links, but it's worth it.
...are those that come in perfectly legitimate email, stuff that I actively subscribed to. They already know where I came from, their own damned email. Why does it need to go through a redirecting clicktracker?
Furthermore, it lets even legit emails send me somewhere not only unanticipated but also a pain in the ass, like links that unexpectedly open a whopping great PDF.
Many thanks to folks who posted links to two URL de-obfuscator services, which are now permanently on my toolbar.
http://unshorten.com/index.php
http://www.longurlplease.com/
~REZ~ #43301. Who'd fake being me anyway?
http://userscripts.org/scripts/show/40582
Greasemonkey script with analogous function.
Despite the name, it works for things in addition to TinyURL.
I listen to both RIAA and non-RIAA stuff if I like the music, tangential business/politics nonwithstanding.
Firefox was an early adopter of the <a ping> HTML 5 feature to solve exactly this redirect-for-tracking issue, added in early 2006: https://bugzilla.mozilla.org/show_bug.cgi?id=319368 There was huge controversy that the feature helped sites track users (never mind that you're being tracked as it is, and that the feature let you turn it off) and it was disabled before it ever shipped. We thus continue trudging through redirect hell when the browser could have been doing that for us in parallel while giving us the content we wanted.
The feature would have sold better if it was framed as <a shortcut> or <a dest>. That is, keep the historical href behavior jumping through redirects in old browsers, while new browsers could just load the final content directly from the shortcut (or dest) attribute and treat href as the ping. I'm sure that suggestion gives HTML purist fits on semantic grounds. At least it's backward compatible unlike ping which requires a site to choose between serving different content to old and new browsers, forgoing link tracking on old browsers (the majority? fat chance), or not supporting the feature at all (we have a winner!).
URL-shorteners are a different use-case altogether and not served by <a ping>
I've already posted or I'd mod this up.
This is a very good example of why such addresses may be necessary at times.
The real "overhead" for many, many Web sites now is the linking to fifty ad servers on every page - and THOSE servers are either down or slow, so they don't finish responding to the browser request in less than ten minutes.
Which is why your browser "busy" indicator stays that way even though the page appears to have been fully loaded - or worse, the page never loads.
This makes a difference when you try to save a page on your hard drive - that last little bit won't save and the browser will tell you the save "failed" - in reality you got most of it except for one lousy little ad.
All of this is just the effect of the Internet industry running on too little server horsepower and too little bandwidth - and WAY too little brains.
And yet people think they can run a business "in the cloud" - not with these morons running the cloud.
The reality is that the Internet is now as fast as an old monochrome green screen dumb terminal hooked up to a mainframe circa 1975 - except it's in color. You still spend a minute waiting for a Web page to load, no different than waiting for an overloaded mainframe to respond to a dumb terminal. And this despite the fact that the servers running a Web site are a thousand times more powerful than that 1970's mainframe.
And there may be an entire server FARM running that Web site - it's STILL slow. Because somebody else's server ISN'T.
As Woody Allen summed up the human situation, "Nothing works and nobody cares."
Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
No, that's my point. Capitalism in itself isn't such a bad idea. How it's been implemented by selfish companies using advertising and other tactics to manipulate consumers (microsoft are one of them) is bad though. It wasn't supposed to be that way, and reigning in advertising to a more centralised directory structure would largely solve it. Assuming you could verify the directory system enough to prevent corruption, that is. But it shouldn't be too hard, with something as simple as a directory of goods and services.
What I would like is a dynamic URL rewriter. When my browser notices a redirect, it submits the URL to a central database and asks for the rewrite rule for the rewriter behind that URL. (i.e. find the url=... in the google redirect).
So my browser would build up a locally cached redirect database, of redirecting rules of sites that I use.
The problem is that this will cause an arms race, that is easy to win for the redirectors. If they have "link=1928347234" instead of "url=http://www.slashdot.org/story/10/09/23/1851220/" it becomes difficult to find the destination URL without building a huge database, as big as the search-engine itself.
For some of us less-literate folks, the problem seems even worse. When I try to find something on the web, the first thing I do is enter my quest in the search bar. Soon, a list of 28,132,667,534 hits comes up (in only .00036 microseconds, yet). When I try any of those links, thinking ya, I'm getting somewhere! instead of getting to the place I want, a zillion other websites are there listing all the responses to my original quest. It's like, if I want to buy a pound of apples, I google apples, and then click on one of the resultant links, thinking I will be taken to an apple-provider. No. It's just another site telling me they have a 'better' list of apple-providers! and i never quite get to the store to buy them apples.... (maybe i shudda used as example 'oranges'???) Want another dumb comparison? go to the grocery store, look at the signs hanging from the ceiling for 'soups and veggies'. You get to that isle, and no soups and veggies... instead, shelves lined with ads and instructions on how to get to the 'soups and veggies' isle.
No, he means like this (click link:)
http://3.ly/2halj3u7
Alexander Peter Kristopeit bought his basement from his mommy for one dollar.