Can rev="canonical" Replace URL-Shortening Services?
Chris Shiflett writes "There's a new proposal ('URL shortening that doesn't hurt the Internet') floating around for using rev="canonical" to help put a stop to the URL-shortening madness. In order to avoid the great linkrot apocalypse, we can opt to specify short URLs for our own pages, so that compliant services (adoption is still low, because the idea is pretty fresh) will use our short URLs instead of TinyURL.com (or some other third-party alternative) replacements."
Please, more comments, or I'll be forced to read the actual article. I don't want to be kicked off slashdot for RtFA...
What value are these new URLs if they aren't cute?!?
Seth
$5 / month hosted VPS on linux = awesome!
I didn't understand a single word of the submission, and I used to teach Web design. Is it too much to ask submitters to define terms they use?
how about we just kill all twitter users instead?
http://developers.slashdot.org/comments.pl?threshold=2&mode=thread&commentsort=3&sid=1196477
Best Slashdot Co
Yes, TinyURL hasn't killed anyone. BUT... any attempt to fix this is entirely missing the point anyway. From the article:
If they fix twitter to support links with proper labels or tag contents --- Oh, I don't know, like HTML has supported from the very beginning --- then there wouldn't be a problem.
Don't work around the bugs, fix the bugs. Links are designed for machines, the higher-level marked up text is for people.
Twitter is essentially an SMS aggregation and redistribution tool. SMS is limited to 140 character messages. I do not think you understand the meaning of the word "arbitrary".
On the Twitter /. feed, this of course shows as:
slashdot Can rev="canonical" Replace URL-Shortening Services? http://tinyurl.com/c3j4n8
P.S. Now if you want a really short URL, try http://tinyarro.ws/ (no affiliation; just impressed by the idea)
somesite.org/wiki/index/cool_tips/code/perl/hello_world.php
That's just wrong.
512 MB RAM, 20 GB disk, 200 GB transfer, five datacenters. $19.95/month.
short summary: everyone should adopt this NewTechnology(tm) because it will make twitter work better
1. If everyone uses it
2. if twitter implements support for it
of course it's pretty much useless for everyone else
From tinyurl:
Copyright © 2002-2009 Gilby Productions. All rights reserved.
(2009 - 2002) < 11+
"There are no facts, only interpretations." --Friedrich Nietzsche.
How about Twitter just stops arbitrarily limiting characters. Go by word count, perhaps?
I know some avid twitter users, and the majority of them apparently use the idiotic SMS message system to 'tweet' each other all throughout the day on their phones. Twitter can't abandon the 140-character limit for this reason.
For the record, I am against anything that keeps the SMS system relevant in this day and age. It should have been abandoned long ago in favor of standard data packets on the internet, rather than control packets on a proprietary wireless system. There's no good reason to keep this system alive when it either forces you to pay $X per month for it, or pay $.15 per 140 characters when one of your idiot friends 'texts' you. There's no way (that I know of) to force incoming SMS to route through GPRS, so you are hit with SMS fees even when you already pay for unlimited data. It also invites spam that you actually DO pay for, quite literally, and from which the wireless carrier profits as well. It should be illegal for the carrier to charge you for incoming SMS messages. Anyone who agrees with me should call their congressperson to protest this policy and call their wireless carrier to block all SMS messages.
This story should be tagged Twitter.
This guy seems to be focusing on the meaningful identifier aspect of URL shortening for use in a space limited context - without actually confining his suggestion to use in that sort of environment.
He puts forth other reasons for using this method such as control over the persistence of the shortened URL, but that doesn't make a whole lot of sense to me... and then he goes back to mentioning Twitter.
There's all this talk of URL shortening services - whether third-party, or in-house implementation.
The question here is this: Why are the URLs so long to begin with?
Why does it have to be:
http://shiflett.org/blog/2009/apr/save-the-internet-with-rev-canonical
A full title in the URL is, IMHO, a very inefficient idea. The excuses I've heard are:
Search Engine Optimizations (better performance when keywords are in the URL)
Okay, I can't argue that some search engines do stuff like that. But shouldn't the TITLE or META tags have more bearing on this than how ridiculously long the URL is?
"The URL has meaning, so you know what you're clicking", Context, etc.
I suppose that when I see a URL like
http://shiflett.org/blog/2009/apr/save-the-internet-with-rev-canonical
as opposed to something like
http://example.org/blog/526
I would have a slightly better idea of the article's content before clicking on it. But then again, I can't really say that I've decided against clicking on a link just because of the link URL. I would, instead, decide whether I'd want to visit the link by its link text/description.
So <a href="http://example.org/blog/526">blog on link shortening</a> would still have the same effect on me as a long URL IMO. If it were bookmarked, the same rules would apply.
Hell, if I were handed an obfuscated shortened URL without context, I'd know even less of what I was getting myself into.
I think the proper solution is to just stop making ridiculously long URLs to begin with, so we don't have to rely on obfuscation/hashing/shortening to accommodate services that have character limit restrictions. And we'd save bandwidth too, apparently. Win-win?
LOL! Only in America, the free market bastion of the world, do you have to pay for incoming texts.
Free Manning, jail Obama.
"Because bigger is better, right?" http://www.hugeurl.com/
1999 called, it wants its charges back.
People pay for SMS in your country? Here even pay and go plans have unlimited SMS bundles.
And I can't even parse this statement.. "or pay $.15 per 140 characters when one of your idiot friends 'texts' you"
How can your friends make you pay for SMS? Do you have some way of sending bills over it or something?
All this short URL stuff sounds like some phishing scam if you ask me. Short cryptic URLs obviously exist to make me transpose a couple of letters or numbers and end up at some fake bank site. No, give me large detailed URLs so I can see those dead giveaways like pid=poor_sucker&sid=steal_credit_card_info !
Short URLs indeed... no thank you Nigerian scammers... I won't be transferring any large sums today!
On a serious note, why is this news exactly?
Unfortunately, it's not yet an integral part of web frameworks that I have seen. So I am adding it in a new web site I'm building. It means I have to add the feature to the web server.
It works like this. Every part of the web site code that builds URLs for the same site passes them first through the mapping logic. This basically builds an SHA1 checksum of the canonicalized URL string. Then it looks up the string in a fast database (I'll be using Berkeley DB for this). If it's already there, and is the same URL, it generates a new URL that references the checksum. If it was a different URL, it notifies me that it found an SHA1 collision. If not already there, it adds it. The original URL is thus replaced with the mapping URL.
Code added to the web server will be designed to detect checksum URLs. If it looks like one, it looks it up in the database to get the original URL, and proceeds with the request using that URL. Original URLs would still be processed as usual, in case they leak out, or are intentionally made to bypass the mapping for special purposes. Basically it's like a tiny URL service, but integrated without the need to do a redirect.
One thing I am looking at doing is shortening even these URLs, even though they should be short enough already. But this raises the chance for a collision to the point I'll need to add logic to deal with it. How I would do that is similar to a hash data structure collision, but by expanding on the SHA1 checksum by adding back digits that were removed to shorten it.
External URLs to other sites can be done the same way. This does add the extra redirection. I could limit the use of this only to long external links, since this being a web interface, should handle long external links OK. It could be an option.
now we need to go OSS in diesel cars
A couple of good questions I have seen, and my best attempt to answer them:
1. Don't you mean rel? No, I mean rev. It indicates a reverse link.
2. Why not make your URLs short in the first place? I happen to like my URLs and have made them as short as I want them. They're only too long in some very specific use cases, like Twitter. I could just complain about Twitter, or I could support an idea that makes URL shortening suck less. I chose the latter.
Thanks for reading, and please do feel free to criticize whatever you think is wrong with this idea. I'd like a way to indicate a preferred short URL for my own stuff, and this seems like a pretty good way to do it that makes sense semantically and is easy to implement. For an ongoing discussion about adding an HTTP header to do the same thing (so that only a HEAD request is required), read here:
http://shiflett.org/blog/2009/apr/a-rev-canonical-http-header
US wireless carriers charge on both ends -- both the receiver AND the sender will pay the 15 cents per message, assuming neither one of them has an unlimited plan. I think this charge used to be 10 cents, but was raised to 15 cents last year. Or maybe it was 15 cents and was raised to 20 cents. I have no idea, but either way it is terrible. I think plans are typically $5/month for 200 'texts' or $15/month for unlimited.
And don't even get me started on MMS messages. I received my first MMS spam the other day. My first thought was "ooh, nice tits", but my second thought was "$#%&, I probably just got charged $3.00 for this spam!"
It wasn't even the Digg Bar exactly. Gruber didn't like it because of the obvious reasons (breaks bookmarks, history, hides the site, etc) but mainly because the DiggBar was turned on by default for all users. Other sites have things like the Diggbar, but no-one really complained about them because users had to turn them on by default.
If he alone had not liked it you would not have seen the rush to block it from all quarters. I as a user despised it myself, and am happy to see all framing mechanisms die a horrible death.
Shortening services that use a redirect, he and others have no issue with.
"There is more worth loving than we have strength to love." - Brian Jay Stanley