Slashdot Mirror


Can rev="canonical" Replace URL-Shortening Services?

Chris Shiflett writes "There's a new proposal ('URL shortening that doesn't hurt the Internet') floating around for using rev="canonical" to help put a stop to the URL-shortening madness. In order to avoid the great linkrot apocalypse, we can opt to specify short URLs for our own pages, so that compliant services (adoption is still low, because the idea is pretty fresh) will use our short URLs instead of TinyURL.com (or some other third-party alternative) replacements."

24 of 354 comments (clear)

  1. but will they be cute? by SethJohnson · · Score: 5, Funny



    What value are these new URLs if they aren't cute?!?

    Seth

    1. Re:but will they be cute? by kongit · · Score: 4, Funny
  2. WTF? by Anonymous Coward · · Score: 4, Insightful

    I didn't understand a single word of the submission, and I used to teach Web design. Is it too much to ask submitters to define terms they use?

    1. Re:WTF? by Renderer+of+Evil · · Score: 4, Informative

      This whole url shortening shit started to pick up steam few days ago when Digg introduced Diggbar - a hybrid of frame and url-shortening that framed other sites and did not display the proper site address. John Gruber went nuts and modified his blog to redirect users to a special page. Then he blogged for 2 days non-stop how to make diggbar go away. Since he's widely read around the web everyone started chiming in with their opinions on the general idea of url shortening services and how it hurts or helps the web.

      Nerd bullshit. And not the good kind.

  3. a better idea by Anonymous Coward · · Score: 5, Funny

    how about we just kill all twitter users instead?

    1. Re:a better idea by YourExperiment · · Score: 4, Funny

      Twitter? That's so first-quarter-of-2009. Everyone's using flutter now you know.

  4. Solving the wrong problem by CarpetShark · · Score: 4, Insightful

    Yes, TinyURL hasn't killed anyone. BUT... any attempt to fix this is entirely missing the point anyway. From the article:

    I happen to think this URL is beautiful. :-) Unfortunately, it is sure to get mangled into some garbage URL if you try to talk about it on Twitter, because it's not very short. I really hate when that happens. What can I do?

    If rev="canonical" gains momentum...

    If they fix twitter to support links with proper labels or tag contents --- Oh, I don't know, like HTML has supported from the very beginning --- then there wouldn't be a problem.

    Don't work around the bugs, fix the bugs. Links are designed for machines, the higher-level marked up text is for people.

    1. Re:Solving the wrong problem by rusl · · Score: 5, Funny

      But then you're going to have the problem solved instead of opening up a new can of worms with lots of jobs and neverending problems to solve. Intelligence is bad for the economy.

      --
      Stupidity is its own reward.
  5. Arbitrary by Senjutsu · · Score: 5, Informative

    Twitter is essentially an SMS aggregation and redistribution tool. SMS is limited to 140 character messages. I do not think you understand the meaning of the word "arbitrary".

    1. Re:Arbitrary by Anonymous Coward · · Score: 5, Funny

      Your mom is an aggregation and redistribution tool. And she certainly didn't limit herself to 140 characters.

    2. Re:Arbitrary by Jamie's+Nightmare · · Score: 5, Informative

      There's no technical reason restricting Twitter from allowing 140 rather than 160 characters, unless there's an issue I am not aware of (perhaps one or more major mobile networks are broken and only allow 140 characters rather than 140 bytes?).

      20 are reserved for the user name. The co-founder mentioned this during his interview on The Colbert nation.

      --
      "When you see a unixer brainwashed beyond saving, kick him out of the door." - Xah Lee
  6. URL shortened, of course by Toe,+The · · Score: 4, Interesting

    On the Twitter /. feed, this of course shows as:
    slashdot Can rev="canonical" Replace URL-Shortening Services? http://tinyurl.com/c3j4n8

    P.S. Now if you want a really short URL, try http://tinyarro.ws/ (no affiliation; just impressed by the idea)

  7. Re:sorry but I dont get... by palegray.net · · Score: 5, Funny

    somesite.org/wiki/index/cool_tips/code/perl/hello_world.php

    That's just wrong.

  8. Re:"Great link apocolypse" WAT? by Feyr · · Score: 5, Informative

    short summary: everyone should adopt this NewTechnology(tm) because it will make twitter work better

    1. If everyone uses it
    2. if twitter implements support for it

    of course it's pretty much useless for everyone else

  9. Re:"Great link apocolypse" WAT? by Sebilrazen · · Score: 5, Funny
    Oh great, mysterious and anonymous time traveler, what year did you start using the internet so that we may know what year you are posting from and get lottery numbers, World Series and Superbowl winners from you?

    From tinyurl:

    Copyright © 2002-2009 Gilby Productions. All rights reserved.

    (2009 - 2002) < 11+

    --
    "There are no facts, only interpretations." --Friedrich Nietzsche.
  10. Re:I have an easier solution: by Christophotron · · Score: 5, Interesting

    How about Twitter just stops arbitrarily limiting characters. Go by word count, perhaps?

    I know some avid twitter users, and the majority of them apparently use the idiotic SMS message system to 'tweet' each other all throughout the day on their phones. Twitter can't abandon the 140-character limit for this reason.

    For the record, I am against anything that keeps the SMS system relevant in this day and age. It should have been abandoned long ago in favor of standard data packets on the internet, rather than control packets on a proprietary wireless system. There's no good reason to keep this system alive when it either forces you to pay $X per month for it, or pay $.15 per 140 characters when one of your idiot friends 'texts' you. There's no way (that I know of) to force incoming SMS to route through GPRS, so you are hit with SMS fees even when you already pay for unlimited data. It also invites spam that you actually DO pay for, quite literally, and from which the wireless carrier profits as well. It should be illegal for the carrier to charge you for incoming SMS messages. Anyone who agrees with me should call their congressperson to protest this policy and call their wireless carrier to block all SMS messages.

  11. Re:"Great link apocolypse" WAT? by ushering05401 · · Score: 5, Informative

    This story should be tagged Twitter.

    This guy seems to be focusing on the meaningful identifier aspect of URL shortening for use in a space limited context - without actually confining his suggestion to use in that sort of environment.

    He puts forth other reasons for using this method such as control over the persistence of the shortened URL, but that doesn't make a whole lot of sense to me... and then he goes back to mentioning Twitter.

  12. Alternative Solution: Implement it Right? by Kupo · · Score: 5, Insightful

    There's all this talk of URL shortening services - whether third-party, or in-house implementation.

    The question here is this: Why are the URLs so long to begin with?

    Why does it have to be:
    http://shiflett.org/blog/2009/apr/save-the-internet-with-rev-canonical

    A full title in the URL is, IMHO, a very inefficient idea. The excuses I've heard are:

    Search Engine Optimizations (better performance when keywords are in the URL)
    Okay, I can't argue that some search engines do stuff like that. But shouldn't the TITLE or META tags have more bearing on this than how ridiculously long the URL is?

    "The URL has meaning, so you know what you're clicking", Context, etc.
    I suppose that when I see a URL like
    http://shiflett.org/blog/2009/apr/save-the-internet-with-rev-canonical
    as opposed to something like
    http://example.org/blog/526
    I would have a slightly better idea of the article's content before clicking on it. But then again, I can't really say that I've decided against clicking on a link just because of the link URL. I would, instead, decide whether I'd want to visit the link by its link text/description.

    So <a href="http://example.org/blog/526">blog on link shortening</a> would still have the same effect on me as a long URL IMO. If it were bookmarked, the same rules would apply.

    Hell, if I were handed an obfuscated shortened URL without context, I'd know even less of what I was getting myself into.

    I think the proper solution is to just stop making ridiculously long URLs to begin with, so we don't have to rely on obfuscation/hashing/shortening to accommodate services that have character limit restrictions. And we'd save bandwidth too, apparently. Win-win?

  13. Re:I have an easier solution: by he-sk · · Score: 5, Insightful

    LOL! Only in America, the free market bastion of the world, do you have to pay for incoming texts.

    --
    Free Manning, jail Obama.
  14. I prefer hugeurl by LittleBigScript · · Score: 5, Funny

    "Because bigger is better, right?" http://www.hugeurl.com/

  15. Well, I call for long URLs by athlon02 · · Score: 4, Insightful

    All this short URL stuff sounds like some phishing scam if you ask me. Short cryptic URLs obviously exist to make me transpose a couple of letters or numbers and end up at some fake bank site. No, give me large detailed URLs so I can see those dead giveaways like pid=poor_sucker&sid=steal_credit_card_info !

    Short URLs indeed... no thank you Nigerian scammers... I won't be transferring any large sums today!

    On a serious note, why is this news exactly?

  16. URL mapping is the answer by Skapare · · Score: 4, Interesting

    Unfortunately, it's not yet an integral part of web frameworks that I have seen. So I am adding it in a new web site I'm building. It means I have to add the feature to the web server.

    It works like this. Every part of the web site code that builds URLs for the same site passes them first through the mapping logic. This basically builds an SHA1 checksum of the canonicalized URL string. Then it looks up the string in a fast database (I'll be using Berkeley DB for this). If it's already there, and is the same URL, it generates a new URL that references the checksum. If it was a different URL, it notifies me that it found an SHA1 collision. If not already there, it adds it. The original URL is thus replaced with the mapping URL.

    Code added to the web server will be designed to detect checksum URLs. If it looks like one, it looks it up in the database to get the original URL, and proceeds with the request using that URL. Original URLs would still be processed as usual, in case they leak out, or are intentionally made to bypass the mapping for special purposes. Basically it's like a tiny URL service, but integrated without the need to do a redirect.

    One thing I am looking at doing is shortening even these URLs, even though they should be short enough already. But this raises the chance for a collision to the point I'll need to add logic to deal with it. How I would do that is similar to a hash data structure collision, but by expanding on the SHA1 checksum by adding back digits that were removed to shorten it.

    External URLs to other sites can be done the same way. This does add the extra redirection. I could limit the use of this only to long external links, since this being a web interface, should handle long external links OK. It could be an option.

    --
    now we need to go OSS in diesel cars
  17. A Few Responses by shiflett · · Score: 5, Interesting

    A couple of good questions I have seen, and my best attempt to answer them:

    1. Don't you mean rel? No, I mean rev. It indicates a reverse link.

    2. Why not make your URLs short in the first place? I happen to like my URLs and have made them as short as I want them. They're only too long in some very specific use cases, like Twitter. I could just complain about Twitter, or I could support an idea that makes URL shortening suck less. I chose the latter.

    Thanks for reading, and please do feel free to criticize whatever you think is wrong with this idea. I'd like a way to indicate a preferred short URL for my own stuff, and this seems like a pretty good way to do it that makes sense semantically and is easy to implement. For an ongoing discussion about adding an HTTP header to do the same thing (so that only a HEAD request is required), read here:

    http://shiflett.org/blog/2009/apr/a-rev-canonical-http-header