URL Shorteners Get Some Backup
URL shorteners are problematical, as everybody knows, but with the rise of Twitter and its ilk they seem to be a necessary part of the landscape. Some of the biggest questions around services such as bit.ly, TinyURL, and is.gd is what happens when they go out of business (as tr.im did last August). Now a group of such companies, organized under the auspices of the Internet Archive, has formed a non-profit entity to hold URL-shortening databases in escrow, with the intent of continuing to resolve a member company's links should it get out of the business. At announcement, the 301Works organization has 21 URL-shortener members, including the largest, bit.ly. Many others are not (yet) on board. The members have agreed to cede control of their domain names to 301Works.org should they exit the field, and to back up their URL mappings regularly to the organization.
URL shorteners are problematical, as everybody knows, but with the rise of Twitter and its ilk they seem to be a necessary part of the landscape.
Seriously?? I know editors frequently get grief for this sort of thing, but come on... the word is problematicalic, for crying out loud. ;)
They inevitably generate link rot.
A governing body should not have to step up to preserve these databases.
I have a great proof why this won't work, but it's too long to fit in into a URL :(
Sorry, I mean srs bsns.
Hopefully bit.ly's commitment will force the other common players (tinyurl, tr.im, etc) to join as well. Bit.ly was the only main player on their list so far. A great next-step would be to get the twitter image sites (twitpic, img.ly, etc) on board as well.
URL shortners only server for twitter posts and other place where you need to count characters, these links become pointless within days of a post (some think they become useless even earlier than that), so why bother preserving them after that? let alone when a provider goes bankrupt.
p.s I'm only posting this so i can get some karma to go troll apple ;)
IranAir Flight 655 never forget!
If one of these companies goes bankrupt, their creditors will demand the only valuable asset: the domain name. Does their agreement with 301Works overrule the creditors claims?
qkd2f
My first program:
Hell Segmentation fault
running these things?
$6.99 a year for the domain with free standard hosting from GoDaddy and you're set.
It's not like it's a particularly difficult task to create and run these types of sites. With a simple cron script to clear out links which haven't been clicked in X amount of days you won't even have to worry about your DB ballooning out of control.
Throw up Google AdSense on the user facing side to draw in funds and point both GoDaddy and Google at the same bank account. Google giveth and GoDaddy taketh away. Throw in a hundred to start and you're good to go for a decade.
Work Safe Porn
but with the rise of Twitter and its ilk they seem to be a necessary part of the landscape.
The fact that a website based on the horridly outdated USA SMS system needs URL shorteners does not convince me for a second that they're a necessary part of the landscape. Everywhere else on the internet, a few bytes aren't an issue and we can copy+paste long links anyway.
I'm not sure what the point of this is - most links that would use a shortener are those with limited lifespans anyway. Twitter et al are generally of the "look at the cool article/video clip" The use in printed works I can somewhat see but even the underlying website for these things is likely to have changed during the intervening period if these links are more than a year old. This seems a solution in search of a problem.
If bit.ly is the largest, I wonder why I haven't heard of it. Granted, once you find one, then it's golden. But I've never seen a bit.ly shortener. I've sone loads of tinyurl.com hashes, but never any bit.ly.
URL shorteners are a scourge. As someone else pointed out, they're only really useful for Twitter, with its artificial post-length constraint. Anyone who links to a tinyurl on an actual Web site (such as Slashdot) should automatically be assumed to be a troll, because the only reason to use an URL shortener is to conceal what you're actually linking to.
Breakfast served all day!
Funny, I still have 4 urls shortened by them and they all still work.
These private URL shortening sites shouldn't exist anyway. They're just a hack to support long urls on mediums that can't handle proper html-style linked text (aka hypertext). Those mediums are buggy should be upgraded (if only by footnote style guidelines).
The bigger issue is private databases, and that all these sites are independent, with separate domains and slightly different urls. The proper solution to that is probably to replace shortlinks with URNs, some DNS/directory/mirror protocol extension that allows browsers to find the nearest server that handles SHORTNAME://MYURN style links, and to essentially query it for "whatever webserver(s) can provide the file referred to by MYURN".
Now, I may be missing something here...
But can someone enlighten me why it would be "problematic" if such a service would go out of business?
All they do is redirect to the original url. So where is the loss?
The original url is still there. If no one is able to find it without using the shortened url chances are pretty big it isn't much interresting anyway.
Life starts at the end of your comfort zone.
Does anyone else find it odd that the White House's twitter page uses bit.ly urls when .ly is the top-level domain code for Libya?
But then what happens when 301Works dies?
That stupid characters limit is self-imposed, they're the ones who created the problem. The easiest solution is to not count HTML characters, meaning you could have a 200 characters URL but be limited in length for the anchor text itself.
It also does unnecessary traffic and adds a delay. Doesn't seem like much, but let's say in five years everybody does that:
1. You click a link
2. Your browser does a DNS request for the shortener domain
3. The shortener website takes the short URL, fetches it in its database
4. Your browser receives a redirect to a new domain
5. Your browser does a DNS request for the actual domain
6. The actual website is sent to your browser
Steps 1-4 are added only for the benefit of an artificial, self-imposed limit from ONE website?
Fuck Twitter.
From what I see they are just for transient things such as Twitter or blog posts - why would someone permanently link using one of these services?
These private URL shortening sites shouldn't exist anyway. They're just a hack to support long urls on mediums that can't handle proper html-style linked text (aka hypertext). Those mediums are buggy should be upgraded (if only by footnote style guidelines).
Have fun upgrading SMS in the network back end, all the handsets that support it, and how it's billed across all carriers globally.
Not that the web should downgraded to support inferior systems, but mobile does have full web support these days, and there is such a thing as MMS these days.
Some websites have user-friendly URLs, such as "www.apple.com/macmini/". You don't even need to click that link to know what it's about.
Other websites have dumb, half-friendly URLs, where they add the backend technology inside the URL, such as "http://www.logitech.com/index.cfm/mice_pointers/" (what's with the "index.cfm" in the URL?). If they fix that problem, all the links pointing to the current URL will break. If they ever change technology, it's also going to break the links from other websites.
And then we have the URLs designed by web monkeys, such as the link for Dell's new Adamo laptop: "www.dell.com/content/topics/topic.aspx/global/products/adamo/topics/en/us/adamo-onyx?c=us&cs=19&l=en&s=dhs". What the HELL is that thing? Even if we forget the parameters at the end, look at the path of that thing! I don't care how your crap is organized on the server, the URL should be much simpler than that!
And last, we have completely brain-dead URLs that seem to be created for computers only, without any chance of figuring out what kind of content is waiting for us on the other side of that link. Crap like "http://www.sonystyle.com/webapp/wcs/stores/servlet/CategoryDisplay?catalogId=10551&storeId=10151&langId=-1&categoryId=16154&SR=nav:electronics:computers:notebook_computers:shop_compare:ss". We're lucky to see "notebook_computers" in the parameters, sometimes it's just a database reference number.
But even with crap URLs like that, unless you have to spell it, write it down or read it on a (paper) page, such links can be hidden behind simple anchor text such as Sony Laptops.
Twitter is its own problem, they should be the ones to fix their own mess. Someone could start a service similar to Twitter but without counting HTML code as being part of the X characters limit, which seems to be what the fuss is all about.
Maybe we could just issue unique IDs for everything on the Internet. I'm not sure how many would be enough. It could be 64-bits, or perhaps even 128, although you can be sure that if we did that some comittee would probably come up with a reason to gobble up bits. Then of course you'd need some bits for private URLs.
I'm not sure what you call it, but plainly some protocol is needed for all these URLs on the Internet. A kind of... "Internet Protocol", or IP for short. Yeah, that's it. Is anybody working on that?
(end sarcasm)
For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
Make the URLs/URIs/IRIs short in the first place, and get rid of the problem. (.html, leading double slash, trailing slash, www...) /saves/ network traffic in the end.
And use title of page instead of plain IRI.
No need query servers with cryptic fake-URIs to get the real ones. Sending the full URIs actually
One reason the link-rot threat is very real is the little guy.
I run a url-shortner (ish) service because it's fun and I can.
While, I would love to defend url shortners, my advice to a friend would be : don't use these for anything important. They are not to be used in place of bookmarks. If you have a site or a blog.. just use the real URL in the href. You can beautify it any way you would like inside the "Anchor" tag itself. We've been doing that for two decades now.
Also, the link-rot threat is quite real. SoCuteUrl is simply a fun way to send an otherwise cumbersome link. It's more memorable.. easy to write down, text, etc.
I run the site because it costs very very little to do so and is a very easy to thing to have set up. And, it's fairly easy to maintain.
This is where the problem lies. These are so easy to engineer that virtually anyone can do it. Yes, even slackers like myself with a tendency to flake out on personal projects.
301Works Looks like a decent solution. I will be evaluating it for my own site (socuteurl.com).
However, the membership fee, which does not exist now could prove problematic. My site makes no money. $1,000 a year may not be a lot of money for a site that makes some kind of profit, but it's a lot to support a hobby.
I think 301works may have to come up with a better way to support their costs. Since the biggest threat to link rot.. are the sites that don't make money! I think the membership fee if instated should be optional, and donations should be accepted. Or, perhaps the membership fee can be scaled down for sites with small dbs to upload.
Slashdot covered the benefits of using url-shorteners to reduce bandwidth waste only last March! Everyone is so eager to prove how sophisticated they are and toss hate on an admittedly stupid fad (twitter) that they're prepared to pretend there are absolutely no benefits to using the types of services talked about in the article. I thought this was supposed to be a geek site, not some silly MMORPG where the only thing that counts is how high your comments get rated?
Remember this?
There's a reason why people use url-shorteners, and that reason is because they have a benefit to their use! Many of the more savvy tech sites have begun using them internally to save that 'as much as 75MBit/sec of bandwidth' mentioned in the Slashdot headline. If there is a group getting together to ensure this usage can continue to live on even after the death of the individual services, so much the better! This should be seen as good news...
Instead you half-wits decided to forsake any semblance of geek cred you may have had to whine about Twitter... stuff like this and I wonder why I even come here any more!
--bornagainpenguin
Have a Virgin Mobile USA smartphone? Give VMRoms.com a try!
You don't want one from California dude, they're all in the union. Cheaper to hire one from Alabama and fly him to Cali than it is to pay those union wages! As a bonus the one from Alabama usually will have a better gun and you can probably get a discount if you tell him the target is a Yankee.
ACs don't waste your time replying, your posts are never seen by me.
They also give you a way to send people to places they would normally have the good sense not to.
Cool!
What? There's no need to change any of the "means" of transmission. When a person reads an sms and a) copies the link to his/her PC browser or b) tells the phone to follow that link, does it matter that the TEXT in the link is "http://bit.ly/asdfasdf" or "short://0923abf84"? Nope, it doesn't. Any device connected to the internet can use any of the theoretical URN to URL mappers (there are tons and tons of research papers written on this and how DNS is crippling the web, etc.) I'm just way too tired to explain any further, but, please, next time you try to make a point, make sure you do have one.
let's play spot the idiot who doesnt "get" IP networking and the web in general. oh look, it's you!
Are you serious? HTTP is a text based protocol. If you really want to optimize that much forget about shortening your urls and turn on compression.
So do it internally. There's no reason slashdot couldn't turn all the URL's on their own page into
http://slashdot/a
a-z 0-9, then add 2 digits, then 3, then 4. That should last you a while.
I'm not a nig fan of url shortening services, but sometimes it makes sense.
Sometimes I just want to send a link to my friends with my dumb phone. The problem is that I don't want to type the whole stupid url and I don't want my friends to ask me if it's an Oh or a Zero. In that case, SocuteURL rocks!
Some of the biggest questions around services shch as bit.ly, TinyURL, and is.gd is what happens when they go out of business (as tr.im did [CC] last August).
Yeah, it's out of business, but now it's open-source and not-for-profit (IIRC). I use tr.im a lot and it works just as good, if not better, than any other URL shortener I've used even.
"Our country is not nearly so overrun with the bigoted as it is overrun with the broadminded." -Archbishop Fulton Sheen
Firstly, tr.im is still working, and even better, it's now open source (on github). Secondly, although this backup stuff sounds good, there's probably a much easier solution: put your link database online and release it under a free license. That's exactly what ur1.ca did (and this one is open source too).
These private URL shortening sites shouldn't exist anyway. They're just a hack to support long urls on mediums that can't handle proper html-style linked text (aka hypertext). Those mediums are buggy should be upgraded (if only by footnote style guidelines).
Could you clarify this? How does what you're suggesting help me read a long URL over the phone? Or type one from memory? Or paste one into an IM or IRC chat window?
It's not difficult to write your own. I did it (not going to link to it because my server probably won't handle the /. effect.)
They can't even decide on the name. In their Terms of Participation, they refer to
themselves as 310works, not 301works. Later they refer to themselves as 201works.
This does not appear to me to be a very professional company if they can't even proofread their own page...
And this part gets me:
Participating companies will be encouraged to place a ‘301Works’ badge on their websites, indicating that they are operating in accordance with these terms of participation. We will generate these badges so they will include the 301works logo and the company’s logo.
They get free advertising on all of these sites. And last section says they *MAY* impose a fee later, like a $1000/year....
I'm providing my services for free, no guarantees, warranties or promises. If I go belly up, well, to bad... But with their proofreading "skilz" and free advertising, and possibly charging a fee later on, I think I'll pass.
The day Microsoft creates a product that doesn't suck, it will be known as the Microsoft Vaccuum Cleaner!
Hi, grammar (and spelling!) fails in sig, but judging from post content, it doesn't matter.
I feel fantastic, and I'm still alive.
./ seems biased against long URLs. When I try to paste one from http://iliil.com/ or http://011010.com/ I get: "Filter error: That's an awful long string of letters there."
Years ago I ran a web server off a home connection using Windows Me and Apache.
I logged over 1 million page views in a month.
As someone else mentioned, it's (possibly) all the click tracking and storing of links perpetually which will eat up database space which results is lower response time without more or better hardware.
Work Safe Porn
I think the article you cite criticize the use of ridiculously long url such as http://www.google.com/search?q=slashdot&ie=utf-8&oe=utf-8&aq=t&rls=org.debian:en-US:unofficial&client=iceweasel-a which is not even that bad. If you want short URL to save bandwidth you can change the URL for the one you manage and run a short url service yourself for the one you do not manage.
The main feature (redirecting) is trivial and capable of being handled with a GoDaddy account. The economy package (free with domain) gives you 300GB per month of transfer and 10GB of space to work with. GoDaddy claims to limit the size of your DB to hundreds of megs or so but the real limit is several GB.
I can see that the additional features of tracking clicks and what not (which should be handled on the individual sites, not by the url redirector) would result in far more resources being needed.
If these sites don't want to go out of business due to increasing costs then the issue is trying to offer more than they need to. Cut features and stick to the core functionality.
But, I guess some people would rather drive themselves into the ground than look for ways to save money and resources.
Work Safe Porn
Bit.ly is currently the number one URL shortener because Twitter automatically uses them when a non-shortened URL is sent.
Work Safe Porn
I'd like to see that shortener database stored as a distributed hash table, and clients can either query it directly by joining the DHT or, as you said, use a service that exposes an simple API that queries DHT.
Heck if they go bankrupt why not just release their database as a torrent and let other people get it if they want? Not like it'll do them any good to keep it private anyway.
The source code is available on e44.us/1.
You can "log in" with your gmail account so one day you can edit your short links.
Anyhow, it's a simple app for now but if there is interest in a "community" OSS project, we can add cool features like, make personalized forms of the app (urls like e44.us/fred/1) or even your own domain (which you can do now with a Google Apps account), optimize it for mobile phones, validate access to URL's etc etc. If you're interested, let me know.
http://2sa.me/ is my own URL shortener service I just whipped up.
It's running off a GoDaddy account.
It's using a two hash method of storing and retrieving URLs quickly. The URL is stored once based on the MD5 hash of the URL (for finding URLs that are being submitted) and once based on the MD5 of the short code (for fast lookup when redirecting). Given one you can find the other. URLs are stored across 4096 tables.
We'll see if it manages to earn $9 over the next year.
Work Safe Porn
How's about Twitter just stops imposing a stupid arbitrary limit on post size, and then we wouldn't need these horrible services?
The SMS message length is a red herring - when was the last time you saw a phone that couldn't handle multiple messages strung together? And I know it has the side benefit of encouraging brevity and stopping people using it like a full-blown blog, but honestly, there's no need - Facebook status messages don't have a length limit (that I've hit, anyway) and I don't see anybody knocking out War And Peace in there, because it's just not the medium for that sort of thing.
I think the OP is referring to a compression algorithm, instead of a cryptographic hash. It sounds like a good idea.
Perhaps this RFC can recommend the use of reserved domain names for compatibility purposes (i.e., http://x.invalid/foobar)
Cool, now you can get your OH JOHN RINGO NO goodness at http://www.socuteurl.com/cheekyfuzzydoo
My heart is about to explode from the cuteness.
Why can't someone build a purpose-built compression algorithm for URLs, so we can skip the URL shortener providers entirely? URLs contain lots of oft-occurring constructs, so I would think a reasonably good compression ratio could be attained.
Take a URL like http://is.gd/XXXXX - that's 18 characters where only 5 are being used to reference the URL. Couldn't a generic URL compressor do a better job on most URLs of reasonable length? Then we could build inflate support directly into the browser and skip the URL shortener entirely.
If Twitter is the main consumer of these, and they depend on them, why doesn't Twitter go into the shortening business? Buy one of those short domain names and run it themselves. They clearly have the resources, and the motivation. Seems obvious to me.
J