Slashdot Mirror


Are Long URLs Wasting Bandwidth?

Ryan McAdams writes "Popular websites, such as Facebook, are wasting as much as 75MBit/sec of bandwidth due to excessively long URLs. According to a recent article over at O3 Magazine, they took a typical Facebook home page, looked at the traffic statistics from compete.com, and figured out the bandwidth savings if Facebook switched from using URL paths which, in some cases, run over 150 characters in length, to shorter ones. It looks at the impact on service providers, with the wasted bandwidth used by the subsequent GET requests for these excessively long URLs. Facebook is just one example; many other sites have similar problems, as well as CMS products such as Word Press. It's an interesting approach to web optimization for high traffic sites."

379 comments

  1. Can they not use... by teeloo · · Score: 5, Insightful

    compression to shorten the URL's?

    1. Re:Can they not use... by Anonymous Coward · · Score: 0, Informative

      Sure they can

      TinyURL

    2. Re:Can they not use... by Anonymous Coward · · Score: 0

      compression to shorten the URL's?

      Maybe, but removing redundant data will certainly help.

    3. Re:Can they not use... by Skal+Tura · · Score: 2

      handshake for the compression, and packet headers would probably become more than the potential benefits, not worth the effort.

    4. Re:Can they not use... by dotgain · · Score: 5, Funny

      No, they cannot use TinyURL (read: goatse, tubgirl et. al) thank you very much.

    5. Re:Can they not use... by corsec67 · · Score: 1

      You mean something like mod_gzip?

      That leave only the url in the request header, the rest should (already) be compressed by mod_gzip.

      --
      If I have nothing to hide, don't search me
    6. Re:Can they not use... by tepples · · Score: 1

      Take a page full of short URLs and a page full of long URLs. Run them both through mod_gzip. The page with short URLs will still probably come out smaller.

    7. Re:Can they not use... by unlametheweak · · Score: 1

      compression to shorten the URL's?

      No, throttle these Web sites. Throttling is a more traditional approach to bandwidth management.

    8. Re:Can they not use... by truthsearch · · Score: 4, Funny

      They should just move all the GET parameters to POST. Problem solved. ;)

    9. Re:Can they not use... by Anonymous Coward · · Score: 2, Informative

      That's not compression, that's hashing.

    10. Re:Can they not use... by slummy · · Score: 1

      That wouldn't be very UX-centric.

      If pages continually POST to each other, hitting the browser's back button will display the annoying alert asking you to "Resend POST data".

    11. Re:Can they not use... by MoFoQ · · Score: 1, Interesting

      because they got more requests than the number of unique things TinyURL or whatever can handle.

      Better is to use a better way of doing AJAX other than using GET....they can use POST and make sure gzip is on.

      I think if they really put their minds on it, they can also implement clientside JSON compression using some of the javascript compression libraries that are out there (or use a simple flash wrapper to do the dirty work).

      Just throw a bunch of kiddies (or 21yr olds) in a room and offer them free pizza/beer whatever.....it'll get done.

    12. Re:Can they not use... by jd · · Score: 4, Informative

      Most of the time, yes, but then there's a question of trade-off. Small URLs are generally hashes and are hard to type accurately and hard to remember. On the other hand, if you took ALL of the sources of wastage in bandwidth, what percentage would you save by compressing pages vs. compressing pages + URLs or just compressing URLs?

      It might well be the case that these big web services are so inefficient with bandwidth that there are many things they could do to improve matters. In fact, I consider that quite likely. Those times I've done web admin stuff, I've rarely come across servers that have compression enabled.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    13. Re:Can they not use... by jd · · Score: 1

      Then dump CGI-like syntax completely and use applets that send back data via sockets.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    14. Re:Can they not use... by master5o1 · · Score: 1

      Your URL violated the compression filter. Try less whitespace and/or less repetition.

      --
      signature is pants
    15. Re:Can they not use... by gbh1935 · · Score: 5, Funny

      This thread is wasting more bandwidth

    16. Re:Can they not use... by rackserverdeals · · Score: 1

      Those times I've done web admin stuff, I've rarely come across servers that have compression enabled.

      Not sure why you would see that. Even for small sites that don't come close to hitting their minimum bandwidth allocation, using mod_gzip increases the visitor's experience because the HTML and CSS files download a lot faster and the processing overhead is minimal.

      As for this story, I think whoever wrote it had this epiphany while they were stoned. There are so many other ways that Facebook could save bandwidth if they wanted to that would be easier.

      75Mb/s is probably nothing to a site like Facebook. Let's assume $100/mbs which is probably high. That's $7,500/month. That's peanuts compared to their revenues which are in the hundreds of millions.

      --
      Dual Opteron < $600
    17. Re:Can they not use... by guyminuslife · · Score: 5, Funny

      That's nothing. This is the most disgusting shit you'll ever see on the Internet.

      --
      I don't believe in time. It's a grand conspiracy designed to sell watches.
    18. Re:Can they not use... by darkpixel2k · · Score: 1

      compression to shorten the URL's?

      And while we're at it, DNS traffic is starting to look a bit bulky. Forget dumping 'friendly' URLs, let's dump friendly domains too.

      From now on, please access Slashdot via http://216.34.181.45/

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    19. Re:Can they not use... by Nutria · · Score: 3, Insightful

      Sure they can TinyURL

      No, because the long URL is still out there. For example: http://tinyurl.com/c9fjov translates into http://www.nerve.com/CS/blogs/scanner/2008/11/16-22/pervert.jpg.

      --
      "I don't know, therefore Aliens" Wafflebox1
    20. Re:Can they not use... by FredFredrickson · · Score: 4, Funny

      I won't lie. I was partly relieved, but partly dissapointed when I clicked that link.

      --
      Belief? Hope? Preference?The Existential Vortex
    21. Re:Can they not use... by Chabo · · Score: 1

      Only 4 bytes required, as long as we keep using IPv4!

      --
      Convert FLACs to a portable format with FlacSquisher
    22. Re:Can they not use... by x_MeRLiN_x · · Score: 4, Informative

      Using a cookie, TinyURL allows you to enable previews, i.e., view where a TinyURL points to before following the link.

    23. Re:Can they not use... by Anonymous Coward · · Score: 0

      Hell, I shit 75Mb/s in my sleep. And this is for all of facebook? Who gives a shit?

    24. Re:Can they not use... by dgatwood · · Score: 4, Insightful

      Depending on your network type, you may not get any benefit from shorter URLs at all. Many networking protocols use fixed-size frames, which then get padded with zeroes up to the end of the frame. For example, in ATM networks, anything up to 48 bytes is a single frame, so depending on where that URL occurs relative to the start of a frame, it's possible that it would take a 48 byte URL to cause even one extra frame to be sent.

      Either way, this is like complaining about a $2 budget overrun on a $2 billion project. Compared with the benefits of compressing the text content, moving all your scripts into separate files so they can be cached (Facebook sends over 4k of inline JavaScript with every page load for certain pages), generating content dynamically in the browser based on high density XML without all the formatting (except for the front page, Facebook appears to be predominantly server-generated HTML), removing every trace of inline styles (Facebook has plenty), reducing the number of style sheet links to a handful (instead of twenty), etc., the length of URLs is a trivial drop in the bucket.

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    25. Re:Can they not use... by Anonymous Coward · · Score: 0

      That's exactly what I did for a site I work on. We had lots of long query strings (some of which contain redirection URLs), so we tried deflating the whole of them, then using a modified base64-like encoding on the result. Turns out these compressed + base64'd URLs (remember: they still need to be passed somehow, so base64 or long hex-based url encoding are your options) have only been "shorter" about half the time or less. We got better "compression" by putting a character on the end to say "does this param contain an http://www./ prefix?" than we did trying to deflate things.

    26. Re:Can they not use... by dgatwood · · Score: 4, Informative

      And even with the wink, this still got initially moderated "Interesting" instead of "Funny".... *sigh*

      To clarify the joke for those who don't "GET" it, in HTTP, POST requests are either encoded the same way as GET requests (with some extra bytes) or using MIME encoding. If you use a GET request, the number of bytes sent should differ by... the extra byte in the word "POST" versus "GET" plus two extra CR/LF pairs and a CR/LF-terminated Content-length header, IIRC.... And if you use MIME encoding for the POST content, the size of the data balloons to orders of magnitude larger unless you are dealing with large binary data objects like a JPEG upload or something similar.

      So basically, a POST request just hides the URL-encoded data from the user but sends almost exactly the same data over the wire.

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    27. Re:Can they not use... by smellotron · · Score: 4, Informative

      You're missing the joke... GET requests look like this:

      GET /url?a=b&c=d HTTP/1.0

      POST requests look like this:

      POST /url HTTP/1.0
      a=b&c=d

      Same amount of content... URL looks shorter, but the exact same data as the querystring gets sent inside the request body. Thus, switching from GET to POST does not alter the bandwidth usage at all, even if it makes the URL seen in the browser look shorter.

    28. Re:Can they not use... by Anonymous Coward · · Score: 0

      Probably best to think the modder liked the joke so much, he decided to give it a karma raising score.

    29. Re:Can they not use... by travd · · Score: 1

      Deflate doesn't work well on shortish strings because (a) it has some 11 or bytes of overhead you pay no matter what (b) you are no likely to find a lot of long-run duplication in short strings (c) the relative cost of including the huffman tree in the output is large, so it is likely that a less efficient static variation will be chosen.

    30. Re:Can they not use... by JustKidding · · Score: 1

      she's kinda cute...

    31. Re:Can they not use... by Directrix1 · · Score: 1

      Is this order of maginitude 33% bigger?

      --
      Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
    32. Re:Can they not use... by Sparr0 · · Score: 2, Insightful

      How do you bookmark a specific lower level page if no variables are stored in the URL?

    33. Re:Can they not use... by Directrix1 · · Score: 1

      Why would they? URL size is miniscule compared to the page size.

      --
      Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
    34. Re:Can they not use... by Anonymous Coward · · Score: 0

      I won't lie. I was partly relieved, .

      Thank god we didn't see what would happen if you were fully relieved.

    35. Re:Can they not use... by mattwarden · · Score: 1

      I think this is one of the most witty comments I've ever read on slashdot. Kudos. If I had mod points, I'd use them.

    36. Re:Can they not use... by Anonymous Coward · · Score: 0

      My bad on the modding, meant to click insightful, not redundant.

      I agree completely with this comment. Unless the site is deliberately trying to obscure some data, it is poor design not to allow it to be indexed by URL, no matter how shiny the interface is.

    37. Re:Can they not use... by slashchuck · · Score: 1

      I had a client who wanted to save disk space by using smaller fonts

      --
      $sig not found
    38. Re:Can they not use... by Sneeka2 · · Score: 1

      Moderation system needs +1 Funny & Hitting the Nail on the Head

      --
      Bitten Apples are still better than dirty Windows...
    39. Re:Can they not use... by jesset77 · · Score: 1

      Who, Emilie Watson? (And watch the click-through rate soar! ;D)

      --
      People willing to trade their freedom of expression for temporary entertainment deserve neither and will lose both.
    40. Re:Can they not use... by Dan541 · · Score: 5, Funny

      http://tinyurl.com/6rywju

      Tiny url is not all bad, this is one example of a positive use.

      --
      An SQL query goes to a bar, walks up to a table and asks, "Mind if I join you?"
    41. Re:Can they not use... by Anonymous Coward · · Score: 0

      Implement your recommendation then try hitting "Back" in any mainstream browser.

      Have fun with that.

    42. Re:Can they not use... by asplake · · Score: 1

      He may also have been referring to the practice of "tunelling through POST" requests that would be much more appropriately done as GET (SOAP for example makes this too easy), thereby negating caches and the like. I have done the opposite, moving addressing parameters from payloads to URLs. If your payload is (for example) XML, this is much more space efficient and you get informative URLs as a nice byproduct.

    43. Re:Can they not use... by dextromulous · · Score: 1

      Consider a page that is full of URLs. Think about how many URLs are transmitted to your computer right now just to load this page. I count 1295 right now, just in tags.

      Personally, I'm not concerned, but if you want to see how many tags are in any page, paste this into your address bar and press Enter.

      javascript:alert(document.getElementsByTagName("A").length)

      I've only tried this in FF3, and of course URLs can be more places than in an href="" string of an tag...

      --
      There are two types of people in the world: those who divide people into two types and those who don't.
    44. Re:Can they not use... by rdebath · · Score: 1

      This is the site you were looking for.

    45. Re:Can they not use... by Anonymous Coward · · Score: 0

      compression to shorten the URL's?

      Yes,

      Use the gzip compression test on a typical web page
      http://www.gidnetwork.com/tools/gzip-test.php

      Slashdot for example is coming up as NOT compressed. A page load therefore takes 100 bytes.

      Turn on gzip (server side compression) and you reduce the typical page load (of text) to 20 bytes.

      So says the gzip test at least.

      With GZIP turned on, you're likely not going to notice a few extra characters in a long URL

    46. Re:Can they not use... by Anonymous Coward · · Score: 0

      Just throw a bunch of kiddies (or 21yr olds) in a room and offer them free pizza/beer whatever.....it'll get done.

      And Twitter was born!

    47. Re:Can they not use... by SenseiLeNoir · · Score: 1

      I cant see how a 160 char URL, is worse than the sometimes 2MB resulting page!

      --
      Have a nice day!
    48. Re:Can they not use... by Antique+Geekmeister · · Score: 1

      They should just use throw out all the personal pictures of fat people. By displaying only slim pictures, I'm sure they could save a great deal of bandwidth.

    49. Re:Can they not use... by dave420 · · Score: 1

      It's even worse than that - a POST request describes the length of the POST-ed data with a Content-Length header and includes a Content-Type header to describe the fact it has data in the request, so it takes up even more space than a GET.

    50. Re:Can they not use... by thomthom · · Score: 1

      Doh!

    51. Re:Can they not use... by Phreakiture · · Score: 1

      How about we get real? What fraction of the bigger bandwidth picture is this 75 Mb/sec? Is this equivalent to jettisoning a big bag of feathers in order to keep a balloon aloft?

      Anyway, if shortening the URLs is really going to save that much, then the headers need to be reined in, too -- they use far more bandwidth than the URL.

      Compression is probably not the best solution, rather, a lookup table would do a nice job. If the URLs were mapped to a base-64 number, (note, mapped to it, not converted to it) then the bandwidth required on the URL could be cut down pretty well. You would also remove all meaning from the URL.

      --
      www.wavefront-av.com
    52. Re:Can they not use... by FnH · · Score: 1

      You're wrong. It's compression. With a very large (and growing) dictionary on the TinyURL servers.

    53. Re:Can they not use... by Teun · · Score: 1

      Hey thanks, I knew Slashdot would one day bring me a useful tip :)

      --
      "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
    54. Re:Can they not use... by PRMan · · Score: 1

      Aren't ethernet packets about 1500 characters in length?

      So, I'm guessing that a 150 vs. 50 difference in URL length makes exactly ZERO difference in the initial call of the page. Now, maybe logging and links within the page will chew up some bandwidth. Maybe that's what they were getting at, since Facebook pages have about a billion links each.

      --
      Peter predicted that you would "deliberately forget" creation 2000 years ago...
    55. Re:Can they not use... by jcwayne · · Score: 1

      Still on dial-up here. You wouldn't believe the savings from using HTTP://3626153261.

      --
      Failure to follow this advice may result in non-deterministic behavior.
    56. Re:Can they not use... by Anonymous Coward · · Score: 0

      I was just disappointing once I realized it was only a face.

    57. Re:Can they not use... by Anonymous Coward · · Score: 0

      If writing good HTML were easy, any monkey with a keyboard could do it. Instead, we have a near-infinite number of monkeys with keyboards writing bad HTML, with the occassional random good site thrown in.

    58. Re:Can they not use... by Directrix1 · · Score: 1

      The <a> tag is also used for internal document references (anchors) and javascript calls. So, your measurement is not completely valid. Also, this is slashdot. Where every single comment has like 8 links on it.

      --
      Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
    59. Re:Can they not use... by Ihmhi · · Score: 2, Funny

      I was disappointed, too. I was expecting a link to idle.slashdot.org.

    60. Re:Can they not use... by svank · · Score: 1

      This sure is a useful Greasemonkey script.

      (It's even better, in my opinion, if you comment out the block of code that replaces the link text with the real URL.)

    61. Re:Can they not use... by svank · · Score: 1
    62. Re:Can they not use... by malkir · · Score: 0

      The variables are there in the back end - consider the Google Maps method of linking to a specific map search... its all still maps.google.com until you click 'Link' to generate the URL equivalent.

      CakePHP has a tons of features to speed up this process - check it out!

    63. Re:Can they not use... by Anonymous Coward · · Score: 0

      But there are some \r\n inbetween there. Also it would make it impossible to bookmark a page. And at last, everytime you hit "back" in your browser it would prompt you whether you want to send the POST again. So POST would be a great disadvantage, not only from the traffic point of view.
      Except you're using Ajax and the visitor never leaves the one page he or she is on.

    64. Re:Can they not use... by tonycheese · · Score: 1

      I actually clicked that, and went, "Hey, I thought I was logged in..." and logged back in to see what you were trying to show me. Eventually I realized what was going on.

    65. Re:Can they not use... by Anonymous Coward · · Score: 0

      As for this story, I think whoever wrote it had this epiphany while they were stoned.

      To be honest, he's pretty much always like that. He tends to focus on tiny things that he thinks are so important while ignoring the really important stuff and, as a result, getting nowhere...

    66. Re:Can they not use... by TheLink · · Score: 1

      1) Compression is very useful. I helped save a lot of bandwidth at my prev company just by suggesting and helping to enable mod_gzip on just one server.

      2) Long urls might be wasting bandwidth, but they're "small potatoes" in the greater scheme of things. Just one big fat image could use more bandwidth than 100 urls, and typically be less compressible.

      --
    67. Re:Can they not use... by ncc74656 · · Score: 1

      From now on, please access Slashdot via http://216.34.181.45/

      strlen("216.34.181.45") > strlen("slashdot.org")

      Since HTTP 1.1 sends the requested hostname along with the path (it's how a server can offer up multiple websites from one IP address), your proposal, serious though it isn't, wouldn't actually save bandwidth.

      --
      20 January 2017: the End of an Error.
    68. Re:Can they not use... by darkpixel2k · · Score: 1

      From now on, please access Slashdot via http://216.34.181.45/

      strlen("216.34.181.45") > strlen("slashdot.org")

      Since HTTP 1.1 sends the requested hostname along with the path (it's how a server can offer up multiple websites from one IP address), your proposal, serious though it isn't, wouldn't actually save bandwidth.

      I noticed my screwup a few seconds after I posted. I was hoping no one else would.

      ...but this is slashdot.

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    69. Re:Can they not use... by TheoMurpse · · Score: 1

      Your comment seems to imply that you don't know who she is. Here you go.

    70. Re:Can they not use... by Nicolay77 · · Score: 1

      That doesn't happen in Opera.

      --
      We are Turing O-Machines. The Oracle is out there.
    71. Re:Can they not use... by jgrahn · · Score: 1

      Small URLs are generally hashes and are hard to type accurately and hard to remember.

      I think it's the other way around. Super-long URLs tend to be alphanumeric garbage and/or long lists of undocumented CGI-style variables. Someone who cares about URLs as names for resources/documents in the original spirit of HTTP (e.g. the Wikipedia) will come up with much shorter names, like a short sentence.

    72. Re:Can they not use... by atraintocry · · Score: 1

      You can also add "preview" to the url, like

      http://preview.tinyurl.com/6rywju

      And if you let TinyURL store a cookie then you can turn previews on for everything.

      I like the greasemonkey script you linked to better. But turning on previews for all TinyURL links has the advantages of catching them if they're coming in from, say, a twitter client.

    73. Re:Can they not use... by dgatwood · · Score: 1

      The amount of expansion depends on the size of the data elements, but it wouldn't be at all surprising to see MIME encoding waste a hundred bytes or more to store a one or two digit number. MIME encoding increases the size of the data itself by a relatively small amount. The problem is that it also adds field boundaries between each data field that are usually HUGE relative to the size of the data. By contrast, the field boundary for a GET or URL-encoded POST message is a single byte containing the ampersand character....

      GET request (59 bytes total, 13 bytes for data fields):

      GET http: //www.example.com/x.cgi?name=Phil&score=3 HTTP/1.0

      POST request (373 bytes total, 326 bytes for data fields):

      POST http: //www.example.com/x.cgi HTTP/1.0 Content-Type: multipart/form-data; boundary=-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-323947234702342374243
      Content-Length: 209

      -3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3 23947234702342374243
      Content-Disposition: form-data; name="name"

      Phil
      -3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3-3 23947234702342374243 Content-Disposition: form-data; name="score"

      3

      (Added bogus spaces in URLs so Slashdot wouldn't try to create links out of them. Oh, and apologies for the extra spaces in the middle of the boundaries that Slashdot adds for line wrapping purposes. There's nothing I can do about them.)

      The boundary value used here is entirely made up, but it is in the same ballpark length-wise as some that I've seen generated by actual web browsers. I couldn't use a real one because Slashdot says it contains too many "junk" characters.... Yeah, that's kind of my point. :-) :-) :-)

      So no, I was not exaggerating when I said it can balloon by orders of magnitude. In this case, it would be more than two orders of magnitude if you only look at the form data portion of the request. Thus, as the number of fields grows relative to the URL and initial protocol negotiation (which make up a significant portion of this trivial request), the bloat would tend to converge towards that limit, give or take....

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    74. Re:Can they not use... by dgatwood · · Score: 1

      I should clarify that it would converge towards a couple of orders of magnitude for typical data, i.e. data fields containing just a few bytes apiece. That's why I said MIME should only be used if you are uploading large binary data blobs.

      It doesn't really make sense to use MIME encoding until the overhead of URL encoding (two extra bytes per binary byte) exceeds the total length of all of the boundaries (including the extra copy at the start where the boundary is being defined) plus the total length of all of the content-length/disposition/type noise, so probably on the order of sixty or seventy bytes of binary data per field on average plus an extra fifty or sixty bytes for good measure. (This is all half-asleep math, so take it with a grain of salt.)

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    75. Re:Can they not use... by dgatwood · · Score: 1

      Ethernet uses a variable packet size that can be up to 1500 bytes. I don't think it gets padded out to 1500 bytes. I could be wrong, though. I am not that familiar with what happens down at that layer in Ethernet, and I had to look up the packet size/constancy even for ATM to confirm that I was remembering that correctly....

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    76. Re:Can they not use... by will_die · · Score: 1

      Skip tinyURL, socuteURL is the place to be.
      For Example

    77. Re:Can they not use... by manif3st · · Score: 1

      Bitch.

      --
      http://www.collude.biz - Ignore this, it's for Project Honey Pot.
  2. Wordpress has the option by slummy · · Score: 5, Informative

    Wordpress by default allows you to configure URL writing. The default is set to something like: http://www.mysite.com/?p=1.

    For SEO purposes it's always handy to switch to the more popular example: http://www.mysite.com/2009/03/my-title-of-my-post.html.

    Suggesting that we cut URL's that help Google rank our pages higher is preposterous.

    1. Re:Wordpress has the option by Anonymous Coward · · Score: 0, Insightful

      Yeah, exactly.
      And since I've read somewhere that Wordpress isn't the best CMS for a high-traffic site, it doesn't really matter too much.

    2. Re:Wordpress has the option by blanks · · Score: 1

      The querystrings/all the get prams are still being passed, there just passed in a "visually" pleasant way for the user.

      All the data is still there meaning mod_rewrite dosen't help with the "bandwidth" issue at all. It just looks pretty.

    3. Re:Wordpress has the option by macraig · · Score: 0, Offtopic

      Unless your blog endures tens of millions of page hits every day, TFA authors weren't even talking to you. Can you say n-o-n s-e-q-u-i-t-u-r?

    4. Re:Wordpress has the option by Kugrian · · Score: 1

      Maybe one day soon Google will have some way to expand mysite.com/5sfg to mysite.com/my_title_of_my_post.html. Having said that, how much of the importance of pagerank (and similar techs) is based on the url rather than title tags or links to it?

    5. Re:Wordpress has the option by slummy · · Score: 1

      Truth. :)

    6. Re:Wordpress has the option by master5o1 · · Score: 1

      With mod-rewrite It can be simply http://example.com/1

      --
      signature is pants
    7. Re:Wordpress has the option by master5o1 · · Score: 1

      On something like .../1/edit/blah
      could be .../?page=1&action=edit&blah=blah
      Since only the /1/edit/blah is sent between the computer and server, the reduction of having /?page= + &action= + &blah= is a slight save in bandwidth...(assumption)

      --
      signature is pants
    8. Re:Wordpress has the option by Thinboy00 · · Score: 1

      The problem is I don't feel like going to Google when I could just change
      http://www.example.com/2009/3/foo.html
      to
      http://www.example.com/2009/3
      which will DTRT in Wordpress AFAIK unless your server is broken.

      --
      $ make available
    9. Re:Wordpress has the option by macraig · · Score: 0, Offtopic

      How exactly is my objective comment supposed to hurt your feelings?

      You weren't really considering mine, though, were you? You deliberately ran off to my personal blog to see what dirt you could dig up and, finding what you perceived as humiliating dirt, you then deliberately posted it here in an attempt to humiliate me. You didn't hurt my feelings much if any, but if I were a different person you might have.

      I didn't even look at your "homepage" at all; all I did was check the URL to confirm that it's not some Facebook-scale site. I didn't attempt to shame you, I merely pointed out the obvious fact that your site isn't relevant to the contents of TFA, and thus that your dismissive remark was also not relevant.

      Perhaps it is you who should be less reactive and recognize that people have feelings?

    10. Re:Wordpress has the option by Anonymous Coward · · Score: 0

      All the data is still there meaning mod_rewrite dosen't help with the "bandwidth" issue at all.

      Sure it does. Consider the following (fictitious) URI:

      http://example.com/query.php?q=slashdot&sort=alpha&coords=2.45,4.68

      mod_rewrite can rewrite it to this:

      http://example.com/query/slashdot/alpha/2.45,4.68

      You've just shaved off 18 characters.

    11. Re:Wordpress has the option by slummy · · Score: 1

      You are right about me being reactive, although "digging up dirt" isn't exactly what I was doing by quoting you off your public blog.

      The original intent of my comment was to point out the fact that shortening URLs could have a negative impact on a site pondering SEO.

    12. Re:Wordpress has the option by michaelhood · · Score: 1

      PageRank (tm), et al. - none.

      That said, Google does place a lot of value in determining the relevance of a page/URL on its' keyword contents (and/or lack thereof.)

    13. Re:Wordpress has the option by rufus+t+firefly · · Score: 1

      The querystrings/all the get prams are still being passed, there just passed in a "visually" pleasant way for the user. All the data is still there meaning mod_rewrite dosen't help with the "bandwidth" issue at all. It just looks pretty.

      Moving persistent data to sessions is probably a better way of doing it. My guess would be that only a few things are going to be passed from one page load to another. Whether you GET or POST the data, passing it during every request is a waste of time, not to mention more difficult to sanity check everything.

      --
      "He may look like an idiot, and talk like an idiot, but don't let that fool you. He really is an idiot." - Duck Soup
    14. Re:Wordpress has the option by Zerth · · Score: 1

      what, like putting <link rel="canonical" href="http://www.example.com/product.php?item=swedish-fish" /> in the HEAD section of a page to let search engines know what they should use over the actual link that brought them to your page?

      I think they do that already.

    15. Re:Wordpress has the option by bananaquackmoo · · Score: 1

      A rather large portion of it, actually...

    16. Re:Wordpress has the option by macraig · · Score: 1

      Sure, but you mentioned Wordpress and only Wordpress, and most Wordpress blogs don't have a scale that would warrant such a consideration. That was my point. My blog software allows that URL rewriting, too, as I think does Apache, but I don't use it.

      BTW, I knew my blog doesn't warrant the consideration, either (and it's self-hosted, so not even Wordpress). My observation wasn't motivated by my-blog-is-better-than-yours egotism or any sort of comparison.

      As for whether any huge site that's actually doing similar URL rewriting should be doing it at the expense of some bandwidth, there are reasonable arguments to be made both supporting and against it. I'd even toss in a "social responsibility" angle regarding the collective bandwidth cost. If everybody avoided URL rewriting, then it wouldn't be "unfair" to expect it; the SEO playing field would still be level.

    17. Re:Wordpress has the option by mattwarden · · Score: 1

      The SEO issue is secondary. The ?p=1 URL is not nearly as usable as the URL that indicates the page's content.

    18. Re:Wordpress has the option by cgenman · · Score: 3, Insightful

      Do we know what 75MBps as a percentage of total site traffic is? It seems like if that number is 1% or less, there would be more important areas to optimize. A little slack can be more valuable than bandwidth in a complex system.

    19. Re:Wordpress has the option by BRSQUIRRL · · Score: 1

      I'm sorry, but it seems ludicrous to me that Google would consider the URL (see Tim Berners-Lee's thoughts on what URLs should and should not contain) when ranking pages. Is there any proof or documentation that this assertion is true?

  3. Who knows? by esocid · · Score: 4, Funny

    Are forums (fora?) like these wasting bandwidth as well by allowing nerds, like myself, to banter about minutia (not implying this topic)? Discuss amongst yourselves.



    Read the rest of this comment

    --
    Absolute power corrupts absolutely. indymedia
    1. Re:Who knows? by phantomfive · · Score: 4, Insightful

      Seriously. No one better tell him about the padding in the IP packet header. A whole four bits is wasted in every packet that gets sent. More if it's fragmented. Or what about the fact that http headers are in PLAIN TEXT? Talk about a waste of bandwidth.

      In reality I think by watching one youtube movie you've used more bandwidth than you will on facebook URLs in a year.

      --
      Qxe4
    2. Re:Who knows? by PolygamousRanchKid+ · · Score: 1, Funny

      One man's waste is another man's treasure. Some say, "The world is my oyster." I say, "The world is my dumpster."

      Wasted bandwidth, indeed.

      --
      Schroedinger's Brexit: The UK is both in and out of the EU at the same time!
    3. Re:Who knows? by jd · · Score: 1

      I discussed it with myselves, but there was no agreement. Well, other than the world should use IPv6 or TUBA and enable multicasting by default.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    4. Re:Who knows? by cashman73 · · Score: 1

      If you're talking about wasting bandwidth, then I'd say most of these sites are wasting bandwidth over pointless discussions by tween girls over who's f*cking who, or by sophomoric guys debating who's penis is bigger. The length of the URL in these discussions seems rather minor, IMHO,...

    5. Re:Who knows? by Anonymous Coward · · Score: 0

      That depends on if URL in that sentence stands for "urinal/reproductive lance" or not.

      (posting AC as I've moderated on this thread)

    6. Re:Who knows? by Anonymous Coward · · Score: 0

      Dude, he's not worrying about YOUR bandwidth. He's saying *facebook* could shave 75mbits/sec off their bandwidth consumption by shortening URLs.

                Reagarding what slummy says, that's also very true. Dont' know about SEO, but I do like a nice-looking URL. This is where I was going to say something about Facebook saving so much cash... but I looked online and realize a dedicated 100mbit/sec line is like $600/month (quantity one, and I'm sure facebook buys in bulk.)

    7. Re:Who knows? by Tokerat · · Score: 1

      Win.

      --
      CAn'T CompreHend SARcaSm?
  4. Better way of doing it by Foofoobar · · Score: 4, Informative

    The PHPulse framework is a great example of a better way to do it. It uses one variable sent for all pages which it then sends to a database (rather than an XML page) where it stores the metadata of how all the pages interelate. As such, it doesn't need to parse strings, it is easier to build SEO optimized pages and it can increase page load times by 10 times over other MVC frameworks.

    --
    This is my sig. There are many like it but this one is mine.
    1. Re:Better way of doing it by kopo · · Score: 1

      You mean decrease?

    2. Re:Better way of doing it by Foofoobar · · Score: 2, Informative

      heh, yes. I was thinking increase numbers of pages loaded and decrease page load at same time. But yeah, the benchmarks are impressive. I even tried testing the system using it with a HEAP database just to see how fast it would be and there was no difference; all the data is indexed and cached so well that HEAP was pointless.

      Of course, it's a totally different paradigm that requires a database instead of XML for the page metadata. But what it enables in being able to relate the sections of the site to one another and the pages is a an increase in functionality and speed over conventional methodologies.

      --
      This is my sig. There are many like it but this one is mine.
    3. Re:Better way of doing it by Anonymous Coward · · Score: 0

      I fail to see how using one variable to define a page is considered SEO. Instead of naming files with keywords, you're replacing it with a variable. How is '/?area=89' any better than '/about/company' from a SEO perspective? Plus, it's way easier to remember '/about/company' than some arbitrary value in some variable.

    4. Re:Better way of doing it by mattwarden · · Score: 2, Insightful

      You mean that ?area=51 crap? How is http://mysite.com/?area=51 usable?

      (Unless the page is about government conspiracies, I guess.)

    5. Re:Better way of doing it by Foofoobar · · Score: 1

      Well to answer your question in a less snarky way, websites are heirarchical in nature and most frameworks try to use XML to load page data (or some other method based on strings) and to relate pages to one another... but can XML maintain relationships? Is XML reliable in checking key relationships? Not as well as a database. Databases rely on integers and enforce relationships. XML relys on strings and the relationships cannot be enforced.

      So the way the creators of PHPulse did it, is they recreate a heirarchical system in the database and use a key to relate to every webpage. This allows the system to dynamically create URL's and breadcrumbs from one key as that key has a parent which in turn has a parent and so on. This also allows a plethora of other functionality like dynamically managing user privileges, generation of sitemaps and so on.

      The system doesn't have to read 5 or 6 different XML sheets or parse a whole bunch of strings to get this information. It just queries precached information stored in memory and as a result is 10 times faster than conventional methods.

      --
      This is my sig. There are many like it but this one is mine.
    6. Re:Better way of doing it by corychristison · · Score: 2, Informative

      This is a very, very simple method. You seem to want to make it out to be the best thing in the world. The problem is, it needs some form of descriptive characteristic.

      In my own little personal CMS/framework I do it similarly, except with a 1-16 character string. This way I can set some form of description.

      It's really, very easy to do. Basically need a table with (id,parentid,page_title,page_content). parentid is the id of the parent section, leave NULL if it is the top level. This way you can seek in the DB with a simple SELECT * FROM `pages` WHERE `id`="aboutus" LIMIT 1.

      You can use parentid to form a breakcrumb to trace back to which section it relates to... this maintains hierarchy. An easier way is to do a select all and hold it in an array (or hash table -- depending on your language) to make it really speedy. Hell, skip the DB. Store it in a file as a serialized string. (for VERY low traffic sites, anyway)

      This method also works VERY well with URL rewriting.

    7. Re:Better way of doing it by Foofoobar · · Score: 1

      Don't use strings as keys in databases. They are far slower to index. In general you should never use a string as a key. You CAN do it... you just shouldn't especially on something that gets called that often.

      And I'm just impressed with it because I don't see any other framework doing this: Zend, Struts, Rails, Cake. They all use strings and/or XML which are far slower and can't maintain the page relationships.

      --
      This is my sig. There are many like it but this one is mine.
    8. Re:Better way of doing it by mattwarden · · Score: 1

      Sir, I'm getting the distinct impression that you don't know what you're talking about.

      > Don't use strings as keys in databases. They are far slower to index.

      What does that mean? A given row is slower to add to an index, or a search on an existing index is slower? Further, a search on 50 pages that you have on your website, probably will not even USE an index, even if it exists. If it does, the 0.001ms difference on a string vs integer lookup is going to make no difference... certainly not enough difference to forgo usability.

      > In general you should never use a string as a key.

      This is hogwash.

      > They all use strings and/or XML which are far slower and can't
      > maintain the page relationships.

      XML is just a format. I could just as easily do what you are describing in XML by taking the table format described in parent and create an XML format. Further, you don't even know if the DB is storing the data in XML in the background.

      I think you are excited over a whole lot of nothing (except loss of usability).

    9. Re:Better way of doing it by ukyoCE · · Score: 1

      To reiterate/summarize mattwarden's point, URLs that select paths/pages with numeric IDs are Very Bad. You gain a modicrum of bandwidth, and lose a huge amount of descriptiveness in the URL.

      The same goes for TinyURL. Unless you're writing URLs on paper, it's nonsense.

      If someone sends me a link, I don't blindly click on it. I read the link, and sometimes realize I know what it is, or that I don't want to go there. Any miniscule gains from having a short URL are overwhelmed by the benefit of having a user-readable URL.

    10. Re:Better way of doing it by Foofoobar · · Score: 1

      Well I don't know how to be any clearer; varchars(strings) have a variable length which causes them to take longer to return as well as requiring pattern matching if using as a key. All RDBMS's (and DBA's) will suggest using integers over anything else as keys as anything else is slower to search, slower to index, slower to return result sets, etc. All keys are indexed and when the RDBMS has to look up a string as a key, it has to do pattern matching which takes longer. An integer just requires the comparison operator. I have seen people use decimals and floats as primary keys as well and due to their variable length, they take longer to return result sets as well.

      If you think I am blowing hot air (as you obviously do), google it. I don't think you will find any professional who will tell you that using a string over an integer as a primary key is a good idea.

      --
      This is my sig. There are many like it but this one is mine.
    11. Re:Better way of doing it by mattwarden · · Score: 1

      As with every other absolute, it's always wrong. If you are choosing whether to add a new key field that is a varchar or a number, of course you use the number. But if you already have a varchar field that you must store and which is the business key, should you add a new field that is an integer? Maybe, maybe not.

      And you didn't address my point that the database probably won't even USE an index given the small set of data, which of course was the biggest point.

      > Well I don't know how to be any clearer

      Indeed.

    12. Re:Better way of doing it by Foofoobar · · Score: 1

      You are confusing the point... that integers make better keys than strings. You are NOW talking about an entirely different topic of surrogate keys vs. natural keys wherein a surrogate is usually always the better choice because the database maintains the control over the key and key relationships whereas natural keys are prone to human error, repetition, duplication, etc as they are based on an external system. For example, given your previous example wherein you used a webpage name as the key, can't two web pages have the same name? As such, you're natural key fails (as most do). The surrogate makes your system self contained and controlled.

      But I digress. As I mentioned, you confused the original topic which was about whether strings were as good as integers as keys. And they are not.

      My suggestion to you is to take a class on database administration. It really helps to round out your experience and I think you will find it helpful in the long run to avoid anymore errors like the ones you are running in to.

      --
      This is my sig. There are many like it but this one is mine.
    13. Re:Better way of doing it by mattwarden · · Score: 1

      > You are confusing the point...

      If I am, it's only the point you want to make, not the one I'm making.

      > that integers make better keys than strings. You are NOW talking about an
      > entirely different topic of surrogate keys vs. natural keys wherein a
      > surrogate is usually always the better choice because the database
      > maintains the control over the key and key relationships whereas natural
      > keys are prone to human error, repetition, duplication, etc as they are
      > based on an external system.

      Um, what? The database does not "keep control over the key". Your application does. In both cases. No matter what data type your key is. The only thing the database does is guarantee uniqueness in the column, and that does not change whether you use a string, an integer, or a date, or any combination of these.

      You're making a whole lot out of nothing, here. And you still didn't address the real point, which was that in your example the index would likely not even be used, because the data set is so small.

      > For example, given your previous example wherein you used a webpage name
      > as the key, can't two web pages have the same name?

      They can't in normal file systems! What requirement do you have for needing two different pages to have exactly the same name? You are grasping at straws, my friend.

      > As such, you're natural key fails (as most do).

      Sir, you just do not know what you're talking about. The natural key is the natural key no matter what. If the column(s) is not unique, then it is not the natural key. If you're saying the column is not unique, then of course you cannot use that as a unique key! But to say "the natural key is failing" makes no sense. If it is not unique, it was never a natural key. If it is a natural key, it will never have duplicates.

      You are doing what all "DBAs" do and throw numeric surrogate keys everywhere because it's easier than having to be an actual data modeler. But the data is the data is the data. You say "well, if I use a surrogate key, then I can have two pages with the same name." I say that is a data modeling error. Adding a surrogate key does not change the meaning of the rest of the data.

      > The surrogate makes your system self contained and controlled.

      It doesn't make anything any different. It is a decision of DATA TYPE. It is a column like any other column, and your application has full control over it whether it is an integer or a varchar or a date. You are making a lot out of nothing. Are you sure you understand what a primary key is? It is not an autonumber field in Access that magically populates the next available number. It's just a column that holds a piece of data, whatever that data might be and whatever data type it might be.

      > My suggestion to you is to take a class on database administration. It
      > really helps to round out your experience and I think you will find it
      > helpful in the long run to avoid anymore errors like the ones you are
      > running in to.

      Guess I should have done that before I wrote that book... thanks for the suggestion.

    14. Re:Better way of doing it by Foofoobar · · Score: 1

      Guess I should have done that before I wrote that book...

      It obviously wasn't about database development. Good luck with your next childrens book.

      --
      This is my sig. There are many like it but this one is mine.
    15. Re:Better way of doing it by mattwarden · · Score: 1

      > It obviously wasn't about database development

      Feel free to look it up.

      > Good luck with your next childrens book.

      All this back and forth and still no response to the main point, which was your entire argument is moot because the database optimizer will likely not use the index with such a small data set anyway. Therefore, removing usability for the sake of a faster index lookup is STUPID since there IS no index lookup. Your "rule of thumb" mentality that you still haven't given any real reason for (except "go google it!!!1") just shows you don't have any real understanding of how databases work, as well as how to think about the application as a whole rather than "how can I squeeze an extra 0.01ms out of this query and save 12KB on disk", which is decidedly oversimplistic.

      Instead you offer a bunch of childish jabs and "look at me, I'm the expert" positioning without anything to back it up. Thanks for being a typical slashdot idiot who is more interested in stroking his own ego by putting strangers down than responding to the issue in question.

      Now if you'll excuse me, I need to go buy those books you mentioned so I can be as "enlightened" as you are.

    16. Re:Better way of doing it by Foofoobar · · Score: 1

      For someone who believes database keys are controlled and maintained by an outside application, you sure do think you know alot about databases. Truly sad really.

      Sadly, I have answered all your questions; you just are so sadly misinformed (or ignorant) that you fail to see the answers. Keys are always indexed. Look it up. SQL Server doesn't index foreign keys (oddity) so you do have to force that RDBMS to index foreign keys. But in general, all keys are ALWAYS indexed when they are defined as keys. MySQL, DB2, SQL Server, Oracle, etc. I suppose somewhere in the config, you MIGHT be able to turn this off... but why? Whats the point of having keys that aren't indexed. It's like having a cow that doesn't give milk and you can' make hamburgers out of.

      The truly sad thing is you think you know something about databases and yet I'm having to explain these beginner concepts to you. You have my sympathy.

      --
      This is my sig. There are many like it but this one is mine.
    17. Re:Better way of doing it by Foofoobar · · Score: 1

      LOL. Knew it. Your 'contribution' to the book was on AJAX... had nothing to do with databases. And it's WROX... who pays crap and let's anyone publish. Your resume is bloated and shows your lack of experience (who lists HTML over and over). Case so VERY VERY closed, junior. You aren't even worth my time. This is what qualifies as a professional? I weep for your employers.

      --
      This is my sig. There are many like it but this one is mine.
    18. Re:Better way of doing it by mattwarden · · Score: 1

      Wow, this just keeps getting better and better.

      > Sadly, I have answered all your questions;

      I have asked you only one and you haven't answered it: what relevance does the speed of an index lookup have on a data set so small that the database optimizer will not choose to use the index?

      > Keys are always indexed. Look it up.

      You're talking out of your arse. First of all, I don't even know how this has any relevance, because I was not talking about whether or not the column was indexed.

      I was talking about -- assuming it IS indexed -- whether the database will USE the index or decide to perform a full table scan. With small unjoined data sets, it is LESS EFFICIENT to use the index. I'm sure that is difficult for you to grasp, because someone told you a "rule of thumb" that indexes make searches faster, but that's why we pay smart people to build our database optimizers for us, and you can just let it do its magic without understanding it in the least (which is good, because you clearly do not).

      Your knowledge of databases reads like a bulleted list in a beginner book intro. Perhaps some day you will discover that these rules of thumb are just that: rules of thumb. And when you begin to look into how databases actually operate, you begin to realize that there are a lot of exceptions to your rules of thumb.

    19. Re:Better way of doing it by Foofoobar · · Score: 1

      Like I said AJAX boy, 3 years experience (none of which was fulltime database development) does not make make you a DBA. Go learn something. You aren't worth my time. You are a waste of my time, junior.

      --
      This is my sig. There are many like it but this one is mine.
    20. Re:Better way of doing it by mattwarden · · Score: 1

      It's impressive that even in your childish personal attacks you can find room for significant inaccuracies.

      > AJAX boy

      ajax is a hobby I messed with on the side. I stopped right around the time people started calling it 'ajax' in 2005

      > 3 years experience

      10 years in June

      > none of which was fulltime database development

      Bzzt

      > does not make make you a DBA.

      I never claimed to be a DBA, nor do I ever want to be a DBA. If I want to know what I should set my Oracle heap size, I will ask a DBA. The fact that you think DBAs model data explains a lot.

      > You are a waste of my time, junior.

      I think if you were worried about wasting time, you would have answered my simple question about 7 replies ago, instead of consistently evading the question by employing personal attacks. (I guess that assumes you have the ability to answer it, though.)

    21. Re:Better way of doing it by Foofoobar · · Score: 1

      I never claimed to be a DBA, nor do I ever want to be a DBA.

      And it shows. Buh-bye

      --
      This is my sig. There are many like it but this one is mine.
    22. Re:Better way of doing it by mattwarden · · Score: 1

      The chapter I wrote on my own was on ajax, yes, because none of the other co-authors had ajax experience. I don't really care what your opinion of wrox is; they made the best offer so we selected them. If you're talking about the resume that is posted on my website, that is out of date by about 4 years because, as I say on that very page, I am not considering any offers of employment from people who just happen upon that page. If there is an opportunity that I am interested in, I send them my resume. I suppose I probably should just remove that page on my site, but other than morons on slashdot who can't answer simple database questions and would rather take cheap shots, I didn't really see a need to.

      Anyway, I'm sure I'll see you eventually. Whoever decided to employ someone of your maturity will likely need to call my company in at some point to clean up your mess. Until then, keep on with the cheap shots...

    23. Re:Better way of doing it by Foofoobar · · Score: 1

      Lots of excuses. But I don't expect anything else. And I doubt anyone with any credibility would hire you so why would I care. Keep working on that HTML there juniour.

      --
      This is my sig. There are many like it but this one is mine.
    24. Re:Better way of doing it by Foofoobar · · Score: 1

      Oh I'm sorry. That was your attempt at asking for a job wasn't it? Well send me your UPDATED resume. I do the interviewing for my team in the corporation where I work. Obviously, you lack the database experience (per your own confession) but I can see if there is a junior position for an HTML person.

      --
      This is my sig. There are many like it but this one is mine.
    25. Re:Better way of doing it by Foofoobar · · Score: 1
      And to recap, the best of your comments... (and true classics they are), you said:
      • strings work just as well as integers for keys with no significant loss in query speed. (I believe you used the term 'hogwash')
      • loss of speed in query loading has no effect on page loads (and in effect no effect on scalability)
      • keys are managed by outside applications rather than the RDBMS
      • indexing does not happen on primary keys

      I mean seriously... Wow. The company that pays you minimum wage must be lucky to have a 'genius' like you. We're all laughing our butts off here. I really have to thank you for providing our office with hours of enjoyment.

      --
      This is my sig. There are many like it but this one is mine.
    26. Re:Better way of doing it by mattwarden · · Score: 1

      I didn't say any of those, except for #3, which is true. There is nothing about primary keys that is "managed" by the RDBMS except for making sure the value is unique. This is called a constraint, and it is a function that the database performs on specified columns regardless of whether they are primary keys are not.

    27. Re:Better way of doing it by mattwarden · · Score: 1

      Congratulations on doing "interviewing" for your "team". Perhaps one day you will be responsible for hiring the people you are managing, as well as deciding how many people you need to execute your project. If you ever get to this point and need some advice, I'll try to lend you some based on my "3" years experience doing "HTML".

    28. Re:Better way of doing it by mattwarden · · Score: 1

      Jesus... 3 different replies over a span of almost 4 hours? Your ego must be pretty fragile.

    29. Re:Better way of doing it by Foofoobar · · Score: 1

      You actually keep track of the time and number of replies? Move out of your moms basement and get a boyfriend already.

      Actually, we're having fun laughing at you and taking turns responding.

      --
      This is my sig. There are many like it but this one is mine.
    30. Re:Better way of doing it by Foofoobar · · Score: 1

      Poor child doesn't understand the process. Maybe we should enlighten him. You see in BIG BIG companies, there is a chain and technically HR or uppermanagement does hiring. You'll understand this when you work for someone besides you relatives. How's that going by the way? Get any other jobs designing websites for your mom's friends? LOL.

      --
      This is my sig. There are many like it but this one is mine.
    31. Re:Better way of doing it by Foofoobar · · Score: 1

      OMIGOD! Now he's in denial! LOL. What a tard! Did you just pick up that HTML book, read two pages and start posting on Slashdot. Classic. You really should go back and read your posts (we compiled those from your previous posts junior). Is there some elementary school we should be notifying that you're playing hooky from?

      --
      This is my sig. There are many like it but this one is mine.
    32. Re:Better way of doing it by mattwarden · · Score: 1

      I hire the people I manage. HR does not know what people need to be on my team. Companies that work the way you are describing end up hiring people who don't fit. Sorry that you have to deal with that.

    33. Re:Better way of doing it by Foofoobar · · Score: 1

      LOL. Oh really? and how exactly does a contractors get to hire people? Hmmm? Wow. A bad liar as well as a bad developer. You aren't smart in any sense are you?

      --
      This is my sig. There are many like it but this one is mine.
    34. Re:Better way of doing it by mattwarden · · Score: 1

      "a contractors" do hire people in some cases. One of the people at our client is a contracted project manager, although I always thought that was a little odd. Anyway, I am not a contractor.

    35. Re:Better way of doing it by Foofoobar · · Score: 1

      Uh huh. Cause Deloitte says you are a contractor. God you're a bad liar. Seriously, who are you trying to fool? Each time you post you look more and more stupid. First you show you don't know what you are talking about, then you try to say 'I published a book' when you just contributed and it had nothing to do with databases which is what the discussion was about and then I call you out on your experience and you say 'I have 10 years experience' when you only graduated a couple years ago.

      Oh but thats the beginning. Then you say you lead a team and do hiring, and you're nothing but a contractor. You're pathetic. You're the classic 'NO HIRE' (which explains why you're still contracting). Hell, you have to put the stuff you did while in college on your resume... sad.

      Go ahead. Can't wait for the next response. Ought to be a classic like all the rest. Give it up and become the janitor you were always meant to be. Everything else is beyond your intellectual reach.

      --
      This is my sig. There are many like it but this one is mine.
    36. Re:Better way of doing it by mattwarden · · Score: 1

      You make a lot of assumptions based on half of the facts, just like your misunderstanding of simple data modeling concepts. Keep living your life by rules of thumb, and you will continue to look like a jackass when you come across the exceptions. Reading between the lines, it sounds like you are older than I am and have significantly less responsibility at your job, otherwise I can't reconcile why you would be so sure that I am overstating mine. Perhaps spending less time with personal attacks on slashdot and more time making yourself valuable would be more productive in getting you out of the cubicle farm that you are surely in. If you ever need a job, I've got a couple bullshit testing roles on my team, and I'd be willing to negotiate a rate upwards of $8/hr. It would be annoying to deal with your personality, so I would probably have an intern manage you.

    37. Re:Better way of doing it by Foofoobar · · Score: 1

      Nice dodge. I especially liked the way you took everything i just said about you and said it about me. How creative.

      Enjoy your contract job doing HTML. :)

      --
      This is my sig. There are many like it but this one is mine.
    38. Re:Better way of doing it by Foofoobar · · Score: 1
      The fact that you are still here and playing with us obviously means...
      1. I'm right (and have been right all this time)
      2. ... and we struck a nerve.

      Truth hurts don't it. Hey but at least you can design web pages for your mom's friends when your contract job ends soon. :)

      --
      This is my sig. There are many like it but this one is mine.
    39. Re:Better way of doing it by mattwarden · · Score: 1

      I'm not sure what I'd do if you accidentally made sense in one of your comments. I didn't repeat anything you said. But, I guess being accurate doesn't really matter.

    40. Re:Better way of doing it by Foofoobar · · Score: 1

      Uh huh. Whatever you say, Mathew. Your entertainment factor is dwindling, Mathew. C'mon, act like you know something about databases again; that was hilarious. Act like you can fire people and lead a team. We were cracking up over that one. Why not tell us all about how you were unemployed for two years after graduation? And how all you can hold down is a contracting job? Tell us how awesome you are that you are just a contractor? Tell us how 'lite that is. And how your resume consists of a task you did in college for the computer lab? LOL.

      Wow. You are uber lite dude. We all worship your l33t coding skillz Mathew. You are our idol.

      --
      This is my sig. There are many like it but this one is mine.
    41. Re:Better way of doing it by mattwarden · · Score: 1

      > Why not tell us all about how you were unemployed for two years
      > after graduation?

      Where are you getting your information? I have been employed by the same employer since day 1 after graduation.

      > And how your resume consists of a task you did in college for
      > the computer lab?

      I don't even know what this is referring to???

      You're still not making sense.

    42. Re:Better way of doing it by Foofoobar · · Score: 1

      Uh huh. Sure you don't. Caught in yet another lie huh? Classic.

      It's rather easy Mathew. Your name is freely given. Your picture is on the book you contributed AJAX for. And everything else is available in DATABASES. Something you know nothing about. LOL. From there we can tell you exaggerate your accomplishments, have an overblown ego and are consistently confrontational but wrong. No one would recommend you to save your life and not even the girl you had a crush on in college. And Deloitte is the best job you can get for someone who thinks he is so friggin smart.

      Sheesh, I've turned down the Microsoft Open Source Development Labs and have been offered a job at Google and that's all you can get. Sad really? You really are a pathetic human being. And that fake psychology degree didn't help you when padding your resume either did it. :)

      Do you have any friends?

      --
      This is my sig. There are many like it but this one is mine.
    43. Re:Better way of doing it by mattwarden · · Score: 1

      I'm not confused about how you can get information on me. Obviously I don't have a problem with that information being available. What I'm confused about is how you have such wrong information despite the correct information being freely available.

      Congratulations on turning down a job at Microsoft. I would too.

      I don't know what accomplishments you think are exaggerated. My resume is admittedly modest and I don't even list anything I've done in years. I have no reason to look for a job, so I have no reason to brag about my accomplishments. Even in this thread with you consistently saying I'm an "HTML coder" and a "contractor", I haven't talked about anything I've done in the last 4 years. I really don't care whether you think I'm "leet" or whatever.

    44. Re:Better way of doing it by Foofoobar · · Score: 1

      Admittedly modest and yet in your words, you have '10 years experience' (must have went back in time after graduating 3 years ago... hmmm.). And no reason to brag yet 'I wrote a book' (when you merely contributed). Sounds like an attempt to brag... a sad, lame and pathetic attempt but an attempt to brag nonetheless. LOL. Wow, you are a whole book full of contradictions. God this is hilarious.

      Please keep going. You're killing me. :)

      --
      This is my sig. There are many like it but this one is mine.
    45. Re:Better way of doing it by mattwarden · · Score: 1

      > Admittedly modest and yet in your words, you have '10 years experience'

      I don't know how correcting you is immodest. I guess just saying "you're wrong" would have been more modest. Anyway, if you have my resume, you already know how many years experience I have, unless you cannot do math.

      > And no reason to brag yet 'I wrote a book' (when you merely contributed).

      I don't think that's too big of a deal and nothing to brag about. I'm not sure what your deal is with the "you merely contributed" stuff. I was an author along with 3 other co-authors. We also had editors, publishers, graphic designers, marketers, and a whole host of other people who were on the book project. This is how book publishing works. I don't know if you think there are people out there who write books "all by themselves", but I can tell you there aren't, unless you're talking about indie releases that sell tens if not hundreds of copies.

      The book was 4 years ago and it's on my resume, so whether I mention it or not is irrelevant. The comment, to which you (not surprisingly) only paid half attention, was that I have not talked about anything I've done in the last 4 years even though you repeatedly call me an "HTML coder" and a "contractor". I don't really care what you think I've done over the period of time that isn't reflected on the resume you apparently spent time googling for.

      > Please keep going. You're killing me. :)

      Ok. It doesn't take any effort to respond to your misrepresentations and inaccuracies. I do wonder how you have so much time to come up with them, though.

    46. Re:Better way of doing it by Foofoobar · · Score: 1

      It doesn't take any effort to respond to your misrepresentations and inaccuracies. I do wonder how you have so much time to come up with them, though.

      LOL. It takes us a minute and 3 sentences to show how you discredit yourself. It takes you 20 minutes and 4 paragraphs of bullshit to come up with an excuse to justify your lies. Who's the real person with something to prove? I'm putting my money on the person who just graduated and is merely a contractor.

      --
      This is my sig. There are many like it but this one is mine.
    47. Re:Better way of doing it by mattwarden · · Score: 1

      > I'm putting my money on the person who just graduated and is merely
      > a contractor.

      And who is that?

    48. Re:Better way of doing it by Foofoobar · · Score: 1

      Not too bright are you? Can't put two and two together now. Getting dumber by the second since you graduated. What kind of community college was that?

      --
      This is my sig. There are many like it but this one is mine.
    49. Re:Better way of doing it by mattwarden · · Score: 1

      Sorry, all your inaccuracies are confusing.

    50. Re:Better way of doing it by Foofoobar · · Score: 1

      I can see how confusing living a lie can be confusing. I'll bet your easily confused alot. I also love the fact how your responses have become far less verbose the second I gave you shit about them. Apparently I do know how to motivate you. There's hope for you yet, HTML BOY.

      --
      This is my sig. There are many like it but this one is mine.
    51. Re:Better way of doing it by mattwarden · · Score: 1

      > I can see how confusing living a lie can be confusing. I'll bet
      > your easily confused alot.

      I get particularly confused when people can't structure a sentence.

      It's "you're" and "a lot", by the way. Hope this helps.

    52. Re:Better way of doing it by Foofoobar · · Score: 1

      LOL. Attacking typos. The last refuge of the desperate.

      --
      This is my sig. There are many like it but this one is mine.
  5. Depending on your viewpoint by markov_chain · · Score: 5, Insightful

    The short Facebook URLs waste bandwidth too ;)

    --
    Tsunami -- You can't bring a good wave down!
    1. Re:Depending on your viewpoint by Dreen · · Score: 1

      I very much like this being Insightful =)

    2. Re:Depending on your viewpoint by FooBarWidget · · Score: 3, Informative

      I've always found stories along the lines of "$ENTITY wastes $X amount of $RESOURCE per year" dubious. Given enough users who each use a piece of $RESOURCE, the total amount of used resources will always be large no matter how little each individual user uses. There's no way to win.

    3. Re:Depending on your viewpoint by Anonymous Coward · · Score: 1, Funny

      I also like his username.

    4. Re:Depending on your viewpoint by jd · · Score: 1

      For most users, anything they can access on Facebook is already present on 127.0.0.1.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    5. Re:Depending on your viewpoint by Minwee · · Score: 1

      There's no way to win.

      Of course there is. It's a strange game, and the only way to win is not to play.

    6. Re:Depending on your viewpoint by Directrix1 · · Score: 1

      Professor Falcon?

      --
      Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
    7. Re:Depending on your viewpoint by KahabutDieDrake · · Score: 1

      I have to throw my 2 cents here. Nothing facebook is doing is useful. So therefore, everything facebook is doing is wasting bandwidth. I think the point here is that not only are they wasting bandwidth, but they are paying for it, so what is the problem?

    8. Re:Depending on your viewpoint by noidentity · · Score: 1

      Yeah, my mother always used to use this to argue against some family cost. "What, $1 a month? That's $12 a year, $120 in ten years!!! Think of what you could do what that amount of money!" She never seemed to be able to grasp comparing some cost against ALL THE OTHER costs for that same time period. When you compare $1 with $1000+ in living expenses every month, it's a miniscule 0.1%.

    9. Re:Depending on your viewpoint by Anonymous Coward · · Score: 0

      not just the URLs

    10. Re:Depending on your viewpoint by TheRaven64 · · Score: 1

      Yes, can we have these expressed as a percentage? 75Mb/s is, what, 1%? 0.1%? of the total bandwidth used by these sites? Less? I'd suggest that more aggressive cache headers for a single commonly-loaded image could save a lot more bandwidth for any of these sites than any amount of URL munging.

      --
      I am TheRaven on Soylent News
    11. Re:Depending on your viewpoint by DarkOx · · Score: 1

      I fail to "grasp" the relevance of your comment. Were you suggesting you were so rich that it was never worth sacrifice for a .1pct of the buget?

      I could a cup of coffe for $4 at Starbucks on the way into the office every morning. Its a in the corse of and in the corse of a month its still a vanishinly small number compared to say my house payment, but its still $1040 a per year. Seems to me I could buy some nice kit for that. Its worth considering. Just because you spend more of it somewhere else does not remove the value of the other money!

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
    12. Re:Depending on your viewpoint by ukyoCE · · Score: 1

      There should be a law, like Occam's Razor, about this.

      Multiple anything by a big enough number, and you end up with a big number.

    13. Re:Depending on your viewpoint by Anonymous Coward · · Score: 0

      I fail to "grasp" the relevance of your comment. Were you suggesting you were so rich that it was never worth sacrifice for a .1pct of the buget?

      I could a cup of coffe for $4 at Starbucks on the way into the office every morning.

      A cup of coffee at Starbucks costs $1.50 ($1.35 with black card, $1.25 with black card and own cup), NOT $4.00. Please don't speak of things you know not.

    14. Re:Depending on your viewpoint by noidentity · · Score: 1

      The point is that if your goal is to noticeably reduce monthly expenses, you don't focus on 0.1% contributions. Same as when optimizing a program; you first find the biggest contributors.

    15. Re:Depending on your viewpoint by Anonymous Coward · · Score: 0

      Please don't speak of things you know not.

      http://www.mchodessa.com/bistromenu/starbucksmenu.pdf

      I know this is just one menu for one company that happens to use the Starbucks logo..... BUT

      note the price for a Venti CARAMEL MACCHIATO. Just saying:

      Please don't speak of things you know not.

  6. Wordpress? by BradyB · · Score: 3, Informative

    By default Wordpress produces short urls.

    --

    Good is never enough, when you dream of being great!
  7. Waste of effort by El_Muerte_TDS · · Score: 4, Interesting

    Of all things that could be optimized, urls shouldn't have a high priority (unless you want people to enter them manually).
    I'm pretty sure their HTML, CSS, and javascript could be optimized way more than just their urls.
    But rather than simply sites, people often what it to be filled with crap (which nobody but themselves care about).

    ps, that doesn't mean you should try to create "nice" urls instead of incomprehensible url that contain things like article.pl?sid=09/03/27/2017250

    1. Re:Waste of effort by JCY2K · · Score: 5, Insightful

      Of all things that could be optimized, urls shouldn't have a high priority (unless you want people to enter them manually). I'm pretty sure their HTML, CSS, and javascript could be optimized way more than just their urls. But rather than simply sites, people often what it to be filled with crap (which nobody but themselves care about).

      ps, that doesn't mean you should try to create "nice" urls instead of incomprehensible url that contain things like article.pl?sid=09/03/27/2017250

      Of all things that could be optimized, urls shouldn't have a high priority (unless you want people to enter them manually). I'm pretty sure their HTML, CSS, and javascript could be optimized way more than just their urls. But rather than simply sites, people often what it to be filled with crap (which nobody but themselves care about).

      ps, that doesn't mean you should try to create "nice" urls instead of incomprehensible url that contain things like article.pl?sid=09/03/27/2017250

      To your ps, most of that is easily comprehensible It was an article that ran today; only the 2017250 is unmeaningful in itself. Perhaps article.pl?sid=09/03/27/Muerte/WasteOfEffort would be better but we're trying to shorten things up.

    2. Re:Waste of effort by krou · · Score: 5, Interesting

      Exactly. If they wanted to try optimize the site, they could start looking at the number of Javascript files they include (8 on the homepage alone) and the number of HTTP requests each page requires. My Facebook page has *20* files getting included alone.

      From what I can judge, a lot of their Javascript and CSS files don't seem to be getting cached on the client's machine either. They could also take a look at using CSS sprites to reduce the number of HTTP requests required by their images.

      I mean, clicking on the home button is a whopping 726KB in size (with only 145 KB coming from cache), and 167 HTTP requests! Sure, a lot seem to be getting pulled from a content delivery network, but come on, that's a bit crazy.

      Short URIs are the least of their worries.

      --
      'If Christ had tweeted the sermon on the mount, it might have lasted until nightfall.' - John Perry Barlow
    3. Re:Waste of effort by krou · · Score: 1

      * Small clarification: by 20 files, I mean 20 Javascript and CSS files.

      --
      'If Christ had tweeted the sermon on the mount, it might have lasted until nightfall.' - John Perry Barlow
    4. Re:Waste of effort by HeronBlademaster · · Score: 4, Informative

      This very type of analysis is what YSlow is for :)

    5. Re:Waste of effort by gknoy · · Score: 1

      Depending on the link density of one's pages that are actually served out to users, the bits used by the links themselves might be a large proportion of the page that is served. Yes, there's other stuff (images, javascript), but from the server's perspective those might be served someplace else -- they're just naming them. If the links can be shortened, especially for temporary things not meant to be indexed, it can save some bandwidth.

      I'm not saying it's a primary way to save bandwidth, just that it's an interesting one.

    6. Re:Waste of effort by Anonymous Coward · · Score: 0

      Not even that, just ditch the whole article.pl crap.
      Hell, get rid of all nonsense "page links", just use virtual directories on server-end.

      All it needs to be is subdomain.slashdot.org/value, where value can either be an article ID, or whatever else lays beyond the "/" (seriously, I'm not even sure anymore)

      Also, what's the point in having those dates there?
      Does it even do anything functional? Whoopty doo if it was posted today, nobody should ever need to know that just by looking at the URL, it is pointless.

    7. Re:Waste of effort by Anonymous Coward · · Score: 0

      Yeah, FB really should be using a single sprite for all their graphics. Maybe they figure it just doesn't matter since there's about 400 user uploaded pictures on each page anyway.

    8. Re:Waste of effort by Anonymous Coward · · Score: 0

      When I saw the 'yahoo.com' next to 'YSlow', I thought it was another low blow aimed at yahoo..

    9. Re:Waste of effort by Zadaz · · Score: 1

      Why don't we also process outgoing HTML to get rid of tabs, linefeeds, and more than one concurrent space? It's all just wasting bandwidth!

      And before you think I'm being facetious, there was a reasonably popular and just as ludicrous movement to do just this in the late 90's. Back when we were going to run out of bandwidth by 2003. One place I worked (as a freelance HTML coder) was religious about this. No page went live without going through a script that stripped rendered characters and, even image names and links were shortened to 3.3 characters. Until they lost the human-readable source and link database for one site and had to essentially de-obfuscate the entire site by hand.

    10. Re:Waste of effort by SethJohnson · · Score: 1



      I was there with you in the late nineties. I had a client perpetually complaining because our web publishing product would insert an extra carriage return into every HTML page whenever an SSI was performed. Annoying.

      Seth

    11. Re:Waste of effort by JCY2K · · Score: 1

      The benefit is navigability. I can't guess at what article number something is but (gods willing) I can pull up the directory listing of sitewhatever.com/archives/2009/03/27/ and find what article I want. Boingboing is an excellent example. Although I suppose googling would be easier and probably faster to boot...

    12. Re:Waste of effort by xouumalperxe · · Score: 1

      Of all things that could be optimized, urls shouldn't have a high priority (unless you want people to enter them manually). I'm pretty sure their HTML, CSS, and javascript could be optimized way more than just their urls.

      Out of curiosity, I just tried saving the discussion attached to this comment. That yielded 375KiB worth of data. That makes the rather heinous 108-character URL that links to it amount to under 0.03% of the total data transfered. Assuming everything but the HTML was cached, it's still a 40 KiB download, so the URL represents 0.28% of the total data. Forget Knuth's prematurity, misguided optimization is indeed the root of all evil.

    13. Re:Waste of effort by DKlineburg · · Score: 1

      I say we optimize Flash out of websites. . . .

      --
      Memory is deceptive because it is colored by today's events. - Albert Einstein
  8. Irrelevant by Skal+Tura · · Score: 5, Insightful

    It's irrelevantly small portion of the traffic, while at the scale of Facebook, it could save some traffic, but does not make any impact on the bottomline worthwhile the effort!

    150 chars long url = 150 bytes VS 50KILObytes + Images of rest of the pageview....

    I'm throwing out of my head that 50kilobytes for the full page text, but a pageview often runs at over 100kb.

    So it's totally irrelevant if they can shave off the 100kb a whopping 150bytes.

    1. Re:Irrelevant by CannonballHead · · Score: 1

      ya. i hav better idea. ppl shuld just talk in txt format. saves b/w. and whales. l8r

      Seriously, though, I don't exactly get how a shorter URL is going to Save Our Bandwidth. Seems like making CNET articles that make you click "Next" 20 times into one page would be even more effective. ;)

      The math, for those interested:

      So to calculate the bandwidth utilization we took the visits per month (1,273,0004,274) and divided it by 31. Giving us 41,064,654. We then multiplied that by 20, to give us the transfer in kilobytes per day of downstream waste, based on 20k of waste per visit. This gave us 821293080, which we then divided by 86400 which is the number of seconds in a day. This gives us 9505 kilobytes per second, but we want it in kilobits, so we multiply it by 8. Giving us 76040, finally we divide that by 1024 to give us the value in MBits/sec. Giving us 74Mbit/sec. One caveat with these calculations is that we do not factor in gzip compression. Using gzip compression, we could safely divide the bandwidth wasting figures by about 50%. Browser caching does not factor in the downstream values, as we are calculating the waste just on the HTML file. It could impact the upstream usage as not all objects maybe requested with every HTML request.

    2. Re:Irrelevant by Skal+Tura · · Score: 3, Interesting

      So to calculate the bandwidth utilization we took the visits per month (1,273,0004,274) and divided it by 31. Giving us 41,064,654. We then multiplied that by 20, to give us the transfer in kilobytes per day of downstream waste, based on 20k of waste per visit. This gave us 821293080, which we then divided by 86400 which is the number of seconds in a day. This gives us 9505 kilobytes per second, but we want it in kilobits, so we multiply it by 8. Giving us 76040, finally we divide that by 1024 to give us the value in MBits/sec. Giving us 74Mbit/sec. One caveat with these calculations is that we do not factor in gzip compression. Using gzip compression, we could safely divide the bandwidth wasting figures by about 50%. Browser caching does not factor in the downstream values, as we are calculating the waste just on the HTML file. It could impact the upstream usage as not all objects maybe requested with every HTML request.

      roflmao! I should've RTFA!

      This is INSULTING! Who could eat this kind of total crap?

      Where the F is Slashdot editors?

      Those guys just decided per visit waste is 20kb? No reasoning, no nothing? Plus, they didn't see on pageviews, just visits ... Uh 1 visit = many pageviews.

      So let's do the right maths:
      41,064,654 visits
      Site like Facebook would probably have around 30 or more pageviews per visit. let's settle for 30.

      1,231,939,620 pageviews per day.

      150 average length of url. Could be compressed down to 50. 100 bytes to be saved per pageview.

      123,193,962,000 bytes of waste, 120,306,603Kb per day, or 1392Kb per sec.

      In other words:
      1392 * 8 = 11136Kbps = 10.875Mbps.

      100Mbps guaranteed costs 1300$ a month ... They are wasting a whopping 130$ a month on long urls ...

      So, the RTFA is total bullshit.

    3. Re:Irrelevant by Chyeld · · Score: 0, Troll

      ya. i hav better idea. ppl shuld just talk in txt format. saves b/w. and whales. l8r

      times 17.3.84 bb speech malreported africa rectify

      times 19.12.83 forecasts 3 yp 4th quarter 83 misprints verify current issue

      times 14.2.84 miniplenty malquoted chocolate rectify

      times 3.12.83 reporting bb dayorder doubleplusungood refs unpersons rewrite
      fullwise upsub antefiling

    4. Re:Irrelevant by Anonymous Coward · · Score: 0

      Yes, you stole my idea.

      Seriously this article is ridiculous. Premature optimization is the root of all evil.

    5. Re:Irrelevant by Anonymous Coward · · Score: 0

      Consider caching. Your browser is going to ask facebook whether or not it has a newer version of an image file every time it loads a page with that file. Even when the file is not out of date and all that the facebook server sends back is the last modification date. In that scenario a very long URL could actually be the largest part of the http transaction.

    6. Re:Irrelevant by Anonymous Coward · · Score: 0

      Also consider that each HTTP request contains the browser's user-agent string. That's 93 chars long for the browser I'm currently using. URLs are twice as bad though, because they're in the GET line and also in the "referer" header. People who are bothered by the length and repetitive use of these strings should probably not use HTTP. It's a very chatty protocol.

    7. Re:Irrelevant by EvanED · · Score: 1

      100 bytes to be saved per pageview.

      Keep in mind that the URL in the request is not the only place you'd get savings. Let's say the average page contains 9 links that can each be shortened by 100 bytes. You're an order of magnitude off then -- they'd save $1300.

      In actuality, 9 is pathetically low and I'd estimate the main page of Facebook when I sign in has closer to 150, but saving 100 bytes is also very high in that case, since most of those URLs aren't even close to being that long, so these things probably roughly balance each other out.

      Evan

    8. Re:Irrelevant by BikeHelmet · · Score: 1

      It might not save anything if the packets are the same size.

      I think sites like Youtube waste more bandwidth.

      1) Watch vid.
      2) Post comment. (Page reloads)
      3) Vid re-downloaded. (in part, or in full)

    9. Re:Irrelevant by Anonymous Coward · · Score: 4, Informative

      You missed the previous paragraph of the article where they explained where they got the 20k value, perhaps you should read the article first. :)

      They rounded down the number of references, but on an average Facebook home.php file there are 250+ HREF or SRC references in excess of 120 characters. They took that these could be shaved by 80 bytes each. Thats 80 bytes x 250 references = 20,000 bytes or 20k.

      Your math is wrong, its taking into account just one URL, when there are 250 references on home.php alone! They did not even factor in more than one page view per visit. If they did it your way, you would be looking at far more bandwidth utilization that 74MBit/sec.

    10. Re:Irrelevant by Anonymous Coward · · Score: 0

      But you haven't considered that all the links would have these short urls too, thus saving even more bandwidth ;)

    11. Re:Irrelevant by Mozk · · Score: 1

      11136Kbps = 10.875Mbps

      11136 kb/s = 11.136 Mb/s

      Bitrates are measured in thousands.

      Get your k's, K's, Ki's, b's, and B's straight.

      --
      No existe.
    12. Re:Irrelevant by Tokerat · · Score: 1

      Two words: Proxy cache.

      facebook.com/home.php != facebook.com/home.php?ref=logo

      --
      CAn'T CompreHend SARcaSm?
    13. Re:Irrelevant by Skal+Tura · · Score: 1

      NOoOOOOooooo!

      IT Thousand = 1024

    14. Re:Irrelevant by Mozk · · Score: 1

      Conventionally, clock rates, bitrates, bandwidths, and other networking and data rates are understood to use decimal prefixes (i.e., 1 kb/s = 1000 b/s), RAM is understood to use binary prefixes (i.e., 1 MB/MiB = 1024 kB/KB/KiB), and storage devices use both in different places (the OS or the packaging).

      Random sources that back this up:
      http://www.speedguide.net/read_articles.php?id=115
      http://www.pcguide.com/intro/fun/bindec-c.html
      http://www.cknow.com/refs/BitsBytesandMultipleBytes.html
      http://en.wikipedia.org/wiki/Bit_rate#Prefixes
      http://en.wikipedia.org/wiki/Binary_prefix#Usage_notes

      --
      No existe.
    15. Re:Irrelevant by Mozk · · Score: 1

      (Replying to myself)

      As a sidenote, I'll say that it is indeed confusing. It's probably best to use units and prefixes recognized as the standard, though. In addition, most standards organizations recommend putting a space between measurements and their units (e.g., 4 kb/s and not 4kb/s).

      In any case I think the most important thing you missed is that b is bit and B is byte, though you did multiply by 8. Just get the units right.

      --
      No existe.
  9. imho by Rikiji7 · · Score: 1

    Those FacebookApps that spread through innocent people's notifications waste more.

    --
    slashwhat?
  10. Facebook? Go after Twitter. by Anaplexian · · Score: 2, Interesting

    Twitter clients (including the default web interface) auto-tinyURL every URL put into it. Clicking on the link involves not one but *2* HTTP GETs and one extra roundtrip.

    How long before tinyurl (and bit.ly, ti.ny, wht.evr...) are cached across the internet, just like DNS?

  11. Most likely insignificant by nysus · · Score: 3, Informative

    This is ridiculous. If I have a billion dollars, I'm not going to worry about saving 50 cents on a cup of coffee. The bandwidth used by these urls is probably completely insignificant.

    --

    ---Technology will liberate us if it doesn't enslave us first.

    1. Re:Most likely insignificant by Psychotria · · Score: 1

      That's a funny way to look at it. If I save 50 cents a day on my cup of coffee I will have another billion dollars in just 5479452 years (roughly). And that's excluding compound interest!

    2. Re:Most likely insignificant by scdeimos · · Score: 4, Interesting

      I think the O3 article and the parent have missed the real point. It's not the length of the URL's that's wasting bandwidth, it's how they're being used.

      A lot of services append useless query parameter information (like "ref=logo" etc. in the Facebook example) to the end of every hyperlink instead of using built-in HTTP functionality like the HTTP-Referer client request headers to do the same job.

      This causes proxy servers to retrieve multiple copies of the same pages unnecessarily, such as http://www.facebook.com/home.php and http://www.facebook.com/home.php?ref=logo, wasting internet bandwidth and disk space at the same time.

    3. Re:Most likely insignificant by Anonymous Coward · · Score: 0

      yeah, it be far more worthwhile to optimize the webpage as a whole. why try to optimize the 1% of the data tranfer when they can optmize the part that really matters.

      this is of course assuming the bandwidth cost savings would be worth the cost to assign people to optimize and test it which I doubt it.

      But your analogy is pretty bad where saving $0.50 cents can mean alot when you selling in the millions. 1 million sales of coffee (for a large chain) would mean the difference of $500,000 dollars.

      Bandwith is a bit different where you have to optmize quite a bit to recoup loss from assigned man hours due to low bandwith costs.

    4. Re:Most likely insignificant by hldn · · Score: 1
      --
      http://www.accountkiller.com/removal-requested
    5. Re:Most likely insignificant by XanC · · Score: 2, Insightful

      You can't ever rely on the HTTP-Referer header to be there. Much of the time, it isn't; either the user has disabled it in his browser, or some Internet security suite strips it, or something. I'm amazed at the number of sites that use it for _authentication_!

    6. Re:Most likely insignificant by jd · · Score: 1, Offtopic

      Just how interesting are the compounds in coffee, anyway?

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    7. Re:Most likely insignificant by Plaid+Phantom · · Score: 1

      As I understand it, caffiene is actually very popular nowadays.

      --
      All comments are properties and trademarks of the voices in my head. Not like I'm gonna claim them.
    8. Re:Most likely insignificant by patternmatch · · Score: 1

      Ref tagging of that nature is not designed to track what page someone is coming from (which is what the HTTP referrer header gives you), but rather to track where users are clicking on the page. Did they go to their home page by clicking on the logo, or by clicking somewhere else? This can be useful usability data.

    9. Re:Most likely insignificant by jd · · Score: 1

      Celebrities are popular but exceedingly boring, so popularity may not be a good measure.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    10. Re:Most likely insignificant by Tokerat · · Score: 1

      ...and you have hit the nail on the head. Well put.

      --
      CAn'T CompreHend SARcaSm?
    11. Re:Most likely insignificant by Anonymous Coward · · Score: 0

      An interesting analogy, to be certain.

      If you're the CEO of a company that serves coffee to its employees, perhaps you'll be interested in saving 50 cents on each cup - then you'll reach your next billion even quicker!

      For the record, I think the argument is bogus on the website, but the point here is scale is a bitch.

  12. Really? by kenh · · Score: 1

    How many times are the original pages called? Is this really the resource hog?

    What about compressing images, trimming them to their ultimate resolution?

    How about banishing the refresh tags that cause pages to refresh while otherwise inactive? Drudgereport.com is but one example where the page refreshes unless you browse away from it...

    If you really want to cut down on bandwidth usage, eliminate political commenting and there will never be aneed for Internet 2!

    --
    Ken
    1. Re:Really? by jd · · Score: 1

      Replacing all the images with random links to adult sites would save considerable bandwidth and I doubt the users would notice the difference.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    2. Re:Really? by blackest_k · · Score: 1

      That refresh rate can be a killer, i've a spike in my bandwidth where the refresh was set to once a second, whoops.

      I guess the biggest waste is loading images then rescaling them in the browser. Another good one often missed is using a single line instead of a block of color I took a banner that was plain on the left and detailed on the right and set the background color of the div to take the 1 line jpeg and floated the main part of the image right, resizing the column width became easier no loss of impact and a much smaller image file. Another fairly obvious thing is to separate the content from the styling with style sheets, although that is more a time safer than a bandwidth saver when it comes to changing a site.
       

  13. Wow. Just wow. by NerveGas · · Score: 3, Informative

    75 whole freaking megabits? WOWSERS!!!!

    They must be doing gigabits for images, then. Complaining about the URLs is complaining about the 2 watts your wall-wart uses when idle, all the while using a 2kW air conditioner.

    --
    Oh, you're not stuck, you're just unable to let go of the onion rings.
  14. Mental Masturbation by JWSmythe · · Score: 5, Insightful

        This is a stupid exercise. Oh my gosh, there's an extra few characters wasted. They're talking about 150 characters, which would be 150 bytes, or (gasp) 0.150KB.

        10 times the bandwidth could be saved by removing a 1.5KB image from the destination page, or doing a little added compression to the rest of the images. The same can be said for sending out the page itself gzipped.

        We did this exercise at my old work. We had relatively small pages. 10 pictures per page, roughly 300x300, a logo, and a very few layout images. We saved a fortune in bandwidth by compressing the pictures just a very little bit more. Not a lot. Just enough to make a difference.

        Consider taking 100,000,000 hits in a day. Bringing a 15KB image to 14KB would be .... wait for it .... 100GB per day saved in transfers.

        The same can be said for conserving the size of the page itself. Badly written pages (and oh are there a lot of them out there) not only take up more bandwidth because they have a lot of crap code in them, but they also tend to take longer to render.

        I took one huge badly written page, stripped out the crap content (like, do you need a font tag on every word?), cleaned up the table structure (this was pre-CSS), and the page loaded much faster. That wasn't just the bandwidth savings, that was a lot of overhead on the browser where it didn't have to parse all the extra crap in it.

        I know they're talking about the inbound bandwidth (relative to the server), which is usually less than 10% of the traffic. Most of the bandwidth is wasted in the outbound bandwidth. That's all anyone really cares about. Server farms only look at outbound bandwidth, because that's always the higher number, and the driving factor of their 95th percentile. Home users all care about their download bandwidth, because that's what sucks up the most for them. Well, unless they're running P2P software. I know I was a rare (but not unique) exception, where I was frequently sending original graphics in huge formats, and ISO's to and from work.

    --
    Serious? Seriousness is well above my pay grade.
    1. Re:Mental Masturbation by Skal+Tura · · Score: 2, Informative

      it's actually not even 0.15Kb, it's 0.146kb >;)

      and 100mil hits, 1kb saved = 95.36Gb saved.

      You mixed up marketing, and in-use computer kilos, gigas etc. 1Kb !== 1000 bytes, 1Kb === 1024bytes :)

    2. Re:Mental Masturbation by Anonymous Coward · · Score: 0

      "So on a single day, the folks over at facebook have wasted roughly 783GB downstream and 469GB upstream."

    3. Re:Mental Masturbation by drinkypoo · · Score: 1

      While you have a good point, your argument can be summed up as "I've already been shot, so it's okay to stab me."

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    4. Re:Mental Masturbation by JWSmythe · · Score: 1

          Nah, I just never converted the KB (Bytes) of file size and string size (8 bit characters are 1 byte), so I never converted it down to the Kb/s (kilobits per second) for bandwidth measurement. :)

      --
      Serious? Seriousness is well above my pay grade.
    5. Re:Mental Masturbation by JWSmythe · · Score: 1

          Naw, it's more like, I'd rather be poked with that blunt stick than shot with a cannon. :)

      --
      Serious? Seriousness is well above my pay grade.
    6. Re:Mental Masturbation by value_added · · Score: 1

      This is a stupid exercise. Oh my gosh, there's an extra few characters wasted. They're talking about 150 characters, which would be 150 bytes, or (gasp) 0.150KB.

      Perhaps, but I'm reminded of the time when I started getting into the habit of stripping Unsubscribe footers (and unecessarily quoted Unsubscribe footers) from all the mailing lists (many high volume) that I subscribed to. During testing, I found the average mbox was reduced down in size by between 20 and 30%.

      If you accept the premise that waste is waste, then it's only a matter of perspective. If something doesn't affect you personally, it doesn't mean that it doesn't affect someone else.

      Overlong URLs may not waste bandwidth to any degree, but they sure as hell are wasteful in other respects, if not outright idiotic. Or is there no one who has ever copied/pasted a URL?

    7. Re:Mental Masturbation by Anonymous Coward · · Score: 0

      You mixed up marketing, and in-use computer kilos, gigas etc. 1Kb !== 1000 bytes, 1Kb === 1024bytes :)

      no need for special operators here! the majority of slashdot-reader-brains parse code type safe!

    8. Re:Mental Masturbation by Anonymous Coward · · Score: 0

      It's worse:

      Kb never used (Kelvin bit)
      kb kilo bit (1000 bits)
      KB Kilo byte (1024 bytes)
      kB rarely used (1000 bytes)

      There's no similar distinction for mega or giga, but b always means bit, and B always means byte.

    9. Re:Mental Masturbation by excesspwr · · Score: 1

      Badly written pages ... not only take up more bandwidth because they have a lot of crap code in them,

      This has always been my argument against commenting code. No comments passed = less bandwidth.

    10. Re:Mental Masturbation by roc97007 · · Score: 1

      > I took one huge badly written page, stripped out the crap content (like, do you need a font tag on every word?),

      Every string, more likely. No doubt created by Frontpage. I've had to clean up pages like that also. And the pages really did render in less time.

      --
      Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
    11. Re:Mental Masturbation by JWSmythe · · Score: 1

        I don't believe in commenting my **displayed code**. I do strongly believe in commenting my parsed code, so those never show to the users. In know in theory even comments can slow down the code as it's being parsed, but that's so trivial it doesn't matter.

          I am lazy on rare occasions, and will temporarily comment out HTML in displayed code, but that's always a temporary thing. It always makes me laugh when I read the source of big sites, and they have comments still in there. News sites have been fun for it, where they say "story here" and "image here", and other crap like that.

          In PHP, I may have "// story here" and "//image here", but those never show up to the user. :)

          Since this conversation started with Facebook, I just went to their main page. it has these comments in it...

      I won't even attempt to guess at all the javascript crap they have on the home page, but I'd guess its not necessary. :)

      --
      Serious? Seriousness is well above my pay grade.
    12. Re:Mental Masturbation by JWSmythe · · Score: 1

      Actually, I think the original layout in that particular page was done in Macromedia Dreamweaver. But ya, a WYSIWYG editor.

          I write my code by hand. It always comes out better, and it's easier to maintain.

      --
      Serious? Seriousness is well above my pay grade.
    13. Re:Mental Masturbation by Anonymous Coward · · Score: 0

      Network bandwidth uses SI prefixes. (i.e. 10^3N) Only a few holdouts in the RAM and OS fields continue to foist the abomination of "computer" prefixes upon us as they become increasingly a less clever approximation.

    14. Re:Mental Masturbation by dimethylxanthine · · Score: 0

      Well done, you have just wasted in the order of 35KB do make your point, and another 30 for me to point that out to you.

      What would Slashdot be if it weren't for the waste...

    15. Re:Mental Masturbation by Anonymous Coward · · Score: 0

      Actually, 1Kb equals something else.

      kilo is k not K

      byte is B not b

      bit is b

      Kelvin is K

      so 1Kb is 1 Kelvinbit, don't ask me what it means.

    16. Re:Mental Masturbation by PRMan · · Score: 1

      But Facebook's whole point is that it usually links to Fackbook. So it is going to have 100 links per page that could be reduced by 100 characters each. So it's outbound bandwidth too.

      --
      Peter predicted that you would "deliberately forget" creation 2000 years ago...
    17. Re:Mental Masturbation by AtomicJake · · Score: 1

      To the author: I am sorry to tell you that you are wrong:
      Kilobit.

      To the moderator: If you don't know what's wrong and what's true, just don't moderate.

    18. Re:Mental Masturbation by Anonymous Coward · · Score: 0

      You are also wrong. 1Kb is 1 Kilobit, not 1 Kilobyte. 1KB is 1 Kilobyte. So 1Kb = 1024 bits, _not_ 1024 bytes as you suggest. 1Kb = 128 bytes, since there are 8 bits per byte.

  15. what is that as a proportion? by wjh31 · · Score: 1

    but how much is that as a proportion of their total bandwidth usage, if they were worried about bandwidth im sure they could just compress the images a little more and save much more

    1. Re:what is that as a proportion? by Bloke+down+the+pub · · Score: 1

      Or mek it so evrbdy az 2 rte lyk dis.

      --
      It's true I tell you, feller at work's next door neighbour read it in the paper.
  16. Re:WTF by Skal+Tura · · Score: 0, Offtopic

    No, those guys wanting to ban black cars are saner people than writers of this article ...

    The black car thing atleast is somewhat significant! For example, see when mythbusters tested white vs. black car.

  17. BS by Anonymous Coward · · Score: 0

    BS

  18. tag: dropinthebucket by RobertB-DC · · Score: 4, Insightful

    Seriously. Long URL's as wasters of bandwidth? There's a flash animation ad running at the moment (unless you're an ad-blocking anti-capitalist), and I would expect it uses as much bandwidth when I move my mouse past it as a hundred long URL's.

    I'm not apologizing for bandwidth hogs... back in the dialup days (which are still in effect in many situations), I was a proud "member" of the Bandwidth Conservation Society, dutifully reducing my .jpgs instead of just changing the Height/Width tags. My "Wallpaper Heaven" website (RIP) pushed small tiling backgrounds over massive multi-megabyte images. But even then, I don't think a 150-character URL would have appeared on their threat radar.

    It's a drop in the bucket. There are plenty of things wrong with 150-character URLs, but bandwidth usage isn't one of them.

    --
    Stressed? Me? Of course not. Stress is what a rubber band feels before it breaks, silly.
    1. Re:tag: dropinthebucket by Skal+Tura · · Score: 1

      lol, i used to run Wallpaper Haven :)

      People came to me complaining "it's not haven, it's heaven!" Ugh ... Didn't know what Haven means :D

    2. Re:tag: dropinthebucket by basementman · · Score: 1

      Actually ads are usually served by the advertising company, not the websites they appear on.

  19. 5kb per typed page by TinBromide · · Score: 1

    If you take and type a full page (no carriage returns) into notepad and save it, you end up with 5kb per printed page at the default font/print settings. When was the last time that a web page designer cared about 5kb? If 150 bytes (yes, 150 char's) is a concern, trim back on the dancing babies and mp3 backgrounds before you get rid of the ugly url's.

    Besides, if not for those incredibly long and in need of shortening URL's, how else would we be able to feed rick astley's music video youtube link into tinyurl and expect people to click it, expecting it to be a real URL?

    --
    Is it sad that I am more likely to recognize you and your posts by your sig than your name or UID?
    1. Re:5kb per typed page by Overzeetop · · Score: 2, Interesting

      Actually, when I had my web page designed (going on 4 years ago), I specifically asked that all of the pages load in less than 10 seconds on a 56k dialup connection. That was a pretty tall order back then, but it's a good standard to try and hit. It's somewhat more critical now that there are more mobile devices accessing the web, and the vast majority of the country won't even get a sniff at 3G speeds for more than a decade. There is very little that can be added to a page with all the fancy programming we can put into them. Mostly, I (and my clients who need to find me) want information, and one of the best ways is simply readable text with static pictures. For the web, you can really compress the heck out of an image and still have it look crisp on a monitor.

      --
      Is it just my observation, or are there way too many stupid people in the world?
  20. more like CSS, Javascript, and by Anonymous Coward · · Score: 0

    umpteen ad links, crapromedia, and god knows what else. Seriously, the URL size is like complaining while you urinate into the sea then saying it's affecting the tide level.

    1. Re:more like CSS, Javascript, and by Tokerat · · Score: 1

      umpteen ad links, crapromedia, and god knows what else. Seriously, the URL size is like complaining while you urinate into the sea then saying it's affecting the tide level.

      Point taken, but it's certainly a metaphor. If we spend that much on the URL, imagine what the wasteful content with all the JavaScript comments and extra whitespace is taking up...

      --
      CAn'T CompreHend SARcaSm?
  21. Compared to what? by dmomo · · Score: 1

    What's the percentage savings? Is it enough to care or is it just another fun fact?

    Simplifying / nanoizing / consolidation javascript and reducing the number of sockets required to load a page would probably be more bang for the buck. Is it worth worry about?

    1. Re:Compared to what? by Skal+Tura · · Score: 1

      Simple javascript compression would probably save them 1000 as much as shortening urls from 150 chars to 50.

    2. Re:Compared to what? by HeronBlademaster · · Score: 2, Interesting

      Interestingly (or maybe not), Google doesn't gzip their analytics javascript file...

    3. Re:Compared to what? by jholster · · Score: 1

      True & interesting, but browser will cache the ga.js anyway. Generally, if you want to save some real bandwidth, the first thing to do is make sure that your web server (and your CMS!) supports If-Modified-Since HTTP request header.

    4. Re:Compared to what? by Anonymous Coward · · Score: 0

      Unpatched IE 6 (original XP version) has a bug in handling compressed javascript files.

      While nobody should be running unpatched IE 6, it isn't good business practice to not support the default install of the most common OS of your customers.

    5. Re:Compared to what? by HeronBlademaster · · Score: 1

      So you're arguing against gzipping websites? You realize webservers can decide whether or not to gzip a file based on what browser the user is running, right? All Google needs to do is have their webserver not gzip the file if the user is running a sufficiently old IE6.

  22. absolute number and 'wasted' by fermion · · Score: 1
    First, absolute numbers mean nothing. It is like 200 million for this wasted federal program on the 20 I waste on coffee over equal period. Without know the percent of total, or how much it would really save, or even if the problem can be fixed. As it is, this is just random showboating, perhaps interesting from a theoretical sense if the math is correct, but given the absolutes I doubt the author can really do the math correctly.

    Second, define 'waste'. Most rational people would argue that facebook is itself is a waste of bandwidth, and that getting rid of it would leave more bandwidth for what people really want, which is p0rn, unless the rumors extorted in the previous article is true which is that facebook is really about such amateur barely legal material.

    But even if we assume that Facebook is wasted, the percentage of bandwidth used is probably not excessive given it's entertainment value. I mean, it would be like getting rid of the department of homeland security. Sure it would lower the taxes we pay by 2%, but don't we already have enough unemployed executives complaining about how hard it is to live on a 1 million dollar severance package?

    --
    "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
  23. Long urls good for search rankings by Anonymous Coward · · Score: 0

    The reason many sites have long URLs like that is so they can be explicit and do better in Google search rankings. As long as Google values the actual URL for search rankings, URLs will remain long.

  24. Whatever by Bloke+down+the+pub · · Score: 1, Funny

    75 MBit/s? What's that in Libraries of Congress per decifortnight?

    --
    It's true I tell you, feller at work's next door neighbour read it in the paper.
    1. Re:Whatever by geekoid · · Score: 1

      Giraffe

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  25. Its all about the evolution of "it" by Rooked_One · · Score: 1

    it goes in cycles... you get better hardware, then you saturate it with software. Then you get better software and you saturate it with hardware.

    Currently, we can apply said metaphor with internet connections. We started with jpegs. We had low baud modems. We then moved on to moving pictures we needed to download. They upped it to cable. Now we are to the point where the demand for fiber to your house is going to be needed in most situations.

    Think how we've moved from dumb terminals to workstations and now we are using more dumb terminals (ie - VM's) and it will just keep cycling.

  26. At this point... by MatthewAnderson · · Score: 0

    At this point we may as well start harping on engineers about TCP/IP packet overhead if we're concerning ourselves with this water under the bridge...

  27. I can top that. Try the Globe and Mail! by Anonymous Coward · · Score: 5, Interesting

    For an even more egregious example of web design / CMS fail, take a look at the HTML on this page.

    $ wc wtf.html
    12480 9590 166629 wtf.html

    I'm not puzzled by the fact that it took 166 kilobytes of HTML to write 50 kilobytes of text. That's actually not too bad. What takes it from bloated into WTF-land is the fact that that page is 12,480 lines long. Moreover...

    $ vi wtf.html

    ...the first 1831 lines (!) of the page are blank. That's right, the &lt!DOCTYPE... declaration is on line 1832, following 12 kilobytes of 0x20, 0x09, and 0x0a characters - spaces, tabs, and linefeeds. Then there's some content, and then another 500 lines of tabs and spaces between each chunk of text. WTF? (Whitespace, Then Failure?)

    Attention Globe and Mail web designers: When your idiot print newspaper editor tells you to make liberal use of whitespace, this is not what he had in mind!

  28. Re:Wow. Just wow. by Guysmiley777 · · Score: 1

    Typical half-assed slack-alism. HEY! If I take a really small number and multiply it by a REALLY HUGE number, I get a REALLY BIG NUMBER! The end is nigh! Panic and chaos!!!

    --
    Coding with assembly is like playing with Legos. Coding an application in assembly is like building a car with Legos.
  29. ffs by stonedcat · · Score: 0

    Yea that's it... URLs are wasting bandwidth... never mind the massive amounts of useless garbage on the Internet no it's definitely long URLs.

    --
    You can't take the sky from me.
  30. Yes, of course it's waste. by Anonymous Coward · · Score: 0

    If they were using that space for descriptive purposes (like long file names) there might be an arguable tradeoff, but most URLs are full of illegible encodings that mean nothing to anyone except the people managing the service. This is all fine, but why not encode all the info and send it in one fat lump? Most users, perhaps with the exception of some nerds who hangout here at /. don't navigate by editing the URL directly. They press the big shiny buttons like normal primates.

  31. Customer bulletin by kheldan · · Score: 4, Funny
    Dear Customer,
    In order to maximize the web experience for all customers, effective immediately all websites with URLs in excess of 16 characters will be bandwidth throttled.

    Sincerely,
    Comcast

    --
    Are YOU using the TOOL, or is the TOOL using YOU? Think about it!
    1. Re:Customer bulletin by Anonymous Coward · · Score: 0

      Inserting that text in every page would be a good start in saving bandwidth.

  32. High Hanging Fruit? by Anonymous Coward · · Score: 0

    For laughs, use Yahoo's YSlow on the article. The site makes a stupid amount of requests for CSS and JS, doesn't GZIP, doesn't use ETAGS, etc. They ignore almost every other bandwidth saving technique, but at least their URLs are short!

    1. Re:High Hanging Fruit? by HeronBlademaster · · Score: 1

      This is precisely the kind of thing that makes me annoyed with people in general... if people didn't do this all the time (complain about the dike leaking a drip when six feet over there's a ten-foot hole) I wouldn't be so anti-social.

    2. Re:High Hanging Fruit? by Lennie · · Score: 1

      Also, Yahoo has been using short URL's on it's frontpage for god knows how long. 10 years atkeast, that it not got included in YSlow just shows you how much this really matters.

      --
      New things are always on the horizon
  33. i just got off the toilet by Anonymous Coward · · Score: 0

    www.ishitoutanobama.com

  34. An elliptic assault on Net Neutrality. by tjstork · · Score: 2, Interesting

    Has anyone here even looked at what the real motivation behind this study is? It's to create this idea that web hosts, are, surprisingly, wasting the valuable bandwidth provided by your friendly ISPs. Do a few stories like this over a few years, and suddenly, having Comcast charge Google for the right to appear on Comcast somehow seems fair. The bottom line is, as a consumer, its my bandwidth and I can do whatever I want with it. If I want to go to a web site that has 20,000 character URLS, then, that's where I'm headed.

    --
    This is my sig.
  35. more bandwidth wasted by this thread! by Anonymous Coward · · Score: 0

    honestly, is this really an issue when people are streaming entire movies?

  36. It's not the URL in the GET, it's URLs in the HTML by rbrome · · Score: 2, Insightful

    I hope this is obvious to most people here, but reading some comments, I'm not sure, so...

    The issue is that a typical Facebook page has 150 links on it. If you can shorten *each* of those URLs in the HTML by 100 characters, that's almost 15KB you knocked off the size of that one page. Not huge, but add that up over a visit, and for each visit, and it really does add up.

    I've been paying very close attention to URL length on all of my sites for years, for just this reason.

  37. Better idea by Anonymous Coward · · Score: 5, Funny

    Just use a smaller font for the URL!

    1. Re:Better idea by v(*_*)vvvv · · Score: 1

      Ya, whatever. That will only save black ink for your monitor.

  38. Re:tag: dropinthebucket... Hmmm, i was thinking by davidsyes · · Score: 1

    ...

    I am wondering if this is more about exploting the fact that such long and exacting URLs might serve as a form of security through obscurity...

    --
    Previously: "Linux... Toward the Sunrise..." Now: "Linux... Toward the-- No, now, part of Every Sunrise"
  39. African or European? by Anonymous Coward · · Score: 1, Funny

    75 MBit/s? What's that in Libraries of Congress per decifortnight?

    Depends. Is it an African or European Library of Congress?

  40. Re:Mental Masturbation Try the new ebay by thoglette · · Score: 2, Insightful

    Badly written pages (and oh are there a lot of them out there) not only take up more bandwidth because they have a lot of crap code in them, but they also tend to take longer to render.

    ebay has "upgraded" their local site http://my.ebay.com.au/> and "my ebay" is now a 1M byte download. That's ONE MILLION BYTES to show about 7K of text and about 20 x 2Kb thumbnails.

    The best bit is that the htm file itself over 1/2 Mbytes. Then there's two 150K+ js files and a 150k+ css file.

    Web "designers" should be forced to develop on a 128M P3 machine with VGA screen and dial up modem

    --
    -- Butlerian Jihad NOW!
  41. No by kpang · · Score: 5, Insightful

    Are Long URLs Wasting Bandwidth?

    No. But this article is.

  42. Focusing on the wrong problem... by hrbrmstr · · Score: 3, Insightful

    Isn't Facebook itself the huge waste of bandwidth as opposed to just the verbose URLs it generates?

    --
    Mind the gap...
  43. It's on static pages too... by Anonymous Coward · · Score: 0

    I used to be the sysadmin for a public high school. The school's website was 100% static pages, and the Webmaster/Web design teacher was thoroughly incompetent. She pretty much read "Web Design for Dummies" and used Macromedia Suite MX to design the worst and slowest possible Java or Flash crap. Poor layout too--it looks like MySpace was rewritten by teenagers.

    The school website URL was kind of long to begin with: http://www.school.county.k12.fl.us/

    Here's where it got fun. First, she could not comprehend the concept of relative paths, so every single link was an absolute path.

    For the school calendar, I wanted to use http://www.school.county.k12.fl.us/calendar. She could not have any of that, and insisted on http://www.school.county.k12.fl.us/DailyUpdates/calendar/calendar/calendar.htm. Her argument was that users should not memorize addresses of things they go to frequently--they should go to the main page and link through.

    My personal favorite URL of hers? http://www.school.county.k12.fl.us/StudentParentInfo/PhoneList/PhoneList08-09.htm.

    Every attempt I made to organize the webspace was met with her hysterically screaming and making it a mess again. She also insisted on uploading .psd files along with their resultant .jpg files--her "new and improved" website started at 900 MB and grew to 40 GB in three years.

  44. A great symptom of poor design: eBay v TradeMe by ykiwi · · Score: 1
    It isn't just the URL, but the URL is a great symptom of a site where being lean is not a design priority.

    eBay url for an electronic item for sale:

    http://cgi.ebay.com.au/SONY-5-1-Home-Theatre-Amplifier-Receiver-STRDE497_W0QQitemZ180339459830QQcmdZViewItemQQptZAU_HOME_CINEMA_SYSTEMS?hash=item180339459830&_trksid=p3286.c0.m14&_trkparms=66%3A2%7C65%3A1%7C39%3A1%7C240%3A1318

    NZ's Trade Me url for a piece of electronic equipment:

    http://www.trademe.co.nz/Electronics-photography/Home-audio/Headphones/auction-210021888.htm

    There is no excuse for the difference in length, and eBay is not only confusing search engines and us, but is also making their pages slower to load.

    Note that the ebay.com.au and Trademe sites have about the same number of listings. The url is a symptom, and a cursory analysis of the rest of the page and the site will see plenty of other examples of poor design for page loading speed.

    1. Re:A great symptom of poor design: eBay v TradeMe by Lehk228 · · Score: 1

      ebay is a giant active database serving real time data and images in every direction. if they can save a small percentage CPU or memory load by having long URLs it will be worth it.

      in addition if they can expedite diagnostics when problems arise by having meaningful (for the devs) urls they save on s/downtime/money

      --
      Snowden and Manning are heroes.
    2. Re:A great symptom of poor design: eBay v TradeMe by Anonymous Coward · · Score: 0

      I "like" how ebay seems to replace a semantic "=" with "Z" and is using "QQ" as a delimiter in URLs.

      This smells of half-assed braindeadness right away.

  45. Re:Wow. Just wow. by Dynedain · · Score: 1

    Even worse, it's like complaining about one person's wall-wart in an entire city of homes using air-conditioners.

    --
    I'm out of my mind right now, but feel free to leave a message.....
  46. Stupid questions are the real culprit by SleepyHappyDoc · · Score: 1

    I bet a lot more bandwidth was "wasted" here talking about it. Facebook could "save" even more bandwidth, if they shut down all their servers and returned nothing to every request. This is a silly discussion.

    --
    Stasis is death. Embrace change.
  47. Notsomuch by EkriirkE · · Score: 1

    Outside of meaningless SEO strings, a lot of the data in the URL would have to be transmitted otherwise, in cookie or POST data

    --
    from 09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
    to 45 2F 6E 40 3C DF 10 71 4E 41 DF AA 25 7D 31 3F
  48. AJAX by Gnaget · · Score: 0

    If you want to save bandwidth charges, there are much better ways to do it which will drop bandwidth by a lot more than the length of a URL. Move to AJAX calls, replacing only the parts of the DOM which need to be updated, instead of refreshing the entire page.

  49. Are bloated HTML files wasting bandwidth? by Anonymous Coward · · Score: 0

    Yes! 10 to 400 times more so that long URLs! The IQ of your average Slashdot poster seems to be slipping lately.

  50. Cookies are a much bigger problem IMO by Anonymous Coward · · Score: 0

    Many sites today have insanely large cookies, which are sent with every request. Some sites are so large that the HTTP request won't even fit in on TCP packet....

    Clearly these sites ought to be able to use much smaller session key cookies of some sort (obviously properly cryptographically signed). All this other junk they jam in their cookies they can / should just store on their server.

  51. Actually, it does help bandwidth a little. by Millennium · · Score: 1

    The querystrings get passed to the CGI script, but that's done completely server-side: the bandwidth isn't wasted because the querystring never goes through the pipes. So you end up saving a little bit bandwidth-wise

    In any case, this is a case of grossly premature optimization. Very, very few URLs even come close to the bandwidth of a favicon.ico file, which is itself considered a pittance. There are far more effective ways to cut down on bandwidth than these near-trivial aspects.

  52. Damn Right! by failedlogic · · Score: 1

    After all these years of using Slashdot, I cannot access the site with just /. in my browser.

    Here's the math:

    Slashdot.org
    - /.
    ___________________
                                      10 wasted characters.

  53. Re:Damn waste of effort by Anonymous Coward · · Score: 0

    Web-bloat is as prevalent as any software-bloat. Chief cause: laziness.

    The sub-market is at fault for offering me short-URI services, so that I can Twitter more and SEO be damned.
    The search engines are at fault for inventing SEO and making the content of URIs matter.
    The merchants are at fault for making my shopping cart expire, after I go away for five damn minutes.
    The advertisers are at fault for making me pay extra for bandwidth, because I use a plugin to hide their content.
    The webmasters are at fault for needing revenue, and using interstitial pages as though they were a damn magazine.
    The browsers are at fault for demanding a page hit, instead of trusting the cache from 10 seconds ago.
    The programmers are at fault for including bells-and-whistles AJAX and kitchen-sink CMS overhead, irrelevant to my interests.
    The designers are at fault for using tables instead of CSS.
    The coders are at fault for embedding scripts, styles, and too many indents for readibility.
    The W3C is at fault for pushing HTML, XML, and XSL tags to be longer, not shorter.
    The technology is at fault for not setting a threshold on how long a URI can be.
    The mouse is at fault for letting me click a link instead of typing everything in.
    The gopher protocol is at fault for letting http win.
    The internet is at fault for being so old it can't evolve past 7-bit ASCII.

    Okay...I have other pointless articles to go read.

  54. And that wouldn't have mattered... by coryking · · Score: 2, Insightful

    ...except they aren't using mod_gzip/deflate. At first I thought you browsed the web RMS style and maybe wc* didn't support compression** and you were just getting what you deserved***, but then I checked in firefox and lo and behold:


    Response Headers - http://www.theglobeandmail.com/blogs/wgtgameblog0301/

    Date: Fri, 27 Mar 2009 23:39:54 GMT
    Server: Apache
    P3P: policyref="http://www.theglobeandmail.com/w3c/p3p.xml", CP="CAO DSP COR CURa ADMa DEVa TAIa PSAa PSDa CONi OUR NOR IND PHY ONL UNI COM NAV INT DEM STA PRE"
    Keep-Alive: timeout=15, max=100
    Connection: Keep-Alive
    Transfer-Encoding: chunked
    Content-Type: text/html

    200 OK
    No compression!

    If they had been using compression, it would have made all that whitespace fairly negligible.

    Probably a result of how their template system stitches everything together. Still, that is pretty bad. There is no excuse to run a webserver and not turn on compression. It is the single biggest way to boost page-load and decrease bandwidth.

    * wget 4 lyfe!
    ** compression is probably evil and Anti-Freedom(tm) somehow, kinda like images are evil or fads like "graphical user interfaces" are evil. In otherwords,anything that makes things easier or faster for a user is basically evil and Anti-Freedom****.
    *** braindead comment spamming bots are the only thing not using compression (except RMS, probably)
    **** I'll leave it to you, dear reader, to deduce if I'm serious. Hint: no hint.

    1. Re:And that wouldn't have mattered... by Anonymous Coward · · Score: 0

      wc == word count. Basic unix tool. It doesn't fetch webpages. He probably did use wget to get the page in the first place.

  55. nah urls are fine by Anonymous Coward · · Score: 0

    if we need to free up some bandwidth how about cutting out all the stupid ass ads everywhere, who clicks on those anyway?

  56. Re:I can top that. Try the Globe and Mail! by LateArthurDent · · Score: 5, Interesting

    ...the first 1831 lines (!) of the page are blank...Attention Globe and Mail web designers: When your idiot print newspaper editor tells you to make liberal use of whitespace, this is not what he had in mind!

    Believe it or not, someone had it in mind. This is most likely a really, really stupid attempt at security by obscurity.

    PHB:My kid was showing me something on our website, and then he just clicked some buttons and the entire source code was available for him to look at. You need to do something about that.
    WebGuy:You mean the html code? Well, that actually does need to get transferred. You see, the browser does the display transformation on the client's computer...
    PHB:The source code is out intellectual property!
    WebGuy:Fine. We'll handle it. ::whispering to WebGuy #2:: Just add a bunch of empty lines. When the boss looks at it, he won't think to scroll down much before he gives up.
    PHB:Ah, I see that when I try to look at the source it now shows up blank! Good work!

  57. Usability is the problem, not bandwidth by Bill+Dimm · · Score: 2, Insightful

    The problem isn't bandwidth, it is that long URLs are a pain from a usability standpoint. They cause problems in any context where they are spelled out in plain text (instead of being hidden as a link). For example, they often get broken in two when sent in plain text email. When posting a URL into a simple forum that only accepts text (no markup), a long URL can blow-out the width of the page.

    Where does this problem come from? It comes from SEO. Website operators realized that Google and other search engines were taking URLs into account, so CMSs and websites switched from using simple URLs (like a numeric document ID) to stuffing whole article titles into the URL to try to boost search rankings. One of the results of this is that when someone finds a typo in an article title and fixes it, the CMS either creates a duplicate page with a slightly different URL, or the URL with the typo ends up giving a 404 error and breaks any links that point to it.

    What I don't understand is why search engines bother to look at anything beyond the domain name when determining how to rank search results. How often do you see anything useful in the URL that isn't also in the <title> tag or in a <h1> tag? If search engines would stop using URLs as a factor in ranking pages, people would use URLs that were efficient and useful instead of filling them with junk. The whole thing reminds me of <meta> keyword tags -- to the extent that users don't often look at URLs while search engines do, website operators have an opportunity to manipulate the search engines by stuffing them with junk.

    1. Re:Usability is the problem, not bandwidth by Lehk228 · · Score: 1

      i saw one site using a scheme of example.com/000000/articletitle.html where the numbers were the document ID and the article title could be any valid URL and get to that document.

      --
      Snowden and Manning are heroes.
    2. Re:Usability is the problem, not bandwidth by Bill+Dimm · · Score: 1

      i saw one site using a scheme of example.com/000000/articletitle.html where the numbers were the document ID and the article title could be any valid URL and get to that document

      Yes, I've seen that before. Someday the people that do that will realize what a bad idea it is. They'll probably figure it out when people start linking to:
      example.com/000000/get+your+free+kiddie+porn+here.html
      and their pages start showing up in the search engines for things they don't want to be associated with.

    3. Re:Usability is the problem, not bandwidth by Lehk228 · · Score: 1

      you can already do that with a ? after the URL.

      --
      Snowden and Manning are heroes.
    4. Re:Usability is the problem, not bandwidth by Bill+Dimm · · Score: 1

      you can already do that with a ? after the URL.

      Good point (assuming it isn't a dynamic page that checks for extraneous parameters), and yet another reason why search engines shouldn't be paying attention to anything in the URL other than the domain name -- the rest is too easy to manipulate.

  58. Re:I can top that. Try the Globe and Mail! by the_one(2) · · Score: 2, Informative

    Have you tried compiling the whitespace =)

  59. Plus by coryking · · Score: 2, Insightful

    The HTTP-Referer isn't designed for ?ref=somesource

    Your stat software wants to know if more people click to your page through the logo ?ref=mylogo or through a link in the story ?ref=story. The Referer can't give you that info.

    The HTTP-Referer also is no good for aggregation. It only give you a URL. If you didn't append something like ?campaign=longurl, it would be almost impossible to track things like ad-campaigns.

    HTTP-Referers *are* good for dealing with myspace image leeches. If you haven't I suggest you read thorough you log files right now--I bet you'll find 20% of your traffic is myspace idiots leeching your images. Redirect those guys to something more... tasteful, and enjoy the bandwidth savings.

  60. Is facebook wasting bandwidth by Anonymous Coward · · Score: 0

    The better question is, how much bandwidth would be saved if we got rid of facebook altogether ?

    I banned this POS site on my network, and suddenly real sites are much more responsive & everyone is more productive!

  61. Re:I can top that. Try the Globe and Mail! by DavidD_CA · · Score: 2, Interesting

    Wow. Judging by the patterns that I see in the "empty" lines, it looks like their CMS tool has a bug in it that is causing some sections to overwrite (or in this case, append instead).

    I'd bet that every time they change their template, they are adding another set of "empty" lines here and there, rather than replacing them.

    --
    -David
  62. no, they are not by cecil_turtle · · Score: 1

    The core answer to "are long URL's wasting bandwidth" is no, they are not. The extra bandwidth used is much less than the general background noise of the Internet. There are so many other things that waste more bandwidth than long URL's that it's not worth spending any time worrying about them. Think server headers ("x-powered-by: asp.net" is really annoying), spam, extra email headers, long email threads where the entire original thread is copied each time, un-optimized graphics on web pages, un-optimized javascript/css/html, un-followed prefetching and DNS prefetching, viruses spreading, port scans, etc. The list goes on and on. Watch a raw firewall log on an active connection for about 5 minutes to see what I mean about "background noise". Long URL's can only go to about 2,000 characters (~2KB) before you start running into compatibility issues (IE), and most "long URL's" are much shorter than that even. This just isn't worth worrying about.

  63. Re:I can top that. Try the Globe and Mail! by Anonymous Coward · · Score: 0

    What do you think inband http compression does to those blank lines?

  64. It keeps countlessl idiots off the streets... by AmazingRuss · · Score: 1

    ...so no.

    American Idol might yield a better "idiots off the street" to bandwidth ratio, though.

  65. Re:Mental Masturbation Try the new ebay by mikael_j · · Score: 1

    As a web developer who was using a P3 with 256 MiB of RAM at work up until recently I think I'm going to have to disagree.

    How about an example of the software I tend to be running at the same time:

    1. gVim - Not very resource intensive but it all adds up, at least when you have a dozen or so files open at once.
    2. MS SQL Enterprise Manager
    3. MS SQL Query Analyzer
    4. Internet Explorer
    5. Firefox with firebug, Web Developer and a few other addons
    6. Safari
    7. Opera

    Clearly all of this eats more than 256 MiB of memory, on a CPU that's a few generations out of date, let's just say there's a lot of swapping...

    Then there's the matter of the VGA screen, last time I checked the stats less than 4% of the users were using a resolution of 800x600 or less, why would anyone care to develop for machines running at 640x480 with 4 bit color? And how do you expect anyone to get useful work done on a machine like that?

    As for the javascript, sometimes it is, unfortunately, necessary to send fairly large amounts of javascript to the clients but if done correctly then the client should cache the scripts after loading them the first time.

    That said, a 0.5 MiB html document is pretty huge, at least for what is essentially a start page, I'm sure they could trim that down if they really wanted to.

    /Mikael

    --
    Greylisting is to SMTP as NAT is to IPv4
  66. These long URLs are generated by the server by Skapare · · Score: 1

    Instead of sending them to the client (waste step 1) which in turn has to send them back to the server (waste step 2), what should be done is keep them on the server. Generate an MD5 hash of the URL string. Use the MD5 hash as an index in a Berkeley DB file, storing the time last created, and the whole URL (maybe compressed). Make a replacement URL with the MD5 hash. When a request comes back with the MD5 hash, look up the URL to use for it, much like having your own tiny URL tool. A reaper process can run in the background, gradually stepping through the indexes in some order, and deleting entries considered to be too old (whatever is appropriate for the site).

    This is actually a good idea for security, too. By refusing to accept the full URLs with all those variable fields that people could modify, you'll have more of a shield against hackers trying to tweak with the system.

    --
    now we need to go OSS in diesel cars
  67. Excessive URL? How about some of those review site by rrossman2 · · Score: 1

    Just take a look at some of those review sites people post links to right here on slashdot. They contain so many ads from different addresses they make the client look up, plus a review of TWO items tend to be split over 10 pages, which each page being a single paragraph or ONE PICTURE. Slashdot! Save the bandwidth! Don't like to those sites, or if someone does, don't follow! Think of the URL requests we as slashdot could save if we didn't all follow a link to a site that has 100+ URL requests per page for 10 pages!

  68. Mod parent up by Anonymous Coward · · Score: 0

    This is the best feature of TinyURL, and there's no need to get goatsed or tubgirled again.

    1. Re:Mod parent up by Anonymous Coward · · Score: 0

      he only crappy thing is is that is not default... for some reason, firefox likes to delete cookies... I irregularly have to re-sign into igoogle, etc. Though it could be because of DHCP

    2. Re:Mod parent up by iago-vL · · Score: 1, Troll

      Google and the like don't care what your source IP is, just that you have the proper cookie. Something else is causing your problem.

      (If you want proof, drag a laptop to your friends' houses, and you'll still be logged in)

    3. Re:Mod parent up by RoFLKOPTr · · Score: 1

      for some reason, firefox likes to delete cookies...

      Actually, Google just requires you to reauthenticate every couple of weeks. If you notice, when it asks you to login again, it will show your email address in static text and only ask for your password. It has nothing to do with cookie deletion or IP address changes.

  69. I found this looong URL the other day by Skapare · · Score: 1

    Slashdot won't even let me type (really: paste) it in. So I have to enter it via tinyurl. The preview link is here [http://preview.tinyurl.com/dapfpm]. That URL was found traversing through the proxy when accessing a Youtube video. Later I repeated the same video (which was rather lame) and didn't get anything as long. I think they dynamically insert all the ads and stuff in the URLs.

    --
    now we need to go OSS in diesel cars
  70. bad reporting by roc97007 · · Score: 1

    Without context, this is meaningless. 75 Mb/sec as a percentage of what? "As much as" -- translating to "less than, probably significantly less than". In context, this is probably not news. Which is usually why context is left out -- because in context, there would be nothing significant to report.

    "Up to 1000 people had pianos dropped on their heads". Over what time period and geological area? If worldwide since 1698, it's a curiosity. If in my town since last Tuesday, it's cause for concern. But without context, I have no idea whether to get excited about the Piano Ban of 2009.

    --
    Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
  71. Wasting Bandwidth by Anonymous Coward · · Score: 0

    Let me get this one straight.
    He is saying that with long web names it is waisting bandwidth.
    Now, with SPAM, VIRUS, MalWare, SpyWare, Bots, Trojans, Rootkits, and any other band wasting junk out there these people have the gull to complain about long URL's wasting bandwidth.

    Go Figure!!!

  72. Information Underload! by malevolentjelly · · Score: 1

    Maybe we should consider not using a bunch of ridiculous and frivolous web-based application crap like ajax in order to serve information and images.

    I mean, if we're going to nitpick, isn't the entirety of web 2.0 sort of retarded? We complain about web standards and yet no one is willing to write things in a simple and expressive enough manner that old or low end hardware such as cell phones are relevant to the mainstream web. Maybe URL's aren't really the big problem here.

    Hell, if we are translating this to energy use, imagine how much electricity would be saved if everyone stopped using inefficient and expensive languages like Ruby... cities worth.

    Why don't we stop serving video in flash and go back to simplistic plugin players or *gasp* ... Java! If we want to cut bandwidth, look to the past and narrowband.

    If we want to cut energy or bandwidth use, it's quite simple. The modern web is full of ridiculous waste.

  73. Oh come on... by Anonymous Coward · · Score: 0

    hugeurl.com is there for a reason.
    I don't know which one, but I'm sure there is one.

  74. Subject by z-j-y · · Score: 1

    What is next? XML is too wasteful?

  75. Re:I can top that. Try the Globe and Mail! by coryking · · Score: 1

    I bet you are right. Hundred bucks says the HTML for the textbox where you change your templates has a couple newlines in it. SOmething like:

    [textarea] ... some extra newlines....
    {$OLD_TEMPLATE} .... some extra newlines...
    [/textarea]

    Sorry I couldn't make that more clear, but slashdot would have ate the HTML (moreso than it already has)

  76. And following this line of "thought" ... by ignavus · · Score: 1

    do long filenames waste disk sectors?

    (Answer: not as much as long files do).

    --
    I am anarch of all I survey.
  77. Didn't this thread load really fast? by chord.wav · · Score: 1

    i jst svd u 9 byts of bndwdth, thts why

  78. How about the other 1000 things you could do? by v(*_*)vvvv · · Score: 1

    Like:
    - naming all your variables and functions really short
    - persistent use of classes and css to avoid common attributes
    - grouping/normalizing css properties
    - using js to generate repetitive code
    - not using any carriage returns
    - removing every js function and css declaration you don't use
    - removing all comments
    - properly compressing images
    - proper caching declarations
    - doing all browser optimizations on the server

    And you know, URLs usually contain important information. I've never seen someone put garbage in a URL. And if it's information that needs to be passed, it doesn't matter where it is placed. It will take up the same bandwidth.

  79. So I remember an old Microsoft joke by microbee · · Score: 1

    It was said by replacing "Microsoft Office" by "MSOF" will make certain office releases from 10 floppy diskettes to 9, saving tremendous manufacturing cost.

  80. Re:It's not the URL in the GET, it's URLs in the H by v(*_*)vvvv · · Score: 1

    Ya, and this is a design decision I think. Each link contains a hash to a cached node of some sort, for direct access to each resource.

    FYI: Director of Engineering discusses Facebook's architecture (1hr)

  81. the answer by gEvil+(beta) · · Score: 1

    Maybe. But I'm sure we can waste more.

    --
    This guy's the limit!
  82. Thanks filter for ruining my answer... :P by Anonymous Coward · · Score: 0

    1

    (Slashdot is killing the Earth's bandwith!)

  83. They get charged for outbound, not inbound by grantsucceeded · · Score: 1

    It only matters whether the greater of the two directions is reduced, if you do 8gig out and 1gig in you get charged for 8gig out, even if you reduce 1g inbound to .925G. (also it's 95th percentile typically, but that shouldn't matter in this case)

    It would cost more to do the refactoring than the ever could hope to recoup, even if shorter urls also decrease outbound traffic.

  84. Re:I can top that. Try the Globe and Mail! by f00Dave · · Score: 1

    I compiled whitespace from the Haskell source (could have apt-get install whitespace'ed it, but this was more fun =] ). Then fed it that page's source ... no output. Anyone want to reverse engineer some probable CMS-noise?

    --
    .f00Dave
  85. Re:Wow. Just wow. by Macthorpe · · Score: 0, Offtopic

    Why did I read that as "panic and cheese!!!"?

    --
    "It does not do to leave a live dragon out of your calculations, if you live near him." - Tolkien
  86. what else is wasteful? by cratermoon · · Score: 1

    Sure, they are wasting bandwidth. In fact, so are vowels. Lets remove all the vowels from text, that'll save even more!

    1. Re:what else is wasteful? by ptudor · · Score: 1

      Vowels are excluded in non-elementary/non-Quranic Arabic writing. And it works, in the way that when you see "Blvd" or "Pkwy" you already know how to pronounce it.

  87. CAPTCHA by Joebert · · Score: 1

    But how much is being wasted on CAPTCHA challenges ?

    --
    Wanna fight ? Bend over, stick your head up your ass, and fight for air.
  88. Size doesn't matter... by Anonymous Coward · · Score: 0

    It not the size of the URL that matters but what you do with it.

    At least that's what my GF says.

  89. pr0n, pix, imgs, flash, m$.aspx, botnets, spam by Anonymous Coward · · Score: 0

    penny wise, pound foolish.

  90. Re:Can they not eschew apostrophe's... by aqk · · Score: 0

    I say, for one thing, that THOUSAND'S, perhaps million's of byte's of storage could be saved if we all stopped using these idiotic and gramatically incorrect apostrophe's when describing URLs (instead of url's)

  91. absurd by rbunker · · Score: 2, Insightful

    This is silly. The URLs, even "long" ones are miniscule compared to the pictures, streaming video, music, javascript etc. on these pages. To worry about them is like worrying about the lint on a suit of clothes making them too hot. This is just absurd.

  92. Probably the point, but... by Tokerat · · Score: 1

    ..if URLs are such a huge waste, image the bandwidth used by the rest of the data in the HTTP protocol alone, content excluded.

    Since web applications are getting so advanced, isn't it time we moved beyond the "interactive document" paradigm and developed something a little more efficient for the internet applications of tommorow?

    DISCLAIMER: I myself have no idea where to begin on such a task, but it does seem that we've just been patching on to a technology that isn't mean for what we do with it; HTTP/HTML has it's place for certain, but we could and should do better, don't you think? Is anyone working on an alternative? With service providers constantly screaming about bandwidth, there must be a way to tighten the belt, so to speak...

    --
    CAn'T CompreHend SARcaSm?
  93. Re:I can top that. Try the Globe and Mail! by noidentity · · Score: 1

    ...the first 1831 lines (!) of the page are blank. That's right, the &lt!DOCTYPE... declaration is on line 1832, following 12 kilobytes of 0x20, 0x09, and 0x0a characters - spaces, tabs, and linefeeds. Then there's some content, and then another 500 lines of tabs and spaces between each chunk of text. WTF? (Whitespace, Then Failure?) Attention Globe and Mail web designers: When your idiot print newspaper editor tells you to make liberal use of whitespace, this is not what he had in mind!

    Dude, that's just an embedded Whitespace script. You need to upgrade your browser!

  94. Common Misconception by rrp · · Score: 1

    The URL of your page has nothing to do with the Google pagerank. See: http://googlewebmastercentral.blogspot.com/2008/09/dynamic-urls-vs-static-urls.html

  95. Re:I can top that. Try the Globe and Mail! by The+Moof · · Score: 1

    Or, a more likely explanation, is server side script blocks.

  96. Acrnym by noppy · · Score: 3, Funny

    To fthr sav our bdwdth, ^A txt shld be cmprsd into acrnyms!

  97. Reducing bandwidth can be sensible by Anonymous Coward · · Score: 0

    Don't be quite so dismissive. We have a high traffic site which features a real-time data widget. This polls for a small data file very frequently. Eliminating the majority of the server response headers, changing to use a shorter domain name and page, and ensuring we polled on a different domain (so that cookies weren't sent) reduced our bandwidth usage by just over a third. That's not to be sniffed at.

  98. Re:Mental Masturbation Try the new ebay by thoglette · · Score: 1

    Hi Mikael,

        thanks for the considered response. Let me drag you back to the real world.

        First, I usually don't run anyone's app/website fullscreen. 2ndly, even if I do have 1/2G RAM, you can bet your bottom dollar there's half a dozen other apps fighting for it. Slashdot, ebay, twitter, aardvark. cyclingnews.com are all sideshows predominantly providing c. 10K text. Work?

    Now, my home machine has only 388M of RAM and 1500 x 1024 pixels. My work machine has more ram (1/2G), but much less screen (1240 x 1024). Yes, that's my brand new corporate PC. My mother=in-law (and mother) have less than both of these. My kids have subnotebooks with VGA and sub VGA screens. Half my extended family are still on dialup (or dialup paced wifi). Let's not get started on the early adopterkindens with their wunderwebphones!

    Now, I have a number of supplier/client sites whose first useful hyperlink is more than 1000 pixels away from top left. Painful. Pretty, pretty graphics above. But not user friendly.

    The bottom line is - for most of us, www browsing is only one of the thomgs we are doing (I currently have gimp, xsls,open project, 5 browser panes, email, a stats package, two dos windows, one running perl, RDP and word open. Yes I'm mildly http://www.randsinrepose.com/ ADD )

    And, as you said, the my.ebay start page is surreal.

    --
    -- Butlerian Jihad NOW!
  99. I/dont/think/that/urls/are/too/long... by gooneybird · · Score: 1

    I/dont/think/that/urls/are/too/long/I/think/that/they/should/be/as/long/as/they/need/to/be/after/all/it's/only/bandwidth/and/bandwith/will/always/increase/until/it/doesn't/anymore..

  100. Leaking faucet next to Niagra Falls. by cskrat · · Score: 1

    We're talking about a leaking faucet next to the font of data that Facebook delivers.

    It's interesting that they can quantify the amount of data they're receiving in GET requests but I'd suspect that using shorter URLs would still transmit the same number of packets over the internet AND would incur an extra database hit or decompression to translate the *slightly* shorter URL into something meaningful.

    Are we really worrying about byte level optimizations here?
     

    --
    My God! It's full of eval()'s.
  101. Re:Can they not use... SocialDNS.net by Anonymous Coward · · Score: 0

    Just use go://target instead of long http://...

    http://www.socialdns.net

    It is an open alternative that creates a novel namespace using the go:// scheme.
    Names are open and free and every user can manage his web TLD: go://blog.pedro

  102. Actually, you may be missing the point by PRMan · · Score: 1

    Let's say a Facebook page has 100 links on it.

    Using GET syntax, each of the 100 links must contain all the metadata about the user's current state, etc.

    Using POST syntax, the links each contain only the page information. The metadata WILL be added to the call, the same as the GET syntax, but not to each <a> tag on the page.

    --
    Peter predicted that you would "deliberately forget" creation 2000 years ago...
    1. Re:Actually, you may be missing the point by autocracy · · Score: 1

      Using GET syntax, each of the 100 links must contain all the metadata about the user's current state, etc.

      No. Session information is usually stored server-side, and a key lookup is passed. Whether it be via GET, POST, or a cookie, the same amount of data is transferred. In some cases, the client-side has the information, but there is no magical change that's different between those three methods. Whatever the server doesn't know has to be sent regardless of the method, whether it's a reference to a key value for info on the server side from the last form, or the entire contents of the form.

      --
      SIG: HUP
    2. Re:Actually, you may be missing the point by smellotron · · Score: 1

      Hyperlinks are always GETs. The only way to get a link that POSTs is by using a hidden form and javascript that submits the form onclick... which is counterproductive.

    3. Re:Actually, you may be missing the point by Anonymous Coward · · Score: 0

      But still commonly abused. You'd think people would realize that when they have to go through that much trouble to accomplish things in that way, it's probably not the right way to be doing it anyway.

  103. The search engine problem with short URLs by pcause · · Score: 1

    One of the issues with shortening URLs is that the search engines look at the URL and the words present in the URL to determine how to rank the URL. This works against the desire to shorten. For example having: "baseball/redsox/beckett" is important to get a higher ranking.

  104. I'm in no position to speak on the matter... by John+Pfeiffer · · Score: 1

    Given that my personal website is 'giantpachinkomachineofdoom.com' and the website for my upcoming project is 'tangibleimagination.com', I think it would be a terrible conflict of interest for me to comment on this issue.

    --

    Friend: "The NIC is misconfigured..." Me: "No prob, I'll just telnet in and fix it." *Silence*
  105. IPv4 v IPv6 by reddn · · Score: 1

    want to complain about a small amount of usage.. how about what will be used when we start using 128bits instead of 32 bits for IP addresses in IPv6

  106. .NET VIEWSTATE even worse by Anonymous Coward · · Score: 0

    I am part of a team redesigning my company's website. I used to be a web developer, but am now an infrastructure and system (read "general" system, as in systematic, not specifically computer systems) junkie. Anyhow, I did a source view of the generated HTML and how the VIEWSTATE "gibberish" for maintaining state in ASP.NET. I copied the value and pasted it into Notepad and saved it... it was 48k... just the VIEWSTATE value.

    There is a workaround that stores it in a session on the server, which alleviates this. However, if I had not discovered this, it would have made it to production. How many instances of things like this exist out there in the open?

  107. pointless by canuck08 · · Score: 1

    A typical web server will have a symmetric network connection and will be using almost entirely the outbound bandwidth.

    If you are sending 90Mb/s and receiving 5Mb/s on your heavily loaded web server reducing theat 5Mb/s to 3 or 2 is utterly useless.

  108. More waste due to cookies for static content by Lazy+Jones · · Score: 1
    Many cookie-ridden websites nowdays still force client browsers to send all site/domain cookies for every request to static content. This is a much bigger problem, esp. in the age of asymmetric (DSL) connections.

    For example, the numerous images slashdot uses for the design are all on the .slashdot.org domain, for which my browser has 7 cookies stored (some of them from Google Analytics, one of the biggest culprits). This means that for each request (even HEAD) to these images and other static content, all those cookies are sent to the slashdot servers, wasting precious outgoing ("upload") bandwidth.

    --
    "I love my job, but I hate talking to people like you" (Freddie Mercury)
  109. Re:I can top that. Try the Globe and Mail! by skeeto · · Score: 1

    HTML tidy immediately chops 60kB off of it. So that's 60kB that wasn't doing anything at all.

  110. Who cares. by Anonymous Coward · · Score: 0

    $15/Mbps/mo * 75 == $750/mo. Yawn. Get a life people. Have you seen the other waste and spend involved in a large site?

  111. I suggest... by drolli · · Score: 1

    a lot of other reasons for waste of Bandwidth 1) The idea that no instead of a link URL we need usuall a javascript onclick event 2) The idea that a really good layouted requires nesting 15 levels of components. 3) The ideas that we first need to test if flash is installed. 4) The idea that even just figuring out the start date of a movie nowadays requires to load and skip a flash video. 5) The idea that things which would be perfectly fine without AJAX now require extra request after loading the website. 6) the idea that every wesite needs special menu bars on each end of the page instead of a unified, simple menu with one hierachy level more

  112. Re:Mental Masturbation Try the new ebay by mikael_j · · Score: 1

    Not to nitpick but I'd like to correct some parts of your post...

    VGA resolution means 640x480 with 4 bit color (16 colors) or 320x200 with 8 bit color ( 256 colors), most cellphones sold in the last few years are capable of graphics better than that and I sincerely doubt you know a lot of people running computers with "sub VGA screens".

    • Your computer most likely has 384 MiB of RAM, not 388.
    • 1240x1024 sounds like a very odd resolution, are you sure you don't mean 1280x1024?
    • The closest standard resolution to 1500x1024 that I can think of is 1400x1050.

    /Mikael

    --
    Greylisting is to SMTP as NAT is to IPv4