Slashdot Mirror


Is RSS Doomed by Popularity?

Ketchup_blade writes "As RSS is becoming more known to the mainstream users and press, the bandwidth issue reported by many sites (Eweek, CNet, InternetNews) related to feeds is becoming a reality. Stats from sites like Boing Boing are showing a real concern regarding feeds bandwidth usage. Possible solutions to this problem are emerging slowly, like RSScache (feed caching proxy) and KnowNow (even-driven syndication). RSScache seems to offer a realistic solution to the problem, but can this be enough to help RSS as it reaches an even bigger user base in the upcoming year?"

25 of 351 comments (clear)

  1. Push by Phroggy · · Score: 5, Insightful

    Remember all the hype about "push" technology back in the mid-nineties? Nobody was interested, but RSS feeds are being used in much the same way now. I'm thinking there are two significant differences: 1) with RSS, the user feels like they're in control of what's going on; with push, users felt like they were at the mercy of whatever money-grabbing corporations wanted to throw at them, and 2) a hell of a lot of people now have an always-on Internet connection with plenty of bandwidth to spare. When you've got a 33.6kbps dialup connection, you use the Internet differently than when you've got DSL or cable.

    How much bandwidth does Slashdot's RSS feed use?

    It looks like the RSS feed on my home page has a small handful of subscribers. Neat.

    --
    $x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
    $x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
    1. Re:Push by Anonymous Coward · · Score: 5, Insightful

      Pointcast sent way too much data at the time, and now we all have orders of magintude more bandwidth.

      Most of the problem come from a few older RSS readers that don't support Conditional GET, gzip, etc. With modern readers, there's essentially no problem (I've measured it on a few sites I run). Yes, they poll every hour or two, but the bandwidth is a tiny, tiny fraction of what we get from say, putting up a small QuickTime.

      There seem to be lots of people who freak out way to quickly about a few bytes. RSS sends to unnecessary data, but if you've configured things correctly, it's much smaller than lots of other things we do on our networks...

    2. Re:Push by sploxx · · Score: 2, Insightful

      Yes, maybe this way 'feels technically different', but if you have an RSS aggregator/news ticker applet whatever on your desktop, it usually hides the implementation details completely from the user. Do you really think of "ok, now my client makes a http request, that travels through the call hierarchy of the libraries, gets a tcp socket open, gets a kernel call of the driver to send a SYN packet??". Even if I may have detailed knowledge about the inner workings of an application, I usually don't care about it.

      BTW, it's the same about eMail and another good reason why the SMTP/POP suite should be replaced soon (besides spam).

    3. Re:Push by kimba · · Score: 2, Insightful

      DNS expiries and retries are completely configurable. You can set your zone to expire every 5 minutes if you want to. That is how these dynamic DNS places do it.

      Just because you have set up your zone to refresh every 12 hours doesn't mean its mandatory.

  2. They just need to follow ./'s lead by Neil+Blender · · Score: 5, Insightful

    And institute jackboot banning policies if you access them more than x times per y hours.

    1. Re:They just need to follow ./'s lead by Hatta · · Score: 2, Insightful

      And institute jackboot banning policies if you access them more than x times per y hours.

      I don't know much about RSS, but it seems kind of silly to have the user refresh. Doesn't that defeat the purpose? Why not just have the server send out new news as it gets it?

      --
      Give me Classic Slashdot or give me death!
  3. RSS readers don't cache! by IO+ERROR · · Score: 4, Insightful

    One thing that would help immensely is if RSS readers/aggregators would actually cache the RSS feed and not download a new copy if they already have the most current one. I could go through my server logs and point out the most egregious problem aggregators if anyone's interested.

    --
    How am I supposed to fit a pithy, relevant quote into 120 characters?
    1. Re:RSS readers don't cache! by gad_zuki! · · Score: 4, Insightful

      Sometimes you can't tell if you have the newest file, depending on the web server/config.

      The problem, is of course, server-side. For instance, the GPL blog software Word Press doesnt do ANY cacheing. Its RSS is a php script. So if you get 10,000 requests for that RSS, then you're running a script 10,000 times. That's ridiculous and poor planning. Other RSS generation is guilty of this crime.

      Yes, there is a plug in (which doesnt work at nerdfilter nor at the other wordpress site I run) and a savvy person could just make a cron job and redirect RSS requests to a static file, but that's all besides the point. This should all be done "out of the box." This is a software problem that should be addressed server side first, client side later.

      Not to mention, a lot of these RSS readers are big sites like bloglines, newgator, etc who should be respecting bandwidth limits, but really have no incentive to do so. RSS really doesnt scale too well for big sites. What they should be doing is denying connections for IPs that hit it too often or change the RSS format to give server instructions like "Dont request this more than x times a day" in the header for the clients to obey. x would be a low number for a site not updated often and high for asite updated very often.

  4. Doomed? It's barely got off the ground... by WIAKywbfatw · · Score: 5, Insightful

    What you're seeing right now are teething troubles. Nothing more, nothing less. The bandwidth and consumption experienced right now will be laughed off a couple of years from now as miniscule.

    Take the BBC News website for example. On September 11th 2001 its traffic was way beyond anything it had experienced to that point. Within a year or so, it was comfortably serving more requests and seeing more traffic every day. Proof if it was needed that capacity isn't the issue when it comes to Internet growth, and won't be for the foreseeable future.

    RSS is in its infancy. Just because people didn't anticipate it being adopted as fast as it has been that doesn't make it "doomed". By that rationale, the Internet itself, DVDs, digital photography, etc are all "doomed" too.

    --

    "Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
  5. Re:Welcome to the internet by Svet-Am · · Score: 2, Insightful

    depends on your perspective. If I imagine myself to be a server, I'm pushing information to a client and pulling information from a client, like the name implies.

    you're interpreting it from the client perspective, which is not where the name came from.

    --
    [move .sig! for great justice, take off every .sig!]
  6. Re:Welcome to the internet by Anonymous Coward · · Score: 1, Insightful

    And the funny thing here is, if RSS had-- at its conception-- included caching and push-based update notification and all the other smart features that would have prevented this sort of thing from becoming a problem now

    What are you talking about? RSS had caching built-in from day one - it uses HTTP as its transport mechanism.

    Frankly, I fail to see the point of this article. It already mentions that there are significant ways around the bandwidth problem. Any decent non-polling technique such as pubsub offers a solution, not to mention the fact that virtually everybody complaining about RSS bandwidth use hasn't bothered implementing best practice - things like 304 Not Modifed, Cache-Control headers, etc.

    So the executive summary of this story is: some people are complaining about RSS bandwidth. Here are links to solutions. Oh no, sky is falling because I just have to include an ominous yet completely unsubstantiated prediction about next year.

  7. Bittorrent by Jherek+Carnelian · · Score: 3, Insightful

    Seems like bittorrent, or a bittorrent-alike protocol would be useful here. Turn the RSSfeed into a tracker/seed and then all it has to keep track of is who has the latest version of the content and it could redirect feeders to each other, always preferring the latest updated version. Eventually, you will have the same scaling problems that bittorrent has (single tracker), but at least you stretch things out a few months or a year until a better solution ocomes around.

  8. Re:Not a problem with RSS.. just humans. by kardar · · Score: 2, Insightful

    I wonder if advertising has anything to do with it - if you go to a news site just to see "what's up", you might get banner ads, google ads, so on and so forth - but RSS just makes a nice neat webpage for you or something similar.

    I have to point out how much I love "Sage", the Mozilla Firefox plugin for RSS - you can even rightclick on that XML thing that tries to tell you to save the page and bookmark it under "Sage Feeds" and then Alt-S and you have your RSS.

    I started using Sage for /., Groklaw, and a couple others and it's very cool. Very very cool. I hope the advertising revenue doesn't hurt people or whatever, but it's almost one of those things that would be worth money in how much time and aggravation it saves you having to deal with web designs that aren't as great as they could be.

    I've heard a lot about how people complain about Slashdot and the interface and the web design and so on, but Sage cuts down significantly on the time spent here, more or less - or anywhere, for that matter - I think it make the ./ or Groklaw or whatever experience BETTER, not worse.

    Only thing I can think of is advertising revenue.

  9. This issue was previously discussed elsewhere by Paul+Bain · · Score: 4, Insightful

    As RSS [becomes] more known to the mainstream users and press, the bandwidth issue reported by many sites . . . related to feeds is becoming a reality. Stats from sites like Boing Boing are showing a real concern regarding feeds bandwidth usage. Possible solutions to this problem are emerging slowly, like RSScache (feed caching proxy) and KnowNow (even-driven syndication). RSScache seems to offer a realistic solution to the problem, but [will it] be enough . . . ?

    Slashdot user GaryM posted a related question elsewhere about 20 months ago. At that time, in that forum, commenters dismissed his proposed solution, the use of NNTP, on the grounds that NNTP is deficient, but others continue to see NNTP as a possible solution nevertheless.

    --

    A lawyer & digital forensics examiner. Also an expert on open source software (OSS).
  10. If-Modified-Since, User-Agent by pbryan · · Score: 3, Insightful

    I'd be interested in seeing how many of these hits are for complete feeds rather than If-Modified-Since the last time it was downloaded. I suspect that if the RSS readers were behaving like nice User-Agents, we wouldn't see such reports.

    Perhaps particularly offending User-Agents should be denied access to feeds. If I saw particular User-Agents consistently sending requests without If-Modified-Since, I'd ban them.

    --

    My car gets 40 rods to the hogshead, and that's the way I likes it!

  11. Re:Not a problem with RSS.. just humans. by Anonymous Coward · · Score: 1, Insightful

    RSS feeds are meant as a way to strip all the nonsense from a site and offer easy syndication, right? Basically, present the relevent news from a full-fledged webpage in a smaller file size? If such is the case, this isn't an RSS issue, really.

    RSS has different use patterns to normal website visits.

    If you visit a website on a daily basis, you might pull down a single hit. You might lose interest after a couple of days. You might not visit at weekends.

    But if you subscribe to the feed (RSS, Atom, whatever), chances are, you'll be requesting that feed every hour for as long as you have your mail client/newsreader/web browser open. And even if you lose interest, a lot of people will still remain subscribed and just skim over what doesn't interest them.

    Newsfeeds make visitors "sticky". Normally, that's a good thing, but it's actually far better at doing so than need be, "capturing" visitors that really aren't all that interested, and inducing normal visitors to "visit" far more frequently than usual.

    That's the inherent difference between serving, say a cut-back XHTML Basic document, and serving an RSS feed that people can subscribe to.

  12. Slashdot's RSS blocking policy-$$$$ Kaching. by Anonymous Coward · · Score: 4, Insightful

    "Slashdot's RSS traffic, like Boing Boing's, is huge, and blocking broken readers has saved us a ton of bandwidth, which of course means money."

    So's using correct HTML, and CSS.

  13. corporate caching by chiph · · Score: 2, Insightful

    I wouldn't doubt that eventually someone will build a RSS caching device & sell it to the corporate market. Given how big a drain as RSS is to the supplier, the corporate market has the money and determination not to permit it to become a problem for them.

    Chip H.

  14. Re:Slashdot's RSS blocking policy by jamie · · Score: 3, Insightful
    Is there a reason Slashdot doesn't cache better? I'd think that'd save a lot of bandwidth.

    Not really. Our cache hit rate would be about zero. We update the homepage about once a minute, and the same goes for any page that any reader would be likely to reload within a reasonable time.

  15. RSS has already failed. by Anonymous Coward · · Score: 1, Insightful

    It was meant for syndication. So that one website could gather syndicated news from other sites. It was not meant for individual readers to use it as a news update service. Simply using an appropriate protocol would solve this problem, but do to the blogtard community, this will never happen. And so RSS is doomed to be used stupidly just like it is now.

  16. Re:Swarming (Like BitTorrent) is the answer by Quixotic · · Score: 2, Insightful

    i'm not sure the overhead of maintaining a torrent would be less than just serving up a single rss feed (or webpage, image, whatever small file). if i'm not mistaken, each client still needs to download the torrent from the main site to determine where it should download the payload from... and if you're going to do that, you might as well just serve up the small file.

    also, using a torrent might not work so well for sites like slashdot, which allows users to customize the homepage and/or feeds...

    --
    --
  17. Compression by yem · · Score: 2, Insightful

    I assume the complainers are using it?

    51894b boingboing.rss.xml
    17842b boingboing.rss.xml.gz

    --
    No, I did not read the f***ing article!
  18. Re:RSS + Bittorrent -- works for Podcasts... by Wesley+Felter · · Score: 2, Insightful

    Too bad podcasts are totally different from normal RSS feeds, because podcasts are about 100x larger. BitTorrent doesn't work for normal RSS feeds because they are too small and change too often.

  19. RSS hits that directly hit databases are flawed by smagruder · · Score: 2, Insightful

    I've seen many RSS URLs pull from a site's database to build the XML each time it's hit. This is fixed simply by creating a CRON job that builds the RSS XML on a periodic basis, then serving the resulting file. If you're just throwing a file back, then server bandwidth isn't as much of a problem, especially when you consider that browsers themselves cache files.

    --
    Steve Magruder, Metro Foodist
  20. It's not RSS's fault! by Anonymous Coward · · Score: 1, Insightful

    That's XML for ya!

    the best way is to optimize your rss feeds to a max of 10 items, and stick to TITLE and LINK fields only.

    Tom's hardware had a feed that was over 500kb, and they wonder why they had bandwidth issues.