Slashdot Mirror


Netscape Restores RSS DTD, Until July

Randall Bennett writes "RSS 0.91's DTD has been restored to it's rightful location on my.netscape.com, but it'll only stay there till July 1st, 2007. Then, Netscape will remove the DTD, which is loaded four million times each day. Devs, start your caching engines."

44 of 134 comments (clear)

  1. Redirect by cynicalmoose · · Score: 3, Insightful

    And they can't set up a redirect to the new hosting location?

    --
    Exercise your right not to vote. thinkoutside.org
    1. Re:Redirect by Otter · · Score: 2, Insightful

      Wouldn't they then be serving 4 million redirects per day? The point is that they need to eventually break it to make people stop relying on that path.

    2. Re:Redirect by AndroidCat · · Score: 4, Funny

      HTTP/1.1 301 Moved Permanently
      Content-Type: text/html
      Location: http://127.0.0.1/
      --
      One line blog. I hear that they're called Twitters now.
    3. Re:Redirect by werewolf1031 · · Score: 5, Insightful
      And they can't set up a redirect to the new hosting location?
      What in the world would be the point? That would merely duplicate the problem to a different location. As was clearly stated in the article by Mr. Finke, four-million hits every day is a crapload of bandwidth wasted re-downloading a file that will never change. The RSS 0.91 spec is finished, complete, and yes, for all intents and purposes, written in stone. Stop looking at it every damned day. It will not change. Ever. It's truly stupid for client-side software to be accessing it over the Internet to read its forever-static contents. That's like checking the writings of a dead poet every day to see if anything's changed.

      And any dev who codes his app to check a file like this every day instead of caching it client-side should be smacked oh-my-god-so-frickin-hard.
    4. Re:Redirect by AndroidCat · · Score: 5, Funny
      And any dev who codes his app to check a file like this...
      They might not even know that they're doing it if they're using Microsoft's Swiss Army Chainsaw XMLHTTP COM object and set the flags wrong.
      --
      One line blog. I hear that they're called Twitters now.
    5. Re:Redirect by BobNET · · Score: 2, Funny
      And naturally that's Microsofts fault? Not the developer who doesn't know anything about their tool?

      I wouldn't worry about it, many developers have firsthand experience with their tools...

    6. Re:Redirect by AndroidCat · · Score: 3, Informative

      I didn't say that it was Microsoft's fault. It's just that it's a powerful tool with thousands of uses that's simple (on the surface) to use, but it pays to read the fine print carefully because many things aren't obvious. (/me remembers wasting time wondering why my XPath queries weren't working...)

      --
      One line blog. I hear that they're called Twitters now.
    7. Re:Redirect by naChoZ · · Score: 2, Interesting

      And any dev who codes his app to check a file like this every day instead of caching it client-side should be smacked oh-my-god-so-frickin-hard.

      Ironic because Netscape is guilty of this poor practive themselves. I have an old sun u2 box that I recently revived. I had a copy of netscape messaging server/netscape enterprise server on it (used by the isp where I worked at the time). I wanted to archive some old mail off of it before I wiped the drive. I couldn't start it up because there were so many files containing references to http://developer.netscape.com/products/servers/ent erprise/dtds/nes-webapps_6_1.dtd which of course doesn't even exist. Couldn't even start up until I replaced all references to that file with local file uri links.

      --
      "I can be self-referential if I want to," said Tom, swiftly.
    8. Re:Redirect by Albanach · · Score: 2, Insightful

      To be fair, the article points out that they have already put in place a redirect.

      They point out that it might not be entirely sensible for millions of newsreaders to rely upon downloading a static file from the web each time they open a feed. Most newsreaders (like the one built into Firefox use a local cached copy.

      They restored the file so these newsreaders will continue to work for a period long enough that they can be altered to use a local copy.

      Whether it's reasonable or not for them to remove the file is, I guess, up to the reader to decide. Personally though, I think it's a fair point that you should never rely on a file hsoted on a server which you have no control over - the file can be altered, vandalised, or in this case simply removed without warning and without you being able to do anything about it.

    9. Re:Redirect by doom · · Score: 2, Funny

      It's truly stupid for client-side software to be accessing it over the Internet to read its forever-static contents.

      Hey, you're challenging one of the cherished principles on which the web was based.

      The next thing you know, you're going to be talking about the separation of document id from location.

  2. Not enough time!!!! by Anonymous Coward · · Score: 5, Funny

    Developers who made the mistake to use that external resource in their code most likely don't have the brain resources to adapt until July.

    (This is not a troll. Resignation and bitterness, maybe. But not a troll.)

    1. Re:Not enough time!!!! by Anonymous Coward · · Score: 5, Funny

      That is kind of like declaring PI to be a volatile double variable, in case it changes in real time...

  3. Why can't we just move it? by Anonymous Coward · · Score: 4, Interesting

    Developers should take the opportunity to move to Atom. In the mean time we could use something as simple as round-robin DNS to share the load or have Mozilla, Google or the internet archive host it. It's a historical document and should reside at a permanent URI.

    1. Re:Why can't we just move it? by Paulrothrock · · Score: 2, Interesting

      Kind of like Example.com. That was set up in RFC-2606.

      --
      I'm in the hole of the broadband donut.
    2. Re:Why can't we just move it? by metamatic · · Score: 2, Informative

      You know, if you're gonna be a smartass on this topic, you should at least understand the difference between a URI and a URL.

      There's nothing flawed about the notion of a permanent URI. A permanent URL is the tricky bit.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  4. CmdrTaco by MagicM · · Score: 5, Funny

    Netscape Restores RSS DTD, Until July - from the that's-kinda-lame dept.
    Two Stargate SG1 Films Announced - from the good-for-them dept.
    Linux: x86 Linux Flash Player 9 is Final - from the i-still-hate-flash dept.

    Looks like somebody is having a case of the mondays.

    (On Wednesday.)

    1. Re:CmdrTaco by Valthan · · Score: 5, Funny

      I believe you would get your ass kicked for saying that to someone.

      --
      --Valthan
  5. I don't get it by Thansal · · Score: 4, Interesting

    I admit, I am not familiar enough with RSS. However this is a 2.3KB file that is not supposed to change. Why would developers NOT hardcode it into their RSS tools?

    --
    Do Or Do Not, There Is No Spoon, There Is Only Zuul. Everything in the above post is probably opinion.
    1. Re:I don't get it by jrumney · · Score: 5, Informative

      Developers use off the shelf XML parsers, which generally take care of validation for you. Netscape created this problem themselves when they stated in the spec for RSS 0.91 that well-formedness was not enough, RSS 0.91 feeds should be validated against the DTD. They then specified that document authors must use a PUBLIC doctype specifier, so the option of using a SYSTEM one (where the DTD is looked up in a local catalog) is not an option.

    2. Re:I don't get it by Anonymous Coward · · Score: 3, Insightful

      PUBLIC doctypes simply give the URI of the DTD, and are exptected to always resolve to the same content. But there's no requirement that you use the default resolver.

    3. Re:I don't get it by rholliday · · Score: 2, Interesting
      I'm also not an expert, but from what I know about DTDs they are supposed to be referenced when the content should validate against them. For example:
      <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
      "http://www.w3.org/TR/html4/strict.dtd">
      This is at the top of every Slashdot page. Should IE or FF break if the W3 were to remove that file? Certainly not. But should it be loaded and validated if possible? I believe so.

      If any XML or RSS gurus want to correct me on this feel free.
      --
      Xbox reviews.. We think they're funny.
  6. mirror ;) by jaredmauch · · Score: 2, Funny
    1. Re:mirror ;) by geoffspear · · Score: 5, Informative

      Great, the entire internet community can rely on one random person's server instead of on one really big corporation's server. That should fix things.

      --
      Don't blame me; I'm never given mod points.
    2. Re:mirror ;) by CokeBear · · Score: 5, Funny
      i registered nether.net before aol registered aol.com

      But you waited until (UID 633928) to register on Slashdot?

      Newbie.

      --
      Reality has a liberal bias
    3. Re:mirror ;) by RabidMonkey · · Score: 2, Funny

      "newbie"?

      thats so oldschool now, it's noob or newb now

      y'old fart.

      --
      We emerge from our mother's womb an unformatted diskette; our culture formats us. - Douglas Coupland
  7. Probably by Anonymous Coward · · Score: 2, Funny

    As that would give Google another way to track your every online move.

  8. Re: Because software evolves by mutation by 140Mandak262Jamuna · · Score: 4, Insightful
    No one ever writes a new XML (and most other Web2.0) application from the scratch. They all take an app they are familiar with and modify it to do new things. And some of the initial boot-strap processes are never looked into. If it works, dont mess with it attitude is pervasive. So someone long ago may be in a galaxy far away wrote an application that replicated and mutated by developers and others took it and did more mutations and it propagated. One side effect of this and similar cut&paste code development tactics is that bugs, security holes, inefficient algorithms, brain dead implementations also propagate.

    Richard Dawkins asks this very fundamental question, why reproduce (sexually or asexually) using seeds and embryos? Why not propagate by cuttings and cloning? It happens in nature. Many fern like plants do it. Bananas have been reproducing by new shoots. Then he discusses how harmful mutations too propagage and how going back to the basics and recreating the embryo selects the beneficial mutations and puts a check on deletrious mutations. Books The Selfish Gene, Climbing the Mount Improbable.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  9. Let's be Evil by hackershandbook · · Score: 2, Interesting

    .. and I thought it was only Microsoft and Google that tried to "break the web" on purpose ....

  10. "Caching" not the answer by KrisWithAK · · Score: 5, Informative

    As I replied for the previous Netscape RSS DTD article http://slashdot.org/comments.pl?sid=216818&cid=176 03480, caching DTDs from the network is not the answer if there is the possibility they will not be there in the future:

    The proper thing to do is for your application to use an XML catalog for resolving entities/URIs and bundle the DTD files with the application. There is a good article at http://xml.apache.org/commons/components/resolver/ resolver-article.html that helped me out. In addition, if you are using Eclipse with the web tools platform, you can customize the catalog so it resolves DTDs and entities locally. See http://wiki.eclipse.org/index.php/Using_the_XML_Ca talog.

  11. Re:whoops by drinkypoo · · Score: 4, Funny
    Cant they sue Microsoft for stealing bandwidth, and bad design?

    Uh, if Microsoft could be held liable for bad design, their buildings would already have been burned to the ground, their women stampeded, their cattle raped, the ground sown with salt and the wells poisoned.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  12. Technical vs. Emotional by mmurphy000 · · Score: 5, Interesting

    (I tried posting this as a reply to the blog posting, but I'm not getting the confirmation email, so I'll post it here)

    From a purely technical standpoint, I agree with your assertion that, for well-baked files like RSS DTDs, clients should not be relying on a file hosted by an arbitrary service.

    That being said, please understand that the emotional message you're sending is: "Don't rely on Netscape".

    Why?

    Back when RSS was first starting out, Netscape's documentation said to use Netscape URLs for the RSS DTDs. Witness this page, published by Netscape, from late 2000:

    Now, a shade over six years later, Netscape is saying "Oh, yeah, what we told you to do? Never mind. We're not supporting it any more."

    If Netscape/AOL was shutting its doors, that'd be one thing. If the service in question was obviously onerous, that too would be understandable. Or, if Netscape told people "For the love of all that is holy, don't use our URLs for your DTD needs!" from the get-go (based on that document, you didn't), any such reliance would be our own fault.

    But, because AOL does not want to serve up two static files, each of which is smaller than the "Netscape Reports" graphic on the netscape.com home page, Netscape is abandoning a service they told people to use.

    So what are we to think about Netscape's current services and their long-term usability?

  13. Re:Not really by Bogtha · · Score: 2, Interesting

    I bet the bandwidth costs from attempted email delivery are huge even though there are no MX records and the server doesn't accept SMTP connections. In addition to spam harvesting, people like me have been using xyz@example.com to satisfy email address requirements for years.

    That's what the .invalid TLD is for, also defined in RFC 2606.

    ".invalid" is intended for use in online construction of domain names that are sure to be invalid and which it is obvious at a glance are invalid.

    --
    Bogtha Bogtha Bogtha
  14. Re:Spelling issue by JasonKChapman · · Score: 2, Informative
    What's so fucking hard about spelling "its" correctly?

    An old Jedi mind trick:

    Its apostrophe is missing, because it's been moved over here.

    --
    Sorry, I'm a writer. That makes you raw material.
  15. Re: Because software evolves by mutation by Vreejack · · Score: 2, Interesting

    That was insightful (hint to mods).

    Now we need software that can breed sexually.

    Or, more realistically, software that has a finer granularity and greater modularity so that the piece of ancient code that does this can be easily identified and swapped out, without needing to be understood by developers.

    --
    "Will future ages believe that such stupid bigotry ever existed!" -- Ivanhoe
  16. First woodpecker... by kabdib · · Score: 2, Insightful

    This is why whenever I hear the words "architecture" and "web" in the same sentence that I snicker. Unpolite, but OMFG who designed this junk?

    Oh, right. Nobody, really. It's amazing it works at all (... and sometimes it doesn't!)

    Djikstra's quip, "If programmers build houses they way they built programs, the first woodpecker to come along would topple civilization" was and remains insightful.

    --
    Any sufficiently advanced technology is insufficiently documented.
  17. Re: Because software evolves by mutation by Zak3056 · · Score: 2, Funny

    Now we need software that can breed sexually.

    Nahh, the risk of virus transmission is too high...

    --
    What part of "shall not be infringed" is so hard to understand?
  18. they don't by jonasj · · Score: 2, Insightful
    until this story broke, I didn't realize they still existed.
    They don't. They haven't existed since 2003. AOL is just using the name for a portal and IIRC a dial-up ISP service.

    http://www.google.com/search?q=%22Brand+Necrophili a%22&safe=off
    --
    You know, Microsoft's street address also says a lot about their mentality.
  19. Re:Why is it done this way? by Anonymous Coward · · Score: 2, Insightful

    You need to put a certain DTD URI into your documents because they essentially act like "magic cookie" values in binary file formats. It's the only way to tell if you're supposed to treat a document as HTML 1.0, 2.0, 3.0, 4.01, XHTML, HTML strict, HTML transitional, whatever. That information isn't encoded in the DTD, so there's no way to identify a file format simply by pointing at a random location with the identical DTD.

    The point of the URI is to act as an opaque identifier for a particular file format. Being able to fetch it is just a bonus, and a good programmer shouldn't rely on the resource being there at run time. URIs are used because the domain name system already delegates responsibility for namespaces; a different scheme could be used, but using DNS leverages the existing infrastructure. It's not perfect (as the RSS 0.91 example shows), but it works 90% of the time.

  20. Re:pi meter by Anonymous Coward · · Score: 2, Informative

    pi = any circle's circumference / diameter. At least on a Euclidean plane, anyway. It's a bit incorrect to think that the value of pi varies, as it's defined in a particular type of geometry. There are also purely mathematical ways to define the value of pi, independent of geometry. Still, the idea of measuring space to get the value of "pi" is a valid way to think about measuring the curvature of space.

    Now, let's say space is curved, like a sphere. (Like, oh, the one we live on.) If you draw a circle, say, the circumference of the Earth (along the equator), and then try to measure the "diameter" on the sphere (over one of the poles), you'll find it's much larger than the actual diameter (straight through the planet's core), and hence the "value of pi" will be much different. (In fact, it'll be 2, give or take a few decimal places since the Earth isn't a perfect sphere.)

    To create a "pi meter", you might think of a device consisting of a fiber optic loop, like in a laser ring gyro; you simply measure the amount of time it takes for light to go around to measure the circumference of the loop, as well as using another pulse to measure its diameter. If space curves (as indeed it does, although not in any way that's noticeable far from, say, a black hole), you'll find a discrepency between your measurement of circumference / diameter and the defined value of pi.

    You could also do the same thing with a piece of string and a ruler, but it wouldn't be convenient enough to call it a "pi meter".

  21. URLs, URIs and URNs 101 by metamatic · · Score: 4, Informative
    URLs are a subset of URIs. A URL defines a location where a resource can be accessed. A URI may merely be the name of a resource, i.e. a URN.

    For example, globally unique IDs in Atom feeds are often URNs, and hence URIs; but URNs aren't URLs, and you shouldn't need or want to try to connect to something just because it's used as a globally unique identifier in an Atom feed and looks a bit like a URL.

    This is relevant because many Internet specifications use URNs (or in the case of HTML, FPIs) as spec identifiers. For instance, XML namespace identifiers are URIs; and while some of them happen to be URLs too, the XML namespace recommendation says:

    The namespace name, to serve its intended purpose, should have the characteristics of uniqueness and persistence. It is not a goal that it be directly usable for retrieval of a schema (if any exists).

    In the case of RSS 0.91, Netscape wrote the spec, and they used a URL and told people to connect to it to fetch the necessary information to parse the file. They could have used a URN, but I'm guessing they wanted to keep their options open as far as changing the spec on the fly.

    (Of course, Dave Winer has a different approach to changing RSS specs on the fly...)
    --
    GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  22. Or mitigate with cache headers by Kelson · · Score: 2, Insightful

    Sending Expires and Cache-Control headers that say "Don't bother retrying for 3 years" might help mitigate some of the bandwidth waste.

    That said, he's got a point that the feed readers should work if the DTD isn't retrievable -- but deliberately removing it looks like a great way to say "Netscape isn't reliable."

  23. Re:URIs by Schraegstrichpunkt · · Score: 4, Informative

    No. This is the perfect example of why a URI is not necessarily supposed to be treated as a URL. http://my.netscape.com/publish/formats/rss-0.91.dt d is just a unique identifier for the RSS DTD. It used to also be hosted there as a convenience, but your software isn't supposed to rely on that.

  24. Re:pi meter by alienmole · · Score: 2, Funny
    You could also do the same thing with a piece of string and a ruler, but it wouldn't be convenient enough to call it a "pi meter".
    Yeah, but if you attached the string to a sleeping cat's tail, then when the value of pi changes it would pull the cat's tail and the cat would jump, hitting the lever above its head, which would release a ball which would roll down a spiral ramp into a container of water balanced on a thin beam, so that when the ball sinks to the bottom of the container it would tip it over onto the cat, which would release an ear-splitting yowl... What was the question again?
  25. Re:whoops by symbolset · · Score: 2, Funny

    It would be returned for a refund.

    --
    Help stamp out iliturcy.