Netscape Dumps Critical File, Breaks RSS 0.9 Feeds
An anonymous reader writes "In the standard definition of RSS 0.91, there are a couple of lines referring to 'DOCTYPE' and referencing a 'dtd' spec hosted on Netscape's website. According to an article on DeviceForge.com quite a few RSS feeds around the web probably stopped working properly over the past few weeks because Netscape recently stopped hosting the critical rss-0.91.dtd file. Probably someone over at netscape.com simply thought he was cleaning up some insignificant cruft." Some explanation has been offered by a Netscape employee.
I would've seen this post sooner, but my RSS feed was broken... something about a 404?
I don't see how this would break RSS readers. DTDs pretty much never get read except by validators. Normal SGML and XML parsers just treat the DTD URL as an opaque string, not as something that can be retrieved.
It's almost as if the team retooling netscapes web presence has no idea about what a DTD is. I'd say it begs the question but in this industry reckless stupidity from clueless morons is the norm. Glad we switched to atom.
what is the probably web?
According to an article on DeviceForge.com quite a few RSS feeds around the probably web stopped working properly over the past few weeks because Netscape recently stopped hosting the critical rss-0.91.dtd file.
STOP, Grammar time. Ooooh whoooaaa oh oh...
Probably someone over at netscape.com simply thought he was cleaning up some insignificant cruft."
Or Netscape got tired of people using their bandwidth. Regardless of the reasons: if you reference a file on someone's site, it's hardly their fault if they move/change/delete it, and it breaks your stuff.
Please help metamoderate.
blame the people (programmers/management) who based their business operation and website programming solely on the security of a third parties business and infrastructure being available
DTD's should be held and referenced locally within your infrastructure and your control
"A distributed system is one in which the failure of a computer you didn't even know existed can render your own computer unusable."
RSS readers don't do anything with the DTD or try to access it at all. This didn't break anything.
Of course they retrieve it - unless they already have a local or cached copy. How else would they be able to parse a document marked up using a custom DTD?
Don't answer - go hang your head in shame.
Or maybe some smart person at Netscape decided to teach some people a lesson about using a 3rd party as a single point of failure?
a few RSS feeds around the probably web stopped working properly.
*blinks*
And if so, why would anyone rely on AOL to make something on the web work?
http://bgcommonsense.blogspot.com
Suck that, Web 2.0!
If you had even the slightest understanding of what a DTD is your opinion would be different and may even be valid.
Is to have a common component shared among many documents without replication.
Class paths is java are the perfect example to say how it *should* work. Java CLASSPATHs in every application/installation I have seen are site-local, all paths accessible without going over the internet to another site to get classes.
To be similar, an RSS site should copy this DTD to their local server, or to a server with which they have a concrete understanding of the relationship. Either a commercial agreement with a peer or at least using a server from an organization who explicitly defines the purpose of hosting to be a common place to promote it as a standard.
Did netscape promise itself to be an organization sharing that DTD explicitly, or did site developers get in the practice because 'it just always was there'?
XML is like violence. If it doesn't solve the problem, use more.
This is the precise reason why I host everything myself including my own series of tubes, dubbed the Internets. I host not only every file that my site uses, but I also have a program that regularly crawls the entire Internet and compresses it onto my own distributed system. That way I can browse the Internet by myself without worrying if someone else's system will fail. Although I do need to replace systems every now and then. But that's not a problem, b/c the distributed system has 3-5 copies of the Internet, each copy in a different place. Wait, isn't their some other company that does that? I can't quite place the name.
Seriously though, relying on some other system so your site will work is a recipe for disaster. It's similar to relying on someone to take you to work everyday. After a while, you get used to that fact that someone else is driving you that you don't even think about it. Then your driver gets deleted somehow. And you're stuck with no way to work.
Funny createSig(Witty remark, Odd reference)
{
return (Funny)remark + (Funny)reference;
}
Whenever someone accesses a RSS file, Netscape would know the IP for every access? How stupid can that be? Why don't the readers just cache the DTDs and fetch only if there's a problem?
It is expected that DTDs are hotlinked. For example, if you ever look at html source of a web page, you would see: on the top, and the hotlink goes to somewhere on w3.org. That is because W3 is the authority body that defines the html.
Since Netscape is the authority body that defines RSS 0.91, it is a bit strange how they stopped hosting the definition.
In any case, the missing definition won't affect software that processes RSS feeds. It only affects software that checks whether a SGML document is structured properly according to that missing DTD.
The main interest to this article seems to be the speculation how a deprecated web 1.0 company could end up hiring a clueless webmaster who deletes important files without recognizing its importance.
I once had a signature.
That's just wrong, you should try software authored by folk who know what they're doing. atom is XML, even Microsoft MUST parse it as an XML doc before internally converting to RSS for IE7, right?
Do daemons dream of electric sleep()?
This could seriously affect both of the guys using Netscape.
What? You've never heard of a probability web before? ...Holds the improbability drive at NS in place?
I'll believe in corporations having personhood when Texas executes one... - advocate_one
Non-validating processors are not required to read any external DTD subset.
Isn't w3.org supposed to host DTDs? Sigh.. what is this, I don't understand? Every "doctype" there is links to w3.org. Seriously, NETSCAPE?! A commercial organization!
How is this any different than the W3C doctypes ala
? Most web tutorials tell you to use the remote file....but if it ever goes down what happens? Is there a reason for it being taught this way?
Once you start despising the jerks, you become one.
From April 2001, "Netscape removed the RSS 0.91 DTD from their website. This means that all RSS feeds which depend on the RSS 0.91 (many, MANY news sites) cannot be used with a validating parser."
/. discussion (which, um, I haven't read) remains.
It seems as though it just took them 5+ years to follow up on the threat? Primary links are broken, but of course the lively
Non-validating parsers are still required to check documents against an internal DTD subset. Who builds an RSS reader using a non-validating parser without support for all 9 incompatible RSS versions? Obviously a reader attempting to retrieve this RSS DTD is built on a validating parser.
my.netscape.com is undergoing a redesign, and when we announced the redesign about 10 days ago, the DNS entry for my.netscape.com was changed to point to the new server where My Netscape will be living. This had the effect of making anything under the old my.netscape.com unavailable, since the only thing public on the new server is a splash page. (Nobody on the team was especially aware of this DTD file since all of the old Netscape employees were let go last year around the time Netscape.com was redeveloped; anybody working at Netscape now was hired since then.)
Now, why this file was living under my.netscape.com is anybody's guess, but we'll have it restored ASAP. I only wish that someone had brought it to our attention so that I didn't have to find out about it from Slashdot.
Christopher Finke
Netscape Developer
Netscape? What's a Netscape?
End of Line.
If I would create a reader that was dependent on version 0.91 of the distribution, it sure as hell would include the DTD in local storage. It makes no sense to create a reader that can also use, say, version 0.92 since you would not know what had changed (and there is no such thing as inheritence between versions of a DTD afaik). Actually, as other readers noted, it would be terribly stupid to make your web-server or client rely on a third party computer for which you cannot guarantee the uptime.
These URL's are mainly there for their Uniqueness, not so much as for a place of quaranteed storage. Of course, they are also a nice place to look for the actual definition, but after that you would need a local repository. This is the first thing an XML library should support, and the first thing a moderately intelligent programmer should look at. I get *very* annoyed if this kind of basic rules are ignored. And I've even seen them ignored by people pointing to the XML digital signature definitions, where security and reliability should be the first requirements in the design.
Also, what would happen if w3c.org or netscape.com go the way of the Gopher? If they go bust? It's a quickly changing world out there.
This blast is not squarely aimed at you, but you triggered it. Treat this in the spirit it is meant please (if I didn't give a crap at all, I wouldn't comment. Show this to your insulated bosses who don't know the first thing about community and transparency. Kudos to you BTW for showing initiative and acting on a Slashdot post. Honestly, I would not have given the "new Netscape" that much credit.).
:-)
>I only wish that someone had brought it to our attention so that I didn't have to find out about it from Slashdot.
This rankles.
Have you EVER tried contacting Netscape from the outside world? Seriously, I can count the number of times:
*) When my.netscape.com locked out Konqueror (1998?)
*) When my.netscape.com WITHDREW the ability to embed RSS feed on your "my" page -- actually this was PRE-RSS if I recall. Way before it was commonplace, you could embed Slashdot and Linux Today feeds. Then they killed it, presumably because they got enough users or some pointy haired reason. 1999.
*) When my.netscape.com adopted a shitty policy of DELETING all your mail if you don't login for 30 days. This did not seem to be publicised by an actual email. They don't seem to delete the mailbox itself, which violates RFCs I'm sure and basically insinuates the mailbox is active. I lost tons of mail from 1996-2003 (yeah yeah backups. Some things I didn't think I would need later). ?? Happened in 2003. Note that mailboxes were only 5MB still, so I quickly bailed for a 100MB Yahoo account.
*) The 2001 deletion of Netscape Developer. This lost a ton of Netscape copyrighted Javascript documentation.
Just TRY contacting Netscape from their page. The best you can do is use the WRONG FORM to submit to some contracter who won't forward it. Or, oh yeah - there's a 900 number for by the minute Support.
Back when it mattered, there was no 'Google Guy' for Netscape, who would act as an unofficial liason. After Jamie Z left, no one internally tried to fill the shoes of a community facing employee.
While I'll be eternally grateful for Netscape's open sourcing of their browser. What a different world it is now. Too bad that step is something the current management would never have allowed (that's the perception). I can't think of a more opaque Internet company than today's Netscape. I'm sure there are people who disagree or wish it could be changed (you're here..) but that and a $1 gets you a cup of black coffee. Show this to your boss - there are suggestions here
It is not the first time that Netscape dropped important files and Netscape didn't care in the past. E.g. the RDF schemata http://home.netscape.com/WEB-rdf and http://home.netscape.com/NC-rdf bite the dust some time ago and Netscape gave a fucking fart. Other format specs are also gone for years, but I am to lazy to look up what whent missing years ago.
But hey, from where I sit Netscape is anyhow run by a bunch of liberal arts graduates and doesn't have any technical competence left.
not trying to be a troll here.. but.. one would think that that file would have been accessed quite often and that would have shown up in the logs...
If I was a new hire at some old company where everyone else had been let go, I'd at least check out the logs and see what is being used? and then if some file is being hit 1,000's of times a day.. maybe ask a few questions..
http://www.hawknest.com/
..the other one ?
Last time I looked, if the RSS 0.91 feed references the DTD, IE7 refuses to display it anyways... it's only of the DTD references is removed that IE7 "works" "properly"
Does reading in the bathroom count as multi-tasking?
Once upon a midnight dreary, while I websurfed, weak and weary, Over many a strange and spurious bookmark of 'free news galore', While I clicked my fav'rite feed, suddenly there came a warning, And my heart was filled with mourning, mourning for my dear amour. "'Tis not possible," I muttered, "give me back my free news source!" Quoth the server, "404".
If you were offended by anything I said... No, I'm not sorry. Please lighten up.
Anyone remember the incident, maybe 8 years ago, when the root DNS servers dropped the entry for localhost, 127.0.0.1? We had a lot of random code break because of that.
Wow.
/. frontpage from 4+ years ago...
1 1
/. again in 2013.
I'd swear today I was looking at the
After a little searching, I've found the exact same story, from April 2001:
http://slashdot.org/article.pl?sid=01/04/28/21192
Those who forget history are doomed to repeat it.
I look forward to reading this story on
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
The web needs some scheme for content based addressing. Like the urn:sha1 scheme used in gnutella. This (and some sort of reasonable caching scheme) would do a lot to alleviate problems like this. It could also help a lot with the Slashdot effect.
Need a Python, C++, Unix, Linux develop
RSS readers just cache the DTDs and refer to them locally instead? It's not like they're going to change or anything...
Referencing the other topic today...
You mean to tell me that every RSS reader references - and actually tries to FIND and DOWNLOAD - a specific SPECIFICATION hosted on ONE SITE ON THE PLANET?
Are you people utter fucking morons or what?
I can't believe design decisions like this.
I'm especially irritated because I have just spent the last week trying to find an rdiff-backup or rsync that functions on Windows WITHOUT A FUCKING 2GB or 4GB FILE SIZE LIMITATION! Even the Cygwin people could only tell me to "try it"...How about putting it in the fucking FAQ what the file size limits are?!
We HAVE LARGE FUCKING FILES THESE DAYS, MORONS!! We've HAD THEM FOR FIVE YEARS!!
Oh, nobody can be bothered to support librsync anymore...right...
Only Fedora Core 5 bothered to patch it so rdiff-backup can handle large files...
Backup isn't "sexy" like 3D wobbly windows, so nobody cares...
Chimpanzees. Jesus Baron von Fucking Christ!
Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
OMGZ! Netscape is still around?
How critical is the guarantee that the local copy is identical to the remote copy in this case? If not, then why not cache a known good copy locally and then use the cached copy when the remote one is not accessible?
so what was the inspiration to sack the entire netscape team?
And this is not the first time it has happened. Every time I install netscape on an old iMac, it stalls out for like 90 seconds on initia launch because something else on your server the browser depends on has been deleted. A few times I had a very hard time installing Netscape because it refused to quit digging for that page on launch. I never did figure out exactly what it was it was looking for. (something to do with first run registration I would guess) This happened about what, a year ago.
So, how many other things in Netscape will blow up if netscape.com goes away? Since that would mean netscape was out of business, does anyone at netscape care? I've had to throw out software I paid for because the author's web site went away and prevented me from reinstalling the app, and that's REALLY annoying.
I work for the Department of Redundancy Department.
I'm no RSS expert, and I don't spend that much time dealing with it, honestly.
However, I do spend a LOT of time dealing with XML. And many environments offer quite an elegant way of dealing with this: the catalog file.
The catalog file provides an alternative mapping between the public identifier and a local, system referenced file SO THIS SORT OF THING DOESN'T happen.
If the RSS standard is to be dependant on external doctype definitions, the RSS readers should either:
(1) Include the doctype hosted on their own system and change all of their doctype public URI's or (2) The RSS readers should include the requisite files locally, and contain a catalog that maps the public URI's to the locally deployed model.
Like I said before, I'm not so into RSS, so I'm not sure which is applicable, but certainly, I don't think that Netscape should have to host this...
Small correction: It actually has 5 (< > & " '). The last one wasn't in HTML4.
You know, Microsoft's street address also says a lot about their mentality.
I thought slash would escape the entities. Anyway, here they are: < > & " '
You know, Microsoft's street address also says a lot about their mentality.
The file is now restored, but it will not be available forever. See this post at the Netscape blog for the full details.