XML 1.1 Spec Hits Some Snags

← Back to Stories (view on slashdot.org)

Posted by Hemos on Friday October 18, 2002 @02:17AM from the battle-in-the-works dept.

oever writes "News.com reports that the new XML 1.1 specification defines a new newline character, making it incompatible with the 1.0 specifiation. Apparently, IBM has been pushing the new character to avoid having to modify their software, thereby invalidating everybody else's XML software."

7 of 257 comments (clear)

Min score:

Reason:

Sort:

It's only a candidate specification. by tomhudson · 2002-10-18 02:21 · Score: 5, Insightful

This specification is being put forth as a W3C Candidate Recommendation of XML 1.1.
If you don't like it, keep in mind that you CAN bitch about it and help change this.
Considering ... by DigitalDreg · 2002-10-18 02:30 · Score: 5, Insightful

That IBM gave the world SGML and XML by derivative ....

That a lot of useful data exists on IBM mainframes ....

That EBCDIC doesn't "cleanly" map into Unicode by design like ASCII/UTF-8 does ...

That this benefits IBM users and customers, not IBM because there is no strategic market position related to new-line characters ...

That this was a recommendation reached by a group ...

Let it live and get a life.
Re:One tiny little update ??? by PainKilleR-CE · 2002-10-18 02:33 · Score: 5, Insightful

IBM has contributed so much, it's only natural that some changes might be characterized in the news as benefitting them more than other parties. Is anyone that worried about adding a new EOL character in 1.1 that XML 1.0 "chokes" on ?

and, as an IBM rep pointed out in the article, XML documents are supposed to specify what version they're using at the top of the document. Any proper XML parser should read that it's 1.0 and interpret the newline character as 1.0 would.

--
-PainKilleR-[CE]
2 line summary by Shagg · 2002-10-18 02:39 · Score: 5, Insightful

1) XML 1.0 does not follow the Unicode spec
3) XML 1.1 makes a change so that it does follow the spec

What's the complaint again?

--
Unix is user friendly, it's just selective about who its friends are.
What do they mean, "XML 1.0 chokes"? by st.+augustine · 2002-10-18 02:42 · Score: 5, Insightful

Does anyone have a link to a page explaining what's really going on? Last I heard, XML doesn't even have a concept of newlines -- most of the time all white space gets normalized (collapsed). The only problem that I could see is if the character wasn't part of the spec for white space. Now, people may have written XML software that chokes, but I think that's a slightly different story. So is the problem that the new character shows up as bogus text content in elements? And is that true for all XML processing software, or does software that relies on a proper Unicode engine not have the problem? What's the deal?

--

-- Some things are to be believed, though not susceptible to rational proof.
Re:Read the Unicode spec.... by Anonymous Coward · 2002-10-18 02:52 · Score: 5, Insightful

Like the man says, read the Unicode specification! Unicode defines a far wider range of characters than simple 7 or 8bit ASCII text can cover, and the à is simply mapped into another Unicode byte pair. You won't loose the ability to use à in your XML documents, you just use Unicode.
*Shrug* by Fweeky · 2002-10-18 02:53 · Score: 5, Insightful

If you're using the XML prologue like you're supposed to, your XML 1.0 documents will have:
<?xml version="1.0" ?>
At the top. The parsers will then parse using the XML 1.0 specification and you won't notice a thing.

If you don't use it, tough luck, you should have followed the original recommendation more closely. Lucky for you it's not exactly difficult to automatically process XML documents and add the prologe later.