Tim Bray On The Origin Of XML

← Back to Stories (view on slashdot.org)

Tim Bray On The Origin Of XML

Posted by Zonk on Friday March 18, 2005 @02:35PM from the makes-feed-users-happy dept.

gManZboy writes "Queue just posted an interview with XML co-inventor Tim Bray (currently at Sun Microsystems). Interestingly enough the interviewer is none other than database pioneer Jim Gray (currently at Microsoft). Among other things, in their discussion Tim reveals where the idea for XML actually came from: Tim's work on the OED at Waterloo."

8 of 218 comments (clear)

Min score:

Reason:

Sort:

SGML by Anonymous Coward · 2005-03-18 14:50 · Score: 3, Interesting

I think it's very funny that XML looks like it is based on SGML.

But according to the interview, it seems that the similarities are merely coincidental.
This is article is amazingly honest by tabkey12 · 2005-03-18 14:59 · Score: 4, Interesting

JG I assume that the burning issue was keeping it simple.
TB And we missed. XML is a lot more complex than it really needs to be. It's just unkludgy enough to make it over the goal line. The burning issues? People were already starting to talk about using the Web for various kinds of machine-to-machine transactions and for doing a lot of automated processing of the things that were going through the pipes.
Amazingly, for such a popular method of 'communication' between and within applications, XML is admitted by most to be rather flawed and bulky...

--
Get a free iPod Nano 4GB!
1. Re:This is article is amazingly honest by Camel+Pilot · 2005-03-18 15:18 · Score: 3, Interesting
  
  I current working on a project that is doing machine-to-machine transactions. We started off using XML to bundle and unbundle the data. However as the data rates went up performance went south.
  
  Some bright bunny came up with the idea of using perl stringified data structures instead using Data::Dumper.
  
  On the receiveing end the data structure is Safe eval'ed and viola there is the data - orders of magnitude faster and there is still the ability to read or edit the data via text editor.
  
  XML is just a representation of hierarchy data via named parameters and list. Perl (or Python if want) or very adept at parsing code strings.
  
  Also with code structures you can add dynamic functionality like
  
  'rsv_time' = localtime(time)
  
  which you can't with XML...
Why, oh why, did they have to repeat the tag name? by Anonymous Coward · 2005-03-18 15:18 · Score: 3, Interesting

I work with XML every day. And every day I wonder the same thing: why the hell does the end tag name have to be repeated? Why can't it just be optional? In other words, why can't it just be abbreviated as: <tagname>data</> ?

Oh MAN I wish they could have done just that one little thing for us. It would cut our datagram size down by at least 30%, maybe more.
Right in front of you, Tim! by Anonymous Coward · 2005-03-18 15:36 · Score: 4, Interesting

You know, the people who invented XML were a bunch of publishing technology geeks, and we really thought we were doing the smart document format for the future. Little did we know that it was going to be used for syndicated news feeds and purchase orders.

The most amazing thing is that back then in 1995-1996 at Open Text we were already using SGML as a data exchange protocol. All of us there (including Tim) ought to have known that XML would also have a life as a computer-to-computer communication protocol. Problem was that at the time so much of the SGML discourse was wrapped around the content versus format debate that we missed the obvious: the main of use of XML was not a replacement for HTML as a text format for the web, but as a kind of uber ASCII to allow the ready exchange of data between disimilar applications (just like ASCII in its time had eased the transfer of data between dismilar hardware and/or software platforms).
Semantic web snake oil... by Alomex · 2005-03-18 15:45 · Score: 5, Interesting

TB: I spent two years sitting on the Web consortium's technical architecture group, on the phone every week and face-to-face several times a year with Tim Berners-Lee. To this day, I remain fairly unconvinced of the core Semantic Web proposition.

Everyone who has actually done work on knowledge representation in the real world knows that this is a huge, difficult problem, unlikely to be solved anytime soon, as Tim Bray claims.

The only people who claim otherwise are either frauds or ignorant. The Semantic Web initiative has both: Tim Berners-Lee is very smart, but not a computer scientist, so he's not aware of the size of the challenge, plus he's a genuinely nice person, so he tends to trust others too much.

He has surrounded himself with the snake oil AI salesmen from the early 1980s who had promised us impending ubiquitous intelligent computers. Those fraudsters got found out back then, and spent the next fifteen years in academic limbo, only to be rescued by Tim Berners-Lee naivete.
Re:Oh boy... by Evil+Grinn · 2005-03-18 15:46 · Score: 3, Interesting

replacing compact, binary config files with 'human-readible', resource-intensive XML

Like what, the Windows registry? Don't say shit like that or ESR will shoot with one of those guns he collects.

http://www.faqs.org/docs/artu/ch03s01.html#id288 82 98

--
where there's fish, there's cats
Intra-vendor XML is (usually) stupid by mi · 2005-03-18 16:18 · Score: 5, Interesting

It drives me up the wall, that my employer is using XML to let parts of their own application communicate with other parts. DTDs are not used and all parts still need to be modified/recompiled whenever one of them changes. Same people maintain both ends of the communication.
Theirs is, in reality, a proprietory format, but to stay buzz-word compliant they use XML, which hurts performance -- sometimes dearly...
For example, to pass a couple of thousands of floating-point numbers from front end to a computation engine, each is converted to text string with something like <Parameter> around it. The giant strings (memory is cheap, right?) are kept in memory until the whole collection is ready to be sent out... The engine then parses the arriving XML and fills out the array of doubles for processing.
It really is disgusting, especially since freely available alternatives exist... For instance, PVM solved the problem of efficiently passing datasets between computers a decade ago, but nooo, we only studied XML in college -- and it is, like, really cool, dude...

--
In Soviet Washington the swamp drains you.