Slashdot Mirror


XML and Transcoding - How Would You Do It?

morzel asks a doosy: "XML is one of these words everybody's talking about yet no-one really knows how to use it in specific applications or server technologies. At the Apache XML Project, some work is being done on integrating XML/XSL in the server itself, but personally I like IBM's idea of a transcoder in between a range of (XML) servers and a range of clients. But... how can it be done?" (More)

"Suppose you have to develop an on-line application, and you'd want to go with XML on the server side, and everyday browsers on the client side. Portable platforms like Palm and WAP-enabled phones will probably be a client platform that is being used frequently.
What tools -open source or commercial- are available to accomplish this?

The elements of the system are:

  • XML Enabled Database system: Data is retrieved by the transcoder using HTTP or your favorite protocol
  • Transcoding gateway: should translate the XML data using XSL (or another way) to a form readable by the client. The exact translation or the XSL to use can be set by the server (included in the XML source), or be detected by the gateway.
  • Browsers of all colours and kinds.
A typical usage of this system would be the publishing of an on-line application without having to bother with client troubles except for writing the XSLs. I do web development, and the amount of work that goes into making sure every platform works as it's supposed to be is way too much in comparison to the functionality of the system. Specially when exotic clients like PDAs and WAP mobile phones are requested client platforms (e.g.: a sales follow-up app), the burden of getting everything working and having a UI that does the job is a real nightmare...

XML is the wave of the future, that's for sure... But what tools are available to actually incorporate XML in a system that can do all things we poor webdesigners dream of?

All suggestions welcome! "

44 of 139 comments (clear)

  1. Standard formats needed... by pb · · Score: 3

    Lisp has been doing this stuff forever. Maybe it'd be a good idea to look into the formats that expert systems use to exchange data; I bet they're pretty generic.

    Of course, that won't happen, we'll all make our own stripped-down, human-readable versions, with big gaping flaws, until someone either standardizes it, or hides something nasty and binary with a GUI and dominates the market (*hint* I wonder who wants to use XML and "open standards"....) So let's try to come up with a real open format now, instead. :)
    ---
    pb Reply or e-mail; don't vaguely moderate.

    --
    pb Reply or e-mail; don't vaguely moderate.
    1. Re:Standard formats needed... by jilles · · Score: 2

      We don't need another data exchange format. XML is pretty adequate for more advanced stuff we have SGML.

      What we do need are tools to manipulate XML. The tools for reading and writing XML are already there. What we need next is tools to transform XML documents (the standard to specify these transformation already exists: XSL).

      I think there are several initiatives in this direction. (sorry I don't have any references).

      Like many people I see a great future for XML but I think the coming few years will be characterized by a lot of redundant programming since everybody will individually attempt to implement more or less the same components. It would be nice to see some reusable components on the serverside.

      --

      Jilles
  2. Mino XML parser by ChipX86 · · Score: 3

    Well, this is kind of a shamless plug, but I'm developing a XML parser at http://mino.portaldesign.net. It is LGPL. The library can be used in any programs and the parser that comes with it can be used for converting XML files to HTML on-the-fly.

    I'm working on XSL support (so people can easily say what XML tags should become in HTML), so that should be done in the (hopefully) near future. For now, feel free to download the latest alpha and play with it.

    In the near future, I plan to have support for databases, CSS, XSL (as mentioned above), and a few other XML-related technologies.

    People familiar with C/C++ should easily be able to write custom modules for converting from XML to HTML using the library by looking at the examples in xmlhandlers/. Anyone want to help develop this?

    1. Re:Mino XML parser by ChipX86 · · Score: 2

      XML is useful for many things. In fact, the next version of HTML (called XHTML) will be based off XML.

      One use for XML is that you can develop entire sites using your own tag set instead of HTML. For example, if you want to represent a list of books in HTML, you would probably setup a list of items. In XML, you can do:


      <book>
      <name>Some Book</name>
      <author>Some Author</author>
      </book>
      ...


      Which is much easier to understand. Using XSL (a stylesheet language for XML) or a parser built specifically for your tag set, that <book> tag and its subtags will actually mean something.

      Writing your entire site in XML has other advantages. For example, let's say you have 100 pages on your site, all written in HTML. Now you want to change the layout of the entire site. You would have to modify the HTML of all 100 pages. If all those pages were written in XML, however, you would have to modify only one file, the XSL stylesheet.

      XML also has support for namespaces. A namespace (in XML) is a group of tags. Each namespace has a URI. For example, the upcoming XHTML 1.0 namespace is http://www.w3.org/1999/xhtml (that link does not actually exist though). Namespaces are very useful. If you were writing a document in XHTML and wanted to include tags from your own tagset, you would call in your namespace, and you would suddenly be able to use your own tags.

      My parser will have XSL support soon. For now, you can write the modules in C/C++ and the parser will load them automagically using the namespaces and parse the XML.

      I have a few articles/tutorials I've written over at gelicon.com on XHTML, XML, DTDs, and namespaces. Hopefully they will offer a better understanding.

  3. XSLT is a Great Idea by costas · · Score: 2

    Probably one of the few truely great ideas in the Web development industry. It means freedom from client peculiarities --forget about all writing for all those different browsers again and again, just one huge translator template will (e.g. XML->Opera-compatible HTML, or IE-compatible HTML or AvantGO, etc). It means that potentially the same server can be serving not only PCs, laptops, PDAs and the like, but also other software, by reading plain XML, or some subset of it.

    In the OSS arena, the best example of XML on the server=>HTML (or for that matter anything else) on the client is Cocoon. I played around with Cocoon 1.x a little bit and it's very impressive architecturally, but even the principals agree that the performance isn't there yet. I am eagerly awaiting for Coccoon 2 though ;-)...




    engineers never lie; we just approximate the truth.

  4. Related: Client-side data on demand? by Amoeba+Protozoa · · Score: 2

    One thing that I heard the wonderful-world of XML was supposed to allow was data on demand. A user clicks an XML/XSL defined element such as a button or piece of hypertext and the page updates without reloading.

    This was the theory anyway...has anybody heard of such an implementation, or does anybody know if it is in a future spec?

    One application (which is badly needed on the web, I think) is a dynamic collapsable tree. Imagine if you will a SlashDot comments page (not to hard, as you are looking at one!). Now, instead of getting a page-full of comments that take a healthy amount of time downloading (depending on your threshold settings): imagine clicking on a message to expand more comments in the thread which are fetched dynamically. You could resort, change moderation thresholds, and lots of other nifty dynamic operations without having the server do all the work.

    -AP

    1. Re:Related: Client-side data on demand? by jilles · · Score: 2

      XML will actually allow for less network trafic. The reason for this is that you will be able to manipulate an XML DOM tree on the clienside with for instance javascript. This means you won't need to contact the server for simple things like sorting a list.

      A second reason you'll have less network trafic is that you don't have to put layout information in the XML files. Rather you download a separate XSL file (which can be cached). Subsequent communication consists of data only.

      Microsoft has some nice demos on their site (yes I know it's propietary and all but it's there) and I think mozilla also has a few nice demos.

      --

      Jilles
    2. Re:Related: Client-side data on demand? by jilles · · Score: 2

      Convert the demos you saw to ordinary HTML (without losing features) and you'll see the amount of communication increase since you can't do much at all on the clientside. With HTML you always have to transfer layout information since that's all you got. You can to some extent manipulate the client side DOM model but in practice you'll let the server handle more complex things like sorting data (resulting in a reload of the entire page rather than a small XSL file).

      Anyway, I don't think that the bandwidth problem is caused by either HTML or XML. The real problem is the objects that are referenced like for instance gif or jpg images and that won't change I'm afraid.

      --

      Jilles
  5. On the browser by Gleef · · Score: 3

    Ideally, browsers should develop to the point where they understand XML as well as HTML and XSL as well as CSS. There has been significant effort to do this in the Mozilla browser, the XML/CSS combo works quite well, and the person developing an XSLT (XSL Transformations) engine for Mozilla is talking about having something useful around May. Similarly, Internet Explorer 5.0 has a base understanding of XML (styled with CSS), and surely plugins for decent XML/XSL encoding for IE are likely to appear soon after Netscape shows that it's a feature people demand.

    In the meantime, there are some Java Servlets out there to do the transformation on the server side. The server will grab the XML and XSL file, do transformations, and output HTML (or whatever format) to the client. I haven't played with them enough to recommend one as being particularly better, but there's some handy stuff out there.

    ----

    --

    ----
    Open mind, insert foot.
    1. Re:On the browser by morzel · · Score: 2
      Ideally, browsers should develop to the point where they understand XML as well as HTML and XSL as well as CSS.
      That's the ideal situation...

      The current situation however is that there is a plethora of browsers, which is growing rapidly, with big differences among them, between OSes and even between versions...
      To develop a number of websites, one can simply not assume that users will have a specific browsers on a specific OS of a specific version...

      We might evolve to a pure XML/XSL/CSS browser eventually, but until then, there has to be a different solution that can serve today... You would be amazed how much people still use Netscape 3, just because they don't have the urge to upgrade...

      Java servlets are a technique, but again: it's built into the server. There are a number of servers out there, that don't have these servlets, so that another solution would come in reaallllyyyy handy.


      Okay... I'll do the stupid things first, then you shy people follow.

      --
      Okay... I'll do the stupid things first, then you shy people follow.
      [Zappa]
  6. Uses of XML in the real world... by Augury · · Score: 2

    The reason we use XML in our multi-tier solution is simple. ADO cannot support detached, hierachical record sets.

    In our case, this meant we had to find a way to store that hierachical information, which is vital to the front end, in an intermediate format that did not put load on the database itself.

    The reason for that, of course, is that when you're running a distributed application to potentially thousands of clients, you want any database hit to be as few, fast and clean as possible.
    That means we can't sustain connections to the DB.
    That means we have to use disconnected record sets.
    Disconnected recordsets don't hold hierachy information, and that means that we have find some other way of hitting the database once, getting enough data to build the hierachy externally, then shutting down the DB link.

    XML provides the functionality we need to parse a flat recordset back up to a hierachical structure, without hitting the database again. It also has the added bonus that when it comes to presenting the front end in a browser, we can feed it directly to the browser if it's "XML compliant" (IE5, though there is a patch for IE4).

    B.

    PS: You'll also find that XSL can do similar things to your XML as CSS does to HTML ;)

    1. Re:Uses of XML in the real world... by X · · Score: 2

      In my experience you can do the same thing if you have a proper middle tier (application server) between the database and the client. There are roughly 1000 products that do this, and none of them need XML to do it.

      XML is just a consistent way of presenting information, not some major enabling-technology.

      --
      sigs are a waste of space
  7. server to server / business to business by SPorter · · Score: 2
    XML is a killer technology in business to business / server to server communications. Example? I'm involved in a project designing a website selling widgets and widget service plans. (The product name has been changed to protect the innocent.) We use XML to:
    • communicate with the order fulfilment organization to check on inventory
    • gather shipping information from the fulfilment organization
    • submit order requests to the fulfilment organization

    The widget order fulfilment organization has a server that speaks XML over HTTP. We created a widget on our server to talk XML over HTTP to it. Instead of spending weeks to work out how to communicate with some proprietery server in proietary format we spent a few days interfacing our servers.

    XML = server to server / business to business killer technology

    The consumer may someday directly use XML but I don't see that coming soon on a broad scale. HTML (with Java, Javascript, CSS, etc.) will (IMHO) be the way consumers work the web for the near future.

    Of course, I could be wrong.

  8. XML FAQ by jkorty · · Score: 4

    The XML FAQ is here.

  9. Beware XSL by rambone · · Score: 2
    I would advise against anyone using XSL.

    Looking at any non-trivial XSL stylesheets, you can see what a generally bad idea it is.

    My advice would be to use a real programming language with DOM bindings.

    XML.com has a good article regarding XSL:XSL considered hamrful.

    Note that XML.com also has some pro-XSL articles listed, but they aren't nearly as persuasive.

    The bottom line is that the W3 "ordained" XSL to be part of the grand scheme of things, although the technology hasn't been developed in response to any particular problem.

    1. Re:Beware XSL by anthonyclark · · Score: 3

      Looking at any non-trivial XSL stylesheets, you can see what a generally bad idea it is. My advice would be to use a real programming language with DOM bindings.

      I wouldn't write off XSL on the strength of that article at xml.com...

      When I first looked at XSL some months ago, I thought that it would be a messy and difficult language. I was wrong. XSL, IMHO, is the right solution for translating XML into pretty much anything. Yes, it does have a steep initial learning curve (much like our favourite OS :-) but once that is out of the way, you understand why the language is so useful. Why does it look so unwieldy? Because it's a "dialect" of XML. (Which I think is a good thing - it shows how flexible XML is) Typical XSL is as simple as saying "if you encounter this XML element, do this with it." Editing XSL text is really quite easy with the correct syntax highlighting. (TextPad is a good editor under windows)

      As for non-trivial XSL stylesheets? On our project, we have written XSL to transform our XML data into binary outputs. The stylesheets used ran into tens of thousands of lines! I think that qualifies for non-trivial in anyone's book. I admit that the XSL is difficult to read, but show me any source that is easy to read when >10k lines...

      XSL as a complete solution? No. Even in a relatively simple XML to HTML documentation tool I wrote, I called the XSL from a JavaScript app that handled things like file access and other helper functions. This was under Win2k, using the built in script engine to call the XSL via COM. (yes, even MS get's things right sometimes) The point is that XSL is better for tranforming XML than trying to use a DOM-manipulating language binding...

      On another note, why does everyone assume that XML is solely for exchanging data on the web/net? I've used it for documentation, log files, test cases, application persistence and application exchange formats. It's a lot more useful and flexible than people think.

      --
      ----- Documentation is worth it just to be able to answer all your mail with 'RTFM' - Alan Cox.
  10. Perhaps you should take a look at M$ ! by Taco+Cowboy · · Score: 2




    Believe it or not, the open-source bug has biten M$ !

    Look into M$'s sponsorship of the Schools Interoperability Framework (www.schoolsinterop.org) and maybe you can see how M$ plans to use XML (and its derivative) in real world application.


    --
    Muchas Gracias, Señor Edward Snowden !
  11. XML by trance9 · · Score: 2

    The key insight into XML is that it should be used only where other solutions fall apart. XML is one of those technologies that is so general, so abstract, and so powerful that you can construct a solution for ANY problem.

    The downside is that the solution will involve extra processing steps, extra stuff to be implemented, and impose on you a development model that might not always be convenient (not everything wants to be a document, or a conversion or transcoding between document formats).

    However, there are many cases where XML is the only viable solution, and in those cases you're just glat you can solve the problem at all! A typical example is when you have documents coming from multiple sources, and you publish them to multiple targets. It's easy to see what the XML solution would look like--but the problem doesn't even fit into the other ways of doing things.

    With WebMacro a common implementation strategy is to drop key XML objects into a template that is otherwise created through ordinary WebMacro HTML template gunk.

    The advantage of this approach is that you can create the bread-and-butter stuff like shopping carts, authentication, login/logout, using ordinary Java servlet code and templates. (These things are nasty when you try and force them into a document model).

    Then in the middle of your page somewhere you have your XML document, rendered using XSLT or something. You have other targets, besides your servlet, where you publish that same XML document, so the whole thing winds up being a rather pleasant mixture of two different programming paradigms.

    Again, the key insight in this strategy is that you use XML for the parts of your problem where it is the only viable solution--and you do everything else the normal way (without the extra costs imposed by XML, since you don't need the extra power).

    I worked in an SGML shop for a couple of years, and became smitten with SGML/XML. I set out to do absolutely everything I could in SGML/XML for awhile, before realizing that a traditional template tool (like WebMacro) was far more useful for typical bread and butter servlet programming.

    I still use XML a lot, but now I use it intelligently, where it's needed!

    1. Re:XML by trance9 · · Score: 2


      XML doesn't solve this problem either. Writing a different stylesheet for each browser winds up being just as much work. The key is to get all of that work out of your source code, so that it is independent of the application. You can do that by using a template system.

      The IBM example has multiple sources of documents feeding multiple target formats, where those targets are diverse--not just different forms of HTML, but different media altogether. In those cases XML is a big win.

  12. Re:Yes but surely only for READ-ONLY resultsets. by Augury · · Score: 2

    Hmm, in this case, XML is purely used as a transfer agent, not to hold concurrent data, directly effect writes upon the database.
    I agree that that would be very very ugly, but then, I also don't think that a system should necessarily be trying to provide concurrency on the client side, especially if the client base is expected to be extensive.
    In this case, as you say, record locking and concurrency handling problems would all but preclude the use of anything but the most 'beefy' RDBMS's.

    In my case, perhaps I am lucky in that user interaction is not 'live', but transactional. I just present some output, and wait for the user to respond in whatever way. Once that response comes in, I have a heap of middle-tier business logic handling exactly what we should do with it.
    Record locking and such issues are dealt with at that level, rather than in the backend.

    And yes, I do believe that SQL Server could handle such a solution, coupled with MTS and perhaps using a little DCOM :P

    In any case, transactions can do nothing but help the cause :P

  13. XML and XSLT are the way to go by __donald_ball__ · · Score: 3

    Hiya. I'm one of the authors on the cocoon project and I admit my biases upfront. I think, and many of you seem to agree, that the web publishing industry (more generally, the electronic information publishing industry) is in desperate need of a standard way of seperating (and mixing) content and design. XML (a generic tree description language) and XSLT (a generic tree merging and transformation language) offer a very elegant way of accomlishing that goal. The cocoon project is currently focused mainly on two goals: creating (and implementing) a standard way to create XML fragments dynamically, and determining (and implementing) the best way to maintain a site back-ended by XML and XSLT. I encourage brave developers to come check it out - the basic stuff (XML+XSLT -> HTML) works very well, the more elaborate stuff (SQL,LDAP,POP3 -> XML+XSLT -> HTML) is coming along very well, and we're playing with a very interesting take on the whole *SP paradigm called XSP - I was personally highly skeptical at first but am beginning to see the light.

    As far as IBM's product goes - once you drill down into the technical details, it looks very much like cocoon. Interestingly enough, some of the closed source components that IBM's product relies on were donated a few months back to jump start the xml.apache.org site (namely, the XML4J parser and the Lotus XSLT processor). The main thing that IBM seems to be offering here is its 'transcoder' technology - which may be interesting and certainly bears investigation, but for my money, you're better off checking out (and having a voice in the development of) the open source apache projects.

  14. i'm workin' on it, dammit. by Uberdog · · Score: 3

    xml rocks. every piece of online information should be in xml. usability on the web is horrible right now. the fact that search engines and yahoo-style directories are the main entrances to the web is horrific. the fact that google can't find me a single page on gkrellm (a kick-ass system monitor for linux) pisses me off to no end when i'm bored with my current skin. with everything in xml the extraction of data would be much simpler and therefore the interfaces to the web would be much more effective.

    the current problem is that

    1. lots of people know what xml is, but don't really know what to do with it.
    2. the processing of xml data at this point is very intense. rendering an xml web page (or add in the scaling of images, too, and call it transcoding as ibm does) takes a lot of work on the server side and there's not currently a way for it to be rendered on the client-side (browsers don't support this yet).

    i'm working on a solution and need help...so it's actually pretty smooth that this article came out in ./ at this point.

    in a huge blow to problems #1 and #2 above (as well as quite a few others), i am initiating the creation of Uberbia, the most open source of web sites. the backend is zope, which is a tres cool open source web application environment which can conveniently output its internal data as xml. what this allows is for information to be created in zope and stored in zope's native db format and served up as web pages (for instance) quickly, but then also output as xml. problem #2 solved. and when browsers can handle the xml...shove it out that way.

    zope also allows for information to be very easily created and shared. this is one of the main goals of Uberbia.

    the idea for Uberbia was born out of the fact that the Open Source community has been living in an environment of relatively closed content management on the internet. Sure, one could create a web page and post a HOWTO they just wrote. And then post a message to a relevant mailing list letting everyone know that resource is available. And then submit the HOWTO to the LDP and wait for it to be approved and posted on the LDP page. Uberbia will remove a lot of this hassle and allow the Open Source community to easily create and manage it's content. and the data will go into an xml-aware application. problem #1 solved, at least for the Open Source community. well, okay...so i'm still workin' on it, but it'll get solved, dammit.

    on trying to figure out what i was talking about, Ethan (a friend and to-be-developer of Uberbia) wrote:

    sounds to me like you want to build an open-content information space. am I totally off-base? Bring "source" up to the next level of abstraction? Collaborative environments of information?

    yup. he gets it. but the possibilities that arise from having such a body of contributors and open content in xml are insane. for example, imagine turning on a "newbie" feature in Uberbia that automagically inserted links to the proper entry in the jargon file for every word that was defined there. not difficult with zope and the data in xml

    so, essentially i'm responding to this ask slashdot question by calling out for help with an open source project that wants to solve this problem and others. some work has been done, but there's a lot more to do. sourceforge is graciously both hosting the development of this and hosting the project itself. if you are interested at all in the development of something like this or have some really smooth-ass ideas, let me know or join the mailing list.

    i hope some of that made sense.

    word, Uberdog

  15. IE5 already does XML+XSL by Dacta · · Score: 2

    It isn't too bad, either.

    If no XSL stylesheet is applied then it displays the XML document using a "TreeView" default style sheet.

    Also, because the XML parser & XSL thing is COM based you can use it in any language that supports COM - like Javascript/VBScript/ASP. I hate to be a MS lover, but unless you go to Java there isn't much that can do it better than that.

    The new XML parser that comes with Win2000 is supposed to be 5 times faster, too. See MSDN.

    As far as I know there is no support in IE5 for XML+CSS. I may be wrong, there, though.

  16. But that is true for any web based system by Dacta · · Score: 2

    You couldn't do it with HTML, either, could you?

    Any server that uses stateful connections like that is going to have to be big & powerful.

  17. You're looking at the problem the wrong way by X · · Score: 3

    I think you're not looking at the problem the right way. Typical applcation development breaks things up into domains. These layers usually include a persistence domain (your database), a business logic domain, an application domain, and and a presentation domain.

    XML really doesn't change any of the domains EXCEPT the presentation domain. You don't need an XML enabled DB, as you NEVER want to have the outside world talking directly to your DB. XML (combined with HTTP or whatever else) is one way of presenting your application. The various transforms that you would do using XSL are just "aspects" of the same presentation. So this doesn't completely change the way you build applications, just how you do your presentation.

    I've written more than a few apps that were available both as GUI applications and web servers. Both versions shared the same code base up until the last layer.

    As far what you need to do an XML system, I think it's a lot like an existing HTML system. With HTML, you need a database server, an app server, and a web server for an HTML system. The web server is normally scripting enabled so you can do handy transforms with the raw data.

    With XML, it's basically the same concept, except your "XML server" needs to be using XSL to script transforms of the XML data. What we currently don't have is a very good way of doing this. Ideally you'd actually want the CLIENT to do the transforms as the XML data is usually much terser than whatever the XSL will generate. However, nobody trusts the clients to do this, so you might as well go with the XSL engine on the server.

    --
    sigs are a waste of space
  18. Some examples... by evlist · · Score: 3
    Hi,
    <quote>
    But what tools are available to actually incorporate XML in a system that can do all things we poor webdesigners dream of?
    </quote>

    There are many tools available to build such a system.

    To mention only Open Source projects, I could suggest using Apache JSERV with Apache Cocoon as a framework, Castor or Quick to bind XML data to Java objects and a OODBMS like ozone or a RDBMS like PostgreSQL.

    These are my favorites ;)

    They are very powerful and highly flexible, but the price to pay is that they are rather complex to use, that you need time to get on speed with them and that you loose focus on the core techniques behind them.

    To try to get a good understanding of these core techniques, I have set up some simple examples showing how one can bind XML documents into java objects, store these objects in a OODBMS and use them in a XSLT sheet both in standand alone mode or as a servlet.

    These examples are available on our web at http://downloads.dyomedea.com/java/ and a mailing list has been created to exchange and discuss such basic tips.

    Hope this helps.

    Eric van der Vlist

  19. XML Script by rjb · · Score: 3

    You might like to check out this page. One of the things they have is an interpreter (X-Tract) that reads a template (written in XML!) and performs pretty much arbitrary transformations on XML input data based on this template. Looks pretty cool and simple to use. X-Tract is free for download. Funny I didn't find any info on license terms though.

    I tried doing some very simple stuff with the Linux version, and the only complaints I have are:

    • fetching the input data via HTTP doesn't seem to work (as it should according to the docs)
    • when I tried calling it from a CGI it freaked out, seems that env variables override explicit XML Script commands in the template -- not what one would expect. Fixed it by clearing the environment
    • the docs, though pretty exhaustive, are not very reader-friendly (to me)
  20. XML and MetaHTML by hqm · · Score: 3

    You should take a look at MetaHTML (www.metahtml.com), which is a sort of macro
    like programming designed to emit HTML (it
    was developed before XML was invented). It
    was developed by Brian Fox and myself when
    we had a company called Universal Access (ua.com). MetaHTML
    is superior in some ways to XSL, because it is
    more a general purpose programming language, yet
    it's evaluator does a lot of the work of parsing
    XML syntax expressions. We used to use it
    to do many XML-ish things, such a generate the
    MetaHTML documentation automatically from a
    structured representation in the database.

    MetaHTML has also been under GNU public license since about 1996.

  21. Re:Grr.. by PigleT · · Score: 3

    Well that's unfortunate. A very quick trip straight to the Web Consortium shows their pages on XML straight up, complete with links to the XML FAQ and of course, just what you always wanted, the XML 1.0 Spec. If that's not an adequate definition, read the source for your favourite parser!

    --
    ~Tim
    --
    .|` Clouds cross the black moonlight,
    Rushing on down to the circle of the turn
  22. 10,000 line stylesheets by SurfsUp · · Score: 2

    On our project, we have written XSL to transform our XML data into binary outputs. The stylesheets used ran into tens of thousands of lines!

    This is supposed to be good? Something is horribly broken. Perhaps a different tool would be more appropriate? How about a parser generator? (see Jikes)

    --
    Life's a bitch but somebody's gotta do it.
  23. We already do this. Our website is live. by evilandi · · Score: 4

    I work for AssureSoft whose AssureWeb website is live (work out the URL for yourself, it's not obscure but we don't want to be slashdotted). The site provides financial information to subscribers. You have to have a username and password to get the full range of services- we dole out passwords free to British independent financial advisors.

    Our first XML-based service is a quotations system which allows users to get a quote for a pension or mortgage from a wide range of companies in real time (typically 5-20 secs).

    Why we needed XML

    Our problem was that each company had a slightly different way of asking for customer details. We decided to create an XML data type definition, now adpoted as industry standard by UK financial standards body Origo. This standard means that we can present pretty much the same input form, with a few optional extras, for any financial product.

    The main use of XML is in passing the input data from our web server to the companies' quotes servers.

    Layer 1: Client Browser
    Layer 2: AssureWeb server
    Layer 3: Company Quotes server

    The XML goes back and forth between layers 2 and 3. We compile standard CGI GET/POST client requests into XML on the webserver and fire them at the quotes server. The quotes server fires back a response as XML again, and we parse this and present it to the client as a standard HTML web page. There is no XML on the client side.

    Provided the company quotes server conforms to our XML standard, we can use that server for quotes. Adding new products or companies becomes a lot easier- typically we can go from scratch to beta with a new product within days. Previously it would have taken many months to write and test each individual product. XML allows us to re-use both code and input/output standards to a level never seen before.

    Our next step will be a comparative quotes service. Users will be able to enter one set of data, and fire it at multiple companies. They will then get back multiple quotations, from which they can select the best based on their criteria. Effectively we will be having multiple concurrent layer 3 transactions.

    --

    --
    Andrew Oakley - www.aoakley.com
  24. We already do this. Our website is live. by evilandi · · Score: 2

    I work for AssureSoft whose AssureWeb website is live (work out the URL for yourself, it's not obscure but we don't want to be slashdotted). The site provides financial information to subscribers. You have to have a username and password to get the full range of services- we dole out passwords free to British independent financial advisors.

    Our first XML-based service is a quotations system which allows users to get a quote for a pension or mortgage from a wide range of companies in real time (typically 5-20 secs).

    Why we needed XML

    Our problem was that each company had a slightly different way of asking for customer details. We decided to create an XML data type definition, now adpoted as industry standard by UK financial standards body Origo. This standard means that we can present pretty much the same input form, with a few optional extras, for any financial product.

    The main use of XML is in passing the input data from our web server to the companies' quotes servers.

    Layer 1: Client Browser
    Layer 2: AssureWeb server
    Layer 3: Company Quotes server

    The XML goes back and forth between layers 2 and 3. We compile standard CGI GET/POST client requests into XML on the webserver and fire them at the quotes server. The quotes server fires back a response as XML again, and we parse this and present it to the client as a standard HTML web page. There is no XML on the client side.

    Provided the company quotes server conforms to our XML standard, we can use that server for quotes. Adding new products or companies becomes a lot easier- typically we can go from scratch to beta with a new product within days. Previously it would have taken many months to write and test each individual product. XML allows us to re-use both code and input/output standards to a level never seen before.

    Our next step will be a comparative quotes service. Users will be able to enter one set of data, and fire it at multiple companies. They will then get back multiple quotations, from which they can select the best based on their criteria. Effectively we will be having multiple concurrent layer 3 transactions.

    --

    --
    Andrew Oakley - www.aoakley.com
  25. IE5 XSLT is not standard. by dwalsh · · Score: 2

    IE5 XSLT is very different from the W3C recommendation. It is a partial implementation of a 1998 working draft.
    Do not assume this to be a case of embrace & extend. Microsoft just implemented XSL before the spec was finalised. They say they will bring out a compliant version soon.

    --
    ${YEAR+1} is going to be the year of Linux on the desktop!
  26. A small warning... by tgd · · Score: 3

    A small warning for those thinking about moving down the XML/XSL route who haven't done any testing on it:

    Its slow. VERY slow.

    Most XSL implementations have significant performance and scalability issues as compared to more common custom technology for producing dynamic web pages.

    There's no argument that its a better technology, but I've known several commercial web sites that have spent considerable resources developing XML/XSL implementations and having to roll back the technology when they discovered they needed four or five times the number of servers to be able to use it.

    Anyone know of any top-tier sites that are actually using the technology?

  27. NNTP? by Matts · · Score: 2

    If someone wants that they can either use NNTP or develop it using the current mod_perl + HTML route - there's no need for XML there.

    XML should be used where its appropriate. I'm unconvinced that client-side transformations are the right thing.

    --

    Matt. Want XML + Apache + Stylesheets? Get AxKit.
  28. Why? by Matts · · Score: 4

    Mind if I ask why you're doing this? XML parsers are off-the-shelf free commodity tools now.

    Spend your time working with those tools (XML4C, expat, rxp to name a few) to create higher level tools. Don't re-implement an XML parser - I can guarantee you it will be full of obscure bugs where you didn't understand the spec, didn't understand how to cope with character encodings, or just did something wrong. This stuff, despite the XML spec suggesting that a graduate could write a parser in a matter of weeks, is hard, and experienced people (such as James Clark) have put out excellent products for all to use under non-restrictive licences. Theres even an LGPL parser already out there called libxml (ships with gnome).

    If you don't believe you'll create a broken parser, see the recent XML conformance tests on XML.com.

    I'd also love to see you move from a non-working XML parser to something supporting XSL "in the near future". I appreciate your enthusiasm, but the XPath spec has some tough little nuts to crack (I know - I'm cracking them right now) and then implementing XSLT from an 80-odd page spec - wow - good luck to you!

    (I'm not trying to poo-poo your project, but so many people start working on stuff that's already being worked on in the open-source community that it's just wasted effort).

    --

    Matt. Want XML + Apache + Stylesheets? Get AxKit.
    1. Re:Why? by Matts · · Score: 2

      Download expat. Download the C++ bindings for it. It supports any XML format you want to write code for, has full (and correct) namespaces support, and is very much free software.

      If that doesn't bake yer noodles, download rxp which also does validation against a dtd.

      Really, work on providing XPath and XSL support for expat - the community will thank you _much_ more for it.

      --

      Matt. Want XML + Apache + Stylesheets? Get AxKit.
  29. XML applications *do* exist by wfrp01 · · Score: 2

    XML is one of these words everybody's talking about yet no-one really knows how to use it in specific applications or server technologies

    I disagree. Check out the W3C's SVG standard. This is for real.

    If you've ever had to muck about with all of the different proprietary flavors of vector graphics formats, you know what a great thing this will be.

    That said, I personally *don't* believe in across-the-board XML standardization panacea. Some things deserve standardization, others don't.

    Accountants all adhere to accepted standard accounting practices. This is what makes it possible to encapsulate their work into shrink-wrapped database products that pretty much any accountant can use. But this only works because the process is so well known.

    So I disagree vehemently that business-to-business transactions, for example, are ripe for XML standardization. Why? Because who the heck is such an expert on these kinds of transactions to be telling everyone else how to do it? There's a lot of trial-and-error to go through before anyone should start proposing standards.

    And remember: "You can't vote for anarchy". ;~)

    --

    --Lawrence Lessig for Congress!
  30. 10,000 line sheets necessary due to dumb syntax by rambone · · Score: 2
    XSL provides no features for reuse or modularity, so any useful stylesheet ends up being monstrous.

    For someone who uses a language like Python or Java, I can't imagine why they would find anything compelling about XSL. It really is a dog language. Most people are just too ga-ga over the fact that it is encoded in XML to see how lame it really is.

    Thankfully, few people are rallying behind it.

  31. XML Summary and History -- Comments on Transcoding by EverCode · · Score: 2

    I have written an article that will help you XML-newbies get up to speed on the idea of XML and some of the sub-specs. The Promise of XML.

    I believe eventually we are going to get to a point where server-side transcoding will not be necessary. However, this will be several years, and we are going to have to learn how to do all of this efficiently.

    I am even developing my own transcoding software process because I belive I have a better method of doing it than what is currently available. If and when I do succeed it will be closed-source because I want to make money off of my product, not just give away all my hard work.

    Anyway, the next few years are going to be very interesting.

    E

    --

    EverCode
  32. XML and Transcoding - How IBM would do it by Ralph+Case · · Score: 2

    The decomposition into three system elements (XML content source, Transcoding gateway, and browser) makes a lot of sense. That way the content source can focus on what it does - deliver content - and the transcoding gateway can handle the customizing the content for presentation on whatever device is making the request. The IBM Transcoding Technology (see http://www.ibm.com/software/secureway/transcoder/) is an example of a tool for building the transcoding gateway. You can download and try the beta code now. There are additional notes at this web site about other tools that may be useful in developing this kind of application. There is a short write-up on XSL at http://www.ibm.com/software/secureway/transcoder/x sl.html.

    As you hinted in your note, it can sometimes be a challenge to select the best stylesheet to apply to a given XML document. The gateway may want to choose a stylesheet based on the source document and the destination browser or device. In addition, different stylesheets may be better suited to specific user preferences or network connections. The IBM transcoding technology includes a way to select the "best" stylesheet to apply in a given situation.

    The Transcoding technology can also adapt content other than XML for different clients. HTML requires special processing because you can't apply stylesheets to directly since it's not well formed. Images also require special handling to adapt them for the destination device. The whole transcoding gateway may be a separate component, installed as an HTTP proxy, or it may be configured as a servlet on the same server that is the content source.

  33. We have it in production by Morendil · · Score: 2

    MultiMania's site has most of its content stored in XML. The main HTTP servers are Apache+PHP; we have a JVM running the SAXON stylesheet processor, and a MySQL database with "glue" data, telling the system which XSL stylesheet to apply to wich XML document to generate which HTML page. Some neat hacks and some smart caching even let us deliver 'semi-dynamic' pages - content stored as XML, interpreted as PHP on delivery.

    XML rocks. You don't need to stuff your head full of theoretical debates about namespaces, general entities, etc. All you need is vi (or Notepad) and Saxon. To learn XML syntax, just write XML files by hand and feed them to SAXON until it no longer reports XML errors. To learn XSL, just write XSL files until you get SAXON to actually spit out some HTML. Lots of examples are available to accelerate the trial and error process.

    When you are finally ready to integrate the whole shebang into actual applications, there are tons of open-source tools to choose from. Look at the list above again - Apache,PHP,MySQL,SAXON - cost zero - this combo drives one of France's most popular Websites.

  34. Re:The Road Ahead --- and some pitfalls. by Pfhreakaz0id · · Score: 2

    You can connection pool with SQL server 7, btw. Also, M$ says COM+ will be available "A month or so" after Win2K