Slashdot Mirror


WYSIWYG Editor for DocBook DTD Content?

Saqib Ali asks: "This week I saw a demo of the Tagless Editor by i4i. The editor is a plugin to Microsoft Word, which can be used to create XML based content. The plugin can handle various custom DTDs. However it can not properly handle the DocBook DTD. I was wondering if there is any WYSIWYG XML editor that can be used to edit DocBook DTD based content? Any ideas?"

30 of 51 comments (clear)

  1. Conglomerate, maybe? by DeadMoose · · Score: 4, Informative

    I was looking for the same thing not too long ago, and came across Conglomerate, which despite its web page, is no longer dead, and back under development.

    I've had a few problems getting it compiled/running well, but from what I've seen, it looks like it's a fairly decent bit of code, so once it gets some polish, it could be pretty handy.

    1. Re:Conglomerate, maybe? by DeadMoose · · Score: 3, Interesting

      Where's some info to show that it's active again? I might like to get involved.

      Well first off, the page has finally been updated since I last looked at it (looks like just today actually), but even with that, it's out-of-date & misleading.

      There's been a decent amount of activity on the developers' mailing list however, so you should check out the archives/subscribe to that.

  2. Re:not necessary by DeadMoose · · Score: 4, Insightful

    One of the great things about XML is that it's human-readable and -writeable. You don't need a WYSYWIG editor, because you can just edit it directly. Once again, we find that open standards are superior to the proprietary alternatives pushed by the likes of Microsoft and Sun.

    Except for the fact that it's really convenient to not have to edit the raw XML. Yes I can, but I really don't want to; I'd much rather save myself that agrivation and just write what I'm trying to write.

    It's just like a lot of other things that people go "well, you can do it manually, so why do you need a fancy tool?" about. Yes, I can manually patch/build/blahblah my own kernel. Do I want to? No, I want to just have things work. Same story here: do I want to sit here remembering what all the docbook tags are? No. Did I want to learn them in the first place? No. There's a time and a place for good tools to make tedious tasks easier.

    Just because I could edit a file and manually set all the colors for all the pixels, doesn't mean there's no need for Photoshop/The Gimp.

  3. have you tried by Hubert_Shrump · · Score: 5, Informative

    LyX? I know it's not a true WYSIWYG, but it does have a DocBook mode. I haven't tried it in awhile (went back to xemacs), but it might have all sorts of new goodies.

    --
    Keep your packets off my GNU/Girlfriend!
  4. Docbook and WYSIWYG... by Papineau · · Score: 3, Insightful

    The only problem I see is that Docbook doesn't have a visual representation, it has many, depending on the backend you want (HTML, PDF, PS, TXT, etc.). So a WYSIWYG editor would only show one type of representation.

    Besides, Docbook (as many other document formats) is meant to separate the visual from the info. Linking the visual to the edition would only make people try to make it present the info in (what they beleive to be) pretty layout, when Docbook's goal is to concentrate on the structure of the document, which the backend then translates to HTML tags, or PDF fonts and layout, etc.

    1. Re:Docbook and WYSIWYG... by booch · · Score: 3, Interesting
      When talking about semantic structured documents (like DocBook) it's true that WYSIWYG really doesn't have any meaning. But when someone asks for a WYSIWYG editor for such a document type, they really want something that shows a representation of the semantics. In other words, the tags themselves don't appear. Instead, the tags are used to apply (user-definable) stylesheets (usually CSS) to the document as you type. So you still have to deal with the underlying XML structure and tags, explicitly putting the tags where you want them. It's just that you get to deal with the tags as if they were styles in a word processor. So I wouldn't ever make something bold. Instead, I'd tag it as a new term, or emphasis, or a command, or whatever, and let the stylesheets make it bold, or bold and green, or a different font or background -- whatever helps me make the distinction in my head without having to specifically think about the tags.

      Believe me, it's much easier to edit large amounts of DocBook without having all the tags mixed in. To me, it actually keeps the content separate from the presentation -- the tags are really just explaining how to format things, by giving things semantic meanings. Removing the tags from view lets me concentrate on the content. But the tags are still visible enough that I can add them when I change semantic contexts. (Typing in all the content and adding tags later tends to take almost twice as long.)

      See my post on Morphon for a list of programs that support this way of editing XML.

      --
      Software sucks. Open Source sucks less.
  5. Structured Editor by Crutcher · · Score: 5, Insightful

    I agree, you don't really want a WYSIWYG editor for docbook, as that violates the Information/Representation sepperation.

    But what would be _really_ useful would be a structured editor, which provided a good mapping at several levels (ala Mozilla's composer, where I can turn on and off tabs). The point of such an editor's representation would not be for final production, but to unambigously display the information's _structure_ to the user, and to facilitate manipulating that structure.

    It would be like a very ugly word processor, where the tables would always have borders, etc.

    If that editor was then linked with a set of generation tools, to make it EASY to genearte and view PostScript, HTML, XHTML, etc. productions, then you'd really have something. I'd use it, for sure, and I don't mind using docbook tags now.

    In fact, would mozilla's composer be a good place to start with a docbook editor? Mozilla has good DOM tools, like the inspector, which loads XML DocBook files just fine.

    --

    -- Crutcher --
    #include <disclaimer.h>
  6. XMLMind by jdurham · · Score: 2, Informative

    http://www.xmlmind.com/xmleditor/ Not sure what the license is.

  7. Re:not necessary by GigsVT · · Score: 2, Insightful

    You are missing the point, the whole point is to seperate the content from presentation. WYSIWYG is totally contrary to that. I'd go as far as to say that it's entirely impossible to have such a Docbook editor, since you have pretty much no say in the presentation when you write it. It's the same with any good implementation of XML.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  8. Re:not necessary by Twirlip+of+the+Mists · · Score: 4, Insightful

    You are missing the point, the whole point is to seperate the content from presentation.

    No offense intended, but I think it is you who is missing the point. The average person doesn't think in XML. The average person thinks in formatted text. When I'm writing, I don't think "<ital>foo</ital>," I think "foo." Being able to see foo on the screen, but get "<ital>foo</ital>" in the output is a big win.

    I'd go as far as to say that it's entirely impossible to have such a Docbook editor, since you have pretty much no say in the presentation when you write it.

    Oh, sure you can. The editor doesn't need to display it the exact same way the end-user will display it; it just needs to display it in a way that makes the user feel comfortable. On your machine, chapter titles are 24 point serif type; on mine, they're 18 point italic sans-serif type. I don't care that yours are different from mine; I only care that mine look the way I expect them to.

    --

    I write in my journal
  9. Re:not necessary by GigsVT · · Score: 4, Informative

    The presentation is more than just typefaces and formatting, in docbook it goes as far as what gets put on what HTML page, and how they are linked together, or even if some structures are omitted from the presentation.

    You have one docbook file that get broken up into make multiple HTML files, or a single TXT file, or whatever.

    You designate sections/chapters/whatever, and the presentation decides how to parse those. The point is that only the structure of the document is defined, nothing is assumed about presentation. Nothing at all!

    A tool could be developed to assist this, by basically making sure you don't nest improperly, or use tags in an invalid context, or even give you a hint about what some things will look like on the final output using a common style sheet, but ultimately there is no way to approach even close to WYSIWYG.

    And no offense intended likewise, but the "average person" shouldn't be trying to write XML documents if they can't understand the concept of seperating content from presentation.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  10. Re:not necessary by JohnFluxx · · Score: 2

    It isn't a WYSIWYG editor if it isn't displayed the same way the end user will display it. And so you can't have a WYSIWYG editor for DocBook.

  11. XMetal by Ovidius · · Score: 2, Informative

    If buying it isn't a problem, XMetal works pretty well for DocBook. You can view your document as a tree (i.e., structured); with minimal block formatting and visible tags; as something like WYSIWYG, formatted with a CSS stylesheet; and, preview it as HTML in browser (IE for sure -- I don't know if you could get it to use Mozilla) formatted with an XSL stylesheet. The new version adds PDF preview which I assume is done through XSL-FO, but I haven't used it.

    It's not the fastest or smoothest editor to use, but it does a good job of balancing the spirit of XML with the niceness of seeing formatted text as you work.

  12. Re:not necessary by JohnFluxx · · Score: 2

    But docbook has no display info at all! At least with HTML it is fairly close. With docbook you don't even know if it will have a contents page, etc.

    And who said anything about editing the raw xml? I'm just saying what he is asking for is just a GUI editor of some kind - not a WYSIWYG. - See lyx for example.

  13. Re:not necessary by Twirlip+of+the+Mists · · Score: 3, Interesting

    ultimately there is no way to approach even close to WYSIWYG

    I think you need to go take a look at FrameMaker. It predates all this new-fangled XML hoo-hah; its native format is SGML. It is entirely WYSIWYG. Your point is thus demonstrably false.

    And no offense intended likewise, but the "average person" shouldn't be trying to write XML documents if they can't understand the concept of seperating content from presentation.

    Ah, the ugly face of snobbery and elitism raises its ugly head once again. Thanks for the input, GigsVT; since I'm obviously not wanted here, I'll just go back to using Microsoft Word for my documentation. You XML folks have fun playing in your little sandbox all by yourselves.

    --

    I write in my journal
  14. Re:not necessary by King+of+the+World · · Score: 2, Interesting
    I think you need to go take a look at FrameMaker. It predates all this new-fangled XML hoo-hah; its native format is SGML. It is entirely WYSIWYG. Your point is thus demonstrably false.
    When displaying a docbook file tell me what font I should use, or where I should get this font information from?

    Then how can what I see be what's actually encoded in the file?

  15. Re:not necessary by Twirlip+of+the+Mists · · Score: 3, Funny

    When displaying a docbook file tell me what font I should use, or where I should get this font information from?

    Sensible default, with user-configurable options.

    Then how can what I see be what's actually encoded in the file?

    Magic.

    Sometimes I think people just don't get it. It saddens me... but only a little, because I soon get pulled back into my real life where this "docbook" thing pales in comparison to whether table 11 got their watercress soup or not.

    --

    I write in my journal
  16. Re:not necessary by sporktoast · · Score: 4, Insightful

    When I'm writing, I don't think "<ital>foo</ital>," I think " foo."

    Ok then. Why should foo be italicized? Is it important? Is it an example some something? A variable? A citation? A proper name? A bit of sarcasm? Maybe you just *like* the way italics look and don't really mean anything more by it. Then, perhaps "<ital>foo</ital>," is something of a win. But if "foo" means more than that, then "<ital>foo</ital>," actually loses that meaning.

    Oh, sure you can. The editor doesn't need to display it the exact same way the end-user will display it; it just needs to display it in a way that makes the user feel comfortable. On your machine, chapter titles are 24 point serif type; on mine, they're 18 point italic sans-serif type. I don't care that yours are different from mine; I only care that mine look the way I expect them to.
    But that's not really WYSIWYG then, is it? (Unless you want to get all semantic about the Ys in the acronym.) The thing is, if you lobotomize the meta-data so that what you create as a <chapter title> is encoded as <ital>, then you are pretty much forcing the hand of my machine. Your "big win" becomes my loss. Sure, I can remap <ital> so that it renders as 24 point serif, but then everything else that you've flattened down to <ital> for the sake of how it looks (footnotes, dates, marginal commentary, etc.) gets the 24 point serif treatment, too.

    WYSIWYG for DocBook? You're gonna need a bigger wrench if you want to pound in that screw.

    --
    In a related story, the IRS has recently ruled that the cost of Windows upgrades can NOT be deducted as a gambling loss.
  17. Re:not necessary by King+of+the+World · · Score: 2
    you hard-headed nitwit,
    Oh grow up you brat.
    Yet WYSIWYG HTML editors have been around for nearly a decade now. In HTML one encloses a paragraph in "P" tags, and the browser handles the rest. WYSIWYG HTML editors pick a sensible default (this sounds familiar) and let the user change things around in the preferences (where have I heard this before?).
    1. HTML "wysiwyg" editors were lying. Everyone knows this. If I call a horse an HTML WYSIWYG editor it doesn't mean that there's ever been one.

    2. HTML with CSS has some style information, and anyone can define entirely physical real-world measurements. With newer editors such as VS.NET and Dreamweaver MX it's possible (although usually not desirable) to produce onscreen WYSIWYG content that prints the same.

    Docbook however can never be WYSIWYG without additional style information.

  18. Re:not necessary by Twirlip+of+the+Mists · · Score: 2

    Why should foo be italicized?

    That was just an example, and maybe, in retrospect, not the best one. Yes, in a real case the tag wouldn't be <ital> but rather <cite> or <xref> or some such. Same idea applies. I would want an editor to show me foo instead of <cite>foo</cite>, using whatever the appropriate default rendering style is for <cite>.

    --

    I write in my journal
  19. Re:not necessary by King+of+the+World · · Score: 2
    Let's just agree that we're talking about a program that shows formatted text instead of XML tags, and that allows the user to apply XML tags using buttons or mouse gestures or speech recognition or synchronized farts or some other method that doesn't involve typing the fucking things out.
    Now that's comedy.
  20. There is a commercial product to avoid.. by molo · · Score: 3, Informative

    We use a commercial product to do this at work. It is called Epic, and is available from Arbortext. We've had some real problems with it though, so I think our tech writers are moving to a plain ascii editor. I can't recommend it, but I thought I would provide a data point.

    Good luck.

    -molo

    --
    Using your sig line to advertise for friends is lame.
  21. Re:not necessary by Twylite · · Score: 2
    But if "foo" means more than that, then "<ital>foo</ital>," actually loses that meaning.

    No, it only "loses" that meaning if you want to parse it. While that may be useful for certain computer applications, or if you need to reformat the text into another style, ultimately the message conveyed (visually) to the reader is the same.

    --
    i-name =twylite [http://public.xdi.org/=twylite], see idcommons.net
  22. Re:Buzzword Stew makes me sick by SwellJoe · · Score: 2

    English is for secret agents and English teachers.

    Nerds on the other hand know what a DTD is. Nerds that don't know, know what Google does. "DTD" aint a buzzword, it's a technical abbreviation in common use among nerds.

  23. Re:jEdit support DocBook by Big+Sean+O · · Score: 3, Informative
    Yep, jEdit is a total pig. It's slow, takes forever to load, and is quite simply the best free XML editor I've ever used. I know it's not magic, but it does the following really well:
    • When I start a tag, it opens a contextural menu that displays all the valid tags.
    • As I type the name of the tag, the list changes until I get the unique tag I want.
    • When I type a close-tag, it fills in the close-tag information.
    • It's XML validator does a pretty good job and has the sensible trait of stopping after listing 100 errors.
    • Poorly-formed and invalid XML is redlined, like a spelling mistake.


    I don't think it's productive to get into a match about XML/WYSIWIG. I like structured XML editors (like XML Spy). They definitely have their place, I especially like the table format. But for 60% of my XML-editing needs I'll be quite happy with a text editor with syntax coloring (ahh, BBEdit).

    I guess it's all about the tools. The right tool in the right place. And for me, when I'm writing a DocBook, it's jEdit. It's not perfect, but it's the best tool I got.
    --
    My father is a blogger.
  24. OpenOffice by Matts · · Score: 3, Informative

    Try OpenOffice. My company sells an XSLT based filter that will turn OpenOffice documents (if using sensible styles) into DocBook XML. You may have to tweak it a bit to get exactly what you desire, but that's going to be the case with any tool.

    --

    Matt. Want XML + Apache + Stylesheets? Get AxKit.
  25. Re:not necessary by Twirlip+of+the+Mists · · Score: 2

    I would consider that it is not beyond the wit of even "the 'average person'" to understand the concept of separating content from presentation

    The concept? No. But expecting a technical writer to wade through a sea of angle-brackets to produce his documentation is laughable at best.

    The vast majority of technical documentation in the world is written by technical writers, not programmers. (I played around as one myself for a while.) That's why the majority of documentation is written with user-friendly tools like Word and FrameMaker. Programmers, of course, will choose things like XML because... I don't know, maybe it makes them feel superior or something. Couldn't say why. It certainly doesn't offer any useful benefits over Frame.

    Like so many other "open source" technologies, "Docbook" is irrelevant to most of the world because it's more trouble to use than its worth. Why do you think the submitter is looking for a WYSIWYG tool?

    Oh, and elitism is telling someone, "the "average person" shouldn't be trying to write XML documents if they can't understand the concept."

    --

    I write in my journal
  26. Re:not necessary by Twirlip+of+the+Mists · · Score: 2

    You strike me as someone who has never even written a docbook.

    Once. Well, half of once. One day in a fit of blinding rage, I called up Adobe and moved the whole damn thing over to FrameMaker and SGML the next morning.

    --

    I write in my journal
  27. Re:not necessary by King+of+the+World · · Score: 2
    WYSWIYG doesn't just mean any graphic editor where when you make a table you see a table. If you throw the term around like that then it loses all meaning. There are other, better, names such as the WYSIWYM mode of Lyx and Conglomerate. But to call it wysiwyg you've got to at least have a canvas!

    I think we're in agreement that you shouldn't expect people to bother with XML tags themselves.

    All authors I have found have loved to express structure through their editor (be in styles and headings in MSWord, or their software of choice) that we can then later map to Docbook or TEI elements. Structure helps them edit as they can see how many chapters at a glance, and drag around branches of the document tree easier than cutting and pasting. They can know that 'MSWord styles' will be converted to a certain visual output and that unlike the past they won't have to trust the editor - who often doesn't understand the content - to style it correctly.

  28. Morphon by booch · · Score: 2
    The canonical list of DocBook editors is here. The best program I've seen for editing DocBook is Morphon. It does a really good job of styling tags with CSS in real-time so that you can edit the document with various tags, but see the output in a WYSIWYG-like way. There's also a tree view. Another good program is XMLmind's XXE XML Editor. Both are Java apps, so will work cross-platform. They both come with good DocBook configurations, and are primarily used for DocBook. They've got free evaluation copies, and are reasonably priced at $100-$200.

    I also looked at ArborText and FrameMaker. They claimed to support DocBook, but they supply config files only for (much) older DocBook versions. I found the out-of-the-box support for docBook to be sorely lacking. It looked like it was possible to configure them for better support, but it would have taken many hours to do so.

    XML Spy and XMetaL looked pretty good. I don't remember how well they did with DocBook, but they are geared more for data-oriented XML, whereas Morphon and XXE are more suited for document-oriented XML, such as DocBook.

    --
    Software sucks. Open Source sucks less.