Slashdot Mirror


Opera CTO Hits Back at Microsoft's Standards Push

Michael writes "Opera CTO Håkon Wium Lie hit back today at Microsoft's push to fast track Office Open XML into an ISO standard, in a blistering article on CNET. He also took a swipe at Open Document Format: 'I'm no fan of either specification. Both are basically memory dumps with angle brackets around them. If forced to choose one, I'd pick the 700-page specification (ODF) over the 6,000-page specification (OOXML). But I think there is a better way.' The better way being the existing universally understood standards of HTML and CSS. Putting this to the test, Håkon has published a book using HTML and CSS."

20 of 246 comments (clear)

  1. Re:Is it mature enough? by willy_me · · Score: 5, Informative

    I'm a latex junkie. Latex though is a PITA to create templates and styles for. Someone willing to take up the task to modernize latex or completely replace it?
    Done. It's called ConTeXt.
  2. Re:huh? by 8-bitDesigner · · Score: 5, Informative

    Actually one of the highlights of the CSS spec is support for non-standard display types, such as screen readers, projectors, PDA, and yes, print. CSS is a rather brilliant standard, but since W3C hasn't really seen fit to publish a reference platform for it, there's no real compliance checking in the major browers.

  3. Re:CSS for Documents? by Coryoth · · Score: 3, Informative

    Ever since I started using word processors (which for me was a long time after I started using web browsers), i've always thought, why doesn't updating this style make all text with that style update? Why do I always have to change the same thing over and over again?

    Such things exist. TeX provides a decent the base for such things, so it's a matter of finding a TeX centric editor. LyX would be a good example, and indeed it has the sort of functionality and general approach to document creation that you seem to be after. Of course it doesn't necessarily have all the other features that other word processors might have (like mail merge or what have you).
  4. Re:Is it mature enough? by indiechild · · Score: 4, Informative

    Tables are not obsolete. Tables are still used for tabular data, which is what they were originally intended to be used for, and that has not changed.

    Tables shouldn't be used for page layout -- that's what CSS is for. It's as simple as that.

  5. Re:fsck'n ugly by Anonymous Coward · · Score: 3, Informative

    You're entirely right. Word/OOo aren't used for pro typesetting and page layout. But if we exclude that, then we still have many, many other formats, like RTF too (or why not even BBCode while we're at it?). Yes it's quite ugly, but I don't see (x)html + css as being the answer either:

    -too many versions of html (4, and perhaps 5 soon) and xhtml (1.0, 1.1, strict, transitional, etc)
    -different versions of CSS, browser support for it varies quite a bit (and is pretty much non-existent for CSS3)
    -too many rendering engines, css hacks required so the content displays the same in most of them, etc
    -html/css sucks at MANY things - how about a self-updating TOC? (don't even try to say some javascript parsing the DOM for header tags with certain IDs to generate it dynamically!) Hell, how can you even tell the page numbers in a html "document" anyways?
    -while word/OOo formats aren't real typesetting (like InDesign CS2 would do), at least they have half-way decent typography. Yeah, no fancy glyphs or super precise kerning, but it's still usable. On the web there's only a handful of "just OK" fonts one can use (unless everything is rendered server-side as images).
    -if people use html/css, there would basically be no standards *at all* or anything even resembling it (much like anything we see on the web). And I'm not sure the W3C is really going to help much here... Not that their recommendations are implemented very quickly (so many nice standards, but with basically no support e.g. xforms). And I'm not sure they're really being too helpful anymore either - more like slow and misguided IMO.

    At least with the new formats you're starting fresh, with the chance to have most features (like a Table of Content), and have them implemented properly. Mind you I'm not saying the new word/OOo XML formats are perfect - nor even the answer to the problem in the first place...

    And yeah, it's not like (x)html has angle brackets either ;)

    Looks to me like Opera has only one tool: a hammer (or is that a web browser?) and everything is strangely starting to look an awful lot like a nail?

  6. Re:fsck'n ugly by EvanED · · Score: 5, Informative

    html/css sucks at MANY things - how about a self-updating TOC? (don't even try to say some javascript parsing the DOM for header tags with certain IDs to generate it dynamically!)

    This would have to be done by the tool displaying it, same as a self-updating TOC in a Word or OpenOffice Writer document. The information is present in a correctly-structured HTML document in the form of Hx tags.

    Hell, how can you even tell the page numbers in a html "document" anyways?

    The same way you would in a Word document. It doesn't make sense if you're looking at it as a web page in your browser, but if your editor used HTML it would work the same way. (This also partially alleviates the rendering issues.)

  7. Re:Is there a FOSS way to make PDF from XHTML/CSS? by vtrac · · Score: 1, Informative

    I don't know if there's an automated way, especially because you run into the problem of differences in rendering. But, if you are on Linux, just install CUPS-pdf or on Windows, use PDFCreator (http://sourceforge.net/projects/pdfcreator/). Both are print drivers so you can use the HTML/CSS rendering engine of your choice (pick a browser), then print.

  8. Re:Is it mature enough? by EvanED · · Score: 2, Informative

    File | Export to Web? Am I missing something here?

    Yes, the fact that he used a program called Prince to generate a reasonably professional-looking "book". Not "printed web page". Book.

    Funny, he didn't mention that he "wrote" the book in HTML, just that he "published" it in HTML.

    "It is now possible, even feasible, to use HTML as the document format for books." (Granted, that's two links off the /. summary.) But "To prove how powerful it can be, the authors decided to use CSS in the production process" is following only one link.

    That PDF posted above was generated entirely from an HTML + CSS document.

  9. fonts by cybpunks3 · · Score: 2, Informative

    The problem with using HTML for publishing is that to this day there is no viable downloadable font system. So you are limited to a lowest-common-denominator list of 2-3 fonts like verdana and new times roman. With Flash and PDF you can do a lot more, but obviously authoring becomes a problem.

  10. Re:CSS for Documents? by Haeleth · · Score: 2, Informative

    Latex: its not that hard to learn.
    But it is tricky to use for any language other than English. Out of the box, it's English or nothing. Other European languages are complicated; more complex languages like Arabic, Hindi, or Chinese require some very involved hacks indeed.

    It can be done, some of the time, but it's very, very easy to mess up. I have tried numerous times to get Japanese support, using one of the several special Japanese versions that exist (it seems it simply can't be done with standard TeX), and only once did I manage to generate a DVI - which I was unable to convert to a usable format, because doing so always stripped out all Japanese text, for some reason I never managed to fathom.

    And this is all fair enough, because TeX was written to scratch Knuth's itch, and therefore it does what Knuth needed very well: it's brilliant for typesetting English and mathematics. Unfortunately that doesn't make it the solution to all the world's typesetting problems.

    I hate to say it, but "inferior" products like MS Word, OpenOffice.org, etc. have supported Arabic, Hindi, Chinese, and Japanese perfectly for as long as I can remember. Largely because they use Unicode internally, rather than one of the numerous inadequate and non-standard encodings that TeX and its derivatives rely on.

    To be fair, there's a Unicode version of TeX called Omega or some such. I'd doubtless have found it very useful if I'd ever managed to get it to work at all.
  11. I wrote my thesis book this way by Rudd-O · · Score: 3, Informative

    And it worked out great.

    http://software-libre.rudd-o.com/

    Used MediaWiki to write the chapters, wrote a small python proggie (available there) to consolidate the wiki into a single HTML file (mostly conforming to the Boom! microformat), then used Prince and Hakom's book CSS to generate the PDF.

    Great typesetting, collaborative book editing, screw LaTeX!

    Hakom was right.

    --
    Rudd-O - http://rudd-o.com/
  12. Re:Is it mature enough? by Anonymous Coward · · Score: 2, Informative

    If you want to stay in Latex use the memoir document class.

  13. Re:How come? by PCM2 · · Score: 4, Informative

    So I'd ask Håkon, "how come?" :)

    Since nobody gets it, I'll spoil it: That's how Håkon advises people to pronounce his name. It's even on his business card.

    --
    Breakfast served all day!
  14. Re:Is there a FOSS way to make PDF from XHTML/CSS? by smoker2 · · Score: 2, Informative
    CSS not withstanding, you can use HTMLDOC to produce PDFs from html pages. If you are creating reports etc dynamically anyway, just create a temporary html file and convert it through HTMLDOC. I use Perl to generate reports and interface with HTMLDOC, but YMMV.
    An example of the HTMLDOC specific code used in the conversion :

    # Run HTMLDOC to provide the PDF file to the user...
    system "htmldoc --continuous --browserwidth 800 --landscape --size A4 --header ... --left 1in --embedfonts -f $fileref.pdf $filename";
    (the command is all on one line)This is running on RHEL 3 through Apache 2 and Perl 5.8.0
  15. Um... NO by salesgeek · · Score: 4, Informative

    ODF is not about web pages or word processing. It's a standard for office documents including spreadsheets, presentation and word processing. That's a big difference from what Opera's CTO is talking about. CSS/HTML might make a good format for one part of the suite (word processing) with a lot of work on the standard. The issue: that's not what is needed for a standard. It's about doing for office documents what HTML did for websites. ODF is actually an opportunity for opera - extend the browser to support ODF so people can post ODF documents, make dynamic applications render to ODF and so on. It takes the web to the next level and further erodes the big monopoly.

    --
    -- $G
  16. Re:Is it mature enough? by TheRaven64 · · Score: 2, Informative

    Why anyone writes books in anything else is beyond me. I couldn't agree more. I am currently writing a book, and I can't imagine how people use tools like Word. It has a lot of technical content, particularly code snippets. With LaTeX, I can easily insert a few lines from a code file, and have it automatically syntax highlighted. I never have to worry about copy-and-paste errors, since the source code is included directly from the source files, which I can compile and test.

    I can also define short commands like \code{} for inline code snippets (e.g. variable or structure names) and then decide how I want them typeset later. I have a \note{} command defined, that puts the note in square brackets, blue underlined when I compile a draft, and doesn't display it at all when I compile a final version.

    The other nice thing about LaTeX is that it works with all of my standard development tools. I can keep my document in subversion, and have human-readable output to svn diff. I have a Makefile that makes generates all of my inclusions (e.g. graphs from gnuplot, images from OmniGraffle) and then typesets my document.

    The only problem with LaTeX is that it's not really a well-defined format. A LaTeX document is basically a program that generates a document. My source code if pretty easy to read, because I use English words for typesetting commands and then define them in my document class. Without the document class file, someone would be able to extract most of the semantic information from my source, but they would find it hard to generate my output.

    --
    I am TheRaven on Soylent News
  17. Re:fsck'n ugly by EsbenMoseHansen · · Score: 4, Informative

    So, McSmarty, how do I
    - position an image on page 4 of my document?

    You don't, nor do you want to. But you can anchor, float or bind the images to the text easily enough. This would be handled by css... for the HTML side, it would just be div and object tags --- not that you would ever see them, since this is an word app.

    - add footnotes?

    <p class="footnote">My footnote</p> with the appropriate CSS rule (presumably something like float: page or whatever.)

    - embed fields (date, last editor...)?

    Using XML entities, presumably

    - mark the embedded TOC as TOC so that it gets regenerated on reload?

    Regenerated on reload? Come on, have some ambition.. it should be in sync at all times. Anyway, by keeping tracks of the header tags, presumably.

    HTML is *not* a description language suitable for word processing in its current state, and it is unclear it can be made so without sacrificing device indepence.

    XHTML+CSS would need some expansions... but probably not much. A good layout program propably doesn't care about the device, but if it did, there are already @media tags to handle this situations. There are also a couple of other truly dedicated layout namespaces on w3 to consider.

    But all this matters not. This is politics. Sadly.

    --
    Religion is regarded by the common people as true, by the wise as false, and by rulers as useful.
  18. Re:CSS for Documents? by TheRaven64 · · Score: 2, Informative

    But it is tricky to use for any language other than English. Out of the box, it's English or nothing. Other European languages are complicated; more complex languages like Arabic, Hindi, or Chinese require some very involved hacks indeed. Really? All of my LaTeX files are UTF-8, and most include some non-English characters. I tend to use the raw unicode, rather than the LaTeX sequences because they are easier to type on a Mac. I'm not using a custom version of LaTeX although I vaguely remember having to include a package that told LaTeX to use UTF-8. Things like Greek letters and accents just work. I've not tried Arabic, Hindi or Kanji, however.
    --
    I am TheRaven on Soylent News
  19. Re:I don't know that I agree completely by TheRaven64 · · Score: 2, Informative

    PostScript (and PDF) have the adobe problem, but there is a better format that doesn't: DVI (the device independent format created by Donald Knuth). The DVI format doesn't even have the capability to include bitmap images. LaTeX cheats and uses the comment section to point to an external encapsulated postscript file. dvips will read this and include the EPS, and so will some DVI viewers but this can lead to all sorts of hard-to-track-down bugs. I ditched latex for pdflatex a while ago, and haven't looked back.
    --
    I am TheRaven on Soylent News
  20. Re:And wrong at that by temcat · · Score: 2, Informative

    I suggest that you read a bit about both formats and how they were developed. And actually look at the XML samples of both. Google it, it's not so hard.