Slashdot Mirror


Converting TeX to Microsoft Word?

belmolis asks: "For many years I've done almost all of my writing in TeX. This has increasingly caused problems with publishing in journals. For a long time, many journals reset what you sent them, so they didn't care what program you used. More and more, I find, they do, and in most cases, what they want is MS Word. Is there any good way to convert TeX to Word?" "I've seen some advertised. Some only work with LaTeX, which doesn't help. One claims to use a full-scale TeX interpreter, but my queries as to whether it can handle home-brew Metafont fonts, PIC graphics etc. have gone unanswered. These products also all seem to be plugins for MS Word. I don't use MS Windows or any other MS products, and hate WYSIWYG word processors (I hated Bravo before it was reincarnated as Word) so a Word plugin is not a great solution, even if it works.

Furthermore, I wonder what exactly these programs do. If they interpret the TeX and then generate very low level Word, that may result in a document that looks similar, but a journal editor probably won't be able to edit it the way he wants to. In some cases the editor can be persuaded to accept a camera-ready PDF, since it turns out that the publishers often want PDF and the reason the editor wants Word is so he can edit the text, but when the editor can't or won't budge, is there any alternative to reformatting the document entirely in Word or a clone?

The larger question this raises is, where are we going? Even if formats are open, translation is difficult if they are only commensurable at a very low level. Is the solution to write in something very abstract like DocBook? And if so, will the market go this way?"

11 of 89 comments (clear)

  1. LaTeX2rtf by Noksagt · · Score: 4, Informative

    The F/OSS LaTeX2rtf is probably your best bet. Coverts cross-references, eps pictures to jpeg, or png (pdflatex users will be happy to know rtf supports jpeg and png), equations to either an EQ field or to a bitmap picture, and does tables right. It isn't perfect, but it is good.

  2. What journals? by epsalon · · Score: 4, Informative

    Most journals I've worked with accept TeX/LaTeX or PDF files, given that you use the journal's .sty file (which they supply). I've never seen a scientific journal which doesn't accepd LaTeX output. Some don't accept MS-Word.

    If it's only a few journals, I guess no respectable researcher would submit to those, so just submit to better journals.

    1. Re:What journals? by biodork · · Score: 4, Insightful

      I would guess you are in the Computer field and not in a biologically oriented field. In those, Word is pretty close to the only answer and TeX is an unknown. I would say TeX is VERY restricted to the fields it is accepted in, and pretty much unknown outside of those.

      --
      Gavin Fischer
    2. Re:What journals? by belmolis · · Score: 4, Informative

      I'm in Linguistics, which covers a lot of areas with different publishing needs and different journals. Some journals fall more-or-less into the math and CS camp and want TeX. One leading journal the last I knew preferred Postscript (I wonder if they now prefer PDF - have to check). Some of them until fairly recently didn't make any specific demands because they still remember the days when everything came in on paper or in a zillion incompatible word processor formats. The problem is that MS Word has so dominated the market outside of some technical fields that they just assume that everybody uses Word. One editor asked me for the electronic version of the paper, without saying anything about the format. When I emailed him the TeX file, he had no idea what it was.

      To some extent of course you can favor journals that accept convenient formats, but there are a lot of limitations on that. Sometimes the paper really should go to a particular journal in order to reach the appropriate audience and/or in order to get the most brownie points. Sometimes you commit the paper before you know who the publisher will be and what format it will want. That happens with conference proceedings, Festschriften, edited collections, and so forth. And some journals don't say anything up front, so if you don't think to ask in advance, you end up in the situation in which you've invested a lot of time and energy getting the paper revised and accepted, the journal has also invested time and energy, and you really don't want to pull out at that point.

  3. Let them know. by Noksagt · · Score: 4, Informative
    This has increasingly caused problems with publishing in journals.
    You are a contributor to the journal. You donate content gratis, which makes Elsevier/Oxford/whoever fat. Let them know you are distressed they don't take LaTeX submissions and/or at-least camera-ready PDFs. Any who have recently stopped supporting LaTeX can be encouraged to start again. Even some journals which haven't taken LaTeX submissions recently have switched due to scientist-demand.
    but my queries as to whether it can handle home-brew Metafont fonts,
    Yeah--good luck with that. metafont->ttf conversion is very tricky. Furthermore, the journals don't really like weird fonts (once they get the DOCs, they often strip ALL formatting). You can go metafont->postscript image->wmf/emf. It is far from ideal
    but when the editor can't or won't budge, is there any alternative to reformatting the document entirely in Word or a clone?
    Ask them what formats they will accept and for which reasons. Many are happy as long as they are able to extract plain-text from your document.
    Is the solution to write in something very abstract like DocBook?
    This would be an O.K. solution. It would allow you to go to RTF or typeset with LaTeX. But it is both less powerful than LaTeX & less "friendly" than Word.
    And if so, will the market go this way?"
    The publishers are dependent on content. A lot of your peers probably do use Word. It is important to know that you can influence which way the market goes & to let them know your preferences.
  4. Use your own! by Ratso+Baggins · · Score: 5, Funny
    1. Print you document's TeX source code on rice paper.

    2. Eat printed code.

    3. Wait 12-24 hours.

    4. Collect the word docs at "the other end".

    --

    --
    "we live in a post-ideological world..." - Billy Bragg.

  5. Re:Grow Up? Is that an option? by BinLadenMyHero · · Score: 4, Interesting

    Write what? It's not that Word is a bad wysiwyg, it's that wysiwyg is bad per se. It's not a matter of taste. LaTeX is MUCH more productive, gives better result, and you concentrate on content, rather than fighting with Word about format details. Fighting, because Word keeps changing the breaks, formatting, and stuff.

  6. Dear slashot.. by Anonymous Coward · · Score: 5, Funny

    I have a large application written Common Lisp. It makes heavy use of macros and is written in a functional paradigm. Also, it uses a sophisticated code-walker macro to optimize the code and convert it to CPS style, and includes a full Java JVM written in Lisp to ease training new hires, as well as a type inference engine. About 50% uses CLOS multimethods and "around" methods.

    However, my new manager only knows Visual Basic on Windows 95. How can I translate? I'm pretty sure it's not a "1-to-1" port. For instance, how do I do continuations in VB? Thanks!

  7. Keep using LaTeX by mcelrath · · Score: 5, Insightful
    LaTeX, quite simply, is the world standard for scientific documents, and it should stay that way. You simply cannot enter complex mathematics in any word processor.

    If your journal is telling you that they won't accept latex, tell them you won't submit your articles anymore, thank you very much.

    In physics we have it good due to the existence of the arXiv, where we put our articles first. Therefore journals are already limited by the fact that your article is already published on the web, and they have to accept the consequences of that. e.g. they cannot have too draconian copyright terms. I know in many disciplines the situation with journals is much worse. But remember, journals are totally dependent on us, the scientists, and not the other way around. With the advent of the web and email we can diseminate our work to our colleagues and perform peer review all without the intervention of a journal.

    The physics community accepts latex as the standard, and people are (rightfully) suspicious of articles which appear on the arxiv in only .doc or .pdf format.

    So, I suggest you keep using latex, investigate adding a section to the arxiv for your specialty, and tell your journal that they will accept latex or be replaced.

    -- Bob

    --
    1^2=1; (-1)^2=1; 1^2=(-1)^2; 1=-1; 1=0.
  8. Inevitable Frustration by Noksagt · · Score: 4, Insightful

    Unfortunately, most of the converters will do only a subset of the markup languages & so few (if any) will work well with custom macros.

    The Chikrii TeX2Word MIGHT do it. TeX4ht may also be worth a try (->HTML/XML, which can easily become other formats). Can't comment on TeXPort. Those are really your only options. If worse-comes-to-worse, you can also look fo ps/pdf->word solutions, but those are just as bad as (La)TeX->Word.

  9. Productivity != Shallow Learning Curve by Noksagt · · Score: 4, Insightful

    The first key to productivity is that you are comfortable in the environment. Additional keys are that it is expressive & doesn't force you through tedium & allows you to script away as much tedium as possible. Certain people ARE more comfortable with LaTeX & know it well enough (and use the right tools) such that it isn't tedious. The most tedious parts about LaTeX are not knowing how to do something (which is combatted by knowledge or good tools or good code to steal) and compilation errors (which is combatted by knowing the syntax well, by using editors that prevent/fix/point out errors, and by compiling frequently (sometimes in the background)). LaTeX is CERTAINLY more scriptable than Word & automating references & formatting can be quite trivial. An example I recently used was a solution to placing a series of dozens of figures & captions. It is easy to generate the plain text code to do this. Less easy to write a VBA script in Word. LaTeX is also more reusable & versioning CAN be better. In short, people CAN BE PRODUCTIVE in LaTeX

    Products with shallow learning curves have simple interfaces. It is true that Word has an easier-to-understand GUI than many of the LaTeX GUIs. More importantly, it is (whether we like it or not) omnipresent & most administrative assistants already have some experience with (or at least knowledge of) it. Shallow learning curves do mean increased productivity for the novice. They don't translate to increased productivity for ALL users or ALL applications.