Slashdot Mirror


Sanely Moving from Word to the Web?

FooAtWFU asks: "I have a job for a web site (no link for you, Slashdot hordes!). A lot of it is systems administration and development, but I have to routinely post content which comes from a myriad of other sources. Usually they are from academic users, come in Word format, and ultimately need to be posted in HTML. The problem is that Word has all sorts of tricks up its sleeve to throw off the font, layout, size, and so forth. To achieve any sort of visual consistency on the site these various formatting tags all need to be scrubbed, but even using other office suites with better HTML export (OpenOffice.Org) to do the dirty work, it's often easier to recreate the formatting by hand from a plain-text version than it is to clean up a sea of messy tags. Does anyone have any advice (or magical tools) to help me deal with this sort of tedious cleanup?"

24 of 547 comments (clear)

  1. Tedious cleanup? by Timesprout · · Score: 1, Funny

    Sounds like a job for Mrs FooAtWFU

    --
    Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
    What truth?
    There is no dupe
  2. One suggestion by Da+Fokka · · Score: 3, Funny

    You might consider a pack of monkeys and typewriters. They can ultimately reproduce Shakespeare so maybe, maybe they might be ablt to properly reformat the HTML gibberish Word produces.

    Of course, you could also outsource to India but that's unethical to both the monkeys and the Americon economy.

    1. Re:One suggestion by Anonymous Coward · · Score: 3, Funny

      It's hard to find qualified monkeys - most of them already have jobs editing /. and cnn.com...

  3. One Word... by ScentCone · · Score: 5, Funny

    ..."Intern"

    --
    Don't disappoint your bird dog. Go to the range.
    1. Re:One Word... by Cerdic · · Score: 5, Funny

      No, no, no...

      Usually they are from academic users

      It sounds like this might be a university environment. The correct answer should be grad students .

      --
      Advice for my fellow geeks: before seeking out that threesome you dream of, you might see what a TWOsome is like first.
  4. Re:hi by Anonymous Coward · · Score: 2, Funny

    hello. how are you?

  5. Re:hi by Anonymous Coward · · Score: 2, Funny

    I'm fine thank you

  6. Re:no link for you, Slashdot hordes! by Dunbal · · Score: 5, Funny

    Which only goes to show:

          There is NO WAY the slashdot effect can be avoided. Resistance is futile...

    --
    Seven puppies were harmed during the making of this post.
  7. Re:hi by Anonymous Coward · · Score: 2, Funny

    I'm fine too.

    I'm glad we have these little discussions. It makes my day so much more interesting.

    Let's do lunch.

  8. Re:Dreamweaver by drmike0099 · · Score: 3, Funny

    The OP is correct: 1) Open Dreamweaver. 2) Commands > Clean Up Word HTML... 3) Rejoice

  9. Elaine's by Anonymous Coward · · Score: 1, Funny

    I've got a table at Elaine's. Can you make it?

    1. Re:Elaine's by Anonymous Coward · · Score: 1, Funny

      Not sure; I don't know Elaine and I'm not very handy with tools.

  10. Amen by Quadraginta · · Score: 5, Funny

    Jesus, tell me about it. I get 30kb attachments merely saying "Got your email, thanks!" with "thanks" done up in some odd curly red font and a six-line sig, not to mention the twenty-seven 8x10 colored glossy JPG attachments with circles and arrows and a paragraph on the back of each one...

  11. Re:no link for you, Slashdot hordes! by slo_learner · · Score: 2, Funny

    I can understand why you would hunt this information down for your own demented purient interest, but why did you have to post it?

    Didn't he clearly state that he didn't want to be slashdotted? This just seems like a perfect opportunity for the application of a little common sense along with just a hint of courtesy.

  12. Re:PDF? by cloudmaster · · Score: 2, Funny

    I hate you and your kind. Yes, hate. :)

  13. Re:Resign from your executive position by VGR · · Score: 5, Funny

    You think that's bad?

    I was given 61 screenshots (blithely dubbed "program requirements"), each its own Word document. Each containing only a (weirdly scaled) picture, of course.

    61 Word documents.

    --
    The Internet is full. Go away.
  14. Re:Sounds like you should release on sourceforge by extrasolar · · Score: 2, Funny

    See, if we were really elite, we would all automatically know how to do stuff like this with our favorite editor. There would be no Ask Slashdot. There's a reason why emacs is one of the most popular editors around and it's because it saves us from having to do this kind of repetative text work that should be done autonomously.

    But we're not elite and I'm now going to learn how to do macros in emacs :)

  15. Re:Resign from your executive position by Detritus · · Score: 2, Funny

    Because it isn't a "real memo" unless it is printed on company letterhead, formatted according to the company's style guide.

    --
    Mea navis aericumbens anguillis abundat
  16. You can get anything you want.... by Quadraginta · · Score: 2, Funny

    But that's not what I came here to tell you about.

    I came to talk about the draft.

  17. Re:Dreamweaver by bach37 · · Score: 1, Funny

    in Dreamweaver, there's a command "Clean up MS Word HTML".

    You mean the menu option that says 'Delete All'?

  18. Re:no link for you, Slashdot hordes! by jalefkowit · · Score: 4, Funny
    This just seems like a perfect opportunity for the application of a little common sense along with just a hint of courtesy.

    You must be new here.

  19. Common... what? by DragonHawk · · Score: 2, Funny

    "... a perfect opportunity for the application of a little common sense..."

    What is this "common sense" of which you speak? Where may I download it from?

    --

    dragonhawk@iname.microsoft.com
    I do not like Microsoft. Remove them from my email address.
  20. Re:Actually, an NDA probably doesn't matter. by mrchaotica · · Score: 5, Funny
    Doubtless he couldn't post the _documents_ that he converted.
    You realize he was converting them for the purpose of putting them on a website, right? ; )
    --

    "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  21. Re:Resign from your executive position by Larry+Lightbulb · · Score: 2, Funny

    You get specs? And you're complaining?