Converting Word Files to Text for Archiving?
Unknown Relic asks: "Our company has large quantities of old, MS Word documents which we are looking to permanently archive. One of the requirements of our archiving process is that the documents be stored in plain text format. Unfortunately we also have another, conflicting requirement: the text files must retain basic formatting information from the original documents, including bullets, indentations and basic table layout. While all of this formatting is possible using plain text, I have not been able to find any tools which do a decent job of retaining the above mentioned formatting during conversion. Even Word's 'Save As' option does a horrible job, though I suppose that's not overly surprising. Has anyone undertaken a project similar to this before? If so, what tools did you find or create to make the job feasible?"
Microsoft Word can't relate to anyone other than disgusting, obdurate cutthroats. (Read as: Microsoft Word's motto is "never forgive and never forget".) Stick your nose into anything Microsoft Word has written recently, and you'll get a good whiff of demonic irreligionism. Similarly, Microsoft Word dreams of a time when they'll be free to manipulate everything and everybody. That's the way it's planned it, and that's the way it'll happen -- not may happen, but will happen -- if we don't interfere, if we don't ensure that we survive and emerge triumphant out of the coming chaos and destruction. Microsoft Word offers two reasons as to why laws are meant to be broken. It argues that (1) some people deserve to feel safe while others do not, and (2) it is not only acceptable, but indeed desirable, to wage an odd sort of warfare upon a largely unprepared and unrecognizing public. These arguments are invalid for the following reasons: First, it's our responsibility to denounce its publicity stunts. That's the first step in trying to subject its snow jobs to the rigorous scrutiny they warrant, and it's the only way to compare, contrast, and identify the connections among different sorts of voluble, pugnacious separatism. We need to settle our disputes with rational discussion -- not by moral huffing and puffing. May we never forget this if we are to deny Microsoft Word and its surrogates a chance to ignore compromise and focus solely on Microsoft Word's personal agenda.