Slashdot Mirror


Formats for Electronic Forms?

Bifurcati asks: "I'm a grad student at the University of Queensland, Australia. I am frequently required to submit forms (e.g., annual reports) which are sent as Word or RTF documents and must be filled in electronically. However, these are almost impossible to use under Linux (e.g., StarOffice) because the tables and formatting are just too complicated and get mangled. Even Word for Mac sometimes has problems. Can anyone suggest a better, cross-platform format? Could PDF files with forms do this? What are the costs & learning curve? User friendliness is vital for both admin and student. Alternatively, can anyone suggest ways they could make their Word files more compatible across platforms?"

38 comments

  1. HTML? by Anonymous Coward · · Score: 3, Insightful

    Add a bit of PHP, maybe a database, and easy form processing.

  2. HTML by Fred+Nerk · · Score: 5, Insightful

    What about good ol' online forms? They are accessable from almost any PC, the users can't go changing the layout of the forms, data can be real-time, and there's no need for someone to parse the forms by hand.

    --
    Anything is possible, except skiing through revolving doors.
    1. Re:HTML by MasterDirk · · Score: 2, Insightful

      This is just what the net is good at. XML!

      Honestly, it is!

      --

      "Programming is like sex: one mistake and you have to support it for the rest of your life."

    2. Re:HTML by Anonymous Coward · · Score: 2, Insightful

      One problem is printing HTML. It's hard enough to make a good screen layout. Making it look good on paper, so that certain parts are guaranteed to be on certain pages, is very difficult.

    3. Re:HTML by slittle · · Score: 2, Insightful

      Once you've snarfed all the data from a form into a file or DB, it can be barfed back out in whatever format and layout you want.

      --
      Opportunity knocks. Karma hunts you down.
  3. Crossover Office by nri · · Score: 1

    I'm sure theres a older warez version out there somewhere (given that uni student have a tight beer budget).

    --
    if :w! doesn't work, try :!cvs commit -m""
  4. Loads fine in OO 1.1.1 by e_AltF4 · · Score: 0, Troll

    Don't care for formatting, load in OO, fill in and save.
    It's not you who has to wade through the bad formatting
    once you saved as rtf (or .swx) and mailed back the result.

  5. PDF forms would seem to be a perfect fit. by Talonius · · Score: 4, Insightful

    They allow you to save the data with the form, transfer the data to a remote source without any loss, and at the remote site you can process the PDF form with additional software. Individuals can save the form and take it with them to have the data entered, removing the requirement for a connection to the network (ala HTML forms).

    Acrobat allows you to easily specify the types of data you want them to allow to input. There's quite a few PDF form creation software packages available as well allowing you to do to this.

    We use them at my place of employment and have had only one problem: data entry sections that can widely vary. There's no way to make the section grow or shrink that we've found so if the form creator specified area isn't large enough to hold your data you could be out of luck when you go to print.

    In that same vein they don't deal well with ad hoc data being added to the beginning or end of the form as a Word or RTF file would. The purpose of a form is to get away from that sort of data, but it happens.

    --
    My reality check bounced.
    1. Re:PDF forms would seem to be a perfect fit. by VasiliTerkin · · Score: 2, Informative

      I thought you can't save the changes in the "READER" (free, aka "the one used by everyone (on windows)") version of the Acrobat. + Very much readable + easy to use + Can be saved off-line and emailed - Requires expensive full version of Acrobat to be truly a 2 way communitacation tool. - Can be parsed for data, but not an easy/cheap task. - The fields are not flexible (dont expand in relation to data volume) i.e. I always have to tweak them in full Acrobat for my last name to fit.. grrrr... InfoPath though XML-based, seems to require Office 2003 - the hole you seem to be trying to avoid. + XML-based, easy data parsing. + Fields are flexible. + Can be filled out off-line - Need extensive MS infrastructure for support of these. - Talking about a niche market! Geesh. To view forms, you NEED InfoPath. HTML forms. well... you know the story... + Can look and work in any way you want. + Parse\store in any way admin wants. - Server\Live connection required. Simple Text files: i.e.: (field code1) answer (field code2) answer (field code3) answer + Universally readable, save-able, flexible "field size". - Doesn't look good, users can mess up the filed codes.

    2. Re:PDF forms would seem to be a perfect fit. by 2TecTom · · Score: 1

      I'm terribly sorry to have to be the one to inform you of this, but PDF sucks. Perhaps it's proprietary existence just isn't well enough understood yet?

      --
      Words to men, as air to birds.
    3. Re:PDF forms would seem to be a perfect fit. by CRCulver · · Score: 3, Insightful

      I'm terribly sorry to have to be the one to inform you of this, but PDF sucks.

      Some of the arguments on that site you cite are silly. I mean, the guy writes: "Have you ever looked inside a PDF file? It may preserve the document's data, but it does it in a non-standard way." What is a "standard way"? Furthermore, PDF doesn't do anything different than PostScript, and PostScript is revered in the printing industry because it is so dependable, for it will always print the same way.

      The author then goes on to say: "Printed documents are unreadable until you track down the correct printer driver." Duh. This isn't a PDF problem, it's a printer problem. I'm sure MS Word, DVI documents etc. look like crap if you can't set up your system correctly.

      Personally, I would like to see XMLFO become the usual method of interchanging printable documents online because then one can approach it in many ways through standard XML tools. But PDF isn't bad for documents one expects to be printed.

    4. Re:PDF forms would seem to be a perfect fit. by Anonymous Coward · · Score: 0

      Standards, as in IEEE or W3C etc... oh, and btw, have you tried using PDF's without having an installed printer driver? I don't think so.

    5. Re:PDF forms would seem to be a perfect fit. by einhverfr · · Score: 1

      Even with the reader, you can fill out forms and send back the PDF with the added information in the forms. These are more similar to HTML forms in that only a few parts allow for editing. But it is possible.

      Also, LaTeX can create PDF Forms, iirc. I don't have experience with this aspect of LaTeX yet.

      --

      LedgerSMB: Open source Accounting/ERP
    6. Re:PDF forms would seem to be a perfect fit. by einhverfr · · Score: 2, Insightful

      xpdf requires printer drivers? So does ggv?

      Actually, I have read PDF's without printer drivers installed for quite a while. What you are describing is an Acrobat Reader (or perhaps Windows) problem rather than a PDF one.

      --

      LedgerSMB: Open source Accounting/ERP
    7. Re:PDF forms would seem to be a perfect fit. by einhverfr · · Score: 1

      It is only proprietary to the extent that Adobe has not taken it to standards organizations. However, they have published the spec, and been happy with third party implimentations. So in that regard, it is an open standard, albeit not an official one.

      --

      LedgerSMB: Open source Accounting/ERP
    8. Re:PDF forms would seem to be a perfect fit. by Anonymous Coward · · Score: 0

      Perhaps it's proprietary existence

      "its".

    9. Re:PDF forms would seem to be a perfect fit. by 2TecTom · · Score: 1

      ouch, sorry

      --
      Words to men, as air to birds.
    10. Re:PDF forms would seem to be a perfect fit. by uits · · Score: 2, Insightful

      Yes, because everything proprietary "sucks". Regardless, it's an open format, it's ubiquitous, and it fills a need you obviously don't understand.

    11. Re:PDF forms would seem to be a perfect fit. by 2TecTom · · Score: 0, Redundant

      Obviously from your lofty position, a Technical Writer, who works with PDFs daily, wouldn't have any perspective on this issue.

      People who feel the need to resort to insulting others speak volumes only of themselves.

      --
      Words to men, as air to birds.
    12. Re:PDF forms would seem to be a perfect fit. by 2TecTom · · Score: 1

      Dood, please, don't waste your mod points on me, find someone who cares.

      --
      Words to men, as air to birds.
  6. PostScript by Cranx · · Score: 2, Insightful

    You can't just edit PostScript directly?

  7. Tested it with Word 2004 for Mac... by mTor · · Score: 2, Informative

    ... and they open just fine.

    As for compatibility, Word 2004 has a nice feature called "Compatibility Report" which analyzes a file before saving it and warns you which versions of Word might have problems.

    My general advice for forms is to implement an XML based form server. I know that Adobe is pushing their PDF forms server but that's really an overkill. If you have money to burn, Microsoft's InfoPath is a good choice as well.

  8. What kind of people submit to ask slashdot? by 7-Vodka · · Score: 1, Funny
    • I'm a grad student
    • I am frequently required to submit forms (e.g., annual reports)

    How often do anual reports come round again you lazy bastard?!
    Oh wait, you posted to: ask slashdot. Duuh.

    I jest, I jest.

    --

    Liberty.

  9. MOD PARENT UP by blackcoot · · Score: 2, Insightful

    i was going to suggest something along these lines. if you're serious about it, you may want to check out xforms especially if you're doing any major processing of those forms once all is said and done.

  10. Learning Curve for PDFs? by rmohr02 · · Score: 2, Informative

    There is essentially no learning curve for using PDFs. In MS Office (or, for that matter, any Windows application), use PDFCreator. On OS X, exporting to PDFs is a function of the operating system. In Unix/Linux, there's ps2pdf (and just about any application will print postscript to file), as well as KOffice and OpenOffice.org.

  11. XML by jefu · · Score: 3, Interesting
    This is the kind of problem that XML solves perfectly.

    XML would allow you to define (in a DTD/Schema...) the kinds of data that the form should be collecting and do it in a format neutral way. Then you could use web pages (translate the XML automatically to XHTML, grab the data and translate back). This can be fairly easily automated as could other methods to handle the input. PDF and DOC (and its cousins) are poor substitutes as you can't as easily identify the important information in the document, you can't store it concisely and you can't then do semantic level searches on it. Furthermore, in XML processing you can do consistency checks and so on.

    In a web setting (or similarly "connected" kind of configuration) you could pre-populate much of the data for the user. You could even "compile" the xml to a set of online forms (XML -> GLADE or the MS .NET XML window description thing).

    Once the data is entered into XML it can be massaged and output in any needed format (I don't know of any free XML to DOC format converter, but I suspect that the XML enabled MS Office stuff can do it if needed).

    By the way, while that first step is easy to say, actually defining the DTD/Schema/... is likely to be rather difficult. (Look up sometime what it takes to specify an address.) But this difficulty pays off immensely in that you know much more about your data, and much more about the ways it might be used. Once this is done though, the other parts are really pretty straightforward.

    It might take a bit of work, but in the long term coding this up in XML is likely to save far more work and money.

    1. Re:XML by AdamPiotrZochowski · · Score: 1, Informative


      Another XML fanboy.

      First you create XML. Then you build DTD/Schema to enforce a set
      of constrictions on the original XML (note: DTD/Schema were added
      later to XML).

      Then you 'automatically' convert the said XML to XHTML.

      right, you build another XML to represent how to display the
      original data within original XML/DTD/Schema. Such display XML
      shall not be called XML, lets call it different things, for there
      are different DTD/Schemas for display XMLs. The Mozilla crowd
      wants you to use XUL, the ms camp wants you to use XAML, and
      ofcourse, others will say XHTML + Javascript + some server side
      sollution. When I say display I really mean display with ability
      to edit/add/delete.

      Anyways, since XAML/XUL are esoteric we are left with XHTML +
      javascript + server side scripting (jsp/asp/php). This, as you
      say, is quite bad for true controlled non editing display. We
      want to export to DOC (be it RTF, or ms office XML), or to
      ascii or PS/PDF. Good, now hope that your original display XML,
      afterall from which you generated XHTML, can also be used to do
      the conversion to other formats. Here again you will come to
      problems, as you need to keep playing and writting converters
      from one XML format to another, massage their content, until the
      PDF/DOC exporters work like you want.

      Now, you have gone through tons and tons of extra hoops, with a
      fairly new technology. And it took you how long?

      Here is somethings that much faster:
      1. sql -> setup tables / foreign keys / check constraints
      2. ms access -> link tables
      3. ms access -> auto wizard the forms
      4. ms access -> export forms to ASP

      and you are done. And anything to do is few clicks away. Now,
      its not perfect, actually far from it (ms-access exported forms
      to ASP leave ALOT to be desired) but atleast its not hodge podge
      as it is with the magical XML.

      I cringe when people say XML is a sollution to every data related
      problem. The only thing that XML by itself that is amazing is
      the nested data structure support. Everything else the good
      old tab/comma delimited format already supported!!

      --
      /apz, XML is good and all when used right


  12. It's not just a linux problem ... by jc42 · · Score: 2, Insightful

    It's perhaps worth pointing out that it's also a problem for Windows users. If you have a version of Word that's very different from the one used to generate the form, there's a good chance that it'll be garbled for you, too.

    The funny thing is that university admin types tend to use ancient, unpatched versions of W98 (or even W95), so it's students with an up-to-date XP machine that are likely to have problems. OO on linux can often read such files better than recent versions of Word.

    Of course, the real solution is to somehow educate them to the risks of using Word docs. But they're university people; they probably can't be educated.

    --
    Those who do study history are doomed to stand helplessly by while everyone else repeats it.
  13. PDF by kronsrepus · · Score: 1

    PDF forms are easy as, only problem reader will let you enter the data, but only Acrobat will let you save the data back into the PDF.

    The learning curve for setting up a PDF form isn't too bad, whoever creates these forms in the first place just needs to take their Word file, convert it to PDF and mark the form fields in Acrobat.

    For the end user putting data into the form - if you've got a PDF printer driver under Windows then you can just print them back to a PDF, or under linux print as postscript and convert to PDF.

    It all comes down to the initial effort, and if its worth it for those creating the form.

    1. Re:PDF by Anonymous Coward · · Score: 1, Informative

      We've just done a pilot of exactly what the questioner seems to be asking for - and we used online forms (written in XHTML), and a stylesheet for rendering the form to FOP, and thence an Open Source product (forget which) to render to PDF forms. User gets on to the website, fills in the form online, submits and gets a filled-in PDF back, which they can sign and send in by snail-mail if they want to. Glorious thing is we've already got all the data stored from the online form, so we're only checking for their signature.

  14. XFDL and soon XForms by bwt · · Score: 3, Informative

    XFDL works for this, but unfortunately there is only one vendor: PureEdge. The US Air Force is using this for online/offline forms capabilities. XFDL was sort of a predecessor to XForms (one of the PureEdge lead technicals is on the XForms WG).

    Eventually, XForms should have enough support to category kill this problem. It's just taking a while because it has a lot of dependencies on other XML specs that make it difficult for implementors. I was SO glad to see IBM and Novell step in to provide resources to Mozilla implement XForms.

    1. Re:XFDL and soon XForms by Korgan · · Score: 1

      Doh! You beat me to it! ;-) Is exactly what I was going to suggest as well.

      This XForms is perfect for this kind of application.

  15. PDF-Scribus and OO. by Anonymous Coward · · Score: 2, Informative

    Well two things. One scribus will let one create PDF forms. Second OpenOffice can create it's own editable forms, and then print that to PDF.

  16. No -- we need *fillable* forms by JavaRob · · Score: 1

    I've run into the same problem as the questioner... I don't have Adobe Acrobat (and don't really want to buy it for personal use, at $250+), and making locked forms in Word can be irritating to get working right, and many people can't fill in the form.

    Sure, I can create PDFs with no problem using various software (it's built-in to OpenOffice, for one)... but I haven't seen free or open-source software that will let me create PDF *forms*, where the user can fill in the fields before printing it out.

    Actually, that brings up another issue -- even if I had Acrobat, would it let me generate PDF forms that could be saved? Most of the ones I see (I'm thinking tax forms here, actually) only let you fill in the fields and print out the result, not save your modified document with the field data.

    This really strikes me as a hole in available software (proprietary or not!).

    Here's my personal example: For a recent school reunion, we wanted to build a "face book", where people could fill out and return (via email) a form with a photo, current name and contact info, recent history, etc... and it was a real pain to do. I ended up sending out a Word doc, with *parts* of the document locked (with form fields) and parts unlocked (for inserting photos and giving users font control in parts). Then I also sent out an OpenOffice-generated PDF, for people to print out, fill-in by hand, scan, and email back as an image. Some people who had Acrobat installed managed to edit this file and send it back.

    Needless to say, there were plenty of people who had a hard time with either format. Some just emailed me back the text they wanted to enter and attached photos to the email, since they couldn't get them properly into the Word doc. Some people gave up, and didn't get a page in the book. And this is ignoring all the hoops *I* had to jump through to open the attachments from people using Outlook (winmail.dat, anyone?). [well, that last complaint is a rant for another day]

    Why should this be so complicated? Yes, I considered setting up a simple webapp to collect the info. But of course every entry needed to fit on a single page... and HTML is not at all designed for specific control over that kind of thing. Plus I'd need at least some control over what I'd let people upload as a "photo", for security reasons, and have to automate cropping or at least resizing of the image... it got way too ugly way too fast.

    I could have given people the password to unlock the Word doc. But I'd have to warn them that once you unlock the document, you can't edit the form fields anymore. Oh, and if you re-lock it, it kindly deletes everything you previously entered in those form fields. Ugh.

    1. Re:No -- we need *fillable* forms by Your+Pal+Dave · · Score: 1
      Sure, I can create PDFs with no problem using various software (it's built-in to OpenOffice, for one)... but I haven't seen free or open-source software that will let me create PDF *forms*, where the user can fill in the fields before printing it out.
      There is a method called 'pdfmarks' that allows you to use little snippets of postscript code inserted into the print stream. This can be done by embedding them in little EPS files and inserting them into your wordprocessor document as graphics. This isn't as covenient as the drag-n-drop method in Acrobat, but I imagine that it might end up being easier to maintain a document marked up in this method than having to fire up Acrobat and re-do the forms markup every time you change the base document.
      Actually, that brings up another issue -- even if I had Acrobat, would it let me generate PDF forms that could be saved? Most of the ones I see (I'm thinking tax forms here, actually) only let you fill in the fields and print out the result, not save your modified document with the field data.
      For a small mountain of money, Adobe will sell you something called Adobe Reader Extensions Server which flags a PDF as being savable to the free Adobe Reader. I believe it uses some sort of crypo-drm method to do so, but that's just a SWAG.
  17. InfoPath by DreamTheater · · Score: 1

    Information collected with InfoPath 2003 can be integrated with a broad range of business processes because InfoPath 2003 supports any customer-defined Extensible Markup Language (XML) schema and integrates with Web services. As a result, InfoPath 2003 can help you connect directly to organizational information and then act on it, which leads to greater business impact.
    http://www.microsoft.com/office/infopath/prodinfo/ overview.mspx

    1. Re:InfoPath by Anonymous Coward · · Score: 1, Informative

      Um, thanks for the advertisement, but the poster asked for a CROSS-PLATFORM solution. InfoPath is Windows-only.

      I've used InfoPath in the past, and while it got the job done, there were about 10 different other ways to go which would have gotten the job done in the same amount of time AND been cross-platform.

  18. PDF editing by BakaBaka · · Score: 2, Informative

    As a designer, I've had to make forms for a variety of idiots. Lots of idiots choose MS Word/Works, which is fine if you lock the documents and provide very specific instructions. For personal and in-office stuff, I use Excel. It's ugly but functional. For most other forms, I prefer the PDF format. For fancy stuff, a lot of design software (including Quark, Photoshop, etc.) can export to PDF, and you can drop the form boxes in later. Although Acrobat lacks the tools you might need, PitStop Pro by Enfocus will give you neat and necessary tools. It makes Acrobat more crash-prone, but it's worth the pain. If the creator of the forms has PitStop, the users just need Acrobat and a readme. Short of encrypting the data or sending it as graphics, this is (in my opinion) really the best way to send semi-sensitive form data over the internet.