Slashdot Mirror


Ask Slashdot: Best Way To Store Data In Hard Copy?

First time accepted submitter bmearns writes "I have some simple plain-text files (e.g., account information) that I want to print on paper and store in my firebox as a backup to my backup. What's the best way to encode the data for print so that it can later be restored to digital form? I've considered just printing it as text and using OCR to recover it. The upsides are that it's easy and I can even access the information without a computer if necessary. Downsides are data density, no encryption, no error correction, and how well does OCR work, anyway? Another option is printing 2D barcodes. Upsides are density, error correction, I could encrypt the data before printing. Downsides are that I'll need to split it up into multiple barcodes due to maximum capacity of popular barcode formats, and I can't access the data without a computer. Did I miss any options? What do slashdotters suggest?"

16 of 329 comments (clear)

  1. Whatever you do.... Use papyrus by pollarda · · Score: 4, Funny

    The Egyptians used hand written papyrus and we still have copies to look at. The laser printed paper copies of the Book of the Dead simply didn't survive.

  2. Use the correct font for OCR by chill · · Score: 4, Informative

    Google for OCR-A and OCR-B as TTF. There are freely available versions. I use them for mailing labels, along with PostNet bar codes to make it as easy as possible for the Post Office.

    --
    Learning HOW to think is more important than learning WHAT to think.
  3. Re:Text, but why? by Z00L00K · · Score: 4, Interesting

    And as long as a decent font for OCR is used - like OCR-B then it should be feasible.

    The reason for doing it - well, if you want to preserve something for a few decades then it's printing on lint paper and using ink that can survive a long time. The latter is probably the hardest since nobody really knows which kind of ink used in computer printers that's able to survive for centuries.

    My suspicion is that the dot matrix printers are better off than lasers and inkjets.

    --
    If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
  4. Re:Text, but why? by Anonymous Coward · · Score: 5, Insightful

    Why, the answer is simple, there is no standard for Digital backup. zero zip. There are only two methods of time test backup.
    1) Text printed on no acid paper.
    2) Microphish. or film.

    I suggest you print it with ocr readable characters with a pigment based ink. If you are that serious about backup, take it to a printer and have them printed with good ink on the best paper you can find. store the copies in two separate locations.

    Remember every one, there is NO standard on digital backup medium.

    Text printed correctly on zero acid paper or film is the only time test way.

    IMHO

  5. Re:Text, but why? by Hans+Lehmann · · Score: 5, Insightful

    How many accounts can anyone have that they actually need to have bar codes or some other such nonsense to be able to regain entry to them? Print out you account information, user names, passwords, etc., and put the printout in your fire-resistant safe. If your house burns down, or some other calamity happens, and you need to regain access to all of your accounts, then you'll just re-enter tha passwords for each one. This can't possibly be more complicated than setting up some OCR / Barcode / Rube Goldberg solution.

    --
    09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
  6. Engraved to stone by vasster · · Score: 4, Insightful

    Engraved to stone. Guaranteed for centuries.

  7. Re:Google by sideslash · · Score: 5, Funny

    I think using a proprietary standard for this has potential for disaster in the long term. QR codes would be much better. Scenario: the author of "Paperbak" discovers a huge improvement in his algorithm and deprecates the old version. 20 years into the future somebody needs to decode their stuff, and they search for the source code to "Paperbak" and realize that the only version they can find on the future internet is the "new/improved" version that can't read their stuff. So they are just the lucky owners of some paper decorated with a very specific arrangement of dots.

    With QR Codes, on the other hand, it is difficult to believe that the knowledge of their format will be lost in our lifetimes. They have their own Wikipedia entry describing their structure, for example.

  8. Re:Text, but why? by fuzzyfuzzyfungus · · Score: 4, Informative

    No need to worry about ink: even the cheapest and nastiest laser printers use toner, and a mixture of thermoplastic and carbon black thermally fused to your paper isn't going anywhere(in fact, if you use lousy enough paper, some lucky future archeology intern may have the... unmixed pleasure... of picking the little plastic character glyphs out of the pile of dust, trying to keep them in their original order!).

    His data-restore needs probably don't extend to truly epic lengths in any case, so it shouldn't be a big deal.

  9. Print it out with check digits by SSpade · · Score: 4, Informative
  10. Re:QR codes? by bill_mcgonigle · · Score: 5, Informative

    Here's a link for Paperbak: http://ollydbg.de/Paperbak/

    PaperBack is a free application that allows you to back up your precious files on the ordinary paper in the form of the oversized bitmaps. If you have a good laser printer with the 600 dpi resolution, you can save up to 500,000 bytes of uncompressed data on the single A4/Letter sheet. Integrated packer allows for much better data density - up to 3,000,000+ (three megabytes) of C code per page. ....

    Actual version is for Windows only, but it's free and open source, and there is nothing that prevents you from porting PaperBack to Linux or Mac, and the chances are good that it still will work under Windows XXXP or Trillenium Edition. And, of course, you can mail your printouts to the recipients anywhere in the world, even if they have no Internet access or live in the countries where such access is restricted by the regiment.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  11. Text. It's the only way. by roc97007 · · Score: 4, Insightful

    This is a backup to your backup, so digital means must have failed before you'd consider using it. Text is low density, but it has an advantage that any encrypted barcode or other high tech means do not have -- it can be read by human eyes. When you're huddled in a rough lean-to roasting a feral cat over the campfire amid the wreckage of civilization, you will still be able to read your backup. That might come in handy.

    --
    Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
  12. Re: One word by Anonymous Coward · · Score: 4, Insightful

    Dude, this happens to stuff that is very valuable to you? You either have an awesomely interesting life or are a weird mix of paranoid and careless.

  13. Re:Text, but why? by plover · · Score: 4, Informative

    Sadly, many of my old dot matrix and teletype printouts have faded as much as any other liquid ink I've used. It depends entirely on the ink in the ribbon. The liquid ink present in ordinary ribbons was often of wildly varying quality, and most people who bought those ribbons in bulk sought out the cheapest possible ribbons. I wouldn't bet on their longevity.

    Laser printed pages consist of carbon in plastic, and there's no reason they shouldn't last a century or more, as long as certain conditions are met: if the toner is properly fused to the paper, if the paper doesn't degrade beneath it, and if the facing page doesn't adhere to the toner.

    1. Your printer should have the right temperature set in the fuser, and that's probably not even adjustable to you. If the toner comes out dusty or smeary, it's too cool. If it comes out brown and crispy, it's too hot :-) You should recognize it immediately if the print quality is poor.

    2. Store the paper properly. Heat is your enemy: don't let it get too hot, and don't store anything you want preserved in sunlight. Don't let it get damp - mold will destroy paper. Don't use crappy paper that will disintegrate - acid free is always the recommendation for long term storage. Horizontal stacks of paper will apply a lot of pressure to the sheets near the bottom of the stack, vertical hanging files reduce this pressure.

    3. Watch out for printed sheets facing other printed sheets, (like double sided printing) where the toner from bottom side of the upper sheet can stick to the toner on the top side of the lower sheet. A horizontal stack of paper, especially in a hot environment, will apply a lot of pressure that cause the toners to fuse together where they touch. I've also had problems with toner adhering to vinyl sheets commonly found in 3 ring binders or binder covers.

    --
    John
  14. Re:Text, but why? by __aaltlg1547 · · Score: 4, Funny

    Do a screen shot of it, overlay it with a picture of you and your girlfriend or boyfriend having sex and upload it to a revenge porn site, then publicly complain about it having been uploaded without your consent. That guarantees it will be available from any computer for at least 100 years.

  15. Re: Easy by hamster_nz · · Score: 4, Insightful

    He is making a valid point.

    Soon IDE will be a thing of the past, and maybe SATA will be replaced with something better... maybe native USB3.0 to the disk?

    When I cleaned out my garage I found some old floppies (5.25" and 3.5", including compilers and OSs that cost me quite a lot), some DAT tapes, some Jaz disks, some zip disks, some audio cassettes and some MFM disks too.

    All of which are pretty much junk.

    An interesting aside - when I looked up the specs for a 20Meg MFM disk I found I was surprised that a then current PC was able to read the entire contents in less than half a minute. If only we could do that with today's 3TB drives...

  16. Re: Easy by hamster_nz · · Score: 4, Informative

    Spoken like somebody who was born in the plug and play PC age...

    The MFM controller actually controlled the disk positioning, and so you need to know the physicals of the device to access it, hopefully avoiding trashing the disk.