Ask Slashdot: Best Way To Store Data In Hard Copy?
First time accepted submitter bmearns writes "I have some simple plain-text files (e.g., account information) that I want to print on paper and store in my firebox as a backup to my backup. What's the best way to encode the data for print so that it can later be restored to digital form? I've considered just printing it as text and using OCR to recover it. The upsides are that it's easy and I can even access the information without a computer if necessary. Downsides are data density, no encryption, no error correction, and how well does OCR work, anyway? Another option is printing 2D barcodes. Upsides are density, error correction, I could encrypt the data before printing. Downsides are that I'll need to split it up into multiple barcodes due to maximum capacity of popular barcode formats, and I can't access the data without a computer. Did I miss any options? What do slashdotters suggest?"
The Egyptians used hand written papyrus and we still have copies to look at. The laser printed paper copies of the Book of the Dead simply didn't survive.
Google for OCR-A and OCR-B as TTF. There are freely available versions. I use them for mailing labels, along with PostNet bar codes to make it as easy as possible for the Post Office.
Learning HOW to think is more important than learning WHAT to think.
And as long as a decent font for OCR is used - like OCR-B then it should be feasible.
The reason for doing it - well, if you want to preserve something for a few decades then it's printing on lint paper and using ink that can survive a long time. The latter is probably the hardest since nobody really knows which kind of ink used in computer printers that's able to survive for centuries.
My suspicion is that the dot matrix printers are better off than lasers and inkjets.
If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
Why, the answer is simple, there is no standard for Digital backup. zero zip. There are only two methods of time test backup.
1) Text printed on no acid paper.
2) Microphish. or film.
I suggest you print it with ocr readable characters with a pigment based ink. If you are that serious about backup, take it to a printer and have them printed with good ink on the best paper you can find. store the copies in two separate locations.
Remember every one, there is NO standard on digital backup medium.
Text printed correctly on zero acid paper or film is the only time test way.
IMHO
How many accounts can anyone have that they actually need to have bar codes or some other such nonsense to be able to regain entry to them? Print out you account information, user names, passwords, etc., and put the printout in your fire-resistant safe. If your house burns down, or some other calamity happens, and you need to regain access to all of your accounts, then you'll just re-enter tha passwords for each one. This can't possibly be more complicated than setting up some OCR / Barcode / Rube Goldberg solution.
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
Engraved to stone. Guaranteed for centuries.
I think using a proprietary standard for this has potential for disaster in the long term. QR codes would be much better. Scenario: the author of "Paperbak" discovers a huge improvement in his algorithm and deprecates the old version. 20 years into the future somebody needs to decode their stuff, and they search for the source code to "Paperbak" and realize that the only version they can find on the future internet is the "new/improved" version that can't read their stuff. So they are just the lucky owners of some paper decorated with a very specific arrangement of dots.
With QR Codes, on the other hand, it is difficult to believe that the knowledge of their format will be lost in our lifetimes. They have their own Wikipedia entry describing their structure, for example.
No need to worry about ink: even the cheapest and nastiest laser printers use toner, and a mixture of thermoplastic and carbon black thermally fused to your paper isn't going anywhere(in fact, if you use lousy enough paper, some lucky future archeology intern may have the... unmixed pleasure... of picking the little plastic character glyphs out of the pile of dust, trying to keep them in their original order!).
His data-restore needs probably don't extend to truly epic lengths in any case, so it shouldn't be a big deal.
http://www.pgpi.org/pgpi/project/scanning/
Here's a link for Paperbak: http://ollydbg.de/Paperbak/
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
This is a backup to your backup, so digital means must have failed before you'd consider using it. Text is low density, but it has an advantage that any encrypted barcode or other high tech means do not have -- it can be read by human eyes. When you're huddled in a rough lean-to roasting a feral cat over the campfire amid the wreckage of civilization, you will still be able to read your backup. That might come in handy.
Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
Dude, this happens to stuff that is very valuable to you? You either have an awesomely interesting life or are a weird mix of paranoid and careless.
Sadly, many of my old dot matrix and teletype printouts have faded as much as any other liquid ink I've used. It depends entirely on the ink in the ribbon. The liquid ink present in ordinary ribbons was often of wildly varying quality, and most people who bought those ribbons in bulk sought out the cheapest possible ribbons. I wouldn't bet on their longevity.
Laser printed pages consist of carbon in plastic, and there's no reason they shouldn't last a century or more, as long as certain conditions are met: if the toner is properly fused to the paper, if the paper doesn't degrade beneath it, and if the facing page doesn't adhere to the toner.
1. Your printer should have the right temperature set in the fuser, and that's probably not even adjustable to you. If the toner comes out dusty or smeary, it's too cool. If it comes out brown and crispy, it's too hot :-) You should recognize it immediately if the print quality is poor.
2. Store the paper properly. Heat is your enemy: don't let it get too hot, and don't store anything you want preserved in sunlight. Don't let it get damp - mold will destroy paper. Don't use crappy paper that will disintegrate - acid free is always the recommendation for long term storage. Horizontal stacks of paper will apply a lot of pressure to the sheets near the bottom of the stack, vertical hanging files reduce this pressure.
3. Watch out for printed sheets facing other printed sheets, (like double sided printing) where the toner from bottom side of the upper sheet can stick to the toner on the top side of the lower sheet. A horizontal stack of paper, especially in a hot environment, will apply a lot of pressure that cause the toners to fuse together where they touch. I've also had problems with toner adhering to vinyl sheets commonly found in 3 ring binders or binder covers.
John
Do a screen shot of it, overlay it with a picture of you and your girlfriend or boyfriend having sex and upload it to a revenge porn site, then publicly complain about it having been uploaded without your consent. That guarantees it will be available from any computer for at least 100 years.
He is making a valid point.
Soon IDE will be a thing of the past, and maybe SATA will be replaced with something better... maybe native USB3.0 to the disk?
When I cleaned out my garage I found some old floppies (5.25" and 3.5", including compilers and OSs that cost me quite a lot), some DAT tapes, some Jaz disks, some zip disks, some audio cassettes and some MFM disks too.
All of which are pretty much junk.
An interesting aside - when I looked up the specs for a 20Meg MFM disk I found I was surprised that a then current PC was able to read the entire contents in less than half a minute. If only we could do that with today's 3TB drives...
Spoken like somebody who was born in the plug and play PC age...
The MFM controller actually controlled the disk positioning, and so you need to know the physicals of the device to access it, hopefully avoiding trashing the disk.