National Archive File Format Time Bomb
geordie_loz writes "The BBC is reporting that the UK National Archive is warning of old formats being a 'ticking time-bomb' where data is going to be lost because of incompatibility in newer versions of software, and software not existing at all. More surprisingly, Microsoft has offered a solution via the OOXML format."
Just make a torrent.
ITSATRAP!
Red to red, black to black. Switch it on, but stand well back.
The BBC is reporting that the UK National Archive is warning of old formats being a 'ticking time-bomb' where data is going to be lost because of incompatibility in newer versions of software, and software not existing at all. More surprisingly, Microsoft has offered a solution via the OOXML format.
There are so many idiots in this state of the affairs:
1. the idiots which decided to build huge archive with undocumented proprietary format
2. idiots which believe they can't find even a single copy of the software they need
3. idiots who didn't store a single copy of the software that reads the format, together with the archive (not very far from obvious, is it).
4. idiots who want to convince other idiots that OOXML is an open format (versus straight XML serialization of the whatever binary DOC was in the source code base at the time in MS)
I can't believe the National Archives partnered with the company that caused this mess in the first place, ie Microsoft.
Second, why on earth do they think virtualisation is a long-term solution? Sure, you can emulate Windows 95 within Windows XP today, but what happens in another ten years? Another layer gets wrapped around XP? So in 100 years, you're relying on a stack of emulators to access the old software. You better hope Moore's law holds up, because you're going to need it. Also, who will know how Word 95 worked in 10 years, let alone 100?
IMO translation of the old documents would be a better solution. Translate the documents into a well-documented, open format, and throw away all of the old formatting idiosyncrasities while you're at it. That way, you only have to maintain one way to access the documents with the software-du-jour, instead of having to prop up the entire teetering stack of virtualisation layers.
What's surprising about that? Someone in MS Spin Control and Public Relations is worth his salary. The story could have exploded into an "avoid MS products if you want your data accessible some years down the road" fiasco (we all know that MS is the worst offender when it comes to changing the document formats, usually undocumented). Instead, it was turned into another push for their next format.
Brilliant.
"What, the shit I sold you yesterday stinks? Try this new shit, it's great and it has none of the problems of the old one."
That's what you hire PR people for.
Assorted stuff I do sometimes: Lemuria.org
Rather than bitching about Microsoft making an offer of 'help' which is just thinly disguised marketing (I mean, come on, par for the course no?), could we get a discussion about real solutions? I know MS bashing is fun, but come on, we do it on just about every other thread... lets have a day off.
To kick things off here's one:
Keep EVERYTHING in the simplest possible format. ASCII would seem sensible, since its the content we care about, not the formatting. (although that wouldn't help our Asiatic brethren much). Then Keep decent records of HOW you can read that format. With examples of the software and hardware. do this bit on PAPER. V. Tough Paper (or rock, or plastic or whatever). Update the explanations every other year, to put it in language the next gen will understand. Maybe also have instructions on how to translate the simple format to less simple things.
I guess, basically, its a case of KISS and then *provide a persistent and regularly updated 'Rosetta Stone'* for latecomers to work from.
As a side branch, this kind of reminds me of discussions I read about a while back of how to warn future generations about Nuclear Waste dumps (y'know, the really nasty stuff with half-lives in the thousands of years range). I don't think anyone ever came up with a decent answer....
'Speak softly and carry a beagle'
As for the next century, most of this material will lose value, but the important stuff will get backed up professionally and successively remastered on new media (esp with things like the UK National Archive). And amateur historians, genealogy buffs and private collectors will have their hands full in the future with stuff that you can't find in the official archives but in people's attics, just like people are fascinated with Stone Age, Roman or Victorian artifacts today.
Go somewhere random
Hum now. completely failed to tick the posting anon box :) good job I held back from expressing opinions in there.
I have run that same version of Visicalc, in DOSBox, on a PowerPC Mac. Actually, I've run a few programs in that environment that don't run on Windows without the aid of DOSBox. To me, this says that third parties are better than Microsoft themselves for backwards compatibility with Microsoft programs. I wonder how long it will be before WINE has better support for old Windows apps. I think this is already the case for a few win16 programs...
I am TheRaven on Soylent News