Lockheed Chosen For Electronic Records Archives
TrentL writes "How will we be able to read 1990's email messages in the year 2090? Will GIF files still be accessible in 2105? The US National Archives - tasked with preserving records "for the life of the republic" - has chosen Lockheed Martin to solve exactly this problem. Lockheed was awarded the $308M Electronic Records Archives contract after a year-long design competition. Full Disclosure: I worked on Lockheed's demo team."
Analog media couldn't be restored because the machines that read it broke (couldn't they make new ones?) but as long as the specs exist, I don't see why they won't be able to read the digital data (assuming we still use two bits in the future).
Send email from the afterlife! Write your e-will at Dead Man's Switch.
So it just stores the data? Or is it also required to be able to view all the types of media that will be put into it?
We're just lucky that Walt didn't dream up LZW compression while he was working on Steamboat Mickey, or we'd have patents lasting for the author's life plus 90 years!
-paul
Pistol caliber is like religion: everyone has their favourite, and theirs is the only right choice.
This has a fundamental chicken and egg problem: So you store the information, you also need to store the format of that information. So then how do you read "format of the information" document? What format is *that* in?
... Do you carve it into stone?
:-(
You see; whatever format you used for anything has to be documented and you can't use paper because it won't last as long
Worse still you need some computer science grads to write up exactly the format down to how long a char is and the bit/byte order. It is a extremely difficult task even if you don't take into consideration finding a storage medium that will last that long.
Comment removed based on user account deletion
It's not just the government that needs this. Since we're funding this effort with our taxpayer dollars, I'm hopeful that some of the results from this work will lead to the availability of tools us normal folks can use to make sure our precious data can be preserved and passed down from one generation to the next.
Not sure where I read it, but there was an article I read about using good old cheap IDE Raid as a tape replacement. Some guy did it on a large scale for university, and a (relativly low cost). Considering the low cost per GB, and easy scalability, why not?
and so poorly documented. I can see why we'd spend hundreds of millions making sure we preserve the formats.
Personally, I'd be more worried about proprietary formats.
"I'd rather be a lightning rod than a seismometer." -Ken Kesey
Hmmm... Offsite storage for the United States... Satellite Launch?
Seriously though, what medium would work the best for this? At this point I think that hard drives cost just as much as Ultrium tapes, for just as much storage. Seeing as tapes die so quickly, you may as well back them up onto true hard drives, then just let them sit for a few years. After ten to twenty years, carry it forward to the next big storage medium.
Victory is gained, not in knowing your opponents next move, but in preempting them.
Alright, but why are you posting after the fact on slashdot? Maybe your efforts would have been more effective if you had started this campaign a year ago to the people who make these decisions.
Are you against the National Archives? This program enables the National Archives, into which we've already sunk billions over the centuries, to continue to be (even more) useful in the Information Age. That's our information. Why should we throw it away now?
I'm curious, did you have any criticism for the $300M "bridge to nowhere" in Alaska when it was reported in the new budget this year? And where are you on the $200B+ we're spending in Iraq?
--
make install -not war
Doesn't matter on slashdot, as any anti-social program post gets modded flamebait lately.
/. is for -- honest and open debate from all 3 sides of an issue.
Nonetheless, I believe the "Why?" question should be asked before AND after the fact, continuously. I don't believe we should just roll over when the majority says its ok to tax-and-spend. I've met many people through slashdot who have come to agree with the non-authoritarian positions I've stated in the past, and I love the debates I usually get before I get modded down.
That's what
For a start, they should stop using stupid proprietary formats like Real Video (the Press Conference Video on their website is only available for Real Player).
I remember reading an article about the archival of scientific research; many researches involved in the discovery of DNA's structure didn't keep their (hand-written) notes, but they were later recovered by others who saw the value of such significant documents to future generations.
Nowadays, of course, we can just trash something by clicking the delete button, and one the hard drive's formatted, it's gone. This does make me wonder how much historical and scientific information will be lost to future generations simply because of this ease of deletion.
While it's nice that his memory looks attractive in photographs, it'd probably be more useful if it were photographic :)
tasked with preserving records "for the life of the republic"
Task completed......
Service guarantees Citizenship! Questions Guarantee GITMO.... Amerika Uber Alles!
In 2105: "Yessir, your documents are available in GIF format, unfortunately the 1024 by 768 resolution is almost unviewable on today's 1,048,576 by 786,432 document readers."
A feeling of having made the same mistake before: Deja Foobar
Yes, that's true, I've seen pictures of his memory. Looks quite nice :p
Send email from the afterlife! Write your e-will at Dead Man's Switch.
...all the 1990's pr0n! We need to keep that in a repository for the benefit of mankind for generations to come!
It sometimes amazes me that LMCO manages to build planes that actually fly. But then I have to remember that the people designing the planes apparently aren't the ones designing their software.
If they build aircraft the way they build software, their planes would make these look like this.
"The system's "initial operating capability" should be available during Fiscal Year 2007. Weinstein noted that "the system's architecture makes it flexible enough to accommodate evolving policy change," including the importance of "providing public access while protecting privacy and sensitive information.""..HAHAHAHAHA! Anything even *remotely* important or interesting, paid for by tax payers or not, sorry, "terrorism, security", yada yada yada.
Just look back at how much technology has changed in the past 10 years. We had 5.25" Floppy drives used back in those times, and 3.5" floppies were used as well, and CD burners were just starting to come available at the speedy rates of 1-2x, not to mention hard drives were so small compared to the 500gb drives we have today... and Windows 95 was just released, wonderful system based on FAT architecture... not NTFS like we have today...
Computer technology is increasing at such a rapid rate these days. I can only imagine how it will be in 10 years, much less 100 years from now. I am sure by then that clock speed will be in hundreds of gigahertz, memory in the terabytes, and storage in the petabyte range... if not even higher... who knows...
I also wonder, if in 2090, will their CD-ROM equivalent even exist to read this storage library? They may have long ago abandoned CD-ROMs for being too slow, and if data is stored in this format, how will it be read? Also, as hard drives get larger and larger, am sure the IDE, SCSI, and SATA drives of today will not be readable by the BIOS of tomorrow... much less have connectors to fit...
This is a huge undertaking... good luck Lockheed Martin...
Need a Nerd?
Nerd Systems
Papyrus. It's simple and cheap. Hell, etch it into a clay tablet... or glass or something.
Photogenic memory? What an intriguing concept. Something like this, perhaps?
This has a fundamental chicken and egg problem: So you store the information, you also need to store the format of that information. So then how do you read "format of the information" document? What format is *that* in?
Latin, videlicet.
But seriously the problem in records is not going to be collecting the data, but turning it into knowledge. Meaning that humans in the future are likely to seriously misinterpret or be unaware of the intended meanings and social and political contexts of the preserved data.
This is not a technology problem.
They ought to make sure that real professional historians are there.
This is not nearly as difficult as you make it seem: implement the parser in a standardized language. The formal specification of the standardized language can then be included with the source of the parser.
Getting code to run on later architectures is not usually very difficult. I am fairly comfortable with the proposition of porting any code to any future architecture -- the "emulator scene" testifies to the viability of this strategy. The biggest problem to be solved is reading storage media for which no hardware exists.
For example, how do I get to my college research stored on AmigaDos floppies? Tragically, the easiest solution is to try to get my Amiga running again, and then move the data over a serial cable with kermit. I'm awfully glad I have kermit on that computer, because I don't think I'd be able to find any 2400 baud Amiga BBSes around to download it.
Liberty you never use is liberty you lose.
goatse and tubgirl shall be archived, in all their digital glory for the ages to see.
Would love to meet this Mr. Martin...
maybe you meant photographic memory...
I modded you up :)
Did Google compete for this contract? They're the ones with the largest infrastructure for such a project and the brains to give us a really slick interface to it all. Not to mention that they could probably have faster response times than archive.org which totally fuckin' blows.
What is your penile percentile?
there should have at least been some kind of public contest where other companys can prove themselves worthy of this contract. -acidjazz http://www.litebay.org/ - the most popular movies,music,games,software,etc.
The Bridges name is the Don Young Bridge, after the Alaska Congressman that had the forsight to find a prime piece of real estate where he can build a Manhattan to the west.
"Nimis exaltatus rex sedet in vertice - caveat ruinam!"
While looking through the documentation http://www.archives.gov/era/about/documentation.h
I found a link to the project requirements : http://www.archives.gov/era/about/requirements.cs
Which contains the following line
I know, one typo in one line in several hundred, but why that line ?
Technically I don't see any problem with storing 100PB of data in the next decade, and keep it safe from natural disasters. But how about unnatural disasters, such as an evil administration changing the entire archive to reflect better on itself or protect itself from criminal prosecution? Copies of the archive packages need to be suitable dispersed in multiple jurisdictions or even shot into space in order to make this kind of data destruction infeasible.
Provided you are not lying, you just reversed your moderation.
Now let's get the Federal, State, and local governments to create standards-based web sites as policy. (Remember FEMA and the US Copyright Office.)
http://narnia.dnsalias.org/freegovernment/
Next, let's go for standards for electronic office documents (word processor, spreadsheet). I doubt Lockheed Martin's system here is going to understand every obscure format---say Microsoft Word 5?
"How will we be able to read 1990's email messages in the year 2090?
Mebbe if we kept email as plain text we wouldnt have to ask this question. Im fond of mbox myself. However any sufficiently documented format will only leave us with storage media issues.
Photographic memory? You mean one that fades over time and undergoes color shifts?
There is no God, and Dirac is his prophet.
The National Archives are important, except so much that SHOULD be in the Archives is not, for whatever (illegal) reason:
http://www.lewrockwell.com/orig/stinnett1.html
http://www.lewrockwell.com/pilger/pilger17.html
http://www.lewrockwell.com/rogers/rogers40.html
And some of those items took decades to make it. If they're going to keep certain government information in the Archives, make it available immediately.
I'm against the $300M bridge to nowhere -- I believe in privatized roads funded by local businesses, industry, and homeowners divisions.
I'm against ALL unconstitutional wars (every one since WWII has been unconstitutional).
"The Electronic Records Archives. By the same man who gave us the Stealth(TM) aircraft".
Hhmm...
The US Government and its agencies will have a perfect tool of mine for your interests, associates, lovers, sexual orientation etc. Make a join with Google's database, and they got 50% of your life covered. Enough to blackmail or nail you over and over.
1) Give me the $308M.
2) I'll bank the cash and cream off some interest to keep me in luxury - let's say I can get 5% interest PA, that's $15.4M a year so I'll take, say, $5M/year for myself and my efforts.
3) I'll Spend up to $10M/year maintaining a secure storage facility and purchasing 3 units of every storage device that comes to market together with a range of media, host systems and documentation on acid-free archive paper.
4) There will be an annual charge for subscribers to the facility.
5) Profit!!
AT&ROFLMAO
What a waste of money. Massechussets is doing it for free. All you need to do is make sure that all your document file format standards are free and open. After that, it doesn't matter if they exist in 2090 or not. If they don't exist, it's because no one has really needed the documents. If some historian comes along and does need the document, then a document import filter can be easily created because the document's file format was based on a free and open standard.
Suddenly the volume of documents you need to maintain drops from all of them, to just the documents that describe the file format standards.
This example of format obsolescence just popped into my head. Back when Commodore-Amiga was a going concern, the IFF-ILBM graphics format was pretty widely used. It was a nearly universal standard on Amiga.
A fair number of artists and video producers used Amigas. One of Amiga's advantages was that practically all the graphics programs used ILBM format, which meant you could easily feed the output from one application into another, and then into another. It was good, and it wasn't all that many years ago.
Just trying finding a program on Mac OS X or Windows today that can read IFF-ILBM files! Go on, try it! Photoshop, for one, doesn't have a clue about them. The best you can hope is to find some obscure freeware IFF-to-PNG converter that someone has hacked together.
Another example: It's getting harder to find apps that play "tracker module" music, and the programs that are available tend to be awkward and unreliable. Everything went to MP3, and mod music was quickly forgotten.
So if the idea of today's commonplace formats becoming unknown in the future sounds far-fetched at all. . . It's not.
"Would love to meet this Mr. Martin..."
He's good buddies with Jethro Tull.
YES! Finally a job after all those years studying Akkadian! Clay tablets are some of the most durable media I know. At least they have a proven record. Vast numbers of documents illustrating the fascinating world of accounting, esp. Sumerian sheep and goat transactions, is available thanks to the scribal choice of clay (combined with hot arid conditions). Will soon Lockheed HR soon be seeking 8-10 years of prior "Cuneiform/Pictographic" scribal experience? I can also read omens in the entrails of an ox. That can come in handy.
Yes, resize it.
Send email from the afterlife! Write your e-will at Dead Man's Switch.
I have been saying for years that the DoD should make an initiative to move towards open standards for this exact reason. The document retention requirements they have are incredible, and yet nearly all the documents generated are saved in proprietary formats. Now with the OASIS (OpenDoc) format solidifying and there is more than one implementation of it, they wouldn't even have to define a standard for word processing or spreadsheets.
Obviously, open standards are not a panacea. There are countless standards created by the military that never really spread farther than that, and therefore the support for them is limited (and thus companies that do support it can charge a pretty penny for it). And with open standards, at it is much easier to write an implementation if you need to. Compare this to MS Word, which is a pain to reverse engineer now, just imagine having to do so in the distant future, when it is not as widespread. And of course, for the very long term, nothing is more certain (and more inconvenient) than printing everything out and storing it in a warehouse, which is what is done now. But the longer that can be postponed, the more money can be saved.
As an added bonus, just imagine the competition that would spring up in the word processor market, if the DoD mandated that all new word processor documents generated internally or by contractors be in OASIS format, starting 5-10 years from now. Microsoft would have to support it (and well) or throw away a huge number of Office sales. The DoD would no longer be locked into a single vendor, saving them money upfront in addition to the money they saved on document retention.
Until then, the best plan is likely to convert as much as possible to a few standards like PDF, which is what I expect will happen here.
I'm trying to find out where in our Constitution does the Federal Government find an enumerated power to pay for this.
Wow, you can access the Constitution? I mean it was written in 1776. That's a long time ago. Good thing somebody thought to save it!
We're saving lots of data, because 1) lots of it is important and 2) we have very little perspective on it yet. In 200 years we might very well have a very different idea of what was important today.
So close and yet so far from the world's perfect ID number
When it comes to constitutionality, it doesn't matter IF we need it, it doesn't matter WHY we need it, and it doesn't matter if the MAJORITY thinks we need it. The federal government has certain enumerated powers, and the rest is left to the States and the people.
Why can't private companies do it? $300M in every pork barrel program adds up quickly, and government just inflates more currency into creation whenever things get tight.
Our economy is heading to the gutter fast once the housing bubble bursts, and all these projects are part of the problem.
Of course they need a national digital archive since the death of the general purpose computer has been already decided non DRM'd content at some time in the future it will no longer be accessable. We must protect the corporate entities by making sure that only government has access to non DRM'd content.
"It's so convenient to have a system where everyone is a criminal" - A. Hitler
the works of william shakespeare didn't start out their star studded lives as computer code. They went through a lengthy evolution through at least several iterations of printing presses before they made their way to a digital medium.
Data evolves. If theres one thing that google searches have shown it is that the demand of information is always changing, people arent going to want the same chicken noodle soup that they had fifty years ago because we make better chicken noodle soup now.
If the census 2000 information really deserves saving it will be copied hundreds of times into different isolated servers by individuals or companies who use the information. it will be ported to a new format, it will evolve.
-----
Save the data in XML and save the format in XML Schema and/or Relax NG. Please send your check for $308M to synesis...
Seriously, for $308M what hairbrained scheme is Lockheed going to dream up to justify the price tag.
Great. Now we can be sure that the only way to read any of these documents will be with IE 4.0 since they will prohibit even the sound of the word open source (sorry PDF. See clause 11) and given that they are a defense contractor you can bet they will lock in to some proprietary SW version that is 4 versions older than what is current.
--- Liberty in our Lifetime
100k is an estimate of the national archive's Network Administrator's sallery.
Seriously. I'd do this for free. Many people probably already have.
step 1. Massive raid 5 array with a mirror array off site and pleanty of hot spares.
step 2. Fix date stamping on whatever image format, sound format, text format you decide on.
step 3. Don't use lossy compression.
No offense to the poster but it's not rocket science to create a compression free data format with proper date formatting built in. This sounds like yet another way that the alledgedly conservative Bush administration is simply finding ways to line its friend's pockets with our tax dollars.
I'm sure that Lockheed came up with a much more complicated solution to justify their large check but I fail to see how this is a 300 M dollar problem.
@LogicalMethods | www.sneaksneak.org
>Our economy is heading to the gutter fast once the housing bubble bursts, and all these projects are part of the problem.
Wow, maybe I'll be able to afford a house in Denver after that happens....
Centuries? NARA has only been around for a few decades, and has only been its own agency since 1984.
Believe it or not alot of companies that have tasked with data retention of 100 years+ have adopted microfiche as their standard for data.You don't need to store any programs or anything just dump all the data out onto microfiche and you have a format that will in theory indefinitly
blah blah blah the federal government is supposed to provide for the general welfare of the nation, and i'd say important information about this nation is part of it's welfare.
You're not posting "any anti-social program post". While you may not, in fact, be a Randroid, your post is indistinguishable from one written by a Randroid.
Happy to help!
--grendel drago
Laws do not persuade just because they threaten. --Seneca
Offer void where prohibited, tax and shipping not included.
"Speaking the Truth in times of universal deceit is a revolutionary act." -- George Orwell
I worked with them for a while, as a data entry person back in the early 90's. Basically, we were responsible for keying in a parcel's 5-9 digit Zip code after it had been scanned into the system. By scanned, I mean the front of the package or envelope showing the send-to and return addresses was presented on a monochrome display, which allowed the person operating the terminal to enter the zip codes for the parcels. Then you'd hit a key and move to the next one, and so on and so on.
The bizarre thing is that I found out a few of the invididuals would "pad" their PPM (Parcels Per Minute) by typing in zipcodes they were familiar with instead of reading what was on the display, just to enter a dozen or so really quickly. It didn't happen often, but it helped them keep up the pace and "clear" the system queue more quickly, thus gaining them and their workmates an early break. However, I've no idea what damage may have occurred by their lax attitudes, and I really don't want to know now.
Which brings me to my point (I think): how can we be certain the data they're entering is one-hundred percent accurate, regardless if the medium lasts a century?
The Chronic *WHAT* les of Narnia!
They must not have a digital archives division... ...yet.
the results may not be pretty by the standards of 2105 but they will certainly be readable.
just as old archive film looks pretty shitty by modern TV standards but you can still see whats going on just fine.
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
Or are they just subcontracting it out, as usual.
word.
I have code on a modern HDD that I typed into a BBC computer 15+ years ago fro ma magazine.
I took it off of tape, via the BBC and a serial lead, I have all my chickens and all my eggs. So long as you move to a newer form of media before the old one perishes then your going to be OK.
I think it's a Chinese whisper problem not a chicken and egg problem, what happens when inaccuracies are introduces
e.g. Someone writes a file in a odd charset, nobody notices that the charset is different from ASCI when they convert the file into unicode. In 20 years time will someone notice that the file has been converted badly or will they think it's corruption? What happens when there are lots of tiny conversion errors like this.
thank God the internet isn't a human right.
Regarding this $300,000,000 "bridge to nowhere," it is obvious that you (and most of the world) only know half of the story.
The piece that you are missing is that it is significantly cheaper to fix the damn thing than it is to tear it down.
Just think how many other things folks have a knee-jerk reaction to that they know nothing about.
I'm pretty sure LockheedMartin IS a private company.
Oh, did you mean you want them to front the cost, too? And then who pays them for their services of preserving the National Archives. I expect that would be the government.
Complex data backup solutions and the use of lossless formats has not, for example, kept the critical Pioneer space probe data available, after less than 30 years (http://www.planetary.org/news/2005/pioneer_anomal y_faq.html)
How in the fucks sake do you expect this to last 100+ years? Don't use lossy compression? How is that a solution?
Take Windows Bitmap image format. It's not lossy. That doesn't mean that we won't forget how to display the damn thing...
Raid 5? What problem do you think you're solving? Keeping the data around, or making the data accesible for (as the OP makes clear is the LoC's responsiblity) as long as the United States exists?
Great! Anything we can do to speed the collapse seems like a good thing to me, because we sure as hell ain't going to fix the political climate.
Of course, I'm a known nihilist, so take my opinion with a grain of salt. Or don't. What do I care?
Will they store my favorite porn stuff?
and it WILL NOT last too long (now that morons @ fox are running it).
This type of project requires a continuous life-cycle of evolving, or else it is as pointless as archiving film-slides like the FBI was doing years ago (and probably still uses when they have too).
the only permanence in existence, is the impermanence of existence.
Rule #1 - with just a high-powered optical microscope, you should be able to see the data, preferably in a human-readable form.
Rule #0 - you can still do so 100 or better yet 10,000 years from now.
Back in the days of punch-cards, you COULD see the bits.
I've heard stories, perhaps apocryphal, of taking 60s-era computer tape, exposing it to certain chemicals, and being able to visually see the bits.
Why is this important? One day our civilization will die out and after a time another will rise in its place. We owe it to their anthropologists to make it easy on them.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
We agree on all of that you just mentioned, except the roads which seem to me a perfect "natural monopoly": government territory.
But since you see the value in the National Archives, why don't you see the value in digitizing it for archive and access by the people who own it?
Personally, I expect that Lockheed will bungle the job. They're not experts in the archive business, no matter how much of their own they do, or how much they "demonstrated" to the government. They're in the "blow things up" business.
Really, they're in the "get government contracts" (corporate welfare) business, which is how I expect they were chosen for this project. And why I don't expect them to pull it off, putting our own archives at risk the same way Bush put the Iraqi "archives" (7000 years of it) at such risk that it's now destroyed.
But that's why Lockheed is suspicious. The project itself has merit, and $308M seems worth achieving its goals, especially if it's really $30M for a decade (or even the $60M a decade it will turn out to be).
--
make install -not war
In Germany, we already do that: Zentrale Bergungsort Bundesrepublik Deutschland (German, but pictures). Here is a short description in english. All the documents are kept on microfilms, but I don't know what they do with audio/video material.
I'm not going to go into Lockheed's plan in detail, but you may be happy to know that my teammates were well aware of the OASIS standards and their value.
A lot of people in these comments keep saying, "I can solve this problem for $10k! Convert everyone to Open Office!" That's all well and good, but people, we are already DECADES behind on this problem. Whether you like it or not, there's a boatload of Word95/Excel/BMP/etc files out there (and worse).
Hmmm - I'd better email myself the GIF spec - maybe along with some source code to read it with - and a C compiler to compile the source with. Ah WTF - I'll just email myself the Linux sources. ...but seriously...there won't be any problem with reading GIF if anyone actually wants to - the file format is documented all over the place and in 100 years, if there are still GIF files on some kind of readable media - then the odds are very good that those documents will be easy to find. Programming a GIF reader (or a reader for almost any documented file format) is easy - presuming you are sufficiently motivated. A historian who is interested in 100 year old documents shouldn't have any problems getting them converted to whatever format is needed.
The HUGE concern is the undocumented, encrypted or (worse) DRM'ed files. Reading those in 100 years may be exceedingly difficult.
We can read documents written in heiroglyphs around 2000 years ago. The only problem is with languages for which no translations *ever* existed.
Survival and longevity of antique media are a much bigger problem.
www.sjbaker.org
I'm trying to find out where in our Constitution does the Federal Government find an enumerated power to pay for this
"To make all Laws which shall be necessary and proper for carrying into Execution the foregoing Powers and all other Powers vested by this Constitution in the Government of the United States, or in any Department or Officer thereof."
You need to keep official records of things like: court rulings, legislation, federal expenditures, etc.
D6 63 0D 70 89 81 BB 8E 7B 7C 5F 5D 54 EA AB 73
They ought to make sure that real professional historians are there.
Your point is very valid, as far as correctly interpreting or getting meaning out of the data. However, this project is for archiving the raw data.
Someone now should fund the project for historians to create historical knowldege using this project's raw data output as it's input.
Roads are indeed one of the hardest points for me to debate with even myself in terms of "who provides?"
As a multi-business owner, I can tell you that I'd rather be paying for the roads that come to my store from the highway, and I'd rather co-op it with other businesses by finding the BEST builder and maintainer of the roads. Today, we all pay hidden gas and other taxes for use of the roads, but the costs are crazy (I should know, I've worked for a highway contractor).
If we could should the average taxpayer what we pay in FEDERAL taxes per mile of driving, they'd be outraged.
I'll grant you the roads, this time, as a government responsibility, but how about we leave it up to the towns or maybe the precincts, rather than the State or Federal governments?
Coming soon to a bookstore near you! Reserve your copy online at lockwoodtoilet.com.
I went browsing through some of your older comments and I came across this one which contains this broken link! Get it? archives.gov -> broken link? Ha ha ... heh.
Test 1 2 3 4
HP and MIT have been working on this same issue with the DSpace Project.
$308M would sure go far if doned to this open source federation!
The antidote for misuse of freedom of speech is more freedom of speech.
-- Molly Ivins
I have no idea what you're talking about. I pretty sure that the original poster was talking about the Alaskan bridge to Gravina Island. It is a $300 million bridge connecting a town of 8,000 to an island of 50. The island currently has no bridge, it has no paved roads for that matter. It is served by a ferry. Residents are quoting as saying they like their ferry, and didn't ask for a bridge. This is the project that the nation got up in arms about. What project are you talking about? Or are you just making things up? And as for how I know? Well I spent the five minutes to look it up. Looked to see what the project was called in the transportation bill and then found the actual information. Research, there's no substitute.
Ryan Stultz
* Nitpick: U.S. Constitution was written in 1787; ratified in '89.
1776 was when the Declaration of Independance was written. Don't forget about the Articles of Confederation between the two.
The articles were light (to the point of vacuum) on details about the approach proposed by the company.
From the article: "The system's architecture makes it flexible enough to accommodate evolving policy change," including the importance of "providing public access while protecting privacy and sensitive information." From the sound of that I'm betting its some wonky and ridiculous XML format infected with a sadly pathetic little DRM imp.
The fact is that I can read anything if I have a copy of the software that originally viewed/created it and the machine (or an emulation of the machine) on which the software ran. Adding one more format to the mix just means we have to emulate one more machine and keep track of one more piece of software and all the doubtlessly expensive effort which will be spent in conversion is wasted.
It's great to see the National Archives working on this but I would rather see the tax money farmed out in challenge grants to organizations like the
Long Now that have a chance in Hell of delivering something useful than pouring money into yet another defense company to ensure that whatever technology we use to store records can be properly sanitized and locked away according to the whims of government and "changing policy."
The biggest issue facing us right now is that most of the music, words and images created by our civilization are illegal to preserve. Ridiculous copyright extensions have ensured that the huge mass of data for which no rights owner can be found will simply rot instead of being digitized and stored.
A software emulator can ensure that historic file formats are readable in the future, but Big Media would rather squeeze our history to death before it letting go of the rights.
This is like 1000 fires at the Library in Alexandria. Future generations will curse us for every scrap of information we allow to rot while we squabble.
[-- Trust the Monkey --]
print the files to tiff format, then extract the text and metadata, and toss that into a searchable database, and then keep the orig file around for good measure
I wasn't the one that modded him down in the first place :) it was -1 when I got to it, and I made it 0 :) Now it is at 2...I am happy
Will we need old information in digital format?
It's not just old information. The Archives have a great deal of material from the second Bush administration already, and the rate of data accumulation is increasing steadily.
The reason to go digital is to reduce storage costs. Paper is a very low-density medium and it needs to be kept in climate-controlled warehouses if you don't want it to rot away.
-jcr
The only title of honor that a tyrant can grant is "Enemy of the State."
I have always tried to get my data off old technology and move it to new technology before unpluging the old system. I think as newer technology comes out, designers are making it easier to move data. Now days since everything connects to eachother or the internet it is a piece of cake to accomplish this. Every time I move to a new system with a harddrive 5 to 1o times larger, I always have room to completely back up all the old data. For my life time, I am not worried.
We might even be able to read between the lines 100 years from now and insert text like "separation of church and state" into various documents where it never existed before in order for some people to get their way about things and of course by that time no one would be around from this time to contradict the meaning.
this nation, under God, shall have a new birth of freedom. -- Lincoln, Gettysburg Address
RTFA: "The search for a solution began seven years ago"
If I remember correctly, seven years ago was before Bush's presidency began.
You mean code everything in MIX? It seems the most stable widely spread interpreted environment around. I suppose it could be done in p-code, that's pretty stable, and it used to be widely spread, so with a bit of work it could be so again.
The jvm on the other hand keeps changing every few years or so, ditto the Python interpreter. Parrot isn't ready and the C# interpreter has no track record (but since it comes from MS, I'd bet on it changing more often than the jvm + the Python interpreter combined).
I think we've pushed this "anyone can grow up to be president" thing too far.
That must be similar to my pornographic memory.
I worked on Lockheed's demo team and like to wear women's underwear.
Walt's testimony to the House Committee on Un-American Activities, 24 October, 1947
The republic isn't in the best of health lately. There are far too few people in the country that even understand what the republic was and why it was important. The education system seems purely concerned with the faults of the republic and the men that created it. The current media runs cowering away from anything good done in the nation.
How many people do you know that can say what the separation of powers between the federal government, state governments and local governments is meant to be. How many people do you know that understand why highway and education programs are little more than subversions of the constitution by the federal government. What is the commerce clause actually meant to regulate and what is it being used for.
...Lockheed's getting $308M to keep a roomful of Pentium 200s running for eternity? :-)
Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
Check out the micro-etched data disks used by the Rosetta Project. Their goal is to create a long-lasting archive of the basic elements of 1,000 different languages. The storage medium they're using involves etching readable words on to metal disks. The words are not readable by the naked eye, but all you need to read them is a decent optical microscope -- no special hardware or software.
The Rosetta Project's customized "Rosetta Disk" adds another clever innovation: naked-eye-readable words around the edge of the disk get smaller as they spiral inward, making it clear to anyone who might find this disk in the future that there is more information to be read at greater magnifications.
"Research, there's no substitute."
Dig a little further. That island has the regional airport for that section of Alaska. Ferry service in that region is out a large fraction of the time due to fog.
I still don't think the bridge is justified but you are leaving out some relevant facts
Oh great, now future generations can be guaranteed the chance to see late 20th/early 21st century porn... ;)
--- Users are like bacteria -> Each one causing a thousand tiny crises until the host finally gives up and dies.
They should just use Linux. It might not support wireless networking or motherboard RAID properly, but it certainly supports all manner of hardware from 30 years ago that most users can't even remember, and archaic file formats are positively encouraged, half of them are part of the OS.
Only on slashdot would such an ignorant post be considered "insightful."
Anonymous Cowards suck.
Rock! nothing beats good old rock....
same with Aristotle and a lot of the greeks. Just because most of the stuff made it 500 years more or less intact doesn't mean that there wasn't stuff that was forgotten.
Just because it falls out of use for some years isn't a good reason to let it become forgotten.
Why did they pick a Defense contractor for this and why was it open only to Harris and Lockheed, and not other technology companies?
Also, is the implication here that the historical archives of our country (I'm in the US) are now controlled by the department of Homeland Security?
Don't think that a small group of dedicated individuals can't change the world. It's the only thing that ever has.
Yeesh, can't believe this is the only comment pointing out that booboo. Makes me wonder if most nerds are guilty of a poor knowledge of American history. :-P
If they were so unconstitutional, perhaps the supreme court wouldn't have refused to even hear challenges against them...
Humans will be extinct in 100 years. How many of us think we can really last for 100 more years before we have another couple world wars?
If humans do exist in 100 years, I can guarentee that this economy wont.
So, the company that printed a word doc, deleted the file, and then scanned the 100-odd page doc as IMAGES and handed it to my wife to make some changes, is going to preserve our info until the end of time.
The same company that backed up F-22 radar system designs to an obsolete tape, and then threw away every device in the company that could read the tape, is saving our data.
What could possibly go wrong?
While I sympathize with your libertarian critique of government spending, I do take issue with your free market fundamentalism. Free markets can be good at efficient allocation of resources. However, research is notoriously inefficient.
The more you outsource research to companies, the more incentive you have to offer them in terms of monopolistic protections such as copyright laws and patents. Whereas if the government sponsors this kind of work, it ultimately can be used for the benefit of society - by making the innovation available to the citizenry faster than when technology is developed by private industry given the monopolisitc protections they would need to be viable.
The Internet is the quintessential example of how this works. Would we be using Slashdot right now if we used your preferred method of outsourcing this work to private companies that "do it better"? I'd wager we would not. We would either have one company dominate the market a la Mircosoft or we would have many companies with only large corporations being able to pay the cost involved in all the duplication of effort and monopolisitc licensing fees.
Also, government monopoly is one way to deal with the inefficienies of free markets and can drive adoption rates. You wouldn't want 20 sets of telephone poles in your neighborhood for 20 different telephone companies. Being able to concentrate infrastucture enables you to distribute the technology to a wider population, faster. Of course, monopolies also have their inefficiencies. They have their place - just as free markets do.
On an unrelated note, The Constitution says in Article 1, Section 8 - Powers of Congress:
If you believe, as I do, that the history of the Republic is important for its future and its current Welfare, then it doesn't seem too much of a stretch that this covers the taxes and spending of it on a project like this one.
Then, after you've written those five or ten million lines of reliable code, you will have written about 1/100th of the reliable operational code we probably have, and you can tell us how our software sucks.
The jokes sort of write themselves, don't they.
Many /.-ers would be interested in the Rosetta Project which aims to preserve many world languages using an extremely failsafe medium. defintiely a cool read -- check it out.
sure, it may not be terribly convenient, but it's certainly going to be readable 100 to 1000 years from now (by which point we should have adequate OCR to complete the task of reading the disc automatically)
-- If you try to fail and succeed, which have you done? - Uli's moose
Yeah, I suppose that's for "personal-use only"? Dude, they are so sued.
don't spend too much time on archiving the porn, i got that covered.
His memory looks good in pictures? Excellent! I'm sure it has something to do with homeland security.
We're going sign a data archiving contract to the tune of $300 million with a company that has repeatedly demonstrated it can't find its own ass with both hands? Did anyone even look at their record with DOD contracts?
Simply amazing. I wonder if they'll be archiving the job offer letter from Lockheed that the government contracting officer probably has in his back pocket right now.
On the other hand, I am a little surprised, did Haliburton forget to bid or something?
Since you were involved, could you let us in on what some of the strategies are/were? There are a lot of people pulling wild guesses out of their asses; it'd be nice to know what they actually came up with.
--grendel drago
Laws do not persuade just because they threaten. --Seneca
But, hey, that's just me.
No, more like code everything in ANSI C (which, as far as I can tell, is the most standard, portable language currently in existence). Either that, or some other language with a formal ANSI or ISO specification (maybe something garbage-collected? I don't think C# is widely supported enough yet, not to mention the issues with non-Free libraries...).
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
That's all well and good, but people, we are already DECADES behind on this problem. Whether you like it or not, there's a boatload of Word95/Excel/BMP/etc files out there (and worse).
:) Sorry if I came across as trivializing your work, I definitely didn't mean to do so. Good luck, and hopefully the job will get easier in the future and not worse.
Tell me about it. It would take more fingers than I have just to count all the proprietary formats we have for engineering drawings around here
Contrast this to the good old "Kodak Gold" CDs I was burning onto back in 1996, almost all of which are still readable with 0% errors...
American Digital
Mitsui MAM-A CDRs
Kodak probably 'rebranded' the MAM-As and sold them under the Kodak name years and years ago.
I miss the Kodak gold disks and still have a few around brand new and ready to burn.
As far as I know, the Kodak CDRs I've burnt are still 100% error free after having some of the burnt CDRs for well over 5 years. I simply stored them in a box out of direct sunlight and did what I could to keep them safe from extreme temperatures....
Some of the goals of this project parallel those of the Long Now Foundation: http://www.longnow.org/ Seems like they should have been able to bid, if they did not.
Most roads are city, county or state projects, for design, production, maintenance and financing - though the feds subsidize even state highways, and often local ones. Interstates require larger scale planning, though the Federal planning is usually influenced by local politicians and contractors, almost always for patronage and other favoritism, rather than design optimization. That corruption usually is worse when local bosses are in control, though this year's Congressional Highway Bill is one of the greatest travesties since Robert Moses remade New York in his own twisted image.
--
make install -not war
Yeah, what exactly is your response to the comment replying to yours, with the facts about the $300M bridge to nowhere? What is this "knowledge" you have, that justifies spending $300M on a bridge serving 8050 Alaskans? Isn't it really just lying, to cover up some project that you have some obscured interest in? Don Young, is that you?
--
make install -not war
The National Archives is composed of many materials we've accumulated, often at great cost in collection and curation, over the life of our Republic.
--
make install -not war
Trying to keep the family electronic archives requires active maintainance (basically copying to a new redundant array every few years). I'd love to have access to a well backed electronic safe deposit that didn't require my maintainance, on whom I could rely on to provide at least 100 years worth of archiving
-1 stupidest comment ever moderated up on slashdot
Yeah, all that really sounds like it will be free. Whatever you say, boss.
... and then they built the supercollider.
I can tell you right now that when the Mechas of the future finish off the carbon-based units infesting the Earth, the only thing left will be a little bit of porn and a little bit of plastic. Thats right folks that's gonna be our lagacy.
(and just to make sure I do my part, i am now going to go take a picture of my dick, place the photo in a ziplock and bury it out back.)
Have a nice day!
IrfanView is an excellent free Windows program that makes converting large numbers of image files a breeze and of course it can convert from IFF. Plus it's my favorite image viewer.
On se Internetz nobody noes your German.
This is not nearly as difficult as you make it seem: implement the parser in a standardized language. The formal specification of the standardized language can then be included with the source of the parser.
That's just weird. Now you have to interpret the document format that the language specification is written in, write a compiler for the language, run the compiler for the language on the documents and then interpret the output of the parser. Alternately you could document the document format that the content is in and let later consumers write parsers in whatever programming language they feel is best.
I'm not against providing implementations of parsers as documentation for the document format. They might still have a C or Python compiler around and it could be handy. But it doesn't make sense to focus on documenting the parser implementation language rather than the content format language.
That's funny. As an example of OBSCURITY, IFF is a poor pick. It's an expansion of the Macintosh resource concept that, like XML, allows for a hierarchical organization of data, where if you simply can't understand (or use) one of the internal data chunks, you can more or less safely skip over it (since each chunk has a length attached to it).
And, if you really need to understand it, I can probably spare one of my old Addison-Wesley Amiga guides that details IFF and its various implementations (it was also used for word processors, musical composition and multimedia presentations). ILBM is simply the Interleaved BitMap format.
The Microsoft WAV file format, is, in fact, a slightly perverted form of IFF.
Now if your Windows apps can't READ IFF, I'd take it up with the vendor.
Pretty funny, considering Lockheed just instituted a corporate policy that *all* employee emails be deleted after 90 days.
No joke.
XML Schema is in XML. XML is a text format where the tags indicate the form and the attributes and values provide the content.
Not to say that XML is a self-explanatory panacea - Microsoft is already hard at work trying to ensure that never happens with their patented XML document format which doubtless will be full of the indosyncrasies and obscurities for which the DOC file format is infamous.
But don't make life more difficult than it needs to be.
Individuals input is periodically checked for accuracy. As more mistakes are found, you get less and less work (as in, you get fired).
When a data entry clerk is let go, all of their previous work is re-worked.
In the future, I would want to not be isolated from my friends in the Space Station.
The answer of course if the egg. Eggs have been around millions of years longer than the chicken.
This actually brings up a very good point relating to TFA and preserving accuracy: Mikmod seems to be the standard library for tracked music, with Modplug libraries a close second. The problem is that neither of them is entirely faithful to some formats, Impulse Tracker files in particular. It gets the point across OK but there are sometimes very obvious differences between the songs as played in Impulse Tracker and one of these other players.
The only software I know which is meant for modern computers that does a 100% accurate job is XMPlay http://www.un4seen.com/ which is sadly windows-only. Though there is an XMMS plugin.
I was on the team for another major aerospace firm that considered bidding on this work. It is a huge, huge, gigantically awesome problem. The expected amount of data to be archived and indexed and protected is mind boggling -- many terabytes the first year, then it really grows!
We decided to not bid on it. Too much technical risk.
READ the US Constitution, the Bill of Rights and the other amendments! http://lcweb2.loc.gov/const/const.html
BTW.. Only the ACTUAL Supreme Court can decide what is Unconstitutional.
Unacceptable.. Hmmm Lockhead wouldn't be my first choice either, especially if the goal is to keep costs down.
Unnecessary, Come on man. We lose information on a daily basis. We've already paid for it. You want it to decay on magnetic tape? NASA Photos, Log data, NOAA, ARMY, Nuke teting, all of this should be preserved and made available. I for one don't want to have to load the Reels to access that info anymore. Might as well ask my kids to load the punch cards for census data.
OSGGFG - Open Source Gamers Guide to Free Games
Digipress had some interesting solution for long-term storage called "eon disk".
http://www.mosarca.com/CDINFO/CENTURY.htm
also you could consider storing information in the DNA of some insect or fish. Assuming it won't become extinct you'll only have to catch one in order to get your information back.
The advantage is you don't have to make the backup copies yourself.