Slashdot Mirror


Bush Administration's E-Mail Deluge May Overload Archive System

Lucas123 writes "The Clinton administration generated 32 million e-mails. Bush's administration has generated 50 times as much data — 140TB, 20TB of which is email — which soon will have to be archived through a new government-built records management system. The new system may not be up to the task because the technology behind it may not be able to handle the sheer volume of data along with the fact that the Bush administration has been slow in providing the National Archives and Records Administration (NARA) with needed information about the records, according to a Computerworld story. Questions have also been raised about millions of missing e-mails from between March 2003 and October 2006. 'It wasn't until this summer that an intensive effort began to share information,' said Ken Thibodeau, director of NARA's Electronic Records Archives."

41 of 169 comments (clear)

  1. It is Clinton's porn stash! by Anonymous Coward · · Score: 5, Funny

    The other 120 TB was probably just Clinton's porn stash that the Bush administration found while purging off records.

    1. Re:It is Clinton's porn stash! by Ethanol-fueled · · Score: 4, Funny

      Probably 120 TB because of the larger image sizes needed to accommodate fat chicks.

  2. Number of emails generated. by Ice+Wewe · · Score: 5, Insightful

    "The Clinton administration generated 32 million e-mails. Bush's administration has generated 50 times as much data -- 140TB, 20TB of which is email -- which soon will have to be archived through a new government-built records management system.

    Well, to be fair, email wasn't quite as popular during Clinton's administration as it is now. Then again, the 400GB of e-mails that the Clinton administration must have generated (if it is 50 times less than 20TB) must have been rather hard to store when he left office.

    1. Re:Number of emails generated. by GMonkeyLouie · · Score: 3, Interesting

      Well, most of that 400GB from Clinton's administration was dirty pictures of interns. In all seriousness, though, I don't think the problem will be finding a way to store all that data. The real kicker will be finding information you need in it. Seems to me like the best way to hide relevant and/or damaging e-mails would be to have them stored right alongside truckloads of chain letters.

    2. Re:Number of emails generated. by hedwards · · Score: 2, Interesting

      It isn't storage and it isn't finding it, the problem is preserving it long enough to look through and index it. I'm sure that Google and companies that do similar work have the technology to do it. I'm also quite sure that for the right price the Federal government could obtain software to do most of the heavy lifting.

      The problem is that the Bush administration deliberately migrated only partially to a new system leaving it in a state of constant risk for bit rot and corruption. It's hard to say how much of it has already been lost due to incompetence.

      And remember this is tax payer dollars and a Republican President, I'm sure he's OK with us writing a check for millions upon millions of dollars to correct his inept decision.

    3. Re:Number of emails generated. by geekmux · · Score: 2, Funny

      Well, to be fair, email wasn't quite as popular during Clinton's administration as it is now.

      Good point. I mean hell, Al Gore had just invented the Internet the year prior. Cut the guy some slack.

  3. First, not enough emails... by Anonymous Coward · · Score: 4, Insightful

    ...Now too many many emails.

    Whining is Washington's most favorite thing to do.

  4. Text only, no html by Teun · · Score: 5, Insightful
    Start by mandating text only mail.

    No more fancy signatures and html crap will cause a 60-80% drop in volume if not more.
    Mandate the Usenet way with replies after the original, (it will) teach people to cut irrelevant repeats.
    Stop the addition of stupid and ineffective disclaimers.

    Teach the use of (ftp) servers for sharing large documents, no more Microsoft sized attachments, send a link.

    --
    "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
    1. Re:Text only, no html by ai3 · · Score: 4, Insightful

      I would rather buy another hard disk than waste precious time editing the mail I'm replying too, in most cases it simply isn't necessary. For Usenet it's a different story as many people read it, so it's worth the effort.

    2. Re:Text only, no html by Neoprofin · · Score: 5, Informative

      If it anything like our corporate mail server I would bet you the number one space filler is people making minor changes on documents then reattaching them and forwarding them back to the same 50 people who just got the previous version of the document, repeated over 100 iterations as the email soon becomes a 2GB mess.

    3. Re:Text only, no html by EvilRyry · · Score: 2, Informative

      Some database driven mail servers like Citadel, Exchange, Zimbra and probably Domino support only storing the message and attachments once no matter how many people it was sent to.

      It goes a long way in preventing the attachment * user mess.

    4. Re:Text only, no html by malkavian · · Score: 5, Insightful

      Longer email threads seem to end up forwarded and brought to the attention of many people you never expected at the outset.
      Judicious editing of the emails to include only the relevant sections for the replies, giving the context of the emerging thread of conversation means that someone being brought up to speed with that segment of the conversation doesn't need to trawl through masses of irrelevant junk to get at the meat of the issue.
      I tend to do it as an efficiency gain, rather than taking storage space into account. All comes back to that quote you hear people come out with after sitting through a bad movie "Well, that's an hour of my life I'll never get back". It may only be a few minutes at a time, but they mount up over time. Plus, crafting things to cut to the heart of the matter puts things into sharp perspective, and means people are far less likely to digress, saving even more wasted time.

    5. Re:Text only, no html by Teun · · Score: 2, Informative
      When not on Slashdot we're expected to read the message we reply to.

      Deleting the bit that's already answered, not relevant or whatever can hardly be called 'editing', it has more to do with comprehension.

      One of the worst things for the latter is a typical corporate Outlook mail exchange (I know that word...) with at the bottom text that hasn't been read for the last ten replies.

      --
      "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
    6. Re:Text only, no html by kevin_conaway · · Score: 4, Insightful
      Did you stop paying attention to email in 1995?

      No more fancy signatures and html crap will cause a 60-80% drop in volume if not more.

      I know you hate it when your mom or the boss' secretary at work sends out a cutesy formatted email but some people can actually use HTML email effectively in lieu of sending a document or a link

      Mandate the Usenet way with replies after the original, (it will) teach people to cut irrelevant repeats.

      Irrelevant repeats for you may be important context for someone else.

      Stop the addition of stupid and ineffective disclaimers.

      Often times, those disclaimers are required by law. Most people don't add them for fun or to make themselves feel important.

      Teach the use of (ftp) servers for sharing large documents, no more Microsoft sized attachments, send a link.

      FTP? Are you serious? Sending documents by carrier pigeon is more secure and reliable than FTP

    7. Re:Text only, no html by Reality+Master+101 · · Score: 4, Insightful

      Sheesh. I call this phenomenon "technological puritanism". All tech must be ugly! 80 columns should be enough for anyone! Fixed-width fonts were good enough for my granddaddy, they were good enough for me, and they should be good enough for everyone! Words are worth a thousand pictures! Get off my damn lawn!

      Nothing personal, but if people like you were in charge of the world, we'd all be living in gray, cast concrete cubes. Think of the efficiency! No more wasted paint. You can just make a bigger house by stacking the blocks and adding a ladder.

      Most of us *like* color, pictures, paragraphs, and most of all, convenience. Use FTP when I can just add an attachment that goes directly to the source? Give me a frickin' break. No one gives you respect points when you prove how miserably you can live.

      Let's put this in perspective... that 120 terabytes costs 12,000 dollars in hard drives. Retail at Fry's. The entire output of the Bush Administration costs less than what they probably spend on coffee in a month.

      P.S. And, yes, this is from someone who used a teletype in high school, and was ecstatic when we got a 300 baud modem (whoa! It's almost 3 times faster than the ol' 110!) and a Televideo terminal. Those days were not better.

      --
      Sometimes it's best to just let stupid people be stupid.
    8. Re:Text only, no html by Joce640k · · Score: 2, Insightful

      140 1TB Hard disks (plus another for RAID) probably costs less than a couple of government office chairs so what's the problem?

      [Most likely the fact that it's in secret, proprietary formats and spread across hundreds of PCs instead of being archived by the mail gateway]

      --
      No sig today...
    9. Re:Text only, no html by Vancorps · · Score: 2, Informative

      Your statement doesn't make sense. Exchange supports and automatically takes advantage of single instance storage right out of the box. What do you need 3rd party software for that disables it?

      I run Exchange on a NetApp SAN so everything gets deduped and archived to tier 2 storage if it hasn't been accessed within 90 days. Tier 2 is a lot SATA disks that are backed up to tape. It's not even an expensive solution when you start talking about the cost of enterprise storage.

    10. Re:Text only, no html by the_other_chewey · · Score: 2, Insightful

      Stop the addition of stupid and ineffective disclaimers.

      Often times, those disclaimers are required by law. Most people don't add them for fun or to make themselves feel important.

      I don't know about the situation in the USA, but in most parts of the world, this is exactly the reason.
      "Because everybody else does it" is another.
      In multiple european countries, those disclaimers are entirely worthless, and even in some cases came back
      to bite those using them in court by proving that the sender was aware that some piece of information might
      end up in the wrong place.

      Disclaimers don't replace common sense or encryption.

    11. Re:Text only, no html by jesterzog · · Score: 2, Insightful

      If it anything like our corporate mail server I would bet you the number one space filler is people making minor changes on documents then reattaching them and forwarding them back to the same 50 people who just got the previous version of the document, repeated over 100 iterations as the email soon becomes a 2GB mess.

      In our organisation (government but non-US) we just give people a document management system and we educate people about why they should use it. If that doesn't work we point out it's policy and make them use it, because if they don't then it means that important records might go missing and we could end up in trouble if anyone officially requests the records we hold on any particular topic.

      Nobody sends attached documents, they send links to documents in the DMS. After using it for a short time everyone seems to appreciate the benefits of only having a single master copy of documents.

  5. What's up? by Anonymous Coward · · Score: 3, Interesting

    It hasn't helped that the Bush administration has been slow in providing NARA with needed information about the types and volume of data that will need to be archived. It wasn't until this summer that an intensive effort began to share information, Thibodeau says.

    I can understand the reasoning that for national security, some information needs to be kept secret. The thing is, the more I hear of this administration's obfuscation of their communications and dealings, I can't help but wonder what in the World they are hiding.

  6. Shadowy Government by GMonkeyLouie · · Score: 4, Interesting

    Whenever I receive news that information that we're supposed to have access to from the Bush administration has gone missing, it makes me queasy. There's so much secrecy surrounding random little things that it's started to make me paranoid. Maybe it's just me wanting to blame the last eight years on a scapegoat, but I feel like someone at the top is trying to hide something really big and succeeding.

    1. Re:Shadowy Government by StopKoolaidPoliticsT · · Score: 2, Interesting

      Perhaps you're too young to remember, but Clinton's administration had a problem with missing emails during investigations too (Lewinsky, why hundreds of FBI records on their political enemies ended up in the White House, illegal campaign donations from China, etc).

      I'd say it's par for the course and if you think just one side is doing shady stuff, it might be because you're a bit partisan.

      --
      Stop Koolaid Politics
    2. Re:Shadowy Government by GMonkeyLouie · · Score: 2, Interesting

      Well, Clinton never tried to insist that his VP wasn't part of the executive branch, never tried to put Harriet Miers on the supreme court... Actually I think the shadiest person in the administration is Cheney. He's certainly one of the only members of the 2001 Bush team left, and he keeps so many secrets! Also, he shot a man in the face one time. I love adding that to the end of my Dick Cheney rants. Is that too partisan?

    3. Re:Shadowy Government by StopKoolaidPoliticsT · · Score: 3, Informative

      Well, Clinton never tried to insist that his VP wasn't part of the executive branch,

      It's called "Unitary Executive." That is, there's only one guy in the executive branch that gets to make the decisions. It's entirely up to the President how much of a role he gives the Vice President. Under George Washington, John Adams lamented that the only thing he could do was preside over the Senate and then, he had no say on anything unless there was a tie. It drove him nuts.

      If you read the Constitution, Article II groups the Vice President in with the executive branch, but the ONLY place it provides a job description for him, other than sitting around, waiting for something to happen to the President, is in Article I, Section 3 where it says he is to preside over the Senate and break ties.

      As for the rest of your comment, every administration keeps secrets and covers things up. All of them. It's not just a Bush thing or a Cheney thing, it's a Bush, Clinton, Bush, Reagan, Carter, Ford, Nixon, Johnson, Kennedy, et al thing. Some are better at hiding it than others... and some people simply refuse to open their eyes if it is "their guy" in the White House.

      --
      Stop Koolaid Politics
    4. Re:Shadowy Government by rtfa-troll · · Score: 5, Insightful

      No; you are partisan when you think an accusation against one side can be answered by an accusation against the other side. They are both bad (they are US politicians; corruption is so endemic that it's legal and called lobbying), but Clinton's presidency ended about eight years ago and isn't something worth discussing now.

      The questions are; how to make sure Bush follows the law for what he still does? How to make sure Obama doesn't start off like Bush?

      --
      =~ s,(.*),<sarcasm>$1</sarcasm>,g if any_point_you_wish();
    5. Re:Shadowy Government by mhollis · · Score: 4, Interesting

      I have understood this outgoing administration to be more than secretive. they're positively paranoid and the only administration in memory that was similar was Nixon. All internal memos have been classified first. Declassification only happens when there is a strong and abiding reason why the memo should be declassified. Contrast that with Clinton, where all internal memos are not classified, unless there was a strong and abiding reason why the memo(s) should be classified.

      When Bush announced that his administration would immediately prepare for a transition (before the 4th of November, which was election day in the US), I assumed that the first course of action was that this Bush administration would do what the last Bush administration did: [Rip] the hard drives out of their computers and tried to erase "sensitive" computer files in the White House and West Wing.

      To say that the Clinton Administration started with a "clean slate" was an understatement. Later, Clinton lawyers ignored the dangers of historical archive deletion when faced with Republican destruction of historical records. Presumably, they wanted a "pass" from future Republican administrations.

      Republican administrations tend to be very secretive. Democratic administrations tend to not. I shall expect the Obama administration shall have to purchase all new computers -- or at least hard drives -- in order to simply start up in their first week. This is a horrid waste of taxpayers' money all in the name of whitewashing one's past deeds (for good or ill).

      Due to record-keeping, we now know that Nixon did know about the Watergate break-in. And we do know that he was very interested in its coverup. Nobody can be prosecuted at this time for that (those who were found guilty have all ready served their time). I would be very interested to know if Reagan's CIA planted the stacks of AK-47s used as evidence by his administration that the attack on Grenada was justified. And we still do not know everything about the Iran-Contra affair. These historical records are worth keeping because, well after the Statute of Limitations, America gets another look at how an administration dealt with the world.

      It is a shame that any Administration is that interested in "rewriting history" in order to unfairly burnish a legacy, which in the case of "W" is hardly salvageable.

      --
      Gods don't kill people, people with gods kill people.
    6. Re:Shadowy Government by StopKoolaidPoliticsT · · Score: 2, Informative
      Yes... the same ones that think there is a Constitutional role for the government to provide health care and retirement to the people of the United States, while clearly ignoring the Tenth Amendment.

      It's real simple, the Constitution isn't that hard to figure out.

      Here is every mention of the Vice President in the Constitution:

      Article I, Section 3:
      The Vice President of the United States shall be President of the Senate, but shall have no Vote, unless they be equally divided.

      The Senate shall chuse their other Officers, and also a President pro tempore, in the Absence of the Vice President, or when he shall exercise the Office of President of the United States.

      Article II, Section 1
      The executive Power shall be vested in a President of the United States of America. He shall hold his Office during the Term of four Years, and, together with the Vice President, chosen for the same Term, be elected, as follows:

      ...
      In every Case, after the Choice of the President, the Person having the greatest Number of Votes of the Electors shall be the Vice President. But if there should remain two or more who have equal Votes, the Senate shall chuse from them by Ballot the Vice President.

      In Case of the Removal of the President from Office, or of his Death, Resignation, or Inability to discharge the Powers and Duties of the said Office, the Same shall devolve on the Vice President, and the Congress may by Law provide for the Case of Removal, Death, Resignation or Inability, both of the President and Vice President, declaring what Officer shall then act as President, and such Officer shall act accordingly, until the Disability be removed, or a President shall be elected.

      Article II, Section 4:
      The President, Vice President and all civil Officers of the United States, shall be removed from Office on Impeachment for, and Conviction of, Treason, Bribery, or other high Crimes and Misdemeanors.

      The office is also mentioned in the Twelfth Amendment (which changes how the VP is selected), the Fourteenth (regarding who can vote for VP and that they can't have been part of a rebellion against the US), the Twentieth (defines the executive term and succession to President), the Twenty-third (giving DC electors), the Twenty-fourth (banning poll taxes), and the Twenty-fifth (succession).

      That said, none of those amendments gives the office any more of a defined role, so we must go by the Constitution itself... which is very straightforward and if there's any doubt, look at the first Presidency, which ultimately defined the office in front of the very people that drafted the document creating it.

      So yeah, those "Constitutional lawyers" that see the Vice President's role defined otherwise have a pretty bad reading comprehension problem.

      --
      Stop Koolaid Politics
  7. Re:What the hell does the summary say? by jimicus · · Score: 4, Insightful

    Besides, only 140TB (or 20 TB)? That's child's play for any competent DB admin, never mind only about $2k worth of hardware to hold it.

    Assuming that none of it's been put into the archival system yet, that means they're dumping 140TB on it in one go.

    You index 140TB on $2k worth of hardware and come back to me when you're done. Hopefully I won't have died by then.

  8. How much is spam? by houghi · · Score: 3, Interesting

    How much of that is spam? I can imagine they are not allowed to delete spam. Spam has increased, so this would mean that all of it is still there.

    The rest can mean a lot of different things. I am forced to work (otherwise no food) with 150MB excel files that I would love to put in a database and would take up at least 10 times less space. And I am not even talking about speed increase and ease of use, because somebody else has the file open, so I can not change the content.

    Or perhaps Clinton did not keep everything. Or ...

    --
    Don't fight for your country, if your country does not fight for you.
  9. If they hadn't gone to exchange.... by CFD339 · · Score: 5, Informative

    The Bush administration moved the White House from a Notes/Domino based system to a Microsoft Exchange based system.

    Before moving, they'd had no downtime -- even when congress was taken out for 2 days by the code red word (they were on Exchange).

    In moving, they mysteriously 'lost' all their backups for a period of time that was suspicious as hell, and now they can't scale to handle the capacity issues they face.

    In a Notes/Domino world, this kind of archiving problem wouldn't be all that hard to deal with. You'd just need enough storage for it, and create archives per week/month/year (or an archive per individual's mailbox, or whatever) to put on as much hardware as was required. I single checkbox would be all that was needed to have it encrypted as well.

    Oh well. I guess if conveniently "loosing" mail when you don't want it found is one of your design goals, than you probably want to migrate to something less reliable.

    --
    The problem with quotes on the internet, is that nobody bothers to check their veracity. -- Abraham Lincoln
  10. Dear staff by keraneuology · · Score: 4, Insightful

    It has come to my attention that as I prepare to leave office my previous instructions to make all email and other documentation available to the shredder was incorrect. The correct policy is to make everything available to the archiver. If you have any concerns please feel free to pick up a copy of the standard presidential pardon boilerplate from my secretary's desk. Thank you, W

    --
    If the g'vt kept the data on you that google does you'd better believe you'd be calling it "doing evil"
  11. Re:What the hell does the summary say? by Kent+Recal · · Score: 2, Insightful

    Maybe not $2k worth of hardware but $200k will do. Which is still peanuts in government terms. They probably spend that amount on paperclips and toilet paper in the pentagon alone.
    Honestly, storing and indexing 140TB of e-mail is a trivial task when you can apply a six digit budget to it.

    If their "archival system" blinks at the sight of 140TB of mostly text then it doesn't even deserve the name.

  12. Not the same thing. by Frosty+Piss · · Score: 4, Insightful

    Perhaps you're too young to remember, but Clinton's administration had a problem with missing emails during investigations too (Lewinsky, why hundreds of FBI records on their political enemies ended up in the White House, illegal campaign donations from China, etc).

    Yes, but there is a magnitude of difference in importance between lost emails about blow jobs and a little dirty money, and emails about the loss of privacy and civil liberties of US citizens, torture of POWs, and the various other nastiness that GWB et al are suspected of. Much different.

    --
    If you want news from today, you have to come back tomorrow.
  13. The problem is Law, and convention, not volume by omb · · Score: 2, Insightful

    As with almost all problems where electronic/internet technologies bump into real life issues eg privacy, non-repudiability and simple confidence it is because the Law has not kept up with technology, and that in the USA is the responsibility of the Congress. Writing was thousands of years old, and the printing-press more than 300 years old when the Constitution was adopted in September 17, 1787. The drafters understood the technology.

    Today we are blessed with ignorant self serving legislators who do not, and are far too happy to follow hard-case makes bad law hurd thought, eg children, porn, paedophilia, drugs and terrorism. The courts have long held that you can read post-cards, but that if your letter-in-an-envelope is opened then a felony is committed or the information is normally in-admissible.

    For this to work people have to start encrypting and signing their e-mails and the Congress and the SCOTUS must enforce identical rules for electronic and hand-written communication.

    Specifically you can not go out and discover the entire contents of someone's library and papers in a law suite, and expect to go on a search-engine enabled fishing expedition.

  14. Desired Outcome on a Platter by RulerOf · · Score: 2, Informative

    Riiight... Blame it on Exchange.

    Seriously, if "conveniently [losing] mail" was the goal of the transition, they could have moved from Exchange to Domino and gotten the same effect.

    Forget not, throwing storage (read: money) at any system tends to fix the problem given a competent staff. You don't make a very compelling argument.

    --
    Boot Windows, Linux, and ESX over the network for free.
  15. Re: Juducious editing FTW! by TaoPhoenix · · Score: 2

    Yea, and there's an aesthetic feel to it too. If I'm in a 20 reply discussion, I like to edit out anything more than 2 exchanges old, and I change the subject title every two mails.

    Nothing annoys me more than 20 mails titled "re: call"

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  16. Should've used ZFS by An+dochasac · · Score: 2, Funny

    At the current presidential email growth rate, NTFS isn't gonna cut it for Obama.

  17. Throwing storage won't solve Exchange's issues by CFD339 · · Score: 2, Interesting

    There's an inherent architectural difference between storing mail in a database built on Microsoft's JET technology, and one which stores its data in something that is (although distinctly odd) very much like an xml data store. The Domino architecture makes segmenting the archive into manageable parts by date, by person, or by any combination thereof much simpler.

    Essentially, the Domino architecture results in exactly what you describe -- throw more storage space at it and you can keep storing more data. The Microsoft architecture does not.

    --
    The problem with quotes on the internet, is that nobody bothers to check their veracity. -- Abraham Lincoln
    1. Re:Throwing storage won't solve Exchange's issues by dbIII · · Score: 2, Informative
      The theoretically competant staff with perfect hindsight and unlimited budget could just alias all the emails to also go to a decent system that actually works properly (and can be backed up easily) - however release cycles with MS Exchange are fairly short and the advertising is so good that people could be convinced that it is a half decent system THIS time. Backups have been a horrible problem with MS Exchange for years and it just seems bandaids have been placed over the problems to keep things going in most situations instead of actually getting it into a reliable state.

      For some ideas of how badly things can go take a look at an MS Exchange users mailing list archive. The thing is so fragile that it doesn't take a lot to lose emails. Those who wish to make personal attacks should consider that I'm talking about more competant and experience MS Exchange admins than myself. I personally haven't touched it since 2002 where the users thought it was reliable because there were really three servers doing the job that any other mail system could be doing on a single server, and where multiple backups and/or mirroring got around the problem of backups being unreliable.

      Apparently the current version is not a steaming pile of crap, however I only have MS advertising and fanboys to assure me of this and I also doubt that the White House actually upgraded to the current version. They are most likely running whatever version was installed at the start of the administration. If it's MS Exchange version 5.5 all bets are off and all emails could collapse into a hole of random data and the backup tapes might only contain the statement "file in use, unable to copy".

  18. Moore's Law: 16x in 8 years by SamuraiMike · · Score: 2, Informative

    It's been eight years since the Clinton administration. This is 4x the doubling period based on Moore's Law. While Moore's Law relates to transistor density, Wikipedia says that it's roughly similar to gains in disk storage. So in the last eight years, we could estimate disk storage gains of 2^4 = 16x. This doesn't get you all the way to 50x, but it cuts out a big chunk of the gains.

  19. Well yeah... by RyuuzakiTetsuya · · Score: 2, Funny

    Of course that happens when you embed the 1600x1200 raw image of dick cheney giving everyone the finger with each email

    --
    Non impediti ratione cogitationus.