Slashdot Mirror


You've Got Mail -- Tons Of It

Daniel Goldman writes "The Baltimore Sun has an article about the City of Baltimore's email problem." A snippet: "Millions of old e-mail messages are clogging Baltimore's municipal computers, so the city is going to start automatically deleting any messages older than 90 days. A common practice in private business, the move raises questions when made by a municipality, which has a responsibility to retain certain public records." Goldman points out "Just think about all the potential law suits; 'if it's not there, they can't subpoena it.'"

249 comments

  1. Great way to ignore your customers by Anonymous Coward · · Score: 0

    The tax payers.

    1. Re:Great way to ignore your customers by ePhil_One · · Score: 2

      If they haven't read it in 90 days, they've already ignored it.

      --
      You are in a maze of twisted little posts, all alike.
    2. Re:Great way to ignore your customers by No.+24601 · · Score: 4, Insightful
      If they haven't read it in 90 days, they've already ignored it.

      I don't know what business you work in, but if they haven't read it in 3 days, they've lost my business.

    3. Re:Great way to ignore your customers by wizard992 · · Score: 1
      I don't know what business you work in, but if they haven't read it in 3 days, they've lost my business.


      Well, we are talking about the government here...

    4. Re:Great way to ignore your customers by NetGyver · · Score: 2, Insightful

      I hate it when people associate taxpayers + government with customers + business. The two relationships are very different.

      There are no laws I know of that tell me I have to pay Company X for products. If I don't want any products from Company X, I won't buy anything from them. I'm not going to be breaking any laws because of it. However, if I don't pay my taxes I'll get hounded to death with the possibly of being tossed in jail.

      See the difference?

      --
      A Penny for my thoughts? Here's my two cents. I got ripped off!
    5. Re:Great way to ignore your customers by Kjella · · Score: 2, Insightful

      I don't know what business you work in, but if they haven't read it in 3 days, they've lost my business.

      Let me guess.... you're emigrating a lot, yes? Otherwise you might have to have "business" with the government. Good luck getting a reply in three days there.

      Kjella

      --
      Live today, because you never know what tomorrow brings
  2. Simple... by Anonymous Coward · · Score: 5, Funny

    Outsource each employees email to GMail. Problem solved.

    1. Re:Simple... by hype7 · · Score: 3, Funny
      Outsource each employees email to GMail. Problem solved.


      yeah, and if the budget's looking a bit bad for that year, they could always put a few of the email accounts up on ebay.

      -- james
    2. Re:Simple... by Simon+Lyngshede · · Score: 3, Insightful

      Gmail wouldn't solve problems like this, they only offer one 1GB. I work with secretaries who would use up 1GB of storage a year if they didn't delete any emails. The organisation I do systems administration for isn't even that big, so I could easily imagine that other people running into problems earlier.

    3. Re:Simple... by Nutria · · Score: 0

      I work with secretaries who would use up 1GB of storage a year if they didn't delete any emails.

      That's truly stunning.

      The 36,000 email archive of the Evolution mailing list is only 105,500,082 bytes. Many of those emails are HTML, to boot.

      --
      "I don't know, therefore Aliens" Wafflebox1
    4. Re:Simple... by stilwebm · · Score: 2, Funny

      Yes, but people don't send one page, 100KB Word Documents as attachements to the Evolution mailing list. Secretaries do.

    5. Re:Simple... by BasilBrush · · Score: 3, Insightful

      That's fine. Disk storage is cheap. Certainly cheaper than paying hundreds of staff for the time taken to go through all their old mail sorting the wheat from the chaff. The right solution to running out of disk space for email is to add more disks.

    6. Re:Simple... by zonker · · Score: 0

      i have a gmail account and one of the things i find annoying about it is that you can't easily 'empty spam'. i get tons of spam and while the majority of it is caught by gmail's spam filter, i still can't get rid of it with a single button.

      i have over received 5000 spams in the last 1.5 weeks which would require me to delete 100 spam messages at a time, 50 times to get rid of it all... only to have to repeat the process in another few weeks. while this isn't a big deal in that i have 1 gig to use, it is quickly being eaten up by spam. i'm already using about 2% of my space, all by spam. at this rate i expect that by the end of the year (no i haven't done the math but if you are so inclined, go ahead) i'll have used the majority of it with spam...

      i reported this 'oversight' as a bug. hopefully they'll add an easy to use empty spam button soon...

      btw, the reason why i get spam isn't due to gmail (like various hotmail and yahoo accounts i've had over the years) but because i redirect unrouted mail from my domain to my gmail account so it doesn't get mixed in with mail i want to get. so my gmail account is sort of my spam account...

    7. Re:Simple... by Anonymous Coward · · Score: 0

      heh-- I worked for a company where every press release was sent to every employee in the company in Word format.

      The sysadmins pleaded for years, then gave up.

      The bonehead marketing guy sending the press release could have just sent the url to the press release on the corparate web page.

    8. Re:Simple... by Lord+Apathy · · Score: 3, Funny

      I can beat that. A few years ago this bitch at work clogged up the mail system with a 50 mb zip file containing pictures from the corp. picnic. She sent it to every employee in the company.

      Stupid bitch

      --

      Supporting World Peace Through Nuclear Pacification

    9. Re:Simple... by Lord+Apathy · · Score: 1

      Yeah but 20gb dat tapes are cheaper and have better shelf life. I bet all the data in these emails could fit on one or 2. This is a none issue. Stick a crowbar in the public coffer and buy a offline tape storage.

      --

      Supporting World Peace Through Nuclear Pacification

    10. Re:Simple... by Doppler00 · · Score: 2, Insightful

      How to save 90% of disk space:

      Sort all users e-mail recieved by size for a given year.

      Delete 5% of the largest e-mails. These will probably account for around 90% of all disk usage. They probably represent file attachments which should have been stored on a server instead of in an e-mail account anyway.

      Just think, when you mail a 2MB attachment to 3,000 people in a division, that could use quite a bit of disk space.

    11. Re:Simple... by devilspgd · · Score: 4, Insightful

      With a properly designed mail system, only one copy of the message would be stored on disk, with pointers from each mailbox to the single central copy.

      *shrugs*

      --
      Give a man a fish, he'll eat for a day, but teach a man to phish...
    12. Re:Simple... by Anonymous Coward · · Score: 1, Informative

      only if your mta sucks anus. with a rational one, it'll recognize that the same email is going to 3000 people and only store one copy with a reference count of 3000. when they've all deleted it, it'll get punted, until then it'll use 2MB.

    13. Re:Simple... by SoLoatWork · · Score: 1

      secretary_betty@gmail.com forward to -> secretary_betty_2004@gmail.com

      Dec. 31, 2004, change forward to secretary_betty_2005@gmail.com.

    14. Re:Simple... by Meski · · Score: 1

      Well, you could let it happen automatically, as long as you don't get a gig of spam a month... From Gmail help: How do I delete messages? The Gmail system will automatically empty your trash and spam every 30 days. --------------- Although I do think a select all that did what it says would be useful. Or a select mail that meets a search criteria.

    15. Re:Simple... by SlamMan · · Score: 1

      In a properly designed mail system, it would have spit the message back out at her, saying it was too big. It would also have violated email policy by sending out mass mailing to all staff without supervisor approval.

      --
      Mod point free since 2001
    16. Re:Simple... by UID1000000 · · Score: 1

      Gmail wouldn't solve problems like this, they only offer one 1GB. I work with secretaries who would use up 1GB of storage a year if they didn't delete any emails. The organisation I do systems administration for isn't even that big, so I could easily imagine that other people running into problems earlier.

      I agree. What the users need to do is to move the files off of their Outlook account on to a PST file. They could then eat up local disk storage.

      (Apologies for mentioning MS)

      Me, I've been at my current position for two years and I've used about .96 GB of personal space. This is attritubed to large attachments, etc (maxed out at 4mb). Our email accounts are limited to 65 MB and then the emails don't get deleted but as motivation to move your files you are locked out from being able to send emails. That is definitely a motivational tool.

      PS. I'm a beta tester for Gmail. There is no way that Gmail compares, for business use, to using an application. (Expect in portability)

      --
      UID 1000000 is just around the corner.

    17. Re:Simple... by zonker · · Score: 0

      thanks. didn't see that in the help. fortunately i don't get that much spam, so it should help. :)

    18. Re:Simple... by Anonymous Coward · · Score: 0

      I'm curious. Which major email servers do this? I understand Exchange does something like this. UW-IMAP and Courier-IMAP do not. Does Cyrus IMAP do it? A couple years ago a friend and I looked in to creating a similar design, hoping to use a storage independent (ODBC or BSD DB or perhaps even flat file) back end. We quickly realized that it would take every ounce of free time we had to implement it with adequate performance without a large software engineering team to back us up.

    19. Re:Simple... by devilspgd · · Score: 1

      I'm not aware of any that do at the moment, although it wouldn't surprise me if Exchange does.

      Well, other then mine that is -- But in fairness, that only applies when I send out a newsletter (and yes it's a waste of time when I do it that way for what usually amounts to a 5KB message including headers) -- In fairness, my server doesn't do it natively, but I've managed to get it working as a proof of concept in to a closed source SMTP/POP3/IMAP/Webmail server.

      Personally, I prefer storing messages in individual RFC822 .MSG files rather then an mbox format. My server generates a couple indexes of messages which means that the messages themselves aren't parsed every time the user hits "check mail" in webmail, or accesses the mailbox via POP3/IMAP, the only hit is against the filesystem, and typically most of the file allocation tables stay in the disk cache.

      Once you're using individual files, it's relatively simple to create "copies" of messages as hard links rather then actual files. This lets the OS handle the tough stuff, the mail server never knows the difference.

      Whether it would scale or not with the mail server's current design, I'm not sure, but I know the developer's ears perked up when I mentioned my own tests using my own delivery agent were successful, and ultimately was just some code to parse the userfile and to create hardlinks, little more then that.

      --
      Give a man a fish, he'll eat for a day, but teach a man to phish...
    20. Re:Simple... by pappin · · Score: 1

      actually, if they deleted any email that didn't come from a short-list of addresses (or domains) and archived the rest, the would likely keep most of the stuff that should be kept.

    21. Re:Simple... by bluebagger · · Score: 1
      We have heard a lot about the service, but how is the spam filtering specifically?

      __
      Discount web hosting from $4.

    22. Re:Simple... by Anonymous Coward · · Score: 0

      Fine, go buy everybody a properly designed mail system. Advice is cheap. Rebuilding a system isn't, even a poor one.

  3. Beowulf cluster by b0lt · · Score: 2, Insightful

    This might be a practical use of one, determine which emails are valid, and which aren't, like a spam filter. Allow users to flag 25% or so emails as important, and archive those.

    --
    got sig?
    1. Re:Beowulf cluster by Limecron · · Score: 0, Troll

      Ugh...

      I'm not sure if its worse that this comment might be serious or that someone actually modded it Insightful.

  4. Just load them into Google or the Archive by Animats · · Score: 2, Funny

    Either Google or the Internet Archive would be happy to archive that data for the City of Baltimore and keep it available for public reference.

  5. Bayesian Filter to Identify Officail Mail by Dave419 · · Score: 5, Interesting

    Since they need to delete tons of old messages spam included, but want to save official email, why don't they train a Bayesian Filter to sort through and save as much as possible. Since they can't rely on their employees actually saving each message which was official to their hard drives.

    --
    ~ there are 10 types of people in this world, those that can read binary and those that can't
    1. Re:Bayesian Filter to Identify Officail Mail by Kenja · · Score: 5, Insightful

      because even one false positive can get them in trouble?

      --

      "Have you ever thought about just turning off the TV, sitting down with your kids, and hitting them?"
    2. Re:Bayesian Filter to Identify Officail Mail by Anonymous Coward · · Score: 1, Insightful

      Deleting all the mail... or delete a few false positives. Hrm, tough call...

    3. Re:Bayesian Filter to Identify Officail Mail by llamaguy · · Score: 2, Insightful

      You could have a manual meta-check on all the positives to make sure they aren't anything vital. Computers don't make mistakes, they just don't think like we do so sometimes it's necessary to sort through it all.

      --
      HAH! I just wasted a second of your life making you read this, but I wasted a minute of mine thinking it up. DAMN.
    4. Re:Bayesian Filter to Identify Officail Mail by abhisarda · · Score: 2, Insightful

      "because even one false positive can get them in trouble?"

      You should probably go take a class on probability. When you're dealing with millions of email, there are going to be some false positives.
      What's the alternative, hand sort them?
      Yeah, that's a good idea right? But with bayesian filtering, you can do a lot of refining when you're dealing with millions of email.
      And who says that you need to use the same filters for the health dept and the transport dept.

      Jesus christ, there are lots of companies that already do this. Its not like Baltimore's the only city with millions of old email.
      This is not a mars mission, this is judicious use of existing technology- bayesian filters(or whatever fits the profile) and enterprise storage solutions.
      Its better off spending a few hundred thousand(or less) on archiving the mails than spend a million or two on lawyers and court 5 years later defending the decision to delete the data after some citizen sues them for records etc.

    5. Re:Bayesian Filter to Identify Officail Mail by asit+ler · · Score: 3, Interesting

      Won't work. The point of purging the 90+ day old messages is so that noone has to meta-check them for importance. Unless you want to hire a cadre of Trained Monkeys to look at the positives. It'd be a 1-banana job, and would have to pay bargain-basement peanuts.

      --
      This is not the sig you're looking for.
    6. Re:Bayesian Filter to Identify Officail Mail by Anonymous Coward · · Score: 0

      So, have you done anything other than write a 1995-era HTML editor, with non-standard widgets?

    7. Re:Bayesian Filter to Identify Officail Mail by cortana · · Score: 1

      I would stress that there would be fewer false positives with a Bayesian filter than there would be if a team of people were employed to classify all mail by hand.

    8. Re:Bayesian Filter to Identify Officail Mail by pipingguy · · Score: 1


      Since they need to delete tons of old messages spam included, but want to save official email, why don't they train a Bayesian Filter to sort through and save as much as possible

      Mostly, corporations do not want competent customer support (unless it's a big client).

      Why?

      - Truly good customer support costs money and requires that the CS people know what they're talking about. That costs money.
      - Corporations want script-trained "shooters" that can deflect resposibilty.
      - Corporations think that they cannot afford to pay competent people.

      Creative and innovative people are just an unpleasant necessity on the way to commoditizing things for the most profit.

      And they are right, given the rules.

    9. Re:Bayesian Filter to Identify Officail Mail by Raven42rac · · Score: 4, Insightful

      Five points for excellent use of buzzwords. I would say compress messages older than 90 days and save them. The government is not supposed to just willy nilly throw things away. I would invest in more hard drive space to hedge against lawsuits.

      --
      I hate sigs.
    10. Re:Bayesian Filter to Identify Officail Mail by pHDNgell · · Score: 1

      Five points for excellent use of buzzwords. I would say compress messages older than 90 days and save them.

      I get something like 50% compression on all of my mail with my zlib patch to cyrus. This does pretty good stuff for me. I batch compress them nightly since decompression comes at a very low cost, there's no need to wait 90 days.

      The cyrus guys have rejected my patch a few times because ``disk is cheap.'' Personally, I don't understand the sentiment. Disk is cheap, but it's twice as cheap with 50% compression for people who want it. It's half as cheap for those who don't.

      --
      -- The world is watching America, and America is watching TV.
    11. Re:Bayesian Filter to Identify Officail Mail by unmuzzled+and+mean · · Score: 1
      I say throw it away. We horde to much crap as it is and who the hell will realistically go back through it?

      On the other hand because it is electronically sortable perhaps it will be an indispenisble archive for future historians if we don't delete any of it.

    12. Re:Bayesian Filter to Identify Officail Mail by Anonymous Coward · · Score: 0

      Also determine who is responsible. If using filters that have been set up by the IT dept, is the IT dept responsible for false positives? If using hand-checking, who is responsible? When someone comes to sue, make sure you know where you stand.

  6. Fun! by Anonymous Coward · · Score: 1, Insightful

    Now we get to subpoena entire hard drives so we can run data-recovery software on them. It would be smart of any operation, public or private, to wait out the statute of limitations (which I realize may vary) of any states with which they have substantial contacts before they start deleting data.

  7. Why not... by Izago909 · · Score: 4, Insightful

    figure out what percentage is spam, and sue spammers to recover damages for lost resources.

    1. Re:Why not... by AvantLegion · · Score: 4, Funny
      Maybe just figure out what percentage is spam, and delete that percentage of mail. Ehh, that was probably the right 30% to delete....

    2. Re:Why not... by Sinful_Shirts · · Score: 0, Offtopic

      I vote for cutting off their testes.

  8. so? buy some storage, stick them in there. by Anonymous Coward · · Score: 4, Insightful

    can't they, like, just buy a big hard drive and stuff?

    If the average message is 10kib (10,000 bytes, make the math easier), and compresses down to 3kib (probably even better if you compress a bunch together), then you'd need roughly 30gib to store 10 million of them. Can you even buy hard drives that small any more?

    Add some search index, throw a crappy web interface on it, and call it a day. Never delete an email again!

    1. Re:so? buy some storage, stick them in there. by yppiz · · Score: 1

      The math is more like this.

      With gzip -9, you get either 20% or 10% of the original size of a text document (this is what the Internet Archive used for its archive of HTML when I was there).

      An index into a text collection is on the order of 10% - 100% of the size of the original collection, depending on what features you want to offer at speed. 10-50% is a reasonable size.

      So for 10M messages at 10k each, assuming the compression ratio above (which might not hold for MS Word attachments - a big caveat) you have 100G of source, 10-20G compressed, with a 10-50G index.

      Total storage is then between 20G and 70G, or 20-70% of the original uncompressed text.

      --Pat

    2. Re:so? buy some storage, stick them in there. by ThisIsFred · · Score: 3, Insightful
      can't they, like, just buy a big hard drive and stuff?
      Here's the problem with this: The longer the stuff is retained, the more expensive it gets to hold on to it. IT is usually a very low budget priority to government agencies, so it's going to be hard to purchase high-reliabilitly mass storage devices every couple of years. Since the goal is permanent archival, cheap, high-cap ATAPI fixed disks are going to be the last thing you want to store the stuff on. The other issue is that the user of the mailbox has complete control over the contents, so retaining everything is going to be really difficult to do, and accidental deletion will be a very credible alibi.

      There are rumblings about FOI and permanent archival among my Governmental Overlords, so I'm thinking hard about potential solutions to the problem. Trust me, it's very complicated issue, more so than I care to illustrate here (especially considering my habit of rambling on).

      The simplest solution is responsibility. If it's official policy, it's on dead-trees and filed away.
      --
      Fred

      "A fool and his freedom are soon parted"
      -RMS
    3. Re:so? buy some storage, stick them in there. by name773 · · Score: 2, Interesting

      what about cds as archival media?
      something about breaking down, but is that real?

      then there's dvds and magneto-optical (my personal favourite)

    4. Re:so? buy some storage, stick them in there. by Anonymous Coward · · Score: 0

      You know, you're on a freaking computer. Have you never heard of Google? Have you never had a CD go bad on you? Have you never had a drive go bad? Are you a frigg'n moron? Yes. Yes you are.

    5. Re:so? buy some storage, stick them in there. by ThisIsFred · · Score: 1

      He's definitely not a moron, and he brings up a good point. If I can get a guaranteed minimum of 10 years out of a CD-R or DVD-R, I will consider it. The problem with CD-Rs is that they are relatively low capacity, which makes for a logistics problem when archiving them. DVD recordables are a possibility.

      The discs are prone to damage when handled on a daily basis, but are much less so when recorded and stored. Of course, in ten years there will be the cost associated with the next generation of reliable storage, and all the work involved in transferring or converting the data. :o(

      --
      Fred

      "A fool and his freedom are soon parted"
      -RMS
    6. Re:so? buy some storage, stick them in there. by Anonymous Coward · · Score: 0

      Don't forget hard drives are getting cheaper and bigger.

    7. Re:so? buy some storage, stick them in there. by wired_parrot · · Score: 3, Insightful

      The problem is, the average e-mail is not necessarily 10kb. While HTML can be part of the problem by making e-mails several times bigger than need be, my experience is that large attachments are generally the biggest culprits. A 20Mb powerpoint presentation sent by pointy-haired manager to all his minions can easily swamp the system. And trust me, there are plenty of clueless managers out there sending out Very Large Attachments. I've received 50Mb Excel spreadsheet once, which contained nothing but a single image of a chart scanned at a ridiculously high resolution. It's crap like this that swamps mail servers, not the two paragraph responses.

    8. Re:so? buy some storage, stick them in there. by HermanAB · · Score: 1

      "the user of the mailbox has complete control over the contents" Hell no! You have to archive every message incoming and outgoing right on the server. You can't trust the users to do that. The bigger the organization, the bigger and more frequent the lawsuits.
      It is not worth saving pennies on disks/tape if you get hammered with a billion dollar lawsuit that you can't win 'cause all the e-mail was on some secretary's machine that was trashed two years ago...

      --
      Oh well, what the hell...
    9. Re:so? buy some storage, stick them in there. by Anonymous Coward · · Score: 0

      A. The administrative control tells me that they definately use Lotus Notes.

      B. I can also tell that they use it to transfer large files(ave size of my inbox was ~5Mb) to one another

      My old comapany was in the same situation(you could save your mail by replicating locally, Notes users know how non-intuitive this is)The best soultion is to get their data in order so they are not emailing each other large files.

    10. Re:so? buy some storage, stick them in there. by sfjoe · · Score: 1

      can't they, like, just buy a big hard drive and stuff?

      Are you kidding? The next election the Republicans go batshit because the city spent tax dollars saving porn-related emails. Rush Limbaugh could turn that into a week of shows.

      --
      It's simple: I demand prosecution for torture.
    11. Re:so? buy some storage, stick them in there. by Anonymous Coward · · Score: 0
      Since the goal is permanent archival, cheap, high-cap ATAPI fixed disks are going to be the last thing you want to store the stuff on.

      Use a redundant solution of some sort (some kind of RAID, possibly). Should a drive fail, you can replace it easily enough (cheap, high-cap ATAPI disks are easy to get on short notice). Periodically upgrade to a larger system. Simple, though it would require a larger IT budget.
  9. 100 gig of text? by Neuroelectronic · · Score: 0, Redundant

    with hard drive prices so low, i don't see what the big deal is. im sure if they droped $100 into each computer for a 80gig+ drive there would be plenty of space for -gasp- email. it took me a year and a half just to fill my 60 gig drive with MP3's and pron

  10. IMHO this sounds perfectly reasonable by Richard_L_James · · Score: 5, Insightful
    You wouldn't expect a public office to hang onto every piece of paper, so why should they be expected to hang onto every email they have ever received?

    There are always going to be things like replies to an original question and subsequent follow up questions going back and forth, so normally hanging onto the latest/final reply would be sufficient (providing it had the previous history - clearly showed the conclusion).

    Now if they were to use this as an excuse to accidently lose records that would be a different matter. This however is where auditors should be playing a role to ensure that they are keeping the right records and discarding the rubbish.

    1. Re:IMHO this sounds perfectly reasonable by Anonymous Coward · · Score: 0
      You wouldn't expect a public office to hang onto every piece of paper, so why should they be expected to hang onto every email they have ever received?

      Umm, because that would be an advantage of using the new technology.

    2. Re:IMHO this sounds perfectly reasonable by Twylite · · Score: 1

      Unfortunately "one size fits all" solutions seldom work. My SO worked in the buying department of a large retail group, who decided to start clearing out e-mail over 60 days old. Problem is the department has to place orders on suppliers 6 to 9 months before delivery. By the time delivery rolls around, most tracability is lost, including date estimates which suppliers swear they didn't commit to.

      Now all of this can be handled by printing out the e-mails (ack!), but you sort-of need to know to do this before the delete everything, right? Go figure.

      --
      i-name =twylite [http://public.xdi.org/=twylite], see idcommons.net
    3. Re:IMHO this sounds perfectly reasonable by Anonymous Coward · · Score: 0

      They need a place to store important e-mails. Some kind of file system would work great. I've never understood people who store every e-mail they get in the inbox.

  11. And there are no privacy concerns? by oneiros27 · · Score: 1

    And I'm sure they'd love to offer it up to the general public, as well. The question comes -- should all of it be public? I'm guessing that there are bits of it, which shouldn't be, and it's be more costly in the long run to try to analyze it, and determine what would have to remain confidential, then to just store it all in the first place.

    I'd prefer that people who are familiar with the actual data being stored make the determination if it should be publicly available.

    --
    Build it, and they will come^Hplain.
    1. Re:And there are no privacy concerns? by innocent_white_lamb · · Score: 3, Funny

      The question comes -- should all of it be public?

      They are public records. So, yes it should all be public.

      Simple, no?

      --
      If you're a zombie and you know it, bite your friend!
    2. Re:And there are no privacy concerns? by GlassUser · · Score: 1

      The problem is that not everything written down (on paper or virtually) is a matter of public record. Employees of the State of Texas (for example) are allowed reasonable personal use of resources under some conditions (it doesn't generate profit, it's not costing the State anything, etc). There is also personal information that's part of record that is not released under public information requests (usually a taxpayer's information, for example).

  12. incremental backup by Anonymous Coward · · Score: 5, Insightful

    "Baltimore officials, who approved the new e-mail policy at a Board of Estimates meeting last month, say they have no choice but to delete old messages, which are slowing city computers to a crawl. They say the system is so overburdened that creating a daily backup has become impossible; there is so much data that it takes more than 24 hours to copy it."

    What?!? What's wrong with an incremental backup? Surely all those millions of messages aren't *changing* every day?!?

    Think of all the children that will suffer from this!!!

    1. Re:incremental backup by Saeger · · Score: 1
      You expect something more than incompetence from government workers in their cushy jobs?

      --

      --
      Power to the Peaceful
    2. Re:incremental backup by Piquan · · Score: 3, Insightful

      What?!? What's wrong with an incremental backup? Surely all those millions of messages aren't *changing* every day?!?

      That depends on how their email system works. If it stores each user in a single file, then that file is changing every day. If they're using a file-based backup system...

    3. Re:incremental backup by Anonymous Coward · · Score: 0

      You can have incremental backup for single files as well. Look at all our version control systems. Even if they're binary files it can work (though email file formats like mbox are likely to be text-based). In other words, there's no excuse I can think of not to have it.

    4. Re:incremental backup by Anonymous Coward · · Score: 0

      Incremental backups are not always possible. Imagine a mail system that keeps each email in a seperate file (think SunOne Messaging / iPlanet). Hundreds of small files kill the performance of a backup solution, each file is a huge overhead. So in this situation, companies usually do a volume level backup. It costs more in tape (you back up the entire device, not caring if the block is empty or full) and you lose the ability to restore a single file (without special software), but you can actually do a backup in a reasonable amount of time.

      Or, if the mail system stores the emails in a large file (think Lotus), then as soon as any email comes in, the file has changed and you are backing it to tape.

    5. Re:incremental backup by Anonymous Coward · · Score: 0

      you only have to back up the new files each day.

      as individual files (e.g. maildir): that's easy.

      as one big file (e.g. mbox): version control will handle it, it's also easy.

      if they're getting too many NEW emails each day to handle, then that's another story, but the article doesn't make that out to be the case.

    6. Re:incremental backup by wired_parrot · · Score: 1

      Even incremental backup won't help if you have a very large volume of data. We have an incremental backup system in our company, and we often run into this sort of problem - a few days out of the month the backup won't be completed because it'll take longer than 24 hours for it to run through. Since backup and recovery isn't that visible until you have a big problem, less money has been put into it than our file servers and application servers, with the result that our backup servers are now having trouble handling the huge amount of data being spewed forth from our file servers.

    7. Re:incremental backup by Piquan · · Score: 1

      You can have incremental backup for single files as well. Look at all our version control systems.

      These rely on being able to diff the last recorded version with the version on-disk. This means having one tape drive reading your old backup, while another one writes to the new tape.

      Most incremental backup systems work off of timestamps, which means all you know is that the file changed. You don't know what changed.

      To put it another way, find me a real-life (commercial or free) backup suite that does this. While it may be possible in theory, I've never seen one in practice, and doubt it would go well.

      Of course, I'm presently referring to non-FS-aware systems. It may be possible with some types of journalling FSs with support from the FS itself, but I'd have to research that.

    8. Re:incremental backup by HermanAB · · Score: 2, Insightful

      Sure, but saving the user's copies of the e-mail is friggen retarded. It is much easier to store the incoming and outgoing streams of e-mail on the server and use logrotate to create a new file every week, or every day even. As soon logrotate moves a file to backup, it won't change anymore - ever.
      With Postfix, use always_bcc to forward all outgoing mail to a user called outlog, then use procmail to save all outgoing mail to a log file.
      Likewise, procmail can save all incoming mail - after the crap filters - to an incoming log file called inlog.
      Finally, use logrotate to archive the logs periodically.
      Then you don't have to worry about the users and they can do with their copies of the mail whatever they damnwell please.

      --
      Oh well, what the hell...
    9. Re:incremental backup by Anonymous Coward · · Score: 0

      Outlook files are the bane of backups everywhere
      because people leave outlook running overnight and
      changes the file all the bloody time. Every backup
      has "file changed while reading" errors.

    10. Re:incremental backup by surprise_audit · · Score: 2, Funny

      Just print the damn things out and file them. Anyone who wants to subpoena them had better have a fleet of trucks and hundreds of spare staff...

    11. Re:incremental backup by Rich0 · · Score: 1

      Anyone who wants to subpoena them had better have a fleet of trucks and hundreds of spare staff...

      Uh - if you are subpoena'ed the onus is on YOU to comply, and you can only charge reasonable costs for this service.

      So if I ask for one particular email that there is reason to believe that you have, you can charge me $5 for having the secretary copy it and mail it to me. You can't charge me $50,000 because you were an idiot and have your emails stored on paper in boxes in random order in a heap that weighs 100 tons.

      They should just archive email to long-term storage media and delete it. How much space are we talking about, anyway? Hard drive space costs less than a dollar a gigabyte - at retail prices....

  13. An opening for Google? by Anonymous Coward · · Score: 1, Interesting

    As public employees working on publicly owned systems, this sounds like a job for a Google appliance. Of course, the existing email would likely have to be processed in some fashion--possibly manually.

    Henceforth, though, any email correspondence, unless explicitly marked as "personal" or "sensitive" should be placed in an archive searchable by the public. The "sensitive" messages should be placed in another archive not open to searching by the public. How "personal" messages are handled, and what constitutes "sensitive" should be based on policy established by legislation (of course, open to public debate).

  14. Blame On-Line Storage by buelba · · Score: 5, Interesting

    There are two technical culprits here:

    1. On-line storage. There's no reason to keep all of everyone's mail on-line on the server (a la IMAP or proprietary MS Exchange) instead of offline on their PC's (a la POP, most often seen with Eudora for non-techies). With offline storage, the servers don't clog, and you can keep as much mail as you like.

    The biggest rap agains off-line storage is that you can't control what people do with their mail or how they store it. My old job had a neat solution for this: Eudora downloaded your mail, but stored it on a file server. Each employee had 100 GB or something very large. It worked great; the SMTP/POP servers were never full, and everyone could keep their email.

    2. Ridiculous stupid bullshit HTML rich-text mail crap. Can you tell I have a bias here? Aside from being annoying, HTML mail can take up to ten times the size of plain old text. Some of the HTML generated by common email programs is just terrible; filled with repeating tags for every line, and just wasting an incredible amount of space for absolutely zero benefit. (Outlook is bad, but there are others that are just as bad.)

    There's no excuse for not fixing these problems. Someday someone's going to tell a court they had to delete mail for these reasons, and someone else is going to explain exactly why they're wrong. Until then, people who want to delete mail for legal reasons will hide behind false technical reasons.

    1. Re:Blame On-Line Storage by Anonymous Coward · · Score: 4, Informative

      1. On-line storage.
      Actually, storing the messages on local computers in an organization is about the worst thing to do. Most/all user computers are not backed up the way the servers are.

      For legal requirements for some organizations, various backups must be maintained. Just because the active mailstore does not maintain messages older then X days in it does not mean that the data is lost forever (and thus, subpoena-able).

      To do this right, first, the City needs to create a policy that establishes that active e-mail messages will not be retained in the "inboxes" more than 30 days. They should also set up mailstores for everyone in a different area on the same or different server (but NOT to user PCs. they need to define a policy against this, also, because user computers can be subpoenae'd, so if a user has been retaining e-mail messages on their own computer, this could undermine the overriding policy, aka "Smoking Gun").

      HTML/Rich-text e-mail messages
      No argument there!

      It is LEGAL to not retain e-mail messages past a reasonable amount of time as long as there is an organization-wide POLICY in place and reasonably applied over the entire organization, but the policy has to be in place first.

      There is lots of information on the net about this already. I would maybe google for "email retention policy"...

    2. Re:Blame On-Line Storage by buelba · · Score: 1

      Actually, storing the messages on local computers in an organization is about the worst thing to do. Most/all user computers are not backed up the way the servers are.

      As I said in my message, what I meant by "off-line" was "not on a mail server." I specifically suggested putting "off-line" storage on a file server.

      Whether you and I like it or not, there are no mail servers on the market today that are good at storing large quantities of mail for large quantities of people. I wish there were, but there aren't. The best solution is a kludge, but a kludge that works: force people to download their email (making it "off-line" to the mail server) but store it on a file server that is properly backed up.

    3. Re:Blame On-Line Storage by vladj · · Score: 2

      Eudora downloaded your mail, but stored it on a file server. Each employee had 100 GB or something very large. It worked great; the SMTP/POP servers were never full, and everyone could keep their email. Why couldn't you have a 100GB account for each employee on the mail server instead? - what is the bloody point to get mail from one network server and move it to another one?

    4. Re:Blame On-Line Storage by buelba · · Score: 2

      Why couldn't you have a 100GB account for each employee on the mail server instead? - what is the bloody point to get mail from one network server and move it to another one?

      Lots of reasons. First, mail servers just don't work very well when storing large quantities of mail for large quantities of people. I've never seen one that works well. If I'm wrong, please tell me. Second, the file server model is much more flexible: you can spread the accounts out across lots of file servers, but still have one email server for foo.com or whatever. I'd rather be able to store everything on one email server and have everyone have quick access to it, but I've never seen a solution that works.

    5. Re:Blame On-Line Storage by imroy · · Score: 2, Informative

      Check out project Cyrus. I haven't used it for large projects, but I notice it does support distributing mailboxes across multiple backend servers (The Murder stuff).

    6. Re:Blame On-Line Storage by vladj · · Score: 2, Informative

      Try CommunigatePro: it's not open source but extremely reliable and flexible and can handle huge volumes of mail, and can do clustering over multiple servers too. I've had experience with it on couple of large-scale installations, 50K accounts+ with millions of msgs per day.

    7. Re:Blame On-Line Storage by Anonymous Coward · · Score: 0

      Only one problem: a policy that is against state or federal laws is illegal, and WILL get you into trouble.

  15. wrong approach by yppiz · · Score: 5, Insightful

    This has to be the stupidest approach to the problem. Their networks are too slow, so instead, they're going to have each employee go through their old email and save individually important messages to their local hard disk? Not only are they going to tie up employees with this manual effort, they're also going to lose key documents and a key service - the ability to centrally search and reply to requests for information. In the future, each department will have to search their local hard drives for this information.

    They've taken a simple problem of old or improperly speced equipment and turned it into a manual labor solution instead. That's an insane waste of time and salary. They should just upgrade their network and storage. If I can build a 4 terabyte RAIDed PC for a few thousand dollars, they can centralize their mailserver and back it up for say a hundred thousand, even with extra redundancy and inefficiencies and admin costs.

    By contrast, forcing every current employee to perform a task that would eat up weeks of time per employee per year, in a city of Baltimore's size, will cost tens of millions of dollars.

    Dumb, dumb, dumb.

    --Pat / zippy@cs.brandeis.edu

    1. Re:wrong approach by Anonymous Coward · · Score: 0

      Now with your signature you'll be able to experience the City of Baltimore's woes first hand!

    2. Re:wrong approach by ameoba · · Score: 1

      The problem is that using MBOX format for mails, it -really- bogs down the mail server when it has to parse that whole thing.

      Moving older stuff into folders (that are still on the server) would probably make more sense.

      --
      my sig's at the bottom of the page.
    3. Re:wrong approach by fermion · · Score: 1
      This is the exact right approach. There is no reason to upgrade network speed or increase mass storage. It will be a waste of money and possibly open the city to unnecesary legal liabilities.

      What should be done is what it being done. Each employee must be responsible for thier email. If certain email must be saved to satisfy open records policies, those should be shunted to a specific place for achived storage. Current cases must obviosly be kept local for review. All other email that can be deleted should be deleted in accordance to written policies. If the policies state that all email not marked for archival or pertaining to ongoing cases are to be automatically deleted after a certain time, then that is ok.

      As I mentioned before, Companies regularly get in trouble for keeping too much information about. Companies get in trouble for not following thier own retention policies or not having a retention policy. I think we are far enough into this IT era and have enough court cases so that no one would think of keeping any more documentation than absolutely neccesary. We remember a certain accounting firm that no longer exists because the did not wish to spend the employee time or extra money to follow thier own retention money. I bet the partners with they would have spent the tens of millions of dollars.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    4. Re:wrong approach by Rich0 · · Score: 1

      We remember a certain accounting firm that no longer exists because the did not wish to spend the employee time or extra money to follow thier own retention money.

      I'm not aware of any such company, however I am aware of a company which abetted accounting fraud which is no longer what it once was.

      Surely you're not suggesting that they'd still be in business if their staff were simply better at shredding documents? It might have gotten them out of court (if they were perfect at it), but they'd still go out of business. Would you buy stock in a company which retained the we-hire-great-shredders accounting company?

      The only reason to delete human-typed data in this day and age is because you are doing something that you're concerned might not look good in a court of law. Now, if NASA is collecting 35GB of radio data per hour for the SETI program I can see deleting it once it is processed and found uninteresting. However, if you pay somebody to type something or read something, you can afford to store that document forever. The only question is why are people afraid of doing it?

      With paper shredding made sense - it would cost a fortune to comply with a subpoena which required searching a warehouse with 100 years worth of paperwork. On the other hand, searching 100 years worth of email for a list of keywords is a simple matter.

    5. Re:wrong approach by Anonymous Coward · · Score: 0
      Look at the legal cases over the past few years and look at what the charges were. Increasingly prosecutors are going after people and companies based on the processes they follow. These parties may have in fact committed other illegal acts, but they are destroyed or in jail because of the processes. In business crimes are constantly prosecuted. The key issue is always were processes followed, and people often fall because too much information is kept. This is why long court battles are fought over what is allowed in court. Presidencies rest on what is released.

      Prudence therefore dictates that you have processes that limit your exposure and follow those processes to the letter. This is why certain standards like the ISO-9000 are so important. It is not that they dictate what you do, but the do dictate you do it consistantly.

      It is like driving a car. Prudence dictates you have all your papers and follow all the road rules. There is no reason to give the cop an opportunity to stop you. You may be doing nothing wrong, perhaps you are doing a couple things wrong. In either case, it is best to follow the rules you can, and pray that no one notices the ones the can't.

    6. Re:wrong approach by Rich0 · · Score: 1

      If there is a law which states that companies are liable if an employee looks cross-eyed out the window without first filing a request to do so with the head of HR, the solution is not to make sure that such documents get shredded weekly so that nobody can show non-compliance. The solution in fact is to change the stupid law.

      If a company is breaking the law it should be a result of their doing something harmful to the common good. People shouldn't have to be worried every second of every day that they could be arressted if a police officer decides to selectively enforce some law that everyone violates.

      I'll be the first to agree that we have some dumb laws out there. The solution to that problem is to fix the dumb laws. Not to have companies hide all their activities so that you can't tell whether they are violating either the bad laws or the good ones...

  16. Well... by Ikn · · Score: 1

    I don't really trust any entity but myself to make sure I have important information archived/encrypted/etc. For reasons like this, and this recent Slashdot story

    --
    I know nothing
  17. E-Pencil Pushing by llamaguy · · Score: 0, Redundant

    Yet another example of buereocracy getting in the way of everyday things. It's basically a lose-lose situation - they can't keep on accumilating email, but they can't delete it either for fear of losing anything important. So the solution? Just add a little disclaimer: "Any email stored in this system is liable to be deleted at any time. By using this system you agree unconditianally to this." Voila. No more problem.

    --
    HAH! I just wasted a second of your life making you read this, but I wasted a minute of mine thinking it up. DAMN.
  18. Google to the rescue! by interociter · · Score: 1, Redundant
    Seems like a perfect application of Google's mail technology. Baltimore has tons of mail which needs to be searchable. Google has a scheme for holding and searching large quantities of mail. Plus, even the spam is worth keeping, should Baltimore decide to file suit against someone for attacking the City's technological infrastructure by flooding their servers with spam. Scott Richter, I'm looking in your direction.

    --
    Interociter
    -=What do I want? I'm an American. I want more.
  19. Emails "Flags" by Anonymous Coward · · Score: 0

    Unimportant emails should designate that they are unimportant, by having the last line say "UU" or some other quick, simple system (what's the chance of doing that by mistake?). This would immediately cut out a lot of email so it would not have to be kept.

    There should also be similar flags for emails that need to be kept. This way, the only mail that will have to be processed is the email sent by people who don't know the rules. When the people in charge of email recording start bothering them, everyone will learn pretty quickly.

    Then software can simply look at the last line/letter of each email, and send it where its supposed to go.

  20. Another Solution for Baltimore by superultra · · Score: 1

    Blatimore should offer some muncipal service, say free waste disposal for a year in Baltimore, per gmail swap. Problem solved.

  21. Temporary Fix by Rie+Beam · · Score: 5, Insightful

    Backup all e-mails from the last 4½ years into permanent storage, and then from there, get organized. Put spam filters on, force people to sort any important mail or else it gets deleted after, say, two weeks. People always seem to want to "start from scratch". without looking at the situation rationally. Five years of documents, gone overnight. How can anyone not be at least outraged by that?

  22. Old e-mail - it's a resource by Anonymous Coward · · Score: 2, Funny

    I'm posting anonymously because this may risk my relationship with my employer.

    We see old e-mails as a resource to be harnessed and turned into profit. Thanks to old e-mails we can ensure that no employee leaves with a spotless record since everyone always e-mails something incriminating sooner or later from the company e-mail address.

    We also find that the e-mails are great for data repositories; we fill all of our databases with text and when our clients come in, we tell them that those data warehouses contain terabytes of information.

    1. Re:Old e-mail - it's a resource by Anonymous Coward · · Score: 0

      Wow, it must suck to work for a company that is so fearful and hateful of its employees.

  23. That's what a personal folder is for... by jwcorder · · Score: 1

    Any email that is worth keeping should be moved to a personal folder. I don't see how this is any different then putting a 10 MB limit on an account. Definitely not front page material either.

    --
    http://jayceecorder.blogspot.com
    1. Re:That's what a personal folder is for... by geoffspear · · Score: 1

      Why would the city government care about saving people's personal messages? The problem is that important government documents are going to get deleted, not not their employees might lose 4 year old messages from friends that they were never going to read again anyway.

      --
      Don't blame me; I'm never given mod points.
    2. Re:That's what a personal folder is for... by jwcorder · · Score: 2, Interesting
      A personal folder isn't just for "personal" messages. I am talking about a Personal Folder. As in a *.pst file. As in a place, where you can store messages in an archived format in a file that Outlook treats like that file cabinet that can be saved locally on hard disk and backed up to the server at the end users leisure.

      Any of these so called "important governement documents" shouldn't be stores in an email archive anyway. They should be on a network drive getting backed up.

      My point is that a better solution is to put the email storage in the end user's hands. Set file size limits on their accounts and have them move all important mail off of their server mailbox and into a Microsoft PST file...aka Personal folder.

      I work for Fortune 50 company, running in an exchange environment and this is the method we use for about 4000 corporate employees. They have 10 MB mailbox limitations that will not allow them to send any email when their account reaches 10 MB. We then shut the accounts off when they reach 50 MB and kick messages back to the sender.

      Users who have important email setup Personal folders in Outlook and move messages from their inbox to their PST file. This file is stored locally on laptops (for travel purposes) and on the user's network home drive for desktop PCs.

      We run standard incremental backups daily and full backups once a week. The only problem we have with this is that MS's PST files have a 1 gb limitation before they get corrupt so some of the legal and credit employees have three or four personal folders normally sorted by year. So you would have one file for 2004, one for 2003, etc etc. Works for us, has to work for the government.

      --
      http://jayceecorder.blogspot.com
    3. Re:That's what a personal folder is for... by devilspgd · · Score: 1

      Let me guess... HP?

      10MB is obscenely small, especially with Exchange, since that suggests you use Outlook and Outlook encourages managers to add images, backgrounds, and tons of other painful useless garbage.

      Perhaps what you really need is a more competent admin who is more familiar with Exchange and can run a fully redundant set of distributed mail servers allowing users to keep all their mail in one place, and allowing IT to keep full backups of all email in one place, rather then scattering it around user's hard drives and hoping that they create a backup.

      How, pray tell, is a user expected to access the PST on their desktop when they VPN into the office from home? Or when they use a desktop and a laptop?

      --
      Give a man a fish, he'll eat for a day, but teach a man to phish...
    4. Re:That's what a personal folder is for... by jwcorder · · Score: 1
      We actually don't have a competent admin, but a team of competent admins and a full time Microsoft Employeed Engineer onsite all of the time. We also run about 10 mail servers all in redundancy. No it's not HP. We do not allow users to gain VPN access from non corporate machines. We also don't allow users to connect outside PCs to the network here at the office.

      If they need VPN, their desktop is replaced with a laptop that has the standard corporate image on it. Their pst is kept on their laptop and they are encouraged, but not required, to run a batch file on their desktop that copies all data of importance to their home drive. We leave the responsibility of the user's data in the user's hands. It is their duty to make sure that they keep anything of importance whether in electronic format or printed.

      My point is that if the city government would do the same thing, they wouldn't have this issue.

      --
      http://jayceecorder.blogspot.com
  24. Overload probably not the only reason by GarbanzoBean · · Score: 1

    I've lived in Baltimore for many years and it is obvious that the government. will use any excuse to cover up the extereme corruption. When I was there, a superintendent gave a building contract to his uncle who wasted $100,000 and then $100,000 was spent fixing the errors.

    As an aside:
    If you are gov't employee, it is your responsibility to use email for official business only. All communication should also use proper English (these are not posts to Slashdot) and all the emails and memos should be self-containing (and not include 100k messages of pretty formated reminders of lunches). I don't work in the gov't, but I still follow these rules. This cuts down on most junk.

    1. Re:Overload probably not the only reason by Altima(BoB) · · Score: 1

      "If you are gov't employee, it is your responsibility to use email for official business only."

      Well for some government employees, penis enlarging Vigralisis pills ARE official business.

      --
      Yup...
    2. Re:Overload probably not the only reason by wass · · Score: 1
      Just curious - when did you live here (Baltimore)?

      I'm going on my 4th year here now, and it does really look like Baltimore is turning around. Of course there are some problems, like this email thing as well as financial incompetency problems w/ the public schools. But many other problems seem to be finally coming around.

      For the first time in decades the population in Baltimore is actually increasing, and many formerly bad or sketchy areas are actually quite nice now.

      There's still a bunch of problems but even in the 4 years I've been here I've seen major improvements. I think in 5-10 years Baltimore will be a pretty schweet place to be.

      --

      make world, not war

  25. It's not too hard... by caffeinefiend · · Score: 1

    Clearly those people are abusing the email system; otherwise it would not be such a problem. What they need to do is get the BOFH to keep people's personal email's out of the system... :)

    1. Re:It's not too hard... by ilctoh · · Score: 1

      For the Baltimore Municipal Government? I'd hate to have that job, given the mass of people and machines involved.

      --
      How many slashes would a slashdot dot, if a slashdot could dot slashes?
  26. This shouldn't be a problem. by Entropius · · Score: 0, Redundant

    Millions of old emails?

    They say it "could be as high" as ten million emails. Well, the mean size of an email is probably around 10k, so that's 100GB of old mail.

    100G of storage costs about a day's wages for a city bureaucrat.

    The main problem they mention is that "it takes too long to make daily backups." That doesn't seem to be the mail system's problem--why are they making *daily* backups of static data?

    If you want to make daily backups of your mp3 collection, you don't copy the whole mess every day. You look for new files and copy only those.

    I'm not saying they necessarily need to keep all that old mail. But there's no technical reason why they can't.

    1. Re:This shouldn't be a problem. by Entropius · · Score: 1

      D'oh, everyone beat me to the 100G math. But I still wonder: why make daily backups of all that shit?

      If they do daily backups of everything, seems like that's one hell of a RAID array to store all the bureaucrats' PowerPoint bloat on.

    2. Re:This shouldn't be a problem. by Firethorn · · Score: 1

      Because you can't do an incremental backup on many types of mail server. For example, I don't know of any way to do an incremental backup on exchange's information store.

      Of course, from what I've read, Exchange's storage works like a database system. You have no guarantee where new stuff is stored, as it ends up overwriting old deleted data. It's like a file system that way.

      Now that's a thought, anybody know of a mail server solution that uses SQL for it's datastore?

      --
      I don't read AC A human right
  27. Millons of old spam, most likely. by jafo · · Score: 5, Interesting

    The spam problem is unlikely to go away until people start treating it like the attack on the Internet that it is.

    I've noticed an annoying trend lately that e-mail sent to businesses is frequently getting just ignored. Certainly it seems much more frequent this year than in the past. I've wondered if this is simply because so many e-mail boxes are getting filled up as fast as the spammers can send.

    I'd suspect that the city of Baltimore wouldn't be having any problems if spam weren't such a problem. If the number of messages they had to deal with dropped by 5 to 20 times (depending on which estimates of current spam levels you believe), they could probably just leave the mail where it is.

    This is all something I've been struggling with, being a small business owner doing business on the net. My company of 5 people gets between 4,000 and 20,000 borderline spams per day. By borderline, I mean that we throw away obvious viruses and things which score above a certain score in SpamAssasin (I think it's 9). So, that doesn't count the super spammy messages.

    If it weren't for our fairly strict and complicated spam blocker setup, and a very powerful machine, we couldn't get the few hundred messages per day that are of interest to us. Spam is killing e-mail. I'm not sure why more people aren't treating it as an attack, but it's really hard to get anyone's interest to take some action. Canceling accounts doesn't even begin to solve the problem.

    In the mean time, the City of Baltimore is suffering...

    Sean

    1. Re:Millons of old spam, most likely. by tarks · · Score: 1

      Why should the (end) user treat spam so sereously?

      I never understood this. First of all most spam is easily recognizable by the subject line. Back in the early days of spam I got like 10 spam messages a day and it took me only about 10sec of my precious time to delete them.

      Then came the spam filters. No most of my mail is filtered by gmx then by my university's email relay and finaly by Opera/Mozilla. That leaves me with like 3 spam messages a _week_. Still fairly easy detectable. And all this based on these general filters that surely are more designed not to block _any_ valuable mail. If I sereously trained my Opera filter I surely could get even better results.

      So where is the problem for me? There is none. I can see that it might be a problem for the providers which don't want the additional traffic. But it's none of their bussines. It's not up to them to judge if the traffic is valuable or not. I even might like to recieve spam. There are people that read printed advertisments too. Even tough I do not.

      Bottom line: Install a decent filter and forget about spam

    2. Re:Millons of old spam, most likely. by Wateshay · · Score: 1
      Bottom line: Install a decent filter and forget about spam

      Hmm... that's not a very good solution when you run your own mail server (which is very reasonable for a small company), and have to invest in more server hardware, and more bandwidth just to accommodate the spam. Secondly, not all spam is "easily recognizable by the subject line". Spammers are starting to get clever, and there are a lot of messages where I suspect the message is spam, based on the subject line, but the possibility that it's not necessitates me opening it up just to make sure.

      --

      "If English was good enough for Jesus, it's good enough for everyone else."

    3. Re:Millons of old spam, most likely. by Tony+Hoyle · · Score: 2, Insightful

      Spam is easily recognized by the subject line? Boy I wish I was getting your spam instead of mine!

      Mine's full of:

      hi
      how are you?
      Please Complete and Return
      I miss you
      Fwd: I need your help
      Re: Your Account

      etc... etc...

      Any one of these could be legitimate (occasionally you get a headline that's so inocuous I think the spam filter has got it wrong... until I actually read the email).

    4. Re:Millons of old spam, most likely. by Anonymous Coward · · Score: 0

      Well. 99.5% of all my spam has Subject: [SPAM]blahblah thanks to spamassassin that I run on my mailserver. It is not that difficult to run spamd on a separate server, or increase the CPU in the mail server...

    5. Re:Millons of old spam, most likely. by ModMeFlamebait · · Score: 1

      I guess I'm lucky that English is not my native language then. I almost never get mail with English subjects that are not spam (except from clearly marked mailing list messages).

      Still, I don't really have the spam problem (several spams a week, at most, 99% neatly filtered into trash)

      --
      Pavlov. Does this name ring a bell?
  28. 1984 anyone? by mtg101 · · Score: 0

    If email is deleted without record, what's to stop the powers that be claiming anything they want, and simply rewriting history? Did that email say "mission acomplished"? Or did it say "major combat over"? There's no record. But trust us, we said it: we said "major combat over". If you don't belive us, you're a terrorist.

  29. Dude, you'd think they would have the sense... by DAQ42 · · Score: 2, Interesting

    to dump it off to tape and then just store the tapes instead of just deleting it. Though they are probably running an Exchange server so offloading data stores wouldn't be the easiest thing to do. If they were using something with a simple mbox store, they could easily just parse it through a date filter and dump the older than 90 day stuff to tape. At least then it could be retrieved at a later date.

    Oh, wait, let me guess, they aren't using tape backups...

    --
    Don't Ask Questions. I don't know the answers and even if I did I wouldn't tell you.
    1. Re:Dude, you'd think they would have the sense... by teg · · Score: 1

      to dump it off to tape and then just store the tapes instead of just deleting it. Though they are probably running an Exchange server so offloading data stores wouldn't be the easiest thing to do. If they were using something with a simple mbox store, they could easily just parse it through a date filter and dump the older than 90 day stuff to tape.

      It's a lot tougher on backup systems to deal with mbox systems, because every time a flag is changed in a mail or a mail is added, the entire mailbox is added. Also, not possible to save a little bit of space by saving just one copy of a mail if there are multiple recipients handled by the same server.

      Speaking of that, anyone know of a good, working database-based mail server for Linux?

    2. Re:Dude, you'd think they would have the sense... by DAQ42 · · Score: 1

      Re-read my comment.
      If they were using something with a simple mbox store, they could easily just parse it through a date filter and dump the older than 90 day stuff to tape.
      If the creation date is older than 90 days, off load to tape and delete original. End of story.
      Sure, means they can't go through stuff older than 90 days, but if they need it, restore from tape. Geez.

      --
      Don't Ask Questions. I don't know the answers and even if I did I wouldn't tell you.
  30. not all people quote the entire post... by The+Monster · · Score: 1
    There are always going to be things like replies to an original question and subsequent follow up questions going back and forth, so normally hanging onto the latest/final reply would be sufficient (providing it had the previous history - clearly showed the conclusion).
    I'm old-school when it comes to email (probably because I've been a BBS sysop who had to worry about bandwidth consumption), but you've touched on one of the two big problems with most corporate email cultures:
    • Top-posting a reply, while quoting the entire original message, which recursively contains everything back to the start of the thread (I have actually caught crap sometimes for NOT re-quoting the entire message, because people don't know how to use Outlook to follow a thread any other way.)
    • Gratutious, inefficient use of HTML (e.g. FONT tags everywhere instead of stylesheet-based markup) and graphics ('stationery')
    These bloat emails by entire orders of magnitude over plain text with minimal quoting, which is sufficient in virtually all cases, and could be retained forever with no problem. Even with all the fluff, at current disk prices, I'd say that archiving old messages (and compressing them in the process, even if all of them were converted from HTML to plaintext) on a server with backup tapes that could be pulled out in case an investigation were conducted, would be pretty darned cheap.
    --

    [100% ISO 646 Compliant]
    SVM, ERGO MONSTRO.

    1. Re:not all people quote the entire post... by Hatta · · Score: 1

      Any HTML at all is unacceptable in an email. And you forgot the most egregious error of all, word attachments.

      --
      Give me Classic Slashdot or give me death!
  31. Archive it by jdkane · · Score: 1

    Archive the whole lot of it, and/or copmress it and store it. Don't even try to sift through it all. If and when it is needed, then get it out and pay somebody to sort through it.
    Then it's not clogging anything anymore, and also it's there if you ever need it.

    1. Re:Archive it by lordscotus · · Score: 0

      Definitely! Archive it, and then delete it -- after a specified period. .... That stuff should probably be backed up somewhere anyway, in case they need to rebuild a crashed system. That way we'll be able to recover the one few pieces we eventually need to keep some public servant serving the public,

  32. Doesn't that defeat the purpose... by JimmyJava · · Score: 1

    of going paperless? Surely the reason for using email instead of memos and letters is for cutting costs, environmental protection, etc. With data storage so cheap, why not archive all the old mail instead of trashing it?

  33. Deleting older than 90 days common? by swb · · Score: 3, Interesting

    OMFG, we nearly had a lynch mob attack us when we began deleting mail older than *two years* -- it eventually took the intervention of the CFO and a faked mail system "crash" to make 2-year max retention work, and even then there are people still pissed about it, or who claim that "the client" requires them to retain all correspondence (nope, sorry, we checked the contract).

    90 days seems both unrealistic to implement and way too much reliance on .PST files, which often max out at 2 gig and can get corrupted way too easily, not to mention being fdisked into eternity by clueless helpdesk people.

    1. Re:Deleting older than 90 days common? by geekoid · · Score: 1

      " "the client" requires them to retain all correspondence (nope, sorry, we checked the contract)."

      Yes, clients love it when you've been doing something for them(even if not in the contrat) and then take it away. Great customer service you got there. People who deal with clients do all kinds of thing that aren't in the contract to keep the client happy.
      If I was doing business with someone and they sudddenly stopped giving my the service I have come to expect, I would spend my money elsewhere.

      Just burn them to glass, and warehouse them.

      Part of the issue is the damn format emails are kept in, I perfer them to be kept in a nice neet text file that I can read with any text program, compress, encrypt delet, what ever, keeping them all in one file has never been a good idea. Thats niether here no there.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    2. Re:Deleting older than 90 days common? by swb · · Score: 1

      Yes, clients love it when you've been doing something for them(even if not in the contrat) and then take it away. Great customer service you got there. People who deal with clients do all kinds of thing that aren't in the contract to keep the client happy. If I was doing business with someone and they sudddenly stopped giving my the service I have come to expect, I would spend my money elsewhere.

      "The client..." is typically a knee-jerk reaction by the marketing droids whenever they want something they can't otherwise get. In this case, they're too lazy^H^H^H^Hbusy to manage their business and they're afraid if they delete *anything* the client will ask for it and they'll have to do more work.

      In reality, the client seldom asks for old email generally or some old email specifically -- since THEY typically have a more reasonable retention policy and assume that we do too.

      Just burn them to glass, and warehouse them. [...]

      .PST files (at least in Outlook 2K) aren't openable off of optical media, as Outlook wants R/W access to the file. .PST files are also limited to sizes less than 2GB in our experience, and we've had users with as many as 3 .PST archive files. Archiving also just moves the problem from the email server to a file server or to the help desk (who will lose it in a machine swap) if it sits on a hard disk.

      Part of the issue is the damn format emails are kept in[...]

      I'd say it's almost all of the problem. The problem is that in an organization and with any substantial quantity of email, you need to have it stored in a DB. Plain text is nice for a single, savvy user, but even IMAP and Pine can choke on huge mbox-format files. If MSFT put their SQL2000 engine in Exchange, it might solve a lot of problems for both archiving and for better long-term retention.

  34. In Texas you can't do that legally by Anonymous Coward · · Score: 1, Informative

    In Texas, email to any state or local public official (either elected or appointed), and certain categories of state and local government employees constitutes both a "public meeting" and "public record". The state's record retention laws say that the email must be kept a minimum of three years, and depending upon content up to 7 or 10 years, or perhaps even forever. If an email is deleted prematurely, then state law provides various levels of punishments for different degrees of tampering with, or destruction of public records, which can be as severe as state jail felony hard time if you've destroyed any email that could be construed as evidence in any criminal court case.

    1. Re:In Texas you can't do that legally by Rahga · · Score: 1

      I live in Texas, and if that's the case, I strongly advise state or local public officials to close all of their public e-mail accounts, and only use web-based contact fill-out forms.

    2. Re:In Texas you can't do that legally by surprise_audit · · Score: 1

      So any company dealing with state officials should print out each email and "file" it by sending it to the official's office with a note: "Hey y'all, here's that record you want to keep forever. Look after it yourself."

  35. Client-side storage is not a good solution by Vandil+X · · Score: 0, Flamebait
    i don't see what the big deal is. im sure if they droped $100 into each computer for a 80gig+ drive there would be plenty of space for -gasp- email
    Obviously you have never had to manage an IT budget in the commercial or government sector. IT spending is way down. Buying (and installing) a $100 hard disk drive for each employee at the City of Baltimore would be a horrendous expense, let alone for your typical company of 200 users.

    There's also a bigger problem with client-side archiving: workstations go down. Be it from OS/software failures to hardware failure, the client-side solution is a nightmare waiting to happen when it comes to the protection of important data.

    A better idea would be to write a script to go through each user's mailboxes every month, export any old emails to text, store the files on a server that uses a journaling filesystem, index the emails, and compress them.

    One or two XServe G5s could do the trick quite well.
    --
    Up, Up, Down, Down, Left, Right, Left, Right, B, A, START
    1. Re:Client-side storage is not a good solution by Donny+Smith · · Score: 4, Insightful

      >A better idea would be to write a script to go through each user's mailboxes every month, export any old emails to text, store the files on a server that uses a journaling filesystem, index the emails, and compress them.

      No file system will save you from multiple HDD failures; they should save old (>12 months) data to DVD burners and/or tapes or cheap SATA storage. One can buy 1TB of external SATA space for couple thousand dollars.

      >One or two XServe G5s could do the trick quite well.

      What do XServe boxes have to do with generic application like email? Besides, they're more expensive than comparable Intel+Linux servers (especially considering the fact that CPU perormance is unimportant for most mail servers).

    2. Re:Client-side storage is not a good solution by dasdrewid · · Score: 1

      I think he means the XServe RAIDs, not the servers themselves. Even so, those things are still a bit overpriced for the storage needs here. The only real advantages of those things I can find is if you live in a pure Mac environment, they're probably easier to admin or something.

      --
      No trespassing. Violators will be shot. Survivors will be shot again.
    3. Re:Client-side storage is not a good solution by Neuroelectronic · · Score: 1

      god damn. you get the idea. storage is cheap, thats my point. besides they probably dropped a $1000 on each of these employees computers at the time. now you can get an awsome client computer for $300. like they really mind blowing our tax money

    4. Re:Client-side storage is not a good solution by Donny+Smith · · Score: 1

      Oh yeeeeaaah, I've seen those in action - quite slow - 40MB/sec per channel - the way of SATA...
      I think they'd be painfully slow for email, especially Qmail/Postfix type that uses the Maildir format.

      Yes, they're easy to administer and have nice blue LEDs.

  36. Do they understand the value of data? by amichalo · · Score: 1

    I question wether the article author understood 'take offline' and 'delete' as the same thing, though they are so very different.

    Data is valuable, and Sysadmins know it. (Values such as when combating a lawsuit as the poster suggests or for trend analysis, contact information, or other historical purposes.)

    That said, hard drive space is inexpensive and archiving to optical medium is even LESS expensive. When 47 GB of DVD media can be had at Target for less than $10, it makes NO sense to destroy this data.

    --
    I only came here to do two things; kick some ass, and drink some beer...looks like we're almost out of beer.
    1. Re:Do they understand the value of data? by Anonymous Coward · · Score: 0

      what Target do YOU shop at? at the Target's here in my town, a spindle of 10 DVD's is minimum $15-$20, for the higher end ones it's like $25.

      10 DVD's for less than $10...bah

  37. Nothing new to see, move on.... by InternationalCow · · Score: 1

    So, what's new? The community I live in is famous for losing hard copies of just about anything you can imagine. I'm not sure whether it has really been lost or whether it was decided that it was better to be without certain possibly troublesome papers. So now they're intentionally losing stuff in order to avoid being drowned in cruft. It just goes to remind that whenever you deal with authority, you should keep hard copies of your correspondence :)

    --
    ----- One learns to itch where one can scratch.
  38. Some gov't org do this on purpose by Anonymous Coward · · Score: 2, Interesting
    Where I work, which is government, we only keep backups for about a week, since they are public records.

    We don't want someone to be able to request something from backups that the user thinks is gone.

    This way it's up to the user to decide if they want their data archived. And the onus is on the user to comply with however long the data is supposed to be kept before being destroyed.

  39. Problem with email by sydbarrett74 · · Score: 2, Insightful

    This highlights a fundamental problem with email -- many people pass documents as attachments, or in the body of the email, instead of using email as a sort of metadata describing their works in progress. Documents shouldn't be passed around in email; they should be stored on a network share, where proper controls for mutual exclusion and such can be employed.

    --
    'He who has to break a thing to find out what it is, has left the path of wisdom.' -- Gandalf to Saruman
    1. Re:Problem with email by Anonymous Coward · · Score: 1, Interesting

      There's another problem with keeping so much e-mail.

      When the agency *DOES* get sued (and it will sometime in the future, everyone gets sued), those users will have to sort thru *ALL* Those messages. Who's going to decide what's relevant and what's not?

      Who's going to decide what's attorney/client privelage? I don't think any public agency can have a bunch of *lawyers* go thru 100,000's of e-mails.

      What happens the next time they're sued? They have to go thru them again!

      The problem is that with most government agencies (and I work for one), is that the records retention policy/schedule doesn't specifically address e-mail, and never envisioned the mass quaintity that it could generate.

      Our agency as in the process of adopting a similar plan. If the message is relevant, it gets printed and put in the paper file. If there's an electronic folder for the project, it gets saved to disk there (and would be alot easier to turn over in case of a lawsuit). All e-mail is deleted off server after 90 days.

    2. Re:Problem with email by cavebear42 · · Score: 1

      I back you 100%. I can not seem to drill into the heads of my customers that they can not be sending files by email. We have tried everything, we have given secure drop boxes to delver files to people, we have put up a public access scratch drive (w/ 7-day auto delete) to dump jokes and such, we have posted step by step instructions to make links to files in an email. We limit live mail boxes to 100 MB w/ a nag screen and 200 MB send privleges taken away. We limit the attachement size to 30 MB external and 50 MB internal. If you are over limit, you must store mail to local drive. If you want it saved, you must move to CD, DVD, or tape. Even beyond this all, mail server (like all other servers) has minimum 1 year of incremental backups stored to tape off-site. Even with all this info and restrictions over 50% of the users see the nag screen on a daily basis and people (esp HR) still attach 3MB word docs and send to over 100 people.

  40. Email == offical documents ? by drgonzo59 · · Score: 2, Insightful

    I am not sure if they can use email as official communication? There would be problems with repudiation ("we never received it"), privacy ("someone intercepted it who was not supposed to") and authentication ("it wasn't me who sent it, it was my dog"). Can they use an email in the court then? What would have to be done is to have all the messages signed and encrypted with a public key, and perhaps have some way for the sender to get a receipt back when reciever reads the message.

  41. Some info on record keeping in Maryland by Chatmag · · Score: 1

    According to Georgetown University that I found regarding retention of records in Maryland:

    "8. Has any public records legislation/administrative regulation been proposed calling for "permanent public access" to electronic public records? _x__ Yes ___ No a. If "Yes," cite to and briefly discuss the legislation/proposed regulation; what was the outcome? Arguably, Maryland has such a provision in MD. REGS. CODE tit 14.18.04. Certain electronic records may be considered "permanent electronic records" in they have "sufficient historical, administrative, legal, fiscal, or other archival value to warrant preservation by the Archives beyond the time that the record is needed by the agency that created it." MD. REGS. CODE tit 14.18.04.03(B)(15). Nevertheless, many electronic records will not rise to the level of importance that will ensure permanence."

    The hard part is determining what is important to save and what is not. In general, 7 years is the standard retention time. In our litigious world, keeping anything to prove your case until the statute of limitations runs out is a wise move. Losing emails you don't want your g/f to see is technically called an "oops, I accidently hit delete"

    --
    Pete Carr Owner Chatmag.com
  42. Amen brother! (nt) by Anonymous Coward · · Score: 0

    nt

  43. Try 30 days by Anonymous Coward · · Score: 2, Informative

    Though I don't work in the auditors office in my state, here is what they implemented. Any document (digital or not) over 30 days must be made public. Solution, any e-mail over 30 days is deleted. It allows them to not worry about keeping all e-mail till the end-of-time and not worry about making e-mail public. Great solution in that scenario.

  44. Did they even Look for offline soultions? by Ohm2k · · Score: 5, Informative

    Working at a law firm we have to keep everything for 7 years. We have a system in place that takes all mail over 90 days old pulls it out of exchange and move it to the SAN. As a plus it puts a link back into the information store to make it look like the message is still there. User wants a Old message he can still get it himself w/o a IT person having to do dig up a tame, restore the file and the e-mailing it to him (Thus creating MORE mail). The messages are still searchable and it makes retrival when needed a snap.

    Mind you, we are only a 700 user shop. But nothing gets deleted. If it gets buy the spam filter it gets saved.

    --
    People find it strange that I don't know how to juggle or tap dance.
    1. Re:Did they even Look for offline soultions? by dcsteve72 · · Score: 1

      I'd be interested in learning a little more about the solution that you are using, so obviously, offline would be a better option for communication. I can be reached at dcsteve72 at yahoo.com. =Steve

    2. Re:Did they even Look for offline soultions? by Anonymous Coward · · Score: 0

      I'm also interested in this solution. Please contact rucool at rucool.no-ip.com

  45. This issue isn't limited to the City of Baltimore by Flounder · · Score: 4, Interesting
    I work in the IT department of a county close to Baltimore. Our server can retain e-mail indefinitely (there is a space limit per mailbox, but not a time limit). However, our backups only go back 30 days. This is stipulated by the county lawyers.

    As far as I've been able to figure out, this arose from a lawsuit against the county where an e-mail retrived from two years previous proved a county commissioner to be taking bribes in a zoning issue.

    Rather than fix the corruption, just ensure that it's covered up more efficiently. Gotta love local governments.

    --

    No boom today. Boom tomorrow. There's always a boom tomorrow. - Cmdr. Susan Ivanova

  46. A mark or procedure for official business by jhines · · Score: 4, Insightful

    Once an actual human person has read and acted on the mail, they should be able to mark it "official business" and/or move the email into an "official business" folder which does get kept as required.

    Better procedures and training goes a long way here. These same folks have no problems with snail mail.

    1. Re:A mark or procedure for official business by Anonymous Coward · · Score: 0
      Once an actual human person has read and acted on the mail, they should be able to mark it "official business" and/or move the email into an "official business" folder which does get kept as required.
      We use SpamCop and ORDB, ClamAV, and SpamAssassin. Anything that fails the DNSBL test gets bounced, anything that ClamAV or SpamAssassin doesn't like gets marked as spam (viruses get stripped). Anything marked as spam is deleted after 3 weeks unless the user moves it out of the spam folder.

      Basically, we're doing the reverse of your suggestion; using software to mark it "not official business" and auto-discarding it.
    2. Re:A mark or procedure for official business by Anonymous Coward · · Score: 1, Funny

      1. All people are honest and therefore this is a great idea. Only companies like Microsoft, Enron, AOL, CA, and the entire mutual fund industry are stupid enough to communicate incriminating information via email.

      2. History has proved again again that people are very very good at figuring out what information will be important to the future.

      3. All we need do is train people. Once trained no one will ever make a mistake.

    3. Re:A mark or procedure for official business by globalar · · Score: 1

      You mention a good point. Email messages have different purposes. Some are chit-chat, some worthless jokes, some spam, some messages external to the organization, some internal, some from your boss, some from your employees, etc. But it's all the same in the inbox, governed by the same disk quota and archival policies. Perhaps the file/folder dynamic for email is a little lacking (I think of IBM's Remail, though I'm not sure if this really hits the problem). We just recently got "junk" as a new category with our filters. People use email differently of course, but we have few methods of treating messages differently outside of the user's direct intervention.

      There are different ways to improve this. Perhaps one is to look at making email messages more like documents (like XML) and give the user more options in creating, saving, and searching their messages. Another is to improve indexing and categorization apart from the user entirely (something like Google). Another is efficient and longterm storage (like seperating attachments from mail).

      Right now we have some more crude solutions like saving or deleting mail based on spam-targeted filters, timestamps, etc. But you are right, I think the solution has to start at the inbox and the user. We may need some fundamental improvements to email. Of course, we risk overcomplicating our busy and over-technical lives.

  47. Re:This issue isn't limited to the City of Baltimo by harlows_monkeys · · Score: 1
    Rather than fix the corruption, just ensure that it's covered up more efficiently. Gotta love local governments

    I'm surprised that there aren't any state laws that would override that local limit.

  48. problems with that by drgonzo59 · · Score: 1

    The problem is that their average might not be quite that. Usually the file sizes in large systems (operating systems and web) follow a long tailed distribution, that is the chance of seeing a very large file all of a sudden is not too low. So most messages during a certain day might just be about 5k each but then someone sends a 5MB PDF brochure or a zipped folder of "really cool images of sunsets or mountains" that are worth a thousand smaller 5k messages.

  49. Require mail from the public be encrypted by harlows_monkeys · · Score: 1
    The way to deal with spam in a situation like this, where you may be legally required by state record keeping laws to archive records, including email, for a long time, is to only accept encrypted mail from the public.

    Actually, it doesn't have to be encrypted--any hoop that you can people jump through to mail you is fine, as long as it isn't something that spammers will be able to automate. For example, you could also use a randomly generated email address that changes frequently, and provide a website with a "mailto" link and a challenge image. (However, the challenge image might provide problems for blind people, so that might not fly).

    This should cut out almost all the spam, cutting mail down to a managable volume for archiving, so you can get back to worrying about the other problem with satisfying record retention laws: finding a way to keep the data. Some states require records be maintained for a long time--long enough that you have to worry about media life, and the availability of readers even if the media lasts long enough.

    1. Re:Require mail from the public be encrypted by Tony+Hoyle · · Score: 1

      How much of the public though have the slightese clue about encryption?

      I once tried using X509 to everyone, but Outlook express just refuses to display the message and puts up a huge warning about a corrupt email (all other mailers handled it fine - OE just doesn't support X509 correctly), so I'd just get an email back that said 'your mail was corrupted and I couldn't read it'.

      PGP is worse. It isn't supported by *any* mailer widely uses mailer (installing an extra 'plugin' does not count - most of the people I talk to have absolutely no idea what a plugin is let alone how to install one).

      I can try that on my home email and get away with it for a week or two (before being forced back to plaintext)... if a local government tried it it'd piss off thousands (possibly millions) of people and make the local/national papers.

    2. Re:Require mail from the public be encrypted by gerardrj · · Score: 1

      I think the last think the government wants to do in encourage MORE of it's citizenry to start communicating via encrypted mechanisms. The police would never allow such a policy to be enacted, it would severely limit their ability to "gather information to prevent terrorist and criminal activity" or some such bullshit.

      --
      Article X: The powers not delegated... by the Constitution...are reserved...to the people
    3. Re:Require mail from the public be encrypted by mikael · · Score: 1

      Actually, it doesn't have to be encrypted--any hoop that you can people jump through to mail you is fine, as long as it isn't something that spammers will be able to automate. For example, you could also use a randomly generated email address that changes frequently, and provide a website with a "mailto" link and a challenge image.

      Would it be possible to assign an E-mail address on a per-query/per-case basis. That would allow you to sort correspondence by thread automatically.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
  50. Re:HTML in email? Forbidden! by Kent+Recal · · Score: 1

    Occassionally, we need to send documents to each other, or to our clients. These are not just simple text documents - they are design specs, product proposals, and other docs that contain images, graphs, charts, and other multimedia content.

    Sounds like you want to learn about pdf.

    Why shouldn't we just email them the document, if we know that everyone in the circle has Word?

    Because many word documents contain a history of the last n changes that were made to them. So your client might get to see some "old" figures that they're not supposed to see. Or the name of that other client that you've sent that .doc to the other day...

  51. Why is there a problem with retention? by phr1 · · Score: 2, Interesting

    Just dump the old email to DVD-R and archive it somewhere. If someone wants to subpoena it, burn off copies and wish 'em luck. Even if the city is getting a million pieces of spam a day, at 5kb each after data compression, that's just one DVD-R per day at a buck or so each, peanuts compared to what the city already must spend xeroxing memos for records retention purposes.

  52. Removing old messages isn't the best option by crimoid · · Score: 5, Insightful

    A better option would be to archive old messages rather than remove them entirely. From the article it sounds like they are keeping ALL messages active all the time. For example:

    "They say the system is so overburdened that creating a daily backup has become impossible; there is so much data that it takes more than 24 hours to copy it."

    So, it seems like the solution would be to periodically lop off old messages to offline storage (tape, spare drives, whatever). In the event of a lawsuit the old messages could be reasonably recovered and the cost for such a system would be extremely minimal.

  53. Cheap solution by MikeCapone · · Score: 1

    Just rzip (better than bzip2 with large files) the email archive and burn it to DVDrs. So in case of real legal necessity, it's possible to access it and the whole setup (DVD-burner + media) probably cost around 100$.

  54. Re:HTML in email? Forbidden! by Anonymous Coward · · Score: 0

    Sometimes an editable format is needed because you want the recipient to edit the document. Retaining the change history of document is also important if the sender expects to recieve changes.

    Jackass.

  55. Complying with Public Records Acts by EconomyGuy · · Score: 4, Insightful

    Unlike a legal office where communications are governed by extensive regulation, governments are really only required to keep records of official documents and decisions. The myriad of e-mails leading up to a decision are not generally protected under such an act, nor are snail mail or phone conversations. In fact, the whole idea of there being a digital trail to follow for governmental decision making is really very new. Does it makes sense to change that practice? Do we really think our government officials should be so closely watched that EVERY e-mail/phone conversation/smoke signal should be recorded and exposed to public scrutiny? Talk about making an unattractive job even less inticing.

    In responce to the posters question about all those subpoenas: welcome to the world of civil litigation, where the first one to destroy the evidence wins!

    --
    Only 120 characters... who can summarize their entire world understanding in 120 characters?!
    1. Re:Complying with Public Records Acts by gerardrj · · Score: 2, Insightful

      Yes. When a government official is supposed to be acting in the best interest of the people they should be subject to scrutiny at any level that is reasonably available.
      Storing older emails is a rather trivial issue of collecting, compressing and copying to an inexpensive tape or hard drive which can be archived. A 250GB IDE drive is quite inexpensive and could probably archive several hundred million emails, many more than the city is claiming it will delete.

      In a time when the government is fading further from the ideal of a democracy, or even the republic which it's supposed to be, I think that accountability of the elected and non-elected government workers is critical.

      In this particular case, it seems that all of the city's email is in one central location, otherwise how could they just delete it without putting that responsibility in the hands of users, or sending support people to each and every desk.

      Pick a day, halt all email access inbound and outbound (government usually doesn't work on Sunday), copy all the existing email to a drive, then start the deleting process.

      --
      Article X: The powers not delegated... by the Constitution...are reserved...to the people
    2. Re:Complying with Public Records Acts by EconomyGuy · · Score: 1

      I think spending a few years in government might get you to sing a slightly different tune on the question of scrutiny. In my experience most government employees are really trying their best to do what they can with limited budgets. When we start to place bureaucrats under heavy inspection (different from elected officials, who have a whole other set of methods for evaluation) we end up having employees who are more worried about how they appear than the actual quality of the work preformed.

      There is a critical balance to be struck between fairness/openness and efficiency. I am certainly one who falls to the fairness/openness side of the equation, but it has to have limits. Otherwise we'd just have elections every day and he who fakes it best wins.

      Not my ideal, and certainly not the ideal envisioned by our founders or those who adopted the 10th amendment.

      --
      Only 120 characters... who can summarize their entire world understanding in 120 characters?!
    3. Re:Complying with Public Records Acts by DerekLyons · · Score: 1
      Unlike a legal office where communications are governed by extensive regulation, governments are really only required to keep records of official documents and decisions. The myriad of e-mails leading up to a decision are not generally protected under such an act, nor are snail mail or phone conversations.
      Basically, that's nonsense. Goverments are required to keep whatever documents and records their regulations require them to keep. That can very from 'dammnearnone' to 'everydammthing'. Here in Washington State certain types of phone conversations, snail mail, etc. are actually *forbidden* to be used by public servants because the law requires them to occur in open and public meetings, the transcripts of which are official documents and are required to be kept essentially forever. There have been several cases where retained emails and phone logs have been used to show that the individuals in question knowingly violated these requirements.
    4. Re:Complying with Public Records Acts by EconomyGuy · · Score: 1

      Washington State, my home state, operates under the Open Public Meetings Act which requires quite of bit of record keeping... however, it does not include the recording of bureaucratic conversations unless integral to the decision making process. The e-mail between one secretary to another is not part of the public record, whereas the minutes of a School Board are. Concerning phone calls, the one example I know is that conferences calls where a majority of a decision making body can speak to eachother all at once are forbidden because they could effectively hold their meetings beyond the public eye. But this is very different from a phone conversation between an employee and his or her spouse.

      --
      Only 120 characters... who can summarize their entire world understanding in 120 characters?!
  56. EMC Centera by neillt · · Score: 1

    Isn't this exactly what products such as EMC's Centera were designed for? No, I don't work for EMC, but I have worked with the Centera... it does the job well.

  57. Re:HTML in email? Forbidden! by Hatta · · Score: 4, Insightful

    I'll handle these in reverse order.

    Word attachments are acceptable when they are just a means of moving files around, and not the entire content of the email. What is not acceptable is expecting me to load a large word processor just so you can use the company letterhead. In my experience the latter type is far more common. Besides the security implications (macro viruses, etc), I do not have a gui on the computer I read my email. Nor should I need one.

    As for HTML email, I'm simply not going to render strange IMG tags. They could lead to goatse, or back to a spammer's site, and now they know my email is active. HTML email generally looks like it was designed by an 8 year old with downs syndrome anyway. Plain text is just more readable for nearly every email. Check out HTML email is STILL evil!!! for more.

    --
    Give me Classic Slashdot or give me death!
  58. email being useful by zogger · · Score: 1

    how many actual business inquiries do you get out of all that email? If it's not too large a number, wouldn't it be easier to just switch back to POTS for your business, and just trash the concept of email as being unworkable at this time without a ton of headaches? Just a POTS and an answering machine might be all you need.

    Another alternative might be to use webforms instead of email, and indicate any replies back to the prospective customer will be done on your nickle, on the phone, and they should give you a time and date to return the call.

    1. Re:email being useful by jafo · · Score: 1

      The problem with POTS is that some people DO NOT LIKE TALKING ON THE PHONE. It actually took me quite a long time to realize that you have to be careful to communicate with people in the way in which they are most comfortable.

      I'm one of these people who absolutely hates talking on the phone. Which is a challenge, becaues we have clients who absolutely hate communicating in e-mail. We've realized that we were losing business because we weren't communicating in the right way with certain of our clients.

      Switching over to only POTS communication, even with us having an 800 toll-free number, just isn't an absolute solution. Particularly as I really like working in coffee shops and other locations, to get out of the office.

      Don't suggest "Get a cell phone", I hate talking on the cell phone more than on a landline, because of the whole "Hello? Hello?" shite. I carry a cell phone to let clients contact me in emergencies, but I always try to conduct non-urgent business on a landline because I always seem to end up in crappy coverage areas.

      I'm moving in a direction of becoming much more anal about what e-mail is allowed through. Changing the default from "allow mail in" to "block mail" and then whitelisting clients, addresses we send to, mailing lists, etc. Others can contact us by phone or through a web form.

      I need to think about that more, but something needs to be done. It's just impossible for us to deal with an average of 2,000 or more spams per day per person AND get our job done.

      Sean

    2. Re:email being useful by zogger · · Score: 1

      using the webform for the first contact, then a whitelist with the customers email, and blocking/filtering everything else sounds like the best solution for you then, for those who don't want to use the phone. I agree, wading through 2000 spams a day is nuts, it's like a totally second job that doesn't pay, it just costs.

      And geez, it just doesn't seem all that long ago when you never got much spam if any. I've almost given up on personal email, I think at most about a dozen people have my addy now, and I stopped being on mail lists for various topics.

      Anyway, good luck however you work it, it's hard enough now to just run a business without all the other sideways hassle.

  59. Inept IT by dtfinch · · Score: 1

    They say the system is so overburdened that creating a daily backup has become impossible; there is so much data that it takes more than 24 hours to copy it

    I find that rather hard to believe. They only need to back up the new emails, then they can delete them at any time without actually losing them. I doubt they see many terabytes of new email every day. Nine times out of ten, any IT tech who says something is "impossible" is just lazy and/or incompetent.

    1. Re:Inept IT by surprise_audit · · Score: 1
      Dunno if this is still true, but some time back I read a rant about Microsoft Exchange mail servers where the ranter was bitching about the email being saved in one single file... You have to archive the whole damn thing as one lump, even if it gets to 50Gb. There's no "search the filesystem and just save files added/modified today" feature. And when a user wants a mailbox restored, guess how much has to be restored?? Yep, the whole damn file.

      The article isn't specific about what the mail server is, unfortunately, so we don't know if we have reason to blame Microsoft... :)

    2. Re:Inept IT by SlamMan · · Score: 1

      Nine times out of ten, any IT tech who says something is "impossible" is just lazy and/or incompetent.

      Or doesn't have the budget.

      --
      Mod point free since 2001
  60. Saving to local drives? by Gonoff · · Score: 4, Informative

    We have to spend a lot of time telling people to **NOT** save to local drives. If it is important or confidential, or may be in the future, this should not be saved locally unless you want to loose it or explain to an enquiry why it was found on sale in a car boot sale after a break in. This is what a network is for.

    The answer to the problem in the article is quotas. *nix has them, Novell has them and even Windows has them. Our email quota works as follows
    Limit 1 - email user once per day marked high importance that they are getting close.
    Limit 2 - disable sending and continue with (2k) warning message.
    Limit 3 - disable receiving apart from one final message saying that it would all start working again when the user clears some space

    When they can't send/receive, they get a dialogue box reminding them when they try and when they can't receive, the sender gets a messge.

    This does make for support calls like...

    "Why does my computer tell me that the email is full up and I can't send any more?"
    "Because your email is full up. You have a message explaining this to you."

    "X tried to send me an email and it bounced saying that my mailbox was full up. Why?"
    "Because your mailbox is full up."

    --
    I'll see your Constitution and raise you a Queen.
    1. Re:Saving to local drives? by Grell · · Score: 3, Informative

      3 warnings?

      Must be nice.

      I field 250-320 emails a week.

      All replys are "reply with history" often with screenshots as company policy and due to the complexity of the job. (3rd level insurance support w/ story problems galore)

      I have a personal storage quota of 75 megs,
      the mailbox I save to has a personal storage quota
      of 75 megs. (personal space? about 7% and holding, corporate box? 90-100% at all times)

      They cannot share, or transfer any storage quota from one user or resource to another.

      They will not buy *any* new drive space.

      They will not examine *any* redistribution of present drive shares. (like oh I dunno, *USAGE*)

      And the first warning we get that the drive is
      filling (about 3-4X a week) is the cannot write to
      drive warning.

      We delete somewhere in the neighborhood of 90 megs a week of pontentially subpeonable documentation and there is no plans in place, or even spoken of
      to correct this. (don't ask, don't tell)

      Save it to a local drive? No that would violate security protocols.

      Grell

      --
      ...when it gets down to fundamentals, do what you have to do and shed no tears. Dr. Matson in Tunnel in the Sky
  61. Shoudl use something like emailextender by Anonymous Coward · · Score: 0

    hm... perhaps they should use a product like Email Extender?

    http://www.legato.com/products/emailxtender/

  62. Information Lifecycle Management by bolix · · Score: 4, Insightful

    ILM is the next big thing. Its the logical extension to the ever increasing SAN/NAS Server/Workstation exponentially-increasing-data problem (go google for pretenders to the law).

    You can't oversee growing data storage without a parallel increase in administration costs. Instead, the idea is to build automatic archiving into your storage architecture.

    In practice this means you build tiers of storage/archive methods. Tier 1 is a high tkt Shark SAN etc, Tier 2 is lower priced SATA RAID and Tier 3 is a DAS Tape Library. Build retention guidelines into the storage management playform (Tivoli etc). Older items are automatically moved to the Tier corresponding to that retention/access policy. Really old items "live" on Tape. Frequently accessed data lives on the high speed boxes near to the users/application. You snapshot updates to a DR replica offsite or burn periodic Tape sets etc. Its a good idea to team this with storage virtualization (virtual LUNS/ Metadata directory servers) and you can add/rotate/modify the storage tiers when necessary without any downtime.

    From a user perspective, you click on the link and if applicable, get notified the item is being retrieved from media x (its mostly transparent). Worse case - access times are in the minutes.

    Of course, all this comes with a high price. Enterprise Storage systems are not cheap. Recent legislated policy (Sarbanes Oxley etc) enforces the retention of some media (e.g. email). You cannot rely on end users to enforce data retention. This lets you mandate tiers of protection and is highly configurable to support per application monitoring.

    Nothing is foolproof. Its still being finessed but if you can afford it - its truly a thing of beauty.

    1. Re:Information Lifecycle Management by dasdrewid · · Score: 1

      I agree completely. Mostly because I have no fscking clue what you just said.

      Really though, I think I understand the gist of what you said. But, just a thought, you might wanna pop in a few footnotes next time so that the rest of us 99% know more of what's going on.

      --
      No trespassing. Violators will be shot. Survivors will be shot again.
    2. Re:Information Lifecycle Management by bolix · · Score: 1

      All the big players are buying into this. For the Google-challenged:

      EMC's take from todays Cnet
      CA
      HP
      IBM(tho' they prefer the acronym HSM or Hierarchical Storage Management)
      Veritas
      Random googled pundit on the entire field.

  63. No, they should use Lotus Notes/Domino... by Nick+Driver · · Score: 1

    ...as their email system, where it is trivial to archive old email from your main mailbox off to an archive database file where it's easily moveable to some offline, dirt-cheap media such as CDR, etc.

  64. Re:Simple... (not) by Zen · · Score: 4, Informative

    There comes a point where that, too, gets very expensive. At my company (large US healthcare provider, with governmental and private contracts both HMO and PPO), after saying 3, 5, and 7 years, our lawyers have told us we have to archive all email potentially forever that the end user doesn't specifically delete. They may do an end-run around the deletion and archive those, too, but I don't know. Anyway, our email system (Lotus Notes, which is an extreme HOG) eats somewhere between 100GB - 1TB/week. I was told it was well over 1TB, but I don't believe them. This is of course due to older Notes versions inability to store attachments in public directories and simply sending a copy to each and every recipient (and the stupidity of no size limits on internal email). There is a point to how many drives you can add to a SAN, and then you have to get a whole extra chassis, which is where the expensive part comes in. To keep buying new SAN units every 6 months or so, as well as the harddrives to put in them (plus the maintenance contracts, 24/7 support, etc) could easily add up to $1million/year or more. Which is definitely more costly than 10 average low-mid level administrator's salaries.

  65. +1, Insightful! by Anonymous Coward · · Score: 0

    What he said.

  66. Re:Simple... (not) by BasilBrush · · Score: 2, Insightful

    The 10 secretaries in question were only using 1 GB each per year. 10GB per year in total. If your company is as large as you imply, the amount of work hours involved in sorting though old emails will be larger than that. Each person (or their PA) would need to do their own. That's a lot of hours.

  67. Screw the Lawyers by Detritus · · Score: 3, Interesting

    At one company that I worked for, they got the brilliant idea to delete all email older than 30 days. They also didn't want employees to make backups of their personal mailboxes. They intentionally wanted all traces of old email to disappear. While I'm sure that it made the lawyers happy, it caused a lot of grief for the people actually doing work for the customer. Many design decisions, bug reports and other important things were only documented in email messages. This is supposed to be the age of the paperless office, right? When you are involved in a multi-year project, you often need to refer to old messages. It also had the effect of making old policy memos disappear, whose existence had proved to be very inconvenient to management on several notable occasions.

    --
    Mea navis aericumbens anguillis abundat
    1. Re:Screw the Lawyers by geekoid · · Score: 2, Interesting

      I worked at a company with that policy.
      Then one day, in a meeting with VP's, a manager tried to put me on the spot, and use me as a scapegoat with some bold face lies.
      I'll never forget the look on his face when I produced hard copies of our email exchange...
      ahh, memories.
      I also got the VP to change the email policy.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    2. Re:Screw the Lawyers by DerekLyons · · Score: 1
      They intentionally wanted all traces of old email to disappear. While I'm sure that it made the lawyers happy, it caused a lot of grief for the people actually doing work for the customer. Many design decisions, bug reports and other important things were only documented in email messages.
      Which means the people actually doing work were actually complete idiots. If you are tracking bugs... Then use a bug tracking system, that way bugs don't get lost and forgotten because the guy who recieved the email is on vacation. Design decisions need to be document in a formal paper, submitted and maintained in a proper archive, not spread and buried inside and across multiple mailboxes. etc. etc.
      This is supposed to be the age of the paperless office, right?
      Yes. But 'paperless' != 'everything in one unorganized box'. In a paper system no one with any intelligence would tolerate doing the things that you describe as being done in email.
    3. Re:Screw the Lawyers by Detritus · · Score: 1

      Your response assumes a level of funding and organization that is often not present in the real world of incompetent management and shrinking budgets.

      --
      Mea navis aericumbens anguillis abundat
    4. Re:Screw the Lawyers by DerekLyons · · Score: 1
      Your response assumes a level of funding and organization that is often not present in the real world of incompetent management and shrinking budgets.
      I assumed the individuals in question were actually interested in doing their jobs. Creating a minimal bug tracking system (an indexed collection of printouts say) requires niether a budget nor manegerial competency. It does however require a certain level of competency and initiative on the part of the workers.

      Your comment on the paperless office and willingness to shift the blame shows both to be absent.

  68. Probably Running Exchange by billstewart · · Score: 1
    They're estimating that it would cost $250K/year in management and hardware to expand their system. Assuming that half of that's hiring a good sysadmin (:-), the only way I can see them spending that much money just for expansion is that they must be running some clumsy proprietary mail system - probably MS Exchange, or possibly some antique from IBM (worst case = PROFS.) Exchange has the advantage that it encourages bloatmail - sending attached Word documents instead of simply writing text, or sending Powerpoints instead of Word. I've probably got a gigabyte of mail from the last year, even though I delete lots of stuff. A better-behaved user (:-) might have a mere 100 MB, and it's hard to compress this stuff beyond 2:1 because of MS's binary mailbox formats, even if they're not using the encrypted stuff.

    The real way to do storage, though, is to let the users keep their mailboxes on their own PCs. My company's IT department pushed us in that direction many years ago, partly because it's the only way to really support laptop users, but partly because it gets rid of the central storage bottleneck and makes it the user's problem to not run out of disk.

    A more convenient mail system would make it possible to archive this stuff to DVDs, or CDs for users with smaller mailboxes. (CDs are more useful, because most PCs have CD readers, and most government-office PCs are unlikely to have DVD readers, and probably most don't have CD writers yet either - you'd have to do this in a centralized fashion.) So have a centralized group burn the stuff to CD, unless the users have their own CD burners.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:Probably Running Exchange by SlamMan · · Score: 1

      Keeping their mail on their own PCs doesn't lend it self to decent IMAP usage, or to letting them have a webmail interface to the mail server. It'd only show new things.

      --
      Mod point free since 2001
    2. Re:Probably Running Exchange by pqdave · · Score: 1

      Most webmail use doesn't need to have antique messages, just the most recent few days. My mail client has an option to delete mail from the server a configureble time after the message was downloaded locally.

  69. Is it so hard.... by Zenmonkeycat · · Score: 1
    to just move all the old messages to a few RAID arrays with some new 400-gig drives? Just copy the messages to the backup drives, then delete them from the servers. That's what most POP3 accounts do anyway. Then you can subpoena the backups, and everything will be right there.

    Of course, if the city /wants/ those messages to go away because of the threat of subpoenas, that would be a problem.

    --

    *****
    Dear Mary,
    I yearn for you tragically,
    A.T. Tappman, Chaplain, U.S. Army.

  70. Better than a 90-day maximum lifetime by billstewart · · Score: 1
    Sure, it's nice to get a guaranteed 10 year minimum lifetime. But these droids are talking about a guaranteed 90 day MAXimum lifetime. Unreliable disks aren't great, but they're much better.

    Also, there's the issue of centralized vs. distributed archiving. If you're centralizing, DVDs are obviously the better choice, because you can store 6 times are much data on each, and if you're doing one mailbox at a time, you're less likely to need multiple disks. For distributed use, though, CDs may win, because government bureaucrats are much more likely to have CD readers than DVD readers; some of them will also have CD writers. Probably the best choice is to have one archive copy and one copy for the user to keep, and bar-code-label the archive.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  71. Your math is way wrong for Exchange by billstewart · · Score: 1

    I just took my Exchange mailbox and WinZipped it. 1.4 GB raw became 760 MB compressed - about 50%. Not only is a typical Exchange user unlikely to have most of their mail messages fit into 10KB of text, they're stored in clunky formats. Probably most of the bytes in my mailbox are Powerpoint, and most of the rest are Word. 3/4 of my spam is probably 10 KB of text :-), but the rest has embedded pictures, and in any case it all gets deleted.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:Your math is way wrong for Exchange by yppiz · · Score: 1

      My 31MB inbox in mbox format became 10MB after gzip -9

      I have a fair number of photos and .doc files in there, but little spam.

      --Pat / zippy@cs.brandeis.edu

  72. Outsourcing garbage collection... by mikael · · Score: 3, Funny

    When I heard my city were outsourcing their garbage collection services, I imagined office blocks of staff in India sifting through online hex editors looking for spare memory blocks to delete.

    --
    Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
  73. Re:Simple... (not) by Zen · · Score: 3, Interesting

    Right. Our company originally tried to instate size limits when we went to Notes (only 3 years ago), but then the lawyers said we need to keep everything anyway (HIPAA requirements). So even with the exorbitant expense of the system, it is probably still cheaper to keep expanding every couple months rather than pay people to sit there and sort through their own email. Anything from an external party must be kept, and anything remotely regarding a customer must be kept as well. It's a huge pain, and they took the easy way out by archiving every single email. But neither option is very cost effective. There are four people that I know of in my department alone that have email boxes (extensively categorized with dozens or up to hundreds of folders) with up to 20GB each. It's crazy. But even without the ever looming threat of a lawsuit, they claim that they have been able to disprove what other people were badmouthing them about by being able to produce an email from that person stating the exact opposite a year or two previously. I've witnessed it once, and it is pretty funny watching somebody turn beet red in a room with 25 supervisor's and above.

  74. Re:Simple... (not) by Anonymous Coward · · Score: 0

    wtf?

    "our lawyers have told us we have to archive all email potentially forever that the end user doesn't specifically delete."

    I thought email was archived at the server level, not the end user level-- I thought *everything* coming in and going out gets archived. There is no logic that I can see, in letting the end users decide what gets logged.

  75. Easy solution if you have management's support... by Anonymous Coward · · Score: 1, Insightful

    There is an easy enough solution to this if you have management's support. (Assuming I understand the problem, which is apparently that the pop server is overburdened.)

    The first step is to solve the steady-state problem. This is easy enough: you make it very well known that they are not to leave messages older than 90 days in their mailbox. But because the messages may contain official stuff and can't be deleted, you don't delete old messages. Instead, you test every mailbox periodically to see if it contains old stuff, and if it does, you block delivery of new messages to the mailbox. You can leave them with POP access to it so they can clean it out. Of course, you make this policy well-known. And you put an automated message into their mailbox that notifies them they've been blocked too.

    By doing this, you've set up a give and take situation: as long as they do their part to keep their mailbox generally clean, you do your part to deliver messages. Presumably managers will encourage their employees to keep up on the maintenance because they don't want employees to be unable to be reached by e-mail.

    Second part is to solve the problem of too much data already on the server. To do this, you announce the policy above and put it into place. Send people advance notice (two weeks, one week, two days, one day, etc.) that their mailbox is going to be locked if they don't clean it. For those who don't clean it, go ahead and lock it. Leave it that way for a short while (until you get some complaints) and then announce a one-week extension.

    Then, for those who *still* don't do anything about it, take all the messages that are older than 60 days, remove them from the user's mailbox, then put them aside. Burn a CD of the mailbox and send it (interoffice mail, or whatever) to the user's manager. Then make your own archive of all such messages, and delete them from the server.

    Now the recalcitrant people will have to go see their manager to get their old messages, and the managers will know why and will know that they've been given several warnings and an extension and still didn't bother to do anything about it. Maybe the manager won't care, but I can't imagine they'll have a positive feeling about their employee having found a way to waste their time.

  76. Ten years by HermanAB · · Score: 1

    is what my lawyer wife requires. All incoming and outgoing mail is archived - though I draw the line at saving trash and only saves incoming clean mail after the spam and virus filters. She actually uses the mail archives from time to time. So, logrotate.conf is set to 120 months for e-mail.

    --
    Oh well, what the hell...
  77. I work for a government office... by Anonymous Coward · · Score: 2, Interesting

    Probably too late in the thread for this to catch much attention...

    However, I work for a local Government office, close to Charlotte, NC. It is our stated policy to remove all e-mail older than two weeks except for e-mail that is crucial to job performance. This is less to save space then it is to keep the news media from finding dirt. We really don't care that our e-mail is public record. We really have nothing to hide. However, the local newspaper (in Charlotte) is constantly asking for _ALL_ of the County Manager's e-mail. They aren't looking for anything specific, they are just on a fishing trip, trying to see what trouble they can stir up. They rely on the Freedom of Information act to, hopefully, generate some news, instead of doing some real investigative work. *sigh*

    Anonymity enabled for self-protection. Wouldn't want the powers that be see my e-mail...

  78. Government by HermanAB · · Score: 0

    Being a government organization, I would expect them to first print all the e-mail and make 3 photocopies for filing, before deleting it...

    --
    Oh well, what the hell...
  79. Big Step Forward! by Anonymous Coward · · Score: 0

    In a city that can scarcely get the corpses out of the inner harbor within 90 days, this is a huge advance in bureaucratic efficiency!

  80. Re:Sad news ... Ronald Reagan, dead at 93 by Anonymous Coward · · Score: 0

    PotUS only makes $200K/yr?

  81. Read my lips. No new Hard Drives. by Anonymous Coward · · Score: 0

    "I would invest in more hard drive space to hedge against lawsuits."

    Was that the sound of Raven42rac agreeing to higher taxes? You rock dude.

    1. Re:Read my lips. No new Hard Drives. by Raven42rac · · Score: 1

      No. They should have this crazy thing called a "budget", from which they can buy some hard drives. I also do not live in Baltimore, so my tax dollars have zero impact on that fair city. Think of the tax dollars that would have to be generated if someone decided to sue if something got deleted that should not have.

      --
      I hate sigs.
  82. Re:HTML in email? Forbidden! by Anonymous Coward · · Score: 0

    If you really want collaborative document editing with
    version control then emailing MS-Word documents around is NOT
    the way to do it. You end up wasting a lot of space and
    bandwidth and when it is 5 minutes to the critical meeting
    still everyone argues about who ended up with the final
    draft.

    Email is for text messages that take the place of verbal
    discussion. A blanket ban on all email larger than 1M solves
    the problem nicely.

  83. This really sucks by rodney+dill · · Score: 1

    I work for one of the Big Three (automotive) that has a policy of deleting email messages after 60 days. (They don't automatically do it, but it is supposed to be self policing). All this does is let the F**king liars win. Numerous times in "corporate" life I've had people tell me that "I didn't get that", "I wasn't informed", "I didn't get that message", usually on messages that are greater than 60 days old. When they and their management get a second copy they usually change their tune. I understand the ability to limit lawsuit liabilities but they sap the energy of their best workers when they let the liars win.

    --

    Use your head, can't you, use your head,
    You're on earth, there's no cure for that
    - S. Beckett
    1. Re:This really sucks by Anonymous Coward · · Score: 0

      That's why you always email with a CC list that includes
      people unlikely to cooperate with one another and you
      keep a copy yourself and if they don't reply to that
      email then you ring them up and ask why they haven't
      replied. Once they reply, they always top-post with
      a copy of the message and you have all the evidence
      you need.

  84. Wrong solutions to the right problem. by Anonymous Coward · · Score: 0

    "As an aside:
    If you are gov't employee, it is your responsibility to use email for official business only. All communication should also use proper English (these are not posts to Slashdot) and all the emails and memos should be self-containing (and not include 100k messages of pretty formated reminders of lunches). I don't work in the gov't, but I still follow these rules. This cuts down on most junk."

    Sounds to me like people need to learn what is what. Relatively transitional things like scheduling needs to go into a groupware solution.

    There's voice-mail, and SMS as well as cell-phones for that level.

    Maybe a DMS (Document Managment System) for contracts, and other important documents that have a long lifespan.

    In short, use the right tool for the right problem, instead of an all-purpose solution to every problem.

  85. Require mail from the public be encrypted-A Hole by Anonymous Coward · · Score: 0

    "PGP is worse. It isn't supported by *any* mailer widely uses mailer (installing an extra 'plugin' does not count - most of the people I talk to have absolutely no idea what a plugin is let alone how to install one)."

    Do what everyone else apparently does. "Click on this to install PGP plugin" in body of E-Mail. For once use OE's "problems" to your advantage.

  86. Well.. by Anonymous Coward · · Score: 0

    There's a reason why we call ourselves Baltimorons.

  87. That explains the size of their emails by RedLaggedTeut · · Score: 1
    "Every email has to be self-containing"

    This explains the size of their emails:

    make_email.php: <?php echo $message; include (basename($PHP_SELF) ); ?>

    --
    I'm still trying to figure out what people mean by 'social skills' here.
  88. Ephemeral Email by 4of12 · · Score: 1

    if it's not there, they can't subpoena it.

    Ever since emails at MS got exposed in court and Monica Lewinsky's emails to her friend about the insensitive clod not getting her flowers everyone has decided to have an Official Policy For Getting Rid of Old Potentially Incriminating Email.

    It's a double plus advantage: clear out space on the servers, increase the speed of searches through old email, and decrease legal liability.

    But it doesn't increase my trust in those companies or government agencies that have such policies.

    Verbal communications, hints and innuendo have provided vanishing trails of evidence for years while paper has reinforced accountability. With use of paper ebbing, accountability will decrease and, along with it, trust in other people and institutions. As if we needed less trust in powerful institutions, which operate under enough invisibility and leave-no-trace principles now that much greater abuses of trust are possible than before.

    --
    "Provided by the management for your protection."
    1. Re:Ephemeral Email by 40000 · · Score: 1

      Email is just another form of one to one communication like the telephone. Old email should be deleted, it was a private conversation.

    2. Re:Ephemeral Email by 4of12 · · Score: 1

      Old email should be deleted, it was a private conversation.

      If it really is nothing more than a private conversation, one individual to another.

      OTOH, if your money is invested in a company with unscrupulous employees who plan a fraud via email and those emails are deleted quickly due to some limited retention policy, then you might be out of luck recovering your money.

      Likewise, if your government, which your taxes supports, decides unilaterally to authorize a new policy such as data-mining your credit card information for terrorist connections and the only record of this new policy is in email form, then you might want those government records available.

      Not just personal communication, but a much greater fraction of business and government is conducted electronically. With paper constituting an ever-diminishing fraction of total records, and paper being the only enduring evidence, we can expect fewer successful investigations and prosecutions of fraud in business and in government.

      --
      "Provided by the management for your protection."
  89. Re:Simple... (not) by Anonymous Coward · · Score: 0

    dissallow external email for those that don't really need it - that should fix it

  90. Bitch? by Anonymous Coward · · Score: 1, Insightful

    She doesn't sound like a bitch. She sounds like someone who wanted to share the experiences of the company picnic with everyone. That doesn't sound like a bitch to me. She may not have known about the problems her post would cause. If she had done something like this in the past, and did it again, that might make her "stupid", but not a bitch. A bitch is someone who complains when another employee has a family picture on her desk, because personal decorations are against company policy. A bitch is someone who expects people to drop everything to help him/her, but won't lift a finger to help others. A bitch is, generally, a person who is unpleasant to be around, a person whom almost no one likes. A bitch is not someone who would pass around pictures of the company picnic. A person who calls a woman a "stupid bitch" because she made a simple mistake sounds like a sexist asshole to me, and not someone that I'd like to know.

    1. Re:Bitch? by Lord+Apathy · · Score: 1

      Sounds like? You obviously haven't read my other posts.

      --

      Supporting World Peace Through Nuclear Pacification

    2. Re:Bitch? by Dwonis · · Score: 1
      A person who calls a woman a "stupid bitch" because she made a simple mistake sounds like a sexist asshole to me,

      Asshole, I can understand, but sexist? What do you base that claim on?

  91. IMAP + CD Burner by corvi42 · · Score: 1

    Here's a simple solution, get everyone using central IMAP servers for their email, have a little scripty-poo that tars the mail boxes every month & burn them to cd. Then put the CDs in paper jackets into shoeboxes & stuff them in your city archives.

    --

    There are a thousand forms of subversion, but few can equal the convenience and immediacy of a cream pie -Noel Godin
  92. Re:Easy solution if you have management's support. by edgedmurasame · · Score: 1

    However, if you were to have a critical message come in that could not be missed, this might be used by the Enron types to deny entry. Also, you might want to do a delivery straight to the user - saves on delivery time and puts the BOFH in his place. Blocking delivery might seem a good idea at first, but you're going to have tons of regulations to dodge before you reach the manager.

    --
    "Forget the engineers." -Carly Fiorina, briber of MIT Technology Review.
  93. Re:Simple... (not) by Zen · · Score: 1

    I never said that the 'we' I was referring to meant the end user. I may have a tendency to switch back and forth between possessive's, but in this sentence, 'we' meant ITG services as a whole division. There are also many ways to setup an email backup solution, none of which I know of for sure are the one that we use. You could set it up so it backs up inboxes at a certain time each day - so anything that came in that day, and was deleted before then was not saved. You could trust people's judgement and remove from your archives any emails that the end user specifically deletes from their trash folder, or you could bite the bullet and archive everything every time.

  94. Re:Simple... (not) by Zen · · Score: 1

    Not gonna make any noticable difference at all. Nobody ever sends large attachments to an external email address, because chances are that that external server will not accept it. Also, internal email for a large company is orders of magnitude greater than external email. You've got all your meeting notices, meeting minutes, project notes, quick questions, blah blah blah. I might get 100 emails a day from internal employees, and 5 or so from external.

  95. Re:Simple... (not) by Hoi+Polloi · · Score: 1

    "by being able to produce an email from that person stating the exact opposite a year or two previously"

    It isn't just the amount of email that you store but the fact that you are able to search through it all to find a piece of text from a specific individual from 1-2 years ago! How the heck do you do it? It must take a good chunk of time.

    --
    It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
  96. Olig. South Park Ref. by Anonymous Coward · · Score: 0

    Actually it is done by gnomes living in your memory controller.

    1. Clear unused memory blocks
    2. ?
    3. Profit!

  97. Dept. by TheJavaGuy · · Score: 1

    from the city-that-reads-remember dept. Very appropriate dept. Baltimore's motto, after all, is "The City That Reads".

    --
    Opera Watch - An Opera browser blog.