Slashdot Mirror


Ask Slashdot: Best Way To Archive and Access Ancient Emails?

An anonymous reader writes "I started using email in the early 90s and have lost most of that first decade due to ignorance, botched backups, and so on. But since about 2000, I've got most — if not all — of my email in some form or other. I run Linux, so this has mainly been in a mix of various programs: Kmail, Evolution, Thunderbird. The past 2-3 years are still on the IMAP servers. My problem is that I only rarely NEED to look back to email of 5 years ago. But sometimes it's nice. Or I just want to reminisce about something...or find an old attachment that I was sent. But I do not want to be clogging my current email client of choice with vast backups and even more, I don't know if it will even easily convert. The file structures are different, some are mbox, others maildir, etc., and I would ideally like a way to 1) store and archive these emails, 2) access them, and 3) search by Sender, Subject, Date, Attachments. Is there anything I can do or do I just have to keep legacy applications on hand for this? Should I keep trying to upgrade and pull old files into the new applications? Any help or suggestions about what YOU do would be great."

62 of 282 comments (clear)

  1. IMAP by sylvandb · · Score: 4, Informative

    Just IMAP it all.

    I went IMAP in 1997 and have never looked back.

    I've also used IMAP as a temporary conversion measure for people switching e-mail clients so even if you aren't sure, it makes a good first step.

    I don't understand the concern about too many e-mails. I can access my email back to 1992. With multiple folders it shouldn't be a problem and with modern indexing a search shouldn't be an issue.

    1. Re:IMAP by DNS-and-BIND · · Score: 4, Informative

      When I search for my brother's name I don't want to wait 30 seconds for a search to complete, nor do I want to see his emails from 10 years ago. I just want to see his last email that he sent about the trip we're taking next week. That's the concern about too many emails.

      --
      Shutting down free speech with violence isn't fighting fascism. It IS fascism!
    2. Re:IMAP by kwerle · · Score: 5, Insightful

      This.

      I fired up imap servers for all my old mail.
      I fired up a modern mail client (OSX Mail.app) and connected to all of 'em and also to gmail.
      I dragged all my old email into gmail. In a GUI. And it worked.

      Done.

      I no longer run mailservers. Too much of a headache. gmail is awesome (with imap access, even). Indexing, instant searching, etc.

      If you don't want/trust your email to the cloud, then this isn't for you. Unless you want to run your own imap server with whatever backend suits you - then you can dump it all there. I just can't be bothered to manage that after 15+ years of doing so.

    3. Re:IMAP by Anonymous Coward · · Score: 2, Interesting

      IMAP offers "server side" storage with "client side" viewing.

      Besides the privacy implications, are you seriously suggesting that the OP
      a) find or create an IMAP server,
      b) force feed that server all his archived emails (presuming that there is some way to bulk import email into the IMAP server), and
      c) change his current email setup so that, from now on, his email is sent to the mail server on which the IMAP server runs?

      How is that any easier to manage than his current predicament? ISTM that your suggestion forces him to go into a big "migration" phase, and change his email provider/provisioning. Not the simple solution that I would suggest.

    4. Re:IMAP by hairyfeet · · Score: 2

      Got a better idea? The guy has them in Lord knows how many programs splattered all over the place, he wants it all in one place and searchable and IMAP is good at that. I mean sure we would all prefer a "push button and its done" kinda deal but AFAIK no such thing exists that will let him store it locally. You could run it all into Gmail but some folks have privacy concerns and of course its not like Google hasn't lost stuff in the past, so given a complex situation like TFA I'd say IMAP is probably gonna be the least painful of the bunch.

      Hell he's already on Linux, not like adding an email server role in Linux is hard, millions of Linux boxes do that role everyday. But if you got a better call Hoss I'd like to hear it, given the requirements I'd say IMAP would fit the bill closest with the least amount of hassles. But lets face it when you are dealing with a decade plus worth of data in different formats? Its never gonna be clean and easy, it just don't work that way.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    5. Re:IMAP by arth1 · · Score: 3, Informative

      Pretty much everything not made by Microsoft will support export to good old mbox. It's a good format to store in, because you can always import from it into other formats.
      And you can run simple scripts against the mbox files.
      More than once, I've done a grep against my mail archive, and more than once I've moved it to a new machine and new mail software.

    6. Re:IMAP by Antique+Geekmeister · · Score: 4, Informative

      _NO_. Under no circumstances use "mbox" for mail storage, or anything other than a temporary stage on the way to transferring it to something contemporary and uable such as Maildir. If you lose that one mbox file, by file system corruption or by fat finger accident or overflowing a partition or in tht eprocess of merging new email with it, you've lost _all_ your mail in that mbox. And as you read, mark, or save mail, that file is constantly churning, making backup and replication of the mail spool far more dangerous and fragile, especially when the mail directory is bulky with years or decades of active mail threads or simply undeleted email.

      mbox was useful when the available inodes on a file system were limited programs benefited from using a single inode for transactions, and backups occurred on magtape, but there is simply no point to it in decasdes.

    7. Re:IMAP by icebraining · · Score: 3, Insightful

      I agree with that for new emails, but for an archive file, none of it really applies. File system corruption and fat fingers should be handled by just restoring from backup, and merging / marking as read / etc is not really applicable for old mail, which should be accessed by either viewing it readonly or making a disposable copy.

      mbox might have its problems, but I don't think there's any good reason to spend time converting old files to Maildir.

    8. Re:IMAP by hobarrera · · Score: 3, Insightful

      Archive old emails by year:


      Archives/2013
      Archives/2012
      Archives/2011
      Archives/2010
      Archives/2009 ...

      Only search in the appropiate ones. Easy, right?

    9. Re:IMAP by peragrin · · Score: 2

      That's why I do both.

      for every day I use gmail

      but once a year I fire up a current email client and download everything archived. I also purge the archive every couple of years of truly useless emails.

      I keep a a copy for safety and away I go.

      --
      i thought once I was found, but it was only a dream.
    10. Re:IMAP by arth1 · · Score: 5, Informative

      _NO_. Under no circumstances use "mbox" for mail storage, or anything other than a temporary stage on the way to transferring it to something contemporary and uable such as Maildir. If you lose that one mbox file, by file system corruption or by fat finger accident or overflowing a partition or in tht eprocess of merging new email with it, you've lost _all_ your mail in that mbox.

      Thus speaks ignorance. If you write corrupt data to a mbox file, nothing prior to the corruption is affected at all. Unlike most formats that don't store each mail in a separate file, you can also very easily run recovery against a mbox file. Heck, a one-liner perl script can retrieve anything from before and after a corruption.

      And "overflowing a partition"? Um, run that by us again. If you mean disk full, that doesn't truly affect a format that's made for appending. You won't be able to append. Any other format you can come up with will have the same problem.

      And for archival purposes, this also does not apply. You don't make changes to your archive. Period.
      And you back it up. Period.
      Which is a heck of a lot easier to do with mbox than most other formats.

      But again, the main strength is that it is so simple, which means that pretty much every mail program out there will support it, one way or another.
      Choosing a more modern format leaves you with fewer options, and less certainty that it will be supported in the future. 20 years down the road, mbox will still be supported. It has an RFC - http://tools.ietf.org/html/rfc4155

      Can you say the same about ANY other format? Maildir doesn't work on systems that doesn't allow colon in file names, and hashes the filename based on the hostname which both isn't portable, and crashes badly for many implementations if you have a non-ascii hostname. Not to mention that the format has balkanized, to the point that it's no longer compatible betweeen implementations.

      Again, for archival purposes, simplicity is the key.

    11. Re:IMAP by adolf · · Score: 3, Interesting

      What's wrong with MBOX? I've been using it with gigabyte-sized folders for over a decade and nothing bad has ---@@@ From MAILER-DAEMON Fri Jul 8 12:08:34 2011

      Seriously, though. Even when mbox gets trashed due to disk corruption, it is still every bit as useable as a trashed maildir: Messages get lost, not whole folders...and even then, that's what backups are for (right?).

      Or at least, that was my experience over the past many moons. As a static archive (Sent-mail-2012), I can't think of a single thing wrong with mbox. And it's easier to cat to tape than maildir.

    12. Re:IMAP by arth1 · · Score: 2

      Or at least, that was my experience over the past many moons. As a static archive (Sent-mail-2012), I can't think of a single thing wrong with mbox. And it's easier to cat to tape than maildir.

      You don't even have to create a new tape entry, just append to the old one.

      Not to mention that mboxes can b cat'ed together, as they get older. Like combining all dailys to a weekly, weeklies to a monthly, and monthlies to a yearly.
      What it doesn't have is indices, but nothing prevents you from importing and converting a mbox to an indexed format. Or creating an index for that matter.

      When archiving, the main point is that you want a format that is mature, long-term-supported, and easy to use tools again. What do you do with an Informix or Sybase binary archive today? Yet those admins who got laughed at for making humanly readable and script-parsable SQL dumps have no problem when they need to access 15 year old data.
      Same with e-mail. Go with simple, plain text, and well supported. I.e. mbox.

    13. Re:IMAP by Yosho · · Score: 3, Interesting

      For what it's worth, my personal mail server is an Athlon X2 3800+, still running Ubuntu 8.04.4 LTS. Pretty old by today's standards. I've got a dovecot server offering IMAP access and a Roundcube webmail server on it. My inbox has about 25,000 messages in it, and there are e-mails in there that go back to 2007.

      Doing a search by sender took maybe 1 or 2 seconds, and the most recent e-mails came up right away since they're sorted by date.

      --
      Karma: Terrifying (mostly affected by atrocities you've committed)
    14. Re:IMAP by kcbnac · · Score: 2

      Any web-based client *should* (unless they use a plugin or other weird config tool) save any filters, etc. to the web-based-profile (Like GMail does). Otherwise, if you choose Thunderbird (or any other sane client) you could just copy over the profile between installs. Even across OSes.

      I've successfully used a singular Thunderbird profile on both a Windows and Linux boot off the same machine; granted Linux access to the NTFS partition Windows sat on and it used that profile directory. Been copying it forward to new installs for a few years now.

      Migrating towards a VM (that gets backed up regularly) holding the 'core' stuff that doesn't sync well (Firefox and Chrome both do; so can run that on whatever) - then just use that VM for non-GMail email, and whatever else is worth consolidating down to one machine.

    15. Re:IMAP by WillKemp · · Score: 3, Interesting

      [......] are you seriously suggesting that the OP
      a) find or create an IMAP server,

      Ridiculously simple. They're already running Linux, they just have to install dovecot and they've got a fully functional IMAP server (no configuration required) - which has access to all their local mail boxes.

      b) force feed that server all his archived emails (presuming that there is some way to bulk import email into the IMAP server)

      Ridiculously simple. Fire up Thunderbird, configure it to access your local IMAP server, select all, drag and drop.

      c) change his current email setup so that, from now on, his email is sent to the mail server on which the IMAP server runs?

      Why would they need to do that? Thunderbird (or other mail reader of choice) can access multiple accounts.

    16. Re:IMAP by funkboy · · Score: 2

      Run your own IMAP server. For the past decade or so, Dovecot has by far & away become the best choice. If you've set up any other daemons before it's really not very complicated software.

    17. Re:IMAP by SomeKDEUser · · Score: 2

      Then use Kmail and behold the power of akonadi and nepomuk. searching in the later versions is fast -- and I have a very large number of mails.

      It turns out that for large enough collections you _do_ need a DB :)

    18. Re:IMAP by Antique+Geekmeister · · Score: 2

      Thank you, yes, I was speaking of file system corruption. And because any ongoing email management, such as deleting or even marking as read or unread the old messages, causes change in the content of the mbox file, from the message edited onward, the claim that "it's only risking messages after the point where you edit" is disingenuous. I'm afraid it's not "ignorance" speaking, it's lengthy and painful experience.

      _If_ the mbox files re absolutely static, then mbox can be considered reasonably stable. But if the messages are resorted into new folders, or even worse if the oldest, earliest entries are ever deleted, then the contents of the mbox file _from that message forward_ have to be rewritten. There is no graceful way with most filesystems to simply "snip this 3095 characters content out of the middle between the start and end of this particular message". The means used can be fascinatingly clever and complex but normally involve overwriting _everything after the beginning of the removal_ with the remaining, preserved old content. And we could explore further what happens on the disk when you actually try to delete content after a certain point in a file, and how that churns the underlying filesystem itself, but it's heavily filesytem dependent.

      This means that touching the early entries, and any accidents that occur, corrupts anything after those early entriees. It also means that touching those mbox files causes filesystem churn, becuase the files no longer match the old files and have distinct contents. Unless deletions or additions fo the entries somehow aligns with the old blocks, even most deduplication based filesystems will fail to optimize. And the tendency of some old mbox users to keep _everything_ in simple, large mbox "folders" which are actually single mbox files compounds the issue with backup problems tied to very, very large files, and tied to small edits of those very, very large files causing churn in the backup system.

      Having an RFC for an older, simpler protocol does not make it ideal for modern use. mbox was useful when filesystems were distinctly slowed by many hundreds or thousands of files in one directory, and when the number of inodes available for your home directory and the ability to monitor or mange a mailbox in a consistent format was critical. But Maildir and various tools based on it have, correctly, replaced it. The filesystem issues are one critical reason, and the other is what Dan Bernstein talked about when he wrote Maildir: safe locking or transaction handling for multiple simultaneous client access. (See http://cr.yp.to/proto/maildir.html)

      Maildir successfully follows one of the critical lessons of robust programming. If you make only small changes, you make only small mistakes, and the message handling is vastly safer from adding, deleting, or relocating small files than from merging or extracting individual messages stored in a necessarily vulnerable single archive.

    19. Re:IMAP by Cajun+Hell · · Score: 3, Interesting

      Besides the privacy implications

      There aren't any privacy implications. If you there were any, then you would have named one or mentioned an example. The situation prior to typing "sudo apt-get install dovecot" is that he had the data (so it's already subpoena-able or whatever you're trying to imply) and after that he'll also have the data. Nothing changes. Are you complaining that he's keeping the data rather than deleting it? I don't get your point at all.

      are you seriously suggesting that the OP a) find or create an IMAP server,

      Yes, because it's easy. This can be done in literally ten minutes. Maybe a little more if he doesn't already have good storage allocated for it (e.g. a Reiser formatted ~/Maildir, or whatever your own religion commands).

      b) force feed that server all his archived emails (presuming that there is some way to bulk import email into the IMAP server)

      Yes, because it's trivial. It's highly likely that whatever he is using to read each of his different mail archives, can also talk IMAP, because everything talks IMAP. You say "force feeding" as though literally selecting and dragging in a GUI, or picking "copy" or "import" off some menu, is hard. It's not.

      As for step c (changing how he receives email), I don't think that's being suggested but it may be a good idea. He can decide later, whether or not he wants his archive server to become his main/active server. That decision can wait and is not part of the scenario being discussed; it's an opportunity for the future.

      How is that any easier to manage than his current predicament?

      Because then he'll have his archive stored in a system that is specialized for handling the problem, accessible and searchable by any client he wishes to use, possibly even the very same tool he uses for his day-to-day non-archive mail reading. Or he can pick some other IMAP client if it handles mass/archive use case better than the routine use case. Everything Just Works, all together. All of his complexity and exceptions disappear. And at virtually no cost; there's no downside to counter any of the advantages.

      This is one of the easiest no-brainer Ask Slashdots, ever. There is one right objective best simple easy-and-fast-and-good(!) answer, and setting up an IMAP server is it. Probably because email storage is an old, very-solved problem.

      --
      "Believe me!" -- Donald Trump
  2. Use the IMAP by joel48 · · Score: 2

    Use the IMAP server - if you have control and/or space available.

    I just have a single large archive IMAP folder into which everything that isn't spam gets pushed. You could optionally create subfolders for time ranges (every 1-2 years, whatever works for you). Using dovecot with good indexing support on the backend quick searching has been great. If you do a sub-archive breakout on time the searches will be quicker, you could also then create a virtual mailbox combining them all for when search really needs to span time (and take a good chunk longer)

    There are scripts/utilities available to push mbox, etc. into an IMAP folder, push everything there and use it.

  3. Maildir by the+eric+conspiracy · · Score: 2

    I have all my personal email from 1998 in a Maildir directory with Dovecot as the server on a dual core Atom server running Centos. About 900 MB worth.

    Plenty fast.

     

    1. Re: maildir by shaiay · · Score: 2

      This! Also, for fast Maildir searches, have a look at mairix

    2. Re:Maildir by hedwards · · Score: 3, Interesting

      The main problem that I personally have is where I get emails from people that reference images and such from other servers. Most of the time it's commercial messages that I delete, but sometimes there's a newsletter that I want to save, and the images themselves turn out to be relatively important. Kind of annoys me to have to print them to PDF so that the formating gets preserved.

  4. Look to the past by jaak · · Score: 2

    Trying to figure out what formats will be available in the future is pretty hard, it's easier to see what formats have been around a long time and are still in use.

    As such, two formats come up readily:

    mbox http://en.wikipedia.org/wiki/Mbox and maildir http://en.wikipedia.org/wiki/Maildir

    1. Re:Look to the past by AK+Marc · · Score: 3, Interesting

      And PST.

  5. Just dump them by sk999 · · Score: 5, Interesting

    Had the same need 20 years ago when migrating from VAX/VMS to Unix. The old emails were saved in a not quite readable format, but I figured I could recover them if necessary. In the end, never bothered. Yes, there are a few (actually, only two) that I'd like to resurrect now, but life moves on.

    1. Re:Just dump them by tutufan · · Score: 2

      Yeah, that's about where I'm at, too. I think of those Buddhist monks making the sand paintings (which they then sweep away). It's an exercise in recognizing the impermanence of all things.

  6. Use a database! by cosm · · Score: 5, Interesting

    I'm a big fan of throwing together a DB when I want to store things categorically like that and want fast searches. If you are up to the task, hunt down some tools/roll your own so that you have a nice relational database and some stored procedures for getting what you want when you need it.

    You could export your emails to some parsable format, write an importer to extract the basics that you want to keep (from/to/subject/body,attachments/entire binary blob/etc) and then bulk insert that mess into on a mysql/sql server tucked away somewhere locally or "in the cloud" (EC2, Azure). Just another option as I'm sure you'll see here many here. At least with this route you are in full control of how you index, what you can search, encryption, performance, level of backups, etc. Maybe not the best way for some but I know if I had over 100000 emails that I wanted searchable very very quickly with advanced SQL like searching, this would be a cool way to do it (time permitting). Good luck! And to the pedantry to ensue...Yes. Good day.

    --
    'We are trying to prove ourselves wrong as quickly as possible, because only in that way can we find progress.' RPF
    1. Re:Use a database! by Anonymous Coward · · Score: 5, Interesting

      And you could make a doilie, and a hat, and a casserole, and wallpaper with the headers, and knit the .signatures into a fancy flying cape.

      Just use IMAP and Maildir. Modern systems are fast enough to allow you to search the content directly, and not vulnerable to the database support wackiness this sort of "I can pre-organize it now and make my life better by wasting it pre-programming my queries" approach.

  7. Gmail by lga · · Score: 5, Informative

    Best method of storing and searching old email? Gmail. It can import from pop and imap so you can point it at your other inboxes and let it get on with it.You can upload from other mail clients to Google's imap server. Obviously it's amazing at searching through the archives.

    Best method if you're concerned about Gmail's privacy? I'm still working on that one.

    1. Re:Gmail by zekele2 · · Score: 4, Informative

      Best method of storing and searching old email? Gmail. It can import from pop and imap so you can point it at your other inboxes and let it get on with it.You can upload from other mail clients to Google's imap server. Obviously it's amazing at searching through the archives.

      Best method if you're concerned about Gmail's privacy? I'm still working on that one.

      The solution is Google Apps for your own domain. $5 a month per user, 25Gb space, IMAP, no advertising (which is where most of the privacy issues arise), and most importantly, no lock-in as you can switch your email to a different provider at any time without changing email address. As you said, Gmail is by far the best for searching old email. I haven't run an email server for years.

  8. The obvious answer by 93+Escort+Wagon · · Score: 5, Funny

    Design a MySQL database for storing your mail messages, keying on sender, subject, date, and presence of attachments (bonus points for storing the attachments as blobs rather than as external files). Then write a perl script that'll automatically parse all your incoming email and convert it to database entries. I suppose if you're lazy the script could just monitor your mail spool, but it'd be better to just have it listen for incoming connections and handle the mail directly.

    Next, make copies of that script, modifying as necessary to process all your old mail archives.

    Oh, and you'll need to write another perl script to access all new mail - not from your mail spool, but from this database. You should probably name this system after some animal too. If you absolutely MUST have a graphical interface on it, don't use anything newer than TCL+Tk - but going with curses would be a better choice.

    Oh - it has to be GPLv3, or we'll hate you and probably mailbomb your machine.

    What - isn't that the Slashdot way?

    --
    #DeleteChrome
    1. Re:The obvious answer by the+eric+conspiracy · · Score: 4, Funny

      Holy wheel reinvention, Batman.

  9. Stop being a hoarder by realmolo · · Score: 4, Insightful

    You don't need all those e-mails. Keep the few you actually care about (copy and paste the text into a regular file, and save any attachments you want), and get on with your life.

    People that keep every e-mail are weird. Quit living in the past.

    1. Re:Stop being a hoarder by Ardyvee · · Score: 5, Interesting

      It's kind of like photos, you know? Or letters, and such. People like to store those things, because they serve as a memory aid for what the mind no longer holds. It is also quite useful for history reconstruction/when you are old and have nothing else to do but a box full of photos/letters/etc.

      Not to say that you are wrong on your point, except on the weird part. Unless you are okay with double standards, or you also consider anybody who keeps photos of parties/graduations/etc weird... Just saying.

      --
      I don't care if I'm wrong. I only care about everyone obtaining something from the discussion.
    2. Re:Stop being a hoarder by FuzzNugget · · Score: 3, Insightful

      Right, I don't need that client email from a few years ago to remind me about a detail on a project. It'd be better just to look like an idiot in front of them.

      Just because *you* don't need that archive doesn't mean everyone else doesn't need it.

      Why the hell *not* keep a conveniently categorized, organized, sorted, indexed and searchable database of all your important electronic communications? A few gigs is nothing these days.

    3. Re:Stop being a hoarder by icebraining · · Score: 4, Interesting

      But why would I waste time manually finding and copying individual emails, when I can just let the backup script archive them all for virtually no cost?

    4. Re:Stop being a hoarder by CAIMLAS · · Score: 2

      I recently had to go back 5 years to retrieve an email as evidence in my pending divorce. "I never said that!" - like hell you didn't, I've got it right here (and here, and here, and here). She wanted to play hardball, so hardball it was. :(

      I've had to go back 6+ months to retrieve important mail for myself for work and other personal matters as well. Every written correspondence with my ex-wife was in email - some damning, as above, but all of our "love letters" when we were courting. Wouldn't you want to show that to your kids one day, maybe?

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
    5. Re:Stop being a hoarder by houghi · · Score: 2

      Not to say that you are wrong on your point, except on the weird part. Unless you are okay with double standards, or you also consider anybody who keeps photos of parties/graduations/etc weird... Just saying.

      I would. In the olden days you would have 24 photos of a whole week. I have been on holidays for 6 weeks and have a film of 36 photo's to prove it.
      Now you have 300 photo's of just packing your suitcase. So yes, if you keep all those thousands of photo's that are completely irrelevant, then you are weird.
      So limit yourself to a reasonable number of pictures per year that you are allowed to keep. And no, 3000 pictures of your baby is not reasonable. It is punishment for the baby when it gets older. It is as if you are saying that it used to be cute and now you hate it.

      Als when you are old and nothing else to do, then there is your memory. My great-aunt was 155 when she died and the last several years of her life when she did not have any visitors, she would just REMEMBER the events. With her brain.

      And keeping all your letters? Good luck when your wife finds your love letters. It does not matter if that was 10 years before you met her, you will sleep on the couch till they are burned.

      --
      Don't fight for your country, if your country does not fight for you.
    6. Re:Stop being a hoarder by Tom · · Score: 2

      It's not about living in the past, it is about not worrying what to keep and what to store. And the message you think most important today is likely to be completely worthless in five years, while you would love to still have that other message you thought unimportant back then.

      I keep everything so I can decide today which old message I consider important. It is very rarely that I venture into the archive, but I have needed messages two years old at times.

      --
      Assorted stuff I do sometimes: Lemuria.org
  10. Re:I use gmail by the+eric+conspiracy · · Score: 3, Insightful

    So can anyone with a subpoena. And you can bet Google would be running their advertising stuff on that.

    There is no way I would put my life on a public server like that.

  11. Thunderbird - for when you're done with GMail by Nexus7 · · Score: 3, Informative

    I need to archive emails that I can search later - but with a twist. These are employees who've left the company. I can't keep 'em on at Google Apps 'cause I have to pay for that by user. So I use IMAP (making sure to set Chats to be shown in the IMAP list), create an account in Thunderbird, and slurp it all on to the local machine. It keeps all the folders, although I doesn't seem to be smart enough to figure out multiple labels, so it looks like it downloads the same email multiple times, once for it's folder, and once for "All Mail." Then I delete the account at Google. You just have to be sure to click through all the folders in Thunderbird and make sure it is done downloading before you blow the Google account away.

  12. notmuch by Anonymous Coward · · Score: 3, Informative

    http://notmuchmail.org is Gmail for people that don't trust Google. Works great with your existing IMAP server using offlineimap.

  13. Gmail. by jtownatpunk.net · · Score: 3, Insightful

    As soon as gmail made IMAP available, everything went there. I used to get my stuff via POP and saved it all going back to the early 90s. When IMAP went live on gmail, I let it chug away for hours and hours until it was synced and all my archived stuff was stored on my gmail account. They've been bumping up the limit faster than my mail's built up so I'm now at 3.9 gigs used of 10.1 available, holding about twenty years of email. I have email clients on a desktop and couple laptops that I fire up every couple of months to sync with gmail and keep local stores in the event that google screws up and loses my data. (I like to think I'd be smart enough to disconnect from the internet before accessing the local clients if my gmail account ever went blank but I've got multiple copies just in case I forget.)

    I know that won't work for email fiends who pile up a gig a month but it works for me. I don't even bother sorting my email any more. It's faster to just search. Not like the old days when it would take my email client half an hour to slog through all the messages. :)

  14. Maildir by chipperdog · · Score: 2

    Set up a local courier IMAP server and copy mails there, and archive the Maildirs...each message will be a file and you can use tools like grep to search the Maildirs

  15. Re:I use gmail by wisnoskij · · Score: 2

    It is email. AKA over the web. AKA public.
    And someone with a subpoena can get your records off of your ISP, or just come into your house and take it off your computers.

    --
    Troll is not a replacement for I disagree.
  16. mbox + dovecot + mairix by Creosote · · Score: 2

    I wouldn't posit this as the best way, but it's what I do. I keep my archival mail on a local filesytem arranged in directories, stored in the old-school mbox format. I run Dovecot under OS X for IMAP access to those messages from anywhere; when I need to search through the whole collection, I use mairix (an indexing and retrieval system).

  17. You are kidding, right? by certain+death · · Score: 3

    Just delete some goddamn email.. hoarder!

    --
    "My immediate reaction is "WTF? What kind of moron doesn't make things 64-bit safe to begin with?" Linus
  18. Outlook Express! by Vegan+Cyclist · · Score: 2

    heh - i have all my email going back to '98 in Outlook Express. Best email program ever! It's nearly perfect for what i want. (Any way to get it to do inline spell checking, ie, underlines misspelled words as you type?) Still running it on an XP box. Been using Windows Live Essentials a bit for Win8, it's not horrific, but lacks some of the characteristics..hope MS injects some of the OE spirit into it..

  19. Eudora by saccade.com · · Score: 2

    Eudora still runs on my Win7 box. I have email going back to at least the early '90s. All plaintext and easily searchable.

  20. Re:The Email Mandala by Tumbleweed · · Score: 2

    Just sweep it all into the Trash Bin, breathe deep, and move on with your life confident in the impermanence of all things.

    Plus that Trash Bin program has _great_ compression!

  21. Re:maildir + mutt + maildir-utils by Anonymous Coward · · Score: 2, Interesting

    To those saying keeping email forever is hoarding: not if it's done right.

    That's like saying neck deep rooms of newspapers/magazines in a house isn't hording if you stack them neatly with little paths running through it.

  22. Same rules as any archiving: by jrronimo · · Score: 5, Insightful

    I'd say follow the same rules as any archiving of media:

    Pick one format and migrate all of your messages to that: In this case, I'd say mbox. Thunderbird and most other mail programs read it and you can get most of your mail into mbox format via IMAP/Thunderbird from whatever mail client can read your old ones. You can store your mbox files locally in Thunderbird and gain Thunderbird's searching (for instance) without the need for an actual back-end. I was able to read some mail stored in Netscape Mail because it was just mbox files and opening them in Thunderbird was a breeze.

    Most importantly: Every 5-10 years, re-evaluate your storage choice. Is Thunderbird still around? Is mbox still pretty well regarded? If you find you need to migrate again, do it! If both are still active / supported, then hold onto 'em. The only way to perpetually maintain media access is to make sure your choices are still valid on a regular basis. This is true for any media: As the old formats go obsolete (cassette tape, VHS), you need to migrate that data to the next readily accessible format (CDs, DVDs; FLACs, MPEG(?)).

    I think the biggest problem is that you have a mish-mash of stored files right now. You'll save yourself a headache in the future by tearing the band-aid off now and taking the time to get all of your mail into one format. Then, in the future, when you need to convert, it'll be many steps easier since you won't have to visit Slashdot and find out what to do about your mail again next time. :)

  23. maildir: qmail, courier-imapd, roundcube by czth · · Score: 2

    I run qmail for sending/receiving mail (on Gentoo; netqmail package), using maildir, of course. On top of that, I run the Courier IMAP server on my internal network (with TLS encryption). Until a few months ago I used Mutt as a client (console-based), but I've moved to using Roundcube (web-based email), which I initially installed for my wife, and have been happy with it. I also have some automatic filtering to folders via Maildrop (another Courier utility; it looks at a ~/.mailfilter file to route mail).

    Roundcube/the IMAP server's search is OK most of the time - I keep my inbox small and move older mail to sub-folders - when I want to do advanced searches or search large mailboxes I log in and grep through folders of interest; this works well with the maildir format with one file per message. Maildir was also quite resilient when I had a HD crash and needed to recover some lost mail (block scan for blocks that look like mail headers found most missing items, and I do better backups now - mail is under ~/.maildir and gets backed up automatically).

    I would move older messages to maildir (there are plenty of mbox converters, and almost anything non-proprietary should be convertible to mbox or maildir via existing programs or a short perl script) - even if at some point maildir dies off entirely, which seems unlikely, converting it to another format will always be trivial due to its simplicity and it has the advantages mentioned above of being able to search easily with grep etc.

  24. Clueless. Takes 10 minutes start to finish by raymorris · · Score: 3, Informative

    Force feed? WTF are you taking about? Dovecot can use any make mail format. Just set MAILDIR if it's in a non-standard directory. So the whole procedure is:

    yum install dovecot
    vim /etc/dovecot.conf (only if using a nonstandard mail location)
    service dovecot restart
    set username and password in GUI client

    I never will understand why some people feel the need to post on topics they don't have the slightest clue about.

    1. Re:Clueless. Takes 10 minutes start to finish by WillKemp · · Score: 5, Funny

      I never will understand why some people feel the need to post on topics they don't have the slightest clue about.

      Because it's a long standing Slashdot tradition!

  25. i like this for email archiving by chaos4u · · Score: 2

    http://www.mailstore.com/en/mailstore-home.aspx

    works well quick searches and its local .

    unfortunately its windows only but may work fine under wine.

    --
    Music the Paint dancefloor the canvas your body the brush
  26. Completely missing the point... by ZeroPly · · Score: 3, Insightful

    The problem is that a throwaway email might become critically important later on. There is no way to know in advance what is important and what is not.

    True story: while deployed in the Army, our communications guy could not find a piece of equipment which was very important and very pricey. He had been signing the monthly inventory forms saying he had it, assuming it was in a cabinet. He could not find any paperwork showing it was signed out - it had just disappeared sometime in the last 3 months and no one had seen it.

    On a long shot, I started searching my email - since I keep every last one. Sure enough, about 2 months prior, there was a throwaway email from him to the effect that he was going to turn in item X for repair since it was acting flaky. He checked at the contractor mentioned in that email, and it was sitting on the shelf waiting for pickup.

    --
    Support microSD: in a post 9/11 world, it is unwise to carry your data on media that you cannot comfortably swallow.
  27. Parchment by SpaghettiPattern · · Score: 3, Funny

    Parchment -no less- does it for my ancient emails.

    --

    I hadn't the slightest objection to his spending his time planning massacres for the bourgeoisie... (P.G. Wodehouse)
  28. What I did for all about 50,000 emails by gaspyy · · Score: 2

    I had to archive the emails since 1996. They were in multiple formats - Outlook Express dbx, Mailbox from Netscape Navigator and Thunderbird, Outlook.

    I converted all of them in .eml format. It's a simple, text format that can be read by the OS and easily parsed by any program and script. Much better than mbox or something else. Then I renamed all of them according to a rule - YYYYMMDDhhmm [From] [Subject]

    Now I can easily find any email. I can browse them using the file system, I can search them using the OS or via a script. Windows indexes them and extracts the metadata so any search is very quick.

  29. Move on by rich_salz · · Score: 2

    Ugh. Drop all that stuff. Who needs it? My gmail folder has 20 messages in it. Lighten your (psychic) load.