How should I convert a Thunderbird inbox into text files?
March 14, 2008 11:57 AM   RSS feed for this thread Subscribe

What's the best way to export all of the emails in a Thunderbird inbox to text files, one text file per email?

I'm planning on doing some linguistic analysis of emails and need to take a Thunderbird inbox and convert it into text files before I get to work. Google searches have revealed many mbox (which I think Thunderbird uses) parsers of, I am sure, varying quality. I'd love to skip the endless trial and error stage and start with the perfect tool. What's the best way to convert a Thunderbird inbox to text files?

ps - I'm comfortable using Perl and Python if you guys think a scripting solution is best.
posted by eisenkr to computers & internet (6 comments total) 8 users marked this as a favorite
mbox is an old and well-understood format, so I'd go with a solution that's been around for some time and trust it.

Perl's MessageParser class should do what you want.
posted by zippy at 12:07 PM on March 14


What OS? On XP, you can install a generic "print to file" printer, select all the messages in the Inbox and print them directly to individual text files using that printer.
posted by bizwank at 12:14 PM on March 14


Thunderbird stores mail as mbox, Sylpheed stores them as Maildir, which is essentially one file per message.

So, if you just install Sylpheed and use the file/import to import your mbox file, you will be done.
posted by gmarceau at 12:34 PM on March 14


I recently exported a few years' worth of email to .eml format from Thunderbird using the SmartSave extension. An .eml file is basically a text file that includes some header information at the beginning. I think from there you could just do a batch rename of all your newly-created files (*.eml -> *.txt).
posted by good in a vacuum at 1:28 PM on March 14 [1 favorite]


good in a vacuum FTW!!

I've been looking to do this for a while. I just installed SmartSave, and it looks like it does great job. You can even configure it to use the .txt extension automatically.
posted by ochenk at 1:40 PM on March 14


Thanks guys! Now I just have to strip out all the html people shove in their email and I'll be ready to go.
posted by eisenkr at 4:55 PM on March 14


« Older I'm looking into starting a co...   |   What are retorts for some comm... Newer »

You are not logged in, either login or create an account to post comments



Related Questions
How do I make simple dynamic web pages using perl? August 22, 2008
Shell scripting or something better? May 9, 2008
Webumake Nomakeweb...Anymore. January 3, 2008
Graphical images script November 12, 2004
How do I export email from Thunderbird to Outlook... February 4, 2004