<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: How do I export or convert my Thunderbird inbox into the Unix mailbox format?</title>
	<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format/</link>
	<description>Comments on Ask MetaFilter post How do I export or convert my Thunderbird inbox into the Unix mailbox format?</description>
	<pubDate>Wed, 11 Apr 2007 13:06:04 -0800</pubDate>
	<lastBuildDate>Wed, 11 Apr 2007 13:06:04 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: How do I export or convert my Thunderbird inbox into the Unix mailbox format?</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format</link>	
		<description>How do I export or convert my Thunderbird inbox into the Unix mailbox format? &lt;br /&gt;&lt;br /&gt;  I need to compile a spam corpus to test a Beysian spam filter.  Thunderbird mailbox looks like a Unix type mailbox, but it adds some extra headers: X-Account-Key, X-UIDL, X-Mozilla-Status, X-Mozilla-Status2.</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2007:site.60413</guid>
		<pubDate>Wed, 11 Apr 2007 12:47:19 -0800</pubDate>
		<dc:creator>FakeOutdoorsman</dc:creator>
		
			<category>thunderbird</category>
		
			<category>inbox</category>
		
			<category>import</category>
		
			<category>export</category>
		
			<category>unix</category>
		
			<category>spam</category>
		
			<category>corpus</category>
		
			<category>mozilla</category>
		
			<category>convert</category>
		
	</item> <item>
		<title>By: twiggy</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909460</link>	
		<description>My understanding is that Thunderbird mailboxes are already in unix mailbox format.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909460</guid>
		<pubDate>Wed, 11 Apr 2007 13:06:04 -0800</pubDate>
		<dc:creator>twiggy</dc:creator>
	</item><item>
		<title>By: twiggy</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909462</link>	
		<description>Sorry, I clicked submit on accident.  If you really don&apos;t want those headers there, you would have to write a script to remove them (via Perl or any other language)...&lt;br&gt;
&lt;br&gt;
There&apos;s really no need to, though.  Since they&apos;re present in all of your messages good and bad, the Bayesian filter shouldn&apos;t look at them as bad or anything.  It&apos;s already in valid unix mailbox format, it just has some extra headers.  That doesn&apos;t make it incompatible though.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909462</guid>
		<pubDate>Wed, 11 Apr 2007 13:07:53 -0800</pubDate>
		<dc:creator>twiggy</dc:creator>
	</item><item>
		<title>By: cmiller</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909495</link>	
		<description>Fwiw, instead of &quot;Unix format&quot;, you should describe it as &quot;mbox&quot;, assuming that&apos;s what you mean (instead of &quot;Maildir&quot;, e.g.).  You&apos;ll get more search hits that way.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909495</guid>
		<pubDate>Wed, 11 Apr 2007 13:35:14 -0800</pubDate>
		<dc:creator>cmiller</dc:creator>
	</item><item>
		<title>By: sergeant sandwich</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909725</link>	
		<description>to strip the unwanted headers:&lt;br&gt;
&lt;br&gt;
&lt;pre&gt;grep -v X-account-key &lt;i&gt;yourfile&lt;/i&gt; | grep -v X-UIDL | ... &lt;i&gt;etc&lt;/i&gt; ... &amp;gt; &lt;i&gt;yournewfile&lt;/i&gt;&lt;/pre&gt;&lt;br&gt;
assuming, of course, that you don&apos;t have those strings in any of the body of your messages.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909725</guid>
		<pubDate>Wed, 11 Apr 2007 16:43:02 -0800</pubDate>
		<dc:creator>sergeant sandwich</dc:creator>
	</item><item>
		<title>By: sergeant sandwich</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909728</link>	
		<description>there&apos;s probably a cleverer way to do that using fgrep though.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909728</guid>
		<pubDate>Wed, 11 Apr 2007 16:44:44 -0800</pubDate>
		<dc:creator>sergeant sandwich</dc:creator>
	</item><item>
		<title>By: sergeant sandwich</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909729</link>	
		<description>egrep i mean.  argh</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909729</guid>
		<pubDate>Wed, 11 Apr 2007 16:45:06 -0800</pubDate>
		<dc:creator>sergeant sandwich</dc:creator>
	</item><item>
		<title>By: oats</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#909800</link>	
		<description>&lt;code&gt;egrep -v &apos;^(X-account-key|X-UIDL|X-Mozilla-Status)&apos; yourfile &amp;gt; yournewfile&lt;/code&gt;&lt;br&gt;
&lt;br&gt;
Forcing it to find only at the beginning of the line (&apos;&lt;code&gt;^&lt;/code&gt;&apos;) is both safer and faster.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-909800</guid>
		<pubDate>Wed, 11 Apr 2007 17:50:46 -0800</pubDate>
		<dc:creator>oats</dc:creator>
	</item><item>
		<title>By: cmiller</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#912426</link>	
		<description>*Ahem*, it&apos;s legal to have those in the body of the message.   Imagine, e.g., what happens when my email contains an encoded attachment with a line that starts with &quot;X-UIDL&quot;.&lt;br&gt;
&lt;br&gt;
You need a state machine.  You can do it with sed.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-912426</guid>
		<pubDate>Sat, 14 Apr 2007 11:21:15 -0800</pubDate>
		<dc:creator>cmiller</dc:creator>
	</item><item>
		<title>By: cmiller</title>
		<link>http://ask.metafilter.com/60413/How-do-I-export-or-convert-my-Thunderbird-inbox-into-the-Unix-mailbox-format#912430</link>	
		<description>Oh, and RFC822 (or is it 823?) headers can span several lines.&lt;br&gt;
&lt;br&gt;
Header: one two three next line starts with whitespace&lt;br&gt;
   four five six grep cannot help with this.&lt;br&gt;
&lt;br&gt;
I hate to be a spoil-sport here.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.60413-912430</guid>
		<pubDate>Sat, 14 Apr 2007 11:23:26 -0800</pubDate>
		<dc:creator>cmiller</dc:creator>
	</item>
	</channel>
</rss>
