<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: Looking for mostly text .tif files</title>
	<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files/</link>
	<description>Comments on Ask MetaFilter post Looking for mostly text .tif files</description>
	<pubDate>Thu, 26 Apr 2012 11:41:40 -0800</pubDate>
	<lastBuildDate>Thu, 26 Apr 2012 11:58:34 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: Looking for mostly text .tif files</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files</link>	
		<description>I&apos;m looking to download a large number, at least 500, of mostly text .tif files to use in developing a piece of software. &lt;br /&gt;&lt;br /&gt; Optimal would be a wikileaks email data set, but I haven&apos;t been able to find that as .tif files</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2012:site.213904</guid>
		<pubDate>Thu, 26 Apr 2012 11:41:40 -0800</pubDate>
		<dc:creator>rakish_yet_centered</dc:creator>
		
			<category>tiff</category>
		
			<category>resolved</category>
		
	</item>
	<item>
		<title>By: AzraelBrown</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086461</link>	
		<description>Call your county recorder -- most have gone digital, they&apos;re public records available to anyone so there&apos;s no privacy issues about disclosure, so they may be able to just dump a bunch of a thumbdrive or CD for you.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086461</guid>
		<pubDate>Thu, 26 Apr 2012 11:58:34 -0800</pubDate>
		<dc:creator>AzraelBrown</dc:creator>
	</item><item>
		<title>By: AzraelBrown</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086467</link>	
		<description>Also, the free version of &lt;a href=&quot;http://www.bullzip.com/products/pdf/info.php&quot;&gt;Bullzip PDF printer&lt;/a&gt; will let you print almost anything to a TIF, with the option to save as Group4 and 1-bit scans, like a fax and like most OCR enjoys.   So, load a large Wikileaks file, print it to a TIF using bullzip, and you&apos;ll have your TIFF version.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086467</guid>
		<pubDate>Thu, 26 Apr 2012 12:01:40 -0800</pubDate>
		<dc:creator>AzraelBrown</dc:creator>
	</item><item>
		<title>By: supercres</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086468</link>	
		<description>What are the parameters of what you need?  Does it need to be scanned?  If not, there are tons of utilities that will export a PDF as a series of TIFFs, and PDFs abound.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086468</guid>
		<pubDate>Thu, 26 Apr 2012 12:01:54 -0800</pubDate>
		<dc:creator>supercres</dc:creator>
	</item><item>
		<title>By: rakish_yet_centered</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086485</link>	
		<description>Getting PDF&apos;s first?  Could do that, but it isn&apos;t optimal.  When I poked around wikileaks I only found emails embedded in HTML pages. I like the county recorder idea though&lt;br&gt;
&lt;br&gt;
Scanned? Yes. It&apos;s a document review program that converts tif to txt, creates a thumbnail, that sort of thing</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086485</guid>
		<pubDate>Thu, 26 Apr 2012 12:11:00 -0800</pubDate>
		<dc:creator>rakish_yet_centered</dc:creator>
	</item><item>
		<title>By: rakish_yet_centered</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086501</link>	
		<description>Scanned? I meant no...</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086501</guid>
		<pubDate>Thu, 26 Apr 2012 12:18:59 -0800</pubDate>
		<dc:creator>rakish_yet_centered</dc:creator>
	</item><item>
		<title>By: cmiller</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086545</link>	
		<description>If you will drop the &quot;TIFF&quot; requirement up front, you might get more sources.  ImageMagick will convert a batch of images to whatever format you want.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086545</guid>
		<pubDate>Thu, 26 Apr 2012 12:55:01 -0800</pubDate>
		<dc:creator>cmiller</dc:creator>
	</item><item>
		<title>By: rakish_yet_centered</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086560</link>	
		<description>The .tif is not really a requirement, I use imagemagick, or graphicsmagick, I forget which, to convert from image to text, and image,  But tiff is better, because that what the people I know actually used for document review projects.&lt;br&gt;
&lt;br&gt;
You&apos;re right though, in the end I&apos;ll probably be using PDF&apos;s</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086560</guid>
		<pubDate>Thu, 26 Apr 2012 13:07:40 -0800</pubDate>
		<dc:creator>rakish_yet_centered</dc:creator>
	</item><item>
		<title>By: demiurge</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086668</link>	
		<description>http://archive.org/details/opensource_English&lt;br&gt;
&lt;br&gt;
They are in pdf, but it should not be difficult to separate the pdfs into individual images.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086668</guid>
		<pubDate>Thu, 26 Apr 2012 14:14:57 -0800</pubDate>
		<dc:creator>demiurge</dc:creator>
	</item><item>
		<title>By: rakish_yet_centered</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086721</link>	
		<description>Internet Archive is a good idea, not exactly what I was looking for, but it might have to do</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086721</guid>
		<pubDate>Thu, 26 Apr 2012 15:03:40 -0800</pubDate>
		<dc:creator>rakish_yet_centered</dc:creator>
	</item><item>
		<title>By: wongcorgi</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3086862</link>	
		<description>Why dont you just take a PDF book / pamphlet, etc, and save it in to individual TIF files?</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3086862</guid>
		<pubDate>Thu, 26 Apr 2012 17:27:55 -0800</pubDate>
		<dc:creator>wongcorgi</dc:creator>
	</item><item>
		<title>By: kristi</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3088604</link>	
		<description>I have over 6000 pages of evidence from the investigation into Lisa McPherson&apos;s death at &lt;a href=&quot;http://www.lisafiles.com/&quot;&gt;The Lisa McPherson Files&lt;/a&gt; (formerly &lt;a href=&quot;http://projects.metafilter.com/3202/The-Lisa-McPherson-Files&quot;&gt;on MeFi Projects&lt;/a&gt;), all in TIF format.&lt;br&gt;
&lt;br&gt;
Most are typed, but some (especially the Scientology-produced documents) are handwritten. You&apos;d find mostly typed materials in the &lt;a href=&quot;http://www.lisafiles.com/police/index.html&quot;&gt;listing of police documents&lt;/a&gt;.&lt;br&gt;
&lt;br&gt;
For example, this &lt;a href=&quot;http://www.lisafiles.com/0708.html&quot;&gt;1-page Florida Department of Law Enforcement summary&lt;/a&gt; has a link to &lt;a href=&quot;http://www.lisafiles.com/lf.cgi?http://www.kristi-wachter.com/lisafiles/07/0708.tif&quot;&gt;its TIF file&lt;/a&gt;.&lt;br&gt;
&lt;br&gt;
If these would be useful to you, feel free to MeMail me if I can make them easier for you to download. If you have Dropbox or an FTP directory, I&apos;d be happy to send you over the whole collection of TIFs, or the subset that&apos;d be most useful to you.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3088604</guid>
		<pubDate>Sat, 28 Apr 2012 13:19:10 -0800</pubDate>
		<dc:creator>kristi</dc:creator>
	</item><item>
		<title>By: rakish_yet_centered</title>
		<link>http://ask.metafilter.com/213904/Looking-for-mostly-text-tif-files#3088868</link>	
		<description>Yes kristi, that is exactly what I&apos;m looking for, me-mailing</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2012:site.213904-3088868</guid>
		<pubDate>Sat, 28 Apr 2012 18:33:20 -0800</pubDate>
		<dc:creator>rakish_yet_centered</dc:creator>
	</item>
	</channel>
</rss>
