<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: And Jesus did turn image into text</title>
	<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text/</link>
	<description>Comments on Ask MetaFilter post And Jesus did turn image into text</description>
	<pubDate>Sat, 13 Dec 2008 09:47:51 -0800</pubDate>
	<lastBuildDate>Sat, 13 Dec 2008 09:47:51 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: And Jesus did turn image into text</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text</link>	
		<description>Is there a program I can download/buy/steal/borrow that can &quot;read&quot; scanned pictures of letters and transform them into digital text, like the letters I&apos;m typing right now? Ideally this would include the unique letters and diacritical marks in many other languages like Portuguese, French, Russian, and Chinese. I ask because I would like to start reading the strange and fanciful books I&apos;ve obtained overseas by scanning the pages and running them through the web 2.5 choiceness that is &lt;a href=&quot;http://www.google.com/language_tools?hl=en&quot;&gt;Google Fish&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt; I assume such a program is responsible for how I can &lt;a href=&quot;http://books.google.com/books?q=%D0%9C%D0%BD%D0%B5+%D0%BD%D1%80%D0%B0%D0%B2%D1%8F%D1%82%D1%81%D1%8F+%D0%B3%D0%BE%D0%BB%D1%8B%D0%B5+%D0%B6%D0%B5%D0%BD%D1%89%D0%B8%D0%BD%D1%8B&amp;btnG=Search+Books&quot;&gt;search the text&lt;/a&gt; &lt;a href=&quot;http://books.google.com/books?q=%E6%88%91%E5%96%9C%E6%AC%A2%E4%B8%AD%E5%9B%BD%E8%A3%B8%E4%BD%93%E5%A5%B3%E4%BA%BA&amp;btnG=Search+Books&quot;&gt;inside all those&lt;/a&gt; &lt;a href=&quot;http://books.google.com/books?q=%22it+was+a+dark+and+stormy+night%22&amp;btnG=Search+Books&quot;&gt;Google Books&lt;/a&gt; (Either that or Google hired one million ancient Sumerian scribes to hand type the text of every book ever written).&lt;br&gt;
&lt;br&gt;
&lt;small&gt;And if such a program exists you are all free to copy my brilliant idea.&lt;/small&gt;</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2008:site.109130</guid>
		<pubDate>Sat, 13 Dec 2008 09:41:04 -0800</pubDate>
		<dc:creator>dgaicun</dc:creator>
		
			<category>languagetools</category>
		
			<category>language</category>
		
			<category>translation</category>
		
	</item> <item>
		<title>By: kindall</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571503</link>	
		<description>Commercial, off-the-shelf OCR software has existed for, oh, 15 years. ReadIris, OmniPage, Textbridge are popular OCR products. I believe Acrobat Pro does it too if you have a PDF that is all image rather than text.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571503</guid>
		<pubDate>Sat, 13 Dec 2008 09:47:51 -0800</pubDate>
		<dc:creator>kindall</dc:creator>
	</item><item>
		<title>By: 517</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571507</link>	
		<description>&lt;a href=&quot;http://code.google.com/p/ocropus/&quot;&gt;Ocropus&lt;/a&gt;. I can&apos;t get it to work, but I am pretty new to linux.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571507</guid>
		<pubDate>Sat, 13 Dec 2008 09:54:19 -0800</pubDate>
		<dc:creator>517</dc:creator>
	</item><item>
		<title>By: dgaicun</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571512</link>	
		<description>OK, &lt;a href=&quot;http://en.wikipedia.org/wiki/Optical_character_recognition&quot;&gt;&quot;optical character recognition&quot;&lt;/a&gt;, thanks. Holy crap, I have Acrobat Pro!&lt;br&gt;
&lt;br&gt;
I&apos;ll go check out what it can do; in the mean time, others with experience please give me recommendations and tips that you think will help with my specific OCR needs.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571512</guid>
		<pubDate>Sat, 13 Dec 2008 09:59:48 -0800</pubDate>
		<dc:creator>dgaicun</dc:creator>
	</item><item>
		<title>By: soma lkzx</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571515</link>	
		<description>Chinese is going to be the difficult part of this picture, I think. I highly recommend ReadIris Pro + the Asian language pack - I&apos;ve used it for Japanese and Korean and it works great, so I don&apos;t doubt its Chinese abilities.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571515</guid>
		<pubDate>Sat, 13 Dec 2008 10:06:13 -0800</pubDate>
		<dc:creator>soma lkzx</dc:creator>
	</item><item>
		<title>By: McGuillicuddy</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571636</link>	
		<description>Many scanners (and scanner/printers) come with OCR software, so you may want to look at the complete set of software/drivers that came with your scanner. If you are the market for a new scanner, you can definitely find a fairly cheap on with OCR support.&lt;br&gt;
&lt;br&gt;
Also, there is a slightly less busy version of &lt;a href=&quot;http://translate.google.com/translate_t?hl=en#&quot;&gt;Google Translate&lt;/a&gt;.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571636</guid>
		<pubDate>Sat, 13 Dec 2008 12:57:47 -0800</pubDate>
		<dc:creator>McGuillicuddy</dc:creator>
	</item><item>
		<title>By: low affect</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571789</link>	
		<description>Evernote does OCR. Not sure it how well it will work with non-latin alphabets though.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571789</guid>
		<pubDate>Sat, 13 Dec 2008 16:28:17 -0800</pubDate>
		<dc:creator>low affect</dc:creator>
	</item><item>
		<title>By: NailsTheCat</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571823</link>	
		<description>If you have MS Office installed, Microsoft Office Document Imaging has OCR. You may have to add it to your installed programs if it wasn&apos;t installed initially. Don&apos;t know how its performance compares with other products.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571823</guid>
		<pubDate>Sat, 13 Dec 2008 17:21:35 -0800</pubDate>
		<dc:creator>NailsTheCat</dc:creator>
	</item><item>
		<title>By: Andorinha</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1571860</link>	
		<description>Sorry to point out the obvious, but either your objective is the sheer lulz that reading things that have gone through an automatic translator brings, or there is a rather large flaw in your plan.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1571860</guid>
		<pubDate>Sat, 13 Dec 2008 18:42:32 -0800</pubDate>
		<dc:creator>Andorinha</dc:creator>
	</item><item>
		<title>By: jduckles</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1574079</link>	
		<description>For a FOSS (Free Open Source Software) solution try &lt;a href=&quot;http://code.google.com/p/tesseract-ocr/&quot;&gt;tesseract&lt;/a&gt;.  Here is a &lt;a href=&quot;http://jduck.net/2008/01/05/ocr-scanning/&quot;&gt;script&lt;/a&gt; to get you started with automating OCR in linux with tesseract.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1574079</guid>
		<pubDate>Mon, 15 Dec 2008 19:05:00 -0800</pubDate>
		<dc:creator>jduckles</dc:creator>
	</item><item>
		<title>By: webwesen</title>
		<link>http://ask.metafilter.com/109130/And-Jesus-did-turn-image-into-text#1575979</link>	
		<description>i got much better OCR results from MS Office Doc Scanning than from the tool that comes with my Canon scanner. MS Office Doc scanner is included into an Office package.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.109130-1575979</guid>
		<pubDate>Wed, 17 Dec 2008 11:10:09 -0800</pubDate>
		<dc:creator>webwesen</dc:creator>
	</item>
	</channel>
</rss>
