77 posts tagged with ocr.
Displaying 1 through 50 of 77. Subscribe:

small-scale document digitization services? document retention policies?

I work for a small non-profit that has had six linear feet of historical records on paper in somebody's basement for the last few decades. Can the hive mind (1) recommend a company that can scan/OCR these papers for us and (2) recommend resources for teaching me how to design and implement a document retention policy? [more inside]
posted by d. z. wang on Sep 6, 2016 - 4 answers

How can I nuke bad OCR text in a log of PDFs and replace with good?

I've inherited a ton of PDFs of scanned documents that are somewhat readable. The source were actual documents scanned into a scanner. Now I'm trying to make the OCR'd text readable by screen readers in one way or another. Looking for advice or even keywords to Google. Situational specs ahoy! [more inside]
posted by tilde on May 11, 2016 - 13 answers

Best way to tabulate grocery store receipts

I'd like to start putting the details of all my grocery receipts into a spreadsheet. What's the simplest and quickest way to do that? [more inside]
posted by alms on Mar 4, 2015 - 13 answers

Financial Record Digitization Services - do they exist?

I'm looking for a service like Shoeboxed or MakeSpace, where I can get several decade's worth of my parents' financial records scanned/indexed and then sent back to me to go into a regular storage unit. Extra snowflake information about sibling's information security paranoia. [more inside]
posted by gov_moonbeam on Feb 26, 2015 - 4 answers

Best way to scan and OCR a moderate number of pages

I have ~9,000 pages I'd like scanned and OCRed, spread across 20 books. What will this cost me? [more inside]
posted by dilaudid on Oct 9, 2014 - 7 answers

How to produce an editable PDF doc?

I have a scanned document with some text that I'd like to edit. [more inside]
posted by massofintuition on Sep 26, 2014 - 6 answers

What's the state-of-the-art in paperlessness?

I would really like to be paperless. As soon as I receive them, I'd like non-junk mail and documents to be digitized, then shredded. [more inside]
posted by ignignokt on Sep 21, 2014 - 14 answers

OCRing a non-standard font

I have a dead-tree book that I want to use OCR software on, but it has a strange font. My attempts so far have not been very successful. [more inside]
posted by Solomon on Aug 26, 2014 - 4 answers

Replacement for Scansnap S1500

What is a good substitute for the Fujitsu Scansnap S1500M? [more inside]
posted by Polyhymnia on Aug 15, 2014 - 2 answers

Strategy to proofread document from OCR

I have a document in English of about 1500 pages that was originally derived from OCR scans of varying quality. I've proofread manually and by spell-check. What is the best strategy to eliminate the remaining errors? [more inside]
posted by dances_with_sneetches on Aug 12, 2014 - 11 answers

Japanese OCR for Mac?

So what's the market look like for Japanese OCR software for Mac OS? I'm a bit bewildered and feeling out of my league upon googling around. It would be rad to be pointed in the right direction on this.
posted by DoctorFedora on Jul 18, 2014 - 2 answers

Name that late-'60s programming language or data representation format!

Yesterday I picked up a piece of ceramic bric-à-brac promoting RCA's Electronic Data Processing division (active mid-1950s to 1971). It features ten different ways to represent data or algorithms, of which I recognize many, but not all. Can you name the rest? Bonus: Can you decode the ones with actual data? [more inside]
posted by Lazlo Nibble on Jul 13, 2014 - 10 answers

OCR, emphasizing the R

Does anyone have any recommendations for OCR software that focuses on the "recognition" part? [more inside]
posted by dilaudid on May 6, 2014 - 6 answers

Corpus of printed letter images

I like playing around with text recognition algorithms, but am stymied by a lack of a good corpus to train and test my code against. I'm looking for a large number of images of individual printed letters, labelled with the correct letter. (With the letter in the file name, or each set of letters in a directory, or something equivalent like a metadata file.) Something like this, but more of it.
posted by Zarkonnen on Jan 16, 2014 - 3 answers

Free or Low Cost OCR Solutions?

Does anyone have any recommendations for a free or low cost solution that will let me scan to searchable PDF using OCR? [more inside]
posted by soy_renfield on May 28, 2013 - 10 answers

Batch job screen capture from database help?

Let's say there is a database that I can only access from a front end tool, and that database cannot provide any extract of any sort. I also have no access to run reports off that database. And let's say that the only way to preserve that data is to print to pdf or do screen shots. btw- all legit and you are not helping me do something malicious. [more inside]
posted by BrodieShadeTree on May 21, 2013 - 14 answers

But I don't *want* anyone to recognize it...

How do I prevent OCR on a document (typically a PDF but I could use another document format if necessary)? I know that when I scan it from a hard copy to a PDF I can disable/stop the OCR process, but Adobe allows it to happen on any PDF I scan in, whether OCR was eliminated at scanning or not, and I have to stop that (I have work product I'd like to distribute electronically, but my boss would like to make sure it's not searchable and it's as hard as I can make it to copy). I can use any software or process within reason.
posted by mrs. taters on Jan 25, 2013 - 27 answers

Scanning many old, thin pages

I have inherited a 1,000+ pages of my grandmother's writings. I would like to scan them, OCR them and (after fixing OCR mistakes) share them with the rest of my family online. My question is this: what's the best way to scan so many pages? Also, I should point out that many of the pages are on thin typing paper. Maybe this is carbon paper? or onion skin paper? I'm not sure but I don't want to damage the originals. [more inside]
posted by dylan_k on Oct 5, 2012 - 15 answers

Solution to OCR many bank statements into excel

Need to get data from hundreds of pages of bank records into a spreadsheet. We have a scanner with a document feeder, but would love some recommendations on software/workflow ideas. [more inside]
posted by capnsue on Sep 29, 2012 - 6 answers

Paper management software with OCR

I've got a Scansnap printer and I'm ready to go paperless -- but I need the right Mac software to manage my scanned documents. None of the options I've found seem quite right. [more inside]
posted by yankeefog on Sep 4, 2012 - 6 answers

Performing OCR on a Franktur/Blackletter PDF and getting a searchable image PDF

I have Adobe Acrobat X Pro on Windows 7. Is there any free or inexpensive way to use OCR to create searchable image PDFs from image-only PDFs of texts written in German in Fraktur/Blackletter script? [more inside]
posted by dhens on Apr 28, 2012 - 14 answers

Looking for an OCR program that will handle batch processing and columns automatically.

We're looking for an OCR program that will handle batch processing and columns automatically. [more inside]
posted by pahool on Nov 28, 2011 - 5 answers

OCR solutions?

Looking for hardware/software that will scan a physical form with handwritten fields and generated accurate delimited text with it. [more inside]
posted by jsturgill on Nov 15, 2011 - 2 answers

Digitize my rollodex!

What are my best options for digitizing a large collection of business cards? Bonus points for being able to integrate into MS Outlook. [more inside]
posted by bonehead on Nov 3, 2011 - 4 answers

Paper, begone!

My (very) small office just got a new whiz-bang scanner that can scan stacks of paper to image-only PDFs. Ideally, I'd like to use this to do away with paper filing, but this is harder (for me) than it sounds. [more inside]
posted by 4ster on Oct 9, 2011 - 10 answers

Ease my Prezi pains

I'm trying to turn PDFs made from presentations on the Prezi website (prezi.com) into text documents and am looking for a program to OCR them with. Since Prezi's PDFs come out rather odd with some text trailing off the edge that isn't pertinent to the current slide I need something that will allow me to make a selection box around the text I wish to OCR as opposed to auto OCR'ing the entire page. What Windows program should I be looking at?
posted by MeatyBean on Aug 29, 2011 - 7 answers

Image PDF with OCR - how, without Acrobat?

The full version of Adobe Acrobat has a way to OCR scanned images, so that the image is still viewed in the PDF, but you can search for text in the document. How do you do that without Acrobat? [more inside]
posted by AzraelBrown on May 25, 2011 - 7 answers

Free Mac OCR/E-book software/toolchain

How can I get from paper to e-books for free on a Mac? [more inside]
posted by tellumo on May 22, 2011 - 7 answers

Help me choose a new A Level Psychology specification

Currently I teach A Level Psychology using the AQA A spec and want to change exam boards. I am thinking of Edxcel rather than OCR but wondered if anyone has opinions that they could share with me. Pros and cons for either and / or each if possible. [more inside]
posted by mairuzu on May 6, 2011 - 0 answers

How do I important the data from hundreds of documents into spreadsheets?

How do I use OCR to scan a standard document into an excel spreadsheet? [more inside]
posted by perpetualstroll on May 3, 2011 - 3 answers

Not just for sniffing

How do I use a highlighter? [more inside]
posted by hal_c_on on Mar 27, 2011 - 29 answers

"Pen scanner" recommendations? Is it even called a pen scanner?

"Pen scanner" recommendations? Is it even called a pen scanner? [more inside]
posted by ZenMasterThis on Dec 23, 2010 - 3 answers

Need 20 books scanned, OCRed and proofed. Robots need not apply.

I need to have about 20 novels scanned, OCRed and professionally proofed for conversion to ebooks. Destructive scanning is acceptable. Have you had recent experiences with a company that provides such a service? What did it cost, and how was their proofing? [more inside]
posted by mumkin on Sep 30, 2010 - 6 answers

What is a great OCR program

Suggest to me a fantastic OCR program capable of preserving formatting. [more inside]
posted by maize on Sep 22, 2010 - 3 answers

Linux OCR

Linux script to parse all files in file tree and submit to a program? [more inside]
posted by yoyo_nyc on Aug 15, 2010 - 12 answers

Help me wrangle PDFs

I'm trying to go paperless, and have scanned and OCR'd huge stacks of paperwork into PDF documents. Can you recommend a tool to split, merge, delete pages etc from PDFs? [more inside]
posted by blacksky on Jul 2, 2010 - 9 answers

Handwriting OCR

I'm looking for a handwriting OCR program for OS X. [not evernote] [more inside]
posted by Brent Parker on Jun 12, 2010 - 4 answers

Wanted: "Programmatic Handwriting Recognition for Dummies".

How can I learn how to programmatically recognized Japanese handwriting? [more inside]
posted by Vorteks on Apr 13, 2010 - 9 answers

Soliciting scanner software suggestions.

Any recommendations for scanning / PDF management software for my Mac that is cheap or, better yet, free? [more inside]
posted by synecdoche on Mar 21, 2010 - 9 answers

A different kind of OCR

Image Data acquisition.... Is there software out there that can take an image and translate it to data? Kinda like a specialized OCR software. [more inside]
posted by jmsta on Jan 8, 2010 - 2 answers

Recommendations of software/hardware combo to scan magazines

I have 60 to 100 magazines. I want to scan them all completely and then use OCR software so that they would be searchable. I am looking for recommendations on the scanner I should use, as well as the OCR software. I have a flatbed scanner, so with good software, I could probably take the time to do it, but a scanner that would let me feed in several pages at a time would be best. WinXP is the OS of my computer. Ideally, I would want to spend no more then $300, with $500 as an absolute max.
posted by Mr.X on Dec 26, 2009 - 9 answers

Going Paperless and need a Document Imaging Service

What document imaging company do you recommend that is user-friendly, cheap, and secure? [more inside]
posted by Merlin144 on Dec 25, 2009 - 6 answers

Best digitizing options for dual-language documents?

Options besides PDF for digitizing dual-language books? [more inside]
posted by JoannaC on Dec 20, 2009 - 8 answers

Rental of Fujitsu ScanSnap (or Similar) in Chicago Area? Or General Advice?

At some point in the future, I'd like to follow the recommendations of some various sites I've seen online and scan most of my paper archives to PDF. It looks like this would be the best solution (I'm a Mac user) – or, at least, that's the device I've seen recommended a few million times. However, I really can't see having the discretionary $417+ to purchase this device, not for a very, very long time. Does anyone offer this device for rental? (I live in Chicago.) Is there a RipDigital equivalent for this kind of thing (a very long time ago, they did the initial move of my music from CDs to MP3s)? Are there cheaper alternatives that are just as good?
posted by WCityMike on Nov 21, 2009 - 10 answers

My (really not great) handwriting into pretty, pretty computer text?

My (really not great) handwriting into pretty, pretty computer text? Is Livescribe the answer? [more inside]
posted by eb98jdb on Sep 23, 2009 - 4 answers


ATM check deposits. OCR or just instantaneous offshore data entry? TIA.
posted by @troy on Aug 6, 2009 - 16 answers

What document scanner do you recommend for many filing cabinets worth of scanning?

Where can I find recommendations for document scanner to handle what I'd consider "medium - large volume" scanning of our office files? [more inside]
posted by dukes909 on Jul 27, 2009 - 7 answers

Document mis-management?

How can I convince a document management vendor to stop embracing 100 DPI / JPG as a universal format for scanned documents? [more inside]
posted by anonymous on May 20, 2009 - 13 answers

scanner pen recommendations

My husband has been wondering about getting an OCR scanner pen. There have been a few questions before on this topic, but the technology may have moved on, and he has some specific needs. [more inside]
posted by jb on Apr 17, 2009 - 13 answers

Image processing of scanned text

I have 18 copies of a 47-page document, scanned with handwriting on them. I want to extract the handwritten bits (i.e. compare, page-by-page, and eliminate the "constant" part), despite skewing, offset, and some noise in some copies. I want to use Perl or Python with e.g. ImageMagick or gd or something. Any pointers? I'm not talking about OCR -- just comparison, with one output being the graphical bits that don't match. [more inside]
posted by Michael Roberts on Apr 9, 2009 - 12 answers

Page: 1 2