skip to main content
2 posts tagged with text by Mo Nickels.
Displaying 1 through 2 of 2.
I am in need of a server-side Linux or Unix-based software solution that will sort uploaded PDF files that can be PDF-native (that is, created in such a way that the text in the PDF is freely copyable), PDFs with embedded text over images (usually the result of a previous OCR job), and PDF-scanned, which are PDFs containing no text, only scanned images. The PDF-native files and PDFs with embedded text it will extract text from, the PDF-scanned files it will then OCR and export that text. [more inside]
posted by Mo Nickels
on Mar 17, 2008 -