New scanner, same settings, PDFs 4x bigger. WTF?
March 23, 2011 1:38 PM Subscribe
New network scanner/copier. Scanned files (PDF) are 4x larger than old model for same settings. What? Why?
We had a Canon c5185 printer/copier/scanner. We're a mostly paperless office; what paper we do handle gets run through the scanner and sent to a network share, or delivered to our desks via email from the copier as PDF files/attachments.
It was recently replaced with a Ricoh c6501. Nice unit, but there's an issue about the size of the PDF files that result from scanning paper documents. WAY too big.
45 pages in B&W at 300 dpi = 4.6MB (vs 1.3MB on the old unit)
Same pages scanned in color at 300= 16MB !!!
This is way too big - it's choke our email system and our storage needs are going to balloon. The copier vendor isn't giving me any help in explaining why the same pages scanned at the same resolution are coming out 4x larger in size on this new scanner.
I have discovered that the old Canon used to deliver its PDFs as already text-searchable, but the Ricoh does not. That means that the Canon had some onboard image processing that would run the PDf through OCR before delivering it. Contacting Ricoh, none of their units do OCR internally.
We already have the PDF compression settings turned up to the highest, but our scans are still 4x larger than they should be. And they're not even OCR'd - which I would think would make the file larger, no? I know we can shrink the file size and do OCR manually using Acrobat, but nobody wants to do extra steps on every piece of paper we handle, especially when the old scanner did it automatically.
Has anyone had this issue with Ricoh or other scanner / multifunction devices, where the scan files you get are ridiculously bloated?
posted by bartleby to technology (9 answers total)
If the old copier did OCR, than the PDF 'guts' were letters/symbols/glyphs, rather than graphical data.
ie the new copier is sending you a picture. The old copier sent you text [when it could] with embedded pictures.
(Sorta like typing a letter in a text editor and saving it versus using MS Paint to put the same text on a canvass. The text editor letter will be tiny, the paint bitmap huge, even though the content is the same)
posted by k5.user at 1:46 PM on March 23, 2011