Adobe and the inflexible acrobat
April 10, 2008 9:30 AM
Subscribe
I have a few pdfs of tabular data that I would like to use in a spreadsheet and/or database. Acrobat Reader is not cooperating; it converts obvious tabs into spaces. Is there some way to get this data, properly formatted?
I have 3 pdfs of 8 columns of data, roughly 100+ pages apiece. The data is clearly in a column format, but copy + paste converts the tabs into spaces, regardless of whether it comes from a browser plugin or from Acrobat Reader itself, and regardless of whichever program it goes into.
Exporting as txt also converts the tabs into spaces, in addition to getting the column order wrong on any row of text with line breaks in it. (something like "1 / 2, 2, 2 / 3" comes out as "2 / 1 / 2 / 3 / 2")
If I copy and paste, I can not do a global replace to convert spaces to tabs (and then a series of them to convert double-tabs back into single tabs) because many of the columns have spaces in the data inside.
I've looked online and found a number of shareware tools claiming to convert .pdfs to .xls, but I don't know how successful or trustworthy they are, and I'm hesitant to install anything unknown/untrusted onto workspace computers.
Tech Services here at work could not figure out how it's done.
I'm not convinced there isn't a workable solution. Do any of you have experience with this problem? Is it possible without buying new software?
posted by johnofjack to computers & internet (44 comments total)
2 users marked this as a favorite
posted by flabdablet at 9:45 AM on April 10, 2008