Before building my own... does this software exist? I need to search a collection of PDF & Word files for key phrases, and dump the surrounding lines (x-m to x+n characters/lines, where x is the found phrase) into text files.
I occasionally need to search a few dozen files for a few dozen data items, which usually have some identifying text nearby. This needs to be automated.
Big bonus if it implements OCR, but that's not essential. Freeware, or cheapware, obviously is best. Windows-based is preferable, but I can do Linux.
posted by IAmBroom
on Feb 12, 2014 -
In my line of work (doctoring), there are a lot of PDF files that I use, for example, handouts for patients, copies of guidelines, referral forms for particular clinics, etc. I'd love to be able to organize them hierarchically but also access them in a way that lets me search for keywords or tags and returns results quickly and dynamically, with the ability to easily print said PDF results. Are there any programs (for Windows) or cloud-based sites that might make this easy?
Ideally it would be great to have a little search bar that could just sit on my desktop, but that would just be icing on the cake.
Oh, and I've tried Mendeley - it works somewhat well for this, but it's a hassle to print files unless you open them up in Acrobat Reader or another external program.
posted by greatgefilte
on Nov 28, 2013 -
I have an interesting file syncing problem. As part of my responsibilities, I have to keep track of changes in the US export regulations. The Dept. of Commerce makes the regulations available in two formats, HTML and PDF. The HTML versions are oddly formatted so I prefer the PDF versions even if they are harder to search. They're available in PDF on this page
. I have been regularly going out to the page and downloading relevant documents that have changed but I'd like to automate this if I can. But there are some gotchas. [more inside]
posted by tommasz
on Aug 14, 2013 -
I'd like to make certain documents at work (mostly PDFs) only openable to logged-in domain members based on group membership. Watermarking with account credentials would be a plus. [more inside]
posted by benzenedream
on Apr 17, 2012 -
How can I organize offline pdfs on windows so that they're easy to find offline using a standard browser? In a perfect world, it would be like delicious, but for pdfs. Tags, sortable by tags, compact display. Endnote and Wordpress are the available tools. [more inside]
posted by furious
on Dec 20, 2009 -
What's the best free or open source software (for Windows XP) which will scan multiple pages in a row and output a PDF with zero fuss? [more inside]
posted by cowbellemoo
on Nov 3, 2009 -
In light of the increasing number of PDF security exploits, what is the most secure PDF viewer for Windows? [more inside]
posted by benzenedream
on Apr 1, 2009 -
Mac users who write LaTeX documents in Emacs (with or without AucTeX), please tell me about your workflow. [more inside]
posted by gleuschk
on Oct 4, 2008 -
How can I automate PDF creation with my automatic document feeder, ScanSoft PaperPort and OmniPage? I need to have PDFs that look like the original document and are searchable (not image only). [more inside]
posted by ideaguy
on Feb 1, 2007 -
What's the easiest way to implement a .PDF file that can be edited by multiple users (Mac and Windows)? (more inside) [more inside]
posted by sharksandwich
on Aug 24, 2004 -