<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel>
	  <title>Ask MetaFilter questions tagged with automation and pdf</title>
      <link>http://ask.metafilter.com/tags/automation+pdf</link>
      <description>Questions tagged with 'automation' and 'pdf' at Ask MetaFilter.</description>
	  <pubDate>Mon, 17 Dec 2012 13:55:40 -0800</pubDate> <lastBuildDate>Mon, 17 Dec 2012 13:55:40 -0800</lastBuildDate>

      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>	  
	<item>
	<title>Automated PDF modification</title>
	<link>http://ask.metafilter.com/231327/Automated%2DPDF%2Dmodification</link>	
	<description>Is there an automated way of placing elements from one PDF file into another? Open to coding this via Python if a relevant module exists. I have two pdf&apos;s, Doc 1 and Doc 2. All pages in both documents are US Letter sized. Doc 1 contains large tables of values (generated in Excel with PDFCreator), one per page; this is the only content on the pages. Doc 2 contains pages with my company&apos;s border, and header info (title, page # etc.). Doc 1 tables go into Doc 2 bordered pages. I would like an automated way of taking the tables of Doc 1, and placing each into a separate page in Doc 2, without overlapping elements.&lt;br&gt;
&lt;br&gt;
I&apos;m decently proficient in Python and think I could code something if some pdf/vector graphics handling modules exist. I&apos;ve done nontrivial programming with the xlrd3 module for working with our Excel files. How I think the code for this might work:&lt;br&gt;
&lt;br&gt;
Open Doc 1, Doc 2 as vector images&lt;br&gt;
For each page in Doc 1, Doc 2:&lt;br&gt;
 - Get content bounds in Doc 1&lt;br&gt;
 - Scale content in Doc 1 to fit in Doc 2 borders&lt;br&gt;
 - Insert in Doc 2 at [coordinates]&lt;br&gt;
&lt;br&gt;
What Python modules would I need to do this? Or any other approaches would be welcome.&lt;br&gt;
&lt;br&gt;
Notes &lt;br&gt;
- This task recurs in my work every few weeks, and the pages can total over a hundred. I handle this currently by printing out Doc 2 (page borders), then refeeding the pages into the printer and printing Doc 1. This is inconvenient because the printer often fails to grab the pages, and because I have to run back to my computer to issue the next job (it&apos;s not actually 2 files each time, more like 14 separate pairs of such files, that must be printed separately), and occasionally I can get tripped up by coworkers printing over my pages.&lt;br&gt;
&lt;br&gt;
- I often paste Excel tables directly into Word docs. This fails here because the formatting gets mangled in Word from having lots of merged cells and landscape oriented tables in portrait orientation pages. I can sort of transpose the tables in Excel, and set the text orientation in Word to vertical, but each table requires a ton of cleanup, and there are many of them.&lt;br&gt;
&lt;br&gt;
- I can manually combine the docs by opening the PDFs in Inkscape (or other vector illustrating program), but again, 100+ tables. Inkscape opens up one page at a time.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2012:site.231327</guid>
	<pubDate>Mon, 17 Dec 2012 13:55:40 -0800</pubDate>
	<category>automation</category>
	<category>excel</category>
	<category>help</category>
	<category>office</category>
	<category>PDF</category>
	<category>programming</category>
	<category>python</category>
	<category>work</category>
	<dc:creator>mnemonic</dc:creator>
	</item>
	<item>
	<title>Emacs wizardry for the non-programmer.  </title>
	<link>http://ask.metafilter.com/110361/Emacs%2Dwizardry%2Dfor%2Dthe%2Dnonprogrammer</link>	
	<description>How can I automate emacs to open a file, execute a bunch of macros, then save the file in a new format? I need to extract bookmarks from large PDFs into text files importable into MS Access.  I have been using emacs to open the file, then used 5-6 macros to extract the bookmarks and format it properly to be imported into Access.  This gets extremely tedious, I have about 100 files to convert. &lt;br&gt;
&lt;br&gt;
Is there a more elegant or convenient way to do this?  I would love to be able to automate it in emacs.  I am not a programmer, and although I have tried to figure out how to do this with elisp, it just doesn&apos;t make sense to me.   I&apos;m running emacs 22.2.1 on Windows and Acrobat 6 and 7 standard.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.110361</guid>
	<pubDate>Tue, 30 Dec 2008 09:37:05 -0800</pubDate>
	<category>automation</category>
	<category>bookmark</category>
	<category>emacs</category>
	<category>pdf</category>
	<category>resolved</category>
	<dc:creator>Barry B. Palindromer</dc:creator>
	</item>
	<item>
	<title>Configured, Boiled Down Numbers Please</title>
	<link>http://ask.metafilter.com/91453/Configured%2DBoiled%2DDown%2DNumbers%2DPlease</link>	
	<description>A good, modern, local website traffic stats solution? &lt;i&gt;I used to know this stuff, but it&apos;s been many years and technology changes, yadda yadda, and this has some special odd requirements... so I prefer some hive mind opinions...&lt;/i&gt;&lt;br&gt;
&lt;br&gt;
I have a colleague looking for a good solution to traffic stats reporting. He is responsible for dozens of websites, with new ones being added/removed every couple of weeks -- promotions and projects from various departments. He has access to combined/extended Apache logs (regular + referrer) already, but since he controls the websites, he is also open to a javascript-triggered thingie on each page, and a custom server that logs things itself. Either is fine as long as it churns out the results.&lt;br&gt;
&lt;br&gt;
For results, he&apos;d need an assortment of the usual hits/views/referrers stuff (visitors from Chile, most popular pages this week, hits on this certain link), but boiled down nicely like so:&lt;br&gt;
&lt;br&gt;
1) takes raw logs and turns them into something human-readable.&lt;br&gt;
2) scheduled (weekly or monthly) reports automatically saved as fixed files (auditable, this not recreated each time it&apos;s run).&lt;br&gt;
3) reports showing just the info he wants* e-mailed to him as attachments (PDF, ideally, but text files would work) so he can present to management.&lt;br&gt;
4) the ability to request custom reports (the period from Sept 1 to Sept 14, users from China only) on demand, presumably from some kind of web interface -- an on-the-fly config run one-off.&lt;br&gt;
5) the ability to combine or separate data from diff sites/logs by configuration. (that is: these two sites are the same, but these four are all to be tracked separately)&lt;br&gt;
&lt;br&gt;
&lt;small&gt;* I don&apos;t have a list, but he says it will be one or two pages and NOT change over time. A template of certain reports?&lt;/small&gt;&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
He is considering hiring a human being part-time to do this with some offline program as a secretarial task every week... but that struck me as crazy since it&apos;s automatable, right? Right? Hm.&lt;br&gt;
&lt;br&gt;
It can&apos;t be Google Analytics, sadly for me (I know how to use that) and for privacy/security reasons he&apos;d prefer something running on his own Un*x server rather than a web service.... whether it&apos;s the same server as the websites or another doesn&apos;t matter, as long as he can control access and keep the data private.&lt;br&gt;
&lt;br&gt;
In the old days, I would say this all called for urchin or awstats, I think... plus some magic I don&apos;t know about to turn the results into pretty reports and mail them from a big old cron job.&lt;br&gt;
&lt;br&gt;
From my perspective (helping find/source/make/install this), I&apos;d be fine with something very complicated that I can tweak the configuration of to produce just what he&apos;s asking for -- and no more. &lt;br&gt;
&lt;br&gt;
Or if something does all the processing, will run reports on a schedule and produce some kind of manageable (parsable?) output like plain flat tables, I can probably manage turning that into pretty documents and mailing them out with additional scripty code that I can stumble through.&lt;br&gt;
&lt;br&gt;
Open source and hackable preferred, because of the configurability and tweakiness required... or guidance to existing things that can be cobbled together. If someone has built/integrated one of these before, or something similar, please mefi-mail me and maybe it&apos;s easy work, too. I may have overcommitted myself in trying to help.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.91453</guid>
	<pubDate>Wed, 14 May 2008 21:36:20 -0800</pubDate>
	<category>automation</category>
	<category>dev</category>
	<category>pdf</category>
	<category>reporting</category>
	<category>software</category>
	<category>stats</category>
	<category>traffic</category>
	<category>web</category>
	<dc:creator>rokusan</dc:creator>
	</item>
	<item>
	<title>Automatically PDF a website?</title>
	<link>http://ask.metafilter.com/35319/Automatically%2DPDF%2Da%2Dwebsite</link>	
	<description>How can I automate printing a complete website to PDF?
&lt;br /&gt;&lt;br /&gt;
I need to be able to show what the site looked like on any given date.  The site is a sales tool and therefore requires user input on every page, so simply spidering the site is not adequate. 
&lt;br /&gt;&lt;br /&gt;
Because the site requires user input all over the place, I&apos;d somehow need to supply the tool with data so it could be fully automated.
&lt;br /&gt;&lt;br /&gt;
Are there tools out there that could do this?
&lt;br /&gt;&lt;br /&gt;
Note: It doesn&apos;t necessarily have to be output to PDF, just something manageable and static.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2006:site.35319</guid>
	<pubDate>Wed, 29 Mar 2006 11:31:48 -0800</pubDate>
	<category>automation</category>
	<category>pdf</category>
	<category>script</category>
	<category>scripting</category>
	<category>web</category>
	<dc:creator>gfroese</dc:creator>
	</item>
	<item>
	<title>Can I automate the creation of PDFs from a large number of Word documents?</title>
	<link>http://ask.metafilter.com/10267/Can%2DI%2Dautomate%2Dthe%2Dcreation%2Dof%2DPDFs%2Dfrom%2Da%2Dlarge%2Dnumber%2Dof%2DWord%2Ddocuments</link>	
	<description>Acrobat Filter.  In Mac OS 9.2, I regularly create a large number of PDFs from Word files using Distiller 5.0.  The problem is, I do this for 30 large files every 10 days, and must &lt;b&gt;manually&lt;/b&gt; first create .ps files, and then distill them.  The process takes more than 1 hour, during which I must open Each Word file to .ps &quot;print&quot; it.  Short of buying a new copy of Acrobat for $300 and/or an OS X computer, is there any way to automate this process?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2004:site.10267</guid>
	<pubDate>Fri, 17 Sep 2004 10:35:23 -0800</pubDate>
	<category>acrobat</category>
	<category>adobe</category>
	<category>adobeacrobat</category>
	<category>automation</category>
	<category>distiller</category>
	<category>mac</category>
	<category>os9</category>
	<category>pdf</category>
	<dc:creator>ParisParamus</dc:creator>
	</item>
	
	</channel>
</rss>

