Is it possible to track where a PDF file goes once in the wild?
May 7, 2010 12:59 AM   Subscribe

How easy or possible is it to track when/where a PDF file is transmitted or opened once it goes out into the wild?

Specifically, I'm looking for DRM software that:

- integrates seamlessly into a PDF (i.e. the user doesn't need to download any additional software apart from a bog standard PDF reader)
- doesn't overtly flag to the reader that there is DRM built into the PDF file, or that usage is being tracked
- still lets the PDF be opened by whoever, whenever without a password or code; i.e. places no restrictions on use

Does such a beast exist? And if so, how does it work?
posted by MuffinMan to Computers & Internet (11 answers total)
 
What is your goal? Track or prevent piracy?

If you want to prevent it:
have password = credit card number
don't allow password to be changed

Of course they could print and scan, but atleast you made it harder for them.
posted by chinabound at 1:08 AM on May 7, 2010


Response by poster: My goal is just to find out if it exists at the moment - I've heard companies claim they use such a system. Yes, its use would be to prevent or track piracy. I know about DRM approaches that use a password.

I'm specifically looking for something that fits the criteria above - or alternatively for an explanation of why it couldn't exist.
posted by MuffinMan at 1:17 AM on May 7, 2010


Best answer: The only way to track is to somehow access the internet. Basic PDFs can't do that.

PDF could include JavaScript that could access the net. You could use a webbug to track usage (Eg, http://www.security-labs.org/fred/docs/pacsec08/samples/03-infoleak/webbug-js.pdf ) which would access a per-user URL (eg, http://yoursite/usage.php?id=user-identifier)

So technically it's possible but a minority of PDF readers don't support JavaScript (basically only Adobe Acrobat supports JavaScript but others like Evince don't). You can't really prevent distribution, and the most you can hope is to track PDFs some of the time.

Alternatively you could not offer a PDF for download and just show rasterised bitmaps at various resolutions for people to view when they have an internet connection.
posted by holloway at 1:25 AM on May 7, 2010


Response by poster: Thank you - really helpful. In your webbug example, does the user always see a message telling you that something is trying to load or access the internet or would the user experience be identical to opening up a normal PDF?
posted by MuffinMan at 1:55 AM on May 7, 2010


PDFs can't access the internet unless the user explicitly gives permission. (At least with Adobe Acrobat reader)

The best way I've seen is to provide a web page with a flash or JS reader render the PDF inline so you have full control and the user doesn't have access to the PDF file but can still read it.
posted by wongcorgi at 1:55 AM on May 7, 2010


would the user experience be identical to opening up a normal PDF?

I don't know how PDF readers handle such requests (i.e. if they notify the user) --- decent ones should, as such a system is explicitly built to compromise the users' privacy.

However, such a request could still trip a (properly configured) firewall. The originator of the PDF might have to field questions from users, asking why their PDF is sneakily trying to connect to the internet and what information is being transmitted.

Furthermore, such a system would be trivial to reverse engineer and neuter, by redirecting the phone-home URL so the outgoing request does not reach its destination.
posted by Dr Dracator at 2:43 AM on May 7, 2010


Best answer: PDFs can't access the internet unless the user explicitly gives permission. (At least with Adobe Acrobat reader)
This is correct. Essentially, the way to do this is to add an open action to the document that includes a URI action that pings a mothership, but URI actions are supposed to post a dialog requesting permission.
posted by plinth at 3:09 AM on May 7, 2010


www.pdfmarker.com will automatically stamp the PayPal address of a customer into the .pdf itself; this practice should, in theory, reduce the number of customers willing to blithely give away your work.
posted by darth_tedious at 10:12 AM on May 7, 2010


And here's another company that does the same thing-- but with PayPal and Clickbank customers:

http://www.digitalproductdelivery.com/support/knowledgebase/vendor/general-product-questions/pdf-buyer-tracking.html
posted by darth_tedious at 10:22 AM on May 7, 2010


If a PDF somehow had some sort of auto-executing phone-home code embedded in it, those distributing it illegally would simply strip it out first.

chinabound: "What is your goal? Track or prevent piracy?

If you want to prevent it:
have password = credit card number
don't allow password to be changed
"

A PDF password isn't nearly secure enough to hold CC information.
posted by turkeyphant at 10:33 AM on May 7, 2010


I think this is seen as a feature, not a bug with this scheme: The user will take extra care not to leak the PDF around, to protect the CC information. Somewhat perverse, but I guess it would work --- until one of the myriad gotchas comes up, and the PDF creator is sued for CC fraud damages.
posted by Dr Dracator at 2:08 AM on May 8, 2010


« Older Mystery molecule!   |   feeling guilty about a car purchase Newer »
This thread is closed to new comments.