Tools for a scientific publisher to provide usage statistics for its subscribers? Is this a good idea?
I am the webmaster for a small scientific publisher. Our content is presented online in a way that is similar to most other publishers. The abstracts are free to all, but the articles are restricted to subscribers by the use of IP addresses in htaccess files. Most of our subscribers are university libraries and research institutes.
Lately we have been getting many requests for usage statistics from our subscribers. I have a way of doing this using summary.net, a web log analysis program, but it is time consuming and not an elegant or simple solution requiring me to change the software's settings for each subscriber, render the log files, output an excel file, reformat excel file and then change the software's settings back for my needs. If we decided to offer this service, using this method would take up too much of my time.
Is there some sort of software that I can host on our server that the user can use to show them download statistics for certain files in certain directories based on a list of IP addresses or ranges that they enter?
I thought that a program like awstats might be able to get me in this direction but I don't have root access to our managed server in order to install it properly.
I have also looked at the
Project COUNTER as a possible solution for providing statistics, but it seems to be needlessly complicated for what we are trying to accomplish.
Also, is there a way for these usage statisitcs to be measured on the user's end? Possibly with a software solution on their router/proxy server/gateway that measures outgoing requests? What are some keywords or concepts I should search for so that if I receive a request for usage statistics, I can say, "We don't provide usage statistics for practical and logistical reasons but you can roll your own by......."
Aside from
how to do it, is providing this service even a good idea? I have heard from a few publishers, that they don't provide this service because it provides justification to those making spending decisions to cancel the subscription. This reasoning is described in better detail
here at the American Mathematical Society's website (.pdf). I have also heard from librarians that they hate to cancel subscriptions because they hate,
HATE, having gaps in serial publications, not to mention facing the wrath from the 1 or 2 people that do rely on that publication for their work/research.
I would be interested to here about experiences and solutions from the publisher side, as well as the librarian side of things.
As for whether it's a good idea, you won't know until you can compare your stats with other journals that release theirs. If it doesn't look good it may not be a good idea, but I would suggest collecting the statistics for your own benefit anyways.
posted by shmooly at 6:50 AM on September 5, 2008