<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel>
	  <title>Ask MetaFilter questions tagged with rssfilter</title>
      <link>http://ask.metafilter.com/tags/rssfilter</link>
      <description>Questions tagged with 'rssfilter' at Ask MetaFilter.</description>
	  <pubDate>Sat, 05 Apr 2008 09:21:43 -0800</pubDate> <lastBuildDate>Sat, 05 Apr 2008 09:21:43 -0800</lastBuildDate>

      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>	  
	<item>
	<title>My website&apos;s scripts eat up too much bandwidth. How do I resolve this?</title>
	<link>http://ask.metafilter.com/87977/My%2Dwebsites%2Dscripts%2Deat%2Dup%2Dtoo%2Dmuch%2Dbandwidth%2DHow%2Ddo%2DI%2Dresolve%2Dthis</link>	
	<description>My site&apos;s cron tasks eat up a &lt;em&gt;lot&lt;/em&gt; of bandwidth. Can I change the paths from http to local? Or can I move the tasks to my home server? One of my websites (the election one) runs a truckload of PHP to parse, mesh, filter, and cache an equally large number of external RSS feeds which are then stored and included in a reasonably lean (~10k) index.html. This creates a strange situation where the bandwidth consumed by the PHP (locally, but via http) far exceeds the actual bandwidth consumed by visitors. My web hosting provider has warned me about the excessive activity, but given the limited number of visitors, upgrading to a hosting deal with a higher bandwidth limit isn&apos;t really worthwhile for me.&lt;br&gt;
&lt;br&gt;
How do I resolve this?&lt;br&gt;
&lt;br&gt;
Current situation: a bunch of &lt;a href=&quot;http://feedforall.com/download.htm&quot;&gt;FeedForAll&lt;/a&gt; PHP scripts are executed using a Ruby-as-CGI script (which &lt;a href=&quot;http://www.metafilter.com/username/wackybrit&quot;&gt;wackybrit&lt;/a&gt; kindly helped me with) called via cron at regular intervals, varying by nature of content: none-time critical content would be updated hourly, whereas latest news etc. is updated every 10 minutes (was 2 minutes, but I took it down a notch for now to avoid further problems with my host). My monthly bandwidth limit is 8 GB; actual visitor usage is a modest fraction of that, but the scripting alone was responsible for 37 GB (!) in March.&lt;br&gt;
&lt;br&gt;
Several options:&lt;br&gt;
&lt;br&gt;
1) The bandwidth is measured by the host using HTTP responses. If I run the scripts using local UNIX paths, e.g. /usr/gnfti/www/ etc., the scripts generate no HTTP headers and thus don&apos;t count towards the quota. Of course they will still incur server load, but I have discussed this with the hosting company and it seems they&apos;re okay with it if I limit the use somewhat as a compromise. But then, not all of these scripts seem to be okay with local paths, most notably the keyword filter (rssFilter.php), which is responsible for 25% of the bandwidth on its own. &lt;br&gt;
&lt;br&gt;
Any ideas how to make it work this way anyway?&lt;br&gt;
&lt;br&gt;
2) I have a home server running &lt;a href=&quot;http://www.wampserver.com/en/&quot;&gt;WAMP&lt;/a&gt; on XP. Could I run the PHP on this box instead and have the output automatically upload to the remote server via FTP? Most of the bandwidth is to do with moving stuff around locally anyway; the actual output is in the order of kilobytes. I imagine this would involve running some sort of cron + FTP client/scheduler on the home box, but my knowledge in this area is limited. &lt;br&gt;
&lt;br&gt;
Could this be done, and if so, how?&lt;br&gt;
&lt;br&gt;
3) Lastly, if you have any suggestions on how to pull this off aside from the examples above, that too would be much appreciated.&lt;br&gt;
&lt;br&gt;
Thanks in advance for any suggestions or insight you might have to offer, guys.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.87977</guid>
	<pubDate>Sat, 05 Apr 2008 09:21:43 -0800</pubDate>
	<category>bandwidth</category>
	<category>bandwidthlimit</category>
	<category>cron</category>
	<category>crontab</category>
	<category>feedforall</category>
	<category>ftp</category>
	<category>homeserver</category>
	<category>internet</category>
	<category>php</category>
	<category>rss</category>
	<category>rss2html</category>
	<category>rssfilter</category>
	<category>rssmesh</category>
	<category>script</category>
	<category>scripting</category>
	<category>scripts</category>
	<category>webdesign</category>
	<category>website</category>
	<dc:creator>goodnewsfortheinsane</dc:creator>
	</item>
	
	</channel>
</rss>

