<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: Making a stand-alone rsync appliance.</title>
	<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance/</link>
	<description>Comments on Ask MetaFilter post Making a stand-alone rsync appliance.</description>
	<pubDate>Tue, 11 Oct 2005 12:09:00 -0800</pubDate>
	<lastBuildDate>Tue, 11 Oct 2005 12:09:00 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: Making a stand-alone rsync appliance.</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance</link>	
		<description>Is there a distribution of a linux-like OS which turns hardware into an easy-to-configure rsync appliance? &lt;br /&gt;&lt;br /&gt; I have a client (web dev agency) whose president wants &apos;on site backups of all of the websites we host&apos;. They had issues with less-than-clueful providers, I guess.  They&apos;re not interested in switching their hosting provider.&lt;br&gt;
&lt;br&gt;
She essentially wants to be able to point to a box in her office and say &apos;our backups of all of the sites we&apos;re responsible for live &lt;b&gt;here&lt;/b&gt;&apos;. I don&apos;t work at this office, so I&apos;d like to be able to control this &apos;remote backup appliance&apos; via a web-admin or VNC or some such. If I have no other choice but to be on-site, that&apos;s okay, but not optimal.&lt;br&gt;
&lt;br&gt;
I have a Shuttle-size PC available to me for this project (p3, 256MB, 80GB mirrored HDDs.) My original plan was to install a flavor of linux, then make some shell scripts to do simple wget requests on a regular (perhaps daily) basis. Some research revealed that the aggregate file size to transfer was ~1.5 TB range. This office has only business-class cable modem service (5Mb down).&lt;br&gt;
&lt;br&gt;
Now I&apos;m thinking about implementing rsync, which I need to learn more about. The client claims to have root on their server at the ISP, so if I need to install sw to support rsync, that shouldn&apos;t be a problem.&lt;br&gt;
&lt;br&gt;
My previous experience is with Mac OS X Server, where turning on services is as easy as clicking a button. I have enough unix experience to get around a shell, but not enough to know where all of the config files, etc. live for various services.&lt;br&gt;
&lt;br&gt;
I need to be able to ssh into this box and set up cron scripts to fire rsync events, but if there were a web interface for this kind of thing, all the better.&lt;br&gt;
&lt;br&gt;
Basically, I don&apos;t want to go through the effort of manually writing the scripts if there&apos;s a smarter way to achieve my goal.</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2005:site.25335</guid>
		<pubDate>Tue, 11 Oct 2005 11:54:56 -0800</pubDate>
		<dc:creator>Wild_Eep</dc:creator>
		
			<category>rsync</category>
		
			<category>backups</category>
		
			<category>linux</category>
		
			<category>remoteaccess</category>
		
	</item> <item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400536</link>	
		<description>There really isn&apos;t much in the way of scripts that you&apos;re going to actually need.  rsync probably does everything you need already.  You don&apos;t need to set it up as a service, rsync just needs to be installed on the client and the server.  I&apos;ve used rsync with windows under cygwin.  There may be a native port also.  It almost certainly exists for OSX also.&lt;br&gt;
&lt;br&gt;
Anyway, I backup all my clients websites this way.  I have a script that I wrote, but the only reason I have a script is that certain parts need to get backed up more than others, so I have a script that runs daily, which checks the last time each update was made, and only runs rsync for the ones which have gone past their expiration window.&lt;br&gt;
&lt;br&gt;
The basic syntax you want is something like&lt;br&gt;
rsync -av remotehost:/path/to/remote/directory /path/to/local/directory&lt;br&gt;
that&apos;s it.  Put that in cron.  &lt;br&gt;
You might want to tweak it a little bit, like maybe add --delete to the options so that if a file is removed from the server it&apos;ll be removed from the client also.  I don&apos;t do this so that I can recover from accidental deletions, generally.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400536</guid>
		<pubDate>Tue, 11 Oct 2005 12:09:00 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400537</link>	
		<description>Also, expect the initial sync to be quite time consuming.  After that, it should be short if there are not a lot of changes to the customer&apos;s sites.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400537</guid>
		<pubDate>Tue, 11 Oct 2005 12:11:02 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400539</link>	
		<description>I&apos;m a little confused about one thing:&lt;br&gt;
&lt;br&gt;
&lt;i&gt;Some research revealed that the aggregate file size to transfer was ~1.5 TB range. This office has only business-class cable modem service (5Mb down).&lt;/i&gt;&lt;br&gt;
&lt;br&gt;
Huh?  Do you mean KB?  MB?</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400539</guid>
		<pubDate>Tue, 11 Oct 2005 12:12:57 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400550</link>	
		<description>Since you mentioned VNC, it works on linux also.  So you could use VNC to connect to the server.  Personally for what you&apos;re talking about, ssh would probably be easier.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400550</guid>
		<pubDate>Tue, 11 Oct 2005 12:19:37 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400551</link>	
		<description>Also, I am a POSTING MACHINE.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400551</guid>
		<pubDate>Tue, 11 Oct 2005 12:19:47 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: Wild_Eep</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400556</link>	
		<description>I meant TB as in Terabyte and Mb as in Megabit.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400556</guid>
		<pubDate>Tue, 11 Oct 2005 12:26:07 -0800</pubDate>
		<dc:creator>Wild_Eep</dc:creator>
	</item><item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400560</link>	
		<description>OK.  You&apos;re going to be storing a terrabyte on an 80 gigabyte hard drive then?</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400560</guid>
		<pubDate>Tue, 11 Oct 2005 12:28:55 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: Wild_Eep</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400567</link>	
		<description>Whoops, I should have written GB, not TB.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400567</guid>
		<pubDate>Tue, 11 Oct 2005 12:42:19 -0800</pubDate>
		<dc:creator>Wild_Eep</dc:creator>
	</item><item>
		<title>By: RustyBrooks</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400602</link>	
		<description>OK, that&apos;s more like it.  &lt;br&gt;
&lt;br&gt;
Does what I wrote above make sense?  I don&apos;t think you need anything super-fancy here, or a special version of linux, or anything like that.  Pretty much any OS would do, windows, osx, or linux.  Connection to the machine via ssh or vnc would be fairly straightforward for all of these (except ssh under windows is not all that obvious).&lt;br&gt;
&lt;br&gt;
You&apos;ll probably want to experiment with the command line params for rsync a little bit to get exactly what you want, but I think we&apos;re talking about 10-20 minutes of labor there, tops.  In comparison, finding and installing some kind of web-thang that does exactly what you want is probably kind of a waste of time.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400602</guid>
		<pubDate>Tue, 11 Oct 2005 13:13:20 -0800</pubDate>
		<dc:creator>RustyBrooks</dc:creator>
	</item><item>
		<title>By: mendel</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400611</link>	
		<description>Ex-coworker of mine wrote &lt;a href=&quot;http://www.gormand.com.au/peters/tools/index.html&quot;&gt;rs&lt;/a&gt; as an rsync wrapper to do what you describe. It&apos;s designed to be supervised under daemontools but runs fine from cron in one-shot mode, too.&lt;br&gt;
&lt;br&gt;
Big advantages of that over just running rsync from cron are TTL management (so transfers don&apos;t overlap), configuration files instead of a command-line, better scheduling than cron, and clear success/fail summary notification messages.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400611</guid>
		<pubDate>Tue, 11 Oct 2005 13:24:35 -0800</pubDate>
		<dc:creator>mendel</dc:creator>
	</item><item>
		<title>By: mrbill</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400617</link>	
		<description>I dont know of any &quot;easy install&quot; distributions, but I use Fedora Core 4, the latest rsync, and &lt;a href=&quot;http://www.rsnapshot.org&quot;&gt;rsnapshot&lt;/a&gt; to have NetApp-style &quot;snapshots in time&quot; backups of my colocated machines on a box with large hard drives at home.&lt;br&gt;
&lt;br&gt;
Once the initial (huge) transfer is done, the only things that get transferred nightly are the files that change or are added/deleted.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400617</guid>
		<pubDate>Tue, 11 Oct 2005 13:35:35 -0800</pubDate>
		<dc:creator>mrbill</dc:creator>
	</item><item>
		<title>By: ldenneau</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#400966</link>	
		<description>If your web sites connect to database backends, rsync (nor any other file copying program) cannot guarantee the database files will be copied in a consistent state, resulting in corruption or loss of data. The database files may be in the middle of a write operation at the time of copy, for example. Only the database knows for sure. I would recommend doing a periodic archive/dump/hotbackup of your database using a tool supplied with your database and including the archive files in your nightly rsync backup. Then if your database files are corrupt you can restore from your archive.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-400966</guid>
		<pubDate>Tue, 11 Oct 2005 19:45:28 -0800</pubDate>
		<dc:creator>ldenneau</dc:creator>
	</item><item>
		<title>By: nicwolff</title>
		<link>http://ask.metafilter.com/25335/Making-a-standalone-rsync-appliance#401121</link>	
		<description>rsnapshot has an option to call out to database-dump scripts. I use it to save nightly snapshots from four servers in New York to a backup server at Easyspeedy in Denmark - it&apos;s cheap peace-of-mind and easy to set up. E-mail me questions if you want.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2005:site.25335-401121</guid>
		<pubDate>Wed, 12 Oct 2005 00:04:48 -0800</pubDate>
		<dc:creator>nicwolff</dc:creator>
	</item>
	</channel>
</rss>
