<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel>
	  <title>Ask MetaFilter questions tagged with webscraping</title>
      <link>http://ask.metafilter.com/tags/webscraping</link>
      <description>Questions tagged with 'webscraping' at Ask MetaFilter.</description>
	  <pubDate>Wed, 06 Aug 2008 20:47:07 -0800</pubDate> <lastBuildDate>Wed, 06 Aug 2008 20:47:07 -0800</lastBuildDate>

      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>	  
	<item>
	<title>Reconstituting a wiki database from html?</title>
	<link>http://ask.metafilter.com/98546/Reconstituting%2Da%2Dwiki%2Ddatabase%2Dfrom%2Dhtml</link>	
	<description>I&apos;d like to reconstitute a years-defunct wiki I used to collaborate on.  I&apos;ve contacted the principals, &amp;amp; our searches for the database backup have come up empty so far.  Without having the original database, the simplest path appears to be taking the html &amp;amp; transforming it into, say, an sql dump.  So - how do I do that?  Are there any MediaWiki, database, or Perl trails to follow?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.98546</guid>
	<pubDate>Wed, 06 Aug 2008 20:47:07 -0800</pubDate>
	<category>database</category>
	<category>mediawiki</category>
	<category>recovery</category>
	<category>sql</category>
	<category>webscraping</category>
	<category>wiki</category>
	<dc:creator>Pronoiac</dc:creator>
	</item>
	<item>
	<title>Create a simple database website (and gather the data)</title>
	<link>http://ask.metafilter.com/84064/Create%2Da%2Dsimple%2Ddatabase%2Dwebsite%2Dand%2Dgather%2Dthe%2Ddata</link>	
	<description>I&apos;m undertaking a personal web development and database building project, but I don&apos;t do web development or database building.  Please help me assess the best options to have someone else do the work for me.  I think the stuff I want is fairly easy, but please tell me what you think.
I want to do a set of three things (in priority order):&lt;br&gt;
&lt;br&gt;
1.  Have a simple form where users will input information and that will go into a database.  Input screens will be 2-3 pages total, and I want to easily play with the forms to test conversion, throughput, etc.  A+ answers will let me easily do split A/B testing on the front end.&lt;br&gt;
&lt;br&gt;
2.  Have a searchable database of contacts, all with the same set of data.  Basically contact information, a few links, and some commentary.&lt;br&gt;
&lt;br&gt;
3.  Have someone scrape a whole bunch of different websites to assemble the initial contact database.  (Perl scripting work here probably?)&lt;br&gt;
&lt;br&gt;
4.  Have a contact entry form that will insert data into the contact database.&lt;br&gt;
&lt;br&gt;
5.  All of this should be indexible / crawled by google, and of course have the ability to insert ads.&lt;br&gt;
&lt;br&gt;
6.  Have a non-ugly design.&lt;br&gt;
&lt;br&gt;
7.  (Optional, at outset) have user ability to comment and &quot;rate&quot; these contacts.&lt;br&gt;
&lt;br&gt;
8.  (Optional, at outset) create mashup of contact information and google maps.&lt;br&gt;
&lt;br&gt;
How best to build this?  Hire a rent-a-coder, college student, etc? (If so, how do I go about finding the right person?) Use an off the shelf product and play with it myself?  How much should something like this cost?  (Would love to hire a Mefite - so bids accepted on Mefimail!)  What decisions should I make now that will give me somthing extensible?  What are the main technologies that I should look out for?  I&apos;m sure many other questions here that I&apos;ve not thought of -- so fire away!  Thanks in advance hive mind!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.84064</guid>
	<pubDate>Tue, 19 Feb 2008 15:46:16 -0800</pubDate>
	<category>database</category>
	<category>outsourcing</category>
	<category>perl</category>
	<category>rentacoder</category>
	<category>technology</category>
	<category>webdesign</category>
	<category>webdevelopment</category>
	<category>webscraping</category>
	<dc:creator>mtstover</dc:creator>
	</item>
	
	</channel>
</rss>

