<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

      <title>Comments on: How do I find a site tree?</title>
      <link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree/</link>
      <description>Comments on Ask MetaFilter post How do I find a site tree?</description>
	  	  <pubDate>Thu, 02 Mar 2006 13:13:06 -0800</pubDate>
      <lastBuildDate>Thu, 02 Mar 2006 13:13:06 -0800</lastBuildDate>
      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>

<item>
  	<title>Question: How do I find a site tree?</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree</link>	
  	<description>Is there any way to see all of the HTM or HTML files under one domain, to see a site map or tree? &lt;br /&gt;&lt;br /&gt; For example say www.example.com has two pages: main.htm and links.htm, but doesnt provide any links.  How would I find out these pages exsist&lt;br&gt;
&lt;br&gt;
Sorry this is so vague, but its the best I could do.</description>
  	<guid isPermaLink="false">post:ask.metafilter.com,2008:site.33638</guid>
  	<pubDate>Thu, 02 Mar 2006 13:11:52 -0800</pubDate>
  	<dc:creator>Blandanomics</dc:creator>
	
	<category>sitemap</category>
	
	<category>sitetree</category>
	
</item>
<item>
  	<title>By: odinsdream</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree#524163</link>	
  	<description>Not really, no.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.33638-524163</guid>
  	<pubDate>Thu, 02 Mar 2006 13:13:06 -0800</pubDate>
  	<dc:creator>odinsdream</dc:creator>
</item>
<item>
  	<title>By: Sallysings</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree#524169</link>	
  	<description>YOU COULD use a program like &lt;a href=&quot;http://www.httrack.com/&quot;&gt;HTTRACK&lt;/a&gt; or other &amp;quot;off site&amp;quot; browsing program to spider a site.&lt;br&gt;
&lt;br&gt;
Just set it up to ignore any file that isn&apos;t a .htm/.html, and set it up as &amp;quot;mirroring&amp;quot; the site to keep the directory structure.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.33638-524169</guid>
  	<pubDate>Thu, 02 Mar 2006 13:19:17 -0800</pubDate>
  	<dc:creator>Sallysings</dc:creator>
</item>
<item>
  	<title>By: cillit bang</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree#524174</link>	
  	<description>Enter &lt;i&gt;site:www.example.com&lt;/i&gt; into Google.&lt;br&gt;
&lt;br&gt;
Otherwise, there&apos;s no way to get a list of what files are on a server from a server.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.33638-524174</guid>
  	<pubDate>Thu, 02 Mar 2006 13:20:35 -0800</pubDate>
  	<dc:creator>cillit bang</dc:creator>
</item>
<item>
  	<title>By: camcgee</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree#524177</link>	
  	<description>&lt;a href=&quot;http://siteexplorer.search.yahoo.com/&quot;&gt;Yahoo site explorer&lt;/a&gt;</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.33638-524177</guid>
  	<pubDate>Thu, 02 Mar 2006 13:24:39 -0800</pubDate>
  	<dc:creator>camcgee</dc:creator>
</item>
<item>
  	<title>By: caution live frogs</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree#524221</link>	
  	<description>Also, if the pages exist but are not linked by anything (which is what it sounds like you are asking), well, short of hacking into the server and visually inspecting the directory structure you won&apos;t find these pages. Although if for example hidden.html existed and you typed in that url, it would be dutifully presented by the webserver, there are no spiders that will run through the infinite possibilites for page names just to see if it missed anything.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.33638-524221</guid>
  	<pubDate>Thu, 02 Mar 2006 14:07:02 -0800</pubDate>
  	<dc:creator>caution live frogs</dc:creator>
</item>
<item>
  	<title>By: delmoi</title>
  	<link>http://ask.metafilter.com/33638/How-do-I-find-a-site-tree#524292</link>	
  	<description>No, not at all.&lt;br&gt;
&lt;br&gt;
What you can see are:&lt;br&gt;
&lt;br&gt;
1) URIs that someone else (like google) knows about via spidering&lt;br&gt;
&lt;br&gt;
2) URIs that you spidered yourself&lt;br&gt;
&lt;br&gt;
3) URIs that you guess, and turn out to actually exist.&lt;br&gt;
&lt;br&gt;
Also, keep in mind some sites have practically infinite pages, for example, the URL &amp;quot;http://ask.metafilter.com/tags/monkey&amp;quot; is valid, even if no posts have been tagged with the word &amp;quot;monkey&amp;quot;.  You can make up any tag, and a page exists for it. &lt;br&gt;
&lt;br&gt;
Yeah, you can find out what tags have been used, but other sites have similar features that&apos;ll screw ya if you tried to generalize.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.33638-524292</guid>
  	<pubDate>Thu, 02 Mar 2006 15:27:20 -0800</pubDate>
  	<dc:creator>delmoi</dc:creator>
</item>

    </channel>
</rss>
