<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

      <title>Comments on: Who hijacked my Google cache?</title>
      <link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache/</link>
      <description>Comments on Ask MetaFilter post Who hijacked my Google cache?</description>
	  	  <pubDate>Wed, 23 May 2007 22:25:47 -0800</pubDate>
      <lastBuildDate>Wed, 23 May 2007 22:25:47 -0800</lastBuildDate>
      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>

<item>
  	<title>Question: Who hijacked my Google cache?</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache</link>	
  	<description>How was my Google cache hacked? I have a web page that is fine, but the Google cache of the same page is a &quot;spam&quot; page with keywords linking to other web sites, random text and images, but also some valid links on my  web site. How did that happen? &lt;br /&gt;&lt;br /&gt; This is a web page for a domain I have hosted by TotalChoice Hosting.&lt;br&gt;
&lt;br&gt;
I was checking out the stats for the domain and discovered some unexpected search terms (&quot;penis enlargment&quot;) led to the web page. The web page is 10 lines of simple hand-coded HTML (a title and two links and no mention of penises). Bringing up the web page and looking at its source indicates that the page has not been compromised.&lt;br&gt;
&lt;br&gt;
I did a Google search with some unusual keywords that my statistics indicated led to my web site and my web page was the only search result. But the search result includes text that has never been on my page. Clicking the regular link to my page brings it up as expected. Clicking the link to the Google cache brings up the spam page (cached in February).&lt;br&gt;
&lt;br&gt;
TotalChoice Hosting says they haven&apos;t been compromised recently (well, at least the support guy I chatted with said so).&lt;br&gt;
&lt;br&gt;
It&apos;s not a big deal. There&apos;s nothing important on the web site. But does anyone have a clue on what happened?</description>
  	<guid isPermaLink="false">post:ask.metafilter.com,2008:site.63287</guid>
  	<pubDate>Wed, 23 May 2007 21:48:28 -0800</pubDate>
  	<dc:creator>ShooBoo</dc:creator>
	
	<category>web</category>
	
	<category>hacking</category>
	
	<category>spam</category>
	
</item>
<item>
  	<title>By: panic</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache#952449</link>	
  	<description>Your hosting company could have configured their web server to serve the ad-filled page to the Google spider, although I&apos;m not sure why they would do that.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.63287-952449</guid>
  	<pubDate>Wed, 23 May 2007 22:25:47 -0800</pubDate>
  	<dc:creator>panic</dc:creator>
</item>
<item>
  	<title>By: nakedcodemonkey</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache#952455</link>	
  	<description>You say the cache is from Feb, and the hosting co hasn&apos;t been compromised &amp;quot;lately&amp;quot;.  The obvious question I&apos;d want answered is what was happening last winter, in the weeks before that cache was made.&lt;br&gt;
&lt;br&gt;
I wouldn&apos;t focus too much on Google showing your site as result for words that aren&apos;t on your page.  That&apos;s how PageRank is meant to work, and what makes things like googlebombing possible.  So there could be an explanation as simple as another site linked to yours using those terms.  If you want, use Advanced Search to search for sites linking to yours.  &lt;br&gt;
&lt;br&gt;
But if Google cache actually shows your site having mystery content, Occam&apos;s Razor does suggest that the site was compromised and defaced at least briefly back then.  A good hosting co catches that stuff fast &lt;small&gt;though a better company prevents it from happening in the first place, grumble grumble... &lt;br&gt;
&lt;/small&gt; and restores your site from a backup &lt;small&gt;a more transparent, responsible company would have also kept you informed of such things...&lt;/small&gt;.  What does archive.org show for the URL in question?  They stay 6 months behind, so you won&apos;t be able to see it&apos;s Feb condition yet but it&apos;d be interesting to see what it shows for Dec and beyond.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.63287-952455</guid>
  	<pubDate>Wed, 23 May 2007 22:36:23 -0800</pubDate>
  	<dc:creator>nakedcodemonkey</dc:creator>
</item>
<item>
  	<title>By: nakedcodemonkey</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache#952457</link>	
  	<description>&lt;i&gt;see &lt;del&gt;it&apos;s&lt;/del&gt; &lt;ins&gt;its&lt;/ins&gt; Feb condition&lt;/i&gt;&lt;br&gt;
&lt;br&gt;
grr.  argh.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.63287-952457</guid>
  	<pubDate>Wed, 23 May 2007 22:38:31 -0800</pubDate>
  	<dc:creator>nakedcodemonkey</dc:creator>
</item>
<item>
  	<title>By: toomuchpete</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache#952466</link>	
  	<description>If you know how to do this... try spoofing your UserAgent to look like googlebot and see if you&apos;re getting the spammy page. There are exploits out there for some CMSs where, if the CMS is improperly confugred, bad guys can muck with your files and add spam cloaking like this. See, e.g., &lt;a href=&quot;http://wordpress.org/support/topic/93990&quot;&gt;this thread&lt;/a&gt; from the wordpress codex.&lt;br&gt;
&lt;br&gt;
I would guess that this &lt;i&gt;isn&apos;t&lt;/i&gt; a circumstance of a site getting defaced at exactly the right time. This seems like it&apos;d be a pretty straight-forward problem to solve, but we&apos;d need a little more info.</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.63287-952466</guid>
  	<pubDate>Wed, 23 May 2007 22:46:07 -0800</pubDate>
  	<dc:creator>toomuchpete</dc:creator>
</item>
<item>
  	<title>By: malevolent</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache#952518</link>	
  	<description>Sounds like something was compromised at some point (either the sites themselves or the DNS). It&apos;s worth pretending to be a search engine spider though in case there&apos;s still something dodgy going on (e.g. some kind of proxy on the server modifying the content).&lt;br&gt;
&lt;br&gt;
Have you checked the &lt;a href=&quot;http://www.archive.org/web/web.php&quot;&gt;Wayback Machine&lt;/a&gt; in case it caught a change?</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.63287-952518</guid>
  	<pubDate>Wed, 23 May 2007 23:41:07 -0800</pubDate>
  	<dc:creator>malevolent</dc:creator>
</item>
<item>
  	<title>By: nakedcodemonkey</title>
  	<link>http://ask.metafilter.com/63287/Who-hijacked-my-Google-cache#952525</link>	
  	<description>&lt;a href=&quot;http://ask.metafilter.com/63202/KrisKross-Site-Scripting#951081&quot;&gt;another possibility&lt;/a&gt;</description>
  	<guid isPermaLink="false">comment:ask.metafilter.com,2008:site.63287-952525</guid>
  	<pubDate>Wed, 23 May 2007 23:53:58 -0800</pubDate>
  	<dc:creator>nakedcodemonkey</dc:creator>
</item>

    </channel>
</rss>
