I need to download google!
June 14, 2006 8:04 PM
Subscribe
How do I download google's entire cache of a website that has 137,000 hits?
Ok, so
em411.com got redesigned, and em/admin (site administrator) dumped the database. There were some damn good conversations that happened over the last 6 years on that site and there is no way that I am going to be able to remember what each and every one of them was about, so I need to find a way to get the entire cache of this website. Hope me please.
If I do a
site:em411.com I get 137,000 hits, but only 1000 of them are accessible via google.
I'm thinking that I could get this done via some combination of wget following only the cache links (how do I do that?) and varying the keyword in the search (not sure what the best ones to pick are)
Oh, and the internet
archive has basically nothing of this site.
posted by bigmusic to computers & internet (11 comments total)
Can't you do:
wget -r http://www.google.com/search?hl=en&q=site%3Aem411.com&btnG=Google+Search
posted by xmutex at 8:35 PM on June 14, 2006