How to track the sources a website pulls news from?
September 29, 2012 5:41 PM Subscribe
I'm wondering if anyone could suggest a tool or website that allows me to see a list of sites that a particular website is pulling news from. I know I can do this manually by looking at articles and noting the sites that are sourced, but I'm wondering if there is an easier way to do it.
Any ideas?
Thanks,
- Michael
posted by ISeemToBeAVerb to computers & internet (2 answers total)
wget -O http://particular_website.com | grep -cf file_with_root_website_addresses.txt > output.txt
where file_with_root_website_addresses.txt would look like: ...for each news agency you were looking for.
Since this is more of a hint than explicit instructions, here's the wget manual and the grep manual
posted by Orb2069 at 11:33 AM on September 30, 2012