Datamining the public web
July 30, 2007 1:52 AM
Subscribe
How do i build a data warehouse that scrapes data from public websites for my own use? Tools? Tips?
Hi. I would like to track apartments on a classifieds site and use the data for analyzing the inpact of diffrent things on price. What i need is a tool or scripting language that would make it easy for me to spider the website and put the data in a database. Preferable this would be an open source solution.
I am also looking for good tools for extracting information out of longer pieces of text. For example on the site i want to mine users can put in comments on every object. I would like to be able to decide if a comment is positive, negative och neither. I have seen this be done on one online art site that i cant remember the name of right now. The artist used blog post and decided the mood of the writer by what words were used.
posted by ilike to computers & internet (15 comments total)
20 users marked this as a favorite
However, I am sure there could be for a certain price.
If you post to places like scriptlance.com, rentacoder.com, guru.com or any of the other freelance programing sites you should be able to find somebody to build you something custom for a couple hundred or maybe even less.
posted by B(oYo)BIES at 2:06 AM on July 30, 2007 [1 favorite has favorites]