I'm building a small keyword-relevancy search engine. I'd like to move beyond simply keeping track of what words are in what documents -- at least, I assume there's a beyond having to do with keyword density and such. What should I read to find out more? Are the open source projects I should check out?
sed: How do I use it to do a general search/replace on a bunch of files? The docs seem to believe all I have to do is feed it s/regexp/replacetext/g and a list of files and I'm off to the races, but rather than getting the files changed... [exciting conclusion inside] [more inside]