You say "validation," they say "verification"....
January 15, 2010 2:25 PM Subscribe
How can I parse several largish (~6mb) text documents to produce a common index of keywords and phrases? I need something that will recognize phrases as well as key words, kind of like Amazon's Statistically Improbable Phrases.
posted by cross_impact to Computers & Internet (5 answers total)
I am looking to reconcile terminology in the user requirements documents of a dozen different user organizations who are stakeholders for the same large system under development.
I need to reconcile and document that terminology and square it with the development team's understanding. There are about 18 or so documents and I would like a nifty software thing that would parse them (after a reasonable amount of preprocessing, if necessary) and spit out an index of keywords and phrases that are candidates for "jargon" that needs to be defined and/or reconciled (and the documents/user organizations that use them). Any help?
Oh yeah. And, of course, I have no budget for tools.