American English word frequencies?
April 9, 2007 9:38 PM
Subscribe
Where can I find a good word frequency list for American English?
Requirements: Not just sorted by frequency, but with specific frequency information for each word.
Not lemmatized.
These lists are almost perfect, except they're based on British English (and they have separate entries for "n't" and "'s").
Does a such a list even exist? Is this the kind of thing I can't get online? Google has revealed to me only other lists based on the British National Corpus and a lot of word lists for open source spell checkers and Scrabble programs. Am I missing something?
(Suggestions for other interesting corpora that I could use to generate my own list of this sort would also be appreciated—this is for a generative poetry project.)
posted by aparrish to writing & language (12 comments total)
3 users marked this as a favorite
For example, generate frequencies from http://www.infomotions.com/etexts/literature/english/1600-1699/shakespeare-sonnets-59.txt
posted by demiurge at 10:01 PM on April 9, 2007