What is the word frequency distribution in the NY Times?
January 22, 2009 11:22 AM
Subscribe
How many different words (excluding proper nouns) appear in the New York Times on average?
I remember hearing this friendly fact at one point, but I can't find it anywhere on Google or MeFi. It was something along the lines of "300 words make up 80% of the New York Times." Does anyone have the actual frequency off-hand?
posted by stevekinney to writing & language (9 comments total)
By far, words like the, of, by, that, is, for, etc. are the most popular. The frequency distribution of words is said to follow Zipf's law. If you're interested in word frequency, you might want to check out this searchable database of word frequency composed of data from Time magazine.
posted by demiurge at 11:52 AM on January 22