Diversity/concentration measure for nonexclusive categories?
November 15, 2008 5:20 PM
Subscribe
Measures of diversity / fragmentation / concentration for nonexclusive categories?
So I have data on the financial interests of legislators, and one of the too many things I want to do with them is look at the diversity, fragmentation, or concentration of industries represented in the legislature.
This is pretty trivial for occupations, because if your occupation is "Lawyer," it isn't "Farmer." So for occupation, you can (and people have) just used the Herfindahl index from econ. For those playing the home game, this is the sum of the squares of the market shares.
But I have financial interests. And you can get income from a law firm and income from a farm at the same time. This means that I can't directly use the Herfindahl index, because now the sum of the market shares isn't 100%, it's 105 or 120 or 150%.
So does anyone know of a standard method or summary statistic used to measure diversity or concentration where any individual observation can be in more than one category at the same time? Like, something used to measure linguistic diversity that allows people to speak more than one language, or something else canned?
I can think of ways around this -- modifying the Herfindahl, or doing factor analysis and counting the number of recovered dimensions. But if there's something well-specified and simple from another discipline, I'd prefer to use that.
posted by ROU_Xenophobe to science & nature (10 comments total)
1 user marked this as a favorite
I am not sure concentration is meaningful if you are not doing it either of these ways, since the donors are not really in competition otherwise. You might want to look at simple correlated donations or use cluster analysis to tease out relations between various donor industries, however.
posted by blahblahblah at 6:44 PM on November 15, 2008