Parsing statistical data
May 12, 2014 3:06 PM Subscribe
I have a spreadsheet with 120,000 or so rows & need to pull out some data.
Specifically, the format (columns separated by commas, with gaps between rows added for clarity) is --
First field represents an identifier for materials associated with A, B and/ or C (second field).
Desired output is the number of materials associated with:
* A, B & C alone
* A & B; A & C; and B & C
* all three together (A, B & C)
So, given the above data:
* A alone = count of 1
* B & C = count of 1
* A & C = count 1
* A, B & C = count of 2
I can kludge my way through this, but certainly not in any kind of elegant way that would make repeating the analysis down the road with new data particularly easy or fun.
Anyone with database or Excel chops have any pointers that could get me headed in the right direction with this little project?