Comments on: ANOVA Modelling
http://ask.metafilter.com/8400/ANOVA-Modelling/
Comments on Ask MetaFilter post ANOVA ModellingThu, 01 Jul 2004 17:23:12 -0800Thu, 01 Jul 2004 17:23:12 -0800en-ushttp://blogs.law.harvard.edu/tech/rss60Question: ANOVA Modelling
http://ask.metafilter.com/8400/ANOVA-Modelling
What's a good rule of thumb for the maximum number of 'varieties' and 'factors' you can have in an ANOVA model? <br /><br /> That is, where does the computation get intractable? How much data needed for a given layout? Stuff like this. I realize the question is somewhat ill-posed, just curious if anyone has practical experience from real-world application they'd like to share.post:ask.metafilter.com,2004:site.8400Thu, 01 Jul 2004 16:36:31 -0800freebirdstatisticsanovamanovadependenceindependencevarietyfactornoiseBy: Voivod
http://ask.metafilter.com/8400/ANOVA-Modelling#162581
<a href="http://www.ananova.com/desktop/picture.html?startingAt=2&keywords=+&subsite=entertainment&_subsite=">ANANOVA models?</a><br>
/had to be donecomment:ask.metafilter.com,2004:site.8400-162581Thu, 01 Jul 2004 17:23:12 -0800VoivodBy: nixxon
http://ask.metafilter.com/8400/ANOVA-Modelling#162589
A common rule of thumb is that you need 10 data points for each predictor in a model. You can get away with less, particularly if your effect size is huge. But more data points is always better.comment:ask.metafilter.com,2004:site.8400-162589Thu, 01 Jul 2004 17:46:07 -0800nixxonBy: ajpresto
http://ask.metafilter.com/8400/ANOVA-Modelling#162713
I think the question was actually, how many variables can you have. The answer is: Not too many. I think anything above 3 starts to get unruly. Think of all the interactions you would need to interpret.<br>
<br>
I have two factors which leads to only one interaction factor (between the two factors) and I don't really know what it means. I'm faking it.comment:ask.metafilter.com,2004:site.8400-162713Fri, 02 Jul 2004 07:26:59 -0800ajprestoBy: bonehead
http://ask.metafilter.com/8400/ANOVA-Modelling#162723
With large numbers of variables, you're often better off looking at different methods of slicing your data, usually some sort of aggregate analysis. Which set of measurements is most like another, and such. Principle component analysis and/or hierarchical clustering become more interesting than individual variables. Even with four or five variables, PCA or HC can be useful. Doesn't really help directly with the question, I know, but still might be fruitful to pursue.comment:ask.metafilter.com,2004:site.8400-162723Fri, 02 Jul 2004 08:05:56 -0800boneheadBy: nixxon
http://ask.metafilter.com/8400/ANOVA-Modelling#162747
I guess my initial response was unclear. What I was trying to say is that the number of variables (or factors, or whatever you want to call them) you can include depends on the number of data points you have. From a purely mathematical perspective, you can have lots of variables if you have lots of data. <br>
<br>
But ajpresto is right -- interpreting the results of an ANOVA is a bitch if you have lots of factors. I've driven myself to the brink of madness trying to interpret 4-way interactions. For models with lots of independent variables (predictors), a linear regression would be easier to interpret -- and it does essentially the same thing as ANOVA.comment:ask.metafilter.com,2004:site.8400-162747Fri, 02 Jul 2004 08:48:06 -0800nixxonBy: freebird
http://ask.metafilter.com/8400/ANOVA-Modelling#162837
That was very helpful, thanks! <br>
<br>
I've used PCA/etc, not quite what I need here. The (M)ANOVA stuff I've done has been with microarrays, so (if I keep the terminology right, it's been a while) I had a HUGE number of "varieties" (genes/spots) but only a few "Factors" - "Gene","Dye","Chip","Sample".comment:ask.metafilter.com,2004:site.8400-162837Fri, 02 Jul 2004 12:01:28 -0800freebird