I have no statistics skills whatsoever, and I would love to have this figured out before I return to campus tomorrow morning. Please help!
Background: I have several sets of 70 numbers each (they represent the lengths of bacterial cells infected with different phages). I want to show that there is a significant difference between the two sets. While my averages look very good, adding error bars negates my findings. For example:
Set 1: Average 6.83, Standard deviation 1.67.
Set 2: Average 4.1, Standard deviation 1.00.
Possible explanation: Let's look at one set at a time. There is a chance that I infected my bacteria with less phages than I planned to, so that not all the bacteria were affected--which would effectively split the populations sampled in each set into "infected" and "uninfected", and presumably they would have different length distributions. Can I test for this before I repeat my experiment (I plan to do that anyway, but still want to know if my findings are significant at this point)?
Googling taught me that Hartigan's Dip Test is what I need. A stranger kindly posted
Matlab functions for the test.
Problem: I have Matlab installed on my computer, but I have never used it and I have no idea what to do at this point. I have a column of values in Excel, and even if I manage to enter it as a set in Matlab (although my attempts so far don't look good), I don't know how to run the test. If you can show me how to run it, how would I interpret the results so that they would be meaningful to me (provided I get Matlab to print them)? Please, please, can you help me with that?
Do you have any other suggestions for what to do to my data (remove outliers? how?) or how to look at it in order to see what's going on? (I can post the dataset somewhere if necessary.) Thanks!
histogram in excel
histogram in matlab
if you're struggling with the computational aspect of it, i can't imagine it would take you longer than 10 minutes to count 70 data points into some bins by hand.
posted by sergeant sandwich at 8:04 PM on September 23, 2008