Thanks very much. I’m still a little unclear on why this is appropriate though.
My query can also be extended to situations where the categories are not ordered in any way:
Consider a contingency table test where we are trying to see if the subject that a student chooses to study is independent of their sex.
The possible subjects are French, German, Biology, Chemistry and Physical Education.
Let us say we get an expected frequency of 3 for French and 4 for Biology so that these categories need to be combined.
We could combine them together to create a new category “French and Biology” and lose only 1 degree of freedom.
However, due to the similarities between types of subject, I might expect that the gender trends for French and German might be more similar than those for French and Biology. I wonder if it would therefore be more meaningful to combine French and German to create a new category "Languages" and combine Biology and Chemistry to create a new category "Sciences". These groupings seem to sit more naturally with each other but of course we would then lose 2 degrees of freedom instead of one.
So in summary I’m still a bit confused about what factors we should take into account when considering combining categories i.e. is it just degrees of freedom, or also does the appropriateness of the combinations matter? Also, does it make any difference whether we are doing a goodness of fit test or a contingency table test?
Thanks in advance for any help provided.
Last edited by a moderator: Oct 5, 2009