C
Cheng
Member
Hi again,
I have some question from this chapter and hopefully someone can help me about it. I'm pretty confused about the materials.
1) PDF for Tweedie distribution looks nasty. Are we expected to know how to show that this distribution is part of the exponential family? i.e. to find a(.), b(.), c(.) etc?
2) last para of page 34 says 'if this interaction term is insignificant in the model then we would conclude that the effect of policyholder age is the same for every level of randomgroup and that policyholder age is consistent throughout the whole data'
does that mean that 'age' is not a significant factor in the model? what does it mean when a factor is consistent throughout the whole data?
3) page 35 says that deviance residual measures the distance between the actual observation and the fitted value while raw residuals shows the difference between actual and GLM expected values.
It sounds like deviance residual and raw residuals are the same. What are the difference between the two?
4) for residual plots, can i conclude that
i) if the residual plot is symmetrical about the x-axis but is not constant across the width of the fitted values (like the second n third graph), then the distribution is appropriate (ie we have chosen the right exponential family) but just inappropriate parameters
ii) if the residual plot is not symmetrical about the x-axis, this indicates that I've chosen an inappropriate distribution in the exponential family
5) I don't really get the difference between complete and marginal interaction. Can you provide another example?
6) page 49, why is it the case that when one removes the factor with the most exposure, the standard error associated with other parameter estimates are minimised?
7) I don't get the example on aliasing in section 5.4. Can you point me to another example which shows how aliasing works?
Sorry for having so many questions in one go, but hopefully someone can help me with these.
Thanks in advance!
I have some question from this chapter and hopefully someone can help me about it. I'm pretty confused about the materials.
1) PDF for Tweedie distribution looks nasty. Are we expected to know how to show that this distribution is part of the exponential family? i.e. to find a(.), b(.), c(.) etc?
2) last para of page 34 says 'if this interaction term is insignificant in the model then we would conclude that the effect of policyholder age is the same for every level of randomgroup and that policyholder age is consistent throughout the whole data'
does that mean that 'age' is not a significant factor in the model? what does it mean when a factor is consistent throughout the whole data?
3) page 35 says that deviance residual measures the distance between the actual observation and the fitted value while raw residuals shows the difference between actual and GLM expected values.
It sounds like deviance residual and raw residuals are the same. What are the difference between the two?
4) for residual plots, can i conclude that
i) if the residual plot is symmetrical about the x-axis but is not constant across the width of the fitted values (like the second n third graph), then the distribution is appropriate (ie we have chosen the right exponential family) but just inappropriate parameters
ii) if the residual plot is not symmetrical about the x-axis, this indicates that I've chosen an inappropriate distribution in the exponential family
5) I don't really get the difference between complete and marginal interaction. Can you provide another example?
6) page 49, why is it the case that when one removes the factor with the most exposure, the standard error associated with other parameter estimates are minimised?
7) I don't get the example on aliasing in section 5.4. Can you point me to another example which shows how aliasing works?
Sorry for having so many questions in one go, but hopefully someone can help me with these.
Thanks in advance!