I am working in R with a response variable that is the letter grade the student received in a specific course. The response is ordinal, and, in my opinion, seems logically proportional. My understanding is that I need to test that it is proportional before I can use polr() instead of multinom().
For one of my courses of data, I "tested" the proportionality like this:
M1 <- logLik(polrModel) #'log Lik.' -1748.180691 (df=8)
M2 <- logLik(multinomModel) #'log Lik.' -1734.775727 (df=20)
G <- -2*(M1$1 - M2$2)) #I used a block bracket here in the real code
# 26.8099283
pchisq(G,12,lower.tail = FALSE) #DF is #of predictors
#0.008228890393 #THIS P-VAL TELLS ME TO REJECT PROPORTIONAL
For a second way of testing the proportional odds assumption, I also ran two vglm models, one with family=cumulative(parallel =TRUE)
the other with family=cumulative(parallel =FALSE)
. I then ran a pchisq()
test with the difference of the models' deviances and the differences of the residual degrees of freedom.
Is either of these way respectable? If not, I would love help with the actual coding for determining whether to accept or reject the proportional odds assumption!
In addition to the above two tests, I graphed my cumulative probabilities against each of the predictors, individually. I read that I want these lines to be parallel. What I don't understand is, with polr()
your output is a single slope for each independent variable (a coefficient) and then a specific intercept depending on which cumulative probability you are working with (ex: P(Y<=A), P(Y<=B), etc). So, if your slope coefficients are all the same for each of the equations, how could the lines not be parallel?
I picked up the basics of my knowledge on Chris Bilder's YouTube class; he talks about the parallel graphs here at minute 42.
Any help is appreciated! Thank you!