My great dread. I have a logistic regression There are 645 cases and data is missing in about 228. Only 6 did not answer the dependent variable the median missing for any predictor is 15 about 2 percent of the responses. The problem is there are 38 predictors and being missing on any causes you to get thrown out of the logistic regression (long ago I read about an alternative when you used all the cases even when they were missing some information on some question, but the problems raised in doing so convinced me this was too dangerous).
I am not sure what to do, I know of multiple imputation, but my understanding is that doing this with non-interval data is problematic (actually I stopped studying this because I was told that on this board years ago).
All my predictors are dummy variables, my DV has two levels.
We are doing this to determine which variables are relatively more important, the way we do that is see which are statistically significant (I have found no good way to address relative importance with logistic regression). I am not sure what to do with so many missing cases.
Is it reasonable when you see an unusually high number of cases missing to remove a question, because you think people did not understand it, or had no answer (in honesty I think this is true with the specific question even ignoring all the missing questions - no one asked me about it when it was created)?
I am not sure what to do, I know of multiple imputation, but my understanding is that doing this with non-interval data is problematic (actually I stopped studying this because I was told that on this board years ago).
We are doing this to determine which variables are relatively more important, the way we do that is see which are statistically significant (I have found no good way to address relative importance with logistic regression). I am not sure what to do with so many missing cases.
Is it reasonable when you see an unusually high number of cases missing to remove a question, because you think people did not understand it, or had no answer (in honesty I think this is true with the specific question even ignoring all the missing questions - no one asked me about it when it was created)?