Missingness C. Case Special Case Logistic Regression

I apologize if this question or question like it has been answered in the past on this forum. The search function doesn’t seem to be working on the site.

I have been studying the subject of multiple imputation methods (such as multiple imputation by chained equations MICE) and I’m currently reading “Flexible Imputation of Missing Data” by the author of MICE in R. All was somewhat fine in my understanding until I hit this passage:

Complete case analysis can perform quite badly under MAR and some MNAR cases (Schafer and Graham, 2002), but there are two special cases where it can outperform multiple imputation.

The second special case holds only if the complete data model is logistic regression. Suppose that the missing data are confined to either a dichotomous Y or to X, but not to both. Assuming that the model is correctly specified, the regression coefficients (except the intercept) are unbiased if only the complete cases are analyzed as long as the probability to be missing depends only on Y and not on X (Vach, 1994). This property provides the statistical basis of the estimation of the odds ratio from case-control studies in epidemiology. If missing data occur in both Y and X the property does not hold.
After doing further digging my understanding is that in the case of logistic regression if the “missingness” is related to just independent or to just dependent but not both then the model parameters remain unbiased under complete case analysis? I do understand that the intercept may still be biased.


Omega Contributor
Yeah, I think I recall reading something along these lines before. Though you may lose some precision given the sample size is smaller. Also, I think you can correct intercept when calculating estimates if you have a presumed value for the intercept (base case).