Next: How do I tell
Up: Questions for both versions
Previous: How does the EMis
  Contents
The following four conditions must all hold.
- The analysis model is conditional on the explanatory variables
(such as a regression model) and the functional form is known to
be correctly specified (so that listwise deletion is consistent and
robustness is not lost when applying listwise deletion to data with
slight problems of measurement error, endogeneity, nonlinearity,
etc.).
- There is nonignorable selection on
, so that multiple
imputation can give incorrect answers, and no
variables are
available that could be used in an imputation stage to fix the
problem.
- Missingness in
is not a function of
, and similarly
unobserved omitted variables
that affect
do not exist.
- The number of observations left after listwise deletion is large
so that the efficiency lost does not counter balance (in a mean
square error sense, for example) the biases induced by the other
conditions.
To prefer listwise deletion to EMis, you must then have enough
information about problems with the variables in your analysis that
you do not trust them to impute the missing values in your
's --
or you worry more about using available information to impute the
's than the existence of selection on
as a function of
in
3, which our approach would correct. Despite all this, you still
trust your data enough to want to use them in an analysis model. If
all these conditions hold, listwise deletion can outperform EMis, and
researchers should obviously consider whether they might hold in their
data. However, this situation -- where using more information is
worse and we know about it -- is likely to be rare.
Next: How do I tell
Up: Questions for both versions
Previous: How does the EMis
  Contents
Gary King
2003-07-25