-
Views
-
Cite
Cite
Yongyun Shin, Stephen W. Raudenbush, Just-Identified Versus Overidentified Two-Level Hierarchical Linear Models with Missing Data, Biometrics, Volume 63, Issue 4, December 2007, Pages 1262–1268, https://doi.org/10.1111/j.1541-0420.2007.00818.x
- Share Icon Share
Summary
The development of model-based methods for incomplete data has been a seminal contribution to statistical practice. Under the assumption of ignorable missingness, one estimates the joint distribution of the complete data for θ∈Θ from the incomplete or observed data yobs. Many interesting models involve one-to-one transformations of θ. For example, with yi ∼ N(μ, Σ) for i = 1, … , n and θ = (μ, Σ), an ordinary least squares (OLS) regression model is a one-to-one transformation of θ. Inferences based on such a transformation are equivalent to inferences based on OLS using data multiply imputed from f(ymis | yobs, θ) for missing ymis. Thus, identification of θ from yobs is equivalent to identification of the regression model. In this article, we consider a model for two-level data with continuous outcomes where the observations within each cluster are dependent. The parameters of the hierarchical linear model (HLM) of interest, however, lie in a subspace of Θ in general. This identification of the joint distribution overidentifies the HLM. We show how to characterize the joint distribution so that its parameters are a one-to-one transformation of the parameters of the HLM. This leads to efficient estimation of the HLM from incomplete data using either the transformation method or the method of multiple imputation. The approach allows outcomes and covariates to be missing at either of the two levels, and the HLM of interest can involve the regression of any subset of variables on a disjoint subset of variables conceived as covariates.