Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: multimple imputation in steps??
From
Maarten buis <[email protected]>
To
[email protected]
Subject
Re: st: multimple imputation in steps??
Date
Wed, 7 Apr 2010 13:06:37 -0700 (PDT)
--- On Wed, 7/4/10, Ploutz-Snyder, Robert (JSC-SK)[USRA] wrote:
> Consider a dataset where x1-x5 are separate but correlated
> variables that measure "factor1" and further that
> X6-x10 all tap into "factor2."
>
> Using mi impute in a very general sense, I could simply
> impute all missing items in one step by using
>
> mi impute x1-x10
>
> But that ignores the shared variance structure of the
> factors (nothing on the right side of the impute equation).
>
> Instead I would like to generate factor scores prior to
> imputations, which will result in factor1 and factor2 scores
> for subjects who are missing no data. Then the second
> step would be to use mi impute to impute the FACTOR scores
> from the items loading on the factors, so that the factor
> loading structure is preserved in the imputed data.
> This assumes that the factor structure for subjects with any
> missing data match the factor structure for complete
> subjects, but I'm willing to assume that.
>
> So following the factor analysis and predict statements, I
> could use
>
> mi impute mvn factor1=x1-x5
>
> ...and then separately
>
> mi impute mvn factor2=x6-x10
>
> But this results in different samples in the m>0
> imputations, and I can't use mi estimate commands.
>
>
> What is the solution here? How can we use mi impute
> that preserve the underlying covariance structure among the
> data, i.e. allows different modeling of missing data based
> on what we know about the data a-priori, but then also use
> all of the m>0 imputations in estimation commands
> following imputation?
I would use -ice- to simultaneously impute x1-x10 _without_
the factor variables. This will preserve the correlation
structure, and you can use -factor- afterwards to estimate
your factor variables, while including a maximum of the
observed information.
Hope this helps,
Maarten
--------------------------
Maarten L. Buis
Institut fuer Soziologie
Universitaet Tuebingen
Wilhelmstrasse 36
72074 Tuebingen
Germany
http://www.maartenbuis.nl
--------------------------
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/