Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Centering variables for interactions and using mi regress
From
Richard Williams <[email protected]>
To
[email protected]
Subject
Re: st: Centering variables for interactions and using mi regress
Date
Sat, 15 Jan 2011 18:36:12 -0500
Interesting question, and I am not sure if there
is any official wisdom on this. One reason to
center is so that 0 is a meaningful value. For a
variable like education, though, it might make
sense to deduct 12 from each case, so that a
score of 0 on the new variable corresponded to
high school graduate. But in any event, if my
life depended on it I think I would feel better
doing the centering before imputation, rather
than, say, have the centering be done differently with each imputation.
Also, I am not sure if this is a good idea, but I
suppose you could run the analyses without
centering, use the -mean- command to get the
imputed mean for each variable, and then center using the imputed means.
As for the interactions, Paul Allison posted a
while back that you should go ahead and compute
interactions with the unimputed data and then let
imputation fill in the missing values. This may
seem counter-intuitive, in that, say, if X1 has
an imputed value of 3, and X2 has an imputed
value of 7, the imputed value for the interaction
X1X2 need not be 21! But Allison argued that you
get biased results if you compute the interaction
after X1 and X2 have been imputed.
I am just sort of guessing with all this though;
listen to the experts if they come forward!
At 04:12 PM 1/15/2011, Amy S wrote:
Hello everyone! This may be a very basic
question for most of you, but I am running
regressions using multiple imputation (mi
regress) in stata 11. Because my models have
interactions, I need to center the continuous
variables used in those interactions. However,
I am unsure how to do this for the variable I am
imputing. In the past, without multiple
imputation, I have just centered the variables
first, and then used the new centered variables
(and interaction terms with those center
variables) in the regression. However, because
the missing data on the variable is imputed
based on the regression model itself, I donât
think I can do that. Please correct me if Iâm
wrong, but I doubt itâs legit to simply begin
by centering the variable and then use mi
regress to impute with the centered form of the
variables, is it? Even if that is legit, how
would I create the interaction term before
putting it into the regression if all the data
has not been imputed yet and therefore there
are missing variables? Maybe there is a
simple way to do all of the centering and
creating interaction terms and runnign the
regression model with imputation all in one step - I'm hoping so!  Â
Â
Thanks in advance!
Â
-Amy
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME: (574)289-5227
EMAIL: [email protected]
WWW: http://www.nd.edu/~rwilliam
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/