Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: Standardized interaction terms - which p-values hold?
From
Elisabeth Bublitz <[email protected]>
To
[email protected]
Subject
st: Standardized interaction terms - which p-values hold?
Date
Tue, 15 Jan 2013 16:01:30 +0100
Hi Statalist,
when I compare the p-values of a baseline regression with those obtained
from a regression with standardized coefficients and interaction terms
the following problem comes up: The suggestions previously posted (see,
http://www.stata.com/statalist/archive/2009-04/msg00888.html) are that
the variables forming the interaction need to be standardized before
they are interacted, and a second time afterwards. This changes the
p-values and sometimes even coefficients change their signs. Intuitively
this suggests to me that something with the previous suggestion is not
correct.
Here is the example from the previous thread:
*-------------------Example 1--------------------------------
* This version standardizes the IA once and serves as an example of what
is "incorrect"
sysuse auto, clear
gen ia = head*length
reg mpg head length ia, beta
* This version standardizes the IA twice and is suggested to be "correct"
egen shead = std(headroom)
egen slength = std(length)
egen smpg = std(mpg)
gen ia2 = shead*slength
egen sia2 = std(ia2)
reg smpg shead slength sia2
*-----------------------------------------------------------------
In this example the changes are visible but do not yet cross important
levels, therefore significance levels stay the same. This is, however,
different for the data I use. I'd be curious to learn what you think
about this.
I found an example where the changes are more visible.
*-------------------Example 2--------------------------------
sysuse census, clear
* Standardizing coefficients
egen zdivorce = std(divorce)
egen zmarriage = std(marriage)
egen zdeath = std(death)
egen zmedage = std(medage)
* Interaction terms
gen ia= death*medage
egen zia_1= std(ia)
gen test = zdeath*zmedage
egen zia_2 = std(test)
* Regression
reg divorce marriage death medage ia, beta //(1) this follows the
simpler procedure
reg divorce marriage death medage test, beta //(2) this standardizes the
IA twice, note changes in significance levels and coefficient size
reg zdivorce zmarriage zdeath zmedage zia_2 // (3) for comparison
(identical with (2)): this is the same as suggested in the previous thread
*-----------------------------------------------------------------
Unfortunately, I need to compare the size of two interactions and, thus,
need standardized coefficients. If you have other suggestions, let me
know. I was wondering whether it would make sense to use logarithms
instead.
Many thanks!
Elisabeth
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/