Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Strange Behaviour When Selecting Levels For Factor Variables In Regression With i#
From
Sarah Elizabeth Edgington <[email protected]>
To
[email protected]
Subject
Re: st: Strange Behaviour When Selecting Levels For Factor Variables In Regression With i#
Date
Fri, 18 Jan 2013 22:12:53 -0800
I share Daniel's confusion and this explanation doesn't make it any
clearer to me.
If sex were not a binary variable then the other coefficients
changing would make sense because the different regressions would
actually be changing how sex is coded. However, with a variable
that's coded 0/1 to begin with, isn't i.sex the equivalent of
i1.sex? That is, isn't an indicator for when sex=1 exactly the same
variable as the original sex variable? In which case you'd expect
the first two regression examples to be exactly the same. The fact
that they aren't suggests to me that there's something I don't
understand about the i1.sex factor variable syntax. What am I missing?
-Sarah
At 08:32 PM 1/18/2013, you wrote:
Daniel,
The definition of a coefficient in a multiple regression model
includes the list of all the other predictors in the model. Thus,
using a different predictor for sex changes the definitions of the
coefficients for patient and the categories of when.
If the other predictors were orthogonal to sex, the numerical values
of their coefficients would not differ among the models, even though
the definitions were not the same.
Your models may be equivalent, in the sense that they have the same
predicted values and the same residuals, but they are not all the same
model.
I hope this discussion helps.
David Hoaglin
On Fri, Jan 18, 2013 at 6:45 PM, <[email protected]> wrote:
> Hello,
>
> when i use indicator i with selecting level of a factor variable
like i1.varname to run a regression I get strange results.
>
> For example:
>
> sysuse blong,clear
> regress bp i.sex i.when c.patient i.when#c.patient
> regress bp i1.sex i.when c.patient i.when#c.patient
> regress bp i0.sex i.when c.patient i.when#c.patient
>
> This regression is wihout sense but theoretically it should
estimate the same model and should give same results except for
variable sex cause all I do is demand an indicator for a different
level of a 2-level variable sex.
> But if I run these lines I get three regressions with three
different coefficients for the variable "when" and "patient" even I
didnt change anything that should be related to these variables.
> Whats wrong here?
>
> regards
> Daniel
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/