From | Roger Newson <[email protected]> |
To | [email protected] |
Subject | Re: st: size of omitted/reference category |
Date | Fri, 09 May 2003 15:21:26 +0100 |
At 10:07 09/05/03 -0400, Robert Bozick wrote:
Hi Everyone --The only "estimation problem" here is that, other things being equal, differences compared to a smaller reference category will have larger standard errors (and therefore wider confidence limits) than differences from a large reference category. This is why, if the experimenter has any say in the matter, then s/he will often design the experiment so that the reference category is the largest. There is nothing "incorrect" about using a small reference category.
This is more of a stats question that a STATA question --- nonetheless, I hope that you will allow me to pick your brains. I am working on a regression analysis, where the key set of covariates are a series of dummy variables. The most theoretically logical category to omit for the hypotheses we are trying to test is also the smallest. To get handle on the data, the group sizes are shown below:
group 1 = 69
group 2 = 3,636
group 3 = 553
group 4 = 894
Group 1 is the key group that we wish to use as the omitted category and include dummy variables for groups 2 through 4. Does anyone know if there are estimation problems when you use a small reference category? Is this approach legitimate? Does anyone know any citations that I could use as a guide for this?
© Copyright 1996–2024 StataCorp LLC | Terms of use | Privacy | Contact us | What's new | Site index |