[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: imputing continuous values when respondents select categories, e.g., income category

From	Richard Williams <[email protected]>
To	"[email protected]" <[email protected]>, "[email protected]" <[email protected]>
Subject	Re: st: imputing continuous values when respondents select categories, e.g., income category
Date	Sat, 25 Apr 2009 00:28:47 -0500

At 10:23 PM 4/24/2009, Alan Acock wrote:

Richard Williams asked if I want to impute missing values or to plugin values within each interval, as opposed to assigning everybodythe midpoint of the interval they select.The latter is what I want to do and it appears that the intregcommand with the ystar(a,b) option in the post estimation commandsis exactly what I should use. This treats income as the dependentvariable, but once we estimate the value we can use that as anindependent variable in other models. At least this is my understanding.

Actually, I am not sure if that is the optimal strategy or not. At aminimum, it seems there should be some sort of penalty for usingestimated income rather than real income. You'll also havemulticollinearity problems if all the vars used to compute estimatedincome are also in your other models.

Maarten Buis did touch on these issues at the summer 2008 NASUG (butI don't remember what he concluded!). See the first paper listed at


http://www.stata.com/meeting/snasug08/abstracts.html

Also, Powers & Xie discuss this sort of thing in section 6.2 of theirbook ("Statistical Methods for Categorical Data Analysis"). Theypropose a "normal score transformation" which in turn comes fromClogg & Shihadeh 1994 ("Statistical Models for OrdinalVariables"). There is no discussion of how much better it actuallyworks though.



-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
HOME:   (574)289-5227
EMAIL:  [email protected]
WWW:    http://www.nd.edu/~rwilliam

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: imputing continuous values when respondents select categories, e.g., income category
  - From: Alan Acock <[email protected]>
- Re: st: imputing continuous values when respondents select categories, e.g., income category
  - From: Richard Williams <[email protected]>
- Re: st: imputing continuous values when respondents select categories, e.g., income category
  - From: Alan Acock <[email protected]>

Prev by Date: RE: st: imputing continuous values when respondents select categories, e.g., income category
Next by Date: st: Comparing two models
Previous by thread: RE: st: imputing continuous values when respondents select categories, e.g., income category
Next by thread: st: Goodness of fit measure akin to R-squared for 0-constant or noconstant
Index(es):
- Date
- Thread