[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

RE: st: Issue with multiple imputation -ICE-

From	"Nick Cox" <[email protected]>
To	<[email protected]>
Subject	RE: st: Issue with multiple imputation -ICE-
Date	Wed, 13 Feb 2008 17:32:47 -0000

A similar alternative might be to impute logged values and then exponentiate. 

Mark Lunt

Ren� Wevers wrote:

> I am working with an extensive dataset (10.000+ observations) and
> logically some values are missing. I decided to use the -ice- package to
> impute certain missing values, but the result simply makes no sense to me.
>
> For instance I am estimating missing variables for the full time
> equivalent (FTE) of employees of a company mainly based on the absolute
> number of employees. (Logically) the absolute number of employees is never
> below zero. Also when I run a regression between the FTE number and
> absolute number I get a highly significant relation with positive
> coefficient and a positive constant estimates. Nevertheless when I run
> -ice-, the imputed values are extremely often (far) below zero (!!!). Also
> worrying is that the imputed values are practically all completely
> different (over factor 100) from the absolute number of employees, where a
> closer relation is (logically) expected.
>
> Is there some explanation for this or are we making any mistakes.
>
>   
ICE assumes that continuous variables are normally distributed: if that 
is not the case, impossible values can appear. In particular, if you 
have lots of companies with a few employees and a few companies with 
lots of employees, ICE will impute negative numbers of employees. One 
possible solution is to use the "match" option of ICE. Alternatively, I 
have written some ado-files which convert variables to normal-scores and 
back: you can convert to normal scores (which are normally distributed), 
perform the imputation on these variables, then convert back to your 
original distribution. If you are interested in using these ado-files, type

net from http://personalpages.manchester.ac.uk/staff/mark.lunt

into stata, then click on the blue "nscores"

Hope that's of some use

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: histogram height as avg of another variable
  - From: "Prashant Loyalka" <[email protected]>
- st: Issue with multiple imputation -ICE-
  - From: Ren� Wevers <[email protected]>
- Re: st: Issue with multiple imputation -ICE-
  - From: Mark Lunt <[email protected]>

Prev by Date: Re: st: Issue with multiple imputation -ICE-
Next by Date: st: R: Moran I and spatial correleation help needed
Previous by thread: Re: st: Issue with multiple imputation -ICE-
Next by thread: Re: st: Issue with multiple imputation -ICE-
Index(es):
- Date
- Thread