Joelle,
It seems that you have missings in the by-variables. Consider the
following sentence: hotdeck y, store by(x1 x2) keep(id). You will have
troubles if x1 or x2 have missing, we "solved" the problem using the -9.
Suppose x1 = {1, 2, 3} and x2={1, 2, 3, 4, 5}, then we created x1=-9 if
x1==. and x2=-9 if x2==., we hotdeck in that way (with the news x1 and
x2) and then we put as missings the cases where x1 or x2 ==-9. We did
that with a simple loop.
Rodrigo.
-----Mensaje original-----
De: [email protected]
[mailto:[email protected]] En nombre de Joelle M
Anderson
Enviado el: Domingo, 30 de Marzo de 2008 01:38 p.m.
Para: [email protected]
Asunto: st: Hotdeck problem
For my thesis, I am using the hotdeck program to impute values for
missing cases in my income variable. Currently, I am trying to hotdeck
my income variable (176 missing) using 3 variables (age=27 missing;
education=13 missing; gender=0 missing; although with 9 overlapping
missing values the combination of these three variables only has 31
missing cases total). Yet when Stata creates my new, hotdecked income
variable, there are an additional 19 missing cases that I can't account
for (missing=50). Does anyone know why this might be? Another strange
thing is that, when I try to rename my hotdecked income measure before
merging it with my full dataset, all 50 missing cases remain missing
after merging; when I do not rename my hotdecked income measure before
merging, only 42 missing cases remain missing after merging. I have
pasted my Stata output below. Any help would be greatly appreciated!
Joelle Anderson
Graduate Student, Sociology
University of Wisconsin-Milwaukee
[email protected]
//First hotdeck imputation, renaming the income variable BEFORE merging
with full dataset
. hotdeck incomeR using IncomeHD, store by(education sex ageR) keep(resp
incomeR)
DELETING all matrices....
Table of the Missing data patterns
* signifies missing and - is not missing
Varlist order: incomeR
pattern | Freq. Percent Cum.
------------+-----------------------------------
* | 176 11.72 11.72
- | 1,326 88.28 100.00
------------+-----------------------------------
Total | 1,502 100.00
WARNING: When the <command> option is not selected then no analysis is
performed on the imputed datasets
. use "C:\data\IncomeHD1.dta", clear
. tab incomeR
RECODE of |
income |
(income. |
last year, |
that is in |
2004, what |
was your |
total famil | Freq. Percent Cum.
------------+-----------------------------------
1 | 103 7.09 7.09
2 | 164 11.29 18.39
3 | 222 15.29 33.68
4 | 148 10.19 43.87
5 | 162 11.16 55.03
6 | 265 18.25 73.28
7 | 178 12.26 85.54
8 | 124 8.54 94.08
9 | 86 5.92 100.00
------------+-----------------------------------
Total | 1,452 100.00
. rename incomeR incomez
. merge resp using "C:\Documents and Settings\anders35\My
Documents\Thesis_3_29.dta", unique sort
. tab incomez
RECODE of |
income |
(income. |
last year, |
that is in |
2004, what |
was your |
total famil | Freq. Percent Cum.
------------+-----------------------------------
1 | 103 7.09 7.09
2 | 164 11.29 18.39
3 | 222 15.29 33.68
4 | 148 10.19 43.87
5 | 162 11.16 55.03
6 | 265 18.25 73.28
7 | 178 12.26 85.54
8 | 124 8.54 94.08
9 | 86 5.92 100.00
------------+-----------------------------------
Total | 1,452 100.00
//Second hotdeck imputation, renaming the hotdecked income variable
AFTER merging with full dataset
. hotdeck incomeR using IncomeHotD, store by(education sex ageR)
keep(resp incomeR) DELETING all matrices....
Table of the Missing data patterns
* signifies missing and - is not missing
Varlist order: incomeR
pattern | Freq. Percent Cum.
------------+-----------------------------------
* | 176 11.72 11.72
- | 1,326 88.28 100.00
------------+-----------------------------------
Total | 1,502 100.00
WARNING: When the <command> option is not selected then no analysis is
performed on the imputed datasets
. clear
. use "C:\data\IncomeHotD1.dta", clear
. tab incomeR
RECODE of |
income |
(income. |
last year, |
that is in |
2004, what |
was your |
total famil | Freq. Percent Cum.
------------+-----------------------------------
1 | 98 6.75 6.75
2 | 162 11.16 17.91
3 | 220 15.15 33.06
4 | 153 10.54 43.60
5 | 159 10.95 54.55
6 | 267 18.39 72.93
7 | 178 12.26 85.19
8 | 126 8.68 93.87
9 | 89 6.13 100.00
------------+-----------------------------------
Total | 1,452 100.00
. merge resp using "C:\Documents and Settings\anders35\My
Documents\Thesis_3_29.dta", unique sort
. rename incomeR incomey
. tab incomey
RECODE of |
income |
(income. |
last year, |
that is in |
2004, what |
was your |
total famil | Freq. Percent Cum.
------------+-----------------------------------
1 | 98 6.71 6.71
2 | 164 11.23 17.95
3 | 222 15.21 33.15
4 | 153 10.48 43.63
5 | 159 10.89 54.52
6 | 270 18.49 73.01
7 | 178 12.19 85.21
8 | 126 8.63 93.84
9 | 90 6.16 100.00
------------+-----------------------------------
Total | 1,460 100.00
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
********************************************************************************
ADVERTENCIA: La informaci�n contenida en esta transmisi�n, y en cualquier archivo adjunto, est� sujeta a reserva legal conforme a la normativa aplicable al Banco Central de Chile, y no puede ser usada o difundida por personas distintas de su o sus destinatarios. Si usted ha recibido esta transmisi�n por error, por favor notifique inmediatamente al remitente respondiendo por este mismo medio y elim�nela de su sistema.
El Banco Central de Chile no se har� responsable de la exactitud y veracidad de la informaci�n contenida en este mensaje, as� como de su modificaci�n, copia, divulgaci�n o reenv�o, total o parcial. Su uso no autorizado puede ser sancionado de conformidad con las leyes chilenas.
El Banco Central de Chile transmite sus decisiones a trav�s de comunicados oficiales, los que pone a disposici�n del p�blico en su p�gina de Internet: www.bcentral.cl
DISCLAIMER: The information contained in this email or any attached file, is subject to legal privilege pursuant to the laws and regulations applicable to the Central Bank of Chile , and may not be used or disseminated by any person other than its intended recipients. If you have received this transmission in error, please notify the sender immediately by reply to this email address and delete it from your system.
The Central Bank of Chile shall not be liable for the accuracy or authenticity of the contents of this message, whether amended, copied, forwarded or disclosed in any form, in whole or in part. Please note that unauthorized use may be penalized in conformity with the Chilean law.
The Central Bank of Chile communicates its decisions by official releases, and
makes them available to the public in its WebPages: www.bcentral.cl
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/