[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: RE: creating variable with randomly missing values

From	"Nick Cox" <[email protected]>
To	<[email protected]>
Subject	st: RE: creating variable with randomly missing values
Date	Thu, 6 Dec 2007 23:59:57 -0000

I will ignore "dropped" and focus on "recoded as missing value". 

In all solutions, before you do this, 

. set seed 280352 

or whatever, for reproducibility.  

This will do it approximately 

. replace x = . if uniform() < 0.5 

This will do it exactly 

. gen long id = _n 
. gen random = uniform()
. sort random 
. replace x = . in 1/2500
. sort id 

Ahmed Arif

I have a dataset with variables x and y. the dataset has 5000
observation. i want to generate another variable xmiss, such that 50%
of x values are dropped (recoded as missing value) randomly from the
dataset w/o affecting values of y. is there an easy way to do this?

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: creating variable with randomly missing values
  - From: "Ahmed Arif" <[email protected]>

Prev by Date: Re: st: creating variable with randomly missing values
Next by Date: st: RE: RE: Changing range in sts graph
Previous by thread: Re: st: creating variable with randomly missing values
Next by thread: st: RE: Changing range in sts graph
Index(es):
- Date
- Thread