[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: utility to create fake dataset?

From	Jeph Herrin <[email protected]>
To	[email protected]
Subject	st: utility to create fake dataset?
Date	Sun, 08 Nov 2009 12:20:24 -0500


I sometimes need to create a "fake" dataset that "looks?
like an existing dataset. For example, a dataset that
must, for health privacy reasons, remain on a remote server,
and I would like to develop code locally to run on it.
Or, I need to make mock tables to share with colleagues
who need to remain blinded for now to actual study data.

Usually, I just do something that seems "good enough", like
sample 5%, expand 20, replace values with random values, etc.
Or, in an extreme case, set obs to be twice the existing obs
and keep the ones with missing data. But the first is not
very satisfying when I need to reassure higher powers that
I have a "dummy" dataset, and the second is not very helpful
for writing final useable code.

So, I'm thinking I'll write a utility to create a 'dummy'
dataset from an existing dataset, but wondered if there was
something out there already. Perhaps there is even a well
established name for this process? My searches for "dummy"
and "fake" dataset have not been fruitful.

thanks,
Jeph


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- st: AW: utility to create fake dataset?
  - From: "Martin Weiss" <[email protected]>
- st: RE: utility to create fake dataset?
  - From: "Nick Cox" <[email protected]>

Prev by Date: st: RE: AW: RE: AW: RE: RE: RE: RE: RE: Possible bug
Next by Date: st: RE: utility to create fake dataset?
Previous by thread: st: re: referencing eresult matrices
Next by thread: st: RE: utility to create fake dataset?
Index(es):
- Date
- Thread