From | Richard Williams <[email protected]> |
To | [email protected] |
Subject | Re: st: Simulate and corr2data |
Date | Tue, 20 Jan 2004 19:27:09 -0500 |
At 09:48 PM 1/20/2004 +0000, Allan Reese wrote:
Given corr2data's intended purpose, I don't think this is really a bug. corr2data is meant to generate data where only the means, correlations, sds and N are required for the analysis -- if any other feature of the data is required, corr2data will not handle it. Hence, it doesn't matter what the data is, so long as it produces the desired correlations, etc. As the online docs say,Followed example for simulate except to use corr2data rather than gen to create the dataset. Imagine my chagrin when each repetition gave the same answer! So tried corr2data from the command prompt and found it gives the same sample time after time, regardless of any setting of seed. Does anyone have a fix please? Is this an unintended bug? Looks counter intuitive and ought to happen only if "set seed" used to restart the sequence.
Again, not clear on the goal -- but if corr2data with the seed option does not give you what you want (and I suspect it doesn't) you could create a large data set or data sets with corr2data and then draw random samples from them. That might be easier than figuring out the gen commands that would be required to create a data set drawn from a population with certain desired characteristics.If corr2data can not produce independent samples, can I have comments please on whather it is equivalent to use bootstrap with a large N and relatively small sample size. I am interested in sample statistics that depend on the overall distribution of points, and the default use of bootstrap to select multiple samples of size N from N points must rely upon random points being used once, twice or more - or have I completely misunderstood?
© Copyright 1996–2024 StataCorp LLC | Terms of use | Privacy | Contact us | What's new | Site index |