Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Resampling and compare full sample with subsamples
From
Maarten Buis <[email protected]>
To
[email protected]
Subject
Re: st: Resampling and compare full sample with subsamples
Date
Fri, 7 Mar 2014 11:31:12 +0100
That is in essence a simulation study to get an idea of what missing
observations does to your data. I will start with the bad news: the
problem is well studied so I doubt it will give you insights beyond
what is already known, so I would not expect to gain anything
"publishable" from such an excercise.
Having said that, creating simulations is still a fine strategy to get
a grip on such a problem, and can thus be a good supplement (but not a
replacement) to reading on this subject. There is a difference between
reading that purely random missingness will (on average) not impact
your estimates other than having less data and seeing it happen in
your own simulation. There are a number of tools available for running
and displaying simulations in stata:
http://www.maartenbuis.nl/presentations/chicago08.html
http://www.maartenbuis.nl/presentations/gsug13.pdf
http://www.stata-journal.com/article.html?article=st0200
Hope this helps,
Maarten
On Fri, Mar 7, 2014 at 11:10 AM, Johannes Thrul <[email protected]> wrote:
> Steve,
> Thank you for your suggestions - I will look into them. The logic behind the approach is that there are several waves in said cross sectional survey and the response rates among schools have been dropping. So I am going to use a wave where the response rate was still good, reduce the sample size randomly and based on certain school characteristics and examine the resulting prevalence rates. This should give me an idea of what loosing certain kinds of schools means for the reliability of prevalence figures in other survey waves.
> So what I have come up with is a loop that draws random samples of a certain size from the original sample, I then calculate a mean over these random samples and compare it to the value of the original sample using a one sample test of proportions.
> Does this sound like a reasonable approach to you?
> Thanks and kind regards, Johannes
>
> -----Ursprüngliche Nachricht-----
> Von: [email protected] [mailto:[email protected]] Im Auftrag von Steve Samuels
> Gesendet: Dienstag, 4. März 2014 21:38
> An: [email protected]
> Betreff: Re: st: Resampling and compare full sample with subsamples
>
> Johannes-
>
> I don't get the logic of your approach. Assuming that prevalence rates of responders and non-responders differ, i.e. show "response bias", a comparison of random samples of responders to all responders will provide *no* information on the degree of bias.
>
> There are accepted reweighting techniques for evaluating and reducing response bias. See the downloadable references below.
>
> If, in fact, you know descriptive statistics for the entire population of schools and for the responding schools, you can make the responding schools more closely resemble not just the sample, but the population.
> See Stas Kolenikov's -ipfraking- (-findit-) and John D'Souza's
> -calibrate- (SSC).
>
>
> References:
>
> Burns, Shelley, Xiaolei Wang, and Alexandra Henning. 2011. NCES Handbook of Survey Methods. NCES 2011-609. National Center for Education Statistics http://eric.ed.gov/?id=ED521154
>
> Carlson, BL, and Williams, S. 2001. A comparison of two methods to adjust weights for non-response: propensity modeling and weighting class adjustments. Proceedings of the Annual Meeting of the American Statistical Association http://www.amstat.org/sections/SRMS/proceedings/y2001/Proceed/00111.pdf
>
> Kreuter, Frauke, Kristen Olson, James Wagner, Ting Yan, Trena M Ezzati-Rice, Carolina Casas-Cordero, Michael Lemay, Andy Peytchev, Robert M Groves, and Trivellore E Raghunathan. 2010. Using proxy measures and other correlates of survey outcomes to adjust for
> non,Äêresponse: examples from multiple surveys. Journal of the Royal Statistical Society: Series A (Statistics in Society) 173, no. 2:
> 389-407.
> http://digitalcommons.unl.edu/cgi/viewcontent.cgi?article=1139&context=
> sociologyfacpub
>
>
> Little, RJ, and S Vartivarian. 2003. On weighting the rates in non-response weights. Stat Med 22, no. 9: 1589-1599, available at http://deepblue.lib.umich.edu/bitstream/2027.42/34860/1/1513_ftp.pdf
>
>
> Wun, L-M, and Ezzati-Rice, T. 2007. Assessment of the impact of health variables on nonresponse adjustment in the medical expenditure panel survey (MEPS). Proc. Surv. Res. Meth. Sect. Am. Statist. Ass 2857-2864.
> http://www.amstat.org/sections/SRMS/Proceedings/y2007/Files/JSM2007-000336.pdf
>
> Steve
>
>
> Steve Samuels
> Consultant in Statistics
> 18 Cantine's Island
> Saugerties NY 12477 USA
> 845-246-0774
>
>
>> On Mar 3, 2014, at 10:58 AM, Johannes Thrul <[email protected]> wrote:
>>
>> Dear list,
>> I am working on a large survey dataset (12,000 individuals clustered in 600 schools) and want to examine, how non-response/non-participation of schools affects prevalence estimates (e.g., alcohol use). My plan is to reduce the sample of schools randomly and systematically (e.g., only exclude large schools) and compare the resulting estimates from the subsamples with the estimates from the full sample. I thought of an approach like this: Reduce the sample size in 10% increments, draw a number of subsamples at every step and compare the estimates. However, I have 2 questions about how to best approach this in Stata:
>> 1. Drawing subsamples: Should I use a jackknife, bootstrap, or even something entirely different for drawing the subsamples?
>> 2. Testing: How should I go about testing the results from the subsamples against the full sample?
>> Any help is greatly appreciated!
>> Thanks and kind regards, Johannes
>>
>> --
>> Dr. Johannes Thrul, Dipl.-Psych.
>>
>> Wissenschaftlicher Mitarbeiter / Researcher Präventionsforschung /
>> Prevention Research
>>
>> IFT Institut für Therapieforschung / Parzivalstr. 25, D-80804 München
>> / www.ift.de phone +49 (0) 89 360804 86 / fax +49 (0) 89 360804 69 /
>> e-mail [email protected]
>>
>> IFT Institut für Therapieforschung gem. Gesellschaft mbH /
>> Registergericht München HRB 46395 Geschäftsführung: Prof. Dr. Gerhard
>> Bühringer
>>
>>
> *
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
--
---------------------------------
Maarten L. Buis
WZB
Reichpietschufer 50
10785 Berlin
Germany
http://www.maartenbuis.nl
---------------------------------
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/