Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

AW: st: Resampling and compare full sample with subsamples

From	Johannes Thrul <Thrul@ift.de>
To	"statalist@hsphsun2.harvard.edu" <statalist@hsphsun2.harvard.edu>
Subject	AW: st: Resampling and compare full sample with subsamples
Date	Fri, 7 Mar 2014 10:10:32 +0000

Steve,
Thank you for your suggestions - I will look into them. The logic behind the approach is that there are several waves in said cross sectional survey and the response rates among schools have been dropping. So I am going to use a wave where the response rate was still good, reduce the sample size randomly and based on certain school characteristics and examine the resulting prevalence rates. This should give me an idea of what loosing certain kinds of schools means for the reliability of prevalence figures in other survey waves.
So what I have come up with is a loop that draws random samples of a certain size from the original sample, I then calculate a mean over these random samples and compare it to the value of the original sample using a one sample test of proportions.
Does this sound like a reasonable approach to you?
Thanks and kind regards, Johannes

-----Ursprüngliche Nachricht-----
Von: owner-statalist@hsphsun2.harvard.edu [mailto:owner-statalist@hsphsun2.harvard.edu] Im Auftrag von Steve Samuels
Gesendet: Dienstag, 4. März 2014 21:38
An: statalist@hsphsun2.harvard.edu
Betreff: Re: st: Resampling and compare full sample with subsamples

Johannes-

I don't get the logic of your approach. Assuming that prevalence rates of responders and non-responders differ, i.e. show "response bias", a comparison of random samples of responders to all responders will provide *no* information on the degree of bias.

There are accepted reweighting techniques for evaluating and reducing response bias. See the  downloadable references below.

If, in fact, you know descriptive statistics for the entire population of schools and for the responding schools, you can make the responding schools more closely resemble not just the sample, but the population.
See Stas Kolenikov's -ipfraking- (-findit-) and John D'Souza's
-calibrate- (SSC).

References:

Burns, Shelley, Xiaolei Wang, and Alexandra Henning. 2011. NCES Handbook of Survey Methods. NCES 2011-609. National Center for Education Statistics http://eric.ed.gov/?id=ED521154

Carlson, BL, and Williams, S. 2001. A comparison of two methods to adjust weights for non-response: propensity modeling and weighting class adjustments. Proceedings of the Annual Meeting of the American Statistical Association http://www.amstat.org/sections/SRMS/proceedings/y2001/Proceed/00111.pdf

Kreuter, Frauke, Kristen Olson, James Wagner, Ting Yan, Trena M Ezzati-Rice, Carolina Casas-Cordero, Michael Lemay, Andy Peytchev, Robert M Groves, and Trivellore E Raghunathan. 2010. Using proxy measures and other correlates of survey outcomes to adjust for
non,Äêresponse: examples from multiple surveys. Journal of the Royal Statistical Society: Series A (Statistics in Society) 173, no. 2:
389-407.
http://digitalcommons.unl.edu/cgi/viewcontent.cgi?article=1139&context=
sociologyfacpub

Little, RJ, and S Vartivarian. 2003. On weighting the rates in non-response weights. Stat Med 22, no. 9: 1589-1599, available at http://deepblue.lib.umich.edu/bitstream/2027.42/34860/1/1513_ftp.pdf

Wun, L-M, and Ezzati-Rice, T. 2007. Assessment of the impact of health variables on nonresponse adjustment in the medical expenditure panel survey (MEPS). Proc. Surv. Res. Meth. Sect. Am. Statist. Ass 2857-2864.
http://www.amstat.org/sections/SRMS/Proceedings/y2007/Files/JSM2007-000336.pdf

Steve

Steve Samuels
Consultant in Statistics
18 Cantine's Island
Saugerties NY 12477 USA
845-246-0774

> On Mar 3, 2014, at 10:58 AM, Johannes Thrul <Thrul@ift.de> wrote:
> 
> Dear list,
> I am working on a large survey dataset (12,000 individuals clustered in 600 schools) and want to examine, how non-response/non-participation of schools affects prevalence estimates (e.g., alcohol use). My plan is to reduce the sample of schools randomly and systematically (e.g., only exclude large schools) and compare the resulting estimates from the subsamples with the estimates from the full sample. I thought of an approach like this: Reduce the sample size in 10% increments, draw a number of subsamples at every step and compare the estimates. However, I have 2 questions about how to best approach this in Stata: 
> 1. Drawing subsamples: Should I use a jackknife, bootstrap, or even something entirely different for drawing the subsamples?
> 2. Testing: How should I go about testing the results from the subsamples against the full sample?
> Any help is greatly appreciated!
> Thanks and kind regards, Johannes
> 
> --
> Dr. Johannes Thrul, Dipl.-Psych.
> 
> Wissenschaftlicher Mitarbeiter / Researcher Präventionsforschung / 
> Prevention Research
> 
> IFT Institut für Therapieforschung / Parzivalstr. 25, D-80804 München 
> / www.ift.de phone +49 (0) 89 360804 86 / fax +49 (0) 89 360804 69 / 
> e-mail thrul@ift.de
> 
> IFT Institut für Therapieforschung gem. Gesellschaft mbH / 
> Registergericht München HRB 46395 Geschäftsführung: Prof. Dr. Gerhard 
> Bühringer
> 
> 
*
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Resampling and compare full sample with subsamples
  - From: Steve Samuels <sjsamuels@gmail.com>
- Re: st: Resampling and compare full sample with subsamples
  - From: Maarten Buis <maartenlbuis@gmail.com>

References:
- st: Resampling and compare full sample with subsamples
  - From: Johannes Thrul <Thrul@ift.de>
- Re: st: Resampling and compare full sample with subsamples
  - From: Steve Samuels <sjsamuels@gmail.com>

Prev by Date: Re: st: Plot predictions of multiple variables in one plot (logit model)
Next by Date: Re: st: graph pie with variable label as title
Previous by thread: Re: st: Resampling and compare full sample with subsamples
Next by thread: Re: st: Resampling and compare full sample with subsamples
Index(es):
- Date
- Thread