Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: svyset with DHS subsample
From
Sara Head <[email protected]>
To
[email protected]
Subject
st: svyset with DHS subsample
Date
Tue, 17 May 2011 13:47:45 -0400
Hi there,
I am setting up Demographic and Health Survey data (from Bangladesh
2007, women's survey) for analysis in Stata 11.1.
The survey is based on a two-stage stratified sample of households.
Additionally, households were preselected for domestic violence
questions (outcome variables in my analysis; if there was more than
one eligible female per household, a respondent was randomly selected
through a simple selection procedure based on the Kish Grid).
I've written the svyset commands as:
gen psu = v021
gen strata = v023
gen sampwt = (v005/1000000) //per DHS instruction//
gen dvsampwt = d005 //no DHS instruction to adjust//
svyset psu [pweight=dvsampwt], strata(strata)
where :
summ psu strata sampwt dvsampwt
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
psu | 10146 180.7909 104.1327 1 361
strata | 10146 10.66499 6.26573 1 22
sampwt | 10146 1.004513 .5912652 .13565 3.592687
dvsampwt | 4195 996578.1 764690.7 110423 1.08e+07
I am unsure if this code is correct.
1) Since this is a two-stage stratified sample with further selection
for violence questions, it seems the svyset command should be more
along the lines of : svyset su1 [pweight=pw], strata(strata) || _n,
fpc(fpc2) ?
2) I used dvsampwt instead of the sampwt variable; I can't find
information in the survey report / recode map about how the dv weight
was calculated. I'd like to assume it took the larger sampling design
into account.
Any thoughts greatly appreciated,
Sara
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/