|
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]
Re: st: GEE or svy:logit
From your description, the household survey is not a formal sample
of a target population. (The original case and control groups might
have been.) So, off-hand, you do not need any -svy- commands at all,
ordinary -logit- with a cluster option or -glm- (which does GEE),
also with a cluster option, will be sufficient and will give
equivalent inference.
A more important question is: will the presence of a large percentage
of case-households distort your analysis? For example, if one of
your goals is to predict an outcome and that outcome is related to
the original case-variable, then it will be over-represented in the
sample. To get around this, you might want to down-weight the case-
households. If case households constitute, say, 1% of the population
(uncommon disease), then consider giving them a weight of '1' and
control households a weight of '99'. You can do this in either -glm-
or -logit-. Again no -svy- version is needed.
However this is extreme,, and it will not work if your controls were
pair-matched to cases. If your outcome is unrelated to the outcome
of the case-control study, then go ahead and use the unweighted
data. A convenience sample is a convenience sample.
Steven
For this study, the individuals are a convenience sample from
households that participated in a case-control "parent" study in which
the case households had contaminated water and control households did
not.
This cross-sectional study includes all of the adults within the case
and control households who consented to participate by completing the
individual questionnaire.
Thanks,
Brenda
Brenda, Please give some detail about the survey design: 1. What was
the target population; 2. How did you select the sample-please give
all steps.
Steven
On Feb 2, 2008, at 12:49 PM, [email protected] wrote:
Greetings from a new Statalister.
I am in need of advice, including references, if you have any.
We have done a household-level cross-sectional survey including
all consenting adults within the household (1 to 4 per cluster).
There are both household-level and personal-level variables.
The dependent variable is nominal at the personal level (ill/not
ill).
The focal independent variable is nomial and at the household
level (water contaminated/not).
Other variables of interest (explanatory, in relation to focal
independent variable) are at the personal and household level.
My question is this...do I need to use GEE to adequately account
for clustering within households OR would the svy:logit in Stata
do this? (The ICC for illness & household is 0.08, SE 0.08)
Regards,
Brenda Coleman, PhD candidate
----- End forwarded message -----
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/