Looking at your example data, your statement "only one variable (SNP)
is different in each person" appears to be inconsistent with the
presented data (check rows for id = 2).
I would help a lot to understand how your data is generated. Multiple
treatments/measurements for a number of subjects? Is this perhaps
panel data? ...
regards
sebastian
On 3/12/07, meena khan <[email protected]> wrote:
Hello,
I have a dataset where all variables are replicated twice for each subject,
only one variable (SNP) is different in each person:
ID SNP Event X1 X2
1 0 0 a a
1 1 0 a a
2 1 1 b b
2 1 1 b b
I would like to know if it is correct to cluster by ID in a normal logistic
regression model? Conditional logistic regression does not work as when I
group it by ID as there is no variation within the groups. The logistic
cluster option seems correct to me when I read the Stata help – it adjusts
the SEs for intragroup correlation which is what I want. I just want to
check if it is right. Thanks
Meena
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/