Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Estimating the (possibly negative) intracluster correlation
From
Robert A Yaffee <[email protected]>
To
[email protected]
Subject
Re: st: Estimating the (possibly negative) intracluster correlation
Date
Mon, 06 Sep 2010 22:09:02 -0400
analysis. As such, it has a range of (0,1). If one computes
this as an analog of an R^2, then a negative ICC makes little
sense.
If, however, you compute the ICC numerator as the between
groups variance - the within groups variance, then
a negative ICC can emerge when the within groups variance
exceeds the between groups variance.
Also, if the ICC is computed with an interaction term, the inter-
action may induce such a negative effect, if it is has a negative
coefficient.
Regards,
Bob
Robert A. Yaffee, Ph.D.
Research Professor
Silver School of Social Work
New York University
Biosketch: http://homepages.nyu.edu/~ray1/Biosketch2009.pdf
CV: http://homepages.nyu.edu/~ray1/vita.pdf
----- Original Message -----
From: Bert Jung <[email protected]>
Date: Monday, September 6, 2010 4:54 pm
Subject: Re: st: Estimating the (possibly negative) intracluster correlation
To: [email protected]
> Bob, Steve, Scott and Joseph: many thanks, your comments are very
> helpful indeed.
>
> I have a limited set of covariates and may be unable to sufficiently
> improve the model, so now I am wondering how to address this issue
> analytically. The standard recommendation is to simply report the
> more conservative (larger) unclustered standard errors. For binary
> outcomes (my case) Ten Have and co-authors seem to suggest a modified
> mixed model to directly account for the correlation. Unfortunately I
> don't have access to this paper and the Hanley piece indicates
> reservations in particular circumstances. I would be grateful for any
> pointers to related work and how to implement these procedures in
> Stata.
>
> Thanks again!
> Bert
>
> PS I found the negative ICC counter-intuitive at first. One helpful
> example is competition for resources among multiple offspring from the
> same mother (e.g. animal litter). In this context "nature, faced with
> limited space or nutrition, in an attempt to maximize survival of
> fewer offspring, allows considerable inequality among the individual
> `competitors'" (Hanley et al page 720).
>
>
> Hanley et al "GEE Analysis of negatively correlated binary responses:
> a caution" Statistics in Medicine 2000; 19: 715-722,
> http://www.ncbi.nlm.nih.gov/pubmed/10700741
>
> Ten Have et al "Accommodating negative intracluster correlation with a
> mixed effects logistic model for bivariate binary data" J Biopharm
> Stat. 1998; 8:131-49, http://www.ncbi.nlm.nih.gov/pubmed/9547432
>
>
>
>
> On Mon, Sep 6, 2010 at 1:17 PM, Joseph Coveney
> <[email protected]> wrote:
> > Scott Baldwin wrote:
> >
> > One option is to use the residuals option with an exchangeable
> > correlation structure in xtmixed. This allows you to look at the
> > correlation among observations within a cluster rather than the
> > variance among the cluster means (as would be the case if you fit a
> > random intercept model). [remainder omitted]
> >
> > --------------------------------------------------------------------------------
> >
> > That is neat. I'll really have to start getting familiar with what
> -xtmixed-
> > and its new -residuals()- option can do. The ovary dataset doesn't
> have a
> > negative ICC, but the artificial dataset below does have a negative
> ICC to
> > illustrate Scott's -xtmixed- approach.
> >
> > I'd known that you can do it with -xtgee- (so long as it's a linear
> model),
> > and with the old method-of-moments technique with -anova- (for a balanced
> > dataset).
> >
> > For some reason, I'd always thought that an ML (REML) method
> couldn't deal with
> > negative ICCs, and that you had to resort to ANOVA and method-of-moments,
> > because they admit negative variance components estimates, or to GEE.
> >
> > Joseph Coveney
> >
> > version 11.1
> > clear *
> > set more off
> > set seed `=date("2010-09-07", "YMD")'
> > matrix input C = (1 -0.7 \ -0.7 1)
> > drawnorm mu0 mu1, corr(C) n(200) clear
> > generate int pid = _n
> > quietly reshape long mu, i(pid) j(tim)
> >
> > xtmixed mu i.tim || pid:, nocons residuals(exchangeable) ///
> > nolrtest nolog
> >
> > xtgee mu i.tim, i(pid)
> > estat wcor
> >
> > anova mu pid tim
> > scalar define sigma2_e = e(rss) / e(df_r)
> > scalar define sigma2_u = ///
> > (e(ss_1) / e(df_1) - sigma2_e) / (e(df_2) + 1)
> > scalar define ICC = sigma2_u / (sigma2_u + sigma2_e)
> > display in smcl as text ICC
> >
> > exit
> >
> >
> > *
> > * For searches and help try:
> > * http://www.stata.com/help.cgi?search
> > * http://www.stata.com/support/statalist/faq
> > * http://www.ats.ucla.edu/stat/stata/
> >
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/