Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: st: Is it valid to use the individual ratios (i.e. Xi/Yi) in the dependent or independent part of a regression model?
From
Cameron McIntosh <[email protected]>
To
STATA LIST <[email protected]>
Subject
RE: st: Is it valid to use the individual ratios (i.e. Xi/Yi) in the dependent or independent part of a regression model?
Date
Mon, 28 May 2012 20:37:29 -0400
Jinn-Yuh,
I would suggest you have a look at the following as well:
Bradshaw, Y., & Radbill, L. (1987). Method and Substance in the Use of Ratio Variables. American Sociological Review,52(1), 132-135.
Firebaugh, G. (1988). The Ratio Variables Hoax in Political Science. American Journal of Political Science, 32(2), 523-535.
Liu, Y., & Schutz, R.W. (2003). Statistical validity of using ratio variables in human kinetics research. Research Quarterly in Exercise and Sport, 74(3), 226-235.
Liermann, M., Steel, A., Rosing, M., & Guttorp, P. (2004). Random denominators and the analysis of ratio data. Environmental and Ecological Statistics, 11(1), 55-71.
Cam
> From: [email protected]
> Date: Mon, 28 May 2012 22:47:24 +0800
> Subject: Re: st: Is it valid to use the individual ratios (i.e. Xi/Yi) in the dependent or independent part of a regression model?
> To: [email protected]
>
> 1. Why is the ACR of a group of patients not a ratio distribution?
> 2. The coefficient of variation is always higher for the ratio (X/Y)
> than for either X or Y (http://www.ncbi.nlm.nih.gov/pubmed/17434158).
> 3. In the PREVEND (Prevention of Renal and Vascular End-stage Disease)
> study (http://www.ncbi.nlm.nih.gov/pubmed/22383750):
> 1) The hazard ratio (95% CI) for predicting CV events were 1.41 (1.25,
> 1.58) for spot urine ACR, 1.26 (1.1, 1.43) for spot urine urinary
> albumin concentration, and 1.16 (1.01, 1.32) for the reciprocal of
> spot urine creatinine, respectively. The 95% CI of the three HR
> overlapped (i.e. the three HR were similar) although the author
> claimed that ACR predicts better than either urinary albumin or
> urinary creatinine.
> 2) Body weight, 24 hour-urinary creatinine excretion, age and gender
> predict ACR independent of 24 hour-urinary albumin excretion. In other
> words, 24 hour-urinary albumin excretion is not the only determinant
> of ACR.
> 4. In "Ratio index variables or ANCOVA? Fisher's cats revisited
> (http://www.ncbi.nlm.nih.gov/pubmed/19337988), Tu YK cautioned about
> the use of ratios wherever the underlying biological relationships
> among epidemiological variables are unclear (urinary concentration is
> not the only determinant of urinary creatinine in this case), and
> hence the choice of statistical model is also unclear.
>
> Jinn-Yuh
>
>
> 2012/5/28 David Hoaglin <[email protected]>:
> > Dear Jinn-Yuh,
> >
> > Your analysis is not actually concerned with the ratio distribution as such.
> >
> > Your first message, at the start of this thread, assumed that X and Y
> > were normally distributed. Real data, however, are never normal, so
> > results for a ratio of normal variables are mainly of theoretical
> > interest.
> >
> > In particular, from Phil Clayton's comments, it seems that urinary
> > albumin and urinary creatinine are not close to having normal
> > distributions. You could investigate the distribution shape of
> > urinary albumin, urinary creatinine, and ACR empirically in your
> > patients, though they are probably not a random sample from any
> > well-defined population.
> >
> > In your analyses, if ACR is the dependent variable, a more important
> > consideration is the distribution of the variation of the part of ACR
> > that remains after the contributions of the predictor variables have
> > been removed. In other words, the residual variation (or the
> > so-called "error term"). You can investigate that also, by using the
> > residuals from the fitted regression model.
> >
> > If ACR is an explanatory variable (I avoid the term "independent
> > variable" because "independent variables" are seldom "independent" in
> > any reasonable sense), the underlying theoretical distribution is much
> > less important than the way ACR is distributed in your patients.
> >
> > David Hoaglin
> >
> > On Sun, May 27, 2012 at 9:23 AM, <[email protected]> wrote:
> >> Dear David:
> >> Thank you for your detailed explanations.
> >> It helps me a lot.
> >> P.S.: The motivation for my question was the fact that the ratio
> >> distribution is very complicated
> >> (http://en.wikipedia.org/wiki/Ratio_distribution) with no means and
> >> variances (http://www.mathpages.com/home/kmath042/kmath042.htm).
> > *
> > * For searches and help try:
> > * http://www.stata.com/help.cgi?search
> > * http://www.stata.com/support/statalist/faq
> > * http://www.ats.ucla.edu/stat/stata/
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/