Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: RE: RE: two sample test under generalized Behrens-Fisher conditions

From	Steven Samuels <[email protected]>
To	[email protected]
Subject	Re: st: RE: RE: two sample test under generalized Behrens-Fisher conditions
Date	Tue, 14 Dec 2010 12:09:34 -0500

I don't think that highly of t-tests. To quote Hampel et al. (RobustStatistics: The Approach Based on Influence Functions, Wiley, NY,1986) p. 405:

"Many statisticians are proud of the so-called robustness of the t-test and more generally of the test in fixed-effects models in theanalysis of variance. But this robustness is only a rather moderateand limited robustness of level ("robustness of validity"); the power("robustness of efficiencey") and hence also the length of confidenceintervals and the size of standard erros is very nonrobust.Consequently, a significant result can be believed, but non-significance may just be due to the inefficiency of least squares."

Perhaps the easiest alternative to teach would be one based on trimmedmeans, which are not only easy to understand (as opposed to, say, M-Estimators and robust regression), but, unlike the median, have aneasy standard error formula.


Steve
[email protected]


On Dec 14, 2010, at 10:16 AM, Nick Cox wrote:

I see the problem. I couldn't (wouldn't) fit -glm- in an introductorycourse either.

In similar circumstances I usually assert that t tests work well evenif the assumptions are not well satisfied. This is an idea that goesback at least to G.E.P. Box in Biometrika 1953:

Box, G.E.P. 1953. Non-normality and tests on variances. Biometrika 40:318-35.


Nick
[email protected]

Airey, David C

I was looking for "stark cookbooky" solutions for a (too) short introcourse that will not address GLM. But transformations they will betold about, and the last time I taught this course, your help fileabout transformations was required reading. Thanks for that citation.Looks like a good book.


Nick Cox

In this kind of territory, I would always
1. Check out what is said in Rupert G. Miller, Beyond ANOVA. See onthe CRC Press reissue
<
http://www.crcpress.com/utility_search/search_results.jsf?conversationId=250169
Your library may hold a copy of the Wiley original.
2. Be wary of the stark cookbooky alternative: data if normal, ranksotherwise. What happened to the idea of transformations or linkfunctions? How do you decide when the data are approximately normalany way?
Here is an example of a different approach. In the auto data, -mpg-given -foreign- is neither normal nor heteroscedastic. But these aresecondary issues. Consider this set of results. In each -family(normal)- is implied.
foreach v in "power 1" "power 0.5" "log" "power -0.5" "power -1" {
	qui glm mpg foreign, link(`v')
	mat b = e(b)
	mat V = e(V)
	di "`v'"    "{col 20}" %3.2f   b[1,1] / sqrt(V[1,1])
}

power 1            3.63
power 0.5          3.70
log                3.75
power -0.5         -3.78
power -1           -3.80
The change of sign of what -glm- calls the z statistic is anexpected side-effect of changing to inverse transformations. Moreimportantly, z changes only very slowly and the collective set ofresults points to the idea that 1/mpg is a more appropriate scalethan mpg on which to test for differences. This of course matchesbasic science.
Generalized linear models are nearly 40 years old as a family. Whenare they going to receive the recognition they deserve?


Airey, David C

I was reading a little about what to do when you have both unequalvariance and non-normality. Neither the equal variance t-test northe Mann-Whitney U test are best when you want to interpret thedifference in means or medians.
I had found the Stata command -fprank-, but it turns out thisrobust ranks test doesn't escape a symmetry assumption to interpretthe location difference.
I found that some recommend using Welch's t-test on the ranked data(Zimmerman and Zumbo (1993) Rank transformations and the power ofthe Student's t test and the Welch t' test for non-normalpopulations with unequal variances. Canadian Journal ofExperimental Psychology 47:3, 523-539).
This appears easy and satisfying solution to teach with: always useunequal variances t-test and use ranks if the data are also notnormal.


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: RE: two sample test under generalized Behrens-Fisher conditions
  - From: "Airey, David C" <[email protected]>
- st: RE: RE: two sample test under generalized Behrens-Fisher conditions
  - From: Nick Cox <[email protected]>

Prev by Date: st: loop using rolling, recursive
Next by Date: st: RE: loop using rolling, recursive
Previous by thread: st: RE: RE: two sample test under generalized Behrens-Fisher conditions
Next by thread: st: RE: RE: RE: two sample test under generalized Behrens-Fisher conditions
Index(es):
- Date
- Thread