Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: R: Interpreting Shapiro-Wilk-Test (swilk)

From	"Carlo Lazzaro" <[email protected]>
To	<[email protected]>
Subject	st: R: Interpreting Shapiro-Wilk-Test (swilk)
Date	Tue, 10 Sep 2013 12:57:59 +0200

With 2830 observations Christian would be probably better off with a visual
inspection of regression residuals instead of relying on Shapiro-Wilk-Test
(or the like).

Kind regards,
Carlo
-----Messaggio originale-----
Da: [email protected]
[mailto:[email protected]] Per conto di Christian
Schroetel
Inviato: martedì 10 settembre 2013 12:18
A: [email protected]
Oggetto: st: Interpreting Shapiro-Wilk-Test (swilk)

Dear statistics and/or Stata experts,

actually, I'm used to find out or clear up things myself using FAQs, helps
or such. But this time, I'm quite confused so I decided to use the chance to
get help via Statalist. It's my first time, though.

For my master thesis, I'm performing several regressions on panel data with
up to 13 independent variables (xtreg). To check for normality of the
residuals, I did the following:

- xtreg sgrowth l.sgrowth l.slnsales slnage scfratio  srdintensity sleverage
spersonalpremium sintangibles sinternationalsales sroa stobinsq sclr
scurrentratio, fe -
- predict r, ue -
- kdensity r, normal -
- iqr r -
- swilk r -

The kdensity graphics for the combined resiudal indicate quite some
normality, iqr doesn't show any severe outliers and only less than 0.5% mild
outliers indicating a quite symmetric distribution of the residuals as well.
Also, pnorm and qnorm deliver decent graphics indicating normality. But now
I get the following result with swilk:


    Variable |    Obs       W           V         z       Prob>z
-------------+--------------------------------------------------
           r |   2830    0.99739      4.245     3.723    0.00010

As far as I know, a high value of W should indicate normality, so, again,
that would say I've got normally distributed residuals.
Nevertheless, the p-value indicates rejection of the null hypothesis of
normality.
So, now I wonder whether I'm doing something wrong or whether I should just
not pay too much attention to the p-value. It's just that I'd like to say in
my thesis that the Shapiro-Wilk test indicates normality, which I probably
couldn't the way i is now. Respectively, what could be the reason for the
low p-value combined with a high value for W?

I'd appreciate any thoughts, comments or help on that issue and thank you in
advance for your efforts.

Best regards

Christian

Btw: I tried all the same with - predict r, e - (as I ddin't exactly know
which way to test for normality of the residuals) and - sfrancia r -
delivering similar results.
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: Interpreting Shapiro-Wilk-Test (swilk)
  - From: Christian Schroetel <[email protected]>

Prev by Date: RE: st: graph bar with 2 y-axes
Next by Date: Re: st: Interpreting Shapiro-Wilk-Test (swilk)
Previous by thread: st: Interpreting Shapiro-Wilk-Test (swilk)
Next by thread: Re: st: Interpreting Shapiro-Wilk-Test (swilk)
Index(es):
- Date
- Thread