I presume that Martin is referring to the rank biserial correlation coefficient of Cureton (1956). This has an alternative name, namely Somers' D of the ordinal variable with respect to the dichotomous variable, or D(Y|X), where Y is the ordinal variable and X is the dichotomous variable. The identity of the 2 parameters (and of their corresponding sample statistics) is proved rigorously in Newson (2008).
Confidence intervals for Somers' D in all its forms can be computed in Stata using the -somersd- package, which you can download from SSC. In Stata, type
ssc desc somersd
to describe it, and
ssc inst somersd, replace
to install it. Note that you need to have Stata Version 10 or above to use the latest version. Earlier versions are downloadable from my website by typing, in Stata,
net from http://www.imperial.ac.uk/nhli/r.newson/
and selecting the version for your Stata.
Once you have installed -somersd-, it may be a good idea to exit Stata and then to start Stata again, because some versions of Stata have a problem with newly-installed packages that contain Mata libraries (as -somersd- does). However, to use -somersd-, get your data into the memory, and type
somersd x y, transf(z) tdist
where x is the dichotomous variable and y is the ordinal variable. You should then get an asymmetric confidence interval for Somers' D, aka the rank biserial correlation coefficient. The -somersd- package comes with extensive on-line help, and also a set of .pdf manuals with methods, formulas and examples.
I hope this helps.
Best wishes
Roger
References
Cureton EE. Rank-biserial correlation. Psychometrika 1956; 21: 287{290.
Newson R. Identity of Somers' D and the rank biserial correlation coeffi±cient. 21 February, 2008. Unrefereed document downloadable from
http://www.imperial.ac.uk/nhli/r.newson/papers.htm#miscellaneous_documents
as of today.
Roger B Newson BSc MSc DPhil
Lecturer in Medical Statistics
Respiratory Epidemiology and Public Health Group
National Heart and Lung Institute
Imperial College London
Royal Brompton Campus
Room 33, Emmanuel Kaye Building
1B Manresa Road
London SW3 6LR
UNITED KINGDOM
Tel: +44 (0)20 7352 8121 ext 3381
Fax: +44 (0)20 7351 8322
Email: [email protected]
Web page: http://www.imperial.ac.uk/nhli/r.newson/
Departmental Web page:
http://www1.imperial.ac.uk/medicine/about/divisions/nhli/respiration/popgenetics/reph/
Opinions expressed are those of the author, not of the institution.
-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Berger, Martin
Sent: 04 September 2009 10:06
To: '[email protected]'
Subject: st: rank biserial correlation
dear all,
I would like to calculate a rank biserial correlation coefficient between dichotomous variables (e.g. application of a specific method in the aanalysis; yes/no) and ordinal variables (satisfaction with results of the analysis; five-point likert scale).
to my knowledge, rank biserial should do the job and can be fairly easy calculated in excel.
however, since I'd like to analyse the correlation between many, many different variables I would prefer an 'automatic' solution in stata rather than doing it manually in excel.
any idea?
thanks - any help it is highly appreciated!
best
martin
-----------------------------------------------------------------------------
Dr. Martin Berger
Institut für Technologie- und Regionalpolitik
Joanneum Research Forschungsgesellschaft mbH
Sensengasse 1
A-1090 Wien
Tel.: +43 1 581 7520-2827
Fax.: +43 1 581 7520-2820
www.intereg.at
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/