Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: AW: RE: AW: RE: AW: AW: RE: -ciplot- -eclplot- combine with box-options?
From
"Kaulisch, Marc" <[email protected]>
To
<[email protected]>
Subject
st: AW: RE: AW: RE: AW: AW: RE: -ciplot- -eclplot- combine with box-options?
Date
Mon, 10 May 2010 14:04:10 +0200
Nick, you are right. I was not not clear enough ( I hope the e-mail-programs show the tabs clear enough):
I would like to see the confidence-intervals (boxes) for each of the categories and combination of categories in vertical order. I think this more readable as if the plots for each first category (Cat1) are separately plotted. Especially, if I would like to spot not only differences between one category (Cat2) but also to see whether items in category 1 are different. This is less a problem with "sex" as a by-variable but if I have eight items in a by-var it is very hard to tell differnces...
I hope the examples in my earlier post show this...
My example 1 shows the categories like this:
cat2_1 box
Cat1_1
cat2_2 box
cat2_1 box
Cat1_2
cat2_2 box
...
My example 2 shows the categories like:
CompCat_1_1 box
CompCat_1_2 box
CompCat_2_1 box
CompCat_2_2 box
With By() I get
Cat1_1 Cat1_2
Cat2_1 box box
Cat2_2 box box
Marc
-----Ursprüngliche Nachricht-----
Von: [email protected] [mailto:[email protected]] Im Auftrag von Nick Cox
Gesendet: Montag, 10. Mai 2010 12:31
An: [email protected]
Betreff: st: RE: AW: RE: AW: AW: RE: -ciplot- -eclplot- combine with box-options?
My comment was a lament, not a reproach. I'll look at the help for
-stripplot- to make it clearer that -by()- is an easy alternative to -over()-.
In return, I am not clear what you mean by better interpretable graphs than those generated with -by()-.
Nick
[email protected]
Kaulisch, Marc
Dear Nick,
Thanks for the follow up clarifing opportunities with -stripplot-. I am not as much into Stata that I can adhoc find solutions for two nice-to-haves I would like. 1) Avoid a composite variable for including two categorical variables (comparing sex by agegroup). 2) Better interpretable graphs than those generated with by().
I might miss features of -stripplot- or Stata again.
. sysuse bpwide, clear
1) Example with boxplot
. graph hbox bp_before bp_after , over(sex, label(labs(vsmall))) over(agegrp, label(labs(vsmall)) sort(1) descending)
2) Example with Confidence-intervals:
. tempfile tf1 tf2
. statsby mean1=r(mean) ub1=r(ub) lb1=r(lb) N1=r(N), by(agegrp)
saving(`tf1'): ci bp_before
. statsby mean2=r(mean) ub2=r(ub) lb2=r(lb) N2=r(N), by(agegrp)
saving(`tf2'): ci bp_after
. dsconcat `tf1' `tf2'
. generate axis = 6 if agegrp == 3 & sex == 0 . replace axis = 5 if agegrp == 3 & sex == 1 . replace axis = 4 if agegrp == 2 & sex == 0 . replace axis = 3 if agegrp == 2 & sex == 1 . replace axis = 2 if agegrp == 1 & sex == 0 . replace axis = 1 if agegrp == 1 & sex == 1 . la de laxis 6 "60+ Male" 5 "Female" 4 "46-59 Male" 3 "Female" 2 "30-45 Male" 1 "Female"
. la val axis laxis
. twoway scatter axis mean1 || rcap ub1 lb1 axis, hori || scatter axis
mean2 || rcap ub2 lb2 axis, hori , ///
ylabel(1(1)6, labs(vsmall) nogrid val
angle(hori)) ///
ytitle("")
///
legend(label(1 "Mean bp_before") label(2 "CI
bp_before") ///
label(3 "Mean bp_after") label(4 "CI bp_after")
size(vsmall) rows(2) span)
3) Stripplot (!?) with Confidence intervals:
* Generate axis as in solution (2)
. stripplot bp_before bp_after , by(axis, compact) vertical bar
boffset(0) ms(none) xsc(r(0.8 2.2))
Nick Cox
It's a pity that Marc missed a (simpler) solution with -stripplot- that requires no dataset manipulation at all.
sysuse bpwide
stripplot bp_before bp_after , by(sex, compact) stack vertical bar
height(0.2)
gets you there directly.
If you really want only the c.i. bars, so that the data points are suppressed, that's perverse, but you can do it too:
stripplot bp_before bp_after , by(sex, compact) vertical bar boffset(0)
ms(none) xsc(r(0.8 2.2))
In short, -stripplot- for two or more variables and two or more groups requires -by()- not -over()-. As program writer, I saw no point trying to re-create a complicated set-up when -by()- does it for you for free.
Nick
[email protected]
Kaulisch, Marc
Okay, it seems that -statsby- does the trick together with -dsconcat- (ssc).
Preliminary without labels etc:
sysuse bpwide
tempfile tf1 tf2
statsby mean1=r(mean) ub1=r(ub) lb1=r(lb) N1=r(N), by(agegrp)
saving(`tf1') subsets total: ci bp_before statsby mean2=r(mean)
ub2=r(ub) lb2=r(lb) N2=r(N), by(agegrp)
saving(`tf2') subsets total: ci bp_after dsconcat `tf1' `tf2'
twoway rcap ub1 lb1 agegrp || scatter mean1 agegrp || rcap ub2 lb2 agegrp || scatter mean2 agegrp
Kaulisch, Marc
Nick,
Thanks a lot for this extensive answer. I learn a lot. I haven't had time yet to go deeper in the opportunities to work with statsby (will do this later). But experimenting with -stripplot- I come very early to an end...
I would like to create a graph such as
. sysuse bpwide
. graph hbox bp_before bp_after, over(sex) over(agegrp), sort(1)
descending)
But I do not want a boxplot but a ci-bar as in stripplot...
-stripplot- does not allow two vars with over.
. ciplot bp_before bp_after, by(agegrp)
looks closer to my interest...
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/