[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

RE: st: programming question: obtaining statistics from clustered data

From	"Nick Winter" <[email protected]>
To	<[email protected]>
Subject	RE: st: programming question: obtaining statistics from clustered data
Date	Wed, 26 Jun 2002 09:40:01 -0400

-----Original Message-----

For the standard deviation the answer seems to be more difficult. At the

moment I only can think about a solution with a loop over the
observations 
within each cluster. There must be a better solution and I am sure that
I 
have overlooked somethink obvious. But anyway, you may use the following
as a 
starting point: 

>>>>>>>>>
You can use the alternate expression for variance to do the standard
deviation relatively straightforwardly:


gen X2=X^2
sort cluster
by cluster: gen sumX=sum(X)
by cluster: replace sumX=sumX[_N] - X
by cluster: gen sumX2=sum(X2)
by cluster: replace sumX2=sumX2[_N] - X2
by cluster: gen sd1 = sqrt(((_N-1)*sumX2 - sumX^2)/((_N-1)*(_N-2)))

--Nick Winter

<<winmail.dat>>

Prev by Date: st: Programming question
Next by Date: st: RE: Programming question
Previous by thread: st: RE: programming question: obtaining statistics from clustered data
Next by thread: st: Programming question
Index(es):
- Date
- Thread