Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: RE: creating variables using 'by' for subsets of records
From
"Martin Weiss" <[email protected]>
To
<[email protected]>
Subject
st: RE: creating variables using 'by' for subsets of records
Date
Tue, 23 Feb 2010 21:19:49 +0100
<>
In the absence of example data, it is hard to give you advice. Look at this
calculation of regional unemployment rates:
*******
clear*
//10 regions
set obs 10
gen byte region=_n
//50 indiv per region
expand 50
bys region: gen byte id=_n
gen byte unemployed=runiform()>.9
bys region: gen number=_N
by region: egen numofunempl=total(unemployed)
gen unemprate=numofunempl/number
*******
HTH
Martin
-----Original Message-----
From: [email protected]
[mailto:[email protected]] On Behalf Of John Westbury
Sent: Dienstag, 23. Februar 2010 20:55
To: [email protected]
Subject: st: creating variables using 'by' for subsets of records
Hello,
I have records for individuals by geographic region and wish to aggregate
the records for individuals to records for geographic regions. I believe I
should create variables for those regions using 'by'. Ex: by Region gen x =
argument for variable. I am having difficulty with arguments for variable
x. For example I wish to create a region variable that expresses a ratio of
count of indicator values for individuals in a region to a count of
individuals in the region and am unsure how to code this.
thanks
John
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/