Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: RE: creating variables using 'by' for subsets of records

From	"Martin Weiss" <[email protected]>
To	<[email protected]>
Subject	st: RE: creating variables using 'by' for subsets of records
Date	Tue, 23 Feb 2010 21:29:58 +0100

<>

In the absence of example data, it is hard to give you advice. Look at this
calculation of regional unemployment rates:


*******
clear*

//10 regions
set obs 10
gen byte region=_n

//50 indiv per region
expand 50
bys region: gen byte id=_n
gen byte unemployed=runiform()>.9

bys region: gen number=_N
by region: egen numofunempl=total(unemployed)

gen unemprate=numofunempl/number
*******

HTH
Martin


-----Original Message-----
From: [email protected]
[mailto:[email protected]] On Behalf Of John Westbury
Sent: Dienstag, 23. Februar 2010 20:55
To: [email protected]
Subject: st: creating variables using 'by' for subsets of records

Hello,

I have records for individuals by geographic region and wish to aggregate
the records for individuals to records for geographic regions.  I believe I
should create variables for those regions using 'by'.  Ex: by Region gen x =
argument for variable.  I am having difficulty with arguments for variable
x.  For example I wish to create a region variable that expresses a ratio of
count of indicator values for individuals in a region to a count of
individuals in the region and am unsure how to code this.

thanks

John

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: creating variables using 'by' for subsets of records
  - From: John Westbury <[email protected]>

Prev by Date: st: Listcoef option for logit
Next by Date: st: regression r(103): too many variables
Previous by thread: st: RE: creating variables using 'by' for subsets of records
Next by thread: Re: st: RE: creating variables using 'by' for subsets of records
Index(es):
- Date
- Thread