The tag trick worked wonders... Thx..
Regards,
M
On 11. jan. 2010, at 20.56, Martin Weiss wrote:
>
> <>
>
> Nick seems to understand your intentions, I honestly do not. That could be
> entirely my problem. Anyway, could you provide the number that you want to
> come out of the calculation in your example?
>
>
> HTH
> Martin
>
> -----Original Message-----
> From: [email protected]
> [mailto:[email protected]] On Behalf Of moleps
> Sent: Montag, 11. Januar 2010 20:32
> To: [email protected]
> Subject: Re: AW: st: AW: generating count and sum variable over two
> different categorical variables
>
> Thx, Martin. However I need to sum the total number of people in each region
> comprised of from 3-10 different districts. Using
>
>> by region yr: egen totpop=total(distr_pop)
>
> entails summing the district_population the same number of times as the
> number of cases. What I want is to sum the different districts for each year
> for a total regional population.
>
>
>
> +--------------------------------------------------------------+
> | pid distr_~p district region yr number totpop |
> |--------------------------------------------------------------|
> | 221 440674 3 1 1953 7 2802725 |
> | 684 440674 3 1 1953 7 2802725 |
> | 574 158681 6 1 1953 7 2802725 |
> | 770 440674 3 1 1953 7 2802725 |
> | 869 440674 3 1 1953 7 2802725 |
> |--------------------------------------------------------------|
> | 454 440674 3 1 1953 7 2802725 |
> | 497 440674 3 1 1953 7 2802725 |
> | 790 444041 3 1 1954 1 444041 |
> | 802 112982 13 2 1954 1 112982 |
> | 767 227937 18 4 1954 1 227937 |
> |--------------------------------------------------------------|
> | . 139172 8 . 1953 0 139172 |
> +--------------------------------------------------------------+
>
>
> //M
>
>
>
> On 11. jan. 2010, at 17.24, Martin Weiss wrote:
>
>>
>> <>
>>
>>
>>
>> *************
>> clear*
>>
>> input pid distr_pop district region yr
>> 221 440674 3 1 1953
>> 869 440674 3 1 1953
>> . 139172 8 . 1953
>> 497 440674 3 1 1953
>> 684 440674 3 1 1953
>> 574 158681 6 1 1953
>> 770 440674 3 1 1953
>> 454 440674 3 1 1953
>> 767 227937 18 4 1954
>> 802 112982 13 2 1954
>> 790 444041 3 1 1954
>> end
>>
>> compress
>>
>> bys region yr: egen number=count(pid)
>> by region yr: egen totpop=total(distr_pop)
>>
>> li, noo
>> *************
>>
>>
>>
>> HTH
>> Martin
>>
>>
>> -----Ursprüngliche Nachricht-----
>> Von: [email protected]
>> [mailto:[email protected]] Im Auftrag von moleps
>> Gesendet: Montag, 11. Januar 2010 17:19
>> An: [email protected]
>> Betreff: Re: st: AW: generating count and sum variable over two different
>> categorical variables
>>
>> Sorry...
>>
>>
>> . list pid distr_pop district region yr in 50/60
>> | pid distr_pop district region yr |
>> |---------------------------------------------|
>> 50. | 221 440674 3 1 1953 |
>> 51. | 869 440674 3 1 1953 |
>> 52. | . 139172 8 . 1953 |
>> 53. | 497 440674 3 1 1953 |
>> 54. | 684 440674 3 1 1953 |
>> |---------------------------------------------|
>> 55. | 574 158681 6 1 1953 |
>> 56. | 770 440674 3 1 1953 |
>> 57. | 454 440674 3 1 1953 |
>> 58. | 767 227937 18 4 1954 |
>> 59. | 802 112982 13 2 1954 |
>> |---------------------------------------------|
>> 60. | 790 444041 3 1 1954 |
>> +---------------------------------------------+
>>
>> So what I need to do is generate a variable counting the number of cases
>> from each region for each year and also a variable containing the sum of
> the
>> population for each region for each year. There are between 3 and 10
>> districts in each region.
>>
>> Any idea or do I have to program it from scratch?
>>
>> Regards,
>> M
>>
>>
>>
>>
>>
>> On 11. jan. 2010, at 17.06, Martin Weiss wrote:
>>
>>>
>>> <>
>>>
>>>
>>>
>>> As always: Show an excerpt of your data!
>>>
>>>
>>>
>>> HTH
>>> Martin
>>>
>>>
>>> -----Ursprüngliche Nachricht-----
>>> Von: [email protected]
>>> [mailto:[email protected]] Im Auftrag von moleps
>>> Gesendet: Montag, 11. Januar 2010 16:58
>>> An: [email protected]
>>> Betreff: st: generating count and sum variable over two different
>>> categorical variables
>>>
>>> Dear listers,
>>> I´m doing a survival analysis, but also need to present some graphs on a
>>> regional level. In other words my data is setup on an individual level
>> with
>>> categorical variables for year and region. What I need to do is generate
> a
>>> count variable for the cases counted over year AND region. Using the -
>> egen
>>> total- command I´m only able to sum over either year or region. Not both
>> as
>>> far as I understand. Also is there a way to sum over two categories, not
>>> just one.
>>>
>>> ie my data is setup according to
>>>
>>> case year region
>>>
>>> and i´d like it setup acording to
>>>
>>> case year region #cases/year/region
>>>
>>>
>>> Regards,
>>> M
>>>
>>>
>>> *
>>> * For searches and help try:
>>> * http://www.stata.com/help.cgi?search
>>> * http://www.stata.com/support/statalist/faq
>>> * http://www.ats.ucla.edu/stat/stata/
>>>
>>>
>>> *
>>> * For searches and help try:
>>> * http://www.stata.com/help.cgi?search
>>> * http://www.stata.com/support/statalist/faq
>>> * http://www.ats.ucla.edu/stat/stata/
>>
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/statalist/faq
>> * http://www.ats.ucla.edu/stat/stata/
>>
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/statalist/faq
>> * http://www.ats.ucla.edu/stat/stata/
>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/