Maybe I am missing something obvious here, but I can't follow what you
are trying to do either. This criterion:
> 4 .nongroup_f = 1 when both firm_id and nation_id are different for
> two or more observations with the same contract id
does not seem to be consistent with this line listing:
> 10 5 US 1 0 0 1
> 10 5 US 1 0 0 1
here are two observations with the same firm_id and nation_id yet
nongroup_f is 1. However, you may want to try looking at some
combinations of -duplicates, tag- and levelsof, this might help as an
alternative approach.
Tim
On Tue, Nov 10, 2009 at 12:08 PM, joe j <[email protected]> wrote:
> Thanks. The last 4 columns (group_d; group_f; nongroup_d; nongroup_f)
> are the final output variables. Their definitions are below the table.
>
> ******
> contract_id; firm_id; nation_id; group_d; group_f; nongroup_d; nongroup_f
> 1 2 US 1 0 0 0
> 1 2 US 1 0 0 0
> 4 3 UK 0 1 0 0
> 4 3 US 0 1 0 0
> 8 3 US 0 0 1 1
> 8 4 UK 0 1 0 1
> 8 4 US 0 1 1 1
> 9 3 US 0 0 1 1
> 9 4 UK 0 0 0 1
> 9 5 US 0 0 1 1
> 10 4 CH 0 1 0 1
> 10 4 UK 0 1 0 1
> 10 5 US 1 0 0 1
> 10 5 US 1 0 0 1
> 10 6 NL 0 0 1 1
> 10 7 NL 0 0 1 1
> ******
> 1. group_d = 1 when both firm_id and nation_id are same for two or
> more observations with the same contract id
> 2. group_f = 1 when firm_id is same but nation_id is different for
> two or more observations with the same contract id
> 3. nongroup_d = 1 when firm_id is different but nation_id is same for
> two or more observations with the same contract id
> 4 .nongroup_f = 1 when both firm_id and nation_id are different for
> two or more observations with the same contract id
>
>
> On Tue, Nov 10, 2009 at 5:47 PM, Martin Weiss <[email protected]> wrote:
>>
>> <>
>>
>>
>> For clarification, you could provide the solution, i.e. the dummies that you
>> actually want to see as your final output, for your chosen example. Makes it
>> considerably easier to work towards code for you...
>>
>>
>>
>> HTH
>> Martin
>>
>>
>> -----Ursprüngliche Nachricht-----
>> Von: [email protected]
>> [mailto:[email protected]] Im Auftrag von joe j
>> Gesendet: Dienstag, 10. November 2009 17:39
>> An: [email protected]
>> Betreff: Re: st: AW: forvalues & replace not working under two 'not equal
>> to' conditions
>>
>> Thanks Martin. I think I wasn't clear enough in the last mail. I was
>> not looking at various combinations of firm_id, nation_id and
>> contract_id 'for each observation'. Rather I was looking at the
>> similarity or difference of firm_id/nation_id 'between two or more
>> observations' under each contract_id.
>>
>> Based on Martin's suggestion I could derive group_d (see below). But I
>> still can't get right nongroup_f, which equals 1 (for all
>> observations) if firm_id and nation_id are different for two or more
>> observations under each contract_id (but it takes a value 1, wrongly,
>> for all observations in the data)
>>
>> *deriving group_d (this works)
>> egen groups=group(firm_id nation_id)
>>
>> bys contract_id (groups): /*
>> */ gen byte distinctcount_group_d= /*
>> */ (groups[_n]==groups[_n+1])
>>
>> bys contract_id (groups): /*
>> */ replace distinctcount_group_d=1 /*
>> */ if (groups[_n]==groups[_n-1])
>>
>> *2 deriving nongroup_f doesnt work (e.g. it should be 0 for contract_id=1)
>> bys contract_id (groups): /*
>> */ gen byte distinctcount_nongroup_f= /*
>> */ (groups[_n]~=groups[_n+1]) & (nation_id[_n]~=nation_id[_n+1])
>>
>> bys contract_id (groups): /*
>> */ replace distinctcount_nongroup_f=1 /*
>> */ if (groups[_n]~=groups[_n-1]) & (nation_id[_n]~=nation_id[_n-1])
>>
>> On Tue, Nov 10, 2009 at 4:14 PM, Martin Weiss <[email protected]> wrote:
>>>
>>> <>
>>>
>>> I think a variable denoting the combinations between the three ids is a
>> good
>>> place to start for you:
>>>
>>>
>>>
>>> *************
>>> clear*
>>> inp byte(contract_id firm_id) nation_id:mylabel, auto
>>> 1 2 "US"
>>> 1 2 "US"
>>> 4 3 "UK"
>>> 4 3 "US"
>>> 8 4 "US"
>>> 8 4 "UK"
>>> 8 3 "US"
>>> 9 5 "US"
>>> 9 4 "UK"
>>> 9 3 "US"
>>> 10 5 "US"
>>> 10 5 "US"
>>> 10 6 "NL"
>>> 10 7 "NL"
>>> 10 4 "UK"
>>> 10 4 "CH"
>>> end
>>>
>>> egen groups=group(contract_id firm_id nation_id)
>>>
>>> l, sepby(con) noobs
>>> *************
>>>
>>>
>>>
>>> HTH
>>> Martin
>>>
>>>
>>> -----Ursprüngliche Nachricht-----
>>> Von: [email protected]
>>> [mailto:[email protected]] Im Auftrag von joe j
>>> Gesendet: Dienstag, 10. November 2009 16:04
>>> An: [email protected]
>>> Betreff: st: forvalues & replace not working under two 'not equal to'
>>> conditions
>>>
>>> My dataset has three variables 1. contract_id, 2. firm_id and 3.
>>> nation_id. I want to create 4 variables, each of which gets a value of
>>> 1 if certain conditions are met. The variables I want to create are
>>> specific to the contract id, and are:
>>>
>>> 1. group_d = 1 when both firm_id and nation_id are same for two or
>>> more firms with the same contract id
>>> 2. group_f = 1 when firm_id is same but nation_id is different for
>>> two or more firms with the same contract id
>>> 3. nongroup_d = 1 when firm_id is different but nation_id is same for
>>> two or more firms with the same contract id
>>> 4 .nongroup_f = 1 when both firm_id and nation_id are different for
>>> two or more firms with the same contract id
>>>
>>> The following code works well for the first three variables, but not
>>> for the last, nongroup_f; the value is 1 for all observations. I can't
>>> figure out why.
>>>
>>> This is a sample code:
>>>
>>> clear
>>> inp str10(contract_id firm_id nation_id)
>>> 1 2 "US"
>>> 1 2 "US"
>>> 4 3 "UK"
>>> 4 3 "US"
>>> 8 4 "US"
>>> 8 4 "UK"
>>> 8 3 "US"
>>> 9 5 "US"
>>> 9 4 "UK"
>>> 9 3 "US"
>>> 10 5 "US"
>>> 10 5 "US"
>>> 10 6 "NL"
>>> 10 7 "NL"
>>> 10 4 "UK"
>>> 10 4 "CH"
>>> end
>>>
>>>
>>> *1.group_d . WORKS!
>>> gen group_d=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_d=1 if firm_id==firm_id[_n-`i'] &
>>> nation_id==nation_id[_n-`i']
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_d=1 if firm_id==firm_id[_n+`i'] &
>>> nation_id==nation_id[_n+`i']
>>> }
>>>
>>> *2.group_f WORKS!
>>> gen group_f=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_f=1 if firm_id==firm_id[_n-`i'] &
>>> nation_id!=nation_id[_n-`i']
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace group_f=1 if firm_id==firm_id[_n+`i'] &
>>> nation_id!=nation_id[_n+`i']
>>> }
>>>
>>> *3. nongroup_d WORKS!
>>> gen nongroup_d=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_d=1 if firm_id!=firm_id[_n-`i'] &
>>> nation_id==nation_id[_n-`i']
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_d=1 if firm_id!=firm_id[_n+`i'] &
>>> nation_id==nation_id[_n+`i']
>>> }
>>>
>>> *4.nongroup_f DOESN'T WORK!!
>>> gen nongroup_f=.
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_f=1 if (firm_id~=firm_id[_n-`i']) &
>>> (nation_id~=nation_id[_n-`i'])
>>> }
>>> forvalues i=1/`=_N'{
>>> bys contract_id: replace nongroup_f=1 if (firm_id~=firm_id[_n+`i']) &
>>> (nation_id~=nation_id[_n+`i'])
>>> }
>>> *
>>> * For searches and help try:
>>> * http://www.stata.com/help.cgi?search
>>> * http://www.stata.com/support/statalist/faq
>>> * http://www.ats.ucla.edu/stat/stata/
>>>
>>>
>>> *
>>> * For searches and help try:
>>> * http://www.stata.com/help.cgi?search
>>> * http://www.stata.com/support/statalist/faq
>>> * http://www.ats.ucla.edu/stat/stata/
>>>
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/statalist/faq
>> * http://www.ats.ucla.edu/stat/stata/
>>
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/statalist/faq
>> * http://www.ats.ucla.edu/stat/stata/
>>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
>
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/