Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Count observations
From
Sergiy Radyakin <[email protected]>
To
"[email protected]" <[email protected]>
Subject
Re: st: Count observations
Date
Tue, 23 Jul 2013 12:40:27 -0400
See if any of this is closer to what you want, if not, please show the
manually created result for your example. Best, Sergiy
clear
input byte id str1 feetype
1 "A"
1 "B"
1 "C"
2 "C"
2 "A"
3 "A"
4 "B"
4 "Z"
4 "A"
end
list, sepby(id)
gen c=1
sort id feetype
collapse c, by(id feetype)
list, sepby(id)
reshape wide c, i(id) j(feetype) string
list
+------------------+
| id feetype c |
|------------------|
1. | 1 A 1 |
2. | 1 B 1 |
3. | 1 C 1 |
|------------------|
4. | 2 A 1 |
5. | 2 C 1 |
|------------------|
6. | 3 A 1 |
|------------------|
7. | 4 A 1 |
8. | 4 B 1 |
9. | 4 Z 1 |
+------------------+
+------------------------+
| id cA cB cC cZ |
|------------------------|
1. | 1 1 1 1 . |
2. | 2 1 . 1 . |
3. | 3 1 . . . |
4. | 4 1 1 . 1 |
+------------------------+
On Tue, Jul 23, 2013 at 12:06 PM, Miguel Angel Duran Munoz
<[email protected]> wrote:
> Thank you very much for your help. Let me explain a bit more why -count-
> did not work. There is something in my variables that I did not make
> explicit in my first message (I thought could solve it on my own after
> being helped, but it is not the case).
>
> As I told you, the variable fee describes the type of fee (eg, A B C).
> Nevertheless, the dataset is constructed in a way that A and B, for
> instance, are the same (specifically, I have "commitment fee" and
> "commitment regular fee", but both types are the same). But, although A
> and B are the same, they both might be included for the same firm.
>
> Therefore, given this illustrative dataset,
>
> Id Type-of-fee
>
> 1 A
> 1 B
> 1 C
> 2 C
> 2 A
> 3 A
> 4 B
> 4 .
> 4 A
>
> there are 4 firms that have either A or B. I was trying to use this,
> -bysort id: count if fee=="A" | fee=="B", but what I get is (obsviously)
> split by firms.
>
> I am sorry for the initial confusion.
>
> Miguel.
>
> Unclear why it does not work. It works with the following:
>>
>> ******* start example
>> clear all
>> input id
>> 1
>> 1
>> 1
>> 2
>> 2
>> 3
>> 4
>> 4
>> 4
>> end
>> input str2 fee
>> A
>> B
>> C
>> C
>> A
>> A
>> B
>> ""
>> A
>> count if fee=="A"
>> ******* end example
>>
>> Notice that another alternative is -tab fee-
>>
>> -----Original Message-----
>> From: [email protected]
>> [mailto:[email protected]] On Behalf Of Miguel Angel
>> Duran Munoz
>> Sent: Tuesday, July 23, 2013 10:51 AM
>> To: [email protected]
>> Subject: Re: st: Count observations
>>
>> Hi, Statalisters. I have the following doubt. My dataset is arranged in
>> the following way. I have a variable that identifies firms (say id).
>> Another variable describes whether different types of fees (eg, A B C)
>> applies to a firm. Accordingly, the dataset looks similar to
>>
>> Id Type-of-fee
>>
>> 1 A
>> 1 B
>> 1 C
>> 2 C
>> 2 A
>> 3 A
>> 4 B
>> 4 .
>> 4 A
>>
>> I would like to know, for instance, the number of A fees that there are. I
>> have used -count- but I am not able to get what I want. Will you please
>> help me?
>>
>> Thanks in advance.
>>
>> Miguel.
>>
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>> * http://www.ats.ucla.edu/stat/stata/
>>
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>> * http://www.ats.ucla.edu/stat/stata/
>>
>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/