At 12:09 AM 10/4/2004 -0400, mnitkin wrote:
I have a data set with 131,000 firm observations over 14 years.
Individual firms may be in the data set between 1 and 14 times. I want
to keep only those firms that have observations for the entire 14 year
period.
I've tried all the tricks I know as well as a number of suggestions on
the stata website, but I haven't had any luck.
Something like this might do it. Lets suppose you want to keep those
cases where the same id number occurs 14 times (i.e. there is a first
occurrence and then 13 "duplicates"). Lets further assume each firm has
a maximum of 1 record per year. Then,
duplicates tag id, gen(nyears)
keep if nyears == 13
-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
OFFICE: (574)631-6668, (574)631-6463
FAX: (574)288-4373
HOME: (574)289-5227
EMAIL: [email protected]
WWW (personal): http://www.nd.edu/~rwilliam
WWW (department): http://www.nd.edu/~soc
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/