[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: RE: RE: RE: is ordering with -bysort- unique?

From	"Nick Cox" <[email protected]>
To	<[email protected]>
Subject	st: RE: RE: RE: is ordering with -bysort- unique?
Date	Tue, 10 Jun 2003 23:38:41 +0100

Jose Luis Negrin Mu�oz

> > I have used the following small program to get rid of duplicates
> >
> > 	sort ref;
> > 	by ref: gen dup=_n;
> > 	gsort ref -mesini dup;
> > 	drop if dup>1;
> >
> > where ref is a reference code and mesini is a variable
> > related to time
> > (1, 2...j); these are the variables I am sorting my data
> > by. So I will
> > keep only the oldest data with each particular reference code
>
> Two comments:
>
> 1. This code appears to keep the most recent
> observation within each group, not the oldest.
> Another way to do it is
>
> bysort ref (mesini) : keep if _n == _N
>

Jose's code needs more comment. He had

 	sort ref;
 	by ref: gen dup=_n;
 	gsort ref -mesini dup;
 	drop if dup>1;

The -gsort- makes no difference to what is dropped, so
in terms of what is kept (although not in what order)
Jose's code is just

 	sort ref;
 	by ref: gen dup=_n;
 	drop if dup>1;

which is equivalent to

	bysort ref: drop if _n > 1

or

	bysort ref : keep if _n == 1

The problem here is what is first (_n == 1) within
each group of -ref- is not guaranteed to be
the same observation, let alone the time desired.
This is what was spelled out in the examples in my
original reply to Radu.

Nick
[email protected]

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: RE: RE: is ordering with -bysort- unique?
  - From: "Nick Cox" <[email protected]>

Prev by Date: st: RE: RE: is ordering with -bysort- unique?
Next by Date: Re: st: Fixed effects and time dummy variables
Previous by thread: st: RE: RE: is ordering with -bysort- unique?
Next by thread: st: RE: is ordering with -bysort- unique?
Index(es):
- Date
- Thread