|
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]
st: RE: duplicates in timevar in panels
Dear Mahesh,
One command that I often use to find duplicates in panel data is:
sort panel_id timevar,stable
br panel_id timevar if (panel_id== panel_id[_n-1]| panel_id== panel_id[_n+1]) & (timevar==timevar[_n-1]|timevar==timevar [_n+1])
this is for identification of duplicates.
If both the rows are absolutely similar then you may delete any of the row
drop if if panel_id== panel_id[_n-1] & timevar==timevar[_n-1]
BW,
mukesh
-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Mahesh Ramachandran
Sent: 26 June 2008 19:37
To: [email protected]
Subject: st: duplicates in timevar in panels
hello all,
I am working on a panel data set : 2000 households over 10 years(120 months)
It has a household identifier as panel id and year/month variable as timevar.
when I tried to tsset it, I get the following result:
. tsset panel_id timevar
repeated time values within panel
r(451);
My goal is to identify and isolates those panels which has repeated time values and remove the duplicates.
So I tried :
by panel_id: duplicates re timevar,
and response was that duplicates can not be combined with "by" command. Can you suggest me a way to remove
the duplicates.
Thanks,
MR
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/