Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Comparing two data set
From
Nick Cox <[email protected]>
To
[email protected]
Subject
Re: st: Comparing two data set
Date
Thu, 3 Mar 2011 09:10:23 +0000
I don't think there is any logical difficulty here. If the same
mistake was made twice, a check of whether observations are identical
will not also be a check of whether the data are correct. The point of
double data entry is that repeated mistakes are less likely than a
mistake made once, but nothing makes mistakes impossible. Also, there
are usually other logic checks or range checks that can be made about
data that may catch some kinds of repeated mistakes.
A century and more ago Charles Darwin emphasised that false facts are
much more difficult to eliminate than false ideas.
Am I missing something in your question?
Nick
On Thu, Mar 3, 2011 at 4:00 AM, Rajaram Subramanian Potty
<[email protected]> wrote:
> Thank you very much for the information on different ways to compare
> two files. I have generated error list using both cf and cf3. The
> error list generated by these two methos are found to be identical. We
> also checked manually the data for few observations from these two
> error list and found to be correct. However, I am not sure whether the
> error list will be correct if the ID variable is not entered correctly
> in the two data set.
>
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/