Statistics New Zealand uses a commercial product call Integrity made by
Ascential software to match administrative databases with similar issues.
More information is available at
http://www.ascential.com/products/qs_features.html. I have no idea if this
approach is practical for smaller scale merges or if the product itself is
any good.
Steve
> -----Original Message-----
> From: Scott Talkington [SMTP:[email protected]]
> Sent: Wednesday, August 20, 2003 9:26 AM
> To: Statalist
> Subject: st: Question about match merge on name
>
> I wonder if anyone on the list has advice about matching two databases on
> first, last and middle name and birth date (as well as gender and race).
> The size of the two files is around 400,000 observations (although they
> aren't matched 1 to 1) so correction of misspellings, transpositions, etc.
> by inspection is sort of ruled out. I have done some editing by searching
> for breaks or spaces in the name field that might be name extensions like
> "JR." etc, and have also transformed the name fields in both files to
> upper
> case. I then did a match on the 3 initials plus birth date, and have
> identified the subset of matches where at least the last name and birth
> date
> are identical.
>
> However, I suspect there are quite a few more matches to be gleaned. Are
> there utilities that can facilitate this process? Any tricks and tips?
>
> -Scott
> [email protected]
>
> *
> * For searches and help try:
> * http://www.stata.com/support/faqs/res/findit.html
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
The information contained in this document is intended only for the
addressee and is not necessarily the views nor the official
communication of the Department of Labour. All final/official papers
which are sent from the Department will be sent by non-electronic
means, on appropriate letterhead, signed by authorised personnel.
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/