Pedro Martins
> I would like to match a large list of names available in
> two data sets.
> One problem is that the names are not always exactly the
> same because of
> typos. I can also use other characteristics of each
> observation and do a
> -joinby- but this still provides a large number of
> possible matches
> for each observation. Is there any procedure in Stata for
> one to match
> string variables that may not be exactly equal?
>
There is a -match()- function. Also,
Michael Blasnik's -egen, soundex()-
function may help.
On the whole, however, no such
function will be as smart as you
in deciding what is really the
same and what is different.
Nick
[email protected]
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/