Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Extract a letter between numbers
From
"Dimitriy V. Masterov" <[email protected]>
To
[email protected]
Subject
Re: st: Extract a letter between numbers
Date
Mon, 22 Nov 2010 15:41:53 -0500
I think you may want to fuzzy merge your dirty address data and your
clean data using nearmrg, which you can get from scc.
An alternative way would to append your two data sets and then use
strgroup on the variable that is the stacked version of your clean and
dirty addresses. That will give you the closest match.
Neither one will be perfect and may take a long time/fail if you have
too much data. The latter approach has some operating system
restrictions as well.
DVM
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/