Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Stata equivalent of SPEDIS function in SAS
From
Nick Cox <[email protected]>
To
[email protected]
Subject
Re: st: Stata equivalent of SPEDIS function in SAS
Date
Wed, 20 Jul 2011 09:00:59 -0500
You might get more answers by defining "spelling distance" precisely,
Many members of this list, probably most, don't use SAS or are not
familiar with its detailed commands and functions. But look at
-strgroup- from SSC:
. ssc desc strgroup
------------------------------------------------------------------------------------------------------------------------------------
package strgroup from http://fmwww.bc.edu/repec/bocode/s
------------------------------------------------------------------------------------------------------------------------------------
TITLE
'STRGROUP': module to match strings based on their Levenshtein
edit distance
DESCRIPTION/AUTHOR(S)
strgroup matches similar strings together. This can be useful
when merging data that contain typos. For example, "widgets" will
not merge with "widgetts" because the strings are not identical.
strgroup provides a way to match strings in an objective and
automated manner.
KW: data management
KW: string match
KW: string merge
KW: string group
KW: levenshtein
Requires: Stata version 9.2
Distribution-Date: 20100814
Author: Julian Reif , University of Chicago
Support: email [email protected]
INSTALLATION FILES (type net install strgroup)
strgroup.ado
strgroup.hlp
../l/levenshtein.ado
../l/levenshtein.hlp
strgroup.plugin
On Wed, Jul 20, 2011 at 8:46 AM, Zou, Joe Hong <[email protected]> wrote
>
> Does anybody know whether there is a Stata routine that calculates the
> spelling distance between two names as what SPEDIS function in SAS does?
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/