[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: RE: Question about match merge on name

From	"Daniel R Sabath" <[email protected]>
To	<[email protected]>
Subject	st: RE: Question about match merge on name
Date	Tue, 19 Aug 2003 14:53:23 -0700

I have had a similar problem and have been researching answers. My current
solution is not pretty and works only because I have a very small data set
which I need to match to a much larger set. 

While I was looking for solutions I ran across two projects which may help.
The first is FEBRL.
"This third release of prototype software for probabilistic record linkage
written in the Python programming language contains routines for data
cleaning and standardisation, and probabilistic record linkage and
deduplication."

http://datamining.anu.edu.au/projects/linkage.html
While it is still beta software, it seems to do a fairly good job.


The other is a german project utilizing Perl and Java but reads Stata files.

Prev by Date: st: RE: Question about match merge on name
Next by Date: [no subject]
Previous by thread: st: RE: Re: RE: Question about match merge on name (bug in extrname)
Next by thread: st: RE: Question about match merge on name
Index(es):
- Date
- Thread