Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Frequency weighted cluster analysis
From
Nick Cox <[email protected]>
To
[email protected]
Subject
Re: st: Frequency weighted cluster analysis
Date
Wed, 11 Jan 2012 00:15:16 +0000
Why would a cluster analysis change because some observations are
duplicated? The similarity or dissimilarity of objects is not affected
by their frequency. What does this SAS statement do that should be
replicated by Stata?
Nick
On Tue, Jan 10, 2012 at 11:25 PM, Brendan Halpin <[email protected]> wrote:
> Is it possible to use frequency weighted data with cluster (and in
> particular clustermat)?
>
> From the manual I see that it is not intended to be possible -- no
> weight term in the syntax, for instance. However, for datasets with
> significant rates of duplicates, it could be a way of reducing the
> computational burden very significantly.
>
> SAS documentation suggests that PROC CLUSTER has a FREQ statement that
> does this.
>
> The Stata manual also suggests that programmers might implement their
> own clustering algorithms, but there are no examples of how this might
> be done.
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/