Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
st: cluster analysis validation
From
"Dasinger, Lisa" <[email protected]>
To
[email protected]
Subject
st: cluster analysis validation
Date
Mon, 16 Apr 2012 15:57:30 -0700
I've run a cluster analysis in Stata 11.2 based on three continuous
variables using -cluster wardslinkage- with the default
similarity/dissimilarity measure to generate 6 groups. I'd like to know
if there is a way to apply the same cluster analysis to a new set of
data. In other words, is there a way to run a new dataset through the
old cluster analysis and see how new observations are classified, akin
to running a regression equation and then taking a new dataset to obtain
out of sample predictions?
If so, is there a way to evaluate how well the "old" analysis fits the
new data, e.g., by determining how similar/dissimilar each new
observation is to the observations in the cluster in which each is
placed, and whether the new observation is placed in the "best" cluster,
meaning the one that minimizes the distance between the observation and
the "center" of the existing cluster?
I am new to cluster analysis and am looking for a way to validate the
"old" cluster analysis.
Lisa
Lisa Dasinger, Ph.D.
Data Reporting Manager
Claims Analytics
Zenith Insurance Company
Pleasanton Regional Office
4309 Hacienda Drive, Suite 200
Pleasanton, CA 94588
[email protected]
www.TheZenith.com
***********************************************************
NOTICE:
This e-mail, including attachments, contains information
that may be confidential, protected by the attorney/client
or other privileges, or exempt from disclosure under
applicable law. Further, this e-mail may contain
information that is proprietary and/or constitutes a trade
secret. This e-mail, including attachments, constitutes
non-public information intended to be conveyed only to the
designated recipient of this communication, please be
advised that any disclosure, dissemination, distribution,
copying, or other use of this communication or any attached
document is strictly prohibited. If you have received this
communication in error, please notify the sender
immediately by reply e-mail and promptly destroy all
electronic and printed copies of this communication and
attached documents.
***********************************************************
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/