[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: cluster analysis with missing data

From	Nick Cox <[email protected]>
To	[email protected]
Subject	Re: st: cluster analysis with missing data
Date	Tue, 04 Aug 2009 10:42:32 -0500

One approach might be to -collapse- the data, thus ignoring all the finestructure of missingness. A simple default -collapse- would producemeans of 152 "statements". For that a formal cluster analysis would seempointless as you have a distribution that could be tabulated and plottedto see all the variability.

A variant on that would be to -collapse- to two or more summarystatistics and then combine datasets to get a composite datasetstructure of


summary statistic list * statement list

The real motivation behind cluster analysis is often unclear to me andit's certainly not clear to me in this case, so better advice coulddepend on a finer specification of the scientific problem here.


Nick

Walter R. Paczkowski, Ph.D. wrote on Wed 29 Jul 2009 11:08:34 -0400

A client has a dataset from a survey in which consumers were shown a
randomly selected set of 25 needs statements from a total of 152
statements.  Each consumer saw only 25.  The client want to cluster the
152 needs statements (i.e., 152 variables).  Since the 25 were selected
at random, this should be a Missing Completely at Random problem.  But
with each consumer responding to only 25, each record will have 127
missing values.  I assume that Stata's clustering routines will do
list-wise deletion so there should be no data available for clustering.
Does anyone have any ideas how to handle this?  Any suggestions?  Can a
similarity matrix still be created (how?) with so many missing data points?


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Prev by Date: Re: st: Format and Export(Excel) of 3-way table results
Next by Date: Re: st: Regarding using pathreg with svy command
Previous by thread: Re: st: Format and Export(Excel) of 3-way table results
Next by thread: st: creating and calling local macros in do-files
Index(es):
- Date
- Thread