[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: RE: Cluster analysis on survey data

From	"Nick Cox" <[email protected]>
To	<[email protected]>
Subject	st: RE: Cluster analysis on survey data
Date	Thu, 28 Aug 2008 16:55:42 +0100

What is BRFSS? 

On the main question, it is evident that -cluster- does not support any
kind of weights, so that is one short answer. 

I am unclear on how in principle any kind of weights could inform
cluster analysis. Although there are different recipes, cluster analysis
as implemented in Stata is in essence a more or less elaborate way of
quantifying information on similarity or differences between
observations in a multivariate space. 

Suppose for example that I am in a survey, you are too, and several
other people are as well. Cluster analysis offers methods for plotting
me, you and the others in a space. How are those differences affected by
the sampling design behind who is and who isn't in the dataset,
particularly as no parameter estimation or hypothesis testing is
involved? 

Nick 
[email protected] 

Jessica M. Tullar, PhD

I am using a BRFSS dataset and therefore it has a complex sampling
design. I would like to describe who are the kinds of people that report
medical debt and medical bankruptcy and therefore thought cluster
analysis might be appropriate. 

I've looked through all the manuals and even searched survey analysis
and on cluster analyses and can't find the answer. Is there a way to
perform a cluster analysis and account for the survey weights? 

A second possibility would involve creating a new representative dataset
on which to perform the standard cluster analysis. How best would one
create a new dataset using the survey weighting which would approximate
the population?

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: RE: Cluster analysis on survey data
  - From: Steven Samuels <[email protected]>

References:
- st: Cluster analysis on survey data
  - From: "Tullar, Jessica M" <[email protected]>

Prev by Date: st: RE: RE: preserving leading zeros in destring
Next by Date: st: How do I create a new dataset with just one variable but 500,000 obs?
Previous by thread: st: Cluster analysis on survey data
Next by thread: Re: st: RE: Cluster analysis on survey data
Index(es):
- Date
- Thread