Hello All,
I understand that clustering specifies that observations be independent
across groups while allowing for changes in variance within a group. My
question is how is this different than controlling for a group with an
indicator (0/1) variables.
For example, If my data contains patient data for 12 hospitals and my LHS
variable is (0/1) for recovery and my RHS variables include characteristic
variables, treatment type, etc. What is the difference between clustering
on hospital or creating an indicator variable for each hospital?
Thanks in advance for any commentary,
David J. Bernstein, Ph.D.
[email protected]
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/