Dear Statalister,
I'm working with data from a household panel (GSOEP). When doing a fixed effects regression ( "areg, absorb(id)") the question is whether to use the "clustered data" option (vce(cluster id)) if I want to analyze data "only" on the level of individuals.
Lahiri/Li (2009), 3430 say: "In health studies, observations are often collected on the same subject over time leading to correlated observations.", while textbooks like Wooldridge
(2002) give the example of, e. g. the grouping of individuals
in households. Therefore I'm quite certain that the "cluster" option would be necessary if I included the problem that normally more than one individual is a member of the same household.
But what about the problem that observations of one individual at different points of time form one cluster, so intraclass correlation is to be expected? Is this already accounted for by the use of xtreg, fe or areg or is the additional use of the cluster-option sensible?
Lahiri, P./Li, Yan (2009), A new alternative to the standard F test for clustered data, Journal of Statistical Planning and Inference 139, 3430 - 3441.
Wooldridge, Jeffrey M. (2002), Econometric Analysis of Cross Section And Panel Data, MIT Press, Cambridge, MA/London.
Thanks for your consideration!
Best wishes,
Björn Bünger
PhD-Student
Institute of Public Economics
Münster
Germany
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/