Anirban and Mark,
I concur that -reg, robust - and -xtreg, fe- will not yield the same
coefficients, at least in my experience. And I'd be shocked if
the results of - xtreg, re- were the same as well ...
Thanks for all of your insights on this!
John
anirban basu wrote:
> On Tue, 25 Jun 2002, Mark Schaffer wrote:
>
> >
> > Not quite sure what you mean, so apologies if I'm off target. The
> > coefficient estimates with -regress- won't be the same as with
> > -xtreg, fe- (unless the former is estimating the same model by
> > explicitly including the fixed effects as dummy vars). Both sets of
> > coefficients will be different again from those produced by
> > -xtreg, re-.
> >
> > --Mark
>
> You may be right about this. However, I did get the same coeff from
> -regress-, xtreg, fe and xtreg, re using a simulated dataset with
> exchangeable corr and no dummy vars. ALso, I got different coeffs by
> running an exponential corr model as expected.. Here are the details. I
> may have missed something here..Thanks for your input,
>
> Anirban
>
> . mat C= (1, 0.6, 0.6, 0.6 \ 0.6, 1, 0.6, 0.6 \ 0.6, 0.6, 1, 0.6 \ 0.6,
> 0.6, 0.6, 1)
>
> .
> . drawnorm y1 y2 y3 y4, n(1000) means(1 3 4 7) corr(C)
> (obs 1000)
>
> . gen id=_n
>
> . reshape long y , i(id) j(time)
> (note: j = 1 2 3 4)
>
> Data wide -> long
> -----------------------------------------------------------------------------
> Number of obs. 1000 -> 4000
> Number of variables 5 -> 3
> j variable (4 values) -> time
> xij variables:
> y1 y2 ... y4 -> y
> -----------------------------------------------------------------------------
>
> .
> . reg y time, cluster(id)
>
> Regression with robust standard errors Number of obs =4000
> F( 1,999) =42518.72
> Prob > F =0.0000
> R-squared =0.7897
> Number of clusters (id) = 1000 Root MSE =1.0946
>
> ------------------------------------------------------------------------------
> | Robust
> y | Coef. Std. Err. t P>|t| [95% Conf. Interval]
> -------------+----------------------------------------------------------------
> time | 1.896646 .0091981 206.20 0.000 1.878596 1.914696
> _cons | -.9866896 .0358667 -27.51 0.000 -1.057072 -.916307
> ------------------------------------------------------------------------------
>
> .
> . tsset id time
> panel variable: id, 1 to 1000
> time variable: time, 1 to 4
> . iis id
> . tis time
>
> . xtreg y time, fe
>
> Fixed-effects (within) regression Number of obs =4000
> Group variable (i) : id Number of groups =1000
>
> R-sq: within = 0.9027 Obs per group: min =4
> between = 0.0000 avg =4.0
> overall = 0.7897 max =4
>
> F(1,2999) =27835.94
> corr(u_i, Xb) = -0.0000 Prob > F =0.0000
>
> ------------------------------------------------------------------------------
> y | Coef. Std. Err. t P>|t| [95%
> Conf. Interval]
> -------------+----------------------------------------------------------------
> time | 1.896646 .011368 166.84 0.000 1.874356 1.918936
> _cons | -.9866896 .0311325 -31.69 0.000 -1.047733 -.9256464
> -------------+----------------------------------------------------------------
> sigma_u | .84490704
> sigma_e | .80383774
> rho | .52489398 (fraction of variance due to u_i)
> ------------------------------------------------------------------------------
> F test that all u_i=0: F(999, 2999) = 4.42 Prob > F =0.0000
>
> . xtreg y time, re
>
> Random-effects GLS regression Number of obs =4000
> Group variable (i) : id Number of groups =1000
>
> R-sq: within = 0.9027 Obs per group: min =4
> between = 0.0000 avg =4.0
> overall = 0.7897 max =4
>
> Random effects u_i ~ Gaussian Wald chi2(1) =27835.94
> corr(u_i, X) = 0 (assumed) Prob > chi2 =0.0000
>
> ------------------------------------------------------------------------------
> y | Coef. Std. Err. z P>|z| [95% Conf. Interval]
> -------------+----------------------------------------------------------------
> time | 1.896646 .011368 166.84 0.000 1.874365 1.918927
> _cons | -.9866896 .0390072 -25.30 0.000 -1.063142 -.9102369
> -------------+----------------------------------------------------------------
> sigma_u | .74318848
> sigma_e | .80383774
> rho | .46085639 (fraction of variance due to u_i)
> ------------------------------------------------------------------------------
>
> . prais y time
>
> Number of gaps in sample: 999 (gap count includes panel changes)
> (note: computations for rho restarted at each gap)
>
> Iteration 0: rho = 0.0000
> Iteration 1: rho = 0.4034
> Iteration 2: rho = 0.4136
> Iteration 3: rho = 0.4140
> Iteration 4: rho = 0.4140
> Iteration 5: rho = 0.4140
>
> Prais-Winsten AR(1) regression -- iterated estimates
>
> Source | SS df MS Number of obs =4000
> -------------+------------------------------ F( 1, 3998) =8914.61
> Model | 8913.21448 1 8913.21448 Prob > F =0.0000
> Residual | 3997.37575 3998 .999843859 R-squared =0.6904
> -------------+------------------------------ Adj R-squared =0.6903
> Total | 12910.5902 3999 3.22845467 Root MSE =.99992
>
> ------------------------------------------------------------------------------
> y | Coef. Std. Err. t P>|t| [95%Conf. Interval]
> -------------+----------------------------------------------------------------
> time | 1.953367 .0157109 124.33 0.000 1.922565 1.984169
> _cons | -1.061051 .0456138 -23.26 0.000 -1.15048 -.9716229
> -------------+----------------------------------------------------------------
> rho | .4140074
> ------------------------------------------------------------------------------
> Durbin-Watson statistic (original) 0.925711
> Durbin-Watson statistic (transformed) 1.672284
>
> >
> > >
> > >
> > > Anirban
> > >
> > > ______________________________________
> > > ANIRBAN BASU
> > > Doctoral Student
> > > Harris School of Public Policy Studies
> > > University of Chicago
> > > (312) 563 0907 (H)
> > > ________________________________________________________________
> > >
> > >
> > > On Tue, 25 Jun 2002, Mark Schaffer wrote:
> > >
> > > > Hi everybody.
> > > >
> > > > Just a couple of clarifying details on -cluster- vs. -xtreg-
> > > and
> > > > Anirban's response to John.
> > > >
> > > > The -cluster- option for -regress- doesn't really impose a
> > > particular
> > > > within-cluster correlation structure on the data. If I
> > > understand it
> > > > correctly, what -cluster- does instead is loosen the usual
> > > assumption
> > > > of independence of observations to independence of clusters.
> > > The
> > > > correlation between observations within clusters can be
> > > arbitrary.
> > > > The way this works is basically by treating all the
> > > observations in a
> > > > cluster as a kind of "super-observation" and then applying
> > > the robust
> > > > ("sandwich") formula to these super-observations in order to
> > >
> > > > calculate the standard errors of the coefficients produced
> > > by -
> > > > regress-. See the manual entry for -regress-, p. 87.
> > > >
> > > > The estimated coefficients (the betas) produced by -regress-
> > > are the
> > > > same whether or not the -cluster- option is used; the only
> > > thing that
> > > > is different is the standard errors.
> > > >
> > > > With fixed effects, you _do_ impose a particular correlation
> > >
> > > > structure, namely all the observations within a cluster
> > > share U(k) in
> > > > Anirban's notation. If you use -xtreg- with -fe- to
> > > estimate, Stata
> > > > does not, however, use a first-difference estimator - it
> > > uses a fixed
> > > > effects estimator. In other words, it doesn't
> > > first-difference to
> > > > get rid of the fixed effects, it uses the mean-deviation
> > > > transformation to get rid of them.
> > > >
> > > > Hope this helps.
> > > >
> > > > --Mark
> > > >
> > > > Quoting anirban basu <[email protected]>:
> > > >
> > > > > Hi John,
> > > > >
> > > > >
> > > > > With reg command and cluster option, one basically imposes
> > > an
> > > > > exchangeable
> > > > > correlation structure on the data. i.e assume corr (y(i),
> > > > > y(j)) = rho,
> > > > > where i ne j and i,j are any two observation from the
> > > same
> > > > > cluster. Rho
> > > > > is constant for every pair of observation within a
> > > cluster.
> > > > > So, one can
> > > > > visuaize it in terms of a random effects model where :
> > > > >
> > > > > Y(k) = Xb + U(k) + e, where k represents clusters and U(k)
> > > is
> > > > > a
> > > > > cluster-specific random effect that is common to all
> > > > > observation in that
> > > > > cluster. However, -reg- does not give estimates of this
> > > random
> > > > > effect. It
> > > > > just estimates -betas- assuming this structure.
> > > > >
> > > > > However, this estimation is correct only if U(k) are
> > > > > uncorrelated with
> > > > > Xs. i.e. the unobserved characteristics of a cluster over
> > > time
> > > > > is
> > > > > uncorrelated with the X over time. If not then fixed
> > > effects
> > > > > is useful.
> > > > >
> > > > >
> > > > > With fixed effects, one evades the correlation problem by
> > > > > taking
> > > > > differences. i.e for any cluster k:
> > > > >
> > > > > Y(ik) - Y(1k) = [X(ik) - X(1k)]b + [e(ik) - e(1k)]
> > > > >
> > > > > Note that by taking the difference, the unobserved U(k) is
> > > > > eliminated.
> > > > > However, fixed effects assume the U(k) is fixed over time
> > > for
> > > > > any cluster
> > > > > k. i.e. the unobserved characteristics of a cluster is not
> > > > > changing over
> > > > > time. Also, since we are taking a difference, fixed
> > > effects
> > > > > model cannot
> > > > > estimate the betas for baseline covariates since they
> > > cancel
> > > > > out in the
> > > > > difference.
> > > > >
> > > > > Hope this helps,
> > > > >
> > > > > Anirban
> > > > >
> > > > >
> > > > >
> > > > > ______________________________________
> > > > > ANIRBAN BASU
> > > > > Doctoral Student
> > > > > Harris School of Public Policy Studies
> > > > > University of Chicago
> > > > > (312) 563 0907 (H)
> > > > >
> > > ________________________________________________________________
> > > > >
> > > > >
> > > > > On Tue, 25 Jun 2002, John Neumann wrote:
> > > > >
> > > > > > Hello all,
> > > > > >
> > > > > > Since I frequently see panel data questions flying
> > > around
> > > > > the
> > > > > > list, I'm thinking that some of you can provide me with
> > > a
> > > > > > very succinct answer to the following question, and in
> > > so
> > > > > > doing clarify conceptually for me the data-related
> > > issue:
> > > > > >
> > > > > > I have data on investment products, by year. Not all
> > > > > > products have data in each year. The dependent
> > > > > > variable is scaled in such a way as to make time series
> > > > > > variation in its levels of no concern. Here's the
> > > question:
> > > > > >
> > > > > > What is the difference between using the reg command,
> > > > > > with the robust and cluster option, vs. the xtreg
> > > command
> > > > > > fixed effects model? The cluster variable using reg
> > > would
> > > > > > naturally be the i( ) parameter for xtreg ...
> > > > > >
> > > > > > Thanks!
> > > > > >
> > > > > > John Neumann
> > > > > > Boston University
> >
> > Prof. Mark Schaffer
> > Director, CERT
> > Department of Economics, School of Management
> > Heriot-Watt University, Edinburgh EH14 4AS
> > tel +44-131-451-3494 / fax +44-131-451-3008
> > email: [email protected]
> > web: http://www.som.hw.ac.uk/ecomes
> > ________________________________________________________________
> >
> > DISCLAIMER:
> >
> > This e-mail and any files transmitted with it are confidential
> > and intended solely for the use of the individual or entity to
> > whom it is addressed. If you are not the intended recipient
> > you are prohibited from using any of the information contained
> > in this e-mail. In such a case, please destroy all copies in
> > your possession and notify the sender by reply e-mail. Heriot
> > Watt University does not accept liability or responsibility
> > for changes made to this e-mail after it was sent, or for
> > viruses transmitted through this e-mail. Opinions, comments,
> > conclusions and other information in this e-mail that do not
> > relate to the official business of Heriot Watt University are
> > not endorsed by it.
> > ________________________________________________________________
> >
>
> *
> * For searches and help try:
> * http://www.stata.com/support/faqs/res/findit.html
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/