On Oct 25, 2004, at 10:05 PM, Clive Nicholas wrote:
This is such a basic problem, I don't understand why I can't find the
solution, but here goes.
	[snip -- just the relevant equations for now]
. reg growthpc lgrowth trade lowwage fdi spend left
. predict e, resid
(70 missing values generated)
	OK:  by construction -- by the very definition of an OLS residual -- e  
will be orthogonal (that is, uncorrelated) with ALL of the RHS  
variables above.  Stata neither knows nor cares what those RHS  
variables are or mean;  the vector that represents whatever is on the  
RHS *will* be uncorrelated with e.
. pwcorr e lgrowth trade lowwage fdi spend left, sig
           |        e  lgrowth    trade  lowwage      fdi    spend      
left
----------- 
+---------------------------------------------------------------
         e |   1.0000
           |
   lgrowth |  -0.0000   1.0000
           |   1.0000
           |
     trade |  -0.0000  -0.0787   1.0000
           |   1.0000   0.0553
           |
   lowwage |   0.0000   0.1663  -0.6208   1.0000
           |   1.0000   0.0000   0.0000
           |
       fdi |  -0.0000  -0.1124   0.3373  -0.2562   1.0000
           |   1.0000   0.0087   0.0000   0.0000
           |
     spend |   0.0000  -0.3736   0.5386  -0.4120   0.3258   1.0000
           |   1.0000   0.0000   0.0000   0.0000   0.0000
           |
      left |   0.0000   0.0088   0.1360  -0.1353   0.0552   0.1822    
1.0000
           |   1.0000   0.8314   0.0007   0.0008   0.1988   0.0000
	And, as expected -- nay, by definition -- e is uncorrelated with each  
of the RHS variables.   Sounds like a success!
No matter how small I make the model, I keep finding that the error is
perfectly uncorrelated with the LDV (-lgrowth- in this case).
	Sure, as long as lgrowth is contained in the list of regressors, it  
*must* be uncorrelated with the residual from that regression.
Indeed, it's perfectly uncorrelated with _everything!_
	Everything that is included in the regression, that is.  If a variable  
were excluded, that variable at least has a chance to be correlated  
with the residual.  (Albeit as a reflection of omitted variable bias,  
possibly....)
Exactly the same happens if I:  (1) restrict the model to just the  
first two explanatory variables;
	For the same reasons as put forth above:  as long as lgrowth is one of  
the regressors, OLS will return a residual series that is by  
construction uncorrelated with lgrowth.
(2) estimate it with, say, -areg- and then -predict e, resid-,
	Isn't areg just OLS with a bunch of dummy (i.e. categorical)  
variables?  Then the same explanation as above applies.
or; (3) if I change the -predict, resid- option to, say, -rstandard-  
(which barely changes the values in the correlation matrix).
	Sorry, couldn't find anything mentioned about -rstandard- in the  
on-line help.  But so long as -predict- is giving you OLS residuals,  
then this option doesn't contradict the above explanation.
I don't know about you, but I think all this is odd.
	I don't find the results odd, but I am a little uncertain what you are  
trying to do in the first place.  Your initial statement:
In preparing to muck around with some -ivreg2- test code, I've been
running some basic lagged-DV regressions in order to introduce an
instrument into an IV regression.
	...is unclear to me.  Perhaps I am not understanding your intentions.   
How do you see these regressions helping you to "introduce an  
instrument"?
                                        -- Mike
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/