On 2 Feabh 2010, at 15:09, muhammed abdul khalid wrote:
I have a household income survey data ( 38,000 observations), and my
problem is doing a multiple regression on saving ( independent var) to
ethnicity/strata/employment
etc( dependent var).
The problem is this : 70% of my observation for the value of saving is
zero.
Then this sounds like a two-stage model. The first model predicts
whether the family saves or not, the second predicts how much they
save, conditional on their saving at all. I would be wary of
predicting everything in a single model - the determinants of how much
may not be the same as the determinants of whether.
Ronan Conroy
=================================
[email protected]
Royal College of Surgeons in Ireland
Epidemiology Department,
Beaux Lane House, Dublin 2, Ireland
+353 (0)1 402 2431
+353 (0)87 799 97 95
+353 (0)1 402 2764 (Fax - remember them?)
http://rcsi.academia.edu/RonanConroy
P Before printing, think about the environment
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/