Dear statalisters,
The dependent variable I have is a proportion (percentage of 16 year
olds enrolled in a particular subject) which is between 0 and 86
percent. I am not sure about the linear form. My dependent variable is 0
only in 3,980 cases out of 112,412 sample obs. Here a zero is a
structural one, because the school does not offer history (which is
choice subject).
Would somebody suggest to me whether it would be better to perform a
logit transformation, or estimate -glm- with family(gaussian) and
link(logit). Any suggestion would be greatly appreciated!
Thank you in advance!
Shqiponja