[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: suggested references about the variables to include in zero-inflated portion of zinb?

From	Steven Samuels <[email protected]>
To	[email protected]
Subject	Re: st: suggested references about the variables to include in zero-inflated portion of zinb?
Date	Sun, 26 Oct 2008 11:05:43 -0400

Tim--the Subject of your last post was completely uninformative (st:Re: statalist-digest V4 #3224). If you receive the Digest, do not usethe "Reply" button to respond.



I have a few thoughts:

1. The reviewer's original opinion is not correct. If your targetparameter is the mean score, then OLS may give a consistent estimate,even if the data are skew and non-normal. The proviso is that youhave a good prediction model for the mean. However with OLS,standard errors will be incorrect. The fix is easy: -reg- with a -robust- option will give standard errors that are model-free.

2. Did you compare observed and expected values by eye and with a chisquare test? If the -zinb- fit is not good, there is littlejustification for using it.

3. If, by chance, -zinb- happens to give a good fit, standard errorsbased on the ZINB model will be wrong. You should use the -robust-option or a bootstrap, as Carlo suggested.

4. Published analyses of CESD with the zero-inflated negativebinomial are not, in themselves, justification for using -zinb- inyour problem. Did the published distributions fit the data? I'vedone analyses with full and reduced versions CESD. In one data setand in national data the distribution was quite symmetric. In anotherdata set the distribution was bimodal. (I think this was aninterviewer problem) In neither case was there a lump at the minimum(or maximum) value. In fact, the extreme responses were the rarestones.

5. If you do see lumps at the extremes, considered that they aredishonest. Why? With count data, a separate model for responding atall is plausible. With questionnaire scales, a minimum or maximumscore is the result of a respondent checking the same value forevery item. (I use the world "lumps", but in the statisticalliterature, isolated higher density regions are usually called "bumps".)

6. If you want to fit the distribution of scores, as opposed topredicting means, the beta distribution may provide a goodapproximation. Divide the scores by the maximum possible, so that theresults are proportions. Then download -betafit- from SSC. You willneed to add a small constant to the zeros and subtract it from theones before you do your regressions.





-Steve

I am using zinb to estimate level of psychological distress (scoresrange from 0-24) using various demographic variables and measuresof use of the Internet. I've used -countfit- to compare variouscount models and the results support zinb as the best fitting model.
I am uncertain, however, about how to justify the variables that Iinclude in the zero-inflated part of the model. I've read journalarticles that have used zinb, read the book by Freese and Long, andsearched the Internet and Statalist but I have not been able tofind any detailed recommendations or procedures. Can anyone suggestany other sources (books or journals) that provide an explanationor a good example of this process?
Ideally I would like to find a good source that I can cite in thepaper -- but I appreciate any suggestions about this you might have.
Thanks for you help,
Tim

-----------------------------------------------------
Timothy M. Hale, MA
Graduate Assistant
University of Alabama at Birmingham
Department of Sociology
email:  [email protected]

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: suggested references about the variables to include in zero-inflated portion of zinb?
  - From: Maarten buis <[email protected]>

References:
- st: suggested references about the variables to include in zero-inflated portion of zinb?
  - From: Tim Hale <[email protected]>

Prev by Date: st: New version of -somersd- on SSC
Next by Date: st: RE: re: problem using predictnl after obtaining non-linear estimates
Previous by thread: Re: st: suggested references about the variables to include in zero-inflated portion of zinb?
Next by thread: Re: st: suggested references about the variables to include in zero-inflated portion of zinb?
Index(es):
- Date
- Thread