1998 UK Stata Users Group meeting

Home / Resources & support / Users Group meetings / 1998 UK Stata Users Group meeting / Abstract

Random effects Poisson regression and recurrent events data

Speaker: David Clayton, Institute of Public Health

With recurrent event data the total number of events for a subject will usually show dispersion between subjects which is greater than that expected from a Poisson distribution of events. One way to deal with this is to fit a Poisson regression model using the glm procedure, but to allow the scale factor to depart from 1.0. An alternative approach is to fit the Poisson model but to use a "robust" information-sandwich estimate of the variance-covariance matrix of estimates. These are provided by the author’s hglm function or by Roger Newson's more recent rglm function.

However, both of these approaches are only efficient if the variance of the event count is proportional to its expectation, and most realistic models for overdispersion do not make this prediction. It is natural to assume that overdispersion is the result of the effects of unmeasured covariates. This leads to a mixed model in which random and fixed effects are additive on the same scale. For the log link function (so that the random effects act as random multipliers or "frailties") and with Poisson errors this leads to the variance function

where is the frailty variance. When the frailties are distributed according to a Gamma distribution, the counts are negative binomial and the model can be fitted using nbreg. Alternatively, fixed effects can be estimated in glm using a negative binomial error structure with specified . These calculations can be alternated with estimation of using a program nbpar which can estimate either using ML, maximum pseudo-likelihood, or a pseudo-REML method. Alternatively this double iteration has been implemented in the single program rpoisson. The command rpoisson is introduced to fit this model, and has the option to estimate either by full maximum-likelihood, based on the assumption that frailties have a Gamma distribution, or by using a pseudo-likelihood for and quasi-likelihood for the regression parameters. With the maximum-likelihood option rpoisson is equivalent to the Stata command nbreg.

When the follow-up period is split into several bands either because we need to allow for time-varying covariates or for the variation of rates with time itself, subject heterogeneity introduces not only overdispersion but correlation between the counts for different time periods on the same subject. This can be dealt with in three ways:

Analyze the counts with hlgm or rglm, using the cluster option to ensure that the information-sandwich estimate is robust not only to misspecification of the variance function but also to correlations within clusters (here identified with subjects).
Analyze the marginal counts with xtpois and use the exchangeable correlation structure. This is an interface to the xtgee function. Note that approach (Clayton 1994) is also a special case of this, with independence correlation structure.
Model the counts at an individual level, using a random frailty effect for each record, and allowing records from the same subject to share the same frailty parameter. This leads to a multivariate extension of the variance-mean relationship which includes covariances (Clayton 1994). The model can also be fitted using rpoisson with a cluster option, which ensures that the same frailty is shared between all units within the same cluster.

Note that the first two approaches are again only efficient if the variance is proportional to the mean. rpoisson on the other hand uses variancee–covariance structures suggested by additivity of fixed and random effects on the log scale; it too allows robust estimation of standard errors using the information sandwich.

These techniques are compared using the epilepsy data (Thall and Vail 1990).

References

Clayton, D. 1994.: Some approaches to the analysis of recurrent event data. Statistical Methods in Medical Research 3: 244–262.

Thall, P. F. and S. C. Vail. 1990.: Some covariance models for longitudinal count data with overdispersion. Biometrics 46: 657–71.

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

Random effects Poisson regression and recurrent events data

References

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies

Stata/MP4 Annual License (download)

Random effects Poisson regression and recurrent events data

References

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies