|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Survival analysis: repeated spells
Thank you very much for your reply!
My question was, in a sense, much more basic. When I say "repeated
spells", I simply mean multiple records per unit (under no subject
event dependence).
Therefore, putting it differently, my question is the following:
Compare
stset t
stcox X
with
stset t, id(var)
stcox X
Does the introduction of the option id() change the likelihood
function in stcox? Or it simply allows stata to cluster the standard
error by "var"?
Concerning the way I wrote the likelihood, maybe I'm wrong, but I
still think it is correct (notice that h is the hazard, not the
failure function)
Many thanks,
Vinicio
Il giorno 25/set/2009, alle ore 19.25, E. Paul Wileyto ha scritto:
You have some choices to make for modeling recurrent events. Stata
has many utilities for structuring the risk-set for survival
modeling, especially for multiple record data. Best thing is to go
to the survival manual for Stata, and look up the methods and
formulas section in STREG. Here is how you stated the log-
likelihood...
ln L(i)= c log h(t)+log S(t)
This should actually be
ln L(i)= c log h(t)+ (1-c) log S(t)
The manual adds one more term to make likelihood conditional on time
of entry into the risk set.
ln L(i)= c log h(t)+ (1-c) log S(t) - S(t0)
You will have to make a choice about how to represent time. For
models like Anderson-Gill, the only time=0 is when you enter the
cohort at the beginning. The time for your first event becomes the
entry time for the second event, and so on. Gap time models reset
the clock back to zero after each event, so t for later events
becomes (ti-t0), and the S(t0) term in the likelihood goes away.
Beyond that, you will have to adjust at least the standard errors,
either by some sort of clustering approach (vce(cluster) or
bootstrap), or using shared frailty.
I will say that STSET is unfriendly to certain types of multiple
record data. I have had bad luck trying to STSET data for gap time,
and usually have to do something heroic that side-steps STSET's
"idiot proofing." That is, I either have to create my multiple
records with an Anderson-Gill type structure, and then subtract out
the t0 values, or I have to treat it as single record data... STSET
only generates the risk-set representation of the data, and
clustering or shared frailty will work without giving the IDs to
STSET.
Paul
V. Martini wrote:
Hello,
this is maybe a trivial question for some of you:
how does stata account for the presence of repeated spells?
I know that, in order to account for different spells that refers
to the same unit, stata requires to specify the id( ) option when
the data are stset-ted. But my question is about how stata deals
with this problem in the estimation.
For instance, suppose the contribution to the loglikelihood (with
right censoring) is given by:
L(i)= c log h(t)+log S(t)
where c is a censoring variable (c=1 if spell is complete), h is
the hazard function and S the survival function.
How the contribution to the likelihood changes when unit i has
repeated spells? In my case this is relevant because I have to
program the likelihood function by myself.
Many thanks,
Vinicio
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
--
E. Paul Wileyto, Ph.D.
Assistant Professor of Biostatistics
Tobacco Use Research Center
School of Medicine, U. of Pennsylvania
3535 Market Street, Suite 4100
Philadelphia, PA 19104-3309
215-746-7147
Fax: 215-746-7140
[email protected]
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/