Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: time-series data identified by three variables
From
YANNAN SHEN <[email protected]>
To
"[email protected]" <[email protected]>
Subject
Re: st: time-series data identified by three variables
Date
Wed, 28 Nov 2012 04:44:00 -0500
Dear Nick,
Thank you for your reply!
For the first code do you mean
> bysort illness_id date of visit : egen meansev = mean(severity)
Because all I care is on a certain date, what is the average severity across all patient visited for a certain disease.
And how exactly does "bysort" work? Does it treat the argument variables as a group?
For instance if date=June5; day=5, month=june
Is bysort month day equivalent to bysort date?
Sent from my iPhone
On Nov 28, 2012, at 4:21 AM, Nick Cox <[email protected]> wrote:
> You want commands like
>
> bysort patient_id illness_id date of visit : egen meansev = mean(severity)
> by patient_id illness_id : gen repeat = _n - 1
>
> as you want to number 0 upwards.
>
>
> Nick
>
> On Wed, Nov 28, 2012 at 6:28 AM, yannan shen <[email protected]> wrote:
>
>> I am working some panel data of hospital visits and I want to learn
>> the severity of various disease.
>> The variables I have in the dataset are: patient_id, illness_id,
>> date_of_visit, severity
>> each observation contains: patient_id, illness_id, date_of_visit, severity.
>>
>> For each patient (identified by patient_id), I want to know how many
>> of times he has visited for the same illness (illness_id ).
>> I use the duple command to to label the observation of patients who
>> have visited hospital more than once.
>>
>>> duplicates tag patient_id illness_id , generate(duple)
>>
>> However, duple does not give information for any time series
>> information. If a patient has 5 visiting records, I want to be able to
>> know which is the 0th repeat, 1st repeat, 2nd repeat, 3rd repeat, and
>> 4th repeat...I have a vague feeling that I can order those variables
>> via date_of_visit but I am still not sure how exactly that can be
>> done.
>>
>> Furthermore, I want to create two new variables: one variable equals
>> to the average severity of each disease (disease_id) being treated on
>> the same date_of_visit. The other variable equals the highest severity
>> of a certain disease being treated on that day. (Ideally, I want to
>> create additional variables for each observation)
>>
>> I have used “bysort” in the past but since now the type is a
>> combination of illness_id and date_of_visit, I am a little confused.
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/