Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Coding overlap events in sequence data

From	Nick Cox <[email protected]>
To	"[email protected]" <[email protected]>
Subject	Re: st: Coding overlap events in sequence data
Date	Wed, 13 Nov 2013 08:56:27 +0000

This problem would be easier after a -reshape long-. Your present data
structure makes it really difficult.

After that, check out

-spellutil- (SSC)

-disjoint- (SSC)

which may help.
Nick
[email protected]


On 12 November 2013 22:26, Cheng, Hsu-Chih <[email protected]> wrote:
> Dear All:
>
> I am coding sequence data for 5013 respondents’ sexual relationship histories.  For each respondent, I have 16 time positions (ages 18~18.25 [sext1], 18.25~18.50 [sext2],…, 21.75~22 [sext16]) and the respondent’s beginning and end ages of up to 48 relationships (most respondents have fewer than 5 relationships; so the beginning and end ages of the other 40+ relationships have missing values). Right now, respondents with multiple relationships in a given time position are coded as 4 (so, for example, sext5 = 4).  I can manually go through all respondents to determine whether the multiple relationships in a given time overlap or not and recode them into different categories, but this is very tedious and time consuming.  Is there a faster way to do this?
>
> Here are four examples with multiple relationships in Time 2 (ages 18.25~18.50).  sexBag1 and sexEag1 indicate the beginning and end ages of relationship 1; sexBag2 and sexEag2 indicate the beginning and end ages of relationship 1;…, and so on.  I want to recode [sext2] for Cases 1 and 2 as 1 to indicate that their relationships in Time 2 do not overlap, and Cases 3 and 4 as 2 to indicate their relationships in Time 2 overlap.
>
> id  sext2   sexBag1  sexEag1  sexBag2  sexEag2  sexBag3 sexEag3  sexBag4  sexEag4  sexBag5    sexEag5
> 1       4      18.5    18.75   18.333   18.416   21.416    21.5    19.25   24.083        .          .
> 2       4         .        .   18.250   18.333   18.416  21.666   21.583   22.833     22.5   22.50004
> 3       4    17.249   18.999   18.499   21.999   22.166  23.249   (missing values after this)
> 4       4    16.750   22.750   18.416   18.666   (missing values after this)
>
> I really appreciate if anyone can give me some suggestions. I can provide more information about the data if needed. Thanks again.
>
> Best,
>
> Simon
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/faqs/resources/statalist-faq/
> *   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Coding overlap events in sequence data
  - From: Nick Cox <[email protected]>

References:
- st: Coding overlap events in sequence data
  - From: "Cheng, Hsu-Chih" <[email protected]>

Prev by Date: Re: st: xtivreg, re: Autocorrelation & heteroscedasticity
Next by Date: st: estimating treatment effect of a binary endogenous regressor on binary outcome
Previous by thread: st: Coding overlap events in sequence data
Next by thread: Re: st: Coding overlap events in sequence data
Index(es):
- Date
- Thread