Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: creating panel of household surveyed in different year
From
Chamara Anuranga <[email protected]>
To
[email protected]
Subject
Re: st: creating panel of household surveyed in different year
Date
Wed, 22 May 2013 11:16:25 +0530
Dear Prarkash
There are different way you can do same thing on Stata. If your
dataset organized like this I suggest to do the following
bys hhid: gen numsurvey=_N
bys hhid: egen minyear=min(survey_year)
gen entry=survey_year==minyear
bys hhid: egen maxyear=max(survey_year)
gen exit=survey_year==maxyear
numsurvey gives the number of times each household appear on the survey
entry is 1 if the household appear on the survey at first time and otherwise 0
exit is 1 if the household surveyed at the last time
do the label defne for entry and exit variables.
label define yesno 1 "Yes" 2 "No"
label val entry yesno
label val exit yesno
if household only survey once entry and exit both get 1 (yes). However
you can change the variable way you prefer base on numsurvey
variable.
Hope this help.
Thanks,
Chamara
On Wed, May 22, 2013 at 10:19 AM, Prakash Singh <[email protected]> wrote:
> Thanks Chamara and Nick
>
> Nick, I am providing the id of first ten household surveyed in 1997
> and 2002 below.
>
> hhid survey_year entry exit
> 181004 1997
> 181007 1997
> 181113 1997
> 181801 1997
> 182003 1997
> 182601 1997
> 182615 1997
> 182711 1997
> 182716 1997 yes
> 182803 1997 yes
> 181001 2002 yes
> 181004 2002
> 181007 2002
> 181113 2002
> 181801 2002
> 182003 2002
> 182201 2002 yes
> 182601 2002
> 182615 2002
> 182711 2002
>
> Now if you look at the id, household no 181001 and 182201 were not
> part of 1997 survey household no 182716 and 182803 did not
> participated in the 2002 survey.
>
> My interest is first to generate one variable which identifies
> households participated in all the survey; second variable identifying
> new household in the survey and finally third variable identifying
> household not participated in survey.
> There are two more rounds of data which I am extracting still.
>
> I hope I have made progress in expressing my query.
>
>
>
> Prakash
>
> On Tue, May 21, 2013 at 5:30 PM, Nick Cox <[email protected]> wrote:
>> My own guess is that Prakash's previous post was ignored because it
>> was too vague about precise data structure and the revised post
>> doesn't add much. At least that is why I deleted it. A specific
>> example showing what you have is usually preferable to a long verbal
>> discussion.
>>
>> The solution below seems unnecessarily complicated. Splitting the
>> dataset into three and then -merge-ing them back again is only
>> possible if there is some identifier in the dataset that tells you
>> which survey round is being referred to. Why not just do it in place?
>>
>> At its simplest the number of rounds in which each household
>> participated may just be the number of times the household appears in
>> the dataset. Otherwise there should be some round identifier. There
>> seems little point in speculating about variables, as Prakash can
>> (please) give concrete details.
>>
>> Same applies to entry and exit: show us how the data are held, and
>> specific suggestions are then much easier.
>>
>> Nick
>> [email protected]
>>
>>
>> On 21 May 2013 11:42, Chamara Anuranga <[email protected]> wrote:
>>> Dear Prakash,
>>>
>>> keep id variable in each survey and create new variable to identify each survey.
>>> for the dataset 1
>>> gen svyname1="survey1"
>>> for dataset 2
>>> gen svyname2="survey2"
>>>
>>> etc.
>>> now you have 3 datasets. Merge them base on id. check the missing for svynames
>>>
>>> egen totmiss=rowmiss(svyname*)
>>>
>>> if rowmiss if 0 it mean those household appears in 3 rounds.
>>>
>>>
>>> Thanks
>>> Chamara
>>>
>>>
>>>
>>>
>>> On Tue, May 21, 2013 at 3:57 PM, Prakash Singh <[email protected]> wrote:
>>>> Hello every one
>>>> I had sent mail earlier also but may the subject was not appropriate
>>>> to draw attention so I am sending this again with revised subject.
>>>>
>>>> I am working with survey dataset of more than three rounds. The
>>>> identification code for each household is similar in all the rounds.
>>>>
>>>> Now there are some households which are surveyed in all the years,
>>>> there are some households surveyed in some year but not in other years
>>>> (did not participated in the survey). Now I want to map the households so that
>>>> I can know which household is surveyed in more than two rounds and
>>>> also want create panel of household which are surveyed in all the
>>>> years.
>>>>
>>>> I am also interested in entry and exit of households, where entry
>>>> means new household coming in the subsequent round of survey and exit
>>>> means leaving the survey in the subsequent round of survey.
>>>>
>>>> Please suggest how should I workout this problem.
>> *
>> * For searches and help try:
>> * http://www.stata.com/help.cgi?search
>> * http://www.stata.com/support/faqs/resources/statalist-faq/
>> * http://www.ats.ucla.edu/stat/stata/
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/