Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | tashi lama <ltashi32@hotmail.com> |
To | <statalist@hsphsun2.harvard.edu> |
Subject | RE: st: Arranging variables across rows |
Date | Wed, 27 Jun 2012 17:38:59 +0000 |
I am just curious.....the macros are declared to substitute "_n"(obs no). I would assume that we write replace D[`i'], A[`j'] and B[`k'] instead of D`i', A`j' etc. Am I missing sth here? Thanx ---------------------------------------- > Date: Wed, 27 Jun 2012 18:26:18 +0100 > Subject: Re: st: Arranging variables across rows > From: njcoxstata@gmail.com > To: statalist@hsphsun2.harvard.edu > > Given your determination to keep your present data structure, I can > also suggest a clumsy way to do it. The precise code is untested. > > forval i = 1/19 { > gen D`i' = 0 > forval j = 1/19 { > forval k = 1/8 { > replace D`i' = 1 if A`j' == B`k' & B`k' < . > } > } > } > > Nick > > On Wed, Jun 27, 2012 at 5:10 PM, samuel gyetvay <sam.gyetvay@gmail.com> wrote: > > Thank you for all your help, I feel I am already making some progress. > > > > I realize, however, that I was not perfectly clear or explicit about > > the form or structure of my data. Your responses so far are still very > > useful and appropriate. > > > > A1, ... A19, B1, ... B8 are names of (column) variables, each of which > > contain ~12,000 entries. There exists a variable that identifies > > families, call it fam, although it does not always identify family > > uniquely. There does exist a variable that identifies family > > respondent, however, call it famr. fam and famr are also column > > variables, each of which contain ~12,000 entries. > > > > For the moment, my analysis is quite simple; it only requires some row > > and column summation, and the current form of the data is quite nice > > for what I have to do (for the most part). Nearly all of the variables > > that I need to use are just like A1, ... A19. For example, a set of > > variables that identify "sex of child," call them C1, ... C19 overlap > > perfectly with A1, ..., A19, and you can easily control for sex of > > child when doing row and column summation, because their positions > > coincide. For example, the third value of C13 will identify the sex of > > the same child as the third value of A13. > > > > B1, ... B8 is the only problematic variable, because it does not > > overlap: it merely indicates which child identification number > > received treatment. So you cannot use a simple command such as > > > > . count if A`j' == B`j' > > > > Which I can do for other variables. What I would like to do is to > > create dummy variables D1, ..., D19 equal to 1 when the child has > > received treatment, and 0 otherwise. That way I can use it just like I > > used C1, ..., C19. > > > > I apologize for not being more clear from the start, and appreciate > > everything you have suggested thus far. > * > * For searches and help try: > * http://www.stata.com/help.cgi?search > * http://www.stata.com/support/statalist/faq > * http://www.ats.ucla.edu/stat/stata/ * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/