Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: AW: st: RE: AW: Re:
From
[email protected]
To
[email protected]
Subject
Re: AW: st: RE: AW: Re:
Date
Wed, 24 Feb 2010 14:12:13 +0100
Another reason for splitting up the dataset is that it´s simply huge - it has
got about 250,000 observations, whereas the sub-datasets have only about 3,000.
Would there also have been a possibility for processing such a file, for example
by assigning more memory to Stata?
I suppose that I have to learn all that the hard way...
And thanks for the other hints, I´ve implemented a double loop now and thus
simplified my do-file considerably!
Best,
Andi
Zitat von Martin Weiss <[email protected]>:
>
> <>
>
>
> " The reason why I have so many different datasets is that I read somewhere
> that
> Stata cannot filter observations properly, i.e. without deleting the
> remaining
> dataset."
>
> Not so, as you correctly guessed. Use the -if- qualifier!
>
>
> " With regard to the centile(5 50 95) command, it has only been a bit of
> laziness
> that I didn´t try to omit the 5 and 50-centiles - as soon as it worked, I
> was
> happy :-)"
>
> Omit 5 and 50, and then use - post internal (r(lb_1)) (r(c_1)) (r(ub_1))-
> for the -postfile-
>
>
>
> HTH
> Martin
>
>
> -----Ursprüngliche Nachricht-----
> Von: [email protected]
> [mailto:[email protected]] Im Auftrag von
> [email protected]
> Gesendet: Mittwoch, 24. Februar 2010 01:07
> An: [email protected]
> Betreff: RE: st: RE: AW: Re:
>
> As I told in my first post, I´m an absolute Stata rookie.
> The reason why I have so many different datasets is that I read somewhere
> that
> Stata cannot filter observations properly, i.e. without deleting the
> remaining
> dataset. That´s probably nonsense... So in order to deduce conditional
> statements, I split the original (huge) dataset into numerous sub-datasets.
>
> With regard to the centile(5 50 95) command, it has only been a bit of
> laziness
> that I didn´t try to omit the 5 and 50-centiles - as soon as it worked, I
> was
> happy :-)
>
> And by the way: Thanks a lot for the hint with the loop - I´m gonna try it
> right
> away! Here too, someone told me that looping within a filename wasn´t
> possible...
>
>
> Best,
> Andi
>
>
> Zitat von Martin Weiss <[email protected]>:
>
> >
> > <>
> >
> > It is good to hear you are satisfied with the solution, but I am not quite
> > sure what your data structure looks like. You request three centiles from
> > Stata with your call -centile PF_norm, centile(5 50 95)-, but you store
> only
> > one of them, the 95% one, via -postfile-. This is your prerogative, but it
> > seems inefficient to me. What are the 5% and 50% quantile requests good
> for
> > in your call?
> >
> >
> > You said in an earlier post that you have to repeat this process
> frequently,
> > so you may want to know that you can employ a loop to call your datasets:
> >
> >
> > *******
> > forv i=1/2{
> > use "C:\...\ALL_complete_1.`i'.dta"
> > //... your other commands
> > }
> > *******
> >
> > The endpoint for the index here is 2, you can enter the appropriate number
> > yourself.
> >
> > Does each of those datasets really only contain one interesting variable?
> > Why are they dispersed across many datasets (just being curious)?
> >
> >
> > HTH
> > Martin
> >
> > -----Original Message-----
> > From: [email protected]
> > [mailto:[email protected]] On Behalf Of
> > [email protected]
> > Sent: Dienstag, 23. Februar 2010 19:03
> > To: [email protected]
> > Subject: Re: st: RE: AW: Re:
> >
> > Hey guys,
> >
> >
> > sorry for having bothered you again, I found the solution in the meantime!
> > In case someone is interested, here´s the syntax:
> >
> >
> > log using test, replace
> > use "C:\...\ALL_complete_1.1.dta"
> > centile PF_norm, centile(5 50 95)
> > postfile internal lower_ci95 centile_95 upper_ci95 using "filename",
> replace
> > post internal (r(lb_3)) (r(c_3)) (r(ub_3))
> > clear
> > use "C:\...\ALL_complete_1.2.dta"
> > centile PF_norm, centile(5 50 95)
> > post internal (r(lb_3)) (r(c_3)) (r(ub_3))
> > postclose internal
> > clear
> > log close
> >
> >
> > Still thanks for the help you gave me so far!
> > Andreas
> > *
> > * For searches and help try:
> > * http://www.stata.com/help.cgi?search
> > * http://www.stata.com/support/statalist/faq
> > * http://www.ats.ucla.edu/stat/stata/
> >
> >
> > *
> > * For searches and help try:
> > * http://www.stata.com/help.cgi?search
> > * http://www.stata.com/support/statalist/faq
> > * http://www.ats.ucla.edu/stat/stata/
> >
>
>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
>
>
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/statalist/faq
> * http://www.ats.ucla.edu/stat/stata/
>
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/