Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: AW: st: RE: AW: Re:
From
GORLE Steven <[email protected]>
To
[email protected], [email protected]
Subject
Re: AW: st: RE: AW: Re:
Date
Wed, 24 Feb 2010 14:27:42 +0100
Dear,
You can allocate more memory to Stata. The max memory that you can
allocate will depend on the RAM available on your machine.
Of course, you also need a min of RAM for windows otherwise it will
start increasing the paging file.
In Linux this is less problematic.
**
eg.
********
set memory 1500m
********
**
see FAQ
http://www.stata.com/support/faqs/data/setmemory.html
The new Stata MP version will use the power of multiple cores in a more
efficient way.
Kind regards,
Steven Gorlé
Op 24/02/2010 14:12, [email protected] schreef:
Another reason for splitting up the dataset is that it´s simply huge - it has
got about 250,000 observations, whereas the sub-datasets have only about 3,000.
Would there also have been a possibility for processing such a file, for example
by assigning more memory to Stata?
I suppose that I have to learn all that the hard way...
And thanks for the other hints, I´ve implemented a double loop now and thus
simplified my do-file considerably!
Best,
Andi
Zitat von Martin Weiss<[email protected]>:
<>
" The reason why I have so many different datasets is that I read somewhere
that
Stata cannot filter observations properly, i.e. without deleting the
remaining
dataset."
Not so, as you correctly guessed. Use the -if- qualifier!
" With regard to the centile(5 50 95) command, it has only been a bit of
laziness
that I didn´t try to omit the 5 and 50-centiles - as soon as it worked, I
was
happy :-)"
Omit 5 and 50, and then use - post internal (r(lb_1)) (r(c_1)) (r(ub_1))-
for the -postfile-
HTH
Martin
-----Ursprüngliche Nachricht-----
Von: [email protected]
[mailto:[email protected]] Im Auftrag von
[email protected]
Gesendet: Mittwoch, 24. Februar 2010 01:07
An: [email protected]
Betreff: RE: st: RE: AW: Re:
As I told in my first post, I´m an absolute Stata rookie.
The reason why I have so many different datasets is that I read somewhere
that
Stata cannot filter observations properly, i.e. without deleting the
remaining
dataset. That´s probably nonsense... So in order to deduce conditional
statements, I split the original (huge) dataset into numerous sub-datasets.
With regard to the centile(5 50 95) command, it has only been a bit of
laziness
that I didn´t try to omit the 5 and 50-centiles - as soon as it worked, I
was
happy :-)
And by the way: Thanks a lot for the hint with the loop - I´m gonna try it
right
away! Here too, someone told me that looping within a filename wasn´t
possible...
Best,
Andi
Zitat von Martin Weiss<[email protected]>:
<>
It is good to hear you are satisfied with the solution, but I am not quite
sure what your data structure looks like. You request three centiles from
Stata with your call -centile PF_norm, centile(5 50 95)-, but you store
only
one of them, the 95% one, via -postfile-. This is your prerogative, but it
seems inefficient to me. What are the 5% and 50% quantile requests good
for
in your call?
You said in an earlier post that you have to repeat this process
frequently,
so you may want to know that you can employ a loop to call your datasets:
*******
forv i=1/2{
use "C:\...\ALL_complete_1.`i'.dta"
//... your other commands
}
*******
The endpoint for the index here is 2, you can enter the appropriate number
yourself.
Does each of those datasets really only contain one interesting variable?
Why are they dispersed across many datasets (just being curious)?
HTH
Martin
-----Original Message-----
From: [email protected]
[mailto:[email protected]] On Behalf Of
[email protected]
Sent: Dienstag, 23. Februar 2010 19:03
To: [email protected]
Subject: Re: st: RE: AW: Re:
Hey guys,
sorry for having bothered you again, I found the solution in the meantime!
In case someone is interested, here´s the syntax:
log using test, replace
use "C:\...\ALL_complete_1.1.dta"
centile PF_norm, centile(5 50 95)
postfile internal lower_ci95 centile_95 upper_ci95 using "filename",
replace
post internal (r(lb_3)) (r(c_3)) (r(ub_3))
clear
use "C:\...\ALL_complete_1.2.dta"
centile PF_norm, centile(5 50 95)
post internal (r(lb_3)) (r(c_3)) (r(ub_3))
postclose internal
clear
log close
Still thanks for the help you gave me so far!
Andreas
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/