Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: st: Store datafile at minimum possible file size
From
"Nick Cox" <[email protected]>
To
<[email protected]>
Subject
RE: st: Store datafile at minimum possible file size
Date
Fri, 16 Apr 2010 12:28:51 +0100
Note that Stata 11 has an embedded -zipfile- command.
Nick
[email protected]
-----Original Message-----
From: [email protected]
[mailto:[email protected]] On Behalf Of Abdel Rahmen
El Lahga
Sent: 16 April 2010 12:16
To: [email protected]
Subject: Re: st: Store datafile at minimum possible file size
In my experience the best solution is to zip immediatlt the stata file
without using ascci format. If we try
************
clear*
set mem 2g
set obs 10000000
gen str12 var1 ="abcdefghijkl"
gen str14 var2 ="abcdefghijklmn"
gen var3 =rnormal()
gen var4 =runiform()
compress
qui save data_boehm , replace
outfile using "ascii_boehm.raw", replace
ls
-rw-r--r-- 1 abdelrahmenlahga staff 550000000 Apr 16 12:05
ascii_boehm.raw
-rw-r--r-- 1 abdelrahmenlahga staff 340000912 Apr 16 12:00
data_boehm.dta
*********
As you can see the ascii file is much bigger than the Stata one.
you can zip it to reduce the file zize to about 85m
HTH
AbdelRahmen
2010/4/16 Michael Boehm <[email protected]>:
> Dear statalisters,
>
> I have a big datafile (2 string variables, 2 numeric one, 10million
> observations) created in Stata and want to save it in the smallest
> version possible. I have tried out the "outfile" command to write it
> as an ASCII, but as far as I understand this doesn't give me the
> smallest ASCII file possible. Can anyone help?
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/