I am confused with the size of my data files. Here's why:
I have two files: <b.dta> with 6,771,434 bytes on disk and <b1.asc>
3,385,856 bytes (size according to Win Commander).
Stata however says:
. uc b
. d
Contains data from b.dta
obs: 1,692,789
vars: 1 5 Jul 2002 16:40
size: 13,542,312 (87.1% of memory free)
----------------------------------------------------------------------------
---
storage display value
variable name type format label variable label
----------------------------------------------------------------------------
---
b float %9.0g
----------------------------------------------------------------------------
---
Sorted by:
. uc b1
. d
Contains data from b1.dta
obs: 1,692,789
vars: 1 5 Jul 2002 16:41
size: 10,156,734 (90.3% of memory free)
----------------------------------------------------------------------------
---
storage display value
variable name type format label variable label
----------------------------------------------------------------------------
---
b int %9.0g
----------------------------------------------------------------------------
---
I read the FAQ http://www.stata.com/support/faqs/data/howbig.html and I also
tried with the interactive dataset calculator
(http://www.stata.com/support/faqs/data/datacalc.cgi). It returns for
<b.dta> with a width in bytes of *4* -Approximate dataset size is 12.91
megabytes-. With a width in bytes of *2* for <b1.dta>, I get -Approximate
dataset size is 9.69 megabytes-. Still quite a difference compared to the
sizes returned by -describe-.
Where do the differences come from?
How big are the files now?
Thanks and best wishes,
Daniel
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/