Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: sum issue (wrong values)
From
Nick Cox <[email protected]>
To
[email protected]
Subject
Re: st: sum issue (wrong values)
Date
Wed, 24 Oct 2012 11:43:30 +0100
That does look puzzling. Three of us suggested that value labels might
be getting in the way, but there are none.
I don't have any further suggestions, beyond wondering whether your
executable is corrupted. Can you get the same results from
. list in 1/10
immediately before or after the -summarize-?
Nick
On Wed, Oct 24, 2012 at 11:33 AM, Christian Bärtsch
<[email protected]> wrote:
> Thanks Nick - sorry I am only getting used to the correct terms.
>
> Yes it is correct, that I am looking at the -summarize- command in stata.
>
>
> . sum latency_int
>
> Variable | Obs Mean Std. Dev. Min Max
> -------------+--------------------------------------------------------
> latency_int | 5760 1102.242 700.1589 2 1999
>
> . describe latency_int
>
> storage display value
> variable name type format label variable label
> -------------------------------------------------------------------------------
> latency_int int %8.0g latency_int
>
>
> .
> . count
> 5760
>
> .
> . summarize latency_int, detail
>
> latency_int
> -------------------------------------------------------------
> Percentiles Smallest
> 1% 14 2
> 5% 57 2
> 10% 122.5 2 Obs 5760
> 25% 377.5 2 Sum of Wgt. 5760
>
> 50% 1221.5 Mean 1102.242
> Largest Std. Dev. 700.1589
> 75% 1779 1998
> 90% 1902 1998 Variance 490222.5
> 95% 1948 1998 Skewness -.2200922
> 99% 1988 1999 Kurtosis 1.417508
>
> And here also an extract from list
>
> . list latency_int
>
> +----------+
> | latenc~t |
> |----------|
> 1. | 4720 |
> 2. | 3923 |
> 3. | 1844 |
> 4. | 1435 |
> 5. | 2955 |
> |----------|
> 6. | 1483 |
> 7. | 3459 |
> 8. | 1004 |
> 9. | 1716 |
> 10. | 1372 |
> |----------|
>
> Thanks.
>
>
> 2012/10/24 Nick Cox <[email protected]>:
>> This is ambiguous as between the -summarize- command (which can be
>> abbreviated -sum-) and the -sum()- function, which gives cumulative or
>> running sums, although it seems you mean the first. In Stata (not
>> "STATA") commands and functions are quite different families.
>>
>> Even then, you must show us exactly what you typed and exactly what
>> Stata did by copying output. Otherwise it is difficult to guess what
>> is going on. Does -latency_int- have value labels, which are what you
>> see when you -list-, but not what are -summarize-d? You should show us
>> the results of
>>
>> describe latency_int
>> count
>> summarize latency_int, detail
>>
>> Nick
>>
>> On Wed, Oct 24, 2012 at 11:13 AM, Christian Bärtsch
>> <[email protected]> wrote:
>>
>>> I have a issue using the sum function of STATA. I have a data set,
>>> where I have a variable called latency_int (type: int; and something
>>> over 5700 values). I use the command sum(latency_int). There I get the
>>> minimum of 2 and the maximum of 1999 even though the data set contains
>>> values from 44 to 81000 (those values are shown when I use
>>> list(latency_int). It must be a pretty basic mistake, however I have
>>> not been able to come up with a solution for days.
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/