Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Elimination of outliers

From	Nick Cox <njcoxstata@gmail.com>
To	"statalist@hsphsun2.harvard.edu" <statalist@hsphsun2.harvard.edu>
Subject	Re: st: Elimination of outliers
Date	Mon, 6 Jun 2011 13:39:20 +0100

In general, a very bad idea. Consider transforming your response orpredictors or using a non-identity link function in a generalizedlinear model or some flavour of robust regression as more measuredtactics.

Nick

On 6 Jun 2011, at 12:46, "Achmed Aldai" <Hauptseminar@gmx.de> wrote:

Hi
I am currently working on a do file where I want to eliminateoutliers which have the highest and the lowest values regardingcertain variables. Here it is e.g. at and lt. In general I have150000 observations and out of these observations I want to delete25 observations from the upper and lower boundaries. But it mightalso be better to do it relatively meaning that I dont take thehighest and lowest 25 but the lower and upper 1% of thecorresponding variables.
gvkey           at           lt
1001            1120         231
1001            1230         312
1210            57           32
1210            67           25
1354            789          560
1368            650          500
1481            1230         900
2930            21           30
3201            234          213
3201            256          220
3210            267          320
4510            4335         3214

I hope this became clear.

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Elimination of outliers
  - From: "Achmed Aldai" <Hauptseminar@gmx.de>

References:
- st: Elimination of outliers
  - From: "Achmed Aldai" <Hauptseminar@gmx.de>

Prev by Date: Re: st: RE: Error message with GMM command
Next by Date: Re: st: RE: Error message with GMM command
Previous by thread: st: Elimination of outliers
Next by thread: Re: st: Elimination of outliers
Index(es):
- Date
- Thread