Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: st: Quantile Regression
From
Robert Davidson <[email protected]>
To
[email protected]
Subject
Re: st: Quantile Regression
Date
Tue, 2 Oct 2012 23:15:37 -0400
Thank you for the responses.
I am performing the same number of bootstrap replicates in each model
and I did not receive any messages that any of the quantiles failed to
converge. I will check into setting the same random seed before each
command; I did not set anything before running either command so
perhaps that is the issue. One thing I should mention is that the
standard errors in some cases were dramatically different, not just a
little bit off and that is what surprised me given the number of
observations I have.
I will take a look at Koenker's book as well. It would help a great
deal to find a faster way to estimate these models.
In case anyone has suggestions based on the model itself, I have a
large sample of firm-level stock returns and a series of variables
related to the firm's CEO. I am trying to estimate whether certain
'types' of CEOs are over or under represented at different quantiles
of stock returns. I can see a pattern by plotting a simple histogram,
but want a stronger indicator of the relation and magnitude.
Thanks again,
Rob
On Tue, Oct 2, 2012 at 10:12 PM, JVerkuilen (Gmail)
<[email protected]> wrote:
> On Tue, Oct 2, 2012 at 7:31 PM, Steve Samuels <[email protected]> wrote:
>>
>> Without details (see FAQ 3.3 first sentence), we can only guess. This
>> could happen if 1) you did not set the same random seed before each
>> -sqreg- and -bsqreg- command; 2) the number of bootstrap replicates
>> differed between -sqreg- and -bsqreg- runs; or 3) -sqreg- does not
>> rejects replicates in which convergence failed for any quantile.
>
> If the standard errors are different it's no great surprise if you're
> running bootstrap. All the stuff said makes sense. Check on a known
> dataset (such as auto) and fix the seed.
>
>
>
>> By the way, the manual states that -sqreg- is faster than -bsqreg-.
>
> I believe that computationally there are some speedups due to the fact
> that the linear program can be solved for one and simply updated to
> get the rest of the quantiles, but I could be mistaken. Roger
> Koenker's book (Quantile Regression, Oxford University Press, 2006)
> discusses computation in detail. Also there are analytic options to
> bootstrapping that might be much faster. -qreg- generates standard
> errors analytically using a weighting matrix and density estimator of
> the residuals.
>
> . sysuse auto
> . qreg price mpg
>
> Median regression Number of obs = 74
> Raw sum of deviations 142205 (about 4934)
> Min sum of deviations 129521.7 Pseudo R2 = 0.0892
>
> ------------------------------------------------------------------------------
> price | Coef. Std. Err. t P>|t| [95% Conf. Interval]
> -------------+----------------------------------------------------------------
> mpg | -135.6667 67.26576 -2.02 0.047 -269.7585 -1.574816
> _cons | 8088.667 1483.808 5.45 0.000 5130.749 11046.58
> ------------------------------------------------------------------------------
>
>
>
>
>
> . bsqreg price mpg, reps(999) *note that bsqreg defaults
> to 20!?!?!?!
>
> Median regression, bootstrap(999) SEs Number of obs = 74
> Raw sum of deviations 142205 (about 4934)
> Min sum of deviations 129521.7 Pseudo R2 = 0.0892
>
> ------------------------------------------------------------------------------
> price | Coef. Std. Err. t P>|t| [95% Conf. Interval]
> -------------+----------------------------------------------------------------
> mpg | -135.6667 35.63527 -3.81 0.000 -206.7043 -64.62906
> _cons | 8088.667 889.0486 9.10 0.000 6316.381 9860.953
> ------------------------------------------------------------------------------
>
>
> In this case the standard errors are markedly different and playing
> with the different methods in -qreg- gives quite different values, but
> I don't really know enough to be able to comment on why. I am inclined
> to trust the bootstrapped ones because this problem has a rather small
> N.
>
> I suspect that it is very slow on a huge problem though, given that it
> needs to sort the residuals. Koenker did a good deal of work on
> alternatives such as inverting a test of some sort; I think the R
> implementation of quantile regression has this. Again see his book.
>
>
>
>> I've never had the luxury of having so many observations to analyze. I
>> imagine that almost every simple model can be rejected, so that model
>> building and validation are real challenges.
>
> Randomly subsample and do a real cross validation?
>
> Jay
> --
> JVVerkuilen, PhD
> [email protected]
>
> "Out beyond ideas of wrong-doing and right-doing there is a field.
> I'll meet you there. When the soul lies down in that grass the world
> is too full to talk about." ---Rumi
> *
> * For searches and help try:
> * http://www.stata.com/help.cgi?search
> * http://www.stata.com/support/faqs/resources/statalist-faq/
> * http://www.ats.ucla.edu/stat/stata/
*
* For searches and help try:
* http://www.stata.com/help.cgi?search
* http://www.stata.com/support/faqs/resources/statalist-faq/
* http://www.ats.ucla.edu/stat/stata/