Quantile regression averaging explained

Quantile Regression Averaging (QRA) is a forecast combination approach to the computation of prediction intervals. It involves applying quantile regression to the point forecasts of a small number of individual forecasting models or experts. It has been introduced in 2014 by Jakub Nowotarski and Rafał Weron[1] and originally used for probabilistic forecasting of electricity prices[2] [3] and loads.[4] [5] Despite its simplicity it has been found to perform extremely well in practice - the top two performing teams in the price track of the Global Energy Forecasting Competition (GEFCom2014) used variants of QRA.[6] [7]

Introduction

The individual point forecasts are used as independent variables and the corresponding observed target variable as the dependent variable in a standard quantile regression setting.[8] The Quantile Regression Averaging method yields an interval forecast of the target variable, but does not use the prediction intervals of the individual methods. One of the reasons for using point forecasts (and not interval forecasts) is their availability. For years, forecasters have focused on obtaining accurate point predictions. Computing probabilistic forecasts, on the other hand, is generally a much more complex task and has not been discussed in the literature nor developed by practitioners so extensively. Therefore, QRA may be found particularly attractive from a practical point of view as it allows to leverage existing development of point forecasting.

Computation

The quantile regression problem can be written as follows:

Qy(q|Xt)=Xt\betaq

,

where

Qy(q|)

is the conditional q-th quantile of the dependent variable (

yt

),

Xt=[1,\hat{y}1,t,...,\hat{y}m,t]

is a vector of point forecasts of

m

individual models (i.e. independent variables) and βq is a vector of parameters (for quantile q). The parameters are estimated by minimizing the loss function for a particular q-th quantile:
min\limits
\betaq

\left[

\sum\limits
\{t:yt\geqXt\betaq\
} q |y_t - X_t\beta_q | +\sum\limits_ (1-q)|y_t - X_t\beta_q |\right] = \min\limits_ \left[\sum\limits_{t}(q - \mathbf{1}_{y_t < X_t\beta_q }) (y_t - X_t\beta_q) \right]

QRA assigns weights to individual forecasting methods and combines them to yield forecasts of chosen quantiles. Although the QRA method is based on quantile regression, not least squares, it still suffers from the same problems: the exogenous variables should not be correlated strongly and the number of variables included in the model has to be relatively small in order for the method to be computationally efficient.

Factor Quantile Regression Averaging (FQRA)

The main difficulty associated with applying QRA comes from the fact that only individual models that perform well and (preferably) are distinct should be used. However, there may be many well performing models or many different specifications of each model (with or without exogenous variables, with all or only selected lags, etc.) and it may not be optimal to include all of them in Quantile Regression Averaging.

In Factor Quantile Regression Averaging (FQRA), instead of selecting individual models a priori, the relevant information contained in all forecasting models at hand is extracted using principal component analysis (PCA). The prediction intervals are then constructed on the basis of the common factors (

ft

) obtained from the panel of point forecasts, as independent variables in a quantile regression. More precisely, in the FQRA method

Xt=[1,\hat{f}1,t,...,\hat{f}k,t]

is a vector of

k<m

factors extracted from a panel of point forecasts of

m

individual models, not a vector of point forecasts of the individual models themselves. A similar principal component-type approach was proposed in the context of obtaining point forecasts from the Survey of Professional Forecasters data.[9]

Instead of considering a (large) panel of forecasts of the individual models, FQRA concentrates on a small number of common factors, which - by construction - are orthogonal to each other, and hence are contemporaneously uncorrelated. FQRA can be also interpreted as a forecast averaging approach. The factors estimated within PCA are linear combinations of individual vectors of the panel and FQRA can therefore be used to assign weights to the forecasting models directly.

QRA and LAD regression

QRA may be viewed as an extension of combining point forecasts. The well-known ordinary least squares (OLS) averaging[10] uses linear regression to estimate weights of the point forecasts of individual models. Replacing the quadratic loss function with the absolute loss function leads to quantile regression for the median, or in other words, least absolute deviation (LAD) regression.[11]

See also

Implementations

References

  1. Computing electricity spot price prediction intervals using quantile regression and forecast averaging. Computational Statistics. 2015. 0943-4062. 791–803. 30. 3. 10.1007/s00180-014-0523-0. Jakub. Nowotarski. Rafał. Weron. [Open Access]. free.
  2. Electricity price forecasting: A review of the state-of-the-art with a look into the future. International Journal of Forecasting. 2014. 1030–1081. 30. 4. 10.1016/j.ijforecast.2014.08.008. Rafał. Weron. [Open Access]. free.
  3. Maciejowska. Katarzyna. Nowotarski. Jakub. Weron. Rafał. 2016. Probabilistic forecasting of electricity spot prices using Factor Quantile Regression Averaging. International Journal of Forecasting. 32. 3. 957–965. 10.1016/j.ijforecast.2014.12.004.
  4. Probabilistic Load Forecasting via Quantile Regression Averaging on Sister Forecasts. IEEE Transactions on Smart Grid. 2015. 1949-3053. 1. PP. 99. 10.1109/TSG.2015.2437877. B.. Liu. J.. Nowotarski. T.. Hong. R.. Weron.
  5. Web site: Probabilistic Electric Load Forecasting: A Tutorial Review. blog.drhongtao.com. 2015-11-28. Hong. Tao. Fan. Shu.
  6. Gaillard. Pierre. Goude. Yannig. Nedellec. Raphaël. 2016. Additive models and robust aggregation for GEFCom2014 probabilistic electric load and electricity price forecasting. International Journal of Forecasting. 32. 3. 1038–1050. 10.1016/j.ijforecast.2015.12.001.
  7. Maciejowska. Katarzyna. Nowotarski. Jakub. 2016. A hybrid model for GEFCom2014 probabilistic electricity price forecasting. International Journal of Forecasting. 32. 3. 1051–1056. 10.1016/j.ijforecast.2015.11.008.
  8. Book: Koenker, Roger. Quantile Regression. John Wiley & Sons, Ltd. 2005. 9780470057339. 10.1002/9780470057339.vnn091. Quantile Regression This article has been prepared for the Statistical Theory and Methods section of the Encyclopedia of Environmetrics edited by Abdel El-Shaarawi and Walter Piegorsch. The research was partially supported by NSF grant SES-0850060.
  9. Forecast combination through dimension reduction techniques. International Journal of Forecasting. 2011. 224–237. 27. 2. 10.1016/j.ijforecast.2010.01.012. Pilar. Poncela. Julio. Rodríguez. Rocío. Sánchez-Mangas. Eva. Senra.
  10. Improved methods of combining forecasts. Journal of Forecasting. 1984. 1099-131X. 197–204. 3. 2. 10.1002/for.3980030207. Clive W. J.. Granger. Ramu. Ramanathan.
  11. An empirical comparison of alternative schemes for combining electricity spot price forecasts. Energy Economics. 2014. 395–412. 46. 10.1016/j.eneco.2014.07.014. Jakub. Nowotarski. Eran. Raviv. Stefan. Trück. Rafał. Weron. 2014EneEc..46..395N .