Mean signed deviation explained

In statistics, the mean signed difference (MSD), also known as mean signed deviation and mean signed error, is a sample statistic that summarises how well a set of estimates

\hat{\theta}i

match the quantities

\thetai

that they are supposed to estimate. It is one of a number of statistics that can be used to assess an estimation procedure, and it would often be used in conjunction with a sample version of the mean square error.

For example, suppose a linear regression model has been estimated over a sample of data, and is then used to extrapolate predictions of the dependent variable out of sample after the out-of-sample data points have become available. Then

\thetai

would be the i-th out-of-sample value of the dependent variable, and

\hat{\theta}i

would be its predicted value. The mean signed deviation is the average value of

\hat{\theta}i-\thetai.

Definition

The mean signed difference is derived from a set of n pairs,

(\hat{\theta}i,\thetai)

, where

\hat{\theta}i

is an estimate of the parameter

\theta

in a case where it is known that

\theta=\thetai

. In many applications, all the quantities

\thetai

will share a common value. When applied to forecasting in a time series analysis context, a forecasting procedure might be evaluated using the mean signed difference, with

\hat{\theta}i

being the predicted value of a series at a given lead time and

\thetai

being the value of the series eventually observed for that time-point. The mean signed difference is defined to be

\operatorname{MSD}(\hat{\theta})=

1
n
n
\sum
i=1

\hat{\thetai

} - \theta_ .

Use Cases

The mean signed difference is often useful when the estimations

\hat{\thetai}

are biased from the true values

\thetai

in a certain direction. If the estimator that produces the

\hat{\thetai}

values is unbiased, then

\operatorname{MSD}(\hat{\thetai})=0

. However, if the estimations

\hat{\thetai}

are produced by a biased estimator, then the mean signed difference is a useful tool to understand the direction of the estimator's bias.

See also