In probability theory and statistics, the split normal distribution also known as the two-piece normal distribution results from joining at the mode the corresponding halves of two normal distributions with the same mode but different variances. It is claimed by Johnson et al. that this distribution was introduced by Gibbons and Mylroie and by John. But these are two of several independent rediscoveries of the Zweiseitige Gauss'sche Gesetz introduced in the posthumously published Kollektivmasslehre (1897)[1] of Gustav Theodor Fechner (1801-1887), see Wallis (2014).[2] Another rediscovery has appeared more recently in a finance journal.[3]
The split normal distribution arises from merging two opposite halves of two probability density functions (PDFs) of normal distributions in their common mode.
The PDF of the split normal distribution is given by
f(x;\mu,\sigma1,\sigma2)=A\exp\left(-
(x-\mu)2 | ||||||||
|
\right) ifx<\mu
f(x;\mu,\sigma1,\sigma2)=A\exp\left(-
(x-\mu)2 | ||||||||
|
\right) otherwise
A=\sqrt{2/\pi}(\sigma1+\sigma
-1 | |
2) |
.
The split normal distribution results from merging two halves of normal distributions. In a general case the 'parent' normal distributions can have different variances which implies that the joined PDF would not be continuous. To ensure that the resulting PDF integrates to 1, the normalizing constant A is used.
In a special case when
2 | |
\sigma | |
* |
2 | |
\sigma | |
* |
When σ2≠σ1 the constant A is different from the constant of normal distribution. However, when
2 | |
\sigma | |
* |
The sign of its third central moment is determined by the difference (σ2-σ1). If this difference is positive, the distribution is skewed to the right and if negative, then it is skewed to the left.
Other properties of the split normal density were discussed by Johnson et al. and Julio.
The formulation discussed above originates from John. The literature offers two mathematically equivalent alternative parameterizations . Britton, Fisher and Whitley offer a parameterization if terms of mode, dispersion and normed skewness, denoted with
l{SN}(\mu,\sigma2,\gamma)
The second alternative parameterization is used in the Bank of England's communication and is written in terms of mode, dispersion and unnormed skewness and is denoted with
l{SN}(\mu,\sigma2,\xi)
The three parameterizations are mathematically equivalent, meaning that there is a strict relationship between the parameters and that it is possible to go from one parameterization to another. The following relationships hold:
\begin{align} \sigma2&=
2(1+\gamma)= | |
\sigma | |
1 |
2(1-\gamma) | |
\sigma | |
2 |
\\ \gamma&=
| |||||||
|
\\ \xi&=\sqrt{2/\pi}(\sigma2-\sigma1)\\ \gamma&=\operatorname{sgn}(\xi)\sqrt{1-\left(
\sqrt{1+2\beta | |
-1}{\beta} |
\right)2}, where \beta=
\pi\xi2 | |
2\sigma2 |
. \end{align}
The multivariate generalization of the split normal distribution was proposed by Villani and Larsson. They assume that each of the principal components has univariate split normal distribution with a different set of parameters μ, σ2 and σ1.
John proposes to estimate the parameters using maximum likelihood method. He shows that the likelihood function can be expressed in an intensive form, in which the scale parameters σ1 and σ2 are a function of the location parameter μ. The likelihood in its intensive form is:
L(\mu)=
-\left[\sum | |
xi:xi<\mu |
2 | |
(x | |
i-\mu) |
\right]1/3-
\left[\sum | |
xi:xi>\mu |
2 | |
(x | |
i-\mu) |
\right]1/3
Given the maximum likelihood estimator
\hat{\mu}
2 | |
\hat{\sigma} | |
1 |
=
-L(\mu) | |
N |
\left[\sum | |
xi:xi<\mu |
2 | |
(x | |
i-\mu) |
\right]2/3,
2 | |
\hat{\sigma} | |
2 |
=
-L(\mu) | |
N |
\left[\sum | |
xi:xi>\mu |
2 | |
(x | |
i-\mu) |
\right]2/3,
Villani and Larsson propose to use either maximum likelihood method or bayesian estimation and provide some analytical results for either univariate and multivariate case.
The split normal distribution has been used mainly in econometrics and time series. A remarkable area of application is the construction of the fan chart, a representation of the inflation forecast distribution reported by inflation targeting central banks around the globe.