Marchenko–Pastur distribution explained

In the mathematical theory of random matrices, the Marchenko–Pastur distribution, or Marchenko–Pastur law, describes the asymptotic behavior of singular values of large rectangular random matrices. The theorem is named after soviet mathematicians Volodymyr Marchenko and Leonid Pastur who proved this result in 1967.

denotes a

m x n

random matrix whose entries are independent identically distributed random variables with mean 0 and variance

\sigma²<infty

, let

Y_n=

	1
	n

XX^T

and let

λ_1,λ_2,...,λ_m

be the eigenvalues of

Y_n

(viewed as random variables). Finally, consider the random measure

\mu_m(A)=

	1
	m

\#\left\{λ_j\inA\right\}, A\subsetR.

counting the number of eigenvalues in the subset

included in

Theorem. Assume that

m,n\toinfty

so that the ratio

m/n\toλ\in(0,+infty)

. Then

\mu_m\to\mu

(in weak* topology in distribution), where

\mu(A)=\begin{cases}(1-

	1
	λ

)1_0\in+\nu(A),&ifλ>1\\ \nu(A),&if0\leqλ\leq1, \end{cases}

and

d\nu(x)=

	1
	2\pi\sigma²

	\sqrt{(λ₊-x)(x-λ_-)

} \,\mathbf_\, dxwith

λ_\pm=\sigma²⁽¹\pm\sqrt{λ})^2.

The Marchenko–Pastur law also arises as the free Poisson law in free probability theory, having rate

1/λ

and jump size

\sigma²

Moments

For each

k\geq1

, its

-th moment is

$\sum_^\frac\binom\binom \lambda^ = \frac\sum_^\binom\binom \lambda^$

Some transforms of this law

The Stieltjes transform is given by

s(z)=	\sigma²(1-λ)-z-\sqrt{(z-\sigma^2(λ+1))^2-4λ\sigma⁴

}for complex numbers of positive imaginary part, where the complex square root is also taken to have positive imaginary part. The Stieltjes transform can be repackaged in the form of the R-transform, which is given by

R(z)=	\sigma²
	1-\sigma²λz

The S-transform is given by

S(z)=	1
	\sigma²(1+λz)

For the case of

\sigma=1

, the

-transform is given by

E	1
	1+\gammaX

where

satisfies the Marchenko-Pastur law.

η(\gamma)=1-

	l{F
	(\gamma,λ)}{4\gammaλ}

where

l{F}(x,z)=\left(\sqrt{x(1+\sqrt{z})^{2+1}-\sqrt{x(1-\sqrt{z})}^2+1}\right)²

For exact analyis of high dimensional regression in the proportional asymptotic regime, a convenient form is often

T(u):=η\left(\tfrac1u\right)

which simplifies to

T(u)=

	-1+λ-u+\sqrt{(1+u-λ)^2+4uλ

}

The following functions

B(u):=E\left(	u
	X+u

\right)²

and

V(u):=	X
	(X+u)²

, where

satisfies the Marchenko-Pastur law, show up in the limiting Bias and Variance respectively, of ridge regression and other regularized linear regression problems. One can show that

B(u)=T(u)-u ⋅ T'(u)

and

V(u)=T'(u)

Application to correlation matrices

For the special case of correlation matrices, we know that

\sigma²⁼¹

and

λ=m/n

. This bounds the probability mass over the interval defined by

λ_\pm=\left(1\pm\sqrt{

	m
	n}\right)

^2.

Since this distribution describes the spectrum of random matrices with mean 0, the eigenvalues of correlation matrices that fall inside of the aforementioned interval could be considered spurious or noise. For instance, obtaining a correlation matrix of 10 stock returns calculated over a 252 trading days period would render

+=\left(1+\sqrt{	10
	252

}\right)^2\approx 1.43. Thus, out of 10 eigenvalues of said correlation matrix, only the values higher than 1.43 would be considered significantly different from random.

References

Book: 2567175. Bai. Zhidong. Silverstein. Jack W.. Spectral analysis of large dimensional random matrices. Second edition of 2006 original. 2010. Springer. New York. Springer Series in Statistics. 978-1-4419-0660-1. 10.1007/978-1-4419-0661-8. 1301.60002.
Epps . Brenden . Krivitzky . Eric M. . 2019 . Singular value decomposition of noisy data: mode corruption . Experiments in Fluids . 60 . 8 . 1–30 . 10.1007/s00348-019-2761-y . 2019ExFl...60..121E . 198436243 .
Götze . F. . Tikhomirov . A. . 2004 . Rate of convergence in probability to the Marchenko–Pastur law . Bernoulli . 10 . 3 . 503–548 . 10.3150/bj/1089206408 . free .
Marchenko . V. A. . Pastur . L. A. . 1967 . Распределение собственных значений в некоторых ансамблях случайных матриц . Distribution of eigenvalues for some sets of random matrices . ru . . N.S. . 72 . 114:4 . 507 - 536 . 10.1070/SM1967v001n04ABEH001994 . 1967SbMat...1..457M . Link to free-access pdf of Russian version
Book: Nica . A. . Roland Speicher . Speicher . R. . 2006 . Lectures on the Combinatorics of Free probability theory . limited . Cambridge Univ. Press . 0-521-85852-6 . 204, 368 . Link to free download Another free access site
Tulino. Antonia M.. Antonia Tulino. Verdú. Sergio. Random matrix theory and wireless communications. Foundations and Trends in Communications and Information Theory. 1. 1. 1–182. 10.1561/0100000001. 2004. 1143.94303. Sergio Verdú.
Zhang . W. . Abreu . G. . Inamori . M. . Sanada . Y. . 2011 . Spectrum sensing algorithms via finite random matrices . IEEE Transactions on Communications . 60 . 1 . 164–175 . 10.1109/TCOMM.2011.112311.100721 . 206642535 .

Marchenko–Pastur distribution explained

Moments

Some transforms of this law

Application to correlation matrices

See also

References