Samuelson's inequality explained

In statistics, Samuelson's inequality, named after the economist Paul Samuelson,^[1] also called the Laguerre - Samuelson inequality,^[2] ^[3] after the mathematician Edmond Laguerre, states that every one of any collection x₁, ..., x_n, is within uncorrected sample standard deviations of their sample mean.

Statement of the inequality

If we let

\overline{x}=

	x_{1+ … +x}_n
	n

be the sample mean and

s=\sqrt{

	1
	n

	n
\sum
	i=1

(x_i-\overline{x})²}

be the standard deviation of the sample, then

\overline{x}-s\sqrt{n-1}\lex_j\le\overline{x}+s\sqrt{n-1} forj=1,...,n.

^[4]

Equality holds on the left (or right) for

x_j

if and only if all the n - 1

x_i

s other than

x_j

are equal to each other and greater (smaller) than

x_j.

^[2]

If you instead define

s=\sqrt{

	1
	n-1

	n
\sum
	i=1

(x_i-\overline{x})²}

then the inequality

\overline{x}-s\sqrt{n-1}\lex_j\le\overline{x}+s\sqrt{n-1}

still applies and can be slightly tightened to

\overline{x}-s\tfrac{n-1}{\sqrt{n}}\lex_j\le\overline{x}+s\tfrac{n-1}{\sqrt{n}}.

Comparison to Chebyshev's inequality

Chebyshev's inequality locates a certain fraction of the data within certain bounds, while Samuelson's inequality locates all the data points within certain bounds.

The bounds given by Chebyshev's inequality are unaffected by the number of data points, while for Samuelson's inequality the bounds loosen as the sample size increases. Thus for large enough data sets, Chebyshev's inequality is more useful.

Applications

Samuelson's inequality may be considered a reason why studentization of residuals should be done externally.

Relationship to polynomials

Samuelson was not the first to describe this relationship: the first was probably Laguerre in 1880 while investigating the roots (zeros) of polynomials.^[2] ^[5] Consider a polynomial with all roots real:

	n
a
	0x

	n-1
a
	1x

+ … +a_n-1x+a_n=0

Without loss of generality let

a₀=1

and let

t₁=\sumx_i

and

t₂=\sum

	2
x
	i

Then

a₁=-\sumx_i=-t₁

and

a₂=\sumx_ix_j=

	2
t		-t₂
	1

wherei<j

In terms of the coefficients

t₂=

	2
a
	1

-2a₂

Laguerre showed that the roots of this polynomial were bounded by

-a₁/n\pmb\sqrt{n-1}

where

\sqrt{nt

	2
t
	1

} = \frac

Inspection shows that

-\tfrac{a_1}{n}

is the mean of the roots and that b is the standard deviation of the roots.

Laguerre failed to notice this relationship with the means and standard deviations of the roots, being more interested in the bounds themselves. This relationship permits a rapid estimate of the bounds of the roots and may be of use in their location.

When the coefficients

a₁

and

a₂

are both zero no information can be obtained about the location of the roots, because not all roots are real (as can be seen from Descartes' rule of signs) unless the constant term is also zero.

Notes and References

Paul . Samuelson . How Deviant Can You Be? . . 63 . 324 . 1968 . 1522 - 1525 . 2285901 . 10.2307/2285901 .
MSc . Jensen . Shane Tyler . 1999 . The Laguerre - Samuelson Inequality with Extensions and Applications in Statistics and Matrix Theory . .
Book: Shane T. . Jensen . George P. H. . Styan . 1999 . Some Comments and a Bibliography on the Laguerre-Samuelson Inequality with Extensions and Applications in Statistics and Matrix Theory . Analytic and Geometric Inequalities and Applications . 151–181 . 10.1007/978-94-011-4577-0_10 . 978-94-010-5938-1 .
Book: Advances in Inequalities from Probability Theory and Statistics . Neil S. . Barnett . Sever Silvestru . Dragomir . Nova Publishers . 2008 . 978-1-60021-943-6 . 164 .
Laguerre E. (1880) Mémoire pour obtenir par approximation les racines d'une équation algébrique qui a toutes les racines réelles. Nouv Ann Math 2^e série, 19, 161–172, 193–202