Majorization Explained

In mathematics, majorization is a preorder on vectors of real numbers. For two such vectors,

x,y\inRn

, we say that

x

weakly majorizes (or dominates)

y

from below, commonly denoted

x\succwy,

when
k
\sum
i=1
\downarrow
x
i

\geq

k
\sum
i=1
\downarrow
y
i
for all

k=1,...,n

,where
\downarrow
x
i
denotes

i

th largest entry of

x

. If

x,y

further satisfy
n
\sum
i=1

xi=

n
\sum
i=1

yi

, we say that

x

majorizes (or dominates)

y

, commonly denoted

x\succy

. Majorization is a partial order for vectors whose entries are non-decreasing, but only a preorder for general vectors, since majorization is agnostic to the ordering of the entries in vectors, e.g., the statement

(1,2)\prec(0,3)

is simply equivalent to

(2,1)\prec(3,0)

.

Majorizing also sometimes refers to entrywise ordering, e.g. the real-valued function f majorizes the real-valued function g when

f(x)\geqg(x)

for all

x

in the domain, or other technical definitions, such as majorizing measures in probability theory.[1]

Equivalent conditions

Geometric definition

For

x,y\inRn,

we have

x\precy

if and only if

x

is in the convex hull of all vectors obtained by permuting the coordinates of

y

. This is equivalent to saying that

x=Dy

for some doubly stochastic matrix

D

.[2] In particular,

x

can be written as a convex combination of

n

permutations of

y

.[3]

Figure 1 displays the convex hull in 2D for the vector

y=(3,1)

. Notice that the center of the convex hull, which is an interval in this case, is the vector

x=(2,2)

. This is the "smallest" vector satisfying

x\precy

for this given vector

y

.Figure 2 shows the convex hull in 3D. The center of the convex hull, which is a 2D polygon in this case, is the "smallest" vector

x

satisfying

x\precy

for this given vector

y

.

Other definitions

Each of the following statements is true if and only if

x\succy

.

x

we can produce

y

by a finite sequence of "Robin Hood operations" where we replace two elements

xi

and

xj<xi

with

xi-\varepsilon

and

xj+\varepsilon

, respectively, for some

\varepsilon\in(0,xi-xj)

.

h:R\toR

,
d
\sum
i=1

h(xi)\geq

d
\sum
i=1

h(yi)

.

\sumi{xi}=\sumi{yi}

and, for every,
d
\sum
i=1

max(0,xi-t)

d
\geq\sum
i=1

max(0,yi-t)

.[4]

t\inR

,
d
\sum
j=1

|xj-t|\geq

d
\sum
j=1

|yj-t|

.[5]

Examples

Among non-negative vectors with three components,

(1,0,0)

and permutations of it majorize all other vectors

(p1,p2,p3)

such that

p1+p2+p3=1

. For example,

(1,0,0)\succ(1/2,0,1/2)

. Similarly,

(1/3,1/3,1/3)

is majorized by all other such vectors, so

(1/2,0,1/2)\succ(1/3,1/3,1/3)

.

This behavior extends to general-length probability vectors: the singleton vector majorizes all other probability vectors, and the uniform distribution is majorized by all probability vectors.

Schur convexity

See main article: Schur-convex function. A function

f:Rn\toR

is said to be Schur convex when

x\succy

implies

f(x)\geqf(y)

. Hence, Schur-convex functions translate the ordering of vectors to a standard ordering in

R

. Similarly,

f(x)

is Schur concave when

x\succy

implies

f(x)\leqf(y).

An example of a Schur-convex function is the max function,

\downarrow
max(x)=x
1
. Schur convex functions are necessarily symmetric that the entries of it argument can be switched without modifying the value of the function. Therefore, linear functions, which are convex, are not Schur-convex unless they are symmetric. If a function is symmetric and convex, then it is Schur-convex.

Generalizations

Majorization can be generalized to the Lorenz ordering, a partial order on distribution functions. For example, a wealth distribution is Lorenz-greater than another if its Lorenz curve lies below the other. As such, a Lorenz-greater wealth distribution has a higher Gini coefficient, and has more income disparity.[6]

The majorization preorder can be naturally extended to density matrices in the context of quantum information.[7] In particular,

\rho\succ\rho'

exactly when

spec[\rho]\succspec[\rho']

(where

spec

denotes the state's spectrum).

Similarly, one can say a Hermitian operator,

H

, majorizes another,

M

, if the set of eigenvalues of

H

majorizes that of

M

.

See also

Notes

  1. Talagrand . Michel . 1996-07-01 . Majorizing measures: the generic chaining . The Annals of Probability . 24 . 3 . 10.1214/aop/1065725175 . 0091-1798. free .
  2. Barry C. Arnold. "Majorization and the Lorenz Order: A Brief Introduction". Springer-Verlag Lecture Notes in Statistics, vol. 43, 1987.
  3. Xingzhi. Zhan. The sharp Rado theorem for majorizations. The American Mathematical Monthly. 2003. 110. 2. 152–153. 10.2307/3647776. 3647776.
  4. July 3, 2005 post by fleeting_guest on "The Karamata Inequality" thread, AoPS community forums. Archived 11 November 2020.
  5. Book: Nielsen. Michael A.. Michael Nielsen. Chuang. Isaac L.. Isaac Chuang. Quantum Computation and Quantum Information. Cambridge University Press. Cambridge. 2010. 2nd. 844974180. 978-1-107-00217-3.
  6. Book: Marshall, Albert W. . Inequalities : theory of majorization and its applications . 2011 . Springer Science+Business Media, LLC . Ingram Olkin, Barry C. Arnold . 978-0-387-68276-1 . 2nd . New York . 694574026. 14, 15.
  7. Alfred. Wehrl. General properties of entropy. Reviews of Modern Physics. 1 April 1978. 221–260. 50. 2. 10.1103/RevModPhys.50.221. 1978RvMP...50..221W .

References

External links

Software