Trace inequality explained

In mathematics, there are many kinds of inequalities involving matrices and linear operators on Hilbert spaces. This article covers some important operator inequalities connected with traces of matrices.^[1] ^[2] ^[3] ^[4]

Basic definitions

Let

H_n

denote the space of Hermitian

n x n

matrices,

	+
H
	n

denote the set consisting of positive semi-definite

n x n

Hermitian matrices and

	++
H
	n

denote the set of positive definite Hermitian matrices. For operators on an infinite dimensional Hilbert space we require that they be trace class and self-adjoint, in which case similar definitions apply, but we discuss only matrices, for simplicity.

For any real-valued function

on an interval

I\subseteq\Reals,

one may define a matrix function

f(A)

for any operator

A\inH_n

with eigenvalues

by defining it on the eigenvalues and corresponding projectors

f(A) \equiv \sum_j f(\lambda_j)P_j ~,

given the spectral decomposition

A=\sum_jλ_jP_j.

Operator monotone

See main article: Operator monotone function.

A function

f:I\to\Reals

defined on an interval

I\subseteq\Reals

is said to be operator monotone if for all

and all

A,B\inH_n

with eigenvalues in

the following holds,

A \geq B \implies f(A) \geq f(B),

where the inequality

A\geqB

means that the operator

A-B\geq0

is positive semi-definite. One may check that

f(A)=A²

is, in fact, not operator monotone!

Operator convex

A function

f:I\to\Reals

is said to be operator convex if for all

and all

A,B\inH_n

with eigenvalues in

and

0<λ<1

, the following holds

f(\lambda A + (1-\lambda)B) \leq \lambda f(A) + (1 -\lambda)f(B).

Note that the operator

λA+(1-λ)B

has eigenvalues in

since

and

have eigenvalues in

A function

is if

-f

is operator convex;=, that is, the inequality above for

is reversed.

Joint convexity

A function

g:I x J\to\Reals,

defined on intervals

I,J\subseteq\Reals

is said to be if for all

and all

A_1,A₂\inH_n

with eigenvalues in

and all

B_1,B₂\inH_n

with eigenvalues in

and any

0\leqλ\leq1

the following holds

g(\lambda A_1 + (1-\lambda) A_2, \lambda B_1 + (1-\lambda) B_2) ~\leq~ \lambda g(A_1, B_1) + (1 -\lambda) g(A_2, B_2).

A function

is if −

is jointly convex, i.e. the inequality above for

is reversed.

Trace function

Given a function

f:\Reals\to\Reals,

the associated trace function on

H_n

is given by

A \mapsto \operatorname f(A) = \sum_j f(\lambda_j),

where

has eigenvalues

and

\operatorname{Tr}

stands for a trace of the operator.

Convexity and monotonicity of the trace function

Let

f:R\rarrR

be continuous, and let be any integer. Then, if

t\mapstof(t)

is monotone increasing, so is

A\mapsto\operatorname{Tr}f(A)

on H_n.

Likewise, if

t\mapstof(t)

is convex, so is

A\mapsto\operatorname{Tr}f(A)

on H_n, andit is strictly convex if is strictly convex.

See proof and discussion in, for example.

Löwner–Heinz theorem

For

-1\leqp\leq0

, the function

f(t)=-t^p

is operator monotone and operator concave.

For

0\leqp\leq1

, the function

f(t)=t^p

is operator monotone and operator concave.

For

1\leqp\leq2

, the function

f(t)=t^p

is operator convex. Furthermore,

f(t)=log(t)

is operator concave and operator monotone, while

f(t)=tlog(t)

is operator convex.

The original proof of this theorem is due to K. Löwner who gave a necessary and sufficient condition for to be operator monotone.^[5] An elementary proof of the theorem is discussed in and a more general version of it in.^[6]

Klein's inequality

For all Hermitian × matrices and and all differentiable convex functions

f:R\rarrR

with derivative, or for all positive-definite Hermitian × matrices and, and all differentiableconvex functions :(0,∞) →

, the following inequality holds, In either case, if is strictly convex, equality holds if and only if = .A popular choice in applications is, see below.

Proof

Let

C=A-B

so that, for

t\in(0,1)

B+tC=(1-t)B+tA

,varies from

Define

F(t)=\operatorname{Tr}[f(B+tC)]

.By convexity and monotonicity of trace functions,

F(t)

is convex, and so for all

t\in(0,1)

F(0)+t(F(1)-F(0))\geqF(t)

,which is,

F(1)-F(0)\geq

	F(t)-F(0)
	t

,and, in fact, the right hand side is monotone decreasing in

Taking the limit

t\to0

yields,

F(1)-F(0)\geqF'(0)

,which with rearrangement and substitution is Klein's inequality:

tr[f(A)-f(B)-(A-B)f'(B)]\geq0

Note that if

f(t)

is strictly convex and

C ≠ 0

, then

F(t)

is strictly convex. The final assertion follows from this and the fact that

\tfrac{F(t)-F(0)}{t}

is monotone decreasing in

Golden–Thompson inequality

See main article: Golden–Thompson inequality.

In 1965, S. Golden ^[7] and C.J. Thompson ^[8] independently discovered that

For any matrices

A,B\inH_n

\operatorname{Tr}e^A+B\leq\operatorname{Tr}e^Ae^B.

This inequality can be generalized for three operators:^[9] for non-negative operators

A,B,

	+
C\inH
	n

\operatorname{Tr}e^ln\leq

	infty
\int
	0

\operatorname{Tr}A(B+t)^-1C(B+t)^-1\operatorname{d}t.

Peierls–Bogoliubov inequality

Let

R,F\inH_n

be such that Tr e^R = 1.Defining, we have

\operatorname{Tr}e^Fe^R\geq\operatorname{Tr}e^F+R\geqe^g.

The proof of this inequality follows from the above combined with Klein's inequality. Take .^[10]

Gibbs variational principle

Let

be a self-adjoint operator such that

e^-H

is trace class. Then for any

\gamma\geq0

with

\operatorname{Tr}\gamma=1,

\operatorname{Tr}\gammaH+\operatorname{Tr}\gammaln\gamma\geq-ln\operatorname{Tr}e^-H,

with equality if and only if

\gamma=\exp(-H)/\operatorname{Tr}\exp(-H).

Lieb's concavity theorem

The following theorem was proved by E. H. Lieb in. It proves and generalizes a conjecture of E. P. Wigner, M. M. Yanase, and Freeman Dyson.^[11] Six years later other proofs were given by T. Ando ^[12] and B. Simon, and several more have been given since then.

For all

m x n

matrices

, and all

and

such that

0\leqq\leq1

and

0\leqr\leq1

, with

q+r\leq1

the real valued map on

	+
H
	m

	+
H
	n

given by

F(A,B,K)=\operatorname{Tr}(K^*A^qKB^r)

is jointly concave in

(A,B)

is convex in

Here

K^*

stands for the adjoint operator of

Lieb's theorem

For a fixed Hermitian matrix

L\inH_n

, the function

f(A)=\operatorname{Tr}\exp\{L+lnA\}

is concave on

	++
H
	n

The theorem and proof are due to E. H. Lieb, Thm 6, where he obtains this theorem as a corollary of Lieb's concavity Theorem.The most direct proof is due to H. Epstein;^[13] see M.B. Ruskai papers,^[14] ^[15] for a review of this argument.

Ando's convexity theorem

T. Ando's proof of Lieb's concavity theorem led to the following significant complement to it:

For all

m x n

matrices

, and all

1\leqq\leq2

and

0\leqr\leq1

with

q-r\geq1

, the real valued map on

	++
H
	m

	++
H
	n

given by

(A,B)\mapsto\operatorname{Tr}(K^*A^qKB^-r)

is convex.

Joint convexity of relative entropy

For two operators

	++
B\inH
	n

define the following map

R(A\parallelB):=\operatorname{Tr}(AlogA)-\operatorname{Tr}(AlogB).

\rho

and

\sigma

, the map

R(\rho\parallel\sigma)=S(\rho\parallel\sigma)

is the Umegaki's quantum relative entropy.

Note that the non-negativity of

R(A\parallelB)

follows from Klein's inequality with

f(t)=tlogt

Statement

The map

R(A\parallelB):

	++
H
	n

	++
H
	n

→ R

is jointly convex.

Proof

For all

0<p<1

(A,B)\mapsto\operatorname{Tr}(B^1-pA^p)

is jointly concave, by Lieb's concavity theorem, and thus

(A,B)\mapsto

	1
	p-1

(\operatorname{Tr}(B^1-pA^{p)-\operatorname{Tr}A)}

is convex. But

\lim_{p →}

	1
	p-1

(\operatorname{Tr}(B^1-pA^{p)-\operatorname{Tr}A)=R(A\parallel}B),

and convexity is preserved in the limit.

The proof is due to G. Lindblad.^[16]

Jensen's operator and trace inequalities

The operator version of Jensen's inequality is due to C. Davis.^[17]

A continuous, real function

on an interval

satisfies Jensen's Operator Inequality if the following holds

f\left(\sum_kA

	*X

	kA

_{k\right)\leq\sum}_k

	*f(X
A
	k)A

_k,

for operators

\{A_k\}_k

with

\sum_k

	*
A
	kA

_k=1

and for self-adjoint operators

\{X_k\}_k

with spectrum on

See,^[18] for the proof of the following two theorems.

Jensen's trace inequality

Let be a continuous function defined on an interval and let and be natural numbers. If is convex, we then have the inequality

	*X
\operatorname{Tr}l(fl(\sum
	kA

_kr)r)\leq

	n
\operatorname{Tr}l(\sum
	k=1

	*f(X
A
	k)A

_kr),

for all (₁, ..., _n) self-adjoint × matrices with spectra contained in andall (₁, ..., _n) of × matrices with

	*A
\sum
	k=1.

Conversely, if the above inequality is satisfied for some and, where > 1, then is convex.

Jensen's operator inequality

For a continuous function

defined on an interval

the following conditions are equivalent:

is operator convex.

For each natural number

we have the inequality

	*X
fl(\sum
	kA

_kr)\leq\sum

	n

	k=1

	*f(X
A
	k)A

_k,

for all

(X_1,\ldots,X_n)

bounded, self-adjoint operators on an arbitrary Hilbert space

l{H}

withspectra contained in

and all

(A_1,\ldots,A_n)

l{H}

with

	n
\sum
	k=1

	*
A
	kA

_k=1.

f(V^*XV)\leqV^*f(X)V

for each isometry

on an infinite-dimensional Hilbert space

l{H}

andevery self-adjoint operator

with spectrum in

Pf(PXP+λ(1-P))P\leqPf(X)P

for each projection

on an infinite-dimensional Hilbert space

l{H}

, every self-adjoint operator

with spectrum in

and every

Araki–Lieb–Thirring inequality

E. H. Lieb and W. E. Thirring proved the following inequality in ^[19] 1976: For any

A\geq0,

B\geq0

and

r\geq1,

\operatorname ((BAB)^r) ~\leq~ \operatorname (B^r A^r B^r).

In 1990 ^[20] H. Araki generalized the above inequality to the following one: For any

A\geq0,

B\geq0

and

q\geq0,

\operatorname((BAB)^) ~\leq~ \operatorname((B^r A^r B^r)^q),

for

r\geq1,

and

\operatorname((B^r A^r B^r)^q) ~\leq~ \operatorname((BAB)^),

for

0\leqr\leq1.

There are several other inequalities close to the Lieb–Thirring inequality, such as the following:^[21] for any

A\geq0,

B\geq0

and

\alpha\in[0,1],

\operatorname (B A^\alpha B B A^ B) ~\leq~ \operatorname (B^2 A B^2),

and even more generally:^[22] for any

A\geq0,

B\geq0,

r\geq1/2

and

c\geq0,

\operatorname((B A B^ A B)^r) ~\leq~ \operatorname((B^ A^2 B^)^r).

The above inequality generalizes the previous one, as can be seen by exchanging

B²

and

A^(1-\alpha)/2

with

\alpha=2c/(2c+2)

and using the cyclicity of the trace, leading to

\operatorname((B A^\alpha B B A^ B)^r) ~\leq~ \operatorname((B^2 A B^2)^r).

Additionally, building upon the Lieb-Thirring inequality the following inequality was derived: ^[23] For any

A,B\inH_n,T\inC^{n x}

and all

1\leqp,q\leqinfty

with

1/p+1/q=1

, it holds that

|\operatorname(TAT^*B)| ~\leq~ \operatorname(T^*T|A|^p)^\frac\operatorname(TT^*|B|^q)^\frac.

Effros's theorem and its extension

E. Effros in ^[24] proved the following theorem.

f(x)

is an operator convex function, and

and

are commuting bounded linear operators, i.e. the commutator

[L,R]=LR-RL=0

, the perspective

g(L,R):=f(LR^-1)R

is jointly convex, i.e. if

L=λL_1+(1-λ)L₂

and

R=λR_1+(1-λ)R₂

with

[L_i,R_i]=0

(i=1,2),

0\leqλ\leq1

g(L,R)\leqλg(L_1,R_1)+(1-λ)g(L_2,R_2).

Ebadian et al. later extended the inequality to the case where

and

do not commute .^[25]

Von Neumann's trace inequality and related results

, named after its originator John von Neumann, states that for any

n x n

complex matrices

and

with singular values

\alpha₁\geq\alpha₂\geq … \geq\alpha_n

and

\beta₁\geq\beta₂\geq … \geq\beta_n

respectively,^[26]

|\operatorname(A B)| ~\leq~ \sum_^n \alpha_i \beta_i\,,

with equality if and only if

and

B^\dagger

share singular vectors.^[27]

n x n

positive semi-definite complex matrices

and

where now the eigenvalues are sorted decreasingly (

a₁\geqa₂\geq … \geqa_n

and

b₁\geqb₂\geq … \geqb_n,

respectively),

\sum_^n a_i b_ ~\leq~ \operatorname(A B) ~\leq~ \sum_^n a_i b_i\,.

References

Scholarpedia primary source.

Notes and References

E. Carlen, Trace Inequalities and Quantum Entropy: An Introductory Course, Contemp. Math. 529 (2010) 73–140
R. Bhatia, Matrix Analysis, Springer, (1997).
B. Simon, Trace Ideals and their Applications, Cambridge Univ. Press, (1979); Second edition. Amer. Math. Soc., Providence, RI, (2005).
M. Ohya, D. Petz, Quantum Entropy and Its Use, Springer, (1993).
Löwner . Karl . Über monotone Matrixfunktionen . Mathematische Zeitschrift . Springer Science and Business Media LLC . 38 . 1 . 1934 . 0025-5874 . 10.1007/bf01170633 . 177–216 . 121439134 . de.
[William F. Donoghue Jr.|W.F. Donoghue, Jr.]
Golden . Sidney . Lower Bounds for the Helmholtz Function . Physical Review . American Physical Society (APS) . 137 . 4B . 1965-02-22 . 0031-899X . 10.1103/physrev.137.b1127 . B1127–B1128. 1965PhRv..137.1127G .
Thompson . Colin J. . Inequality with Applications in Statistical Mechanics . Journal of Mathematical Physics . AIP Publishing . 6 . 11 . 1965 . 0022-2488 . 10.1063/1.1704727 . 1812–1813. 1965JMP.....6.1812T .
Lieb . Elliott H . Convex trace functions and the Wigner-Yanase-Dyson conjecture . . 11 . 3 . 1973 . 0001-8708 . 10.1016/0001-8708(73)90011-x . free . 267–288.
D. Ruelle, Statistical Mechanics: Rigorous Results, World Scient. (1969).
Wigner . Eugene P. . Yanase . Mutsuo M. . On the Positive Semidefinite Nature of a Certain Matrix Expression . Canadian Journal of Mathematics . Canadian Mathematical Society . 16 . 1964 . 0008-414X . 10.4153/cjm-1964-041-x . 397–406. 124032721 .
Ando . T. . Concavity of certain maps on positive definite matrices and applications to Hadamard products . Linear Algebra and Its Applications . Elsevier BV . 26 . 1979 . 0024-3795 . 10.1016/0024-3795(79)90179-4 . 203–241. free .
Epstein . H. . Remarks on two theorems of E. Lieb . Communications in Mathematical Physics . Springer Science and Business Media LLC . 31 . 4 . 1973 . 0010-3616 . 10.1007/bf01646492 . 317–325. 1973CMaPh..31..317E . 120096681 .
Ruskai . Mary Beth . Inequalities for quantum entropy: A review with conditions for equality . Journal of Mathematical Physics . AIP Publishing . 43 . 9 . 2002 . 0022-2488 . 10.1063/1.1497701 . 4358–4375. quant-ph/0205064 . 2002JMP....43.4358R . 3051292 .
Ruskai . Mary Beth . Another short and elementary proof of strong subadditivity of quantum entropy . Reports on Mathematical Physics . Elsevier BV . 60 . 1 . 2007 . 0034-4877 . 10.1016/s0034-4877(07)00019-5 . 1–12. quant-ph/0604206 . 2007RpMP...60....1R . 1432137 .
Lindblad . Göran . Expectations and entropy inequalities for finite quantum systems . Communications in Mathematical Physics . Springer Science and Business Media LLC . 39 . 2 . 1974 . 0010-3616 . 10.1007/bf01608390 . 111–119. 1974CMaPh..39..111L . 120760667 .
C. Davis, A Schwarz inequality for convex operator functions, Proc. Amer. Math. Soc. 8, 42–44, (1957).
Hansen . Frank . Pedersen . Gert K. . Jensen's Operator Inequality . Bulletin of the London Mathematical Society . 35 . 4 . 2003-06-09 . 0024-6093 . 10.1112/s0024609303002200 . 553–564. math/0204049. 16581168 .
E. H. Lieb, W. E. Thirring, Inequalities for the Moments of the Eigenvalues of the Schrödinger Hamiltonian and Their Relation to Sobolev Inequalities, in Studies in Mathematical Physics, edited E. Lieb, B. Simon, and A. Wightman, Princeton University Press, 269–303 (1976).
Araki . Huzihiro . On an inequality of Lieb and Thirring . Letters in Mathematical Physics . Springer Science and Business Media LLC . 19 . 2 . 1990 . 0377-9017 . 10.1007/bf01045887 . 167–170. 1990LMaPh..19..167A . 119649822 .
Z. Allen-Zhu, Y. Lee, L. Orecchia, Using Optimization to Obtain a Width-Independent, Parallel, Simpler, and Faster Positive SDP Solver, in ACM-SIAM Symposium on Discrete Algorithms, 1824–1831 (2016).
L. Lafleche, C. Saffirio, Strong Semiclassical Limit from Hartree andHartree-Fock to Vlasov-Poisson Equation, arXiv:2003.02926 [math-ph].
V. Bosboom, M. Schlottbom, F. L. Schwenninger, On the unique solvability of radiative transfer equations with polarization, in Journal of Differential Equations, (2024).
Effros . E. G. . A matrix convexity approach to some celebrated quantum inequalities . Proceedings of the National Academy of Sciences USA. Proceedings of the National Academy of Sciences . 106 . 4 . 2009-01-21 . 0027-8424 . 10.1073/pnas.0807965106 . 1006–1008. 19164582 . 2633548 . 0802.1234. 2009PNAS..106.1006E . free.
Ebadian . A. . Nikoufar . I. . Eshaghi Gordji . M. . Perspectives of matrix convex functions . Proceedings of the National Academy of Sciences . Proceedings of the National Academy of Sciences USA. 108 . 18 . 2011-04-18 . 0027-8424 . 10.1073/pnas.1102518108. 3088602 . 7313–7314. 2011PNAS..108.7313E . free .
Mirsky. L.. A trace inequality of John von Neumann. Monatshefte für Mathematik. December 1975. 79. 4. 303–306. 10.1007/BF01647331. 122252038.
Carlsson. Marcus. von Neumann's trace inequality for Hilbert-Schmidt operators. Expositiones Mathematicae. 2021. 39. 1. 149–157. 10.1016/j.exmath.2020.05.001.
Book: Marshall. Albert W.. Olkin. Ingram. Arnold. Barry. Inequalities: Theory of Majorization and Its Applications. limited. 2011. 2nd. New York . Springer. 340-341. 978-0-387-68276-1.