Trace inequality explained
In mathematics, there are many kinds of inequalities involving matrices and linear operators on Hilbert spaces. This article covers some important operator inequalities connected with traces of matrices.[1] [2] [3] [4]
Basic definitions
Let
denote the space of
Hermitian
matrices,
denote the set consisting of positive semi-definite
Hermitian matrices and
denote the set of positive definite Hermitian matrices. For operators on an infinite dimensional Hilbert space we require that they be
trace class and
self-adjoint, in which case similar definitions apply, but we discuss only matrices, for simplicity.
For any real-valued function
on an interval
one may define a
matrix function
for any operator
with
eigenvalues
in
by defining it on the eigenvalues and corresponding
projectors
as
given the
spectral decomposition
Operator monotone
See main article: Operator monotone function.
A function
defined on an interval
is said to be
operator monotone if for all
and all
with eigenvalues in
the following holds,
where the inequality
means that the operator
is positive semi-definite. One may check that
is, in fact,
not operator monotone!
Operator convex
A function
is said to be
operator convex if for all
and all
with eigenvalues in
and
, the following holds
Note that the operator
has eigenvalues in
since
and
have eigenvalues in
A function
is
if
is operator convex;=, that is, the inequality above for
is reversed.
Joint convexity
A function
defined on intervals
is said to be
if for all
and all
with eigenvalues in
and all
with eigenvalues in
and any
the following holds
A function
is
if −
is jointly convex, i.e. the inequality above for
is reversed.
Trace function
Given a function
the associated
trace function on
is given by
where
has eigenvalues
and
stands for a
trace of the operator.
Convexity and monotonicity of the trace function
Let
be continuous, and let be any integer. Then, if
is monotone increasing, so is
A\mapsto\operatorname{Tr}f(A)
on
Hn.
Likewise, if
is
convex, so is
A\mapsto\operatorname{Tr}f(A)
on
Hn, andit is strictly convex if is strictly convex.
See proof and discussion in, for example.
Löwner–Heinz theorem
For
, the function
is operator monotone and operator concave.
For
, the function
is operator monotone and operator concave.
For
, the function
is operator convex. Furthermore,
is operator concave and operator monotone, while
is operator convex.
The original proof of this theorem is due to K. Löwner who gave a necessary and sufficient condition for to be operator monotone.[5] An elementary proof of the theorem is discussed in and a more general version of it in.[6]
Klein's inequality
For all Hermitian × matrices and and all differentiable convex functions
with
derivative, or for all positive-definite Hermitian × matrices and, and all differentiableconvex functions :(0,∞) →
, the following inequality holds, In either case, if is strictly convex, equality holds if and only if = .A popular choice in applications is, see below.
Proof
Let
so that, for
,
,varies from
to
.
Define
F(t)=\operatorname{Tr}[f(B+tC)]
.By convexity and monotonicity of trace functions,
is convex, and so for all
,
F(0)+t(F(1)-F(0))\geqF(t)
,which is,
,and, in fact, the right hand side is monotone decreasing in
.
Taking the limit
yields,
,which with rearrangement and substitution is Klein's inequality:
tr[f(A)-f(B)-(A-B)f'(B)]\geq0
Note that if
is strictly convex and
, then
is strictly convex. The final assertion follows from this and the fact that
is monotone decreasing in
.
Golden–Thompson inequality
See main article: Golden–Thompson inequality.
In 1965, S. Golden [7] and C.J. Thompson [8] independently discovered that
For any matrices
,
\operatorname{Tr}eA+B\leq\operatorname{Tr}eAeB.
This inequality can be generalized for three operators:[9] for non-negative operators
,
\operatorname{Tr}eln\leq
\operatorname{Tr}A(B+t)-1C(B+t)-1\operatorname{d}t.
Peierls–Bogoliubov inequality
Let
be such that Tr e
R = 1.Defining, we have
\operatorname{Tr}eFeR\geq\operatorname{Tr}eF+R\geqeg.
The proof of this inequality follows from the above combined with Klein's inequality. Take .[10]
Gibbs variational principle
Let
be a self-adjoint operator such that
is
trace class. Then for any
with
\operatorname{Tr}\gamma=1,
\operatorname{Tr}\gammaH+\operatorname{Tr}\gammaln\gamma\geq-ln\operatorname{Tr}e-H,
with equality if and only if
\gamma=\exp(-H)/\operatorname{Tr}\exp(-H).
Lieb's concavity theorem
The following theorem was proved by E. H. Lieb in. It proves and generalizes a conjecture of E. P. Wigner, M. M. Yanase, and Freeman Dyson.[11] Six years later other proofs were given by T. Ando [12] and B. Simon, and several more have been given since then.
For all
matrices
, and all
and
such that
and
, with
the real valued map on
given by
F(A,B,K)=\operatorname{Tr}(K*AqKBr)
.
Here
stands for the
adjoint operator of
Lieb's theorem
For a fixed Hermitian matrix
, the function
f(A)=\operatorname{Tr}\exp\{L+lnA\}
is concave on
.
The theorem and proof are due to E. H. Lieb, Thm 6, where he obtains this theorem as a corollary of Lieb's concavity Theorem.The most direct proof is due to H. Epstein;[13] see M.B. Ruskai papers,[14] [15] for a review of this argument.
Ando's convexity theorem
T. Ando's proof of Lieb's concavity theorem led to the following significant complement to it:
For all
matrices
, and all
and
with
, the real valued map on
given by
(A,B)\mapsto\operatorname{Tr}(K*AqKB-r)
is convex.
Joint convexity of relative entropy
For two operators
define the following map
R(A\parallelB):=\operatorname{Tr}(AlogA)-\operatorname{Tr}(AlogB).
and
, the map
R(\rho\parallel\sigma)=S(\rho\parallel\sigma)
is the Umegaki's
quantum relative entropy.
Note that the non-negativity of
follows from Klein's inequality with
.
Statement
The map
is jointly convex.
Proof
For all
,
(A,B)\mapsto\operatorname{Tr}(B1-pAp)
is jointly concave, by Lieb's concavity theorem, and thus
(A,B)\mapsto
(\operatorname{Tr}(B1-pAp)-\operatorname{Tr}A)
is convex. But
\limp →
(\operatorname{Tr}(B1-pAp)-\operatorname{Tr}A)=R(A\parallelB),
and convexity is preserved in the limit.
The proof is due to G. Lindblad.[16]
Jensen's operator and trace inequalities
The operator version of Jensen's inequality is due to C. Davis.[17]
A continuous, real function
on an interval
satisfies
Jensen's Operator Inequality if the following holds
f\left(\sumkA
k\right)\leq\sumk
k,
for operators
with
and for
self-adjoint operators
with
spectrum on
.
See,[18] for the proof of the following two theorems.
Jensen's trace inequality
Let be a continuous function defined on an interval and let and be natural numbers. If is convex, we then have the inequality
| *X |
\operatorname{Tr}l(fl(\sum | |
| kA |
kr)r)\leq
| n |
\operatorname{Tr}l(\sum | |
| k=1 |
kr),
for all (
1, ...,
n) self-adjoint × matrices with spectra contained in andall (
1, ...,
n) of × matrices with
Conversely, if the above inequality is satisfied for some and, where > 1, then is convex.
Jensen's operator inequality
For a continuous function
defined on an interval
the following conditions are equivalent:
is operator convex.
we have the inequality
for all
bounded, self-adjoint operators on an arbitrary
Hilbert space
withspectra contained in
and all
on
with
for each isometry
on an infinite-dimensional Hilbert space
andevery self-adjoint operator
with spectrum in
.
Pf(PXP+λ(1-P))P\leqPf(X)P
for each projection
on an infinite-dimensional Hilbert space
, every self-adjoint operator
with spectrum in
and every
in
.
Araki–Lieb–Thirring inequality
E. H. Lieb and W. E. Thirring proved the following inequality in [19] 1976: For any
and
In 1990 [20] H. Araki generalized the above inequality to the following one: For any
and
for
and
for
There are several other inequalities close to the Lieb–Thirring inequality, such as the following:[21] for any
and
and even more generally:
[22] for any
and
The above inequality generalizes the previous one, as can be seen by exchanging
by
and
by
with
and using the cyclicity of the trace, leading to
Additionally, building upon the Lieb-Thirring inequality the following inequality was derived: [23] For any
and all
with
, it holds that
Effros's theorem and its extension
E. Effros in [24] proved the following theorem.
If
is an operator convex function, and
and
are commuting bounded linear operators, i.e. the commutator
, the
perspective
is jointly convex, i.e. if
and
with
(i=1,2),
,
g(L,R)\leqλg(L1,R1)+(1-λ)g(L2,R2).
Ebadian et al. later extended the inequality to the case where
and
do not commute .
[25] Von Neumann's trace inequality and related results
, named after its originator John von Neumann, states that for any
complex matrices
and
with
singular values
\alpha1\geq\alpha2\geq … \geq\alphan
and
\beta1\geq\beta2\geq … \geq\betan
respectively,
[26] with equality if and only if
and
share singular vectors.
[27]
positive semi-definite complex matrices
and
where now the
eigenvalues are sorted decreasingly (
and
respectively),
References
Notes and References
- E. Carlen, Trace Inequalities and Quantum Entropy: An Introductory Course, Contemp. Math. 529 (2010) 73–140
- R. Bhatia, Matrix Analysis, Springer, (1997).
- B. Simon, Trace Ideals and their Applications, Cambridge Univ. Press, (1979); Second edition. Amer. Math. Soc., Providence, RI, (2005).
- M. Ohya, D. Petz, Quantum Entropy and Its Use, Springer, (1993).
- Löwner . Karl . Über monotone Matrixfunktionen . Mathematische Zeitschrift . Springer Science and Business Media LLC . 38 . 1 . 1934 . 0025-5874 . 10.1007/bf01170633 . 177–216 . 121439134 . de.
- [William F. Donoghue Jr.|W.F. Donoghue, Jr.]
- Golden . Sidney . Lower Bounds for the Helmholtz Function . Physical Review . American Physical Society (APS) . 137 . 4B . 1965-02-22 . 0031-899X . 10.1103/physrev.137.b1127 . B1127–B1128. 1965PhRv..137.1127G .
- Thompson . Colin J. . Inequality with Applications in Statistical Mechanics . Journal of Mathematical Physics . AIP Publishing . 6 . 11 . 1965 . 0022-2488 . 10.1063/1.1704727 . 1812–1813. 1965JMP.....6.1812T .
- Lieb . Elliott H . Convex trace functions and the Wigner-Yanase-Dyson conjecture . . 11 . 3 . 1973 . 0001-8708 . 10.1016/0001-8708(73)90011-x . free . 267–288.
- D. Ruelle, Statistical Mechanics: Rigorous Results, World Scient. (1969).
- Wigner . Eugene P. . Yanase . Mutsuo M. . On the Positive Semidefinite Nature of a Certain Matrix Expression . Canadian Journal of Mathematics . Canadian Mathematical Society . 16 . 1964 . 0008-414X . 10.4153/cjm-1964-041-x . 397–406. 124032721 .
- Ando . T. . Concavity of certain maps on positive definite matrices and applications to Hadamard products . Linear Algebra and Its Applications . Elsevier BV . 26 . 1979 . 0024-3795 . 10.1016/0024-3795(79)90179-4 . 203–241. free .
- Epstein . H. . Remarks on two theorems of E. Lieb . Communications in Mathematical Physics . Springer Science and Business Media LLC . 31 . 4 . 1973 . 0010-3616 . 10.1007/bf01646492 . 317–325. 1973CMaPh..31..317E . 120096681 .
- Ruskai . Mary Beth . Inequalities for quantum entropy: A review with conditions for equality . Journal of Mathematical Physics . AIP Publishing . 43 . 9 . 2002 . 0022-2488 . 10.1063/1.1497701 . 4358–4375. quant-ph/0205064 . 2002JMP....43.4358R . 3051292 .
- Ruskai . Mary Beth . Another short and elementary proof of strong subadditivity of quantum entropy . Reports on Mathematical Physics . Elsevier BV . 60 . 1 . 2007 . 0034-4877 . 10.1016/s0034-4877(07)00019-5 . 1–12. quant-ph/0604206 . 2007RpMP...60....1R . 1432137 .
- Lindblad . Göran . Expectations and entropy inequalities for finite quantum systems . Communications in Mathematical Physics . Springer Science and Business Media LLC . 39 . 2 . 1974 . 0010-3616 . 10.1007/bf01608390 . 111–119. 1974CMaPh..39..111L . 120760667 .
- C. Davis, A Schwarz inequality for convex operator functions, Proc. Amer. Math. Soc. 8, 42–44, (1957).
- Hansen . Frank . Pedersen . Gert K. . Jensen's Operator Inequality . Bulletin of the London Mathematical Society . 35 . 4 . 2003-06-09 . 0024-6093 . 10.1112/s0024609303002200 . 553–564. math/0204049. 16581168 .
- E. H. Lieb, W. E. Thirring, Inequalities for the Moments of the Eigenvalues of the Schrödinger Hamiltonian and Their Relation to Sobolev Inequalities, in Studies in Mathematical Physics, edited E. Lieb, B. Simon, and A. Wightman, Princeton University Press, 269–303 (1976).
- Araki . Huzihiro . On an inequality of Lieb and Thirring . Letters in Mathematical Physics . Springer Science and Business Media LLC . 19 . 2 . 1990 . 0377-9017 . 10.1007/bf01045887 . 167–170. 1990LMaPh..19..167A . 119649822 .
- Z. Allen-Zhu, Y. Lee, L. Orecchia, Using Optimization to Obtain a Width-Independent, Parallel, Simpler, and Faster Positive SDP Solver, in ACM-SIAM Symposium on Discrete Algorithms, 1824–1831 (2016).
- L. Lafleche, C. Saffirio, Strong Semiclassical Limit from Hartree andHartree-Fock to Vlasov-Poisson Equation, arXiv:2003.02926 [math-ph].
- V. Bosboom, M. Schlottbom, F. L. Schwenninger, On the unique solvability of radiative transfer equations with polarization, in Journal of Differential Equations, (2024).
- Effros . E. G. . A matrix convexity approach to some celebrated quantum inequalities . Proceedings of the National Academy of Sciences USA. Proceedings of the National Academy of Sciences . 106 . 4 . 2009-01-21 . 0027-8424 . 10.1073/pnas.0807965106 . 1006–1008. 19164582 . 2633548 . 0802.1234. 2009PNAS..106.1006E . free.
- Ebadian . A. . Nikoufar . I. . Eshaghi Gordji . M. . Perspectives of matrix convex functions . Proceedings of the National Academy of Sciences . Proceedings of the National Academy of Sciences USA. 108 . 18 . 2011-04-18 . 0027-8424 . 10.1073/pnas.1102518108. 3088602 . 7313–7314. 2011PNAS..108.7313E . free .
- Mirsky. L.. A trace inequality of John von Neumann. Monatshefte für Mathematik. December 1975. 79. 4. 303–306. 10.1007/BF01647331. 122252038.
- Carlsson. Marcus. von Neumann's trace inequality for Hilbert-Schmidt operators. Expositiones Mathematicae. 2021. 39. 1. 149-157. 10.1016/j.exmath.2020.05.001.
- Book: Marshall. Albert W.. Olkin. Ingram. Arnold. Barry. Inequalities: Theory of Majorization and Its Applications. limited. 2011. 2nd. New York . Springer. 340-341. 978-0-387-68276-1.