Sylvester's criterion explained

In mathematics, Sylvester’s criterion is a necessary and sufficient criterion to determine whether a Hermitian matrix is positive-definite.

Sylvester's criterion states that a n × n Hermitian matrix M is positive-definite if and only if all the following matrices have a positive determinant:

{}\vdots

In other words, all of the leading principal minors must be positive. By using appropriate permutations of rows and columns of M, it can also be shown that the positivity of any nested sequence of n principal minors of M is equivalent to M being positive-definite.

An analogous theorem holds for characterizing positive-semidefinite Hermitian matrices, except that it is no longer sufficient to consider only the leading principal minors as illustrated by the Hermitian matrix

\begin{pmatrix} 0&0&-1\\ 0&-1&0\\ -1&0&0 \end{pmatrix}witheigenvectors \begin{pmatrix} 0\\1\\0 \end{pmatrix}, \begin{pmatrix} 1\\0\\1 \end{pmatrix}and \begin{pmatrix} 1\\0\\-1 \end{pmatrix}.

A Hermitian matrix M is positive-semidefinite if and only if all principal minors of M are nonnegative.

Proof for the case of positive definite matrices

Suppose

Mn

is

n x n

Hermitian matrix
\dagger
M
n

=Mn

. Let

Mk,k=1,\ldotsn

be the principal minor matrices, i.e. the

k x k

upper left corner matrices. It will be shown that if

Mn

is positive definite, then the principal minors are positive; that is,

\detMk>0

for all

k

.

Mk

is positive definite. Indeed, choosing

x=\left(\begin{array}{c}x1\\vdots\xk\ 0\\vdots\\0\end{array} \right)=\left(\begin{array}{c} \vec{x}\\ 0\\ \vdots\\ 0 \end{array} \right)

we can notice that

0<x\daggerMnx=\vec{x}\daggerMk\vec{x}.

Equivalently, the eigenvalues of

Mk

are positive, and this implies that

\detMk>0

since the determinant is the product of the eigenvalues.

To prove the reverse implication, we use induction. The general form of an

(n+1) x (n+1)

Hermitian matrix is

Mn+1=\left(

\dagger
\begin{array}{cc}M
n&\vec{v}\\vec{v}

&d\end{array}\right)    (*)

,

where

Mn

is an

n x n

Hermitian matrix,

\vec{v}

is a vector and

d

is a real constant.

Suppose the criterion holds for

Mn

. Assuming that all the principal minors of

Mn+1

are positive implies that

\detMn+1>0

,

\detMn>0

, and that

Mn

is positive definite by the inductive hypothesis. Denote

x=\left(\begin{array}{c}\vec{x}\xn+1\end{array}\right)

then

x\daggerMn+1x=\vec{x}\daggerMn\vec{x}+xn+1\vec{x}\dagger\vec{v}+\bar{x}n+1\vec{v}\dagger\vec{x}+d|xn+1|2

By completing the squares, this last expression is equal to

(\vec{x}\dagger+\vec{v}\dagger

-1
M
n

\bar{x}n+1)Mn(\vec{x}+xn+1

-1
M
n

\vec{v})-|xn+1|2\vec{v}\dagger

-1
M
n

\vec{v}+d|xn+1|2

=(\vec{x}+\vec{c})\daggerMn(\vec{x}+\vec{c})+|xn+1|2(d-\vec{v}\dagger

-1
M
n

\vec{v})

where

\vec{c}=xn+1

-1
M
n

\vec{v}

(note that
-1
M
n
exists because the eigenvalues of

Mn

are all positive.) The first term is positive by the inductive hypothesis. We now examine the sign of the second term. By using the block matrix determinant formula

\det\left(\begin{array}{cc}A&B\\C&D\end{array}\right)=\detA\det(D-CA-1B)

on

(*)

we obtain

\detMn+1=\det

\dagger
M
n(d-\vec{v}
-1
M
n

\vec{v})>0

, which implies

d-\vec{v}\dagger

-1
M
n

\vec{v}>0

. Consequently,

x\daggerMn+1x>0.

Proof for the case of positive semidefinite matrices

Let

Mn

be an n x n Hermitian matrix. Suppose

Mn

is semidefinite. Essentially the same proof as for the case that

Mn

is strictly positive definite shows that all principal minors (not necessarily the leading principal minors) are non-negative.

For the reverse implication, it suffices to show that if

Mn

has all non-negative principal minors, then for all t>0, all leading principal minors of the Hermitian matrix

Mn+tIn

are strictly positive, where

In

is the nxn identity matrix. Indeed, from the positive definite case, we would know that the matrices

Mn+tIn

are strictly positive definite. Since the limit of positive definite matrices is always positive semidefinite, we can take

t\to0

to conclude.

To show this, let

Mk

be the kth leading principal submatrix of

Mn.

We know that

qk(t)=\det(Mk+tIk)

is a polynomial in t, related to the characteristic polynomial
p
Mk
viaq_k(t) = (-1)^kp_(-t). We use the identity in Characteristic polynomial#Properties to writeq_k(t) = \sum_^k t^ \operatorname\left(\textstyle\bigwedge^j M_k\right),where \operatorname\left(\bigwedge^j M_k\right) is the trace of the jth exterior power of

Mk.

From Minor_(linear_algebra)#Multilinear_algebra_approach, we know that the entries in the matrix expansion of

wedgejMk

(for j > 0) are just the minors of

Mk.

In particular, the diagonal entries are the principal minors of

Mk

, which of course are also principal minors of

Mn

, and are thus non-negative. Since the trace of a matrix is the sum of the diagonal entries, it follows that \operatorname\left(\textstyle\bigwedge^j M_k\right) \geq 0. Thus the coefficient of

tk-j

in

qk(t)

is non-negative for all j > 0. For j = 0, it is clear that the coefficient is 1. In particular,

qk(t)>0

for all t > 0, which is what was required to show.

References