Sylvester's formula explained

In matrix theory, Sylvester's formula or Sylvester's matrix theorem (named after J. J. Sylvester) or Lagrange−Sylvester interpolation expresses an analytic function of a matrix as a polynomial in, in terms of the eigenvalues and eigenvectors of .^[1] ^[2] It states that^[3]

f(A)=

	k
\sum
	i=1

f(λ_i)~A_i~,

where the are the eigenvalues of, and the matrices

A_i\equiv

	k
\prod
	j=1\atopj\nei

	1
	λ_i-λ_j

\left(A-λ_jI\right)

are the corresponding Frobenius covariants of, which are (projection) matrix Lagrange polynomials of .

Conditions

Sylvester's formula applies for any diagonalizable matrix with distinct eigenvalues, ₁, ..., _k, and any function defined on some subset of the complex numbers such that is well defined. The last condition means that every eigenvalue is in the domain of, and that every eigenvalue with multiplicity _i > 1 is in the interior of the domain, with being times differentiable at .^[1]

Example

Consider the two-by-two matrix:

A=\begin{bmatrix}1&3\ 4&2\end{bmatrix}.

This matrix has two eigenvalues, 5 and −2. Its Frobenius covariants are

\begin{align} A₁&=c₁r₁=\begin{bmatrix}3\ 4\end{bmatrix}\begin{bmatrix}

	1
	7

	1
	7

\end{bmatrix}=\begin{bmatrix}

	3
	7

	3	\
	7

	4
	7

	4
	7

\end{bmatrix}=

	A+2I
	5-(-2)

\\ A₂&=c₂r₂=\begin{bmatrix}

	1	\ -
	7

	1
	7

\end{bmatrix}\begin{bmatrix}4&-3\end{bmatrix}=\begin{bmatrix}

	4
	7

	3	\ -
	7

	4
	7

	3
	7

\end{bmatrix}=

	A-5I
	-2-5

. \end{align}

Sylvester's formula then amounts to

f(A)=f(5)A₁+f(-2)A_2.

For instance, if is defined by, then Sylvester's formula expresses the matrix inverse as

	1
	5

\begin{bmatrix}

	3
	7

	3	\
	7

	4
	7

	4
	7

\end{bmatrix}-

	1
	2

\begin{bmatrix}

	4
	7

	3	\ -
	7

	4
	7

	3
	7

\end{bmatrix}=\begin{bmatrix}-0.2&0.3\ 0.4&-0.1\end{bmatrix}.

Generalization

Sylvester's formula is only valid for diagonalizable matrices; an extension due to Arthur Buchheim, based on Hermite interpolating polynomials, covers the general case:^[4]

f(A)=

	s
\sum
	i=1

\left[

	n_i-1
\sum
	j=0

	1
	j!

	(j)
\phi
	i

(λ_i)\left(A-λ_iI\right)^j\prod_{j=1,j\ne

}^\left(A - \lambda_j I\right)^ \right],where

\phi_i(t):=f(t)/\prod_j\ne\left(t-

	n_j
λ
	j\right)

A concise form is further given by Hans Schwerdtfeger,^[5]

	s
f(A)=\sum
	i=1

A_i

	n_i-1
\sum
	j=0

	f^(j)(λ_i)
	j!

	j
(A-λ
	iI)

,where _i are the corresponding Frobenius covariants of

Special case

See also: Euler's formula. If a matrix is both Hermitian and unitary, then it can only have eigenvalues of

\plusmn1

, and therefore

A=A_+-A_-

, where

A₊

is the projector onto the subspace with eigenvalue +1, and

A_-

is the projector onto the subspace with eigenvalue

-1

; By the completeness of the eigenbasis,

A_++A_-=I

. Therefore, for any analytic function,

\begin{align}f(\thetaA)&=f(\theta)A₊₁+f(-\theta)A_-1\\ &=f(\theta)

	I+A	+f(-\theta)
	2

	I-A	\\ &=
	2

	f(\theta)+f(-\theta)	I+
	2

	f(\theta)-f(-\theta)
	2

A\\ \end{align}.

In particular,

e^i\theta=(\cos\theta)I+(i\sin\theta)A

and

i	\pi	(I-A)
	2

-i	\pi	(I-A)
	2

References

F.R. Gantmacher, The Theory of Matrices v I (Chelsea Publishing, NY, 1960), pp 101-103
Book: Higham, Nicholas J.. Functions of matrices: theory and computation. 2008. Society for Industrial and Applied Mathematics (SIAM). 9780898717778. Philadelphia. 693957820.
Merzbacher . E . Matrix methods in quantum mechanics. Am. J. Phys.. 36 . 9 . 814–821. 1968. 10.1119/1.1975154. 1968AmJPh..36..814M .

Notes and References

/ Roger A. Horn and Charles R. Johnson (1991), Topics in Matrix Analysis. Cambridge University Press,
[Jon Claerbout|Jon F. Claerbout]
Sylvester. J.J.. 1883. XXXIX. On the equation to the secular inequalities in the planetary theory. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science. en. 16. 100. 267–269. 10.1080/14786448308627430. 1941-5982.
Buchheim. Arthur. 1884. On the Theory of Matrices. Proceedings of the London Mathematical Society. en. s1-16. 1. 63–82. 10.1112/plms/s1-16.1.63. 0024-6115.
Book: Schwerdtfeger, Hans. Les fonctions de matrices: Les fonctions univalentes. I, Volume 1. Hermann. 1938. Paris, France.