Eigenvalues and eigenvectors explained

In linear algebra, an eigenvector or characteristic vector is a vector that has its direction unchanged by a given linear transformation. More precisely, an eigenvector,

v

, of a linear transformation,

T

, is scaled by a constant factor,

λ

, when the linear transformation is applied to it:

Tv=λv

. It is often important to know these vectors in linear algebra. The corresponding eigenvalue, characteristic value, or characteristic root is the multiplying factor

λ

.

Geometrically, vectors are multi-dimensional quantities with magnitude and direction, often pictured as arrows. A linear transformation rotates, stretches, or shears the vectors upon which it acts. Its eigenvectors are those vectors that are only stretched, with neither rotation nor shear. The corresponding eigenvalue is the factor by which an eigenvector is stretched or squished. If the eigenvalue is negative, the eigenvector's direction is reversed.

The eigenvectors and eigenvalues of a linear transformation serve to characterize it, and so they play important roles in all the areas where linear algebra is applied, from geology to quantum mechanics. In particular, it is often the case that a system is represented by a linear transformation whose outputs are fed as inputs to the same transformation (feedback). In such an application, the largest eigenvalue is of particular importance, because it governs the long-term behavior of the system after many applications of the linear transformation, and the associated eigenvector is the steady state of the system.

Definition

Consider a matrix,, and a nonzero vector,

v

. If applying to

v

(denoted by

Av

) simply scales

v

by a factor of, where is a scalar, then

v

is an eigenvector of, and is the corresponding eigenvalue. This relationship can be expressed as:

Av=λv

.[1]

There is a direct correspondence between n-by-n square matrices and linear transformations from an n-dimensional vector space into itself, given any basis of the vector space. Hence, in a finite-dimensional vector space, it is equivalent to define eigenvalues and eigenvectors using either the language of matrices, or the language of linear transformations.

If is finite-dimensional, the above equation is equivalent toA\mathbf = \lambda \mathbf,

where is the matrix representation of and is the coordinate vector of .

Overview

Eigenvalues and eigenvectors feature prominently in the analysis of linear transformations. The prefix eigen- is adopted from the German word eigen (cognate with the English word own) for 'proper', 'characteristic', 'own'.[2] Originally used to study principal axes of the rotational motion of rigid bodies, eigenvalues and eigenvectors have a wide range of applications, for example in stability analysis, vibration analysis, atomic orbitals, facial recognition, and matrix diagonalization.

In essence, an eigenvector v of a linear transformation T is a nonzero vector that, when T is applied to it, does not change direction. Applying T to the eigenvector only scales the eigenvector by the scalar value λ, called an eigenvalue. This condition can be written as the equationT(\mathbf) = \lambda \mathbf,referred to as the eigenvalue equation or eigenequation. In general, λ may be any scalar. For example, λ may be negative, in which case the eigenvector reverses direction as part of the scaling, or it may be zero or complex.

The example here, based on the Mona Lisa, provides a simple illustration. Each point on the painting can be represented as a vector pointing from the center of the painting to that point. The linear transformation in this example is called a shear mapping. Points in the top half are moved to the right, and points in the bottom half are moved to the left, proportional to how far they are from the horizontal axis that goes through the middle of the painting. The vectors pointing to each point in the original image are therefore tilted right or left, and made longer or shorter by the transformation. Points along the horizontal axis do not move at all when this transformation is applied. Therefore, any vector that points directly to the right or left with no vertical component is an eigenvector of this transformation, because the mapping does not change its direction. Moreover, these eigenvectors all have an eigenvalue equal to one, because the mapping does not change their length either.

Linear transformations can take many different forms, mapping vectors in a variety of vector spaces, so the eigenvectors can also take many forms. For example, the linear transformation could be a differential operator like

\tfrac{d}{dx}

, in which case the eigenvectors are functions called eigenfunctions that are scaled by that differential operator, such as\frace^ = \lambda e^.Alternatively, the linear transformation could take the form of an n by n matrix, in which case the eigenvectors are n by 1 matrices. If the linear transformation is expressed in the form of an n by n matrix A, then the eigenvalue equation for a linear transformation above can be rewritten as the matrix multiplicationA\mathbf v = \lambda \mathbf v,where the eigenvector v is an n by 1 matrix. For a matrix, eigenvalues and eigenvectors can be used to decompose the matrix—for example by diagonalizing it.

Eigenvalues and eigenvectors give rise to many closely related mathematical concepts, and the prefix eigen- is applied liberally when naming them:

History

Eigenvalues are often introduced in the context of linear algebra or matrix theory. Historically, however, they arose in the study of quadratic forms and differential equations.

In the 18th century, Leonhard Euler studied the rotational motion of a rigid body, and discovered the importance of the principal axes. Joseph-Louis Lagrange realized that the principal axes are the eigenvectors of the inertia matrix.

In the early 19th century, Augustin-Louis Cauchy saw how their work could be used to classify the quadric surfaces, and generalized it to arbitrary dimensions. Cauchy also coined the term racine caractéristique (characteristic root), for what is now called eigenvalue; his term survives in characteristic equation.

Later, Joseph Fourier used the work of Lagrange and Pierre-Simon Laplace to solve the heat equation by separation of variables in his famous 1822 book Théorie analytique de la chaleur. Charles-François Sturm developed Fourier's ideas further, and brought them to the attention of Cauchy, who combined them with his own ideas and arrived at the fact that real symmetric matrices have real eigenvalues. This was extended by Charles Hermite in 1855 to what are now called Hermitian matrices.

Around the same time, Francesco Brioschi proved that the eigenvalues of orthogonal matrices lie on the unit circle, and Alfred Clebsch found the corresponding result for skew-symmetric matrices. Finally, Karl Weierstrass clarified an important aspect in the stability theory started by Laplace, by realizing that defective matrices can cause instability.

In the meantime, Joseph Liouville studied eigenvalue problems similar to those of Sturm; the discipline that grew out of their work is now called Sturm–Liouville theory. Schwarz studied the first eigenvalue of Laplace's equation on general domains towards the end of the 19th century, while Poincaré studied Poisson's equation a few years later.

At the start of the 20th century, David Hilbert studied the eigenvalues of integral operators by viewing the operators as infinite matrices. He was the first to use the German word eigen, which means "own", to denote eigenvalues and eigenvectors in 1904, though he may have been following a related usage by Hermann von Helmholtz. For some time, the standard term in English was "proper value", but the more distinctive term "eigenvalue" is the standard today.

The first numerical algorithm for computing eigenvalues and eigenvectors appeared in 1929, when Richard von Mises published the power method. One of the most popular methods today, the QR algorithm, was proposed independently by John G. F. Francis and Vera Kublanovskaya in 1961.

Eigenvalues and eigenvectors of matrices

See also: Euclidean vector and Matrix (mathematics).

Eigenvalues and eigenvectors are often introduced to students in the context of linear algebra courses focused on matrices.[3] [4] Furthermore, linear transformations over a finite-dimensional vector space can be represented using matrices, which is especially common in numerical and computational applications.

Consider -dimensional vectors that are formed as a list of scalars, such as the three-dimensional vectors\mathbf x = \begin1\\-3\\4\end\quad\mbox\quad \mathbf y = \begin-20\\60\\-80\end.

These vectors are said to be scalar multiples of each other, or parallel or collinear, if there is a scalar such that\mathbf x = \lambda \mathbf y.

In this case,

λ=-

1
20

.

Now consider the linear transformation of -dimensional vectors defined by an by matrix,A \mathbf v = \mathbf w,or\begin A_ & A_ & \cdots & A_ \\ A_ & A_ & \cdots & A_ \\ \vdots & \vdots & \ddots & \vdots \\ A_ & A_ & \cdots & A_ \\ \end\begin v_1 \\ v_2 \\ \vdots \\ v_n \end = \begin w_1 \\ w_2 \\ \vdots \\ w_n \endwhere, for each row,w_i = A_ v_1 + A_ v_2 + \cdots + A_ v_n = \sum_^n A_ v_j.

If it occurs that and are scalar multiples, that is ifthen is an eigenvector of the linear transformation and the scale factor is the eigenvalue corresponding to that eigenvector. Equation is the eigenvalue equation for the matrix .

Equation can be stated equivalently aswhere is the by identity matrix and 0 is the zero vector.

Eigenvalues and the characteristic polynomial

See main article: Characteristic polynomial.

Equation has a nonzero solution v if and only if the determinant of the matrix is zero. Therefore, the eigenvalues of A are values of λ that satisfy the equation

Using the Leibniz formula for determinants, the left-hand side of equation is a polynomial function of the variable λ and the degree of this polynomial is n, the order of the matrix A. Its coefficients depend on the entries of A, except that its term of degree n is always (−1)nλn. This polynomial is called the characteristic polynomial of A. Equation is called the characteristic equation or the secular equation of A.

The fundamental theorem of algebra implies that the characteristic polynomial of an n-by-n matrix A, being a polynomial of degree n, can be factored into the product of n linear terms,where each λi may be real but in general is a complex number. The numbers λ1, λ2, ..., λn, which may not all have distinct values, are roots of the polynomial and are the eigenvalues of A.

As a brief example, which is described in more detail in the examples section later, consider the matrixA = \begin 2 & 1\\ 1 & 2\end.

Taking the determinant of, the characteristic polynomial of A is\det(A - \lambda I) = \begin 2 - \lambda & 1 \\ 1 & 2 - \lambda \end = 3 - 4\lambda + \lambda^2.

Setting the characteristic polynomial equal to zero, it has roots at and, which are the two eigenvalues of A. The eigenvectors corresponding to each eigenvalue can be found by solving for the components of v in the equation In this example, the eigenvectors are any nonzero scalar multiples of\mathbf v_ = \begin 1 \\ -1 \end, \quad \mathbf v_ = \begin 1 \\ 1 \end.

If the entries of the matrix A are all real numbers, then the coefficients of the characteristic polynomial will also be real numbers, but the eigenvalues may still have nonzero imaginary parts. The entries of the corresponding eigenvectors therefore may also have nonzero imaginary parts. Similarly, the eigenvalues may be irrational numbers even if all the entries of A are rational numbers or even if they are all integers. However, if the entries of A are all algebraic numbers, which include the rationals, the eigenvalues must also be algebraic numbers.

The non-real roots of a real polynomial with real coefficients can be grouped into pairs of complex conjugates, namely with the two members of each pair having imaginary parts that differ only in sign and the same real part. If the degree is odd, then by the intermediate value theorem at least one of the roots is real. Therefore, any real matrix with odd order has at least one real eigenvalue, whereas a real matrix with even order may not have any real eigenvalues. The eigenvectors associated with these complex eigenvalues are also complex and also appear in complex conjugate pairs.

Spectrum of a matrix

The spectrum of a matrix is the list of eigenvalues, repeated according to multiplicity; in an alternative notation the set of eigenvalues with their multiplicities.

An important quantity associated with the spectrum is the maximum absolute value of any eigenvalue. This is known as the spectral radius of the matrix.

Algebraic multiplicity

Let λi be an eigenvalue of an n by n matrix A. The algebraic multiplicity μA(λi) of the eigenvalue is its multiplicity as a root of the characteristic polynomial, that is, the largest integer k such that (λλi)k divides evenly that polynomial.

Suppose a matrix A has dimension n and dn distinct eigenvalues. Whereas equation factors the characteristic polynomial of A into the product of n linear terms with some terms potentially repeating, the characteristic polynomial can also be written as the product of d terms each corresponding to a distinct eigenvalue and raised to the power of the algebraic multiplicity,\det(A - \lambda I) = (\lambda_1 - \lambda)^(\lambda_2 - \lambda)^ \cdots (\lambda_d - \lambda)^.

If d = n then the right-hand side is the product of n linear terms and this is the same as equation . The size of each eigenvalue's algebraic multiplicity is related to the dimension n as\begin 1 &\leq \mu_A(\lambda_i) \leq n, \\ \mu_A &= \sum_^d \mu_A\left(\lambda_i\right) = n.\end

If μA(λi) = 1, then λi is said to be a simple eigenvalue. If μA(λi) equals the geometric multiplicity of λi, γA(λi), defined in the next section, then λi is said to be a semisimple eigenvalue.

Eigenspaces, geometric multiplicity, and the eigenbasis for matrices

Given a particular eigenvalue λ of the n by n matrix A, define the set E to be all vectors v that satisfy equation,E = \left\.

On one hand, this set is precisely the kernel or nullspace of the matrix (AλI). On the other hand, by definition, any nonzero vector that satisfies this condition is an eigenvector of A associated with λ. So, the set E is the union of the zero vector with the set of all eigenvectors of A associated with λ, and E equals the nullspace of (AλI). E is called the eigenspace or characteristic space of A associated with λ. In general λ is a complex number and the eigenvectors are complex n by 1 matrices. A property of the nullspace is that it is a linear subspace, so E is a linear subspace of

Cn

.

Because the eigenspace E is a linear subspace, it is closed under addition. That is, if two vectors u and v belong to the set E, written, then or equivalently . This can be checked using the distributive property of matrix multiplication. Similarly, because E is a linear subspace, it is closed under scalar multiplication. That is, if and α is a complex number, or equivalently . This can be checked by noting that multiplication of complex matrices by complex numbers is commutative. As long as u + v and αv are not zero, they are also eigenvectors of A associated with λ.

The dimension of the eigenspace E associated with λ, or equivalently the maximum number of linearly independent eigenvectors associated with λ, is referred to as the eigenvalue's geometric multiplicity

\gammaA(λ)

. Because E is also the nullspace of (AλI), the geometric multiplicity of λ is the dimension of the nullspace of (AλI), also called the nullity of (AλI), which relates to the dimension and rank of (AλI) as\gamma_A(\lambda) = n - \operatorname(A - \lambda I).

Because of the definition of eigenvalues and eigenvectors, an eigenvalue's geometric multiplicity must be at least one, that is, each eigenvalue has at least one associated eigenvector. Furthermore, an eigenvalue's geometric multiplicity cannot exceed its algebraic multiplicity. Additionally, recall that an eigenvalue's algebraic multiplicity cannot exceed n. 1 \le \gamma_A(\lambda) \le \mu_A(\lambda) \le n

To prove the inequality

\gammaA(λ)\le\muA(λ)

, consider how the definition of geometric multiplicity implies the existence of

\gammaA(λ)

orthonormal eigenvectors

\boldsymbol{v}1,\ldots,

\boldsymbol{v}
\gammaA(λ)
, such that

A\boldsymbol{v}k=λ\boldsymbol{v}k

. We can therefore find a (unitary) matrix

V

whose first

\gammaA(λ)

columns are these eigenvectors, and whose remaining columns can be any orthonormal set of

n-\gammaA(λ)

vectors orthogonal to these eigenvectors of

A

. Then

V

has full rank and is therefore invertible. Evaluating

D:=VTAV

, we get a matrix whose top left block is the diagonal matrix

λ

I
\gammaA(λ)
. This can be seen by evaluating what the left-hand side does to the first column basis vectors. By reorganizing and adding

-\xiV

on both sides, we get

(A-\xiI)V=V(D-\xiI)

since

I

commutes with

V

. In other words,

A-\xiI

is similar to

D-\xiI

, and

\det(A-\xiI)=\det(D-\xiI)

. But from the definition of

D

, we know that

\det(D-\xiI)

contains a factor

(\xi-

\gammaA(λ)
λ)
, which means that the algebraic multiplicity of

λ

must satisfy

\muA(λ)\ge\gammaA(λ)

.

Suppose

A

has

d\leqn

distinct eigenvalues

λ1,\ldots,λd

, where the geometric multiplicity of

λi

is

\gammaA(λi)

. The total geometric multiplicity of

A

,\begin \gamma_A &= \sum_^d \gamma_A(\lambda_i), \\ d &\le \gamma_A \le n,\endis the dimension of the sum of all the eigenspaces of

A

's eigenvalues, or equivalently the maximum number of linearly independent eigenvectors of

A

. If

\gammaA=n

, then

A

's eigenvalues is the entire vector space

Cn

.

Cn

can be formed from

n

linearly independent eigenvectors of

A

; such a basis is called an eigenbasis

Cn

can be written as a linear combination of eigenvectors of

A

.

Additional properties of eigenvalues

Let

A

be an arbitrary

n x n

matrix of complex numbers with eigenvalues

λ1,\ldots,λn

. Each eigenvalue appears

\muA(λi)

times in this list, where

\muA(λi)

is the eigenvalue's algebraic multiplicity. The following are properties of this matrix and its eigenvalues:

A

, defined as the sum of its diagonal elements, is also the sum of all eigenvalues,

\operatorname{tr}(A)=

n
\sum
i=1

aii=

n
\sum
i=1

λi=λ1+λ2++λn.

A

is the product of all its eigenvalues,

\det(A)=

n
\prod
i=1

λi=λ2λn.

k

th power of

A

; i.e., the eigenvalues of

Ak

, for any positive integer

k

, are
k,
λ
1

\ldots,

k
λ
n
.

A

is invertible if and only if every eigenvalue is nonzero.

A

is invertible, then the eigenvalues of

A-1

are \frac, \ldots, \frac and each eigenvalue's geometric multiplicity coincides. Moreover, since the characteristic polynomial of the inverse is the reciprocal polynomial of the original, the eigenvalues share the same algebraic multiplicity.

A

is equal to its conjugate transpose

A*

, or equivalently if

A

is Hermitian, then every eigenvalue is real. The same is true of any symmetric real matrix.

A

is not only Hermitian but also positive-definite, positive-semidefinite, negative-definite, or negative-semidefinite, then every eigenvalue is positive, non-negative, negative, or non-positive, respectively.

A

is unitary, every eigenvalue has absolute value

|λi|=1

.

A

is a

n x n

matrix and

\{λ1,\ldots,λk\}

are its eigenvalues, then the eigenvalues of matrix

I+A

(where

I

is the identity matrix) are

\{λ1+1,\ldots,λk+1\}

. Moreover, if

\alpha\inC

, the eigenvalues of

\alphaI+A

are

\{λ1+\alpha,\ldots,λk+\alpha\}

. More generally, for a polynomial

P

the eigenvalues of matrix

P(A)

are

\{P(λ1),\ldots,P(λk)\}

.

Left and right eigenvectors

See also: left and right (algebra).

Many disciplines traditionally represent vectors as matrices with a single column rather than as matrices with a single row. For that reason, the word "eigenvector" in the context of matrices almost always refers to a right eigenvector, namely a column vector that right multiplies the

n x n

matrix

A

in the defining equation, equation,A \mathbf v = \lambda \mathbf v.

The eigenvalue and eigenvector problem can also be defined for row vectors that left multiply matrix

A

. In this formulation, the defining equation is\mathbf u A = \kappa \mathbf u,

where

\kappa

is a scalar and

u

is a

1 x n

matrix. Any row vector

u

satisfying this equation is called a left eigenvector of

A

and

\kappa

is its associated eigenvalue. Taking the transpose of this equation,A^\textsf \mathbf u^\textsf = \kappa \mathbf u^\textsf.

Comparing this equation to equation, it follows immediately that a left eigenvector of

A

is the same as the transpose of a right eigenvector of

Asf{T}

, with the same eigenvalue. Furthermore, since the characteristic polynomial of

Asf{T}

is the same as the characteristic polynomial of

A

, the left and right eigenvectors of

A

are associated with the same eigenvalues.

Diagonalization and the eigendecomposition

See main article: Eigendecomposition of a matrix.

Suppose the eigenvectors of A form a basis, or equivalently A has n linearly independent eigenvectors v1, v2, ..., vn with associated eigenvalues λ1, λ2, ..., λn. The eigenvalues need not be distinct. Define a square matrix Q whose columns are the n linearly independent eigenvectors of A,

Q=\begin{bmatrix}v1&v2&&vn\end{bmatrix}.

Since each column of Q is an eigenvector of A, right multiplying A by Q scales each column of Q by its associated eigenvalue,

AQ=\begin{bmatrix}λ1v1&λ2v2&&λnvn\end{bmatrix}.

With this in mind, define a diagonal matrix Λ where each diagonal element Λii is the eigenvalue associated with the ith column of Q. Then

AQ=QΛ.

Because the columns of Q are linearly independent, Q is invertible. Right multiplying both sides of the equation by Q−1,

A=QΛQ-1,

or by instead left multiplying both sides by Q−1,

Q-1AQ=Λ.

A can therefore be decomposed into a matrix composed of its eigenvectors, a diagonal matrix with its eigenvalues along the diagonal, and the inverse of the matrix of eigenvectors. This is called the eigendecomposition and it is a similarity transformation. Such a matrix A is said to be similar to the diagonal matrix Λ or diagonalizable. The matrix Q is the change of basis matrix of the similarity transformation. Essentially, the matrices A and Λ represent the same linear transformation expressed in two different bases. The eigenvectors are used as the basis when representing the linear transformation as Λ.

Conversely, suppose a matrix A is diagonalizable. Let P be a non-singular square matrix such that P−1AP is some diagonal matrix D. Left multiplying both by P, . Each column of P must therefore be an eigenvector of A whose eigenvalue is the corresponding diagonal element of D. Since the columns of P must be linearly independent for P to be invertible, there exist n linearly independent eigenvectors of A. It then follows that the eigenvectors of A form a basis if and only if A is diagonalizable.

A matrix that is not diagonalizable is said to be defective. For defective matrices, the notion of eigenvectors generalizes to generalized eigenvectors and the diagonal matrix of eigenvalues generalizes to the Jordan normal form. Over an algebraically closed field, any matrix A has a Jordan normal form and therefore admits a basis of generalized eigenvectors and a decomposition into generalized eigenspaces.

Variational characterization

See main article: Min-max theorem.

In the Hermitian case, eigenvalues can be given a variational characterization. The largest eigenvalue of

H

is the maximum value of the quadratic form

xsf{T}Hx/xsf{T}x

. A value of

x

that realizes that maximum is an eigenvector.

Matrix examples

Two-dimensional matrix example

Consider the matrixA = \begin 2 & 1\\ 1 & 2\end.

The figure on the right shows the effect of this transformation on point coordinates in the plane. The eigenvectors v of this transformation satisfy equation, and the values of λ for which the determinant of the matrix (A − λI) equals zero are the eigenvalues.

Taking the determinant to find characteristic polynomial of A,\begin \det(A - \lambda I) &= \left|\begin 2 & 1 \\ 1 & 2 \end - \lambda\begin 1 & 0 \\ 0 & 1 \end\right| = \begin 2 - \lambda & 1 \\ 1 & 2 - \lambda \end \\[6pt] &= 3 - 4\lambda + \lambda^2 \\[6pt] &= (\lambda - 3)(\lambda - 1).\end

Setting the characteristic polynomial equal to zero, it has roots at and, which are the two eigenvalues of A.

For, equation becomes,(A - I)\mathbf_ = \begin 1 & 1\\ 1 & 1\end\beginv_1 \\ v_2\end = \begin0 \\ 0\end1v_1 + 1v_2 = 0

Any nonzero vector with v1 = −v2 solves this equation. Therefore,\mathbf_ = \begin v_1 \\ -v_1 \end = \begin 1 \\ -1 \endis an eigenvector of A corresponding to λ = 1, as is any scalar multiple of this vector.

For, equation becomes\begin (A - 3I)\mathbf_ &= \begin -1 & 1\\ 1 & -1 \end \begin v_1 \\ v_2 \end = \begin 0 \\ 0 \end \\ -1v_1 + 1v_2 &= 0;\\ 1v_1 - 1v_2 &= 0\end

Any nonzero vector with v1 = v2 solves this equation. Therefore,\mathbf v_ = \begin v_1 \\ v_1 \end = \begin 1 \\ 1 \end

is an eigenvector of A corresponding to λ = 3, as is any scalar multiple of this vector.

Thus, the vectors vλ=1 and vλ=3 are eigenvectors of A associated with the eigenvalues and, respectively.

Three-dimensional matrix example

Consider the matrixA = \begin 2 & 0 & 0 \\ 0 & 3 & 4 \\ 0 & 4 & 9\end.

The characteristic polynomial of A is\begin \det(A - \lambda I) &= \left|\begin 2 & 0 & 0 \\ 0 & 3 & 4 \\ 0 & 4 & 9 \end - \lambda\begin 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end\right| = \begin 2 - \lambda & 0 & 0 \\ 0 & 3 - \lambda & 4 \\ 0 & 4 & 9 - \lambda \end, \\[6pt] &= (2 - \lambda)\bigl[(3 - \lambda)(9 - \lambda) - 16\bigr] = -\lambda^3 + 14\lambda^2 - 35\lambda + 22.\end

The roots of the characteristic polynomial are 2, 1, and 11, which are the only three eigenvalues of A. These eigenvalues correspond to the eigenvectors and or any nonzero multiple thereof.

Three-dimensional matrix example with complex eigenvalues

Consider the cyclic permutation matrixA = \begin 0 & 1 & 0\\ 0 & 0 & 1\\ 1 & 0 & 0\end.

This matrix shifts the coordinates of the vector up by one position and moves the first coordinate to the bottom. Its characteristic polynomial is 1 − λ3, whose roots are\begin \lambda_1 &= 1 \\ \lambda_2 &= -\frac + i \frac \\ \lambda_3 &= \lambda_2^* = -\frac - i \frac\endwhere

i

is an imaginary unit with

For the real eigenvalue λ1 = 1, any vector with three equal nonzero entries is an eigenvector. For example, A \begin 5\\ 5\\ 5 \end = \begin 5\\ 5\\ 5 \end = 1 \cdot \begin 5\\ 5\\ 5 \end.

For the complex conjugate pair of imaginary eigenvalues,\lambda_2\lambda_3 = 1, \quad \lambda_2^2 = \lambda_3, \quad \lambda_3^2 = \lambda_2.

Then A \begin 1 \\ \lambda_2 \\ \lambda_3 \end = \begin \lambda_2 \\ \lambda_3 \\ 1 \end = \lambda_2 \cdot \begin 1 \\ \lambda_2 \\ \lambda_3 \end,and A \begin 1 \\ \lambda_3 \\ \lambda_2 \end = \begin \lambda_3 \\ \lambda_2 \\ 1 \end = \lambda_3 \cdot \begin 1 \\ \lambda_3 \\ \lambda_2 \end.

Therefore, the other two eigenvectors of A are complex and are

v
λ2

=\begin{bmatrix}1&λ2&

sf{T}
λ
3\end{bmatrix}
and
v
λ3

=\begin{bmatrix}1&λ3&

sf{T}
λ
2\end{bmatrix}
with eigenvalues λ2 and λ3, respectively. The two complex eigenvectors also appear in a complex conjugate pair,\mathbf v_ = \mathbf v_^*.

Diagonal matrix example

Matrices with entries only along the main diagonal are called diagonal matrices. The eigenvalues of a diagonal matrix are the diagonal elements themselves. Consider the matrixA = \begin 1 & 0 & 0\\ 0 & 2 & 0\\ 0 & 0 & 3\end.

The characteristic polynomial of A is\det(A - \lambda I) = (1 - \lambda)(2 - \lambda)(3 - \lambda),

which has the roots,, and . These roots are the diagonal elements as well as the eigenvalues of A.

Each diagonal element corresponds to an eigenvector whose only nonzero component is in the same row as that diagonal element. In the example, the eigenvalues correspond to the eigenvectors, \mathbf v_ = \begin 1\\ 0\\ 0 \end,\quad \mathbf v_ = \begin 0\\ 1\\ 0 \end,\quad \mathbf v_ = \begin 0\\ 0\\ 1 \end,

respectively, as well as scalar multiples of these vectors.

Triangular matrix example

A matrix whose elements above the main diagonal are all zero is called a lower triangular matrix, while a matrix whose elements below the main diagonal are all zero is called an upper triangular matrix. As with diagonal matrices, the eigenvalues of triangular matrices are the elements of the main diagonal.

Consider the lower triangular matrix,A = \begin 1 & 0 & 0\\ 1 & 2 & 0\\ 2 & 3 & 3\end.

The characteristic polynomial of A is\det(A - \lambda I) = (1 - \lambda)(2 - \lambda)(3 - \lambda),

which has the roots,, and . These roots are the diagonal elements as well as the eigenvalues of A.

These eigenvalues correspond to the eigenvectors, \mathbf v_ = \begin 1\\ -1\\ \frac\end,\quad \mathbf v_ = \begin 0\\ 1\\ -3\end,\quad \mathbf v_ = \begin 0\\ 0\\ 1\end,

respectively, as well as scalar multiples of these vectors.

Matrix with repeated eigenvalues example

As in the previous example, the lower triangular matrixA = \begin 2 & 0 & 0 & 0 \\ 1 & 2 & 0 & 0 \\ 0 & 1 & 3 & 0 \\ 0 & 0 & 1 & 3\end,has a characteristic polynomial that is the product of its diagonal elements,\det(A - \lambda I) = \begin 2 - \lambda & 0 & 0 & 0 \\ 1 & 2- \lambda & 0 & 0 \\ 0 & 1 & 3- \lambda & 0 \\ 0 & 0 & 1 & 3- \lambda \end = (2 - \lambda)^2(3 - \lambda)^2.

The roots of this polynomial, and hence the eigenvalues, are 2 and 3. The algebraic multiplicity of each eigenvalue is 2; in other words they are both double roots. The sum of the algebraic multiplicities of all distinct eigenvalues is μA = 4 = n, the order of the characteristic polynomial and the dimension of A.

On the other hand, the geometric multiplicity of the eigenvalue 2 is only 1, because its eigenspace is spanned by just one vector

\begin{bmatrix}0&1&-1&1\end{bmatrix}sf{T}

and is therefore 1-dimensional. Similarly, the geometric multiplicity of the eigenvalue 3 is 1 because its eigenspace is spanned by just one vector

\begin{bmatrix}0&0&0&1\end{bmatrix}sf{T}

. The total geometric multiplicity γA is 2, which is the smallest it could be for a matrix with two distinct eigenvalues. Geometric multiplicities are defined in a later section.

Eigenvector-eigenvalue identity

For a Hermitian matrix, the norm squared of the jth component of a normalized eigenvector can be calculated using only the matrix eigenvalues and the eigenvalues of the corresponding minor matrix,|v_|^2 = \frac,where M_j is the submatrix formed by removing the jth row and column from the original matrix. This identity also extends to diagonalizable matrices, and has been rediscovered many times in the literature.

Eigenvalues and eigenfunctions of differential operators

See main article: Eigenfunction.

The definitions of eigenvalue and eigenvectors of a linear transformation T remains valid even if the underlying vector space is an infinite-dimensional Hilbert or Banach space. A widely used class of linear transformations acting on infinite-dimensional spaces are the differential operators on function spaces. Let D be a linear differential operator on the space C of infinitely differentiable real functions of a real argument t. The eigenvalue equation for D is the differential equationD f(t) = \lambda f(t)

The functions that satisfy this equation are eigenvectors of D and are commonly called eigenfunctions.

Derivative operator example

Consider the derivative operator

\tfrac{d}{dt}

with eigenvalue equation\fracf(t) = \lambda f(t).

This differential equation can be solved by multiplying both sides by dt/f(t) and integrating. Its solution, the exponential functionf(t) = f(0)e^,is the eigenfunction of the derivative operator. In this case the eigenfunction is itself a function of its associated eigenvalue. In particular, for λ = 0 the eigenfunction f(t) is a constant.

The main eigenfunction article gives other examples.

General definition

The concept of eigenvalues and eigenvectors extends naturally to arbitrary linear transformations on arbitrary vector spaces. Let V be any vector space over some field K of scalars, and let T be a linear transformation mapping V into V,T:V \to V.

We say that a nonzero vector vV is an eigenvector of T if and only if there exists a scalar λK such that

This equation is called the eigenvalue equation for T, and the scalar λ is the eigenvalue of T corresponding to the eigenvector v. T(v) is the result of applying the transformation T to the vector v, while λv is the product of the scalar λ with v.

Eigenspaces, geometric multiplicity, and the eigenbasis

Given an eigenvalue λ, consider the setE = \left\,

which is the union of the zero vector with the set of all eigenvectors associated with λ. E is called the eigenspace or characteristic space of T associated with λ.

By definition of a linear transformation,\begin T(\mathbf + \mathbf) &= T(\mathbf) + T(\mathbf),\\ T(\alpha \mathbf) &= \alpha T(\mathbf),\end

for xy ∈ V and α ∈ K. Therefore, if u and v are eigenvectors of T associated with eigenvalue λ, namely uv ∈ E, then\begin T(\mathbf + \mathbf) &= \lambda (\mathbf + \mathbf),\\ T(\alpha \mathbf) &= \lambda (\alpha \mathbf).\end

So, both u + v and αv are either zero or eigenvectors of T associated with λ, namely u + v, αvE, and E is closed under addition and scalar multiplication. The eigenspace E associated with λ is therefore a linear subspace of V.[5] If that subspace has dimension 1, it is sometimes called an eigenline.

The geometric multiplicity γT(λ) of an eigenvalue λ is the dimension of the eigenspace associated with λ, i.e., the maximum number of linearly independent eigenvectors associated with that eigenvalue. By the definition of eigenvalues and eigenvectors, γT(λ) ≥ 1 because every eigenvalue has at least one eigenvector.

The eigenspaces of T always form a direct sum. As a consequence, eigenvectors of different eigenvalues are always linearly independent. Therefore, the sum of the dimensions of the eigenspaces cannot exceed the dimension n of the vector space on which T operates, and there cannot be more than n distinct eigenvalues.

Any subspace spanned by eigenvectors of T is an invariant subspace of T, and the restriction of T to such a subspace is diagonalizable. Moreover, if the entire vector space V can be spanned by the eigenvectors of T, or equivalently if the direct sum of the eigenspaces associated with all the eigenvalues of T is the entire vector space V, then a basis of V called an eigenbasis can be formed from linearly independent eigenvectors of T. When T admits an eigenbasis, T is diagonalizable.

Spectral theory

See main article: Spectral theory.

If λ is an eigenvalue of T, then the operator (TλI) is not one-to-one, and therefore its inverse (TλI)−1 does not exist. The converse is true for finite-dimensional vector spaces, but not for infinite-dimensional vector spaces. In general, the operator (TλI) may not have an inverse even if λ is not an eigenvalue.

For this reason, in functional analysis eigenvalues can be generalized to the spectrum of a linear operator T as the set of all scalars λ for which the operator (TλI) has no bounded inverse. The spectrum of an operator always contains all its eigenvalues but is not limited to them.

Associative algebras and representation theory

See main article: Weight (representation theory).

One can generalize the algebraic object that is acting on the vector space, replacing a single operator acting on a vector space with an algebra representation – an associative algebra acting on a module. The study of such actions is the field of representation theory.

The representation-theoretical concept of weight is an analog of eigenvalues, while weight vectors and weight spaces are the analogs of eigenvectors and eigenspaces, respectively.

Hecke eigensheaf is a tensor-multiple of itself and is considered in Langlands correspondence.

Dynamic equations

The simplest difference equations have the form

xt=a1xt-1+a2xt-2++akxt-k.

The solution of this equation for x in terms of t is found by using its characteristic equation

λk-

k-1
a

-

k-2
a

--ak-1λ-ak=0,

which can be found by stacking into matrix form a set of equations consisting of the above difference equation and the k – 1 equations

xt-1=xt-1,...,xt-k+1=xt-k+1,

giving a k-dimensional system of the first order in the stacked variable vector

\begin{bmatrix}xt&&xt-k+1\end{bmatrix}

in terms of its once-lagged value, and taking the characteristic equation of this system's matrix. This equation gives k characteristic roots

λ1,\ldots,λk,

for use in the solution equation

xt=c

t
1

++ckλ

t.
k

A similar procedure is used for solving a differential equation of the form

dkx
dtk

+ak-1

dk-1x
dtk-1

++

a
1dx
dt

+a0x=0.

Calculation

See main article: Eigenvalue algorithm. The calculation of eigenvalues and eigenvectors is a topic where theory, as presented in elementary linear algebra textbooks, is often very far from practice.

Classical method

The classical method is to first find the eigenvalues, and then calculate the eigenvectors for each eigenvalue. It is in several ways poorly suited for non-exact arithmetics such as floating-point.

Eigenvalues

The eigenvalues of a matrix

A

can be determined by finding the roots of the characteristic polynomial. This is easy for

2 x 2

matrices, but the difficulty increases rapidly with the size of the matrix.

In theory, the coefficients of the characteristic polynomial can be computed exactly, since they are sums of products of matrix elements; and there are algorithms that can find all the roots of a polynomial of arbitrary degree to any required accuracy. However, this approach is not viable in practice because the coefficients would be contaminated by unavoidable round-off errors, and the roots of a polynomial can be an extremely sensitive function of the coefficients (as exemplified by Wilkinson's polynomial). Even for matrices whose elements are integers the calculation becomes nontrivial, because the sums are very long; the constant term is the determinant, which for an

n x n

matrix is a sum of

n!

different products.

Explicit algebraic formulas for the roots of a polynomial exist only if the degree

n

is 4 or less. According to the Abel–Ruffini theorem there is no general, explicit and exact algebraic formula for the roots of a polynomial with degree 5 or more. (Generality matters because any polynomial with degree

n

is the characteristic polynomial of some companion matrix of order

n

.) Therefore, for matrices of order 5 or more, the eigenvalues and eigenvectors cannot be obtained by an explicit algebraic formula, and must therefore be computed by approximate numerical methods. Even the exact formula for the roots of a degree 3 polynomial is numerically impractical.

Eigenvectors

Once the (exact) value of an eigenvalue is known, the corresponding eigenvectors can be found by finding nonzero solutions of the eigenvalue equation, that becomes a system of linear equations with known coefficients. For example, once it is known that 6 is an eigenvalue of the matrixA = \begin 4 & 1\\ 6 & 3\end

we can find its eigenvectors by solving the equation

Av=6v

, that is\begin 4 & 1\\ 6 & 3\end\beginx \\y\end = 6 \cdot \beginx \\y\end

This matrix equation is equivalent to two linear equations \left\

Notes and References

  1. Book: Introduction to Linear Algebra . Gilbert Strang . 5 . 6: Eigenvalues and Eigenvectors . Wellesley-Cambridge Press.
  2. Web site: Eigenvector and Eigenvalue. 2020-08-19. www.mathsisfun.com.
  3. Cornell University Department of Mathematics (2016) Lower-Level Courses for Freshmen and Sophomores. Accessed on 2016-03-27.
  4. University of Michigan Mathematics (2016) Math Course Catalogue . Accessed on 2016-03-27.
  5. Lemma for the eigenspace