Rank–nullity theorem explained

The rank–nullity theorem is a theorem in linear algebra, which asserts:

the number of columns of a matrix is the sum of the rank of and the nullity of ; and
the dimension of the domain of a linear transformation is the sum of the rank of (the dimension of the image of) and the nullity of (the dimension of the kernel of).^[1] ^[2] ^[3] ^[4]

It follows that for linear transformations of vector spaces of equal finite dimension, either injectivity or surjectivity implies bijectivity.

Stating the theorem

Linear transformations

Let

T:V\toW

be a linear transformation between two vector spaces where

's domain

is finite dimensional. Then

\operatorname(T) ~+~ \operatorname(T) ~=~ \dim V,

where

\operatorname(T)

is the rank of

(the dimension of its image) and

\operatorname{nullity}(T)

is the nullity of

(the dimension of its kernel). In other words,

\dim (\operatorname T) + \dim (\operatorname T) = \dim (\operatorname(T)).

This theorem can be refined via the splitting lemma to be a statement about an isomorphism of spaces, not just dimensions. Explicitly, since

induces an isomorphism from

V/\operatorname{Ker}(T)

\operatorname{Im}(T),

the existence of a basis for

that extends any given basis of

\operatorname{Ker}(T)

implies, via the splitting lemma, that

\operatorname{Im}(T) ⊕ \operatorname{Ker}(T)\congV.

Taking dimensions, the rank–nullity theorem follows.

Matrices

Linear maps can be represented with matrices. More precisely, an

m x n

matrix represents a linear map

f:F^n\toF^m,

where

is the underlying field.^[5] So, the dimension of the domain of

is, the number of columns of, and the rank–nullity theorem for an

m x n

matrix is

\operatorname(M) + \operatorname(M) = n.

Proofs

Here we provide two proofs. The first operates in the general case, using linear maps. The second proof looks at the homogeneous system

Ax=0,

where

is a

m x n

with rank

and shows explicitly that there exists a set of

n-r

linearly independent solutions that span the null space of

While the theorem requires that the domain of the linear map be finite-dimensional, there is no such assumption on the codomain. This means that there are linear maps not given by matrices for which the theorem applies. Despite this, the first proof is not actually more general than the second: since the image of the linear map is finite-dimensional, we can represent the map from its domain to its image by a matrix, prove the theorem for that matrix, then compose with the inclusion of the image into the full codomain.

First proof

Let

V,W

be vector spaces over some field

and

defined as in the statement of the theorem with

\dimV=n

\operatorname{Ker}T\subsetV

is a subspace, there exists a basis for it. Suppose

\dim\operatorname{Ker}T=k

and let

\mathcal := \ \subset \operatorname(T)

be such a basis.

We may now, by the Steinitz exchange lemma, extend

l{K}

with

n-k

linearly independent vectors

w_1,\ldots,w_n-k

to form a full basis of

Let $\mathcal := \ \subset V \setminus \operatorname(T)$ such that $\mathcal := \mathcal \cup \mathcal = \ \subset V$ is a basis for

.From this, we know that

\operatorname T = \operatornameT(\mathcal) = \operatorname\

=\operatorname{Span}\{T(w_1),\ldots,T(w_n-k)\}=\operatorname{Span}T(l{S}).

We now claim that

T(l{S})

is a basis for

\operatorname{Im}T

.The above equality already states that

T(l{S})

is a generating set for

\operatorname{Im}T

; it remains to be shown that it is also linearly independent to conclude that it is a basis.

Suppose

T(l{S})

is not linearly independent, and let

\sum_^ \alpha _j T(w_j) = 0_W

for some

\alpha_j\inF

Thus, owing to the linearity of

, it follows that

T \left(\sum_^ \alpha _j w_j \right) = 0_W \implies \left(\sum_^ \alpha _j w_j \right) \in \operatorname T = \operatorname \mathcal \subset V .

This is a contradiction to

l{B}

being a basis, unless all

\alpha_j

are equal to zero. This shows that

T(l{S})

is linearly independent, and more specifically that it is a basis for

\operatorname{Im}T

To summarize, we have

l{K}

, a basis for

\operatorname{Ker}T

, and

T(l{S})

, a basis for

\operatorname{Im}T

Finally we may state that $\operatorname(T) + \operatorname(T) = \dim \operatorname T + \dim \operatornameT$

=|T(l{S})|+|l{K}|=(n-k)+k=n=\dimV.

This concludes our proof.

Second proof

Let

be an

m x n

matrix with

linearly independent columns (i.e.

\operatorname{Rank}(A)=r

). We will show that:

To do this, we will produce an

n x (n-r)

matrix

whose columns form a basis of the null space of

Without loss of generality, assume that the first

columns of

are linearly independent. So, we can write

\mathbf = \begin \mathbf_1 & \mathbf_2\end,

where

A₁

is an

m x r

matrix with

linearly independent column vectors, and

A₂

is an

m x (n-r)

matrix such that each of its

n-r

columns is linear combinations of the columns of

A₁

This means that

A₂=A_1B

for some

r x (n-r)

matrix

(see rank factorization) and, hence,

\mathbf = \begin \mathbf_1 & \mathbf_1\mathbf\end .

Let $\mathbf = \begin -\mathbf \\ \mathbf_ \end,$ where

I_n-r

is the

(n-r) x (n-r)

identity matrix. So,

is an

n x (n-r)

matrix such that

\mathbf\mathbf = \begin\mathbf_1 & \mathbf_1\mathbf \end\begin -\mathbf \\ \mathbf_ \end = -\mathbf_1\mathbf + \mathbf_1\mathbf = \mathbf_.

Therefore, each of the

n-r

columns of

are particular solutions of

Ax=

	m
{0}
	{F

Furthermore, the

n-r

columns of

are linearly independent because

Xu=

	n
0
	{F

} will imply

	n-r
0
	{F

} for

u\in{F}^n-r

\mathbf\mathbf = \mathbf_ \implies \begin-\mathbf \\ \mathbf_\end\mathbf = \mathbf_ \implies \begin-\mathbf\mathbf \\ \mathbf\end = \begin\mathbf_ \\ \mathbf_\end \implies \mathbf = \mathbf_.

Therefore, the column vectors of

constitute a set of

n-r

linearly independent solutions for

Ax=

0
	F^m

We next prove that any solution of

Ax=

	m
0
	{F

} must be a linear combination of the columns of

For this, let $\mathbf = \begin \mathbf_1 \\ \mathbf_2\end \in ^$

be any vector such that

Au=

	m
0
	{F

}. Since the columns of

A₁

are linearly independent,

A_1x=

	m
0
	{F

} implies

	r
0
	{F

Therefore, $\begin\mathbf\mathbf & = & \mathbf_ \\\implies \begin\mathbf_1 & \mathbf_1\mathbf\end \begin \mathbf_1 \\ \mathbf_2 \end & = & \mathbf_1\mathbf_1 + \mathbf_1\mathbf\mathbf_2 & = & \mathbf_1(\mathbf_1 + \mathbf\mathbf_2) & = & \mathbf_ \\\implies \mathbf_1 + \mathbf\mathbf_2 & = & \mathbf_ \\\implies \mathbf_1 & = & -\mathbf\mathbf_2\end$ $\implies \mathbf = \begin \mathbf_1 \\ \mathbf_2 \end= \begin -\mathbf \\ \mathbf_ \end\mathbf_2= \mathbf\mathbf_2.$

This proves that any vector

that is a solution of

Ax=0

must be a linear combination of the

n-r

special solutions given by the columns of

. And we have already seen that the columns of

are linearly independent. Hence, the columns of

constitute a basis for the null space of

. Therefore, the nullity of

n-r

. Since

equals rank of

, it follows that

\operatorname{Rank}(A)+\operatorname{Nullity}(A)=n

. This concludes our proof.

A third fundamental subspace

When

T:V\toW

is a linear transformation between two finite-dimensional subspaces, with

n=\dim(V)

and

m=\dim(W)

(so can be represented by an

m x n

matrix

), the rank–nullity theorem asserts that if

has rank

, then

n-r

is the dimension of the null space of

, which represents the kernel of

. In some texts, a third fundamental subspace associated to

is considered alongside its image and kernel: the cokernel of

is the quotient space

W/\operatorname{Im}(T)

, and its dimension is

m-r

. This dimension formula (which might also be rendered

\dim\operatorname{Im}(T)+\dim\operatorname{Coker}(T)=\dim(W)

) together with the rank–nullity theorem is sometimes called the fundamental theorem of linear algebra.^[6]

Reformulations and generalizations

This theorem is a statement of the first isomorphism theorem of algebra for the case of vector spaces; it generalizes to the splitting lemma.

In more modern language, the theorem can also be phrased as saying that each short exact sequence of vector spaces splits. Explicitly, given that $0 \rightarrow U \rightarrow V \mathbin R \rightarrow 0$ is a short exact sequence of vector spaces, then

U ⊕ R\congV

, hence

\dim(U) + \dim(R) = \dim(V) .

Here

plays the role of

\operatorname{Im}T

and

\operatorname{Ker}T

, i.e.

0 \rightarrow \ker T \mathbin V \mathbin \operatorname T \rightarrow 0

In the finite-dimensional case, this formulation is susceptible to a generalization: if $0 \rightarrow V_1 \rightarrow V_2 \rightarrow \cdots V_r \rightarrow 0$ is an exact sequence of finite-dimensional vector spaces, then^[7] $\sum_^r (-1)^i\dim(V_i) = 0.$ The rank–nullity theorem for finite-dimensional vector spaces may also be formulated in terms of the index of a linear map. The index of a linear map

T\in\operatorname{Hom}(V,W)

, where

and

are finite-dimensional, is defined by

\operatorname T = \dim \operatorname(T) - \dim \operatorname T .

Intuitively,

\dim\operatorname{Ker}T

is the number of independent solutions

of the equation

Tv=0

, and

\dim\operatorname{Coker}T

is the number of independent restrictions that have to be put on

to make

Tv=w

solvable. The rank–nullity theorem for finite-dimensional vector spaces is equivalent to the statement

\operatorname T = \dim V - \dim W .

We see that we can easily read off the index of the linear map

from the involved spaces, without any need to analyze

in detail. This effect also occurs in a much deeper result: the Atiyah–Singer index theorem states that the index of certain differential operators can be read off the geometry of the involved spaces.

References

Book: Axler, Sheldon. Linear Algebra Done Right. Springer. 2015. 978-3-319-11079-0. 3rd. Undergraduate Texts in Mathematics. Sheldon Axler.
- Book: Friedberg. Stephen H.. Linear Algebra. Insel. Arnold J.. Spence. Lawrence E.. Pearson Education. 2014. 978-0130084514. 4th.
.
Book: Katznelson. Yitzhak. A (Terse) Introduction to Linear Algebra. Katznelson. Yonatan R.. 2008. American Mathematical Society. 978-0-8218-4419-9. 2008. Yitzhak Katznelson.
Book: Valenza, Robert J.. Linear Algebra: An Introduction to Abstract Mathematics. Springer. 1993. 3-540-94099-5. 3rd. Undergraduate Texts in Mathematics. 1951.

External links

, MIT Linear Algebra Lecture on the Four Fundamental Subspaces, from MIT OpenCourseWare

Notes and References

p. 63, §3.22
p. 70, §2.1, Theorem 2.3
p. 52, §2.5.1
p. 71, §4.3
pp. 103-104, §2.4, Theorem 2.20
- Strang, Gilbert. Linear Algebra and Its Applications. 3rd ed. Orlando: Saunders, 1988.
Web site: Zaman. Ragib. Dimensions of vector spaces in an exact sequence. 27 October 2015. Mathematics Stack Exchange. DimVS.