Vector space explained

In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called vectors, can be added together and multiplied ("scaled") by numbers called scalars. Scalars are often real numbers, but can be complex numbers or, more generally, elements of any field. The operations of vector addition and scalar multiplication must satisfy certain requirements, called vector axioms. Real vector spaces and complex vector spaces are kinds of vector spaces based on different kinds of scalars: real numbers and complex numbers.

Vector spaces generalize Euclidean vectors, which allow modeling of physical quantities, such as forces and velocity, that have not only a magnitude, but also a direction. The concept of vector spaces is fundamental for linear algebra, together with the concept of matrices, which allows computing in vector spaces. This provides a concise and synthetic way for manipulating and studying systems of linear equations.

Vector spaces are characterized by their dimension, which, roughly speaking, specifies the number of independent directions in the space. This means that, for two vector spaces over a given field and with the same dimension, the properties that depend only on the vector-space structure are exactly the same (technically the vector spaces are isomorphic). A vector space is finite-dimensional if its dimension is a natural number. Otherwise, it is infinite-dimensional, and its dimension is an infinite cardinal. Finite-dimensional vector spaces occur naturally in geometry and related areas. Infinite-dimensional vector spaces occur in many areas of mathematics. For example, polynomial rings are countably infinite-dimensional vector spaces, and many function spaces have the cardinality of the continuum as a dimension.

Many vector spaces that are considered in mathematics are also endowed with other structures. This is the case of algebras, which include field extensions, polynomial rings, associative algebras and Lie algebras. This is also the case of topological vector spaces, which include function spaces, inner product spaces, normed spaces, Hilbert spaces and Banach spaces.

Definition and basic properties

In this article, vectors are represented in boldface to distinguish them from scalars.^[1]

A vector space over a field is a non-empty set together with a binary operation and a binary function that satisfy the eight axioms listed below. In this context, the elements of are commonly called vectors, and the elements of are called scalars.

The binary operation, called vector addition or simply addition assigns to any two vectors and in a third vector in which is commonly written as, and called the sum of these two vectors.
The binary function, called scalar multiplication, assigns to any scalar in and any vector in another vector in, which is denoted .^[2]

To have a vector space, the eight following axioms must be satisfied for every, and in, and and in .

Axiom	Statement
Associativity of vector addition
Commutativity of vector addition
Identity element of vector addition	There exists an element, called the zero vector, such that for all .
Inverse elements of vector addition	For every, there exists an element, called the additive inverse of, such that .
Compatibility of scalar multiplication with field multiplication	^[3]
Identity element of scalar multiplication	, where denotes the multiplicative identity in .
Distributivity of scalar multiplication with respect to vector addition
Distributivity of scalar multiplication with respect to field addition

When the scalar field is the real numbers, the vector space is called a real vector space, and when the scalar field is the complex numbers, the vector space is called a complex vector space. These two cases are the most common ones, but vector spaces with scalars in an arbitrary field are also commonly considered. Such a vector space is called an vector space or a vector space over .

An equivalent definition of a vector space can be given, which is much more concise but less elementary: the first four axioms (related to vector addition) say that a vector space is an abelian group under addition, and the four remaining axioms (related to the scalar multiplication) say that this operation defines a ring homomorphism from the field into the endomorphism ring of this group.

Subtraction of two vectors can be defined as $\mathbf - \mathbf = \mathbf + (-\mathbf).$

Direct consequences of the axioms include that, for every

s\inF

and

v\inV,

one has

0v=0,

s0=0,

(-1)v=-v,

sv=0

implies

s=0

v=0.

Even more concisely, a vector space is a module over a field.

Bases, vector coordinates, and subspaces

Linear combination

Given a set of elements of a -vector space, a linear combination of elements of is an element of of the form

a_1 \mathbf_1 + a_2 \mathbf_2 + \cdots + a_k \mathbf_k,

where

a_1,\ldots,a_k\inF

and

g_1,\ldots,g_k\inG.

The scalars

a_1,\ldots,a_k

are called the coefficients of the linear combination.

Linear independence

The elements of a subset of a -vector space are said to be linearly independent if no element of can be written as a linear combination of the other elements of . Equivalently, they are linearly independent if two linear combinations of elements of define the same element of if and only if they have the same coefficients. Also equivalently, they are linearly independent if a linear combination results in the zero vector if and only if all its coefficients are zero.

Linear subspace

A linear subspace or vector subspace of a vector space is a non-empty subset of that is closed under vector addition and scalar multiplication; that is, the sum of two elements of and the product of an element of by a scalar belong to . This implies that every linear combination of elements of belongs to . A linear subspace is a vector space for the induced addition and scalar multiplication; this means that the closure property implies that the axioms of a vector space are satisfied.
The closure property also implies that every intersection of linear subspaces is a linear subspace.

Linear span

Given a subset of a vector space, the linear span or simply the span of is the smallest linear subspace of that contains, in the sense that it is the intersection of all linear subspaces that contain . The span of is also the set of all linear combinations of elements of .
If is the span of, one says that spans or generates, and that is a spanning set or a generating set of .

Basis and dimension

A subset of a vector space is a basis if its elements are linearly independent and span the vector space. Every vector space has at least one basis, or many in general (see). Moreover, all bases of a vector space have the same cardinality, which is called the dimension of the vector space (see Dimension theorem for vector spaces). This is a fundamental property of vector spaces, which is detailed in the remainder of the section.Bases are a fundamental tool for the study of vector spaces, especially when the dimension is finite. In the infinite-dimensional case, the existence of infinite bases, often called Hamel bases, depends on the axiom of choice. It follows that, in general, no base can be explicitly described. For example, the real numbers form an infinite-dimensional vector space over the rational numbers, for which no specific basis is known.

Consider a basis

(b_1,b₂,\ldots,b_n)

of a vector space of dimension over a field . The definition of a basis implies that every

v\inV

may be written

\mathbf v = a_1 \mathbf b_1 + \cdots + a_n \mathbf b_n,

with

a_1,...,a_n

in, and that this decomposition is unique. The scalars

a_1,\ldots,a_n

are called the coordinates of on the basis. They are also said to be the coefficients of the decomposition of on the basis. One also says that the -tuple of the coordinates is the coordinate vector of on the basis, since the set

Fⁿ

of the -tuples of elements of is a vector space for componentwise addition and scalar multiplication, whose dimension is .

The one-to-one correspondence between vectors and their coordinate vectors maps vector addition to vector addition and scalar multiplication to scalar multiplication. It is thus a vector space isomorphism, which allows translating reasonings and computations on vectors into reasonings and computations on their coordinates.

History

Vector spaces stem from affine geometry, via the introduction of coordinates in the plane or three-dimensional space. Around 1636, French mathematicians René Descartes and Pierre de Fermat founded analytic geometry by identifying solutions to an equation of two variables with points on a plane curve. To achieve geometric solutions without using coordinates, Bolzano introduced, in 1804, certain operations on points, lines, and planes, which are predecessors of vectors. introduced the notion of barycentric coordinates. introduced an equivalence relation on directed line segments that share the same length and direction which he called equipollence. A Euclidean vector is then an equivalence class of that relation.

Vectors were reconsidered with the presentation of complex numbers by Argand and Hamilton and the inception of quaternions by the latter. They are elements in R² and R⁴; treating them using linear combinations goes back to Laguerre in 1867, who also defined systems of linear equations.

In 1857, Cayley introduced the matrix notation which allows for harmonization and simplification of linear maps. Around the same time, Grassmann studied the barycentric calculus initiated by Möbius. He envisaged sets of abstract objects endowed with operations. In his work, the concepts of linear independence and dimension, as well as scalar products are present. Grassmann's 1844 work exceeds the framework of vector spaces as well since his considering multiplication led him to what are today called algebras. Italian mathematician Peano was the first to give the modern definition of vector spaces and linear maps in 1888, although he called them "linear systems". Peano's axiomatization allowed for vector spaces with infinite dimension, but Peano did not develop that theory further. In 1897, Salvatore Pincherle adopted Peano's axioms and made initial inroads into the theory of infinite-dimensional vector spaces.

An important development of vector spaces is due to the construction of function spaces by Henri Lebesgue. This was later formalized by Banach and Hilbert, around 1920. At that time, algebra and the new field of functional analysis began to interact, notably with key concepts such as spaces of p-integrable functions and Hilbert spaces.

Examples

See main article: Examples of vector spaces.

Arrows in the plane

The first example of a vector space consists of arrows in a fixed plane, starting at one fixed point. This is used in physics to describe forces or velocities. Given any two such arrows, and, the parallelogram spanned by these two arrows contains one diagonal arrow that starts at the origin, too. This new arrow is called the sum of the two arrows, and is denoted . In the special case of two arrows on the same line, their sum is the arrow on this line whose length is the sum or the difference of the lengths, depending on whether the arrows have the same direction. Another operation that can be done with arrows is scaling: given any positive real number, the arrow that has the same direction as, but is dilated or shrunk by multiplying its length by, is called multiplication of by . It is denoted . When is negative, is defined as the arrow pointing in the opposite direction instead.

The following shows a few examples: if, the resulting vector has the same direction as, but is stretched to the double length of (the second image). Equivalently, is the sum . Moreover, has the opposite direction and the same length as (blue vector pointing down in the second image).

Ordered pairs of numbers

A second key example of a vector space is provided by pairs of real numbers and . The order of the components and is significant, so such a pair is also called an ordered pair. Such a pair is written as . The sum of two such pairs and the multiplication of a pair with a number is defined as follows: $\begin (x_1, y_1) + (x_2, y_2) &= (x_1 + x_2, y_1 + y_2), \\ a(x, y) &= (ax, ay).\end$

The first example above reduces to this example if an arrow is represented by a pair of Cartesian coordinates of its endpoint.

Coordinate space

The simplest example of a vector space over a field is the field itself with its addition viewed as vector addition and its multiplication viewed as scalar multiplication. More generally, all -tuples (sequences of length) $(a_1, a_2, \dots, a_n)$ of elements of form a vector space that is usually denoted and called a coordinate space. The case is the above-mentioned simplest example, in which the field is also regarded as a vector space over itself. The case and (so R²) reduces to the previous example.

Complex numbers and other field extensions

The set of complex numbers, numbers that can be written in the form for real numbers and where is the imaginary unit, form a vector space over the reals with the usual addition and multiplication: and for real numbers,,, and . The various axioms of a vector space follow from the fact that the same rules hold for complex number arithmetic. The example of complex numbers is essentially the same as (that is, it is isomorphic to) the vector space of ordered pairs of real numbers mentioned above: if we think of the complex number as representing the ordered pair in the complex plane then we see that the rules for addition and scalar multiplication correspond exactly to those in the earlier example.

More generally, field extensions provide another class of examples of vector spaces, particularly in algebra and algebraic number theory: a field containing a smaller field is an -vector space, by the given multiplication and addition operations of . For example, the complex numbers are a vector space over, and the field extension

Q(i\sqrt{5})

is a vector space over .

Function spaces

See main article: Function space.

Functions from any fixed set to a field also form vector spaces, by performing addition and scalar multiplication pointwise. That is, the sum of two functions and is the function

(f+g)

given by

(f + g)(w) = f(w) + g(w),

and similarly for multiplication. Such function spaces occur in many geometric situations, when is the real line or an interval, or other subsets of . Many notions in topology and analysis, such as continuity, integrability or differentiability are well-behaved with respect to linearity: sums and scalar multiples of functions possessing such a property still have that property. Therefore, the set of such functions are vector spaces, whose study belongs to functional analysis.

Linear equations

See main article: Linear equation, Linear differential equation and Systems of linear equations. Systems of homogeneous linear equations are closely tied to vector spaces. For example, the solutions of $\begin && a \,&&+\, 3 b \,&\, + &\, & c & \,= 0 \\4 && a \,&&+\, 2 b \,&\, + &\, 2 & c & \,= 0 \\\end$ are given by triples with arbitrary

b=a/2,

and

c=-5a/2.

They form a vector space: sums and scalar multiples of such triples still satisfy the same ratios of the three variables; thus they are solutions, too. Matrices can be used to condense multiple linear equations as above into one vector equation, namely

$A \mathbf = \mathbf,$

where

A=\begin{bmatrix} 1&3&1\\ 4&2&2\end{bmatrix}

is the matrix containing the coefficients of the given equations,

is the vector

(a,b,c),

denotes the matrix product, and

0=(0,0)

is the zero vector. In a similar vein, the solutions of homogeneous linear differential equations form vector spaces. For example,

$f^(x) + 2 f^\prime(x) + f(x) = 0$

yields

f(x)=ae^-x+bxe^-x,

where

and

are arbitrary constants, and

e^x

is the natural exponential function.

Linear maps and matrices

See main article: Linear map. The relation of two vector spaces can be expressed by linear map or linear transformation. They are functions that reflect the vector space structure, that is, they preserve sums and scalar multiplication: $\begin f(\mathbf + \mathbf) &= f(\mathbf) + f(\mathbf), \\ f(a \cdot \mathbf) &= a \cdot f(\mathbf)\end$ for all

and

all

An isomorphism is a linear map such that there exists an inverse map, which is a map such that the two possible compositions and are identity maps. Equivalently, is both one-to-one (injective) and onto (surjective). If there exists an isomorphism between and, the two spaces are said to be isomorphic; they are then essentially identical as vector spaces, since all identities holding in are, via, transported to similar ones in, and vice versa via .

For example, the arrows in the plane and the ordered pairs of numbers vector spaces in the introduction above (see) are isomorphic: a planar arrow departing at the origin of some (fixed) coordinate system can be expressed as an ordered pair by considering the - and -component of the arrow, as shown in the image at the right. Conversely, given a pair, the arrow going by to the right (or to the left, if is negative), and up (down, if is negative) turns back the arrow .

Linear maps between two vector spaces form a vector space, also denoted, or . The space of linear maps from to is called the dual vector space, denoted . Via the injective natural map, any vector space can be embedded into its bidual; the map is an isomorphism if and only if the space is finite-dimensional.

Once a basis of is chosen, linear maps are completely determined by specifying the images of the basis vectors, because any element of is expressed uniquely as a linear combination of them. If, a 1-to-1 correspondence between fixed bases of and gives rise to a linear map that maps any basis element of to the corresponding basis element of . It is an isomorphism, by its very definition. Therefore, two vector spaces over a given field are isomorphic if their dimensions agree and vice versa. Another way to express this is that any vector space over a given field is completely classified (up to isomorphism) by its dimension, a single number. In particular, any n-dimensional -vector space is isomorphic to . However, there is no "canonical" or preferred isomorphism; an isomorphism is equivalent to the choice of a basis of, by mapping the standard basis of to, via .

Matrices

See main article: Matrix and Determinant. Matrices are a useful notion to encode linear maps. They are written as a rectangular array of scalars as in the image at the right. Any -by- matrix

gives rise to a linear map from to, by the following

\mathbf x = (x_1, x_2, \ldots, x_n) \mapsto \left(\sum_^n a_x_j, \sum_^n a_x_j, \ldots, \sum_^n a_x_j \right),

where

\sum

denotes summation, or by using the matrix multiplication of the matrix

with the coordinate vector

$\mathbf \mapsto A \mathbf.$

Moreover, after choosing bases of and, any linear map is uniquely represented by a matrix via this assignment.

The determinant of a square matrix is a scalar that tells whether the associated map is an isomorphism or not: to be so it is sufficient and necessary that the determinant is nonzero. The linear transformation of corresponding to a real n-by-n matrix is orientation preserving if and only if its determinant is positive.

Eigenvalues and eigenvectors

See main article: Eigenvalues and eigenvectors. Endomorphisms, linear maps, are particularly important since in this case vectors can be compared with their image under, . Any nonzero vector satisfying, where is a scalar, is called an eigenvector of with eigenvalue . Equivalently, is an element of the kernel of the difference (where Id is the identity map . If is finite-dimensional, this can be rephrased using determinants: having eigenvalue is equivalent to $\det(f - \lambda \cdot \operatorname) = 0.$ By spelling out the definition of the determinant, the expression on the left hand side can be seen to be a polynomial function in, called the characteristic polynomial of . If the field is large enough to contain a zero of this polynomial (which automatically happens for algebraically closed, such as) any linear map has at least one eigenvector. The vector space may or may not possess an eigenbasis, a basis consisting of eigenvectors. This phenomenon is governed by the Jordan canonical form of the map. The set of all eigenvectors corresponding to a particular eigenvalue of forms a vector space known as the eigenspace corresponding to the eigenvalue (and) in question.

Basic constructions

In addition to the above concrete examples, there are a number of standard linear algebraic constructions that yield vector spaces related to given ones.

Subspaces and quotient spaces

See main article: Linear subspace and Quotient vector space.

of a vector space

that is closed under addition and scalar multiplication (and therefore contains the

-vector of

) is called a linear subspace of

, or simply a subspace of

, when the ambient space is unambiguously a vector space.^[4] Subspaces of

are vector spaces (over the same field) in their own right. The intersection of all subspaces containing a given set

of vectors is called its span, and it is the smallest subspace of

containing the set

. Expressed in terms of elements, the span is the subspace consisting of all the linear combinations of elements of

Linear subspace of dimension 1 and 2 are referred to as a line (also vector line), and a plane respectively. If W is an n-dimensional vector space, any subspace of dimension 1 less, i.e., of dimension

n-1

is called a hyperplane.

The counterpart to subspaces are quotient vector spaces. Given any subspace

W\subseteqV

, the quotient space

V/W

modulo

") is defined as follows: as a set, it consists of

\mathbf + W = \,

where

is an arbitrary vector in

. The sum of two such elements

v₁+W

and

v₂+W

\left(v₁+v_2\right)+W

, and scalar multiplication is given by

a ⋅ (v+W)=(a ⋅ v)+W

. The key point in this definition is that

v₁+W=v₂+W

if and only if the difference of

v₁

and

v₂

lies in

.^[5] This way, the quotient space "forgets" information that is contained in the subspace

\ker(f)

of a linear map

f:V\toW

consists of vectors

that are mapped to

. The kernel and the image

\operatorname{im}(f)=\{f(v):v\inV\}

are subspaces of

and

, respectively.

An important example is the kernel of a linear map

x\mapstoAx

for some fixed matrix

. The kernel of this map is the subspace of vectors

such that

Ax=0

, which is precisely the set of solutions to the system of homogeneous linear equations belonging to

. This concept also extends to linear differential equations

a_0 f + a_1 \frac + a_2 \frac + \cdots + a_n \frac = 0,

where the coefficients

a_i

are functions in

too.In the corresponding map

f \mapsto D(f) = \sum_^n a_i \frac,

the derivatives of the function

appear linearly (as opposed to

f^\prime\prime(x)²

, for example). Since differentiation is a linear procedure (that is,

(f+g)^\prime=f^\prime+g^\prime

and

(c ⋅ f)^\prime=c ⋅ f^\prime

for a constant

) this assignment is linear, called a linear differential operator. In particular, the solutions to the differential equation

D(f)=0

form a vector space (over or).

The existence of kernels and images is part of the statement that the category of vector spaces (over a fixed field

) is an abelian category, that is, a corpus of mathematical objects and structure-preserving maps between them (a category) that behaves much like the category of abelian groups. Because of this, many statements such as the first isomorphism theorem (also called rank–nullity theorem in matrix-related terms)

V / \ker(f) \; \equiv \; \operatorname(f)

and the second and third isomorphism theorem can be formulated and proven in a way very similar to the corresponding statements for groups.

Direct product and direct sum

See main article: Direct product and Direct sum of modules. The direct product of vector spaces and the direct sum of vector spaces are two ways of combining an indexed family of vector spaces into a new vector space.

The direct product

style{\prod_iV_i}

of a family of vector spaces

V_i

consists of the set of all tuples

\left(v_i\right)_i

, which specify for each index

in some index set

an element

v_i

V_i

. Addition and scalar multiplication is performed componentwise. A variant of this construction is the direct sum

\bigoplus_ V_i

(also called coproduct and denoted

\coprod_V_i

), where only tuples with finitely many nonzero vectors are allowed. If the index set

is finite, the two constructions agree, but in general they are different.

Tensor product

The tensor product

V ⊗ _FW,

or simply

V ⊗ W,

of two vector spaces

and

is one of the central notions of multilinear algebra which deals with extending notions such as linear maps to several variables. A map

g:V x W\toX

from the Cartesian product

V x W

is called bilinear if

is linear in both variables

and

That is to say, for fixed

the map

v\mapstog(v,w)

is linear in the sense above and likewise for fixed

The tensor product is a particular vector space that is a universal recipient of bilinear maps

as follows. It is defined as the vector space consisting of finite (formal) sums of symbols called tensors

\mathbf_1 \otimes \mathbf_1 + \mathbf_2 \otimes \mathbf_2 + \cdots + \mathbf_n \otimes \mathbf_n,

subject to the rules

\begina \cdot (\mathbf \otimes \mathbf) ~&=~ (a \cdot \mathbf) \otimes \mathbf ~=~ \mathbf \otimes (a \cdot \mathbf), && ~~\text a \text \\(\mathbf_1 + \mathbf_2) \otimes \mathbf ~&=~ \mathbf_1 \otimes \mathbf + \mathbf_2 \otimes \mathbf && \\\mathbf \otimes (\mathbf_1 + \mathbf_2) ~&=~ \mathbf \otimes \mathbf_1 + \mathbf \otimes \mathbf_2. && \\\end

These rules ensure that the map

from the

V x W

V ⊗ W

that maps a tuple

(v,w)

v ⊗ w

is bilinear. The universality states that given any vector space

and any bilinear map

g:V x W\toX,

there exists a unique map

shown in the diagram with a dotted arrow, whose composition with

equals

u(v ⊗ w)=g(v,w).

^[6] This is called the universal property of the tensor product, an instance of the method—much used in advanced abstract algebra—to indirectly define objects by specifying maps from or to this object.

Vector spaces with additional structure

From the point of view of linear algebra, vector spaces are completely understood insofar as any vector space over a given field is characterized, up to isomorphism, by its dimension. However, vector spaces per se do not offer a framework to deal with the question—crucial to analysis—whether a sequence of functions converges to another function. Likewise, linear algebra is not adapted to deal with infinite series, since the addition operation allows only finitely many terms to be added.

Therefore, the needs of functional analysis require considering additional structures.

\leq,

under which some vectors can be compared. For example,

-dimensional real space

Rⁿ

can be ordered by comparing its vectors componentwise. Ordered vector spaces, for example Riesz spaces, are fundamental to Lebesgue integration, which relies on the ability to express a function as a difference of two positive functions

f = f^+ - f^-.

where

f⁺

denotes the positive part of

and

f^-

the negative part.

Normed vector spaces and inner product spaces

See main article: Normed vector space and Inner product space. "Measuring" vectors is done by specifying a norm, a datum which measures lengths of vectors, or by an inner product, which measures angles between vectors. Norms and inner products are denoted

|v|

and respectively. The datum of an inner product entails that lengths of vectors can be defined too, by defining the associated norm Vector spaces endowed with such data are known as normed vector spaces and inner product spaces, respectively.

Coordinate space

Fⁿ

can be equipped with the standard dot product:

\lang \mathbf x, \mathbf y \rang = \mathbf x \cdot \mathbf y = x_1 y_1 + \cdots + x_n y_n.

R^2,

this reflects the common notion of the angle between two vectors

and

by the law of cosines:

\mathbf x \cdot \mathbf y = \cos\left(\angle (\mathbf x, \mathbf y)\right) \cdot |\mathbf x| \cdot |\mathbf y|.

Because of this, two vectors satisfying

\langx,y\rang=0

are called orthogonal. An important variant of the standard dot product is used in Minkowski space:

R⁴

endowed with the Lorentz product

\lang \mathbf x | \mathbf y \rang = x_1 y_1 + x_2 y_2 + x_3 y_3 - x_4 y_4.

In contrast to the standard dot product, it is not positive definite:

\langx|x\rang

also takes negative values, for example, for

x=(0,0,0,1).

Singling out the fourth coordinate—corresponding to time, as opposed to three space-dimensions—makes it useful for the mathematical treatment of special relativity. Note that in other conventions time is often written as the first, or "zeroeth" component so that the Lorentz product is written

\lang \mathbf x | \mathbf y \rang = - x_0 y_0+x_1 y_1 + x_2 y_2 + x_3 y_3.

Topological vector spaces

See main article: Topological vector space. Convergence questions are treated by considering vector spaces

carrying a compatible topology, a structure that allows one to talk about elements being close to each other. Compatible here means that addition and scalar multiplication have to be continuous maps. Roughly, if

and

, and

vary by a bounded amount, then so do

x+y

and

ax.

^[7] To make sense of specifying the amount a scalar changes, the field

also has to carry a topology in this context; a common choice is the reals or the complex numbers.

In such topological vector spaces one can consider series of vectors. The infinite sum $\sum_^\infty f_i ~=~ \lim_ f_1 + \cdots + f_n$ denotes the limit of the corresponding finite partial sums of the sequence

f_1,f_2,\ldots

of elements of

For example, the

f_i

could be (real or complex) functions belonging to some function space

in which case the series is a function series. The mode of convergence of the series depends on the topology imposed on the function space. In such cases, pointwise convergence and uniform convergence are two prominent examples.

A way to ensure the existence of limits of certain infinite series is to restrict attention to spaces where any Cauchy sequence has a limit; such a vector space is called complete. Roughly, a vector space is complete provided that it contains all necessary limits. For example, the vector space of polynomials on the unit interval

[0,1],

equipped with the topology of uniform convergence is not complete because any continuous function on

[0,1]

can be uniformly approximated by a sequence of polynomials, by the Weierstrass approximation theorem. In contrast, the space of all continuous functions on

[0,1]

with the same topology is complete. A norm gives rise to a topology by defining that a sequence of vectors

v_n

converges to

if and only if

\lim_ |\mathbf v_n - \mathbf v| = 0.

Banach and Hilbert spaces are complete topological vector spaces whose topologies are given, respectively, by a norm and an inner product. Their study—a key piece of functional analysis—focuses on infinite-dimensional vector spaces, since all norms on finite-dimensional topological vector spaces give rise to the same notion of convergence. The image at the right shows the equivalence of the

-norm and

infty

-norm on

R^2:

as the unit "balls" enclose each other, a sequence converges to zero in one norm if and only if it so does in the other norm. In the infinite-dimensional case, however, there will generally be inequivalent topologies, which makes the study of topological vector spaces richer than that of vector spaces without additional data.

From a conceptual point of view, all notions related to topological vector spaces should match the topology. For example, instead of considering all linear maps (also called functionals)

V\toW,

maps between topological vector spaces are required to be continuous. In particular, the (topological) dual space

V^*

consists of continuous functionals

V\toR

(or to

). The fundamental Hahn–Banach theorem is concerned with separating subspaces of appropriate topological vector spaces by continuous functionals.

Banach spaces

See main article: Banach space. Banach spaces, introduced by Stefan Banach, are complete normed vector spaces.

A first example is the vector space

\ell^p

consisting of infinite vectors with real entries

x=\left(x_1,x_2,\ldots,x_n,\ldots\right)

whose

-norm

(1\leqp\leqinfty)

given by

\|\mathbf\|_\infty := \sup_i |x_i| \qquad \text p = \infty, \text

\|\mathbf\|_p := \left(\sum_i |x_i|^p\right)^\frac \qquad \text p < \infty.

The topologies on the infinite-dimensional space

\ell^p

are inequivalent for different

For example, the sequence of vectors

x_n=\left(2^-n,2^-n,\ldots,2^-n,0,0,\ldots\right),

in which the first

2ⁿ

components are

2^-n

and the following ones are

converges to the zero vector for

p=infty,

but does not for

p=1:

\|\mathbf_n\|_\infty = \sup (2^, 0) = 2^ \to 0,

but

\|\mathbf_n\|_1 = \sum_^ 2^ = 2^n \cdot 2^ = 1.

More generally than sequences of real numbers, functions

f:\Omega\to\Reals

are endowed with a norm that replaces the above sum by the Lebesgue integral

\|f\|_p := \left(\int_ |f(x)|^p \, \right)^\frac.

\Omega

(for example an interval) satisfying

\|f\|_p<infty,

and equipped with this norm are called Lebesgue spaces, denoted

L^p(\Omega).

^[8]

These spaces are complete. (If one uses the Riemann integral instead, the space is complete, which may be seen as a justification for Lebesgue's integration theory.^[9]) Concretely this means that for any sequence of Lebesgue-integrable functions

f_1,f_2,\ldots,f_n,\ldots

with

\|f_n\|_p<infty,

satisfying the condition

\lim_ \int_ \left|f_k(x) - f_n(x)\right|^p \, = 0

there exists a function

f(x)

belonging to the vector space

L^p(\Omega)

such that

\lim_ \int_ \left|f(x) - f_k(x)\right|^p \, = 0.

Imposing boundedness conditions not only on the function, but also on its derivatives leads to Sobolev spaces.

Hilbert spaces

See main article: Hilbert space. Complete inner product spaces are known as Hilbert spaces, in honor of David Hilbert. The Hilbert space

L^2(\Omega),

with inner product given by

\langle f\, \ g \rangle = \int_\Omega f(x) \overline \, dx,

where

\overline{g(x)}

denotes the complex conjugate of

g(x),

^[10] is a key case.

By definition, in a Hilbert space, any Cauchy sequence converges to a limit. Conversely, finding a sequence of functions

f_n

with desirable properties that approximate a given limit function is equally crucial. Early analysis, in the guise of the Taylor approximation, established an approximation of differentiable functions

by polynomials. By the Stone–Weierstrass theorem, every continuous function on

[a,b]

can be approximated as closely as desired by a polynomial. A similar approximation technique by trigonometric functions is commonly called Fourier expansion, and is much applied in engineering. More generally, and more conceptually, the theorem yields a simple description of what "basic functions", or, in abstract Hilbert spaces, what basic vectors suffice to generate a Hilbert space

in the sense that the closure of their span (that is, finite linear combinations and limits of those) is the whole space. Such a set of functions is called a basis of

its cardinality is known as the Hilbert space dimension.^[11] Not only does the theorem exhibit suitable basis functions as sufficient for approximation purposes, but also together with the Gram–Schmidt process, it enables one to construct a basis of orthogonal vectors. Such orthogonal bases are the Hilbert space generalization of the coordinate axes in finite-dimensional Euclidean space.

The solutions to various differential equations can be interpreted in terms of Hilbert spaces. For example, a great many fields in physics and engineering lead to such equations, and frequently solutions with particular physical properties are used as basis functions, often orthogonal. As an example from physics, the time-dependent Schrödinger equation in quantum mechanics describes the change of physical properties in time by means of a partial differential equation, whose solutions are called wavefunctions. Definite values for physical properties such as energy, or momentum, correspond to eigenvalues of a certain (linear) differential operator and the associated wavefunctions are called eigenstates. The

spectral theorem decomposes a linear compact operator acting on functions in terms of these eigenfunctions and their eigenvalues.

Algebras over fields

See main article: Algebra over a field and Lie algebra. General vector spaces do not possess a multiplication between vectors. A vector space equipped with an additional bilinear operator defining the multiplication of two vectors is an algebra over a field (or F-algebra if the field F is specified).

For example, the set of all polynomials

p(t)

forms an algebra known as the polynomial ring: using that the sum of two polynomials is a polynomial, they form a vector space; they form an algebra since the product of two polynomials is again a polynomial. Rings of polynomials (in several variables) and their quotients form the basis of algebraic geometry, because they are rings of functions of algebraic geometric objects.

Another crucial example are Lie algebras, which are neither commutative nor associative, but the failure to be so is limited by the constraints (

[x,y]

denotes the product of

and

[x,y]=-[y,x]

(anticommutativity), and

[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0

(Jacobi identity).Examples include the vector space of

-by-

matrices, with

[x,y]=xy-yx,

the commutator of two matrices, and

R^3,

endowed with the cross product.

\operatorname{T}(V)

is a formal way of adding products to any vector space

to obtain an algebra. As a vector space, it is spanned by symbols, called simple tensors

\mathbf_1 \otimes \mathbf_2 \otimes \cdots \otimes \mathbf_n,

where the degree

varies.The multiplication is given by concatenating such symbols, imposing the distributive law under addition, and requiring that scalar multiplication commute with the tensor product ⊗, much the same way as with the tensor product of two vector spaces introduced in the above section on tensor products. In general, there are no relations between

v₁ ⊗ v₂

and

v₂ ⊗ v_1.

Forcing two such elements to be equal leads to the symmetric algebra, whereas forcing

v₁ ⊗ v₂=-v₂ ⊗ v₁

yields the exterior algebra.

Related structures

Vector bundles

See main article: Vector bundle and Tangent bundle. A vector bundle is a family of vector spaces parametrized continuously by a topological space X. More precisely, a vector bundle over X is a topological space E equipped with a continuous map $\pi : E \to X$ such that for every x in X, the fiber π⁻¹(x) is a vector space. The case dim is called a line bundle. For any vector space V, the projection makes the product into a "trivial" vector bundle. Vector bundles over X are required to be locally a product of X and some (fixed) vector space V: for every x in X, there is a neighborhood U of x such that the restriction of π to π⁻¹(U) is isomorphic^[12] to the trivial bundle . Despite their locally trivial character, vector bundles may (depending on the shape of the underlying space X) be "twisted" in the large (that is, the bundle need not be (globally isomorphic to) the trivial bundle). For example, the Möbius strip can be seen as a line bundle over the circle S¹ (by identifying open intervals with the real line). It is, however, different from the cylinder, because the latter is orientable whereas the former is not.

Properties of certain vector bundles provide information about the underlying topological space. For example, the tangent bundle consists of the collection of tangent spaces parametrized by the points of a differentiable manifold. The tangent bundle of the circle S¹ is globally isomorphic to, since there is a global nonzero vector field on S¹.^[13] In contrast, by the hairy ball theorem, there is no (tangent) vector field on the 2-sphere S² which is everywhere nonzero. K-theory studies the isomorphism classes of all vector bundles over some topological space. In addition to deepening topological and geometrical insight, it has purely algebraic consequences, such as the classification of finite-dimensional real division algebras: R, C, the quaternions H and the octonions O.

The cotangent bundle of a differentiable manifold consists, at every point of the manifold, of the dual of the tangent space, the cotangent space. Sections of that bundle are known as differential one-forms.

Modules

See main article: Module. Modules are to rings what vector spaces are to fields: the same axioms, applied to a ring R instead of a field F, yield modules. The theory of modules, compared to that of vector spaces, is complicated by the presence of ring elements that do not have multiplicative inverses. For example, modules need not have bases, as the Z-module (that is, abelian group) Z/2Z shows; those modules that do (including all vector spaces) are known as free modules. Nevertheless, a vector space can be compactly defined as a module over a ring which is a field, with the elements being called vectors. Some authors use the term vector space to mean modules over a division ring. The algebro-geometric interpretation of commutative rings via their spectrum allows the development of concepts such as locally free modules, the algebraic counterpart to vector bundles.

Affine and projective spaces

See main article: Affine space and Projective space. Roughly, affine spaces are vector spaces whose origins are not specified. More precisely, an affine space is a set with a free transitive vector space action. In particular, a vector space is an affine space over itself, by the map $V \times V \to W, \; (\mathbf, \mathbf) \mapsto \mathbf + \mathbf.$ If W is a vector space, then an affine subspace is a subset of W obtained by translating a linear subspace V by a fixed vector ; this space is denoted by (it is a coset of V in W) and consists of all vectors of the form for An important example is the space of solutions of a system of inhomogeneous linear equations $A \mathbf = \mathbf$ generalizing the homogeneous case discussed in the above section on linear equations, which can be found by setting

b=0

in this equation. The space of solutions is the affine subspace where x is a particular solution of the equation, and V is the space of solutions of the homogeneous equation (the nullspace of A).

The set of one-dimensional subspaces of a fixed finite-dimensional vector space V is known as projective space; it may be used to formalize the idea of parallel lines intersecting at infinity. Grassmannians and flag manifolds generalize this by parametrizing linear subspaces of fixed dimension k and flags of subspaces, respectively.

References

Historical references

- - .
    - , reprint:
      - Peano, G. (1901) Formulario mathematico: vct axioms via Internet Archive

Further references

Notes and References

It is also common, especially in physics, to denote vectors with an arrow on top:
\vecv.

It is also common, especially in higher mathematics, to not use any typographical method for distinguishing vectors from other mathematical objects.
Scalar multiplication is not to be confused with the scalar product, which is an additional operation on some specific vector spaces, called inner product spaces. Scalar multiplication is the multiplication of a vector by a scalar that produces a vector, while the scalar product is a multiplication of two vectors that produces a scalar.
This axiom is not an associative property, since it refers to two different operations, scalar multiplication and field multiplication. So, it is independent from the associativity of field multiplication, which is assumed by field axioms.
This is typically the case when a vector space is also considered as an affine space. In this case, a linear subspace contains the zero vector, while an affine subspace does not necessarily contain it.
Some authors, such as, choose to start with this equivalence relation and derive the concrete shape of
V/W

from this.
, Th. 14.3. See also Yoneda lemma.
This requirement implies that the topology gives rise to a uniform structure,, loc = ch. II.
The triangle inequality for
\|f+g\|_p\leq\|f\|_p+\|g\|_p

is provided by the Minkowski inequality. For technical reasons, in the context of functions one has to identify functions that agree almost everywhere to get a norm, and not only a seminorm.
"Many functions in
L²

of Lebesgue measure, being unbounded, cannot be integrated with the classical Riemann integral. So spaces of Riemann integrable functions would not be complete in the
L²

norm, and the orthogonal decomposition would not apply to them. This shows one of the advantages of Lebesgue integration.",, §5.3, p. 125.
For
p ≠ 2,

L^p(\Omega)

is not a Hilbert space.
A basis of a Hilbert space is not the same thing as a basis of a linear algebra. For distinction, a linear algebra basis for a Hilbert space is called a Hamel basis.
That is, there is a homeomorphism from π⁻¹(U) to which restricts to linear isomorphisms between fibers.
A line bundle, such as the tangent bundle of S¹ is trivial if and only if there is a section that vanishes nowhere, see, Corollary 8.3. The sections of the tangent bundle are just vector fields.