Norm (mathematics) explained

In mathematics, a norm is a function from a real or complex vector space to the non-negative real numbers that behaves in certain ways like the distance from the origin: it commutes with scaling, obeys a form of the triangle inequality, and is zero only at the origin. In particular, the Euclidean distance in a Euclidean space is defined by a norm on the associated Euclidean vector space, called the Euclidean norm, the 2-norm, or, sometimes, the magnitude or length of the vector. This norm can be defined as the square root of the inner product of a vector with itself.

A seminorm satisfies the first two properties of a norm, but may be zero for vectors other than the origin.^[1] A vector space with a specified norm is called a normed vector space. In a similar manner, a vector space with a seminorm is called a seminormed vector space.

The term pseudonorm has been used for several related meanings. It may be a synonym of "seminorm".^[2] A pseudonorm may satisfy the same axioms as a norm, with the equality replaced by an inequality "

\leq

" in the homogeneity axiom.^[3] It can also refer to a norm that can take infinite values,^[4] or to certain functions parametrised by a directed set.^[5]

Definition

over a subfield

of the complex numbers

\Complex,

a norm on

is a real-valued function

p:X\to\Reals

with the following properties, where

|s|

denotes the usual absolute value of a scalar

:^[6]

Subadditivity/Triangle inequality:

p(x+y)\leqp(x)+p(y)

for all

x,y\inX.

Absolute homogeneity:

p(sx)=|s|p(x)

for all

x\inX

and all scalars

Positive definiteness/positiveness/: for all

x\inX,

p(x)=0

then

x=0.

- Because property (2.) implies

p(0)=0,

some authors replace property (3.) with the equivalent condition: for every

x\inX,

p(x)=0

if and only if

x=0.

A seminorm on

is a function

p:X\to\Reals

that has properties (1.) and (2.)^[7] so that in particular, every norm is also a seminorm (and thus also a sublinear functional). However, there exist seminorms that are not norms. Properties (1.) and (2.) imply that if

is a norm (or more generally, a seminorm) then

p(0)=0

and that

also has the following property:

Non-negativity:

p(x)\geq0

for all

x\inX.

Some authors include non-negativity as part of the definition of "norm", although this is not necessary.Although this article defined "" to be a synonym of "positive definite", some authors instead define "" to be a synonym of "non-negative"; these definitions are not equivalent.

Equivalent norms

Suppose that

and

are two norms (or seminorms) on a vector space

Then

and

are called equivalent, if there exist two positive real constants

and

such that for every vector

x\inX,

c q(x) \leq p(x) \leq C q(x).

The relation "

is equivalent to

" is reflexive, symmetric (

cq\leqp\leqCq

implies

\tfrac{1}{C}p\leqq\leq\tfrac{1}{c}p

), and transitive and thus defines an equivalence relation on the set of all norms on

The norms

and

are equivalent if and only if they induce the same topology on

^[8] Any two norms on a finite-dimensional space are equivalent but this does not extend to infinite-dimensional spaces.^[8]

Notation

If a norm

p:X\to\R

is given on a vector space

then the norm of a vector

z\inX

is usually denoted by enclosing it within double vertical lines:

\|z\|=p(z).

Such notation is also sometimes used if

is only a seminorm. For the length of a vector in Euclidean space (which is an example of a norm, as explained below), the notation

|x|

with single vertical lines is also widespread.

Examples

Every (real or complex) vector space admits a norm: If

x_\bull=\left(x_i\right)_i

is a Hamel basis for a vector space

then the real-valued map that sends

x=\sum_is_ix_i\inX

(where all but finitely many of the scalars

s_i

are

) to

\sum_i\left|s_i\right|

is a norm on

There are also a large number of norms that exhibit additional properties that make them useful for specific problems.

Absolute-value norm

The absolute value

|x|

is a norm on the vector space formed by the real or complex numbers. The complex numbers form a one-dimensional vector space over themselves and a two-dimensional vector space over the reals; the absolute value is a norm for these two structures.

Any norm

on a one-dimensional vector space

is equivalent (up to scaling) to the absolute value norm, meaning that there is a norm-preserving isomorphism of vector spaces

f:F\toX,

where

is either

\Complex,

and norm-preserving means that

|x|=p(f(x)).

This isomorphism is given by sending

1\isinF

to a vector of norm

which exists since such a vector is obtained by multiplying any non-zero vector by the inverse of its norm.

Euclidean norm

On the

-dimensional Euclidean space

\R^n,

the intuitive notion of length of the vector

\boldsymbol{x}=\left(x_1,x_2,\ldots,x_n\right)

is captured by the formula^[9]

\|\boldsymbol\|_2 := \sqrt.

This is the Euclidean norm, which gives the ordinary distance from the origin to the point X—a consequence of the Pythagorean theorem.This operation may also be referred to as "SRSS", which is an acronym for the square root of the sum of squares.^[10]

The Euclidean norm is by far the most commonly used norm on

\R^n,

but there are other norms on this vector space as will be shown below.However, all these norms are equivalent in the sense that they all define the same topology on finite-dimensional spaces.

The inner product of two vectors of a Euclidean vector space is the dot product of their coordinate vectors over an orthonormal basis.Hence, the Euclidean norm can be written in a coordinate-free way as $\|\boldsymbol\| := \sqrt.$

The Euclidean norm is also called the quadratic norm,

L²

norm,^[11]

\ell²

norm, 2-norm, or square norm; see
L^p

space.It defines a distance function called the Euclidean length,

L²

distance, or

\ell²

distance.

The set of vectors in

\Rⁿ⁺¹

whose Euclidean norm is a given positive constant forms an

-sphere.

Euclidean norm of complex numbers

\R^2.

This identification of the complex number

x+iy

as a vector in the Euclidean plane, makes the quantity

\sqrt

(as first suggested by Euler) the Euclidean norm associated with the complex number. For

z=x+iy

, the norm can also be written as

\sqrt{\barzz}

where

\barz

is the complex conjugate of

Quaternions and octonions

Finite-dimensional complex normed spaces

On an

-dimensional complex space

\Complex^n,

the most common norm is

\|\boldsymbol\| := \sqrt = \sqrt.

In this case, the norm can be expressed as the square root of the inner product of the vector and itself: $\|\boldsymbol\| := \sqrt,$ where

\boldsymbol{x}

is represented as a column vector

\begin{bmatrix}x₁ x₂ ... x_n\end{bmatrix}^\rm

and

\boldsymbol{x}^H

denotes its conjugate transpose.

This formula is valid for any inner product space, including Euclidean and complex spaces. For complex spaces, the inner product is equivalent to the complex dot product. Hence the formula in this case can also be written using the following notation: $\|\boldsymbol\| := \sqrt.$

Taxicab norm or Manhattan norm

See main article: Taxicab geometry.

$\|\boldsymbol\|_1 := \sum_^n \left|x_i\right|.$ The name relates to the distance a taxi has to drive in a rectangular street grid (like that of the New York borough of Manhattan) to get from the origin to the point

The set of vectors whose 1-norm is a given constant forms the surface of a cross polytope, which has dimension equal to the dimension of the vector space minus 1.The Taxicab norm is also called the

\ell¹

norm. The distance derived from this norm is called the Manhattan distance or

\ell¹

distance.

The 1-norm is simply the sum of the absolute values of the columns.

In contrast, $\sum_^n x_i$ is not a norm because it may yield negative results.

p-norm

See main article: L<sup>p</sup> space.

Let

p\geq1

be a real number.The

-norm (also called

\ell^p

-norm) of vector

x=(x_1,\ldots,x_n)

\|\mathbf\|_p := \left(\sum_^n \left|x_i\right|^p\right)^.

For

p=1,

we get the taxicab norm, for

p=2

we get the Euclidean norm, and as

approaches

infty

the

-norm approaches the infinity norm or maximum norm:

\|\mathbf\|_\infty := \max_i \left|x_i\right|.

The

-norm is related to the generalized mean or power mean.

For

p=2,

the

\| ⋅ \|₂

-norm is even induced by a canonical inner product

\langle ⋅ , ⋅ \rangle,

meaning that

\|\mathbf\|_2 = \sqrt

for all vectors

This inner product can be expressed in terms of the norm by using the polarization identity.On

\ell^2,

this inner product is the defined by

\langle \left(x_n\right)_, \left(y_n\right)_ \rangle_ ~=~ \sum_n \overline y_n

while for the space

L^2(X,\mu)

associated with a measure space

(X,\Sigma,\mu),

which consists of all square-integrable functions, this inner product is

\langle f, g \rangle_ = \int_X \overline g(x)\, \mathrm dx.

This definition is still of some interest for

0<p<1,

but the resulting function does not define a norm,^[12] because it violates the triangle inequality.What is true for this case of

0<p<1,

even in the measurable analog, is that the corresponding

L^p

class is a vector space, and it is also true that the function

\int_X |f(x) - g(x)|^p ~ \mathrm d \mu

(without

th root) defines a distance that makes

L^p(X)

into a complete metric topological vector space. These spaces are of great interest in functional analysis, probability theory and harmonic analysis.However, aside from trivial cases, this topological vector space is not locally convex, and has no continuous non-zero linear forms. Thus the topological dual space contains only the zero functional.

The partial derivative of the

-norm is given by

\frac \|\mathbf\|_p = \frac .

The derivative with respect to

therefore, is

\frac =\frac .

where

\circ

denotes Hadamard product and

| ⋅ |

is used for absolute value of each component of the vector.

For the special case of

p=2,

this becomes

\frac \|\mathbf\|_2 = \frac,

\frac \|\mathbf\|_2 = \frac.

Maximum norm (special case of: infinity norm, uniform norm, or supremum norm)

is some vector such that

x=(x_1,x_2,\ldots,x_n),

then:

\|\mathbf\|_\infty := \max \left(\left|x_1\right|, \ldots, \left|x_n\right|\right).

The set of vectors whose infinity norm is a given constant,

forms the surface of a hypercube with edge length

2c.

Energy norm

The energy norm of a vector

\boldsymbol{x}=\left(x_1,x_2,\ldots,x_n\right)\in\Rⁿ

is defined in terms of a symmetric positive definite matrix

A\in\Rⁿ

$_ := \sqrt.$

It is clear that if

is the identity matrix, this norm corresponds to the Euclidean norm. If

is diagonal, this norm is also called a weighted norm. The energy norm is induced by the inner product given by

\langle\boldsymbol{x},\boldsymbol{y}\rangle_A:=\boldsymbol{x}^T ⋅ A ⋅ \boldsymbol{x}

for

\boldsymbol{x},\boldsymbol{y}\in\Rⁿ

In general, the value of the norm is dependent on the spectrum of

: For a vector

\boldsymbol{x}

with a Euclidean norm of one, the value of

{\|\boldsymbol{x}\|}_A

is bounded from below and above by the smallest and largest absolute eigenvalues of

respectively, where the bounds are achieved if

\boldsymbol{x}

coincides with the corresponding (normalized) eigenvectors. Based on the symmetric matrix square root

A^1/2

, the energy norm of a vector can be written in terms of the standard Euclidean norm as

$_ = _.$

Zero norm

In probability and functional analysis, the zero norm induces a complete metric topology for the space of measurable functions and for the F-space of sequences with F–norm $(x_n) \mapsto \sum_n.$ Here we mean by F-norm some real-valued function

\lVert ⋅ \rVert

on an F-space with distance

such that

\lVertx\rVert=d(x,0).

The F-norm described above is not a norm in the usual sense because it lacks the required homogeneity property.

Hamming distance of a vector from zero

Infinite dimensions

The generalization of the above norms to an infinite number of components leads to

\ell^p

and

L^p

spaces for

p\ge1,

with norms

$\|x\|_p = \bigg(\sum_ \left|x_i\right|^p\bigg)^ \text\ \|f\|_ = \bigg(\int_X |f(x)|^p ~ \mathrm d x\bigg)^$

for complex-valued sequences and functions on

X\sube\Rⁿ

respectively, which can be further generalized (see Haar measure). These norms are also valid in the limit as

p → +infty

, giving a supremum norm, and are called

\ell^infty

and

L^infty.

Any inner product induces in a natural way the norm $\|x\| := \sqrt.$

Other examples of infinite-dimensional normed vector spaces can be found in the Banach space article.

Generally, these norms do not give the same topologies. For example, an infinite-dimensional

\ell^p

space gives a strictly finer topology than an infinite-dimensional

\ell^q

space when

p<q.

Composite norms

Other norms on

\Rⁿ

can be constructed by combining the above; for example

\|x\| := 2 \left|x_1\right| + \sqrt

is a norm on

\R^4.

we can define a new norm of

equal to

\|A x\|.

In 2D, with

a rotation by 45° and a suitable scaling, this changes the taxicab norm into the maximum norm. Each

applied to the taxicab norm, up to inversion and interchanging of axes, gives a different unit ball: a parallelogram of a particular shape, size, and orientation.

In 3D, this is similar but different for the 1-norm (octahedrons) and the maximum norm (prisms with parallelogram base).

There are examples of norms that are not defined by "entrywise" formulas. For instance, the Minkowski functional of a centrally-symmetric convex body in

\Rⁿ

(centered at zero) defines a norm on

\Rⁿ

(see below).

All the above formulas also yield norms on

\Complexⁿ

without modification.

There are also norms on spaces of matrices (with real or complex entries), the so-called matrix norms.

In abstract algebra

See main article: Field norm.

Let

be a finite extension of a field

of inseparable degree

p^\mu,

and let

have algebraic closure

If the distinct embeddings of

are

\left\{\sigma_j\right\}_j,

then the Galois-theoretic norm of an element

\alpha\inE

is the value

\left(\prod_j \right)^.

As that function is homogeneous of degree

[E:k]

, the Galois-theoretic norm is not a norm in the sense of this article. However, the

[E:k]

-th root of the norm (assuming that concept makes sense) is a norm.^[13]

Composition algebras

The concept of norm

N(z)

in composition algebras does share the usual properties of a norm since null vectors are allowed. A composition algebra

(A,{}^*,N)

consists of an algebra over a field

an involution

{}^*,

and a quadratic form

N(z)=zz^*

called the "norm".

The characteristic feature of composition algebras is the homomorphism property of

: for the product

of two elements

and

of the composition algebra, its norm satisfies

N(wz)=N(w)N(z).

In the case of division algebras

\R,

\Complex,

and

the composition algebra norm is the square of the norm discussed above. In those cases the norm is a definite quadratic form. In the split algebras the norm is an isotropic quadratic form.

Properties

For any norm

p:X\to\R

on a vector space

the reverse triangle inequality holds:

p(x \pm y) \geq |p(x) - p(y)| \text x, y \in X.

u:X\toY

is a continuous linear map between normed spaces, then the norm of

and the norm of the transpose of

are equal.

For the

L^p

norms, we have Hölder's inequality^[14]

|\langle x, y \rangle| \leq \|x\|_p \|y\|_q \qquad \frac + \frac = 1.

A special case of this is the Cauchy–Schwarz inequality:

\left|\langle x, y \rangle\right| \leq \|x\|_2 \|y\|_2.

Every norm is a seminorm and thus satisfies all properties of the latter. In turn, every seminorm is a sublinear function and thus satisfies all properties of the latter. In particular, every norm is a convex function.

Equivalence

The concept of unit circle (the set of all vectors of norm 1) is different in different norms: for the 1-norm, the unit circle is a square oriented as a diamond; for the 2-norm (Euclidean norm), it is the well-known unit circle; while for the infinity norm, it is an axis-aligned square. For any

-norm, it is a superellipse with congruent axes (see the accompanying illustration). Due to the definition of the norm, the unit circle must be convex and centrally symmetric (therefore, for example, the unit ball may be a rectangle but cannot be a triangle, and

p\geq1

for a

-norm).

In terms of the vector space, the seminorm defines a topology on the space, and this is a Hausdorff topology precisely when the seminorm can distinguish between distinct vectors, which is again equivalent to the seminorm being a norm. The topology thus defined (by either a norm or a seminorm) can be understood either in terms of sequences or open sets. A sequence of vectors

\{v_n\}

is said to converge in norm to

\left\|v_n-v\right\|\to0

n\toinfty.

Equivalently, the topology consists of all sets that can be represented as a union of open balls. If

(X,\| ⋅ \|)

is a normed space then

\|x-y\|=\|x-z\|+\|z-y\|forallx,y\inXandz\in[x,y].

Two norms

\| ⋅ \|_\alpha

and

\| ⋅ \|_\beta

on a vector space

are called if they induce the same topology,^[15] which happens if and only if there exist positive real numbers

and

such that for all

x\inX

C \|x\|_\alpha \leq \|x\|_\beta \leq D \|x\|_\alpha.

For instance, if

p>r\geq1

\Complex^n,

then^[16]

\|x\|_p \leq \|x\|_r \leq n^ \|x\|_p.

In particular, $\|x\|_2 \leq \|x\|_1 \leq \sqrt \|x\|_2$ $\|x\|_\infty \leq \|x\|_2 \leq \sqrt \|x\|_\infty$ $\|x\|_\infty \leq \|x\|_1 \leq n \|x\|_\infty,$ That is, $\|x\|_\infty \leq \|x\|_2 \leq \|x\|_1 \leq \sqrt \|x\|_2 \leq n \|x\|_\infty.$ If the vector space is a finite-dimensional real or complex one, all norms are equivalent. On the other hand, in the case of infinite-dimensional vector spaces, not all norms are equivalent.

Equivalent norms define the same notions of continuity and convergence and for many purposes do not need to be distinguished. To be more precise the uniform structure defined by equivalent norms on the vector space is uniformly isomorphic.

Classification of seminorms: absolutely convex absorbing sets

See main article: Seminorm.

All seminorms on a vector space

can be classified in terms of absolutely convex absorbing subsets

To each such subset corresponds a seminorm

p_A

called the gauge of

defined as

inf

The converse is due to Andrey Kolmogorov: any locally convex and locally bounded topological vector space is normable. Precisely:

Norm (mathematics) explained

Definition

Equivalent norms

Notation

Examples

Absolute-value norm

Euclidean norm

Euclidean norm of complex numbers

Quaternions and octonions

Finite-dimensional complex normed spaces

Taxicab norm or Manhattan norm

p-norm

Maximum norm (special case of: infinity norm, uniform norm, or supremum norm)

Energy norm

Zero norm

Hamming distance of a vector from zero

Infinite dimensions

Composite norms

In abstract algebra

Composition algebras

Properties

Equivalence

Classification of seminorms: absolutely convex absorbing sets

Notes and References