Lp space explained

In mathematics, the spaces are function spaces defined using a natural generalization of the -norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz .

spaces form an important class of Banach spaces in functional analysis, and of topological vector spaces. Because of their key role in the mathematical analysis of measure and probability spaces, Lebesgue spaces are used also in the theoretical discussion of problems in physics, statistics, economics, finance, engineering, and other disciplines.

Applications

Statistics

In statistics, measures of central tendency and statistical dispersion, such as the mean, median, and standard deviation, can be defined in terms of

Lp

metrics, and measures of central tendency can be characterized as solutions to variational problems.

In penalized regression, "L1 penalty" and "L2 penalty" refer to penalizing either the

L1

norm of a solution's vector of parameter values (i.e. the sum of its absolute values), or its squared

L2

norm (its Euclidean length). Techniques which use an L1 penalty, like LASSO, encourage sparse solutions (where the many parameters are zero).[1] Elastic net regularization uses a penalty term that is a combination of the

L1

norm and the squared

L2

norm of the parameter vector.

Hausdorff–Young inequality

The Fourier transform for the real line (or, for periodic functions, see Fourier series), maps

Lp(\Reals)

to

Lq(\Reals)

(or

Lp(T)

to

\ellq

) respectively, where

1\leqp\leq2

and

\tfrac{1}{p}+\tfrac{1}{q}=1.

This is a consequence of the Riesz–Thorin interpolation theorem, and is made precise with the Hausdorff–Young inequality.

By contrast, if

p>2,

the Fourier transform does not map into

Lq.

Hilbert spaces

See also: Square-integrable function.

Hilbert spaces are central to many applications, from quantum mechanics to stochastic calculus. The spaces

L2

and

\ell2

are both Hilbert spaces. In fact, by choosing a Hilbert basis

E,

i.e., a maximal orthonormal subset of

L2

or any Hilbert space, one sees that every Hilbert space is isometrically isomorphic to

\ell2(E)

(same

E

as above), i.e., a Hilbert space of type

\ell2.

The -norm in finite dimensions

The Euclidean length of a vector

x=(x1,x2,...,xn)

in the

n

-dimensional real vector space

\Realsn

is given by the Euclidean norm:\|x\|_2 = \left(^2 + ^2 + \dotsb + ^2\right)^.

The Euclidean distance between two points

x

and

y

is the length

\|x-y\|2

of the straight line between the two points. In many situations, the Euclidean distance is appropriate for capturing the actual distances in a given space. In contrast, consider taxi drivers in a grid street plan who should measure distance not in terms of the length of the straight line to their destination, but in terms of the rectilinear distance, which takes into account that streets are either orthogonal or parallel to each other. The class of

p

-norms generalizes these two examples and has an abundance of applications in many parts of mathematics, physics, and computer science.

Definition

p\geq1,

the

p

-norm
or

Lp

-norm
of

x

is defined by\|x\|_p = \left(|x_1|^p + |x_2|^p + \dotsb + |x_n|^p\right)^.The absolute value bars can be dropped when

p

is a rational number with an even numerator in its reduced form, and

x

is drawn from the set of real numbers, or one of its subsets.

The Euclidean norm from above falls into this class and is the

2

-norm, and the

1

-norm is the norm that corresponds to the rectilinear distance.

The

Linfty

-norm or maximum norm (or uniform norm) is the limit of the

Lp

-norms for

p\toinfty.

It turns out that this limit is equivalent to the following definition:\|x\|_\infty = \max \left\
, x_2, \dotsc, x_n\right\

See -infinity.

For all

p\geq1,

the

p

-norms and maximum norm as defined above indeed satisfy the properties of a "length function" (or norm), which are that:

Abstractly speaking, this means that

\Realsn

together with the

p

-norm is a normed vector space. Moreover, it turns out that this space is complete, thus making it a Banach space. This Banach space is the

Lp

-space over

\{1,2,\ldots,n\}.

Relations between -norms

The grid distance or rectilinear distance (sometimes called the "Manhattan distance") between two points is never shorter than the length of the line segment between them (the Euclidean or "as the crow flies" distance). Formally, this means that the Euclidean norm of any vector is bounded by its 1-norm:\|x\|_2 \leq \|x\|_1 .

This fact generalizes to

p

-norms in that the

p

-norm

\|x\|p

of any given vector

x

does not grow with

p

:

For the opposite direction, the following relation between the

1

-norm and the

2

-norm is known:\|x\|_1 \leq \sqrt \|x\|_2 ~.

This inequality depends on the dimension

n

of the underlying vector space and follows directly from the Cauchy–Schwarz inequality.

In general, for vectors in

\Complexn

where

0<r<p:

\|x\|_p \leq \|x\|_r \leq n^ \|x\|_p ~.

This is a consequence of Hölder's inequality.

When

In

\Realsn

for

n>1,

the formula\|x\|_p = \left(|x_1|^p + |x_2| ^p + \cdots + |x_n|^p\right)^defines an absolutely homogeneous function for

0<p<1;

however, the resulting function does not define a norm, because it is not subadditive. On the other hand, the formula|x_1|^p + |x_2|^p + \dotsb + |x_n|^pdefines a subadditive function at the cost of losing absolute homogeneity. It does define an F-norm, though, which is homogeneous of degree

p.

(\Realsn,dp)

is denoted by
p.
\ell
n

Although the

p

-unit ball
p
B
n
around the origin in this metric is "concave", the topology defined on

\Realsn

by the metric

Bp

is the usual vector space topology of

\Realsn,

hence
p
\ell
n
is a locally convex topological vector space. Beyond this qualitative statement, a quantitative way to measure the lack of convexity of
p
\ell
n
is to denote by

Cp(n)

the smallest constant

C

such that the scalar multiple

C

p
B
n
of the

p

-unit ball contains the convex hull of
p,
B
n
which is equal to
1.
B
n
The fact that for fixed

p<1

we haveC_p(n) = n^ \to \infty, \quad \text n \to \inftyshows that the infinite-dimensional sequence space

\ellp

defined below, is no longer locally convex.

When

There is one

\ell0

norm and another function called the

\ell0

"norm" (with quotation marks).

The mathematical definition of the

\ell0

norm was established by Banach's Theory of Linear Operations. The space of sequences has a complete metric topology provided by the F-norm(x_n) \mapsto \sum_n 2^ \frac
,which is discussed by Stefan Rolewicz in Metric Linear Spaces. The

\ell0

-normed space is studied in functional analysis, probability theory, and harmonic analysis.

Another function was called the

\ell0

"norm" by David Donoho—whose quotation marks warn that this function is not a proper norm—is the number of non-zero entries of the vector

x.

Many authors abuse terminology by omitting the quotation marks. Defining

00=0,

the zero "norm" of

x

is equal to|x_1|^0 + |x_2|^0 + \cdots + |x_n|^0 .

This is not a norm because it is not homogeneous. For example, scaling the vector

x

by a positive constant does not change the "norm". Despite these defects as a mathematical norm, the non-zero counting "norm" has uses in scientific computing, information theory, and statistics–notably in compressed sensing in signal processing and computational harmonic analysis. Despite not being a norm, the associated metric, known as Hamming distance, is a valid distance, since homogeneity is not required for distances.

The -norm in infinite dimensions and spaces

The sequence space

The

p

-norm can be extended to vectors that have an infinite number of components (sequences), which yields the space

\ellp.

This contains as special cases:

\ell1,

the space of sequences whose series are absolutely convergent,

\ell2,

the space of square-summable sequences, which is a Hilbert space, and

\ellinfty,

the space of bounded sequences.

The space of sequences has a natural vector space structure by applying addition and scalar multiplication coordinate by coordinate. Explicitly, the vector sum and the scalar action for infinite sequences of real (or complex) numbers are given by:\begin& (x_1, x_2, \ldots, x_n, x_,\ldots)+(y_1, y_2, \ldots, y_n, y_,\ldots) \\= & (x_1+y_1, x_2+y_2, \ldots, x_n+y_n, x_+y_,\ldots), \\[6pt]& \lambda \cdot \left (x_1, x_2, \ldots, x_n, x_,\ldots \right) \\= & (\lambda x_1, \lambda x_2, \ldots, \lambda x_n, \lambda x_,\ldots).\end

Define the

p

-norm:\|x\|_p = \left(|x_1|^p + |x_2|^p + \cdots +|x_n|^p + |x_|^p + \cdots\right)^

Here, a complication arises, namely that the series on the right is not always convergent, so for example, the sequence made up of only ones,

(1,1,1,\ldots),

will have an infinite

p

-norm for

1\leqp<infty.

The space

\ellp

is then defined as the set of all infinite sequences of real (or complex) numbers such that the

p

-norm is finite.

One can check that as

p

increases, the set

\ellp

grows larger. For example, the sequence\left(1, \frac, \ldots, \frac, \frac, \ldots\right)is not in

\ell1,

but it is in

\ellp

for

p>1,

as the series1^p + \frac + \cdots + \frac + \frac + \cdots,diverges for

p=1

(the harmonic series), but is convergent for

p>1.

One also defines the

infty

-norm using the supremum:\|x\|_\infty = \sup(|x_1|, |x_2|, \dotsc, |x_n|,|x_|, \ldots)and the corresponding space

\ellinfty

of all bounded sequences. It turns out that[2] \|x\|_\infty = \lim_ \|x\|_pif the right-hand side is finite, or the left-hand side is infinite. Thus, we will consider

\ellp

spaces for

1\leqp\leqinfty.

The

p

-norm thus defined on

\ellp

is indeed a norm, and

\ellp

together with this norm is a Banach space. The fully general

Lp

space is obtained—as seen below—by considering vectors, not only with finitely or countably-infinitely many components, but with "arbitrarily many components"; in other words, functions. An integral instead of a sum is used to define the

p

-norm.

General ℓp-space

In complete analogy to the preceding definition one can define the space

\ellp(I)

over a general index set

I

(and

1\leqp<infty

) as\ell^p(I) = \left\,where convergence on the right means that only countably many summands are nonzero (see also Unconditional convergence).With the norm\|x\|_p = \left(\sum_ |x_i|^p\right)^the space

\ellp(I)

becomes a Banach space.In the case where

I

is finite with

n

elements, this construction yields

\Realsn

with the

p

-norm defined above.If

I

is countably infinite, this is exactly the sequence space

\ellp

defined above.For uncountable sets

I

this is a non-separable Banach space which can be seen as the locally convex direct limit of

\ellp

-sequence spaces.[3]

For

p=2,

the

\|\|2

-norm is even induced by a canonical inner product

\langle,\rangle,

called the , which means that

\|x\|2=\sqrt{\langlex,x\rangle}

holds for all vectors

x.

This inner product can expressed in terms of the norm by using the polarization identity. On

\ell2,

it can be defined by\langle \left(x_i\right)_, \left(y_n\right)_ \rangle_ ~=~ \sum_i x_i \overlinewhile for the space

L2(X,\mu)

associated with a measure space

(X,\Sigma,\mu),

which consists of all square-integrable functions, it is \langle f, g \rangle_ = \int_X f(x) \overline\, \mathrm dx.

Now consider the case

p=infty.

Define\ell^\infty(I)=\,where for all

x

[4] \|x\|_\infty\equiv\inf\ = \begin\sup\operatorname|x|&\text X\neq\varnothing,\\0&\text X=\varnothing.\end

The index set

I

can be turned into a measure space by giving it the discrete σ-algebra and the counting measure. Then the space

\ellp(I)

is just a special case of the more general

Lp

-space (defined below).

Lp spaces and Lebesgue integrals

An

Lp

space may be defined as a space of measurable functions for which the

p

-th power of the absolute value is Lebesgue integrable, where functions which agree almost everywhere are identified. More generally, let

(S,\Sigma,\mu)

be a measure space and

1\leqp\leqinfty.

[5] When

pinfty

, consider the set

l{L}p(S,\mu)

of all measurable functions

f

from

S

to

\Complex

or

\Reals

whose absolute value raised to the

p

-th power has a finite integral, or in symbols:\|f\|_p ~\stackrel~ \left(\int_S |f|^p\;\mathrm\mu\right)^ < \infty.

To define the set for

p=infty,

recall that two functions

f

and

g

defined on

S

are said to be, written, if the set

\{s\inS:f(s)g(s)\}

is measurable and has measure zero. Similarly, a measurable function

f

(and its absolute value) is (or) by a real number

C,

written, if the (necessarily) measurable set

\{s\inS:|f(s)|>C\}

has measure zero. The space

l{L}infty(S,\mu)

is the set of all measurable functions

f

that are bounded almost everywhere (by some real

C

) and

\|f\|infty

is defined as the infimum of these bounds:\|f\|_\infty ~\stackrel~ \inf \. When

\mu(S)0

then this is the same as the essential supremum of the absolute value of

f

:\|f\|_\infty ~=~ \begin\operatorname|f| & \text \mu(S) > 0,\\ 0 & \text \mu(S) = 0.\end

For example, if

f

is a measurable function that is equal to

0

almost everywhere[6] then

\|f\|p=0

for every

p

and thus

f\inl{L}p(S,\mu)

for all

p.

For every positive

p,

the value under

\|\|p

of a measurable function

f

and its absolute value

|f|:S\to[0,infty]

are always the same (that is,

\|f\|p=\||f|\|p

for all

p

) and so a measurable function belongs to

l{L}p(S,\mu)

if and only if its absolute value does. Because of this, many formulas involving

p

-norms are stated only for non-negative real-valued functions. Consider for example the identity
r
\|f\|
p

=

r\|
\|f
p/r

,

which holds whenever

f\geq0

is measurable,

r>0

is real, and

0<p\leqinfty

(here

infty/r\stackrel{\scriptscriptstyledef

}\; \infty when

p=infty

). The non-negativity requirement

f\geq0

can be removed by substituting

|f|

in for

f,

which gives
r
\||f|\|
p

=

r\|
\||f|
p/r

.

Note in particular that when

p=r

is finite then the formula
p
\|f\|
p

=

p\|
\||f|
1
relates the

p

-norm to the

1

-norm.

Seminormed space of

p

-th power integrable functions

Each set of functions

l{L}p(S,\mu)

forms a vector space when addition and scalar multiplication are defined pointwise.[7] That the sum of two

p

-th power integrable functions

f

and

g

is again

p

-th power integrable follows from \|f + g\|_p^p \leq 2^ \left(\|f\|_p^p + \|g\|_p^p\right),[8]

Notes and References

  1. Book: Hastie, T. J. . Trevor Hastie

    . Trevor Hastie . Tibshirani . R. . Robert Tibshirani . Wainwright . M. J. . 2015 . Statistical Learning with Sparsity: The Lasso and Generalizations . CRC Press . 978-1-4987-1216-3 .

  2. , page 16
  3. Rafael Dahmen, Gábor Lukács: Long colimits of topological groups I: Continuous maps and homeomorphisms. in: Topology and its Applications Nr. 270, 2020. Example 2.14
  4. Book: Garling. D. J. H.. Inequalities: A Journey into Linear Analysis. 2007. Cambridge University Press. 978-0-521-87624-7. 54.
  5. The definitions of

    \|\|p,

    l{L}p(S,\mu),

    and

    Lp(S,\mu)

    can be extended to all

    0<p\leqinfty

    (rather than just

    1\leqp\leqinfty

    ), but it is only when

    1\leqp\leqinfty

    that

    \|\|p

    is guaranteed to be a norm (although

    \|\|p

    is a quasi-seminorm for all

    0<p\leqinfty,

    ).
  6. For example, if a non-empty measurable set

    N\varnothing

    of measure

    \mu(N)=0

    exists then its indicator function

    1N

    satisfies

    \|1N\|p=0

    although

    1N0.

  7. Explicitly, the vector space operations are defined by:\begin(f+g)(x) &= f(x)+g(x), \\(s f)(x) &= s f(x)\endfor all

    f,g\inl{L}p(S,\mu)

    and all scalars

    s.

    These operations make

    l{L}p(S,\mu)

    into a vector space because if

    s

    is any scalar and

    f,g\inl{L}p(S,\mu)

    then both

    sf

    and

    f+g

    also belong to

    l{L}p(S,\mu).

  8. , Theorem 6.16
  9. See Sections 14.77 and 27.44–47
  10. When

    1\leqp<infty,

    the inequality

    \|f+

    p
    g\|
    p

    \leq2p-1

    p
    \left(\|f\|
    p

    +

    p\right)
    \|g\|
    p
    can be deduced from the fact that the function

    F:[0,infty)\to\Reals

    defined by

    F(t)=tp

    is convex, which by definition means that

    F(tx+(1-t)y)\leqtF(x)+(1-t)F(y)

    for all

    0\leqt\leq1

    and all

    x,y

    in the domain of

    F.

    Substituting

    |f|,|g|,

    and

    \tfrac{1}{2}

    in for

    x,y,

    and

    t

    gives

    \left(\tfrac{1}{2}|f|+\tfrac{1}{2}|g|\right)p\leq\tfrac{1}{2}|f|p+\tfrac{1}{2}|g|p,

    which proves that

    (|f|+|g|)p\leq2p-1(|f|p+|g|p).

    The triangle inequality

    |f+g|\leq|f|+|g|

    now implies

    |f+g|p\leq2p-1(|f|p+|g|p).

    The desired inequality follows by integrating both sides.

    \blacksquare

    although it is also a consequence of Minkowski's inequality \|f + g\|_p \leq \|f\|_p + \|g\|_p which establishes that

    \|\|p

    satisfies the triangle inequality for

    1\leqp\leqinfty

    (the triangle inequality does not hold for

    0<p<1

    ). That

    l{L}p(S,\mu)

    is closed under scalar multiplication is due to

    \|\|p

    being absolutely homogeneous, which means that

    \|sf\|p=|s|\|f\|p

    for every scalar

    s

    and every function

    f.

    Absolute homogeneity, the triangle inequality, and non-negativity are the defining properties of a seminorm. Thus

    \|\|p

    is a seminorm and the set

    l{L}p(S,\mu)

    of

    p

    -th power integrable functions together with the function

    \|\|p

    defines a seminormed vector space. In general, the seminorm

    \|\|p

    is not a norm because there might exist measurable functions

    f

    that satisfy

    \|f\|p=0

    but are not equal to

    0

    (

    \|\|p

    is a norm if and only if no such

    f

    exists).

    Zero sets of

    p

    -seminorms

    If

    f

    is measurable and equals

    0

    a.e. then

    \|f\|p=0

    for all positive

    p\leqinfty.

    On the other hand, if

    f

    is a measurable function for which there exists some

    0<p\leqinfty

    such that

    \|f\|p=0

    then

    f=0

    almost everywhere. When

    p

    is finite then this follows from the

    p=1

    case and the formula
    p
    \|f\|
    p

    =

    p\|
    \||f|
    1
    mentioned above.

    Thus if

    p\leqinfty

    is positive and

    f

    is any measurable function, then

    \|f\|p=0

    if and only if

    f=0

    almost everywhere. Since the right hand side (

    f=0

    a.e.) does not mention

    p,

    it follows that all

    \|\|p

    have the same zero set (it does not depend on

    p

    ). So denote this common set by\mathcal \;\stackrel\; \ = \ \qquad \forall \ p.This set is a vector subspace of

    l{L}p(S,\mu)

    for every positive

    p\leqinfty.

    Quotient vector space

    Like every seminorm, the seminorm

    \|\|p

    induces a norm (defined shortly) on the canonical quotient vector space of

    l{L}p(S,\mu)

    by its vector subspace \mathcal = \.This normed quotient space is called and it is the subject of this article. We begin by defining the quotient vector space.

    Given any

    f\inl{L}p(S,\mu),

    the coset

    f+l{N}\stackrel{\scriptscriptstyledef

    }\; \ consists of all measurable functions

    g

    that are equal to

    f

    almost everywhere. The set of all cosets, typically denoted by \mathcal^p(S, \mu) / \mathcal ~~\stackrel~~ \,forms a vector space with origin

    0+l{N}=l{N}

    when vector addition and scalar multiplication are defined by

    (f+l{N})+(g+l{N})\stackrel{\scriptscriptstyledef

    }\; (f + g) + \mathcal and

    s(f+l{N})\stackrel{\scriptscriptstyledef

    }\; (s f) + \mathcal. This particular quotient vector space will be denoted by

    Lp(S,\mu)~\stackrel{\scriptscriptstyledef

    }~ \mathcal^p(S, \mu) / \mathcal.

    Two cosets are equal

    f+l{N}=g+l{N}

    if and only if

    g\inf+l{N}

    (or equivalently,

    f-g\inl{N}

    ), which happens if and only if

    f=g

    almost everywhere; if this is the case then

    f

    and

    g

    are identified in the quotient space.

    The

    p

    -norm on the quotient vector space

    Given any

    f\inl{L}p(S,\mu),

    the value of the seminorm

    \|\|p

    on the coset

    f+l{N}=\{f+h:h\inl{N}\}

    is constant and equal to

    \|f\|p;

    denote this unique value by

    \|f+l{N}\|p,

    so that:\|f + \mathcal\|_p \;\stackrel\; \|f\|_p. This assignment

    f+l{N}\mapsto\|f+l{N}\|p

    defines a map, which will also be denoted by

    \|\|p,

    on the quotient vector space L^p(S, \mu) ~~\stackrel~~ \mathcal^p(S, \mu) / \mathcal ~=~ \.This map is a norm on

    Lp(S,\mu)

    called the . The value

    \|f+l{N}\|p

    of a coset

    f+l{N}

    is independent of the particular function

    f

    that was chosen to represent the coset, meaning that if

    l{C}\inLp(S,\mu)

    is any coset then

    \|l{C}\|p=\|f\|p

    for every

    f\inl{C}

    (since

    l{C}=f+l{N}

    for every

    f\inl{C}

    ).

    The Lebesgue

    Lp

    space

    \left(Lp(S,\mu),\|\|p\right)

    is called or the of

    p

    -th power integrable functions and it is a Banach space for every

    1\leqp\leqinfty

    (meaning that it is a complete metric space, a result that is sometimes called the Riesz–Fischer theorem). When the underlying measure space

    S

    is understood then

    Lp(S,\mu)

    is often abbreviated

    Lp(\mu),

    or even just

    Lp.

    Depending on the author, the subscript notation

    Lp

    might denote either

    Lp(S,\mu)

    or

    L1/p(S,\mu).

    If the seminorm

    \|\|p

    on

    l{L}p(S,\mu)

    happens to be a norm (which happens if and only if

    l{N}=\{0\}

    ) then the normed space

    \left(l{L}p(S,\mu),\|\|p\right)

    will be linearly isometrically isomorphic to the normed quotient space

    \left(Lp(S,\mu),\|\|p\right)

    via the canonical map

    g\inl{L}p(S,\mu)\mapsto\{g\}

    (since

    g+l{N}=\{g\}

    ); in other words, they will be, up to a linear isometry, the same normed space and so they may both be called "

    Lp

    space".

    The above definitions generalize to Bochner spaces.

    In general, this process cannot be reversed: there is no consistent way to define a "canonical" representative of each coset of

    l{N}

    in

    Lp.

    For

    Linfty,

    however, there is a theory of lifts enabling such recovery.

    Special cases

    Similar to the

    \ellp

    spaces,

    L2

    is the only Hilbert space among

    Lp

    spaces. In the complex case, the inner product on

    L2

    is defined by\langle f, g \rangle = \int_S f(x) \overline \, \mathrm\mu(x)

    The additional inner product structure allows for a richer theory, with applications to, for instance, Fourier series and quantum mechanics. Functions in

    L2

    are sometimes called square-integrable functions, quadratically integrable functions or square-summable functions, but sometimes these terms are reserved for functions that are square-integrable in some other sense, such as in the sense of a Riemann integral .

    If we use complex-valued functions, the space

    Linfty

    is a commutative C*-algebra with pointwise multiplication and conjugation. For many measure spaces, including all sigma-finite ones, it is in fact a commutative von Neumann algebra. An element of

    Linfty

    defines a bounded operator on any

    Lp

    space by multiplication.

    For

    1\leqp\leqinfty

    the

    \ellp

    spaces are a special case of

    Lp

    spaces, when

    S=N)

    consists of the natural numbers and

    \mu

    is the counting measure on

    N.

    More generally, if one considers any set

    S

    with the counting measure, the resulting

    Lp

    space is denoted

    \ellp(S).

    For example, the space

    \ellp(Z)

    is the space of all sequences indexed by the integers, and when defining the

    p

    -norm on such a space, one sums over all the integers. The space

    \ellp(n),

    where

    n

    is the set with

    n

    elements, is

    \Realsn

    with its

    p

    -norm as defined above. As any Hilbert space, every space

    L2

    is linearly isometric to a suitable

    \ell2(I),

    where the cardinality of the set

    I

    is the cardinality of an arbitrary Hilbertian basis for this particular

    L2.

    Properties of Lp spaces

    As in the discrete case, if there exists

    q<infty

    such that

    f\inLinfty(S,\mu)\capLq(S,\mu),

    then\|f\|_\infty = \lim_\|f\|_p.

    Hölder's inequality

    Suppose

    p,q,r\in[1,infty]

    satisfy

    \tfrac{1}{p}+\tfrac{1}{q}=\tfrac{1}{r}

    (where

    \tfrac{1}{infty}~\stackrel{\scriptscriptstyledef

    }~ 0). If

    f\inLp(S,\mu)

    and

    g\inLq(S,\mu)

    then

    fg\inLr(S,\mu)

    and \|f g\|_r ~\leq~ \|f\|_p \, \|g\|_q.

    This inequality, called Hölder's inequality, is in some sense optimal since if

    r=1

    (so

    \tfrac{1}{p}+\tfrac{1}{q}=1

    ) and

    f

    is a measurable function such that\sup_ \, \int_S |f g| \, \mathrm \mu ~<~ \infty where the supremum is taken over the closed unit ball of

    Lq(S,\mu),

    then

    f\inLp(S,\mu)

    and\|f\|_p ~=~ \sup_ \, \int_S f g \, \mathrm \mu.

    Minkowski inequality

    Minkowski inequality, which states that

    \|\|p

    satisfies the triangle inequality, can be generalized: If the measurable function

    F:M x N\to\Reals

    is non-negative (where

    (M,\mu)

    and

    (N,\nu)

    are measure spaces) then for all

    1\leqp\leqq\leqinfty,

    \left\|\left\|F(\,\cdot, n)\right\|_\right\|_~\leq~ \left\|\left\|F(m, \cdot)\right\|_\right\|_ \ .

    Atomic decomposition

    If

    1\leqp<infty

    then every non-negative

    f\inLp(\mu)

    has an, meaning that there exist a sequence

    (rn)n

    of non-negative real numbers and a sequence of non-negative functions

    (fn)n,

    called, whose supports

    \left(\operatorname{supp}fn\right)n

    are pairwise disjoint sets of measure

    \mu\left(\operatorname{supp}fn\right)\leq2n+1,

    such thatf ~=~ \sum_ r_n \, f_n \,,and for every integer

    n\in\Z,

    \|f_n\|_\infty ~\leq~ 2^ \,, and\tfrac \|f\|_p^p ~\leq~ \sum_ r_n^p ~\leq~ 2 \|f\|^p_p \,,and where moreover, the sequence of functions

    (rnfn)n

    depends only on

    f

    (it is independent of

    p

    ). These inequalities guarantee that

    \|fn\|

    p
    p

    \leq2

    for all integers

    n

    while the supports of

    (fn)n

    being pairwise disjoint implies \|f\|_p^p ~=~ \sum_ r_n^p \, \|f_n\|^p_p \, .

    An atomic decomposition can be explicitly given by first defining for every integer

    n\in\Z,

    t_n = \inf \(this infimum is attained by

    tn;

    that is,

    \mu(f>tn)<2n

    holds) and then lettingr_n ~=~ 2^ \, t_n ~ \text \quad f_n ~=~ \frac \, \mathbf_where

    \mu(f>t)=\mu(\{s:f(s)>t\})

    denotes the measure of the set

    (f>t):=\{s\inS:f(s)>t\}

    and
    1
    (tn+1<f\leqtn)
    denotes the indicator function of the set

    (tn+1<f\leqtn):=\{s\inS:tn+1<f(s)\leqtn\}.

    The sequence

    (tn)n

    is decreasing and converges to

    0

    as

    n\toinfty.

    Consequently, if

    tn=0

    then

    tn+1=0

    and

    (tn+1<f\leqtn)=\varnothing

    so that

    fn=

    1
    rn

    f

    1
    (tn+1<f\leqtn)
    is identically equal to

    0

    (in particular, the division

    \tfrac{1}{rn}

    by

    rn=0

    causes no issues).

    t\in\Reals\mapsto\mu(|f|>t)

    of

    |f|=f

    that was used to define the

    tn

    also appears in the definition of the weak

    Lp

    -norm (given below) and can be used to express the

    p

    -norm

    \|\|p

    (for

    1\leqp<infty

    ) of

    f\inLp(S,\mu)

    as the integral\|f\|_p^p ~=~ p \, \int_0^\infty t^ \mu(|f| > t) \, \mathrm t \,,where the integration is with respect to the usual Lebesgue measure on

    (0,infty).

    Dual spaces

    The dual space (the Banach space of all continuous linear functionals) of

    Lp(\mu)

    for

    1<p<infty

    has a natural isomorphism with

    Lq(\mu),

    where

    q

    is such that

    \tfrac{1}{p}+\tfrac{1}{q}=1

    (i.e.

    q=\tfrac{p}{p-1}

    ). This isomorphism associates

    g\inLq(\mu)

    with the functional

    \kappap(g)\inLp(\mu)*

    defined byf \mapsto \kappa_p(g)(f) = \int f g \, \mathrm\mu for every

    f\inLp(\mu).

    The fact that

    \kappap(g)

    is well defined and continuous follows from Hölder's inequality.

    \kappap:Lq(\mu)\toLp(\mu)*

    is a linear mapping which is an isometry by the extremal case of Hölder's inequality. It is also possible to show (for example with the Radon–Nikodym theorem, see[8]) that any

    G\inLp(\mu)*

    can be expressed this way: i.e., that

    \kappap

    is onto. Since

    \kappap

    is onto and isometric, it is an isomorphism of Banach spaces. With this (isometric) isomorphism in mind, it is usual to say simply that

    Lq(\mu)

    is the continuous dual space of

    Lp(\mu).

    For

    1<p<infty,

    the space

    Lp(\mu)

    is reflexive. Let

    \kappap

    be as above and let

    \kappaq:Lp(\mu)\toLq(\mu)*

    be the corresponding linear isometry. Consider the map from

    Lp(\mu)

    to

    Lp(\mu)**,

    obtained by composing

    \kappaq

    with the transpose (or adjoint) of the inverse of

    \kappap:

    j_p : L^p(\mu) \mathrel L^q(\mu)^* \mathrel L^p(\mu)^

    J

    of

    Lp(\mu)

    into its bidual. Moreover, the map

    jp

    is onto, as composition of two onto isometries, and this proves reflexivity.

    If the measure

    \mu

    on

    S

    is sigma-finite, then the dual of

    L1(\mu)

    is isometrically isomorphic to

    Linfty(\mu)

    (more precisely, the map

    \kappa1

    corresponding to

    p=1

    is an isometry from

    Linfty(\mu)

    onto

    L1(\mu)*.

    The dual of

    Linfty(\mu)

    is subtler. Elements of

    Linfty(\mu)*

    can be identified with bounded signed finitely additive measures on

    S

    that are absolutely continuous with respect to

    \mu.

    See ba space for more details. If we assume the axiom of choice, this space is much bigger than

    L1(\mu)

    except in some trivial cases. However, Saharon Shelah proved that there are relatively consistent extensions of Zermelo–Fraenkel set theory (ZF + DC + "Every subset of the real numbers has the Baire property") in which the dual of

    \ellinfty

    is

    \ell1.

    [9]

    Embeddings

    Colloquially, if

    1\leqp<q\leqinfty,

    then

    Lp(S,\mu)

    contains functions that are more locally singular, while elements of

    Lq(S,\mu)

    can be more spread out. Consider the Lebesgue measure on the half line

    (0,infty).

    A continuous function in

    L1

    might blow up near

    0

    but must decay sufficiently fast toward infinity. On the other hand, continuous functions in

    Linfty

    need not decay at all but no blow-up is allowed. The precise technical result is the following. Suppose that

    0<p<q\leqinfty.

    Then:

    Lq(S,\mu)\subseteqLp(S,\mu)

    if and only if

    S

    does not contain sets of finite but arbitrarily large measure (any finite measure, for example).

    Lp(S,\mu)\subseteqLq(S,\mu)

    if and only if

    S

    does not contain sets of non-zero but arbitrarily small measure (the counting measure, for example).

    Neither condition holds for the real line with the Lebesgue measure while both conditions holds for the counting measure on any finite set. In both cases the embedding is continuous, in that the identity operator is a bounded linear map from

    Lq

    to

    Lp

    in the first case, and

    Lp

    to

    Lq

    in the second.(This is a consequence of the closed graph theorem and properties of

    Lp

    spaces.) Indeed, if the domain

    S

    has finite measure, one can make the following explicit calculation using Hölder's inequality\ \|\mathbff^p\|_1 \leq \|\mathbf\|_ \|f^p\|_leading to\ \|f\|_p \leq \mu(S)^ \|f\|_q .

    The constant appearing in the above inequality is optimal, in the sense that the operator norm of the identity

    I:Lq(S,\mu)\toLp(S,\mu)

    is precisely\|I\|_ = \mu(S)^the case of equality being achieved exactly when

    f=1

    \mu

    -almost-everywhere.

    Dense subspaces

    Throughout this section we assume that

    1\leqp<infty.

    Let

    (S,\Sigma,\mu)

    be a measure space. An integrable simple function

    f

    on

    S

    is one of the formf = \sum_^n a_j \mathbf_where

    aj

    are scalars,

    Aj\in\Sigma

    has finite measure and
    {1}
    Aj
    is the indicator function of the set

    Aj,

    for

    j=1,...,n.

    By construction of the integral, the vector space of integrable simple functions is dense in

    Lp(S,\Sigma,\mu).

    More can be said when

    S

    is a normal topological space and

    \Sigma

    its Borel  - algebra, i.e., the smallest  - algebra of subsets of

    S

    containing the open sets.

    Suppose

    V\subseteqS

    is an open set with

    \mu(V)<infty.

    It can be proved that for every Borel set

    A\in\Sigma

    contained in

    V,

    and for every

    \varepsilon>0,

    there exist a closed set

    F

    and an open set

    U

    such thatF \subseteq A \subseteq U \subseteq V \quad \text \quad \mu(U) - \mu(F) = \mu(U \setminus F) < \varepsilon

    0\leq\varphi\leq1

    on

    S

    that is

    1

    on

    F

    and

    0

    on

    S\setminusU,

    with\int_S |\mathbf_A - \varphi| \, \mathrm\mu < \varepsilon \, .

    If

    S

    can be covered by an increasing sequence

    (Vn)

    of open sets that have finite measure, then the space of

    p

     - integrable continuous functions is dense in

    Lp(S,\Sigma,\mu).

    More precisely, one can use bounded continuous functions that vanish outside one of the open sets

    Vn.

    This applies in particular when

    S=\Realsd

    and when

    \mu

    is the Lebesgue measure. The space of continuous and compactly supported functions is dense in

    Lp(\Realsd).

    Similarly, the space of integrable step functions is dense in

    Lp(\Realsd);

    this space is the linear span of indicator functions of bounded intervals when

    d=1,

    of bounded rectangles when

    d=2

    and more generally of products of bounded intervals.

    Several properties of general functions in

    Lp(\Realsd)

    are first proved for continuous and compactly supported functions (sometimes for step functions), then extended by density to all functions. For example, it is proved this way that translations are continuous on

    Lp(\Realsd),

    in the following sense:\forall f \in L^p \left(\Reals^d\right) : \quad \left\|\tau_t f - f \right\|_p \to 0,\quad \text \Reals^d \ni t \to 0,where(\tau_t f)(x) = f(x - t).

    Closed subspaces

    If

    0<p<infty

    is any positive real number,

    \mu

    is a probability measure on a measurable space

    (S,\Sigma)

    (so that

    Linfty(\mu)\subseteqLp(\mu)

    ), and

    V\subseteqLinfty(\mu)

    is a vector subspace, then

    V

    is a closed subspace of

    Lp(\mu)

    if and only if

    V

    is finite-dimensional (

    V

    was chosen independent of

    p

    ). In this theorem, which is due to Alexander Grothendieck, it is crucial that the vector space

    V

    be a subset of

    Linfty

    since it is possible to construct an infinite-dimensional closed vector subspace of

    L1\left(S1,\tfrac{1}{2\pi}λ\right)

    (that is even a subset of

    L4

    ), where

    λ

    is Lebesgue measure on the unit circle

    S1

    and

    \tfrac{1}{2\pi}λ

    is the probability measure that results from dividing it by its mass

    λ(S1)=2\pi.

    Let

    (S,\Sigma,\mu)

    be a measure space. If

    0<p<1,

    then

    Lp(\mu)

    can be defined as above: it is the quotient vector space of those measurable functions

    f

    such thatN_p(f) = \int_S |f|^p\, d\mu < \infty.

    As before, we may introduce the

    p

    -norm

    \|f\|p=

    1/p
    N
    p(f)

    ,

    but

    \|\|p

    does not satisfy the triangle inequality in this case, and defines only a quasi-norm. The inequality

    (a+b)p\leqap+bp,

    valid for

    a,b\geq0,

    implies that N_p(f + g) \leq N_p(f) + N_p(g)and so the functiond_p(f,g) = N_p(f - g) = \|f - g\|_p^pis a metric on

    Lp(\mu).

    The resulting metric space is complete; the verification is similar to the familiar case when

    p\geq1.

    The balls B_r = \form a local base at the origin for this topology, as

    r>0

    ranges over the positive reals. These balls satisfy

    Br=r1/pB1

    for all real

    r>0,

    which in particular shows that

    B1

    is a bounded neighborhood of the origin; in other words, this space is locally bounded, just like every normed space, despite

    \|\|p

    not being a norm.

    In this setting

    Lp

    satisfies a reverse Minkowski inequality, that is for

    u,v\inLp

    \Big\||u| + |v|\Big\|_p \geq \|u\|_p + \|v\|_p

    This result may be used to prove Clarkson's inequalities, which are in turn used to establish the uniform convexity of the spaces

    Lp

    for

    1<p<infty

    .

    The space

    Lp

    for

    0<p<1

    is an F-space: it admits a complete translation-invariant metric with respect to which the vector space operations are continuous. It is the prototypical example of an F-space that, for most reasonable measure spaces, is not locally convex: in

    \ellp

    or

    Lp([0,1]),

    every open convex set containing the

    0

    function is unbounded for the

    p

    -quasi-norm; therefore, the

    0

    vector does not possess a fundamental system of convex neighborhoods. Specifically, this is true if the measure space

    S

    contains an infinite family of disjoint measurable sets of finite positive measure.

    The only nonempty convex open set in

    Lp([0,1])

    is the entire space . As a particular consequence, there are no nonzero continuous linear functionals on

    Lp([0,1]);

    the continuous dual space is the zero space. In the case of the counting measure on the natural numbers (producing the sequence space

    Lp(\mu)=\ellp

    ), the bounded linear functionals on

    \ellp

    are exactly those that are bounded on

    \ell1,

    namely those given by sequences in

    \ellinfty.

    Although

    \ellp

    does contain non-trivial convex open sets, it fails to have enough of them to give a base for the topology.

    The situation of having no linear functionals is highly undesirable for the purposes of doing analysis. In the case of the Lebesgue measure on

    \Realsn,

    rather than work with

    Lp

    for

    0<p<1,

    it is common to work with the Hardy space whenever possible, as this has quite a few linear functionals: enough to distinguish points from one another. However, the Hahn–Banach theorem still fails in for

    p<1

    .

    , the space of measurable functions

    The vector space of (equivalence classes of) measurable functions on

    (S,\Sigma,\mu)

    is denoted

    L0(S,\Sigma,\mu)

    . By definition, it contains all the

    Lp,

    and is equipped with the topology of convergence in measure. When

    \mu

    is a probability measure (i.e.,

    \mu(S)=1

    ), this mode of convergence is named convergence in probability. The space

    L0

    is always a topological abelian group but is only a topological vector space if

    \mu(S)<infty.

    This is because scalar multiplication is continuous if and only if

    \mu(S)<infty.

    If

    (S,\Sigma,\mu)

    is

    \sigma

    -finite then the weaker topology of local convergence in measure is an F-space, i.e. a completely metrizable topological vector space. Moreover, this topology is isometric to global convergence in measure

    (S,\Sigma,\nu)

    for a suitable choice of probability measure

    \nu.

    The description is easier when

    \mu

    is finite. If

    \mu

    is a finite measure on

    (S,\Sigma),

    the

    0

    function admits for the convergence in measure the following fundamental system of neighborhoodsV_\varepsilon = \Bigl\, \qquad \varepsilon > 0.

    The topology can be defined by any metric

    d

    of the formd(f, g) = \int_S \varphi \bigl(|f(x) - g(x)|\bigr)\, \mathrm\mu(x)where

    \varphi

    is bounded continuous concave and non-decreasing on

    [0,infty),

    with

    \varphi(0)=0

    and

    \varphi(t)>0

    when

    t>0

    (for example,

    \varphi(t)=min(t,1).

    Such a metric is called Lévy-metric for

    L0.

    Under this metric the space

    L0

    is complete. However, as mentioned above, scalar multiplication is continuous with respect to this metric only if

    \mu(S)<infty

    . To see this, consider the Lebesgue measurable function

    f:R → R

    defined by

    f(x)=x

    . Then clearly

    \limcd(cf,0)=infty

    . The space

    L0

    is in general not locally bounded, and not locally convex.

    For the infinite Lebesgue measure

    λ

    on

    \Realsn,

    the definition of the fundamental system of neighborhoods could be modified as followsW_\varepsilon = \left\

    The resulting space

    L0(\Realsn,λ)

    , with the topology of local convergence in measure, is isomorphic to the space

    L0(\Realsn,gλ),

    for any positive

    λ

     - integrable density

    g.

    Generalizations and extensions

    Weak

    Let

    (S,\Sigma,\mu)

    be a measure space, and

    f

    a measurable function with real or complex values on

    S.

    The distribution function of

    f

    is defined for

    t\geq0

    by\lambda_f(t) = \mu\.

    If

    f

    is in

    Lp(S,\mu)

    for some

    p

    with

    1\leqp<infty,

    then by Markov's inequality,\lambda_f(t) \leq \frac

    A function

    f

    is said to be in the space weak

    Lp(S,\mu)

    , or

    Lp,w(S,\mu),

    if there is a constant

    C>0

    such that, for all

    t>0,

    \lambda_f(t) \leq \frac

    The best constant

    C

    for this inequality is the

    Lp,w

    -norm of

    f,

    and is denoted by\|f\|_ = \sup_ ~ t \lambda_f^(t).

    The weak

    Lp

    coincide with the Lorentz spaces

    Lp,infty,

    so this notation is also used to denote them.

    The

    Lp,w

    -norm is not a true norm, since the triangle inequality fails to hold. Nevertheless, for

    f

    in

    Lp(S,\mu),

    \|f\|_ \leq \|f\|_pand in particular

    Lp(S,\mu)\subsetLp,w(S,\mu).

    In fact, one has\|f\|^p_ = \int |f(x)|^p d\mu(x) \geq \int_ t^p + \int_ |f|^p \geq t^p \mu(\

    > t \
    ),and raising to power

    1/p

    and taking the supremum in

    t

    one has\|f\|_ \geq \sup_ t \; \mu(\
    > t \
    )^ = \|f\|_.

    Under the convention that two functions are equal if they are equal

    \mu

    almost everywhere, then the spaces

    Lp,w

    are complete .

    For any

    0<r<p

    the expression\|| f |\|_ = \sup_ \mu(E)^ \left(\int_E |f|^r\, d\mu\right)^is comparable to the

    Lp,w

    -norm. Further in the case

    p>1,

    this expression defines a norm if

    r=1.

    Hence for

    p>1

    the weak

    Lp

    spaces are Banach spaces .

    A major result that uses the

    Lp,w

    -spaces is the Marcinkiewicz interpolation theorem, which has broad applications to harmonic analysis and the study of singular integrals.

    Weighted spaces

    (S,\Sigma,\mu).

    Let

    w:S\to[a,infty),a>0

    be a measurable function. The

    w

    -weighted

    Lp

    space
    is defined as

    Lp(S,wd\mu),

    where

    wd\mu

    means the measure

    \nu

    defined by\nu(A) \equiv \int_A w(x) \, \mathrm \mu (x), \qquad A \in \Sigma,

    or, in terms of the Radon–Nikodym derivative,

    w=\tfrac{d\nu}{d\mu}

    the norm for

    Lp(S,wd\mu)

    is explicitly\|u\|_ \equiv \left(\int_S w(x) |u(x)|^p \, \mathrm \mu(x)\right)^

    As

    Lp

    -spaces, the weighted spaces have nothing special, since

    Lp(S,wd\mu)

    is equal to

    Lp(S,d\nu).

    But they are the natural framework for several results in harmonic analysis ; they appear for example in the Muckenhoupt theorem: for

    1<p<infty,

    the classical Hilbert transform is defined on

    Lp(T,λ)

    where

    T

    denotes the unit circle and

    λ

    the Lebesgue measure; the (nonlinear) Hardy–Littlewood maximal operator is bounded on

    Lp(\Realsn,λ).

    Muckenhoupt's theorem describes weights

    w

    such that the Hilbert transform remains bounded on

    Lp(T,wdλ)

    and the maximal operator on

    Lp(\Realsn,wdλ).

    spaces on manifolds

    One may also define spaces

    Lp(M)

    on a manifold, called the intrinsic

    Lp

    spaces
    of the manifold, using densities.

    Vector-valued spaces

    Given a measure space

    (\Omega,\Sigma,\mu)

    and a locally convex space

    E

    (here assumed to be complete), it is possible to define spaces of

    p

    -integrable

    E

    -valued functions on

    \Omega

    in a number of ways. One way is to define the spaces of Bochner integrable and Pettis integrable functions, and then endow them with locally convex TVS-topologies that are (each in their own way) a natural generalization of the usual

    Lp

    topology. Another way involves topological tensor products of

    Lp(\Omega,\Sigma,\mu)

    with

    E.

    Element of the vector space

    Lp(\Omega,\Sigma,\mu)E

    are finite sums of simple tensors

    f1e1++fnen,

    where each simple tensor

    f x e

    may be identified with the function

    \Omega\toE

    that sends

    x\mapstoef(x).

    This tensor product

    Lp(\Omega,\Sigma,\mu)E

    is then endowed with a locally convex topology that turns it into a topological tensor product, the most common of which are the projective tensor product, denoted by

    Lp(\Omega,\Sigma,\mu)\piE,

    and the injective tensor product, denoted by

    Lp(\Omega,\Sigma,\mu)\varepsilonE.

    In general, neither of these space are complete so their completions are constructed, which are respectively denoted by

    Lp(\Omega,\Sigma,\mu)\widehat{}\piE

    and

    Lp(\Omega,\Sigma,\mu)\widehat{}\varepsilonE

    (this is analogous to how the space of scalar-valued simple functions on

    \Omega,

    when seminormed by any

    \|\|p,

    is not complete so a completion is constructed which, after being quotiented by

    \ker\|\|p,

    is isometrically isomorphic to the Banach space

    Lp(\Omega,\mu)

    ). Alexander Grothendieck showed that when

    E

    is a nuclear space (a concept he introduced), then these two constructions are, respectively, canonically TVS-isomorphic with the spaces of Bochner and Pettis integral functions mentioned earlier; in short, they are indistinguishable.

    See also

    \left(

    1
    L
    loc

    \right)

    References

    • .
      • .
    • .
    • .
      • .
    • .

    External links