Calculus on Euclidean space explained

Rⁿ

as well as a finite-dimensional real vector space. This calculus is also known as advanced calculus, especially in the United States. It is similar to multivariable calculus but is somewhat more sophisticated in that it uses linear algebra (or some functional analysis) more extensively and covers some concepts from differential geometry such as differential forms and Stokes' formula in terms of differential forms. This extensive use of linear algebra also allows a natural generalization of multivariable calculus to calculus on Banach spaces or topological vector spaces.

Calculus on Euclidean space is also a local model of calculus on manifolds, a theory of functions on manifolds.

Basic notions

Functions in one real variable

This section is a brief review of function theory in one-variable calculus.

A real-valued function

f:R\toR

is continuous at

if it is approximately constant near

; i.e.,

\lim_h(f(a+h)-f(a))=0.

In contrast, the function

is differentiable at

if it is approximately linear near

; i.e., there is some real number

such that

\lim_h

	f(a+h)-f(a)-λh
	h

=0.

(For simplicity, suppose

f(a)=0

. Then the above means that

f(a+h)=λh+g(a,h)

where

g(a,h)

goes to 0 faster than h going to 0 and, in that sense,

f(a+h)

behaves like

λh

The number

depends on

and thus is denoted as

f'(a)

. If

is differentiable on an open interval

and if

is a continuous function on

, then

is called a C¹ function. More generally,

is called a C^k function if its derivative

is C^k-1 function. Taylor's theorem states that a C^k function is precisely a function that can be approximated by a polynomial of degree k.

f:R\toR

is a C¹ function and

f'(a)\ne0

for some

, then either

f'(a)>0

f'(a)<0

; i.e., either

is strictly increasing or strictly decreasing in some open interval containing a. In particular,

f:f^-1(U)\toU

is bijective for some open interval

containing

f(a)

. The inverse function theorem then says that the inverse function

f^-1

is differentiable on U with the derivatives: for

y\inU

(f^-1)'(y)={1\overf'(f^-1(y))}.

Derivative of a map and chain rule

For functions

defined in the plane or more generally on an Euclidean space

Rⁿ

, it is necessary to consider functions that are vector-valued or matrix-valued. It is also conceptually helpful to do this in an invariant manner (i.e., a coordinate-free way). Derivatives of such maps at a point are then vectors or linear maps, not real numbers.

Let

f:X\toY

be a map from an open subset

Rⁿ

to an open subset

R^m

. Then the map

is said to be differentiable at a point

if there exists a (necessarily unique) linear transformation

f'(x):Rⁿ\toR^m

, called the derivative of

, such that

\lim

	1
	\|h\|

|f(x+h)-f(x)-f'(x)h|=0

where

f'(x)h

is the application of the linear transformation

f'(x)

. If

is differentiable at

, then it is continuous at

since

|f(x+h)-f(x)|\le(|h|^-1|f(x+h)-f(x)-f'(x)h|)|h|+|f'(x)h|\to0

h\to0

As in the one-variable case, there is

This is proved exactly as for functions in one variable. Indeed, with the notation

\widetilde{h}=f(x+h)-f(x)

, we have:

\begin{align} &

	1
	\|h\|

|g(f(x+h))-g(y)-g'(y)f'(x)h|\\ &\le

	1
	\|h\|

|g(y+\widetilde{h})-g(y)-g'(y)\widetilde{h}|+

	1
	\|h\|

|g'(y)(f(x+h)-f(x)-f'(x)h)|. \end{align}

Here, since

is differentiable at

, the second term on the right goes to zero as

h\to0

. As for the first term, it can be written as:

\begin{cases}	\|\widetilde{h
	\|}{\|h\|}

|g(y+\widetilde{h})-g(y)-g'(y)\widetilde{h}|/|\widetilde{h}|,&\widetilde{h} ≠ 0,\\ 0,&\widetilde{h}=0. \end{cases}

Now, by the argument showing the continuity of

, we see

	\|\widetilde{h
	\|}{\|h\|}

is bounded. Also,

\widetilde{h}\to0

h\to0

since

is continuous at

. Hence, the first term also goes to zero as

h\to0

by the differentiability of

\square

The map

as above is called continuously differentiable or

C¹

if it is differentiable on the domain and also the derivatives vary continuously; i.e.,

x\mapstof'(x)

is continuous.

As a linear transformation,

f'(x)

is represented by an

m x n

-matrix, called the Jacobian matrix

Jf(x)

and we write it as:

(Jf)(x)=\begin{bmatrix}

	\partialf₁
	\partialx₁

(x)& … &

	\partialf₁
	\partialx_n

(x)\\ \vdots&\ddots&\vdots\\

	\partialf_m
	\partialx₁

(x)& … &

	\partialf_m
	\partialx_n

(x) \end{bmatrix}.

Taking

to be

he_j

a real number and

e_j=(0, … ,1, … ,0)

the j-th standard basis element, we see that the differentiability of

implies:

\lim_h

	f_i(x+he_j)-f_i(x)
	h

	\partialf_i
	\partialx_j

(x)

where

f_i

denotes the i-th component of

. That is, each component of

is differentiable at

in each variable with the derivative

	\partialf_i
	\partialx_j

(x)

. In terms of Jacobian matrices, the chain rule says

J(g\circf)(x)=Jg(y)Jf(x)

; i.e., as

(g\circf)_i=g_i\circf

	\partial(g_i\circf)
	\partialx_j

(x)=

	\partialg_i
	\partialy₁

(y)

	\partialf₁
	\partialx_j

(x)+ … +

	\partialg_i
	\partialy_m

(y)

	\partialf_m
	\partialx_j

(x),

which is the form of the chain rule that is often stated.

A partial converse to the above holds. Namely, if the partial derivatives

{\partialf_i}/{\partialx_j}

are all defined and continuous, then

is continuously differentiable. This is a consequence of the mean value inequality:

(This version of mean value inequality follows from mean value inequality in applied to the function

[0,1]\toR^m,t\mapstof(x+ty)-tv

, where the proof on mean value inequality is given.)

Indeed, let

g(x)=(Jf)(x)

. We note that, if

y=y_ie_i

, then

	d
	dt

f(x+ty)=

	\partialf
	\partialx_i

(x+ty)y=g(x+ty)(y_ie_i).

For simplicity, assume

n=2

(the argument for the general case is similar). Then, by mean value inequality, with the operator norm

\| ⋅ \|

\begin{align} &|\Delta_yf(x)-g(x)y|\\ &\le

\|\Delta
	y₁e₁

f(x_1,x₂+y₂₎-g(x)(y₁e_1)|+

\|\Delta
	y₂e₂

f(x_1,x₂₎-g(x)(y₂e_2)|\\ &\le|y_1|\sup₀\|g(x₁+ty_1,x₂+y₂₎-g(x)\|+|y_2|\sup₀\|g(x_1,x₂+ty₂₎-g(x)\|, \end{align}

which implies

|\Delta_yf(x)-g(x)y|/|y|\to0

as required.

\square

Example: Let

be the set of all invertible real square matrices of size n. Note

can be identified as an open subset of

	n²
R

with coordinates

x_ij,0\lei,j\nen

. Consider the function

f(g)=g^-1

= the inverse matrix of

defined on

. To guess its derivatives, assume

is differentiable and consider the curve

c(t)=

	tg^-1h
ge

where

e^A

means the matrix exponential of

. By the chain rule applied to

f(c(t))=

	-tg^-1h
e

g^-1

, we have:

f'(c(t))\circc'(t)=-g^-1h

	-tg^-1h
e

g^-1

.Taking

t=0

, we get:

f'(g)h=-g^-1hg^-1

.Now, we then have:

\|(g+h)^-1-g^-1+g^-1hg^-1\|\le\|(g+h)^-1\|\|h\|\|g^-1hg^-1\|.

Since the operator norm is equivalent to the Euclidean norm on

	n²
R

(any norms are equivalent to each other), this implies

is differentiable. Finally, from the formula for

, we see the partial derivatives of

are smooth (infinitely differentiable); whence,

is smooth too.

Higher derivatives and Taylor formula

f:X\toR^m

is differentiable where

X\subsetRⁿ

is an open subset, then the derivatives determine the map

f':X\to\operatorname{Hom}(R^n,R^m)

, where

\operatorname{Hom}

stands for homomorphisms between vector spaces; i.e., linear maps. If

is differentiable, then

f'':X\to\operatorname{Hom}(R^n,\operatorname{Hom}(R^n,R^m))

. Here, the codomain of

f''

can be identified with the space of bilinear maps by:

\operatorname{Hom}(R^n,\operatorname{Hom}(R^n,R^m))\overset{\varphi}\underset{\sim}\to\{(Rⁿ⁾²\toR^mbilinear\}

where

\varphi(g)(x,y)=g(x)y

and

\varphi

is bijective with the inverse

\psi

given by

(\psi(g)x)y=g(x,y)

. In general,

f^(k)=(f^(k-1))'

is a map from

to the space of

-multilinear maps

(Rⁿ⁾^k\toR^m

Just as

f'(x)

is represented by a matrix (Jacobian matrix), when

m=1

(a bilinear map is a bilinear form), the bilinear form

f''(x)

is represented by a matrix called the Hessian matrix of

; namely, the square matrix

of size

such that

f''(x)(y,z)=(Hy,z)

, where the paring refers to an inner product of

Rⁿ

, and

is none other than the Jacobian matrix of

f':X\to(Rⁿ⁾^*\simeqRⁿ

. The

(i,j)

-th entry of

is thus given explicitly as

H_ij=

	\partial²f
	\partialx_i\partialx_j

(x)

Moreover, if

f''

exists and is continuous, then the matrix

is symmetric, the fact known as the symmetry of second derivatives. This is seen using the mean value inequality. For vectors

u,v

Rⁿ

, using mean value inequality twice, we have:

|\Delta_v\Delta_uf(x)-f''(x)(u,v)|\le

\sup
	0<t_1,t₂<1

|f''(x+t₁u+t₂v)(u,v)-f''(x)(u,v)|,

which says

f''(x)(u,v)=\lim_s,(\Delta_tv\Delta_suf(x)-f(x))/(st).

Since the right-hand side is symmetric in

u,v

, so is the left-hand side:

f''(x)(u,v)=f''(x)(v,u)

. By induction, if

C^k

, then the k-multilinear map

f^(k)(x)

is symmetric; i.e., the order of taking partial derivatives does not matter.

As in the case of one variable, the Taylor series expansion can then be proved by integration by parts:

f(z+(h,k))=\sum_a+b<n

	b
\partial
	y

f(z){h^ak^b\overa!b!}+

	1
n\int
	0

(1-t)^n-1\sum_a+b=n

	b
\partial
	y

f(z+t(h,k)){h^ak^b\overa!b!}dt.

Taylor's formula has an effect of dividing a function by variables, which can be illustrated by the next typical theoretical use of the formula.

Example: Let

T:l{S}\tol{S}

be a linear map between the vector space

l{S}

of smooth functions on

Rⁿ

with rapidly decreasing derivatives; i.e.,

\sup|x^\beta\partial^\alpha\varphi|<infty

for any multi-index

\alpha,\beta

. (The space

l{S}

is called a Schwartz space.) For each

\varphi

l{S}

, Taylor's formula implies we can write:

\varphi-\psi\varphi(y)=

	n
\sum
	j=1

(x_j-y_j)\varphi_j

with

\varphi_j\inl{S}

, where

\psi

is a smooth function with compact support and

\psi(y)=1

. Now, assume

commutes with coordinates; i.e.,

T(x_j\varphi)=x_jT\varphi

. Then

T\varphi-\varphi(y)T\psi=

	n
\sum
	j=1

(x_j-y_j)T\varphi_j

.Evaluating the above at

, we get

T\varphi(y)=\varphi(y)T\psi(y).

In other words,

is a multiplication by some function

; i.e.,

T\varphi=m\varphi

. Now, assume further that

commutes with partial differentiations. We then easily see that

is a constant;

is a multiplication by a constant.

(Aside: the above discussion almost proves the Fourier inversion formula. Indeed, let

F,R:l{S}\tol{S}

be the Fourier transform and the reflection; i.e.,

(R\varphi)(x)=\varphi(-x)

. Then, dealing directly with the integral that is involved, one can see

T=RF²

commutes with coordinates and partial differentiations; hence,

is a multiplication by a constant. This is almost a proof since one still has to compute this constant.)

A partial converse to the Taylor formula also holds; see Borel's lemma and Whitney extension theorem.

Inverse function theorem and submersion theorem

C^k

-map with the

C^k

-inverse is called a

C^k

-diffeomorphism. Thus, the theorem says that, for a map

satisfying the hypothesis at a point

is a diffeomorphism near

x,f(x).

For a proof, see .

The implicit function theorem says: given a map

f:Rⁿ x R^m\toR^m

, if

f(a,b)=0

C^k

in a neighborhood of

(a,b)

and the derivative of

y\mapstof(a,y)

is invertible, then there exists a differentiable map

g:U\toV

for some neighborhoods

U,V

a,b

such that

f(x,g(x))=0

. The theorem follows from the inverse function theorem; see .

Another consequence is the submersion theorem.

Integrable functions on Euclidean spaces

A partition of an interval

[a,b]

is a finite sequence

a=t₀\let₁\le … \let_k=b

. A partition

of a rectangle

(product of intervals) in

Rⁿ

then consists of partitions of the sides of

; i.e., if

	n
\prod
	1

[a_i,b_i]

, then

consists of

P_1,...,P_n

such that

P_i

is a partition of

[a_i,b_i]

Given a function

, we then define the upper Riemann sum of it as:

U(f,P)=\sum_Q(\sup_Qf)\operatorname{vol}(Q)

where

is a partition element of

; i.e.,

	n
\prod
	i=1

[t
	i,j_i

t
	i,j_i+1

]

when

P_i:a_i=t_i,\le... … \le

t
	i,k_i

=b_i

is a partition of

[a_i,b_i]

The volume

\operatorname{vol}(Q)

is the usual Euclidean volume; i.e.,

\operatorname{vol}(Q)=

	n
\prod
	1

(t
	i,j_i+1

t
	i,j_i

)

.The lower Riemann sum

L(f,P)

is then defined by replacing

\sup

inf

. Finally, the function

is called integrable if it is bounded and

\sup\{L(f,P)\midP\}=inf\{U(f,P)\midP\}

. In that case, the common value is denoted as

\int_Dfdx

A subset of

Rⁿ

is said to have measure zero if for each

\epsilon>0

, there are some possibly infinitely many rectangles

D_1,D_2,...,

whose union contains the set and

\sum_i\operatorname{vol}(D_i)<\epsilon.

A key theorem is

The next theorem allows us to compute the integral of a function as the iteration of the integrals of the function in one-variables:

In particular, the order of integrations can be changed.

Finally, if

M\subsetRⁿ

is a bounded open subset and

a function on

, then we define

\int_Mfdx:=\int_D\chi_Mfdx

where

is a closed rectangle containing

and

\chi_M

is the characteristic function on

; i.e.,

\chi_M(x)=1

x\inM

and

x\not\inM,

provided

\chi_Mf

is integrable.

Surface integral

If a bounded surface

R³

is parametrized by

bf{r}=bf{r}(u,v)

with domain

, then the surface integral of a measurable function

is defined and denoted as:

\int_MFdS:=\int\int_D(F\circbf{r})|bf{r}_u x bf{r}_v|dudv

F:M\toR³

is vector-valued, then we define

\int_MF ⋅ dS:=\int_M(F ⋅ bf{n})dS

where

bf{n}

is an outward unit normal vector to

. Since

bf{n}=

	bf{r
	_u

x bf{r}_v}{|bf{r}_u x bf{r}_v|}

, we have:

\int_MF ⋅ dS=\int\int_D(F\circbf{r}) ⋅ (bf{r}_u x bf{r}_v)dudv=\int\int_D\det(F\circbf{r},bf{r}_u,bf{r}_v)dudv.

Vector analysis

Tangent vectors and vector fields

Let

c:[0,1]\toRⁿ

be a differentiable curve. Then the tangent vector to the curve

is a vector

at the point

c(t)

whose components are given as:

v=(c_1'(t),...,c_n'(t))

For example, if

c(t)=(a\cos(t),a\sin(t),bt),a>0,b>0

is a helix, then the tangent vector at t is:

c'(t)=(-a\sin(t),a\cos(t),b).

It corresponds to the intuition that the a point on the helix moves up in a constant speed.

M\subsetRⁿ

is a differentiable curve or surface, then the tangent space to

at a point p is the set of all tangent vectors to the differentiable curves

c:[0,1]\toM

with

c(0)=p

A vector field X is an assignment to each point p in M a tangent vector

X_p

to M at p such that the assignment varies smoothly.

Differential forms

The dual notion of a vector field is a differential form. Given an open subset

Rⁿ

, by definition, a differential 1-form (often just 1-form)

\omega

is an assignment to a point

a linear functional

\omega_p

on the tangent space

T_pM

such that the assignment varies smoothly. For a (real or complex-valued) smooth function

, define the 1-form

by: for a tangent vector

df_p(v)=v(f)

where

v(f)

denotes the directional derivative of

in the direction

. For example, if

x_i

is the

-th coordinate function, then

dx_i,(v)=v_i

; i.e.,

dx_i,p

are the dual basis to the standard basis on

T_pM

. Then every differential 1-form

\omega

can be written uniquely as

\omega=f₁dx₁+ … +f_ndx_n

for some smooth functions

f_1,...,f_n

(since, for every point

, the linear functional

\omega_p

is a unique linear combination of

dx_i

over real numbers). More generally, a differential k-form is an assignment to a point

a vector

\omega_p

in the

-th exterior power

wedge^k

	*
T
	p

of the dual space

	*
T
	p

T_pM

such that the assignment varies smoothly. In particular, a 0-form is the same as a smooth function. Also, any

-form

\omega

can be written uniquely as:

\omega=

\sum
	i₁< … <i_k

f
	i₁...i_k

dx
	i₁

\wedge … \wedge

dx
	i_k

for some smooth functions

f
	i₁...i_k

Like a smooth function, we can differentiate and integrate differential forms. If

is a smooth function, then

can be written as:

df=

	n
\sum
	i=1

	\partialf
	\partialx_i

dx_i

since, for

v=\partial/\partialx_j|_p

, we have:

df_p(v)=

	\partialf
	\partialx_j

(p)=

	n
\sum
	i=1

	\partialf
	\partialx_i

(p)dx_i(v)

. Note that, in the above expression, the left-hand side (whence the right-hand side) is independent of coordinates

x_1,...,x_n

; this property is called the invariance of differential.

The operation

is called the exterior derivative and it extends to any differential forms inductively by the requirement (Leibniz rule)

d(\alpha\wedge\beta)=d\alpha\wedge\beta+(-1)^p\alpha\wedged\beta.

where

\alpha,\beta

are a p-form and a q-form.

The exterior derivative has the important property that

d\circd=0

; that is, the exterior derivative

of a differential form

d\omega

is zero. This property is a consequence of the symmetry of second derivatives (mixed partials are equal).

Boundary and orientation

A circle can be oriented clockwise or counterclockwise. Mathematically, we say that a subset

Rⁿ

is oriented if there is a consistent choice of normal vectors to

that varies continuously. For example, a circle or, more generally, an n-sphere can be oriented; i.e., orientable. On the other hand, a Möbius strip (a surface obtained by identified by two opposite sides of the rectangle in a twisted way) cannot oriented: if we start with a normal vector and travel around the strip, the normal vector at end will point to the opposite direction.

The proposition is useful because it allows us to give an orientation by giving a volume form.

Integration of differential forms

\omega=fdx₁\wedge … \wedgedx_n

is a differential n-form on an open subset M in

Rⁿ

(any n-form is that form), then the integration of it over

with the standard orientation is defined as:

\int_M\omega=\int_Mfdx₁ … dx_n.

If M is given the orientation opposite to the standard one, then

\int_M\omega

is defined as the negative of the right-hand side.

Then we have the fundamental formula relating exterior derivative and integration:

Here is a sketch of proof of the formula. If

is a smooth function on

Rⁿ

with compact support, then we have:

\intd(f\omega)=0

(since, by the fundamental theorem of calculus, the above can be evaluated on boundaries of the set containing the support.) On the other hand,

\intd(f\omega)=\intdf\wedge\omega+\intfd\omega.

Let

approach the characteristic function on

. Then the second term on the right goes to

\int_Md\omega

while the first goes to

-\int_\partial\omega

, by the argument similar to proving the fundamental theorem of calculus.

\square

The formula generalizes the fundamental theorem of calculus as well as Stokes' theorem in multivariable calculus. Indeed, if

M=[a,b]

is an interval and

\omega=f

, then

d\omega=f'dx

and the formula says:

\int_Mf'dx=f(b)-f(a)

.Similarly, if

is an oriented bounded surface in

R³

and

\omega=fdx+gdy+hdz

, then

d(fdx)=df\wedgedx=

	\partialf
	\partialy

dy\wedgedx+

	\partialf
	\partialz

dz\wedgedx

and similarly for

d(gdy)

and

d(gdy)

. Collecting the terms, we thus get:

d\omega=\left(

	\partialh
	\partialy

	\partialg
	\partialz

\right)dy\wedgedz+\left(

	\partialf
	\partialz

	\partialh
	\partialx

\right)dz\wedgedx+\left(

	\partialg
	\partialx

	\partialf
	\partialy

\right)dx\wedgedy.

Then, from the definition of the integration of

\omega

, we have

\int_Md\omega=\int_M(\nabla x F) ⋅ dS

where

F=(f,g,h)

is the vector-valued function and

\nabla=\left(

	\partial
	\partialx

	\partial
	\partialy

	\partial
	\partialz

\right)

. Hence, Stokes’ formula becomes

\int_M(\nabla x F) ⋅ dS=\int_\partial(fdx+gdy+hdz),

which is the usual form of the Stokes' theorem on surfaces. Green’s theorem is also a special case of Stokes’ formula.

Stokes' formula also yields a general version of Cauchy's integral formula. To state and prove it, for the complex variable

z=x+iy

and the conjugate

\barz

, let us introduce the operators

	\partial
	\partialz

	1
	2

\left(

	\partial
	\partialx

-i

	\partial
	\partialy

\right),

	\partial
	\partial\bar{z

} = \frac\left(\frac + i \frac \right).In these notations, a function

is holomorphic (complex-analytic) if and only if

	\partialf
	\partial\barz

(the Cauchy–Riemann equations).Also, we have:

df=

	\partialf
	\partialz

dz+

	\partialf
	\partial\bar{z

}d \bar.Let

D_\epsilon=\{z\inC\mid\epsilon<|z-z_0|<r\}

be a punctured disk with center

z₀

.Since

1/(z-z₀₎

is holomorphic on

D_\epsilon

, We have:

d\left(

	f
	z-z₀

dz\right)=

	\partialf
	\partial\barz

	d\bar{z
	\wedge

dz}{z-z_0}

By Stokes’ formula,

\int
	D_\epsilon

	\partialf
	\partial\barz

	d\bar{z
	\wedge

dz}{z-z_0}=\left(

\int
	\|z-z_0\|=r

\int
	\|z-z_0\|=\epsilon

\right)

	f
	z-z₀

dz.

Letting

\epsilon\to0

we then get:^[1]

2\piif(z₀₎=

\int
	\|z-z_0\|=r

	f
	z-z₀

dz+

\int
	\|z-z_0\|\ler

	\partialf
	\partial\barz

	dz\wedged\barz
	z-z₀

Winding numbers and Poincaré lemma

A differential form

\omega

is called closed if

d\omega=0

and is called exact if

\omega=dη

for some differential form

(often called a potential). Since

d\circd=0

, an exact form is closed. But the converse does not hold in general; there might be a non-exact closed form. A classic example of such a form is:

\omega=

	-y
	x²+y²

	x
	x²+y²

,which is a differential form on

R²-0

. Suppose we switch to polar coordinates:

x=r\cos\theta,y=r\sin\theta

where

r=\sqrt{x²+y^2}

. Then

\omega=r^-2(-r\sin\thetadx+r\cos\thetady)=d\theta.

This does not show that

\omega

is exact: the trouble is that

\theta

is not a well-defined continuous function on

R²-0

. Since any function

R²-0

with

df=\omega

differ from

\theta

by constant, this means that

\omega

is not exact. The calculation, however, shows that

\omega

is exact, for example, on

R²-\{x=0\}

since we can take

\theta=\arctan(y/x)

there.

There is a result (Poincaré lemma) that gives a condition that guarantees closed forms are exact. To state it, we need some notions from topology. Given two continuous maps

f,g:X\toY

between subsets of

R^m,Rⁿ

(or more generally topological spaces), a homotopy from

is a continuous function

H:X x [0,1]\toY

such that

f(x)=H(x,0)

and

g(x)=H(x,1)

. Intuitively, a homotopy is a continuous variation of one function to another. A loop in a set

is a curve whose starting point coincides with the end point; i.e.,

c:[0,1]\toX

such that

c(0)=c(1)

. Then a subset of

Rⁿ

is called simply connected if every loop is homotopic to a constant function. A typical example of a simply connected set is a disk

D=\{(x,y)\mid\sqrt{x²+y^2}\ler\}\subsetR²

. Indeed, given a loop

c:[0,1]\toD

, we have the homotopy

H:[0,1]²\toD,H(x,t)=(1-t)c(x)+tc(0)

from

to the constant function

c(0)

. A punctured disk, on the other hand, is not simply connected.

Geometry of curves and surfaces

Moving frame

Vector fields

E_1,...,E₃

R³

are called a frame field if they are orthogonal to each other at each point; i.e.,

E_i ⋅ E_j=\delta_ij

at each point. The basic example is the standard frame

U_i

; i.e.,

U_i(x)

is a standard basis for each point

R³

. Another example is the cylindrical frame

E₁=\cos\thetaU₁+\sin\thetaU_2,E₂=-\sin\thetaU₁+\cos\thetaU_2,E₃=U_3.

T,N,B

on a unit-speed curve

\beta:I\toR³

given as:

The Gauss–Bonnet theorem

The Gauss–Bonnet theorem relates the topology of a surface and its geometry.

Calculus of variations

Method of Lagrange multiplier

The set

g^-1(0)

is usually called a constraint.

Example: Suppose we want to find the minimum distance between the circle

x²+y²=1

and the line

x+y=4

. That means that we want to minimize the function

f(x,y,u,v)=(x-u)²+(y-v)²

, the square distance between a point

(x,y)

on the circle and a point

(u,v)

on the line, under the constraint

g=(x²+y²-1,u+v-4)

. We have:

\nablaf=(2(x-u),2(y-v),-2(x-u),-2(y-v)).

\nablag₁=(2x,2y,0,0),\nablag₂=(0,0,1,1).

Since the Jacobian matrix of

has rank 2 everywhere on

g^-1(0)

, the Lagrange multiplier gives:

x-u=λ₁x,y-v=λ₁y,2(x-u)=-λ_2,2(y-v)=-λ_2.

λ₁=0

, then

x=u,y=v

, not possible. Thus,

λ₁\ne0

and

	x-u
	λ₁

,y=

	y-v
	λ₁

From this, it easily follows that

x=y=1/\sqrt{2}

and

u=v=2

. Hence, the minimum distance is

2\sqrt{2}-1

(as a minimum distance clearly exists).

Here is an application to linear algebra. Let

be a finite-dimensional real vector space and

T:V\toV

a self-adjoint operator. We shall show

has a basis consisting of eigenvectors of

(i.e.,

is diagonalizable) by induction on the dimension of

. Choosing a basis on

we can identify

V=Rⁿ

and

is represented by the matrix

[a_ij]

. Consider the function

f(x)=(Tx,x)

, where the bracket means the inner product. Then

\nablaf=2(\suma_1ix_i,...,\suma_nix_i)

. On the other hand, for

g=\sum

	2
x
	i

-1

, since

g^-1(0)

is compact,

attains a maximum or minimum at a point

g^-1(0)

. Since

\nablag=2(x_1,...,x_n)

, by Lagrange multiplier, we find a real number

such that

2\sum_ia_jiu_i=2λu_j,1\lej\len.

But that means

Tu=λu

. By inductive hypothesis, the self-adjoint operator

T:W\toW

the orthogonal complement to

, has a basis consisting of eigenvectors. Hence, we are done.

\square

Weak derivatives

Up to measure-zero sets, two functions can be determined to be equal or not by means of integration against other functions (called test functions). Namely, the following sometimes called the fundamental lemma of calculus of variations:

Given a continuous function

, by the lemma, a continuously differentiable function

is such that

	\partialu
	\partialx_i

if and only if

\int

	\partialu
	\partialx_i

\varphidx=\intf\varphidx

for every

\varphi\in

	infty
C
	c

(M)

. But, by integration by parts, the partial derivative on the left-hand side of

can be moved to that of

\varphi

; i.e.,

-\intu

	\partial\varphi
	\partialx_i

dx=\intf\varphidx

where there is no boundary term since

\varphi

has compact support. Now the key point is that this expression makes sense even if

is not necessarily differentiable and thus can be used to give sense to a derivative of such a function.

Note each locally integrable function

defines the linear functional

\varphi\mapsto\intu\varphidx

	infty
C
	c

(M)

and, moreover, each locally integrable function can be identified with such linear functional, because of the early lemma. Hence, quite generally, if

is a linear functional on

	infty
C
	c

(M)

, then we define

	\partialu
	\partialx_i

to be the linear functional

\varphi\mapsto-\left\langleu,

	\partial\varphi
	\partialx_i

\right\rangle

where the bracket means

\langle\alpha,\varphi\rangle=\alpha(\varphi)

. It is then called the weak derivative of

with respect to

x_i

. If

is continuously differentiable, then the weak derivate of it coincides with the usual one; i.e., the linear functional

	\partialu
	\partialx_i

is the same as the linear functional determined by the usual partial derivative of

with respect to

x_i

. A usual derivative is often then called a classical derivative. When a linear functional on

	infty
C
	c

(M)

is continuous with respect to a certain topology on

	infty
C
	c

(M)

, such a linear functional is called a distribution, an example of a generalized function.

, the characteristic function on the interval

(0,infty)

. For every test function

\varphi

, we have:

\langleH',\varphi\rangle=

	infty
-\int
	0

\varphi'dx=\varphi(0).

Let

\delta_a

denote the linear functional

\varphi\mapsto\varphi(a)

, called the Dirac delta function (although not exactly a function). Then the above can be written as:

H'=\delta_0.

Cauchy's integral formula has a similar interpretation in terms of weak derivatives. For the complex variable

z=x+iy

, let

E
	z₀

(z)=

	1
	\pi(z-z₀₎

. For a test function

\varphi

, if the disk

|z-z₀|\ler

contains the support of

\varphi

, by Cauchy's integral formula, we have:

\varphi(z₀₎={1\over2\pii}\int

	\partial\varphi
	\partial\barz

	dz\wedged\barz
	z-z₀

Since

dz\wedged\barz=-2idx\wedgedy

, this means:

\varphi(z₀₎=-\int

E
	z₀

	\partial\varphi
	\partial\barz

dxdy=\left\langle

\partial

E
	z₀

\partial\barz

,\varphi\right\rangle,

\partial

E
	z₀

\partial\barz

\delta
	z₀

In general, a generalized function is called a fundamental solution for a linear partial differential operator if the application of the operator to it is the Dirac delta. Hence, the above says

E
	z₀

is the fundamental solution for the differential operator

\partial/\partial\barz

Hamilton–Jacobi theory

See main article: Hamilton–Jacobi equation.

Calculus on manifolds

Definition of a manifold

This section requires some background in general topology.

A manifold is a Hausdorff topological space that is locally modeled by an Euclidean space. By definition, an atlas of a topological space

is a set of maps

\varphi_i:U_i\toRⁿ

, called charts, such that

U_i

are an open cover of

; i.e., each

U_i

is open and

M=\cup_iU_i

\varphi_i:U_i\to\varphi_i(U_i)

is a homeomorphism and

\varphi_j\circ

	-1
\varphi
	i

:\varphi_i(U_i\capU_j)\to\varphi_j(U_i\capU_j)

is smooth; thus a diffeomorphism.By definition, a manifold is a second-countable Hausdorff topological space with a maximal atlas (called a differentiable structure); "maximal" means that it is not contained in strictly larger atlas. The dimension of the manifold

is the dimension of the model Euclidean space

Rⁿ

; namely,

and a manifold is called an n-manifold when it has dimension n. A function on a manifold

is said to be smooth if

f|_U\circ\varphi^-1

is smooth on

\varphi(U)

for each chart

\varphi:U\toRⁿ

in the differentiable structure.

A manifold is paracompact; this has an implication that it admits a partition of unity subordinate to a given open cover.

Rⁿ

is replaced by an upper half-space

Hⁿ

, then we get the notion of a manifold-with-boundary. The set of points that map to the boundary of

Hⁿ

under charts is denoted by

\partialM

and is called the boundary of

. This boundary may not be the topological boundary of

. Since the interior of

Hⁿ

is diffeomorphic to

Rⁿ

, a manifold is a manifold-with-boundary with empty boundary.

The next theorem furnishes many examples of manifolds.

For example, for

g(x)=

	2
x
	1

+ … +

	2
x
	n+1

-1

, the derivative

g'(x)=\begin{bmatrix}2x₁&2x₂& … &2x_n+1\end{bmatrix}

has rank one at every point

g^-1(0)

. Hence, the n-sphere

g^-1(0)

is an n-manifold.

The theorem is proved as a corollary of the inverse function theorem.

Many familiar manifolds are subsets of

Rⁿ

. The next theoretically important result says that there is no other kind of manifolds. An immersion is a smooth map whose differential is injective. An embedding is an immersion that is homeomorphic (thus diffeomorphic) to the image.

The proof that a manifold can be embedded into

R^N

for some N is considerably easier and can be readily given here. It is known that a manifold has a finite atlas

\{\varphi_i:U_i\toRⁿ\mid1\lei\ler\}

. Let

λ_i

be smooth functions such that

\operatorname{Supp}(λ_i)\subsetU_i

and

\{λ_i=1\}

cover

(e.g., a partition of unity). Consider the map

f=(λ₁\varphi_1,...,λ_r\varphi_r,λ_1,...,λ_r):M\toR^(k+1)r

It is easy to see that

is an injective immersion. It may not be an embedding. To fix that, we shall use:

(f,g):M\toR^(k+1)r+1

where

is a smooth proper map. The existence of a smooth proper map is a consequence of a partition of unity. See http://math.uchicago.edu/~may/REU2019/REUPapers/Smith,Zoe.pdf for the rest of the proof in the case of an immersion.

\square

Nash's embedding theorem says that, if

is equipped with a Riemannian metric, then the embedding can be taken to be isometric with an expense of increasing

; for this, see this T. Tao's blog.

Tubular neighborhood and transversality

A technically important result is:

This can be proved by putting a Riemannian metric on the manifold

. Indeed, the choice of metric makes the normal bundle

\nu_i

a complementary bundle to

; i.e.,

TM|_N

is the direct sum of

and

\nu_N

. Then, using the metric, we have the exponential map

\exp:U\toV

for some neighborhood

in the normal bundle

\nu_N

to some neighborhood

. The exponential map here may not be injective but it is possible to make it injective (thus diffeomorphic) by shrinking

(for now, see see https://amathew.wordpress.com/2009/11/05/the-tubular-neighborhood-theorem/#more-636).

Integration on manifolds and distribution densities

The starting point for the topic of integration on manifolds is that there is no invariant way to integrate functions on manifolds. This may be obvious if we asked: what is an integration of functions on a finite-dimensional real vector space? (In contrast, there is an invariant way to do differentiation since, by definition, a manifold comes with a differentiable structure). There are several ways to introduce integration theory to manifolds:

Integrate differential forms.
Do integration against some measure.
Equip a manifold with a Riemannian metric and do integration against such a metric.

For example, if a manifold is embedded into an Euclidean space

Rⁿ

, then it acquires the Lebesgue measure restricting from the ambient Euclidean space and then the second approach works. The first approach is fine in many situations but it requires the manifold to be oriented (and there is a non-orientable manifold that is not pathological). The third approach generalizes and that gives rise to the notion of a density.

Generalizations

Extensions to infinite-dimensional normed spaces

The notions like differentiability extend to normed spaces.

References

- - - - (revised 1990, Jones and Bartlett; reprinted 2014, World Scientific) [this text in particular discusses density]
  - Book: Calculus on Manifolds: A Modern Approach to Classical Theorems of Advanced Calculus . Spivak. Michael. Calculus on Manifolds (book). Benjamin Cummings . 1965 . 0-8053-9021-9 . San Francisco . Michael Spivak .

Notes and References

Theorem 1.2.1. in Book: Hörmander, Lars. Lars Hörmander. An Introduction to Complex Analysis in Several Variables. North Holland. Third. 1990. .

Calculus on Euclidean space explained

Basic notions

Functions in one real variable

Derivative of a map and chain rule

Higher derivatives and Taylor formula

Inverse function theorem and submersion theorem

Integrable functions on Euclidean spaces

Surface integral

Vector analysis

Tangent vectors and vector fields

Differential forms

Boundary and orientation

Integration of differential forms

Winding numbers and Poincaré lemma

Geometry of curves and surfaces

Moving frame

The Gauss–Bonnet theorem

Calculus of variations

Method of Lagrange multiplier

Weak derivatives

Hamilton–Jacobi theory

Calculus on manifolds

Definition of a manifold

Tubular neighborhood and transversality

Integration on manifolds and distribution densities

Generalizations

Extensions to infinite-dimensional normed spaces

See also

References

Notes and References