Inverse function theorem explained

In mathematics, specifically differential calculus, the inverse function theorem gives a sufficient condition for a function to be invertible in a neighborhood of a point in its domain: namely, that its derivative is continuous and non-zero at the point. The theorem also gives a formula for the derivative of the inverse function.In multivariable calculus, this theorem can be generalized to any continuously differentiable, vector-valued function whose Jacobian determinant is nonzero at a point in its domain, giving a formula for the Jacobian matrix of the inverse. There are also versions of the inverse function theorem for holomorphic functions, for differentiable maps between manifolds, for differentiable functions between Banach spaces, and so forth.

The theorem was first established by Picard and Goursat using an iterative scheme: the basic idea is to prove a fixed point theorem using the contraction mapping theorem.

Statements

For functions of a single variable, the theorem states that if

is a continuously differentiable function with nonzero derivative at the point

; then

is injective (or bijective onto the image) in a neighborhood of

, the inverse is continuously differentiable near

b=f(a)

, and the derivative of the inverse function at

is the reciprocal of the derivative of

\bigl(f^\bigr)'(b) = \frac = \frac.

It can happen that a function

may be injective near a point

while

f'(a)=0

. An example is

f(x)=(x-a)³

. In fact, for such a function, the inverse cannot be differentiable at

b=f(a)

, since if

f^-1

were differentiable at

, then, by the chain rule,

1=(f^-1\circf)'(a)=(f^-1)'(b)f'(a)

, which implies

f'(a)\ne0

. (The situation is different for holomorphic functions; see

Holomorphic inverse function theorem

below.)

For functions of more than one variable, the theorem states that if

is a continuously differentiable function from an open subset

Rⁿ

into

\Rⁿ

, and the derivative

f'(a)

is invertible at a point (that is, the determinant of the Jacobian matrix of at is non-zero), then there exist neighborhoods

and

b=f(a)

such that

f(U)\subsetV

and

f:U\toV

is bijective.^[1] Writing

f=(f_1,\ldots,f_n)

, this means that the system of equations

y_i=f_i(x_1,...,x_n)

has a unique solution for

x_1,...,x_n

in terms of

y_1,...,y_n

when

x\inU,y\inV

. Note that the theorem does not say

is bijective onto the image where

is invertible but that it is locally bijective where

is invertible.

Moreover, the theorem says that the inverse function

f^-1:V\toU

is continuously differentiable, and its derivative at

b=f(a)

is the inverse map of

f'(a)

; i.e.,

(f^-1)'(b)=f'(a)^-1.

In other words, if

Jf^-1(b),Jf(a)

are the Jacobian matrices representing

(f^-1)'(b),f'(a)

, this means:

Jf^-1(b)=Jf(a)^-1.

The hard part of the theorem is the existence and differentiability of

f^-1

. Assuming this, the inverse derivative formula follows from the chain rule applied to

f^-1\circf=I

. (Indeed,

1=I'(a)=(f^-1\circf)'(a)=(f^-1)'(b)\circf'(a).

) Since taking the inverse is infinitely differentiable, the formula for the derivative of the inverse shows that if

is continuously

times differentiable, with invertible derivative at the point, then the inverse is also continuously

times differentiable. Here

is a positive integer or

infty

There are two variants of the inverse function theorem. Given a continuously differentiable map

f:U\toR^m

, the first is

The derivative

f'(a)

is surjective (i.e., the Jacobian matrix representing it has rank

) if and only if there exists a continuously differentiable function

on a neighborhood

b=f(a)

such

f\circg=I

near

,and the second is

The derivative

f'(a)

is injective if and only if there exists a continuously differentiable function

on a neighborhood

b=f(a)

such

g\circf=I

near

In the first case (when

f'(a)

is surjective), the point

b=f(a)

is called a regular value. Since

m=\dim\ker(f'(a))+\dim\operatorname{im}(f'(a))

, the first case is equivalent to saying

b=f(a)

is not in the image of critical points

(a critical point is a point

such that the kernel of

f'(a)

is nonzero). The statement in the first case is a special case of the submersion theorem.

These variants are restatements of the inverse functions theorem. Indeed, in the first case when

f'(a)

is surjective, we can find an (injective) linear map

such that

f'(a)\circT=I

. Define

h(x)=a+Tx

so that we have:

(f\circh)'(0)=f'(a)\circT=I.

Thus, by the inverse function theorem,

f\circh

has inverse near

; i.e.,

f\circh\circ(f\circh)^-1=I

near

. The second case (

f'(a)

is injective) is seen in the similar way.

Example

F:R^2\toR²

defined by:

F(x,y)= \begin{bmatrix} {e^x\cosy}\\ {e^x\siny}\\ \end{bmatrix}.

The Jacobian matrix of it at

(x,y)

is:

JF(x,y)= \begin{bmatrix} {e^x\cosy}&{-e^x\siny}\\ {e^x\siny}&{e^x\cosy}\\ \end{bmatrix}

with the determinant:

\detJF(x,y)= e^2x\cos²y+e^2x\sin²y= e^2x.

The determinant

e^2x

is nonzero everywhere. Thus the theorem guarantees that, for every point in

R²

, there exists a neighborhood about over which is invertible. This does not mean is invertible over its entire domain: in this case is not even injective since it is periodic:

F(x,y)=F(x,y+2\pi)

Counter-example

If one drops the assumption that the derivative is continuous, the function no longer need be invertible. For example

f(x)=x+2x^{2\sin(\tfrac1x)}

and

f(0)=0

has discontinuous derivative

f'(x)=1-2\cos(\tfrac1x)+4x\sin(\tfrac1x)

and

f'(0)=1

, which vanishes arbitrarily close to

x=0

. These critical points are local max/min points of

, so

is not one-to-one (and not invertible) on any interval containing

x=0

. Intuitively, the slope

f'(0)=1

does not propagate to nearby points, where the slopes are governed by a weak but rapid oscillation.

Methods of proof

As an important result, the inverse function theorem has been given numerous proofs. The proof most commonly seen in textbooks relies on the contraction mapping principle, also known as the Banach fixed-point theorem (which can also be used as the key step in the proof of existence and uniqueness of solutions to ordinary differential equations).^[2] ^[3]

Since the fixed point theorem applies in infinite-dimensional (Banach space) settings, this proof generalizes immediately to the infinite-dimensional version of the inverse function theorem^[4] (see Generalizations below).

An alternate proof in finite dimensions hinges on the extreme value theorem for functions on a compact set. This approach has an advantage that the proof generalizes to a situation where there is no Cauchy completeness (see).

Yet another proof uses Newton's method, which has the advantage of providing an effective version of the theorem: bounds on the derivative of the function imply an estimate of the size of the neighborhood on which the function is invertible.^[5]

Proof for Single-Variable Functions

We want to prove the following: Let

D\subseteq\R

be an open set with

x₀\inD,f:D\to\R

a continuously differentiable function defined on

, and suppose that

f'(x₀₎\ne0

. Then there exists an open interval

with

x₀\inI

such that

maps

bijectively onto the open interval

J=f(I)

, and such that the inverse function

f^-1:J\toI

is continuously differentiable, and for any

y\inJ

, if

x\inI

is such that

f(x)=y

, then

(f^-1)'(y)=\dfrac{1}{f'(x)}

We may without loss of generality assume that

f'(x₀₎>0

. Given that

is an open set and

is continuous at

x₀

, there exists

r>0

such that

(x₀-r,x₀+r)\subseteqD

and

|f'(x) - f'(x_0)| < \dfrac \qquad \text |x - x_0| < r.

In particular, $f'(x) > \dfrac >0 \qquad \text |x - x_0| < r.$

This shows that

is strictly increasing for all

|x-x_0|<r

. Let

\delta>0

be such that

\delta<r

. Then

[x-\delta,x+\delta]\subseteq(x₀-r,x₀+r)

. By the intermediate value theorem, we find that

maps the interval

[x-\delta,x+\delta]

bijectively onto

[f(x-\delta),f(x+\delta)]

. Denote by

I=(x-\delta,x+\delta)

and

J=(f(x-\delta),f(x+\delta))

. Then

f:I\toJ

is a bijection and the inverse

f^-1:J\toI

exists. The fact that

f^-1:J\toI

is differentiable follows from the differentiability of

. In particular, the result follows from the fact that if

f:I\to\R

is a strictly monotonic and continuous function that is differentiable at

x₀\inI

with

f'(x₀₎\ne0

, then

f^-1:f(I)\to\R

is differentiable with

(f^-1)'(y₀₎=\dfrac{1}{f'(y_0)}

, where

y₀=f(x₀₎

(a standard result in analysis). This completes the proof.

A proof using successive approximation

To prove existence, it can be assumed after an affine transformation that

f(0)=0

and

f^\prime(0)=I

, so that

a=b=0

By the mean value theorem for vector-valued functions, for a differentiable function

u:[0,1]\toR^m

\|u(1)-u(0)\|\le \sup_ \|u^\prime(t)\|

. Setting

u(t)=f(x+t(x^\prime-x))-x-t(x^\prime-x)

, it follows that

\|f(x)-f(x^\prime)-x+x^\prime\|\le\|x

	\prime\\|\sup
-x
	0\let\le1

\|f^\prime(x+t(x^\prime-x))-I\|.

Now choose

\delta>0

so that

\|f'(x) - I\| <

for

\|x\|<\delta

. Suppose that

\|y\|<\delta/2

and define

x_n

inductively by

x₀₌₀

and

x_n+1=x_n+y-f(x_n)

. The assumptions show that if

\|x\|,\|x^\prime\|<\delta

then

\|f(x)-f(x^\prime)-x+x^\prime\|\le\|x-x^\prime\|/2

In particular

f(x)=f(x^\prime)

implies

x=x^\prime

. In the inductive scheme

\|x_n\|<\delta

and

\|x_n+1-x_n\|<\delta/2ⁿ

. Thus

(x_n)

is a Cauchy sequence tending to

. By construction

f(x)=y

as required.

To check that

g=f^-1

is C¹, write

g(y+k)=x+h

so that

f(x+h)=f(x)+k

. By the inequalities above,

\|h-k\|<\|h\|/2

so that

\|h\|/2<\|k\|<2\|h\|

.On the other hand if

A=f^\prime(x)

, then

\|A-I\|<1/2

. Using the geometric series for

B=I-A

, it follows that

\|A^-1\|<2

. But then

{\|g(y+k)-g(y)-f^\prime(g(y))^-1k\|\over\|k\|}={\|h-f^\prime(x)^-1[f(x+h)-f(x)]\|\over\|k\|}\le4{\|f(x+h)-f(x)-f^{\prime(x)h\|\over}\|h\|}

tends to 0 as

and

tend to 0, proving that

is C¹ with

g^\prime(y)=f^\prime(g(y))^-1

The proof above is presented for a finite-dimensional space, but applies equally well for Banach spaces. If an invertible function

is C^k with

k>1

, then so too is its inverse. This follows by induction using the fact that the map

F(A)=A^-1

on operators is C^k for any

(in the finite-dimensional case this is an elementary fact because the inverse of a matrix is given as the adjugate matrix divided by its determinant).^[6] The method of proof here can be found in the books of Henri Cartan, Jean Dieudonné, Serge Lang, Roger Godement and Lars Hörmander.

A proof using the contraction mapping principle

Here is a proof based on the contraction mapping theorem. Specifically, following T. Tao,^[7] it uses the following consequence of the contraction mapping theorem.

Basically, the lemma says that a small perturbation of the identity map by a contraction map is injective and preserves a ball in some sense. Assuming the lemma for a moment, we prove the theorem first. As in the above proof, it is enough to prove the special case when

a=0,b=f(a)=0

and

f'(0)=I

. Let

g=f-I

. The mean value inequality applied to

t\mapstog(x+t(y-x))

says:

|g(y)-g(x)|\le|y-x|\sup₀|g'(x+t(y-x))|.

Since

g'(0)=I-I=0

and

is continuous, we can find an

r>0

such that

|g(y)-g(x)|\le2^-1|y-x|

for all

x,y

B(0,r)

. Then the early lemma says that

f=g+I

is injective on

B(0,r)

and

B(0,r/2)\subsetf(B(0,r))

. Then

f:U=B(0,r)\capf^-1(B(0,r/2))\toV=B(0,r/2)

is bijective and thus has an inverse. Next, we show the inverse

f^-1

is continuously differentiable (this part of the argument is the same as that in the previous proof). This time, let

g=f^-1

denote the inverse of

and

A=f'(x)

. For

x=g(y)

, we write

g(y+k)=x+h

y+k=f(x+h)

. Now, by the early estimate, we have

|h-k|=|f(x+h)-f(x)-h|\le|h|/2

and so

|h|/2\le|k|

. Writing

\| ⋅ \|

for the operator norm,

|g(y+k)-g(y)-A^-1k|=|h-A^-1(f(x+h)-f(x))|\le\|A^-1\||Ah-f(x+h)+f(x)|.

k\to0

, we have

h\to0

and

|h|/|k|

is bounded. Hence,

is differentiable at

with the derivative

g'(y)=f'(g(y))^-1

. Also,

is the same as the composition

\iota\circf'\circg

where

\iota:T\mapstoT^-1

; so

is continuous.

It remains to show the lemma. First, we have:

|x-y|-|f(x)-f(y)|\le|g(x)-g(y)|\lec|x-y|,

which is to say

(1-c)|x-y|\le|f(x)-f(y)|.

This proves the first part. Next, we show

f(B(0,r))\supsetB(0,(1-c)r)

. The idea is to note that this is equivalent to, given a point

B(0,(1-c)r)

, find a fixed point of the map

F:\overline{B}(0,r')\to\overline{B}(0,r'),x\mapstoy-g(x)

where

0<r'<r

such that

|y|\le(1-c)r'

and the bar means a closed ball. To find a fixed point, we use the contraction mapping theorem and checking that

is a well-defined strict-contraction mapping is straightforward. Finally, we have:

f(B(0,r))\subsetB(0,(1+c)r)

since

|f(x)|=|x+g(x)-g(0)|\le(1+c)|x|.\square

As might be clear, this proof is not substantially different from the previous one, as the proof of the contraction mapping theorem is by successive approximation.

Applications

Implicit function theorem

The inverse function theorem can be used to solve a system of equations

\begin{align} &f_1(x)=y₁\\ & \vdots\\ &f_n(x)=y_{n,\end{align}}

i.e., expressing

y_1,...,y_n

as functions of

x=(x_1,...,x_n)

, provided the Jacobian matrix is invertible. The implicit function theorem allows to solve a more general system of equations:

\begin{align} &f_1(x,y)=0\\ & \vdots\\ &f_n(x,y)=0\end{align}

for

in terms of

. Though more general, the theorem is actually a consequence of the inverse function theorem. First, the precise statement of the implicit function theorem is as follows:

given a map

f:Rⁿ x R^m\toR^m

, if

f(a,b)=0

is continuously differentiable in a neighborhood of

(a,b)

and the derivative of

y\mapstof(a,y)

is invertible, then there exists a differentiable map

g:U\toV

for some neighborhoods

U,V

a,b

such that

f(x,g(x))=0

. Moreover, if

f(x,y)=0,x\inU,y\inV

, then

y=g(x)

; i.e.,

g(x)

is a unique solution.To see this, consider the map

F(x,y)=(x,f(x,y))

. By the inverse function theorem,

F:U x V\toW

has the inverse

for some neighborhoods

U,V,W

. We then have:

(x,y)=F(G_1(x,y),G_2(x,y))=(G_1(x,y),f(G_1(x,y),G_2(x,y))),

implying

x=G_1(x,y)

and

y=f(x,G_2(x,y)).

Thus

g(x)=G_2(x,0)

has the required property.

\square

Giving a manifold structure

In differential geometry, the inverse function theorem is used to show that the pre-image of a regular value under a smooth map is a manifold. Indeed, let

f:U\toR^r

be such a smooth map from an open subset of

Rⁿ

(since the result is local, there is no loss of generality with considering such a map). Fix a point

f^-1(b)

and then, by permuting the coordinates on

Rⁿ

, assume the matrix

\left[

	\partialf_i
	\partialx_j

(a)\right]₁

has rank

. Then the map

F:U\toR^r x R^n-r=R^n,x\mapsto(f(x),x_r+1,...,x_n)

is such that

F'(a)

has rank

. Hence, by the inverse function theorem, we find the smooth inverse

defined in a neighborhood

V x W

(b,a_r+1,...,a_n)

. We then have

x=(F\circG)(x)=(f(G(x)),G_r+1(x),...,G_n(x)),

which implies

(f\circG)(x_1,...,x_n)=(x_1,...,x_r).

That is, after the change of coordinates by

is a coordinate projection (this fact is known as the submersion theorem). Moreover, since

G:V x W\toU'=G(V x W)

is bijective, the map

g=G(b, ⋅ ):W\tof^-1(b)\capU',(x_r+1,...,x_n)\mapstoG(b,x_r+1,...,x_n)

is bijective with the smooth inverse. That is to say,

gives a local parametrization of

f^-1(b)

around

. Hence,

f^-1(b)

is a manifold.

\square

(Note the proof is quite similar to the proof of the implicit function theorem and, in fact, the implicit function theorem can be also used instead.)

More generally, the theorem shows that if a smooth map

f:P\toE

is transversal to a submanifold

M\subsetE

, then the pre-image

f^-1(M)\hookrightarrowP

is a submanifold.^[8]

Global version

The inverse function theorem is a local result; it applies to each point. A priori, the theorem thus only shows the function

is locally bijective (or locally diffeomorphic of some class). The next topological lemma can be used to upgrade local injectivity to injectivity that is global to some extent.

Proof:^[9] First assume

is compact. If the conclusion of the theorem is false, we can find two sequences

x_i\ney_i

such that

f(x_i)=f(y_i)

and

x_i,y_i

each converge to some points

x,y

. Since

is injective on

x=y

. Now, if

is large enough,

x_i,y_i

are in a neighborhood of

x=y

where

is injective; thus,

x_i=y_i

, a contradiction.

In general, consider the set

E=\{(x,y)\inX²\midx\ney,f(x)=f(y)\}

. It is disjoint from

S x S

for any subset

S\subsetX

where

is injective. Let

X₁\subsetX₂\subset …

be an increasing sequence of compact subsets with union

and with

X_i

contained in the interior of

X_i+1

. Then, by the first part of the proof, for each

, we can find a neighborhood

U_i

A\capX_i

such that

	2
U
	i

\subsetX²-E

. Then

U=cup_iU_i

has the required property.

\square

(See also ^[10] for an alternative approach.)

The lemma implies the following (a sort of) global version of the inverse function theorem:

Note that if

is a point, then the above is the usual inverse function theorem.

Holomorphic inverse function theorem

There is a version of the inverse function theorem for holomorphic maps.

The theorem follows from the usual inverse function theorem. Indeed, let

J_R(f)

denote the Jacobian matrix of

in variables

x_i,y_i

and

J(f)

for that in

z_j,\overline{z}_j

. Then we have

\detJ_R(f)=|\detJ(f)|²

, which is nonzero by assumption. Hence, by the usual inverse function theorem,

is injective near

with continuously differentiable inverse. By chain rule, with

w=f(z)

	\partial
	\partial\overline{z

_j}

	-1
(f
	j

\circf)(z)=\sum_k

\partial

	-1
f
	j

\partialw_k

(w)

	\partialf_k
	\partial\overline{z

_j}(z)+\sum_k

\partial

	-1
f
	j

\partial\overline{w

_k}(w)

	\partial\overline{f
	_k}{\partial

\overline{z}_j}(z)

where the left-hand side and the first term on the right vanish since

	-1
f
	j

\circf

and

f_k

are holomorphic. Thus,

\partial

	-1
f
	j

\partial\overline{w

_k}(w)=0

for each

\square

Similarly, there is the implicit function theorem for holomorphic functions.

As already noted earlier, it can happen that an injective smooth function has the inverse that is not smooth (e.g.,

f(x)=x³

in a real variable). This is not the case for holomorphic functions because of:

Formulations for manifolds

The inverse function theorem can be rephrased in terms of differentiable maps between differentiable manifolds. In this context the theorem states that for a differentiable map

F:M\toN

(of class

C¹

), if the differential of

dF_p:T_pM\toT_F(p)N

is a linear isomorphism at a point

then there exists an open neighborhood

such that

F|_U:U\toF(U)

is a diffeomorphism. Note that this implies that the connected components of and containing p and F(p) have the same dimension, as is already directly implied from the assumption that dF_p is an isomorphism.If the derivative of is an isomorphism at all points in then the map is a local diffeomorphism.

Generalizations

Banach spaces

The inverse function theorem can also be generalized to differentiable maps between Banach spaces and .^[11] Let be an open neighbourhood of the origin in and

F:U\toY

a continuously differentiable function, and assume that the Fréchet derivative

dF_0:X\toY

of at 0 is a bounded linear isomorphism of onto . Then there exists an open neighbourhood of

F(0)

in and a continuously differentiable map

G:V\toX

such that

F(G(y))=y

for all in . Moreover,

G(y)

is the only sufficiently small solution of the equation

F(x)=y

There is also the inverse function theorem for Banach manifolds.^[12]

Constant rank theorem

The inverse function theorem (and the implicit function theorem) can be seen as a special case of the constant rank theorem, which states that a smooth map with constant rank near a point can be put in a particular normal form near that point.^[13] Specifically, if

F:M\toN

has constant rank near a point

p\inM

, then there are open neighborhoods of and of

F(p)

and there are diffeomorphisms

u:T_pM\toU

and

v:T_F(p)N\toV

such that

F(U)\subseteqV

and such that the derivative

dF_p:T_pM\toT_F(p)N

is equal to

v^-1\circF\circu

. That is, "looks like" its derivative near . The set of points

p\inM

such that the rank is constant in a neighborhood of

is an open dense subset of ; this is a consequence of semicontinuity of the rank function. Thus the constant rank theorem applies to a generic point of the domain.

When the derivative of is injective (resp. surjective) at a point, it is also injective (resp. surjective) in a neighborhood of, and hence the rank of is constant on that neighborhood, and the constant rank theorem applies.

Polynomial functions

If it is true, the Jacobian conjecture would be a variant of the inverse function theorem for polynomials. It states that if a vector-valued polynomial function has a Jacobian determinant that is an invertible polynomial (that is a nonzero constant), then it has an inverse that is also a polynomial function. It is unknown whether this is true or false, even in the case of two variables. This is a major open problem in the theory of polynomials.

Selections

When

f:Rⁿ\toR^m

with

m\leqn

times continuously differentiable, and the Jacobian

A=\nablaf(\overline{x})

at a point

\overline{x}

is of rank

, the inverse of

may not be unique. However, there exists a local selection function

such that

f(s(y))=y

for all

in a neighborhood of

\overline{y}=f(\overline{x})

s(\overline{y})=\overline{x}

times continuously differentiable in this neighborhood, and

\nablas(\overline{y})=A^T(AA^T)^-1

(

\nablas(\overline{y})

is the Moore–Penrose pseudoinverse of

).^[14]

Over a real closed field

The inverse function theorem also holds over a real closed field k (or an O-minimal structure).^[15] Precisely, the theorem holds for a semialgebraic (or definable) map between open subsets of

kⁿ

that is continuously differentiable.

The usual proof of the IFT uses Banach's fixed point theorem, which relies on the Cauchy completeness. That part of the argument is replaced by the use of the extreme value theorem, which does not need completeness. Explicitly, in, the Cauchy completeness is used only to establish the inclusion

B(0,r/2)\subsetf(B(0,r))

. Here, we shall directly show

B(0,r/4)\subsetf(B(0,r))

instead (which is enough). Given a point

B(0,r/4)

, consider the function

P(x)=|f(x)-y|²

defined on a neighborhood of

\overline{B}(0,r)

. If

P'(x)=0

, then

0=P'(x)=2[f_1(x)-y₁ … f_n(x)-y_n]f'(x)

and so

f(x)=y

, since

f'(x)

is invertible. Now, by the extreme value theorem,

admits a minimal at some point

x₀

on the closed ball

\overline{B}(0,r)

, which can be shown to lie in

B(0,r)

using

2^-1|x|\le|f(x)|

. Since

P'(x₀₎=0

f(x₀₎=y

, which proves the claimed inclusion.

\square

References

Book: Allendoerfer, Carl B. . Carl B. Allendoerfer . Calculus of Several Variables and Differentiable Manifolds . New York . Macmillan . 1974 . Theorems about Differentiable Functions . 54–88 . 0-02-301840-2 .
Book: Peter . Baxandall . Peter Baxandall . Hans . Liebeck . Vector Calculus . New York . Oxford University Press . 1986 . The Inverse Function Theorem . 0-19-859652-9 . 214–225 .
Nijenhuis . Albert . Albert Nijenhuis . Strong derivatives and inverse mappings . . 81 . 1974 . 969 - 980 . 10.2307/2319298 . 9 . 2319298 . 10338.dmlcz/102482 . free .
.
Book: Hirsch . Morris W. . Differential Topology . 1976 . Springer-Verlag . 978-0-387-90148-0 . en.
Book: Murray H. . Protter . Murray H. Protter . Charles B. Jr. . Morrey . Charles B. Morrey Jr. . Intermediate Calculus . New York . Springer . Second . 1985 . 0-387-96058-9 . Transformations and Jacobians . 412–420 .
Book: Renardy . Michael . Rogers . Robert C. . An Introduction to Partial Differential Equations . Texts in Applied Mathematics 13. Second . Springer-Verlag . New York . 2004 . 337 - 338 . 0-387-00444-0 .
Book: Rudin, Walter. Walter Rudin. Principles of mathematical analysis. registration. Third . International Series in Pure and Applied Mathematics . McGraw-Hill Book . New York . 1976 . 221 - 223 . 978-0-07-085613-4 .
Book: Calculus on Manifolds: A Modern Approach to Classical Theorems of Advanced Calculus . Spivak. Michael. Calculus on Manifolds (book). Benjamin Cummings . 1965 . 0-8053-9021-9 . San Francisco . Michael Spivak .

Notes and References

Theorem 1.1.7. in Book: Hörmander, Lars. The Analysis of Linear Partial Differential Operators I: Distribution Theory and Fourier Analysis. Classics in Mathematics. Lars Hörmander. Springer. 2015. 2nd. 978-3-642-61497-2.
Book: McOwen, Robert C. . Partial Differential Equations: Methods and Applications . Upper Saddle River, NJ . Prentice Hall . 1996 . 0-13-121880-8 . 218–224 . Calculus of Maps between Banach Spaces . https://books.google.com/books?id=TuNHsNC1Yf0C&pg=PA218 .
Web site: The inverse function theorem for everywhere differentiable maps . Terence . Tao . Terence Tao . September 12, 2011 . 2019-07-26 .
Web site: Inverse Function Theorem. Jaffe. Ethan.
Book: John H. . Hubbard . John H. Hubbard . Barbara Burke . Hubbard. Barbara Burke Hubbard . Vector Analysis, Linear Algebra, and Differential Forms: A Unified Approach . Matrix . 2001 .
Book: Cartan, Henri. Calcul Differentiel. fr. Henri Cartan. Hermann. 1971. 978-0-395-12033-0 . 55–61.
Theorem 17.7.2 in Book: 3310023. Tao. Terence. Analysis. II. Third edition of 2006 original. Texts and Readings in Mathematics. 38. Hindustan Book Agency. New Delhi. 2014. 978-93-80250-65-6. 1300.26003.
Web site: northwestern.edu. Transversality.
Lemma 13.3.3. of Lectures on differential topology utoronto.ca
Dan Ramras (https://mathoverflow.net/users/4042/dan-ramras), On a proof of the existence of tubular neighborhoods., URL (version: 2017-04-13): https://mathoverflow.net/q/58124
Book: Luenberger, David G. . David Luenberger . Optimization by Vector Space Methods . New York . John Wiley & Sons . 1969 . 0-471-55359-X . 240–242 .
Book: Lang, Serge . Serge Lang . Differential Manifolds . New York . Springer . 1985 . 0-387-96113-5 . 13–19 .
Book: Boothby, William M. . An Introduction to Differentiable Manifolds and Riemannian Geometry . registration . Second . 1986 . Academic Press . Orlando . 0-12-116052-1 . 46–50 .
Book: Dontchev . Asen L. . Rockafellar . R. Tyrrell . Implicit Functions and Solution Mappings: A View from Variational Analysis . 2014 . Springer-Verlag . New York . 978-1-4939-1036-6 . 54 . Second.
Theorem 2.11. in Book: Lou van den Dries
. 10.1017/CBO9780511525919. Tame Topology and O-minimal Structures. London Mathematical Society lecture note series, no. 248. 1998 . Dries . L. P. D. van den . Lou van den Dries. 9780521598385. Cambridge University Press. Cambridge, New York, and Oakleigh, Victoria .