Implicit function theorem explained
In multivariable calculus, the implicit function theorem is a tool that allows relations to be converted to functions of several real variables. It does so by representing the relation as the graph of a function. There may not be a single function whose graph can represent the entire relation, but there may be such a function on a restriction of the domain of the relation. The implicit function theorem gives a sufficient condition to ensure that there is such a function.
More precisely, given a system of equations (often abbreviated into), the theorem states that, under a mild condition on the partial derivatives (with respect to each) at a point, the variables are differentiable functions of the in some neighborhood of the point. As these functions can generally not be expressed in closed form, they are implicitly defined by the equations, and this motivated the name of the theorem.[1]
In other words, under a mild condition on the partial derivatives, the set of zeros of a system of equations is locally the graph of a function.
History
Augustin-Louis Cauchy (1789–1857) is credited with the first rigorous form of the implicit function theorem. Ulisse Dini (1845–1918) generalized the real-variable version of the implicit function theorem to the context of functions of any number of real variables.[2]
First example
If we define the function, then the equation cuts out the unit circle as the level set . There is no way to represent the unit circle as the graph of a function of one variable because for each choice of, there are two choices of y, namely
.
However, it is possible to represent part of the circle as the graph of a function of one variable. If we let
for, then the graph of provides the upper half of the circle. Similarly, if
, then the graph of gives the lower half of the circle.
The purpose of the implicit function theorem is to tell us that functions like and almost always exist, even in situations where we cannot write down explicit formulas. It guarantees that and are differentiable, and it even works in situations where we do not have a formula for .
Definitions
Let
be a continuously differentiable function. We think of
as the
Cartesian product
and we write a point of this product as
(x,y)=(x1,\ldots,xn,y1,\ldotsym).
Starting from the given function
, our goal is to construct a function
whose graph
is precisely the set of all
such that
.
As noted above, this may not always be possible. We will therefore fix a point
(bf{a},bf{b})=(a1,...,an,b1,...,bm)
which satisfies
, and we will ask for a
that works near the point
. In other words, we want an
open set
containing
, an open set
containing
, and a function
such that the graph of
satisfies the relation
on
, and that no other points within
do so. In symbols,
To state the implicit function theorem, we need the Jacobian matrix of
, which is the matrix of the
partial derivatives of
. Abbreviating
to
, the Jacobian matrix is
where
is the matrix of partial derivatives in the variables
and
is the matrix of partial derivatives in the variables
. The implicit function theorem says that if
is an invertible matrix, then there are
,
, and
as desired. Writing all the hypotheses together gives the following statement.
Statement of the theorem
Let
be a continuously differentiable function, and let
have coordinates
. Fix a point
(bf{a},bf{b})=(a1,...,an,b1,...,bm)
with
, where
is the zero vector. If the
Jacobian matrix (this is the right-hand panel of the Jacobian matrix shown in the previous section):
is
invertible, then there exists an open set
containing
such that there exists a unique function
such that and Moreover,
is continuously differentiable and, denoting the left-hand panel of the Jacobian matrix shown in the previous section as:
the Jacobian matrix of partial derivatives of
in
is given by the matrix product:
[3] Higher derivatives
If, moreover,
is
analytic or continuously differentiable
times in a neighborhood of
, then one may choose
in order that the same holds true for
inside
.
[4] In the analytic case, this is called the
analytic implicit function theorem.
Proof for 2D case
Suppose
is a continuously differentiable function defining a curve
. Let
be a point on the curve. The statement of the theorem above can be rewritten for this simple case as follows:
Proof. Since F is differentiable we write the differential of F through partial derivatives:
Since we are restricted to movement on the curve
and by assumption
\tfrac{\partialF}{\partialy} ≠ 0
around the point
(since
\tfrac{\partialF}{\partialy}
is continuous at
and
\left.\tfrac{\partialF}{\partialy}
≠ 0
). Therefore we have a
first-order ordinary differential equation:
Now we are looking for a solution to this ODE in an open interval around the point
for which, at every point in it,
. Since
F is continuously differentiable and from the assumption we have
From this we know that
\tfrac{\partialxF}{\partialyF}
is continuous and bounded on both ends. From here we know that
-\tfrac{\partialxF}{\partialyF}
is Lipschitz continuous in both
x and
y. Therefore, by
Cauchy-Lipschitz theorem, there exists unique
y(
x) that is the solution to the given ODE with the initial conditions.
Q.E.D.The circle example
Let us go back to the example of the unit circle. In this case n = m = 1 and
. The matrix of partial derivatives is just a 1 × 2 matrix, given by
Thus, here, the in the statement of the theorem is just the number ; the linear map defined by it is invertible if and only if . By the implicit function theorem we see that we can locally write the circle in the form for all points where . For we run into trouble, as noted before. The implicit function theorem may still be applied to these two points, by writing as a function of, that is,
; now the graph of the function will be
, since where we have, and the conditions to locally express the function in this form are satisfied.
The implicit derivative of y with respect to x, and that of x with respect to y, can be found by totally differentiating the implicit function
and equating to 0:
giving
and
Application: change of coordinates
Suppose we have an -dimensional space, parametrised by a set of coordinates
. We can introduce a new coordinate system
by supplying m functions
each being continuously differentiable. These functions allow us to calculate the new coordinates
of a point, given the point's old coordinates
using
x'1=h1(x1,\ldots,xm),\ldots,x'm=hm(x1,\ldots,xm)
. One might want to verify if the opposite is possible: given coordinates
, can we 'go back' and calculate the same point's original coordinates
? The implicit function theorem will provide an answer to this question. The (new and old) coordinates
(x'1,\ldots,x'm,x1,\ldots,xm)
are related by
f = 0, with
Now the Jacobian matrix of
f at a certain point (
a,
b) [where <math>a=(x'_1,\ldots,x'_m), b=(x_1,\ldots,x_m)</math> ] is given by
where I
m denotes the
m ×
m identity matrix, and is the matrix of partial derivatives, evaluated at (
a,
b). (In the above, these blocks were denoted by X and Y. As it happens, in this particular application of the theorem, neither matrix depends on
a.) The implicit function theorem now states that we can locally express
as a function of
if
J is invertible. Demanding
J is invertible is equivalent to det
J ≠ 0, thus we see that we can go back from the primed to the unprimed coordinates if the determinant of the Jacobian
J is non-zero. This statement is also known as the
inverse function theorem.
Example: polar coordinates
As a simple application of the above, consider the plane, parametrised by polar coordinates . We can go to a new coordinate system (cartesian coordinates) by defining functions and . This makes it possible given any point to find corresponding Cartesian coordinates . When can we go back and convert Cartesian into polar coordinates? By the previous example, it is sufficient to have, withSince, conversion back to polar coordinates is possible if . So it remains to check the case . It is easy to see that in case, our coordinate transformation is not invertible: at the origin, the value of θ is not well-defined.
Generalizations
Banach space version
Based on the inverse function theorem in Banach spaces, it is possible to extend the implicit function theorem to Banach space valued mappings.[5] [6]
Let X, Y, Z be Banach spaces. Let the mapping be continuously Fréchet differentiable. If
,
, and
is a Banach space isomorphism from
Y onto
Z, then there exist neighbourhoods
U of
x0 and
V of
y0 and a Fréchet differentiable function
g :
U →
V such that
f(
x,
g(
x)) = 0 and
f(
x,
y) = 0 if and only if
y =
g(
x), for all
.
Implicit functions from non-differentiable functions
Various forms of the implicit function theorem exist for the case when the function f is not differentiable. It is standard that local strict monotonicity suffices in one dimension. The following more general form was proven by Kumagai based on an observation by Jittorntrum.[7] [8]
Consider a continuous function
such that
. If there exist open neighbourhoods
and
of
x0 and
y0, respectively, such that, for all
y in
B,
is locally one-to-one, then there exist open neighbourhoods
and
of
x0 and
y0, such that, for all
, the equation
f(
x,
y) = 0 has a unique solution
where
g is a continuous function from
B0 into
A0.
Collapsing manifolds
Perelman’s collapsing theorem for 3-manifolds, the capstone of his proof of Thurston's geometrization conjecture, can be understood as an extension of the implicit function theorem.[9]
See also
Both the implicit function theorem and the inverse function theorem can be seen as special cases of the constant rank theorem.
Further reading
- Book: Allendoerfer, Carl B. . Carl B. Allendoerfer . Calculus of Several Variables and Differentiable Manifolds . New York . Macmillan . 1974 . Theorems about Differentiable Functions . 54–88 . 0-02-301840-2 .
- Book: Binmore, K. G. . Kenneth Binmore . Implicit Functions . Calculus . New York . Cambridge University Press . 1983 . 0-521-28952-1 . 198–211 . https://books.google.com/books?id=K8RfQgAACAAJ&pg=PA198 .
- Book: Lynn H. . Loomis . Lynn Harold Loomis . Shlomo . Sternberg . Shlomo Sternberg . Advanced Calculus . registration . Boston . Jones and Bartlett . Revised . 1990 . 164–171 . 0-86720-122-3 .
- Book: Murray H. . Protter . Murray H. Protter . Charles B. Jr. . Morrey . Charles B. Morrey Jr. . Implicit Function Theorems. Jacobians . Intermediate Calculus . New York . Springer . 2nd . 1985 . 0-387-96058-9 . 390–420 . https://books.google.com/books?id=3lTmBwAAQBAJ&pg=PA390 .
Notes and References
- Book: Chiang, Alpha C. . Alpha Chiang . Fundamental Methods of Mathematical Economics . McGraw-Hill . 3rd . 1984 . 204–206 . 0-07-010813-7 .
- Book: Steven . Krantz . Harold . Parks . The Implicit Function Theorem . Modern Birkhauser Classics . Birkhauser . 2003 . 0-8176-4285-4 . registration .
- Oswaldo . de Oliveira . The Implicit and Inverse Function Theorems: Easy Proofs . Real Anal. Exchange . 39 . 1 . 2013 . 10.14321/realanalexch.39.1.0207 . 214–216 . 118792515 . 1212.2066 .
- Book: K. . Fritzsche . H. . Grauert . 2002 . From Holomorphic Functions to Complex Manifolds . Springer . 34 . 9780387953953 .
- Book: Lang, Serge . Serge Lang . Fundamentals of Differential Geometry . limited . 1999 . Springer . New York . Graduate Texts in Mathematics . 0-387-98593-X . 15–21 .
- Book: Edwards, Charles Henry . Advanced Calculus of Several Variables . Dover Publications . Mineola, New York . 1994 . 1973 . 0-486-68336-2 . 417–418 .
- K. . Jittorntrum . An Implicit Function Theorem . Journal of Optimization Theory and Applications . 25 . 4 . 1978 . 10.1007/BF00933522 . 575–577 . 121647783 .
- S. . Kumagai . An implicit function theorem: Comment . Journal of Optimization Theory and Applications . 31 . 2 . 1980 . 10.1007/BF00934117 . 285–288 . 119867925 .
- Cao . Jianguo . Ge . Jian . A simple proof of Perelman's collapsing theorem for 3-manifolds . J. Geom. Anal. . 2011 . 21 . 4 . 807–869. 10.1007/s12220-010-9169-5 . 1003.2215 . 514106 .