Kantorovich theorem explained
The Kantorovich theorem, or Newton–Kantorovich theorem, is a mathematical statement on the semi-local convergence of Newton's method. It was first stated by Leonid Kantorovich in 1948.[1] [2] It is similar to the form of the Banach fixed-point theorem, although it states existence and uniqueness of a zero rather than a fixed point.[3]
Newton's method constructs a sequence of points that under certain conditions will converge to a solution
of an equation
or a vector solution of a system of equation
. The Kantorovich theorem gives conditions on the initial point of this sequence. If those conditions are satisfied then a solution exists close to the initial point and the sequence converges to that point.
[1] [2] Assumptions
Let
be an open subset and
a
differentiable function with a
Jacobian
that is locally
Lipschitz continuous (for instance if
is twice differentiable). That is, it is assumed that for any
there is an open subset
such that
and there exists a constant
such that for any
\|F'(x)-F'(y)\|\leL \|x-y\|
holds. The norm on the left is the operator norm. In other words, for any vector
the inequality
\|F'(x)(v)-F'(y)(v)\|\leL \|x-y\|\|v\|
must hold.
Now choose any initial point
. Assume that
is invertible and construct the Newton step
The next assumption is that not only the next point
but the entire ball
is contained inside the set
. Let
be the Lipschitz constant for the Jacobian over this ball (assuming it exists).
As a last preparation, construct recursively, as long as it is possible, the sequences
,
,
according to
\begin{alignat}{2}
hk&=-F'(x
F(xk)\\[0.4em]
\alphak&=M\|F'(x
\|\|hk\|\\[0.4em]
xk+1&=xk+hk.
\end{alignat}
Statement
Now if
then
- a solution
of
exists inside the closed ball
and
- the Newton iteration starting in
converges to
with at least linear order of convergence.
A statement that is more precise but slightly more difficult to prove uses the roots
of the quadratic polynomial
p(t)
| -1 |
=\left(\tfrac12L\|F'(x | |
| 0) |
\|-1\right)t2
-t+\|h0\|
,
t\ast/**=
| 2\|h0\| |
1\pm\sqrt{1-2\alpha0 |
}and their ratio
}.Then
- a solution
exists inside the closed ball
\barB(x1,\theta\|h0\|)\subset\bar
- it is unique inside the bigger ball
- and the convergence to the solution of
is dominated by the convergence of the Newton iteration of the quadratic polynomial
towards its smallest root
,
[4] if
t0=0,tk+1=tk-\tfrac{p(tk)}{p'(tk)}
, then
- The quadratic convergence is obtained from the error estimate[5]
\|xn+1-x*\|
\le
\|xn+1-xn\|
\le
\|h0\|.
Corollary
In 1986, Yamamoto proved that the error evaluations of the Newton method such as Doring (1969), Ostrowski (1971, 1973),[6] [7] Gragg-Tapia (1974), Potra-Ptak (1980),[8] Miel (1981),[9] Potra (1984),[10] can be derived from the Kantorovich theorem.[11]
Generalizations
There is a q-analog for the Kantorovich theorem.[12] [13] For other generalizations/variations, see Ortega & Rheinboldt (1970).[14]
Applications
Oishi and Tanabe claimed that the Kantorovich theorem can be applied to obtain reliable solutions of linear programming.[15]
Further reading
Notes and References
- Book: Deuflhard, P. . Newton Methods for Nonlinear Problems. Affine Invariance and Adaptive Algorithms . Springer . Berlin . 2004 . 3-540-21099-7 . Springer Series in Computational Mathematics . 35 .
- Book: Zeidler, E. . 1985 . Nonlinear Functional Analysis and its Applications: Part 1: Fixed-Point Theorems . New York . Springer . 0-387-96499-1 .
- Book: John E. . Dennis . John E. Dennis . Robert B. . Schnabel . Robert B. Schnabel . The Kantorovich and Contractive Mapping Theorems . Numerical Methods for Unconstrained Optimization and Nonlinear Equations . Englewood Cliffs . Prentice-Hall . 1983 . 92–94 . 0-13-627216-9 . https://books.google.com/books?id=ksvJTtJCx9cC&pg=PA92 .
- J. M. . Ortega . The Newton-Kantorovich Theorem . Amer. Math. Monthly . 75 . 1968 . 6 . 658–660 . 2313800 . 10.2307/2313800.
- W. B. . Gragg . R. A. . Tapia . 1974 . Optimal Error Bounds for the Newton-Kantorovich Theorem . SIAM Journal on Numerical Analysis . 11 . 1 . 10–13 . 2156425 . 10.1137/0711002. 1974SJNA...11...10G .
- A. M. . Ostrowski . La method de Newton dans les espaces de Banach . C. R. Acad. Sci. Paris . 27 . A . 1971 . 1251–1253 .
- Book: Ostrowski, A. M. . Solution of Equations in Euclidean and Banach Spaces . Academic Press . New York . 1973 . 0-12-530260-6 .
- F. A. . Potra . V. . Ptak . Sharp error bounds for Newton's process . Numer. Math. . 34 . 1980 . 63–72 . 10.1007/BF01463998 .
- G. J. . Miel . An updated version of the Kantorovich theorem for Newton's method . Computing . 27 . 3 . 1981 . 237–244 . 10.1007/BF02237981 .
- F. A. . Potra . On the a posteriori error estimates for Newton's method . Beiträge zur Numerische Mathematik . 12 . 1984 . 125–138 .
- Yamamoto . T. . 1986 . A method for finding sharp error bounds for Newton's method under the Kantorovich assumptions . . 49 . 2–3 . 203–220 . 10.1007/BF01389624 .
- Rajkovic . P. M. . Stankovic . M. S. . Marinkovic . S. D. . 2003 . On q-iterative methods for solving equations and systems . Novi Sad J. Math . 33 . 2 . 127–137 .
- Rajković . P. M. . Marinković . S. D. . Stanković . M. S. . 2005 . On q-Newton–Kantorovich method for solving systems of equations . Applied Mathematics and Computation . 168 . 2 . 1432–1448 . 10.1016/j.amc.2004.10.035 .
- Book: Ortega . J. M. . Rheinboldt . W. C. . 1970 . Iterative Solution of Nonlinear Equations in Several Variables . SIAM . 95021 .
- Oishi . S. . Tanabe . K. . 2009 . Numerical Inclusion of Optimum Point for Linear Programming . JSIAM Letters . 1 . 5–8 . 10.14495/jsiaml.1.5 . free .