Intermediate value theorem explained

In mathematical analysis, the intermediate value theorem states that if

f

is a continuous function whose domain contains the interval, then it takes on any given value between

f(a)

and

f(b)

at some point within the interval.

This has two important corollaries:

  1. If a continuous function has values of opposite sign inside an interval, then it has a root in that interval (Bolzano's theorem). [1]
  2. The image of a continuous function over an interval is itself an interval.

Motivation

This captures an intuitive property of continuous functions over the real numbers: given

f

continuous on

[1,2]

with the known values

f(1)=3

and

f(2)=5

, then the graph of

y=f(x)

must pass through the horizontal line

y=4

while

x

moves from

1

to

2

. It represents the idea that the graph of a continuous function on a closed interval can be drawn without lifting a pencil from the paper.

Theorem

The intermediate value theorem states the following:

Consider an interval

I=[a,b]

of real numbers

\R

and a continuous function

f\colonI\to\R

. Then

u

is a number between

f(a)

and

f(b)

, that is, \min(f(a),f(b)) then there is a

c\in(a,b)

such that

f(c)=u

.

f(I)

is also a closed interval, and it contains

l[min(f(a),f(b)),max(f(a),f(b))r]

.

Remark: Version II states that the set of function values has no gap. For any two function values

c,d\inf(I)

with

c<d

, even if they are outside the interval between

f(a)

and

f(b)

, all points in the interval

l[c,dr]

are also function values, \bigl[c,d\bigr]\subseteq f(I).A subset of the real numbers with no internal gap is an interval. Version I is naturally contained in Version II.

Relation to completeness

The theorem depends on, and is equivalent to, the completeness of the real numbers. The intermediate value theorem does not apply to the rational numbers Q because gaps exist between rational numbers; irrational numbers fill those gaps. For example, the function

f(x)=x2

for

x\in\Q

satisfies

f(0)=0

and

f(2)=4

. However, there is no rational number

x

such that

f(x)=2

, because

\sqrt2

is an irrational number.

Proof

Proof version A

The theorem may be proven as a consequence of the completeness property of the real numbers as follows:[2]

We shall prove the first case,

f(a)<u<f(b)

. The second case is similar.

Let

S

be the set of all

x\in[a,b]

such that

f(x)<u

. Then

S

is non-empty since

a

is an element of

S

. Since

S

is non-empty and bounded above by

b

, by completeness, the supremum

c=\supS

exists. That is,

c

is the smallest number that is greater than or equal to every member of

S

.

Note that, due to the continuity of

f

at

a

, we can keep

f(x)

within any

\varepsilon>0

of

f(a)

by keeping

x

sufficiently close to

a

. Since

f(a)<u

is a strict inequality, consider the implication when

\varepsilon

is the distance between

u

and

f(a)

. No

x

sufficiently close to

a

can then make

f(x)

greater than or equal to

u

, which means there are values greater than

a

in

S

. A more detailed proof goes like this:

Choose

\varepsilon=u-f(a)>0

. Then

\exists\delta>0

such that

\forallx\in[a,b]

, |x-a|<\delta \implies |f(x)-f(a)|Consider the interval

[a,min(a+\delta,b))=I1

. Notice that

I1\subseteq[a,b]

and every

x\inI1

satisfies the condition

|x-a|<\delta

. Therefore for every

x\inI1

we have

f(x)<u

. Hence

c

cannot be

a

.

Likewise, due to the continuity of

f

at

b

, we can keep

f(x)

within any

\varepsilon>0

of

f(b)

by keeping

x

sufficiently close to

b

. Since

u<f(b)

is a strict inequality, consider the similar implication when

\varepsilon

is the distance between

u

and

f(b)

. Every

x

sufficiently close to

b

must then make

f(x)

greater than

u

, which means there are values smaller than

b

that are upper bounds of

S

. A more detailed proof goes like this:

Choose

\varepsilon=f(b)-u>0

. Then

\exists\delta>0

such that

\forallx\in[a,b]

, |x-b|<\delta \implies |f(x)-f(b)|u.Consider the interval

(max(a,b-\delta),b]=I2

. Notice that

I2\subseteq[a,b]

and every

x\inI2

satisfies the condition

|x-b|<\delta

. Therefore for every

x\inI2

we have

f(x)>u

. Hence

c

cannot be

b

.

With

ca

and

cb

, it must be the case

c\in(a,b)

. Now we claim that

f(c)=u

.

Fix some

\varepsilon>0

. Since

f

is continuous at

c

,

\exists\delta1>0

such that

\forallx\in[a,b]

,

|x-c|<\delta1\implies|f(x)-f(c)|<\varepsilon

.

Since

c\in(a,b)

and

(a,b)

is open,

\exists\delta2>0

such that

(c-\delta2,c+\delta2)\subseteq(a,b)

. Set

\delta=min(\delta1,\delta2)

. Then we havef(x)-\varepsilonfor all

x\in(c-\delta,c+\delta)

. By the properties of the supremum, there exists some

a*\in(c-\delta,c]

that is contained in

S

, and sof(c)Picking

a**\in(c,c+\delta)

, we know that

a**\not\inS

because

c

is the supremum of

S

. This means thatf(c)>f(a^)-\varepsilon \geq u-\varepsilon.Both inequalitiesu-\varepsilonare valid for all

\varepsilon>0

, from which we deduce

f(c)=u

as the only possible value, as stated.

Proof version B

We will only prove the case of

f(a)<u<f(b)

, as the

f(a)>u>f(b)

case is similar.[3]

Define

g(x)=f(x)-u

which is equivalent to

f(x)=g(x)+u

and lets us rewrite

f(a)<u<f(b)

as

g(a)<0<g(b)

, and we have to prove, that

g(c)=0

for some

c\in[a,b]

, which is more intuitive. We further define the set

S=\{x\in[a,b]:g(x)\leq0\}

. Because

g(a)<0

we know, that

a\inS

so, that

S

is not empty. Moreover, as

S\subseteq[a,b]

, we know that

S

is bounded and non-empty, so by Completeness, the supremum

c=\sup(S)

exists.

There are 3 cases for the value of

g(c)

, those being

g(c)<0,g(c)>0

and

g(c)=0

. For contradiction, let us assume, that

g(c)<0

. Then, by the definition of continuity, for

\epsilon=0-g(c)

, there exists a

\delta>0

such that

x\in(c-\delta,c+\delta)

implies, that

|g(x)-g(c)|<-g(c)

, which is equivalent to

g(x)<0

. If we just chose
x=c+\delta
N
, where
N>\delta
b-c
, then

g(x)<0

and

c<x<b

, so

x\inS

. It follows that

x

is an upper bound for

S

. However,

x>c

, contradicting the upper bound property of the least upper bound

c

, so

g(c)\geq0

. Assume then, that

g(c)>0

. We similarly chose

\epsilon=g(c)-0

and know, that there exists a

\delta>0

such that

x\in(c-\delta,c+\delta)

implies

|g(x)-g(c)|<g(c)

. We can rewrite this as

-g(c)<g(x)-g(c)<g(c)

which implies, that

g(x)>0

. If we now chose
x=c-\delta
2
, then

g(x)>0

and

a<x<c

. It follows that

x

is an upper bound for

S

. However,

x<c

, which contradict the least property of the least upper bound

c

, which means, that

g(c)>0

is impossible. If we combine both results, we get that

g(c)=0

or

f(c)=u

is the only remaining possibility.

Remark: The intermediate value theorem can also be proved using the methods of non-standard analysis, which places "intuitive" arguments involving infinitesimals on a rigorous footing.[4]

History

A form of the theorem was postulated as early as the 5th century BCE, in the work of Bryson of Heraclea on squaring the circle. Bryson argued that, as circles larger than and smaller than a given square both exist, there must exist a circle of equal area.[5] The theorem was first proved by Bernard Bolzano in 1817. Bolzano used the following formulation of the theorem:[6]

Let

f,\varphi

be continuous functions on the interval between

\alpha

and

\beta

such that

f(\alpha)<\varphi(\alpha)

and

f(\beta)>\varphi(\beta)

. Then there is an

x

between

\alpha

and

\beta

such that

f(x)=\varphi(x)

.

The equivalence between this formulation and the modern one can be shown by setting

\varphi

to the appropriate constant function. Augustin-Louis Cauchy provided the modern formulation and a proof in 1821.[7] Both were inspired by the goal of formalizing the analysis of functions and the work of Joseph-Louis Lagrange. The idea that continuous functions possess the intermediate value property has an earlier origin. Simon Stevin proved the intermediate value theorem for polynomials (using a cubic as an example) by providing an algorithm for constructing the decimal expansion of the solution. The algorithm iteratively subdivides the interval into 10 parts, producing an additional decimal digit at each step of the iteration.[8] Before the formal definition of continuity was given, the intermediate value property was given as part of the definition of a continuous function. Proponents include Louis Arbogast, who assumed the functions to have no jumps, satisfy the intermediate value property and have increments whose sizes corresponded to the sizes of the increments of the variable.Earlier authors held the result to be intuitively obvious and requiring no proof. The insight of Bolzano and Cauchy was to define a general notion of continuity (in terms of infinitesimals in Cauchy's case and using real inequalities in Bolzano's case), and to provide a proof based on such definitions.

Converse is false

A Darboux function is a real-valued function that has the "intermediate value property," i.e., that satisfies the conclusion of the intermediate value theorem: for any two values and in the domain of, and any between and, there is some between and with . The intermediate value theorem says that every continuous function is a Darboux function. However, not every Darboux function is continuous; i.e., the converse of the intermediate value theorem is false.

As an example, take the function defined by for and . This function is not continuous at because the limit of as tends to 0 does not exist; yet the function has the intermediate value property. Another, more complicated example is given by the Conway base 13 function.

In fact, Darboux's theorem states that all functions that result from the differentiation of some other function on some interval have the intermediate value property (even though they need not be continuous).

Historically, this intermediate value property has been suggested as a definition for continuity of real-valued functions;[9] this definition was not adopted.

Generalizations

Multi-dimensional spaces

The Poincaré-Miranda theorem is a generalization of the Intermediate value theorem from a (one-dimensional) interval to a (two-dimensional) rectangle, or more generally, to an n-dimensional cube.

Vrahatis[10] presents a similar generalization to triangles, or more generally, n-dimensional simplices. Let Dn be an n-dimensional simplex with n+1 vertices denoted by v0,...,vn. Let F=(f1,...,fn) be a continuous function from Dn to Rn, that never equals 0 on the boundary of Dn. Suppose F satisfies the following conditions:

Then there is a point z in the interior of Dn on which F(z)=(0,...,0).

It is possible to normalize the fi such that fi(vi)>0 for all i; then the conditions become simpler:

The theorem can be proved based on the Knaster–Kuratowski–Mazurkiewicz lemma. In can be used for approximations of fixed points and zeros.[11]

General metric and topological spaces

The intermediate value theorem is closely linked to the topological notion of connectedness and follows from the basic properties of connected sets in metric spaces and connected subsets of R in particular:

X

and

Y

are metric spaces,

f\colonX\toY

is a continuous map, and

E\subsetX

is a connected subset, then

f(E)

is connected. (

Notes and References

  1. Book: 10.1007/978-3-030-11036-9. Cauchy's Calcul Infinitésimal . 2019 . Cates . Dennis M. . 978-3-030-11035-2 . 132587955. 249 .
  2. Essentially follows Book: Clarke, Douglas A.. Foundations of Analysis. Appleton-Century-Crofts . 1971. 284.
  3. Slightly modified version of Book: Abbot, Stephen. Understanding Analysis. Springer . 2015. 123.
  4. Sanders. Sam . 1704.00281 . Nonstandard Analysis and Constructivism!. 2017. math.LO.
  5. Book: Bos, Henk J. M. . The legitimation of geometrical procedures before 1590 . 10.1007/978-1-4613-0087-8_2 . 1800805 . 23–36 . Springer . New York . Sources and Studies in the History of Mathematics and Physical Sciences . Redefining Geometrical Exactness: Descartes' Transformation of the Early Modern Concept of Construction . 2001.
  6. A translation of Bolzano's paper on the intermediate value theorem. S.B.. Russ. Historia Mathematica. 1980. 7. 2. 156–185. 10.1016/0315-0860(80)90036-1. free.
  7. Who Gave You the Epsilon? Cauchy and the Origins of Rigorous Calculus. Judith V.. Grabiner. The American Mathematical Monthly. March 1983. 90. 185–194. 10.2307/2975545. 3. 2975545.
  8. Karin Usadi Katz and Mikhail G. Katz (2011) A Burgessian Critique of Nominalistic Tendencies in Contemporary Mathematics and its Historiography. Foundations of Science. See link
  9. Book: Smorynski, Craig . MVT: A Most Valuable Theorem . 2017-04-07 . Springer . 9783319529561 . en.
  10. Vrahatis . Michael N. . 2016-04-01 . Generalization of the Bolzano theorem for simplices . Topology and Its Applications . en . 202 . 40–46 . 10.1016/j.topol.2015.12.066 . 0166-8641.
  11. Vrahatis . Michael N. . 2020-04-15 . Intermediate value theorem for simplices for simplicial approximation of fixed points and zeros . Topology and Its Applications . en . 275 . 107036 . 10.1016/j.topol.2019.107036 . 0166-8641. free .