Constructive analysis explained
In mathematics, constructive analysis is mathematical analysis done according to some principles of constructive mathematics.
Introduction
The name of the subject contrasts with classical analysis, which in this context means analysis done according to the more common principles of classical mathematics. However, there are various schools of thought and many different formalizations of constructive analysis.[1] Whether classical or constructive in some fashion, any such framework of analysis axiomatizes the real number line by some means, a collection extending the rationals and with an apartness relation definable from an asymmetric order structure. Center stage takes a positivity predicate, here denoted
, which governs an equality-to-zero
. The members of the collection are generally just called the
real numbers. While this term is thus overloaded in the subject, all the frameworks share a broad common core of results that are also theorems of classical analysis.
Constructive frameworks for its formulation are extensions of Heyting arithmetic by types including
, constructive
second-order arithmetic, or strong enough
topos-,
type- or constructive set theories such as
}, a constructive counter-part of
}. Of course, a
direct axiomatization may be studied as well.
Logical preliminaries
} is not automatically assumed for every
proposition. If a proposition
\neg\neg\existsx.\theta(x)
is provable, this exactly means that the non-existence claim
being provable would be absurd, and so the latter cannot also be provable in a consistent theory. The double-negated existence claim is a logically negative statement and implied by, but generally not equivalent to the existence claim itself. Much of the intricacies of constructive analysis can be framed in terms of the weakness of propositions of the logically negative form
, which is generally weaker than
. In turn, also an implication
(\existsx.\theta(x))\to\neg\forallx.\neg\theta(x)
can generally be not reversed.
While a constructive theory proves fewer theorems than its classical counter-part in its classical presentation, it may exhibit attractive meta-logical properties. For example, if a theory
} exhibits the
disjunction property, then if it proves a disjunction
}\vdash \phi\lor \psi then also
}\vdash \phi or
}\vdash \psi. Already in classical arithmetic, this is violated for the most basic propositions about sequences of numbers - as demonstrated next.
Undecidable predicates
A common strategy of formalization of real numbers is in terms of sequences or rationals,
and so we draw motivation and examples in terms of those. So to define terms, consider a
decidable predicate on the naturals, which in the constructive vernacular means
\foralln.(Q(n)\lor\negQ(n))
is provable, and let
be the characteristic function defined to equal
exactly where
is true. The associated sequence
qn:={style\sum}
\chiQ(n)/2n+1
is monotone, with values non-strictly growing between the bounds
and
. Here, for the sake of demonstration, defining an extensional equality to the zero sequence
(q\congext0):=\foralln.qn=0
, it follows that
q\congext0\leftrightarrow\foralln.Q(n)
. Note that the symbol "
" is used in several contexts here. For any theory capturing arithmetic, there are many yet undecided and even provenly independent such statements
. Two
-examples are the
Goldbach conjecture and the
Rosser sentence of a theory.
Consider any theory
} with quantifiers ranging over
primitive recursive, rational-valued sequences. Already
minimal logic proves the non-contradiction claim for any proposition, and that the negation of excluded middle for any given proposition would be absurd. This also means there is no consistent theory (even if anti-classical) rejecting the excluded middle disjunction for any given proposition. Indeed, it holds that
}\,\,\,\vdash\,\,\,\forall(x\in^).\,\neg\neg\big((x\cong_\mathrm 0)\lor\neg(x\cong_\mathrm 0)\big)This theorem is logically equivalent to the non-existence claim of a sequence for which the excluded middle disjunction about equality-to-zero would be disprovable. No sequence with that disjunction being rejected can be exhibited.Assume the theories at hand are
consistent and arithmetically sound. Now
Gödel's theorems mean that there is an explicit sequence
such that, for any fixed precision,
} proves the zero-sequence to be a good approximation to
, but it can also meta-logically be established that
}\,\nvdash\,(g\cong_\mathrm 0) as well as
}\,\nvdash\,\neg(g\cong_\mathrm 0).
[2] Here this proposition
again amounts to the proposition of universally quantified form.Trivially
}+\,\,\,\vdash\,\,\,\forall(x\in^).\,(x\cong_\mathrm 0)\lor\neg(x\cong_\mathrm 0)even if these disjunction claims here do not carry any information. In the absence of further axioms breaking the meta-logical properties, constructive entailment instead generally reflects provability. Taboo statements that ought not be decidable (if the aim is to respect the provability interpretation of constructive claims) can be designed for definitions of a custom equivalence "
" in formalizations below as well. For implications of disjunctions of yet not proven or disproven propositions, one speaks of weak Brouwerian counterexamples.
Order vs. disjunctions
The theory of the real closed field may be axiomatized such that all the non-logical axioms are in accordance with constructive principles. This concerns a commutative ring with postulates for a positivity predicate
, with a positive unit and non-positive zero, i.e.,
and
. In any such ring, one may define
, which constitutes a strict total order in its constructive formulation (also called linear order or, to be explicit about the context, a
pseudo-order). As is usual,
is defined as
.
This first-order theory is relevant as the structures discussed below are model thereof.[3] However, this section thus does not concern aspects akin to topology and relevant arithmetic substructures are not definable therein.
As explained, various predicates will fail to be decidable in a constructive formulation, such as these formed from order-theoretical relations. This includes "
", which will be rendered equivalent to a negation. Crucial disjunctions are now discussed explicitly.
Trichotomy
In intuitonistic logic, the disjunctive syllogism in the form
(\phi\lor\psi)\to(\neg\phi\to\psi)
generally really only goes in the
-direction. In a pseudo-order, one has
\neg(x>0\lor0>x)\tox\cong0
and indeed at most one of the three can hold at once. But the stronger,
logically positive law of trichotomy disjunction does not hold in general, i.e. it is not provable that for all reals,
See
analytical
}. Other disjunctions are however implied based on other positivity results, e.g.
. Likewise, the asymmetric order in the theory ought to fulfill the weak linearity property
for all
, related to locatedness of the reals.
The theory shall validate further axioms concerning the relation between the positivity predicate
and the algebraic operations including multiplicative inversion, as well as the
intermediate value theorem for polynomial. In this theory, between any two separated numbers, other numbers exist.
Apartness
In the context of analysis, the auxiliary logically positive predicate
may be independently defined and constitutes an
apartness relation. With it, the substitute of the principles above give tightness
\neg(x\#0)\leftrightarrow(x\cong0)
Thus, apartness can also function as a definition of "
", rendering it a negation. All negations are stable in intuitionistic logic, and therefore
\neg\neg(x\congy)\leftrightarrow(x\congy)
The elusive trichotomy disjunction itself then reads
Importantly, a
proof of the disjunction
carries positive information, in both senses of the word. Via
(\phi\to\neg\psi)\leftrightarrow(\psi\to\neg\phi)
it also follows that
. In words: A demonstration that a number is somehow apart from zero is also a demonstration that this number is non-zero. But constructively it does not follow that the doubly negative statement
would imply
. Consequently, many classically equivalent statements bifurcate into distinct statement. For example, for a fixed polynomial
and fixed
, the statement that the
'th coefficient
of
is apart from zero is stronger than the mere statement that it is non-zero. A demonstration of former explicates how
and zero are related, with respect to the ordering predicate on the reals, while a demonstration of the latter shows how negation of such conditions would imply to a contradiction. In turn, there is then also a strong and a looser notion of, e.g., being a third-order polynomial.
So the excluded middle for
is apriori stronger than that for
. However, see the discussion of possible further axiomatic principles regarding the strength of "
" below.
Non-strict partial order
Lastly, the relation
may be defined by or proven equivalent to the
logically negative statement
, and then
is defined as
. Decidability of positivity may thus be expressed as
, which as noted will not be provable in general. But neither will the totality disjunction
, see also
analytical
}.
By a valid De Morgan's law, the conjunction of such statements is also rendered a negation of apartness, and so
(x\gey\landy\gex)\leftrightarrow(x\congy)
The disjunction
implies
, but the other direction is also not provable in general. In a constructive real closed field,
the relation "
" is a negation and is not equivalent to the disjunction in general.
Variations
Demanding good order properties as above but strong completeness properties at the same time implies
}. Notably, the
MacNeille completion has better completeness properties as a collection, but a more intricate theory of its order-relation and, in turn, worse locatedness properties. While less commonly employed, also this construction simplifies to the classical real numbers when assuming
}.
Invertibility
In the commutative ring of real numbers, a provably non-invertible element equals zero. This and the most basic locality structure is abstracted in the theory of Heyting fields.
Formalization
Rational sequences
A common approach is to identify the real numbers with non-volatile sequences in
. The constant sequences correspond to rational numbers. Algebraic operations such as addition and multiplication can be defined component-wise, together with a systematic reindexing for speedup. The definition in terms of sequences furthermore enables the definition of a strict order "
" fulfilling the desired axioms. Other relations discussed above may then be defined in terms of it. In particular, any number
apart from
, i.e.
, eventually has an index beyond which all its elements are invertible.
[4] Various implications between the relations, as well as between sequences with various properties, may then be proven.
Moduli
As the maximum on a finite set of rationals is decidable, an absolute value map on the reals may be defined and Cauchy convergence and limits of sequences of reals can be defined as usual.
A modulus of convergence is often employed in the constructive study of Cauchy sequences of reals, meaning the association of any
to an appropriate index (beyond which the sequences are closer than
) is required in the form of an explicit, strictly increasing function
\varepsilon\mapstoN(\varepsilon)
. Such a modulus may be considered for a sequence of reals, but it may also be considered for all the reals themselves, in which case one is really dealing with a sequence of pairs.
Bounds and suprema
, negatively characterized using
. One may speak of least upper bounds with respect to "
". A
supremum is an upper bound given through a sequence of reals, positively characterized using "
". If a subset with an upper bound is well-behaved with respect to "
" (discussed below), it has a supremum.
Bishop's formalization
One formalization of constructive analysis, modeling the order properties described above, proves theorems for sequences of rationals
fulfilling the
regularity condition
|xn-xm|\le\tfrac{1}{n}+\tfrac{1}{m}
. An alternative is using the tighter
instead of
, and in the latter case non-zero indices ought to be used. No two of the rational entries in a regular sequence are more than
apart and so one may compute natural numbers exceeding any real. For the regular sequences, one defines the logically positive loose positivity property as
x>0:=\existsn.xn>\tfrac{1}{n}
, where the relation on the right hand side is in terms of rational numbers. Formally, a positive real in this language is a regular sequence together with a natural witnessing positivity. Further,
x\congy:=\foralln.|xn-yn|\le\tfrac{2}{n}
, which is logically equivalent to the negation
\neg\existsn.|xn-yn|>\tfrac{2}{n}
. This is provably transitive and in turn an
equivalence relation. Via this predicate, the regular sequences in the band
are deemed equivalent to the zero sequence. Such definitions are of course compatible with classical investigations and variations thereof were well studied also before. One has
as
. Also,
may be defined from a numerical non-negativity property, as
for all
, but then shown to be equivalent of the logical negation of the former.
[5] [6] Variations
The above definition of
uses a common bound
. Other formalizations directly take as definition that for any fixed bound
, the numbers
and
must eventually be forever at least as close.Exponentially falling bounds
are also used, also say in a real number condition
, and likewise for the equality of two such reals. And also the sequences of rationals may be required to carry a modulus of convergence. Positivity properties may defined as being eventually forever apart by some rational.
Function choice in
or stronger principles aid such frameworks.
Coding
It is worth noting that sequences in
can be coded rather compactly, as they each may be mapped to a unique subclass of
. A sequence rationals
may be encoded as set of quadruples
. In turn, this can be encoded as unique naturals
using the
fundamental theorem of arithmetic. There are more economic
pairing functions as well, or extension encoding tags or metadata. For an example using this encoding, the sequence
i\mapsto
| i\tfrac{1}{k} |
{style\sum} | |
| k=0 |
, or
1,2,\tfrac{5}{2},\tfrac{8}{3},...
, may be used to compute
Euler's number and with the above coding it maps to the subclass
\{15,90,24300,6561000,...\}
of
. While this example, an explicit sequence of sums, is a total recursive function to begin with, the encoding also means these objects are in scope of the quantifiers in second-order arithmetic.
Set theory
Cauchy reals
In some frameworks of analysis, the name real numbers is given to such well-behaved sequences or rationals, and relations such as
are called the
equality or real numbers. Note, however, that there are properties which can distinguish between two
-related reals.
In contrast, in a set theory that models the naturals
and validates the existence of even classically uncountable function spaces (and certainly say
} or even
}) the numbers equivalent with respect to "
" in
may be collected into a set and then this is called the
Cauchy real number. In that language, regular rational sequences are degraded to a mere representative of a Cauchy real. Equality of those reals is then given by the equality of sets, which is governed by the set theoretical
axiom of extensionality. An upshot is that the set theory will prove properties for the reals, i.e. for this class of sets, expressed using the logical equality. Constructive reals in the presence of appropriate choice axioms will be Cauchy-complete but not automatically order-complete.
[7] Dedekind reals
In this context it may also be possible to model a theory or real numbers in terms of Dedekind cuts of
. At least when assuming
} or dependent choice, these structures are isomorphic.
Interval arithmetic
Another approach is to define a real number as a certain subset of
, holding pairs representing inhabited, pairwise intersecting intervals.
Uncountability
Recall that the preorder on cardinals "
" in set theory is the primary notion defined as
injection existence. As a result, the constructive theory of cardinal order can diverge substantially from the classical one. Here, sets like
or some models of the reals can be taken to be
subcountable.
That said, Cantors diagonal construction proving uncountability of powersets like
and plain function spaces like
is
intuitionistically valid. Assuming
} or alternatively the
countable choice axiom, models of
are always uncountable also over a constructive framework.
[8] One variant of the diagonal construction relevant for the present context may be formulated as follows, proven using countable choice and for reals as sequences of rationals:
[9] For any two pair of reals
and any sequence of reals
, there exists a real
with
and
.Formulations of the reals aided by explicit moduli permit separate treatments.
According to Kanamori, "a historical misrepresentation has been perpetuated that associates diagonalization with non-constructivity" and a constructive component of the diagonal argument already appeared in Cantor's work.[10]
Category and type theory
All these considerations may also be undertaken in a topos or appropriate dependent type theory.
Principles
For practical mathematics, the axiom of dependent choice is adopted in various schools.
Markov's principle is adopted in the Russian school of recursive mathematics. This principle strengthens the impact of proven negation of strict equality. A so-called analytical form of it grants
or
. Weaker forms may be formulated.
The Brouwerian school reasons in terms of spreads and adopts the classically valid bar induction.
Anti-classical schools
Through the optional adoption of further consistent axioms, the negation of decidability may be provable. For example, equality-to-zero is rejected to be decidable when adopting Brouwerian continuity principles or Church's thesis in recursive mathematics.[11] The weak continuity principle as well as
even refute
. The existence of a
Specker sequence is proven from
. Such phenomena also occur in realizability topoi. Notably, there are two anti-classical schools as incompatible with one-another. This article discusses principles compatible with the classical theory and choice is made explicit.
Theorems
Many classical theorems can only be proven in a formulation that is logically equivalent, over classical logic. Generally speaking, theorem formulation in constructive analysis mirrors the classical theory closest in separable spaces. Some theorems can only be formulated in terms of approximations.
The intermediate value theorem
For a simple example, consider the intermediate value theorem (IVT).In classical analysis, IVT implies that, given any continuous function f from a closed interval [''a'',''b''] to the real line R, if f(a) is negative while f(b) is positive, then there exists a real number c in the interval such that f(c) is exactly zero.In constructive analysis, this does not hold, because the constructive interpretation of existential quantification ("there exists") requires one to be able to construct the real number c (in the sense that it can be approximated to any desired precision by a rational number).But if f hovers near zero during a stretch along its domain, then this cannot necessarily be done.
However, constructive analysis provides several alternative formulations of IVT, all of which are equivalent to the usual form in classical analysis, but not in constructive analysis.For example, under the same conditions on f as in the classical theorem, given any natural number n (no matter how large), there exists (that is, we can construct) a real number cn in the interval such that the absolute value of f(cn) is less than 1/n.That is, we can get as close to zero as we like, even if we can't construct a c that gives us exactly zero.
Alternatively, we can keep the same conclusion as in the classical IVT—a single c such that f(c) is exactly zero—while strengthening the conditions on f.We require that f be locally non-zero, meaning that given any point x in the interval [''a'',''b''] and any natural number m, there exists (we can construct) a real number y in the interval such that |y - x| < 1/m and |f(y)| > 0.In this case, the desired number c can be constructed.This is a complicated condition, but there are several other conditions that imply it and that are commonly met; for example, every analytic function is locally non-zero (assuming that it already satisfies f(a) < 0 and f(b) > 0).
For another way to view this example, notice that according to classical logic, if the locally non-zero condition fails, then it must fail at some specific point x; and then f(x) will equal 0, so that IVT is valid automatically.Thus in classical analysis, which uses classical logic, in order to prove the full IVT, it is sufficient to prove the constructive version. From this perspective, the full IVT fails in constructive analysis simply because constructive analysis does not accept classical logic. Conversely, one may argue that the true meaning of IVT, even in classical mathematics, is the constructive version involving the locally non-zero condition, with the full IVT following by "pure logic" afterwards.Some logicians, while accepting that classical mathematics is correct, still believe that the constructive approach gives a better insight into the true meaning of theorems, in much this way.
The least-upper-bound principle and compact sets
Another difference between classical and constructive analysis is that constructive analysis does not prove the least-upper-bound principle, i.e. that any subset of the real line R would have a least upper bound (or supremum), possibly infinite.However, as with the intermediate value theorem, an alternative version survives; in constructive analysis, any located subset of the real line has a supremum.(Here a subset S of R is located if, whenever x < y are real numbers, either there exists an element s of S such that x < s, or y is an upper bound of S.)Again, this is classically equivalent to the full least upper bound principle, since every set is located in classical mathematics.And again, while the definition of located set is complicated, nevertheless it is satisfied by many commonly studied sets, including all intervals and all compact sets.
Closely related to this, in constructive mathematics, fewer characterisations of compact spaces are constructively valid—or from another point of view, there are several different concepts that are classically equivalent but not constructively equivalent.Indeed, if the interval [''a'',''b''] were sequentially compact in constructive analysis, then the classical IVT would follow from the first constructive version in the example; one could find c as a cluster point of the infinite sequence (cn)n∈N.
See also
Further reading
- Book: Bishop, Errett . Errett Bishop . 1967 . Foundations of Constructive Analysis . 4-87187-714-0 .
- Book: Bridger, Mark . Real Analysis: A Constructive Approach . Wiley . Hoboken . 2007 . 0-471-79230-6 .
Notes and References
- Troelstra, A. S., van Dalen D., Constructivism in mathematics: an introduction 1; Studies in Logic and the Foundations of Mathematics; Springer, 1988;
- Book: Smith . Peter . An introduction to Gödel's Theorems . 2007 . Cambridge University Press . Cambridge, U.K. . 978-0-521-67453-9 . 2384958.
- Erik Palmgren, An Intuitionistic Axiomatisation of Real Closed Fields, Mathematical Logic Quarterly, Volume 48, Issue 2, Pages: 163-320, February 2002
- Bridges D., Ishihara H., Rathjen M., Schwichtenberg H. (Editors), Handbook of Constructive Mathematics; Studies in Logic and the Foundations of Mathematics; (2023) pp. 201-207
- Errett Bishop, Foundations of Constructive Analysis, July 1967
- Stolzenberg, Gabriel. Review: Errett Bishop, Foundations of Constructive Analysis. Bull. Amer. Math. Soc.. 1970. 76. 2. 301–323. 10.1090/s0002-9904-1970-12455-7. free.
- Robert S. Lubarsky, On the Cauchy Completeness of the Constructive Cauchy Reals, July 2015
- Bauer, A., Hanson, J. A. "The countable reals", 2022
- See, e.g., Theorem 1 in Bishop, 1967, p. 25
- [Akihiro Kanamori]
- 1804.05495. Constructive Reverse Mathematics. math.LO. Diener. Hannes. 2020.