Epsilon-induction explained
In set theory,
-induction
, also called epsilon-induction
or set-induction
, is a principle that can be used to prove that all sets satisfy a given property. Considered as an axiomatic principle, it is called the axiom schema of set induction
.The principle implies transfinite induction and recursion. It may also be studied in a general context of induction on well-founded relations.
Statement
The schema is for any given property
of sets and states that, if for every set
, the truth of
follows from the truth of
for all elements of
, then this property
holds for all sets. In symbols:
\forallx.((\forall(y\inx).\psi(y))\to\psi(x))\to\forallz.\psi(z)
Note that for the "bottom case" where
denotes the
empty set
, the subexpression
is
vacuously true for all propositions and so that implication is proven by just proving
.
In words, if a property is persistent when collecting any sets with that property into a new set (and this also requires establishing the property for the empty set), then the property is simply true for all sets. Said differently, persistence of a property with respect to set formation suffices to reach each set in the domain of discourse.
In terms of classes
by
.Let
be
and use the informal
as abbreviation for
. The principle then says that for any
,
\forall(x\subseteq\Psi).x\in\Psi\leftrightarrow\Psi={U}
Here the quantifier ranges over all
sets.In words this says that any class that contains all of its subsets is simply just the class of all sets.
Assuming bounded separation,
is a proper class. So the property
\forall(x\subseteq\Psi).x\in\Psi
is exhibited only by the proper class
, and in particular by no set. Indeed, note that any set is a subset of itself and under some more assumptions, already the self-membership will be ruled out.
For comparison to another property, note that for a class
to be
-
transitive means
\forall(x\in\Sigma).x\subseteq\Sigma
There are many transitive sets - in particular the set theoretical
ordinals.
Related notions of induction
Exportation proves
(A\to(B\toC))\leftrightarrow(B\to(A\toC))
. If
is
for some predicate
, it thus follows that
\forall(x\in\Sigma).((\forall(y\in(x\cap\Sigma)).P(y))\toP(x))\to\forall(z\in\Sigma).P(z)
where
is defined as
. If
is the universal class, then this is again just an instance of the schema.But indeed if
is any
-transitive class, then still
\forall(x\in\Sigma).(x\cap\Sigma=x)
and a version of set induction for
holds inside of
.
Ordinals
Ordinals may be defined as transitive sets of transitive sets. The induction situation in the first infinite ordinal
, the set of natural numbers, is discussed in more detail below. As set induction allows for induction in transitive sets containing
, this gives what is called
transfinite induction and definition by transfinite recursion using, indeed, the whole
proper class of ordinals. With ordinals, induction proves that all sets have ordinal rank and the rank of an ordinal is itself.
The theory of Von Neumann ordinals describes such sets and, there,
models the order relation
, which
classically is provably
trichotomous and total. Of interest there is the successor operation
that maps ordinals to ordinals. In the classical case, the induction step for successor ordinals can be simplified so that a property must merely be preserved between successive ordinals (this is the formulation that is typically understood as transfinite induction.) The sets are
-well-founded.
Well-founded relations
For a binary relation
on a set
,
well-foundedness can be defined by requiring a tailored induction property:
in the condition is abstracted to
, i.e. one always assumes
in place of the intersection
used in the statement above. It can be shown that for a well-founded relation
, there are no infinite descending
-sequences and so also
.Further, function definition by recursion with
can be defined on their domains, and so on.
Classically, well-foundedness of a relation on a set can also be characterized by the strong property of minimal element existence for every subset.With dependent choice, it can also be characterized by the weak property of non-existence of infinite descending chains.
For negative predicates
This section concerns the case of set induction and its consequences for predicates which are of a negated form,
.
Constructively, the resulting statements are generally weaker than set induction for general predicates. To establish equivalences, valid principles such as
\forallx.(A(x)\to\negB(x))\leftrightarrow\neg\existsx.(A(x)\landB(x))
,is commonly made use of, both sides saying that two predicates
and
can not, for any value, be validated simultaneously. The situation when double-negation elimination is permitted is discussed in the section right after.
Denoting the class
by
, this amounts to the special case of above with, for any
,
equal to the false statement
.One has
denoting
\neg\exists(y\in\Sigma).y\inx
. Writing
for the statement that all sets are not members of the class
, the induction schema reduces to
\neg\exists(x\in\Sigma).x\cap\Sigma=\{\}\leftrightarrow\Sigma=\{\}
In words, a property (a class) such that there is no
-minimal set for it is simply the false property (the empty set). (A minimal
for a relation
is one for which there does not exist another
with
. Here the membership relation restricted to
is considered, i.e. a minimal element with respect to
is one without a
.)
Infinite descending chains
The antecedent in the above implication may be expressed as
\forall(x\in\Sigma).\neg\neg\exists(y\in\Sigma).y\inx
. It holds for empty set
vacuously. In the presence of any descending membership chain as a function on
, the axiom of replacement proves existence of a set
that also fulfills this. So assuming the induction principle makes the existence of such a chain contradictory.
In this paragraph, assume the axiom of dependent choice in place of the induction principle. Any consequences of the above antecedent is also implied by the
-statement obtained by removing the double-negation, which constructively is a stronger condition. Consider a
set
with this
-property. Assuming the set is
inhabited, dependent choice implies the existence of an infinite descending membership chain as sequence, i.e. a function
on the naturals. So establishing (or even postulating) the non-existence of such a chain for a set with the
-property implies the assumption was wrong, i.e. also
.
So set induction relates to the postulate of non-existence of infinite descending chains. But given the extra assumptions required in the latter case, the mere non-existence postulate is relatively weak in comparison.
Self-membership
For a contradiction, assume there exists an inhabited set
with the particular property that it is equal to its own singleton set,
. Formally,
\forally.(y\ins\leftrightarrowy=s)
, from which it follows that
, and also that all members of
share all its properties, e.g.
. From the previous form of the principle it follow that
, a contradiction.
Discussed using the other auxiliary terminologies above, one studies set induction for the class
of sets that are not equal to such an
. So in terms of the negated predicate,
is the predicate
, meaning a set that exhibits
has the defining properties of
. Using the set builder notation, one is concerned with
. Assuming the special property of
, any empty intersection statement
simplifies to just
. The principle in the formulation in terms of
reduces to
, again a contradiction. Back to the very original formulation, it is concluded that
and
is simply the domain of all sets. In a theory with set induction, a
with the described recursive property is not actually a set in the first place.
A similar analysis may be applied also to more intricate scenarios. For example, if
and
were both sets, then the inhabited
would exists by
pairing, but this also has the
-property.
Contrapositive
The contrapositive of the form with negation is constructively even weaker but it is only one double negation elimination away from the regularity claim for
,
\Sigma ≠ \{\}\to\neg\neg\exists(x\in\Sigma).x\cap\Sigma=\{\}
With double-negations in antecedent and conclusion, the antecedent may equivalently be replaced with
.
Classical equivalents
Disjunctive form
The excluded middle statement for a universally quantified predicate can classically be expressed as follows: Either it holds for all terms, or there exist a term for which the predicate fails
\forallz.P(z)\lor\existsx.\negP(x)
With this, using the disjunctive syllogism, ruling out the possibility of counter-examples classically proves a property for all terms.This purely logical principle is unrelated to other relations between terms, such elementhood (or succession, see below).Using that
is classically an equivalence, and also using double-negation elimination, the induction principle can be translated to the following statement:
\forallz.P(z)\lor\existsx.(\negP(x)\land\forall(y\inx).P(y))
This expresses that, for any predicate
, either it holds for all sets, or there is some set
for which
does not hold while
is at the same time true for all elements of
. Relating it back to the original formulation: If one can, for any set
, prove that
implies
, which includes a proof of the bottom case
, then the failure case is ruled out and, then, by the
disjunctive syllogism the disjunct
holds.
For the task of proving
by ruling out the existence of counter-examples, the induction principle thus plays a similar role as the excluded middle disjunction, but the former is commonly also adopted in constructive frameworks.
Relation to regularity
The derivation in a previous section shows that set induction classically implies
\Sigma ≠ \{\}\to\exists(x\in\Sigma).x\cap\Sigma=\{\}
In words, any property that is exhibited by some set is also exhibited by a "minimal set"
, as defined above. In terms of classes, this states that every non-empty class
has a member
that is disjoint from it.
In first-order set theories, the common framework, the set induction principle is an axiom schema, granting an axiom for any predicate (i.e. class). In contrast, the axiom of regularity is a single axiom, formulated with a universal quantifier only over elements of the domain of discourse, i.e. over sets. If
is a set and the induction schema is assumed, the above is the instance of the axiom of regularity for
. Hence, assuming set induction over a classical logic (i.e. assuming the
law of excluded middle), all instances of regularity hold.
In a context with an axiom of separation, regularity also implies excluded middle (for the predicates allowed in ones separation axiom). Meanwhile, the set induction schema does not imply excluded middle, while still being strong enough to imply strong induction principles, as discussed above. In turn, the schema is, for example, adopted in the constructive set theory CZF, which has type theoretic models. So within such a set theory framework, set induction is a strong principle strictly weaker than regularity. When adopting the axiom of regularity and full Separation, CZF equals standard ZF.
History
, a theory without regularity or replacement.
The theory
does not prove all set induction instances. Regularity is classically equivalent to the contrapositive of set induction for negated statements, as demonstrated. The bridge from sets back to classes is demonstrated below.
Set induction from regularity and transitive sets
Assuming regularity, one may use classical principles, like the reversal of a contrapositive. Moreover, an induction schema stated in terms of a negated predicate
is then just as strong as one in terms of a predicate variable
, as the latter simply equals
. As the equivalences with the contrapositive of set induction have been discussed, the task is to translate regularity back to a statement about a general class
. It works in the end because the axiom of separation allows for intersection between sets and classes. Regularity only concerns intersection
inside a set and this can be flattened using transitive sets.
The proof is by manipulation of the regularity axiom instance
s ≠ \{\}\to\exists(x\ins).x\caps=\{\}
for a particular sub
set
of the class
. Observe that given a class
and any transitive set
, one may define
which has
x\ins\to(x\in\Sigma\landx\subseteqt)
and also
(x\subseteqt)\to(x\caps=x\cap\Sigma)
. With this, the set
may always be replaced with the class
in the conclusion of the regularity instance.
The remaining aim is to obtain a statement that also has it replaced in the antecedent, that is, establish the principle holds when assuming the more general
. So assume there is some
, together with the existence of some transitive set
that has
as subset. An intersection
may be constructed as described and it also has
. Consider excluded middle for whether or not
is disjoint from
, i.e.
. If
is empty, then also
and
itself always fulfills the principle. Otherwise,
by regularity and one can proceed to manipulate the statement by replacing
with
as discussed. In this case, one even obtains a slightly stronger statement than the one in the previous section, since it carries the sharper information that
and not just
.
Transitive set existence
The proof above assumes the existence of some transitive set containing any given set. This may be postulated, the transitive containment axiom.
Existence of the stronger transitive closure with respect to membership, for any set, can also be derived from some stronger standard axioms. This needs the axiom of infinity for
as a set, recursive functions on
, the axiom of replacement on
and finally the
axiom of union. I.e. it needs many standard axioms, just sparing the
axiom of powerset. In a context without strong separation, suitable function space principles may have to be adopted to enable recursive function definition.
} minus infinity also only proves the existence of transitive closures when
Regularity is promoted to Set induction.
Comparison of epsilon and natural number induction
of the standard natural numbers is the first infinite ordinal. There, the binary membership relation "
" of set theory exactly models the strict ordering of natural numbers "
". Then, the principle derived from set induction is complete induction.
} (or
Heyting arithmetic
}). The
signature includes the constant symbol "
", the successor function symbol "
" and the addition and multiplication function symbols "
" resp "
". With it, the naturals form a semiring, which always come with a canonical non-strict preorder "
", and the irreflexive
may be defined in terms of that. Similarly, the binary ordering relation
is also definable as
.
For any predicate
, the complete induction principle reads
\foralln.((\forall(k<n).Q(k))\toQ(n))\to\forallm.Q(m)
Making use of
(\forall(k<Sn).Q(k))\leftrightarrow(\forall(k<n).Q(k))\landQ(n)
, the principle is already implied by
standard form of the mathematical induction schema. The latter is expressed not in terms of the decidable order relation "
" but the primitive symbols,
(\phi(0)\land\foralln.(\phi(n)\to\phi(Sn)))\to\forallm.\phi(m)
Lastly, a statement may be proven that merely makes use of the successor symbol and still mirrors set induction: Define a new predicate
as
(n=0)\lor\existsp.(Sp=n\landQ(p))
. It holds for zero by design and so, akin to the bottom case in set induction, the implication
is equivalent to just
. Using induction,
} proves that every
is either zero or has a computable unique predecessor, a
with
. Hence
Q-1(Sq)\leftrightarrowQ(q)
. When
is the successor of
, then
expresses
. By case analysis, one obtains
\foralln.(Q-1(n)\toQ(n))\to\forallm.Q(m)
Classical equivalents
Using the classical principles mentioned above, the above may be expressed as
\forallm.Q(m)\lor\existsn.(\negQ(n)\land\forall(k<n).Q(k))
It expresses that, for any predicate
, either
hold for all numbers, or there is some natural number
for which
does not hold despite
holding for all predecessors.
Instead of
, one may also use
and obtain a related statement. It constrains the task of ruling out counter-examples for a property of natural numbers: If the bottom case
is validated and one can prove, for any number
, that the property
is always passed on to
, then this already rules out a failure case. Moreover, if a failure case exists, one can use the
least number principle to even prove the existence of a
minimal such failure case.
Least number principle
As in the set theory case, one may consider induction for negated predicates and take the contrapositive. After use of a few classical logical equivalences, one obtains a conditional existence claim.
Let
denote the set of natural numbers
validating a property
. In the Neumann model, a natural number
is extensionally equal to
, the set of numbers smaller than
. The
least number principle, obtained from complete induction, here expressed in terms of sets, reads
\Theta ≠ \{\}\to\neg\neg\exists(n\in\Theta).n\cap\Theta=\{\}
In words, if it cannot be ruled out that some number has the property
, then it can also not be consistently ruled out that a least such number
exists. In classical terms, if there is any number validating
, then there also exists a least such number validating
. Least here means that no other number
is validating
. This principle should be compared with regularity.
For decidable
and any given
with
, all
can be tested.Moreover, adopting
Markov's principle in arithmetic allows removal of double-negation for decidable
in general.
See also