This article examines the implementation of mathematical concepts in set theory. The implementation of a number of basic mathematical concepts is carried out in parallel in ZFC (the dominant set theory) and in NFU, the version of Quine's New Foundations shown to be consistent by R. B. Jensen in 1969 (here understood to include at least axioms of Infinity and Choice).
What is said here applies also to two families of set theories: on the one hand, a range of theories including Zermelo set theory near the lower end of the scale and going up to ZFC extended with large cardinal hypotheses such as "there is a measurable cardinal"; and on the other hand a hierarchy of extensions of NFU which is surveyed in the New Foundations article. These correspond to different general views of what the set-theoretical universe is like, and it is the approaches to implementation of mathematical concepts under these two general views that are being compared and contrasted.
It is not the primary aim of this article to say anything about the relative merits of these theories as foundations for mathematics. The reason for the use of two different set theories is to illustrate that multiple approaches to the implementation of mathematics are feasible. Precisely because of this approach, this article is not a source of "official" definitions for any mathematical concept.
The following sections carry out certain constructions in the two theories ZFC and NFU and compare the resulting implementations of certain mathematical structures (such as the natural numbers).
Mathematical theories prove theorems (and nothing else). So saying that a theory allows the construction of a certain object means that it is a theorem of that theory that that object exists. This is a statement about a definition of the form "the x such that
\phi
\phi
\phi
\phi
ZFC and NFU share the language of set theory, so the same formal definitions "the x such that
\phi
\{x\mid\phi\}
x\inA\leftrightarrow\phi
\phi
\{x\inB\mid\phi\}
\{x\midx\inB\wedge\phi\}
\{f(x1,\ldots,xn)\mid\phi\}
\{z\mid\existsx1,\ldots,xn(z=f(x1,...,xn)\wedge\phi)\}
f(x1,\ldots,xn)
Expressions definable in set-builder notation make sense in both ZFC and NFU: it may be that both theories prove that a given definition succeeds, or that neither do (the expression
\{x\midx\not\inx\}
\omega
\omega
Whatever is proven to exist in a theory clearly provably exists in any extension of that theory; moreover, analysis of the proof that an object exists in a given theory may show that it exists in weaker versions of that theory (one may consider Zermelo set theory instead of ZFC for much of what is done in this article, for example).
These constructions appear first because they are the simplest constructions in set theory, not because they are the first constructions that come to mind in mathematics (though the notion of finite set is certainly fundamental). Even though NFU also allows the construction of set ur-elements yet to become members of a set, the empty set is the unique set with no members:
\left.\varnothing\right.\overset{def.
x
\{x\}
x
\left\{x\right\}\overset{def.
x
y
\{x,y\}
x
y
\left\{x,y\right\}\overset{def.
\left.x\cupy\right.\overset{def.
n
n
\left\{x1,\ldots,xn,xn+1\right\}\overset{def.
x\cupy=cup\{x,y\}
See main article: article and Ordered pair. First, consider the ordered pair. The reason that this comes first is technical: ordered pairs are needed to implement relations and functions, which are needed to implement other concepts which may seem to be prior.The first definition of the ordered pair was the definition
(x,y)\overset{def
(x,y)\overset{def.
(x,y)
(x,y)
(x,y)=(z,w) \equiv x=z\wedgey=w
Relations are sets whose members are all ordered pairs. Where possible, a relation
R
\{(x,y)\midxRy\}
\{z\mid\pi1(z)R\pi2(z)\}
R
xRy
\left(x,y\right)\inR
In ZFC, some relations (such as the general equality relation or subset relation on sets) are 'too large'to be sets (but may be harmlessly reified as proper classes). In NFU, some relations (such as the membership relation) are not sets because their definitions are not stratified: in
\{(x,y)\midx\iny\}
x
y
x
y
Let
R
S
The converse of
R
\left\{\left(y,x\right):xRy\right\}
The domain of
R
\left\{x:\existsy\left(xRy\right)\right\}
The range of
R
R
\left\{y:\existsx\left(xRy\right)\right\}
The field of
R
R
The preimage of a member
x
R
\left\{y:yRx\right\}
The downward closure of a member
x
R
D
x
zRy
y\inD
R
The relative product
R|S
R
S
\left\{\left(x,z\right):\existsy\left(xRy\wedgeySz\right)\right\}
Notice that with our formal definition of a binary relation, the range and codomain of a relation are not distinguished. This could be done by representing a relation
R
B
\left(R,B\right)
In ZFC, any relation whose domain is a subset of a set
A
B
A x B=\left\{\left(a,b\right):a\inA\wedgeb\inB\right\}
l{P}\left(A\cupB\right)
\left\{\left(x,y\right)\inA x B:xRy\right\}
x
y
R
xRy
A binary relation
R
xRx
x
R
\forallx,y(xRy\toyRx)
\forallx,y,z(xRy\wedgeyRz → xRz)
\forallx,y(xRy\wedgeyRx → x=y)
S
R
\existsx\inS
R
S
x,y
R
x=y
x
y
R
Relations having certain combinations of the above properties have standard names. A binary relation
R
R
R
R
x,y
R
xRy
yRx
R
R
R
F
\forallx,y,z\left(xFy\wedgexFz\toy=z\right).
F
\left\{\left(x,y\right):xFy\right\}
F
\forallx,y,z\left(\left(x,y\right)\inF\wedge\left(x,z\right)\inF\toy=z\right).
F\left(x\right)
y
xFy
x
F
y
f
x
y
y
\left(x,y\right)\inF
F\left(x\right)
F
Outside of formal set theory, we usually specify a function in terms of its domain and codomain, as in the phrase "Let
f:A\toB
A
B
A
B
f
A
B
A
C
C
B
Indeed, no matter which set we consider to be the codomain of a function, the function does not change as a set since by definition it is just a set of ordered pairs. That is, a function does not determine its codomain by our definition. If one finds this unappealing then one can instead define a function as the ordered pair
(f,B)
f
B
(x,y,z)=(x,(y,z))
(f,A,B)
R\subseteqA x B
R
domR\subseteqA
ranR\subseteqB
In NFU,
x
F\left(x\right)
F
F\left(x\right)
F\left[A\right]
\left\{y:\existsx\left(x\inA\wedgey=F\left(x\right)\right)\right\}
A
\left\{F\left(x\right):x\inA\right\}
A
F
F\left[A\right]
F\left[A\right]
A
F
F\left[A\right]
The function
I
I\left(x\right)=x
I
S
S\left(x\right)=\left\{x\right\}
S
Let
f
g
f
g
g\circf
f|g
g\circf
\left(g\circf\right)\left(x\right)=g\left(f\left(x\right)\right)
f
g
f
f\left(-1\right)
f
A
iA
\left\{\left(x,x\right)\midx\inA\right\}
A function is an injective (also called one-to-one) if it has an inverse function.
A function
f
A
B
A
B
f
A
B
A
B
f
B
A
B
f
Defining functions as ordered pairs
(f,B)
(f,A,B)
A
B
B
In both ZFC and NFU, two sets A and B are the same size (or are equinumerous) if and only if there is a bijection f from A to B. This can be written as
|A|=|B|
|A|
|B|
A\simB
Similarly, define
|A|\leq|B|
It is straightforward to show that the relation of equinumerousness is an equivalence relation: equinumerousness of A with A is witnessed by
iA
|A|=|B|
f-1
|B|=|A|
|A|=|B|
|B|=|C|
g\circf
|A|=|C|
It can be shown that
|A|\leq|B|
|A|\leq|B|\wedge|B|\leq|A| → |A|=|B|
|A|\leq|B|\vee|B|\leq|A|
Natural numbers can be considered either as finite ordinals or finite cardinals. Here consider them as finite cardinal numbers. This is the first place where a major difference between the implementations in ZFC and NFU becomes evident.
The Axiom of Infinity of ZFC tells us that there is a set A which contains
\emptyset
y\cup\{y\}
y\inA
\{x\inA\mid\forallB(\emptyset\inB\wedge\forally(y\inB → y\cup\{y\}\inB) → x\inB)\}
y\mapstoy\cup\{y\}
In ZFC, a set
A
n\inN
|n|=|A|
|A|
The usual operations of arithmetic can be defined recursively and in a style very similar to that in which the set of natural numbers itself is defined. For example, + (the addition operation on natural numbers) can be defined as the smallest set which contains
((x,\emptyset),x)
x
((x,y\cup\{y\}),z\cup\{z\})
((x,y),z)
In NFU, it is not obvious that this approach can be used, since the successor operation
y\cup\{y\}
The standard definition of the natural numbers, which is actually the oldest set-theoretic definition of natural numbers, is as equivalence classes of finite sets under equinumerousness. Essentially the same definition is appropriate to NFU (this is not the usual definition, but the results are the same): define Fin, the set of finite sets, as
\{A\mid\forallF(\emptyset\inF\wedge\forallx,y(x\inF → x\cup\{y\}\inF) → A\inF)\}
A\inFin
|A|
\{B\midA\simB\}
\{|A|\midA\inFin\}
The Axiom of Infinity of NFU can be expressed as
V\not\inFin
|A|
|A\cup\{x\}|
x\not\inA
The operations of arithmetic can be defined in a style similar to the style given above (using the definition of successor just given). They can also be defined in a natural set theoretical way: if A and B are disjoint finite sets, define |A|+|B| as
|A\cupB|
\{A\mid\existsB,C(B\inm\wedgeC\inn\wedgeB\capC=\emptyset\wedgeA=B\cupC)\}
The two implementations are quite different. In ZFC, choose a representative of each finite cardinality (the equivalence classes themselves are too large to be sets); in NFU the equivalence classes themselves are sets, and are thus an obvious choice for objects to stand in for the cardinalities. However, the arithmetic of the two theories is identical: the same abstraction is implemented by these two superficially different approaches.
A general technique for implementing abstractions in set theory is the use of equivalence classes. If an equivalence relation R tells us that elements of its field A are alike in some particular respect, then for any set x, regard the set
[x]R=\{y\inA\midxRy\}
For any set A, a set
P
A=cupP
For every equivalence relation R with field A,
\{[x]R\midx\inA\}
\{(x,y)\mid\existsA\inP(x\inA\wedgey\inA)\}
This technique has limitations in both ZFC and NFU. In ZFC, since the universe is not a set, it seems possible to abstract features only from elements of small domains. This can be circumvented using a trick due to Dana Scott: if R is an equivalence relation on the universe, define
[x]R
yRx
zRx
[x]R
[x]R
x\mapsto[x]R
\{x\}\mapsto[x]R
[x]R
Two well-orderings
W1
W2
W1\simW2
W1
W2
xW1y\leftrightarrowf(x)W2f(y)
Similarity is shown to be an equivalence relation in much the same way that equinumerousness was shown to be an equivalence relation above.
In New Foundations (NFU), the order type of a well-ordering W is the set of all well-orderings which are similar to W. The set of ordinal numbers is the set of all order types of well-orderings.
This does not work in ZFC, because the equivalence classes are too large. It would be formally possible to use Scott's trick to define the ordinals in essentially the same way, but a device of von Neumann is more commonly used.
For any partial order
\leq
\{(x,y)\midx\leqy\wedgex ≠ y\}
A set A is said to be transitive if
cupA\subseteqA
In ZFC, the order type of a well-ordering W is then defined as the unique von Neumann ordinal which is equinumerous with the field of W and membership on which is isomorphic to the strict well-ordering associated with W. (the equinumerousness condition distinguishes between well-orderings with fields of size 0 and 1, whose associated strict well-orderings are indistinguishable).
In ZFC there cannot be a set of all ordinals. In fact, the von Neumann ordinals are an inconsistent totality in any set theory: it can be shown with modest set theoretical assumptions that every element of a von Neumann ordinal is a von Neumann ordinal and the von Neumann ordinals are strictly well-ordered by membership. It follows that the class of von Neumann ordinals would be a von Neumann ordinal if it were a set: but it would then be an element of itself, which contradicts the fact that membership is a strict well-ordering of the von Neumann ordinals.
The existence of order types for all well-orderings is not a theorem of Zermelo set theory: it requires the Axiom of replacement. Even Scott's trick cannot be used in Zermelo set theory without an additional assumption (such as the assumption that every set belongs to a rank which is a set, which does not essentiallystrengthen Zermelo set theory but is not a theorem of that theory).
In NFU, the collection of all ordinals is a set by stratified comprehension. The Burali-Forti paradox is evaded in an unexpected way. There is a natural order on the ordinals defined by
\alpha\leq\beta
W1\in\alpha
W2\in\beta
\Omega
\Omega
\Omega
\Omega
\alpha
\alpha
\alpha
\alpha
\alpha
T4(\alpha)
\alpha
T(\alpha)
W\iota=\{(\{x\},\{y\})\midxWy\}
W\in\alpha
\Omega
T4(\Omega)
T4(\Omega)<\Omega
T4
T2
This shows that the T operation is nontrivial, which has a number of consequences. It follows immediately that the singleton map
x\mapsto\{x\}
W\iota
T4(\Omega)<\Omega
\Omega>T(\Omega)>T2(\Omega)\ldots
Ordinals fixed by T are called Cantorian ordinals, and ordinals which dominate only cantorian ordinals (which are easily shown to be cantorian themselves) are said to be strongly cantorian. There can be no set of cantorian ordinals or set of strongly cantorian ordinals.
It is possible to reason about von Neumann ordinals in NFU. Recall that a von Neumann ordinal is a transitive set A such that the restriction of membership to A is a strict well-ordering. This is quite a strong condition in the NFU context, since the membership relation involves a difference of type. A von Neumann ordinal A is not an ordinal in the sense of NFU, but
\in\lceilA
\alpha
\alpha
T(\alpha)
The only von Neumann ordinals which can be shown to exist in NFU without additional assumptions are the concrete finite ones. However, the application of a permutation method can convert any model of NFU to a model in which every strongly cantorian ordinal is the order type of a von Neumann ordinal. This suggests that the concept "strongly cantorian ordinal of NFU" might be a better analogue to "ordinal of ZFC" than is the apparent analogue "ordinal of NFU".
Cardinal numbers are defined in NFU in a way which generalizes the definition of naturalnumber: for any set A,
|A|\overset{def
In ZFC, these equivalence classes are too large as usual. Scott's trick could be used (and indeed is used in ZF),
|A|
The natural order on cardinal numbers is seen to be a well-ordering: that it is reflexive, antisymmetric (on abstract cardinals, which are now available) and transitive has been shown above. That it is a linear order follows from the Axiom of Choice: well-order two sets and an initial segment of one well-ordering will be isomorphic to the other, so one set will have cardinality smaller than that of the other. That it is a well-ordering follows from the Axiom of Choice in a similar way.
With each infinite cardinal, many order types are associated for the usual reasons (in either set theory).
Cantor's theorem shows (in both theories) that there are nontrivial distinctions between infinite cardinal numbers. In ZFC, one proves
|A|<|P(A)|.
|P1(A)|<|P(A)|
P1(A)
|P1(V)|<|P(V)|
x\mapsto\{x\}
P1(V)
|P1(V)|<|P(V)|\ll|V|
\ll
T(|A|)=|P1(A)|
A set A is said to be cantorian just in case
|A|=|P1(A)|=T(|A|)
|A|
(x\mapsto\{x\})\lceilA
The operations of cardinal arithmetic are defined in a set-theoretically motivated way in both theories.
|A|+|B|=\{C\cupD\midC\simA\wedgeD\simB\wedgeC\capD=\emptyset\}
|A| ⋅ |B|
|A x B|
|A| ⋅ |B|
T-2(|A x B|)
Defining the exponential operation on cardinals requires T in an essential way: if
BA
|B||A|
T-3(|BA|)
T-1
T-3
2|V|
|B||A|
|BA|
The exponential operation is total and behaves exactly as expected on cantorian cardinals, since T fixes such cardinals and it is easy to show that a function space between cantorian sets is cantorian (as are power sets, cartesian products, and other usual type constructors). This offers further encouragement to the view that the "standard" cardinalities in NFU are the cantorian (indeed, the strongly cantorian) cardinalities, just as the "standard" ordinals seem to be the strongly cantorian ordinals.
Now the usual theorems of cardinal arithmetic with the axiom of choice can be proved, including
\kappa ⋅ \kappa=\kappa
|V| ⋅ |V|=|V|
|V| ⋅ |V|=T-2(|V x V|)
|V|
|V x V|=T2(|V|)=
2(V)| | |
|P | |
1 |
(a,b)
\{\{c\}\}
(a,b)
\{\{c\}\}
(a,b)
So there are two different implementations of the natural numbers in NFU (though they are the same in ZFC): finite ordinals and finite cardinals. Each of these supports a T operation in NFU (basically the same operation). It is easy to prove that
T(n)
|N|
\omega
T(n)=n
T(n)=n
|\{1,\ldots,n\}|=n
|\{1,\ldots,n\}|=T2(n)
A consequence of Counting is that N is a strongly cantorian set (again, this is an equivalent assertion).
The type of any variable restricted to a strongly cantorian set A can be raised or lowered as desired by replacing references to
a\inA
cupf(a)
f(a)
f-1(\{a\})
f(a)=\{a\}
a\inA
Any subset of a strongly cantorian set is strongly cantorian. The power set of a strongly cantorian set is strongly cantorian. The cartesian product of two strongly cantorian sets is strongly cantorian.
Introducing the Axiom of Counting means that types need not be assigned to variables restricted to N or to P(N), R (the set of reals) or indeed any set ever considered in classical mathematics outside of set theory.
There are no analogous phenomena in ZFC. See the main New Foundations article for stronger axioms that can be adjoined to NFU to enforce "standard" behavior of familiar mathematical objects.
Represent positive fractions as pairs of positive natural numbers (0 is excluded):
pq | |
(p,q)
pq | |
= |
rs | |
\leftrightarrow |
ps=qr
\sim
(p,q)\sim(r,s)\leftrightarrowps=qr
Represent magnitudes (positive reals) as nonempty proper initial segments of the positive rationals with no largest element. The operations of addition and multiplication on magnitudes are implemented by elementwise addition of the positive rational elements of the magnitudes. Order is implemented as set inclusion.
Represent real numbers as differences
m-n
(m,n)
\sim
(m,n)\sim(r,s)\leftrightarrowm+s=n+r
This is the briefest sketch of the constructions. Note that the constructions are exactly the same in ZFC and in NFU, except for the difference in the constructions of the natural numbers: since all variables are restricted to strongly cantorian sets, there is no need to worry about stratification restrictions. Without the Axiom of Counting, it might be necessary to introduce some applications of T in a full discussion of these constructions.
In this class of constructions it appears that ZFC has an advantage over NFU: though the constructions are clearly feasible in NFU, they are more complicated than in ZFC for reasons having to do with stratification.
Throughout this section assume a type-level ordered pair. Define
(x1,x2,\ldots,xn)
(x1,(x2,\ldots,xn))
General cartesian products are defined similarly:
A1 x A2 x \ldots x An=A1 x (A2 x \ldots x An)
The definitions are the same in ZFC but without any worries about stratification (the grouping given here is opposite to that more usually used, but this is easily corrected for).
Now consider the infinite cartesian product
\PiiAi
f(i)\inAi
Ai
In NFU, this is requires attention to type. Given a set I and set valued function A whose value at
\{i\}
P1(I)
Ai
\PiiAi
f(i)\inAi
f(i)\inAi=A(\{i\})
Ai
Now consider the product
\Pii|Ai|
\PiiAi
|Ai|
T-1(|\PiiAi|)
Repeat this for disjoint unions of families of sets and sums of families of cardinals. Again, let A be a set-valued function with domain
P1(I)
Ai
A(\{i\})
\SigmaiAi
\{(i,a)\mida\inAi\}
Ai
The correct definition of the sum
\Sigmai|Ai|
|\SigmaiAi|
It is possible to extend these definitions to handle index sets which are not sets of singletons, but this introduces an additional type level and is not needed for most purposes.
In ZFC, define the disjoint union
\SigmaiAi
\{(i,a)\mida\inAi\}
Ai
A(i)
Permutation methods can be used to show relative consistency with NFU of the assertion that for every strongly cantorian set A there is a set I of the same size whose elements are self-singletons:
i=\{i\}
In ZFC, define the cumulative hierarchy as the ordinal-indexed sequence of sets satisfying the following conditions:
V0=\emptyset
V\alpha+1=P(V\alpha)
Vλ=cup\{V\beta\mid\beta<λ\}
λ
\alpha
A\inV\alpha+1-V\alpha
The cardinal
|P(V\omega)|
\beth\alpha
This construction cannot be carried out in NFU because the power set operation is not a set function in NFU (
P(A)
The sequence of cardinals
\beth\alpha
2|A|
T-1(|\{0,1\}A|)
\{0,1\}
|\{0,1\}A|=|P(A)|
\beth
|N|
2|A|
|A|
A convention for ordinal indexing of any well-ordering
W\alpha
W
W
\{y\midyWx\}
\alpha
\beth\alpha
\alpha
\beth
\aleph\alpha
\alpha
\aleph0=|N|
\alpha
W\alpha
Each set A of ZFC has a transitive closure
TC(A)
\in\lceilTC(A)
\in\lceilTC(A)
This suggests that (an initial segment of) the cumulative hierarchy can be studied by considering the isomorphism classes of set pictures. These isomorphism classes are sets and make up a set in NFU. There is a natural set relation analogous to membership on isomorphism classes of set pictures: if
x
[x]
[x]E[y]
[x]
A\inB
There is a T operation on isomorphism classes of set pictures analogous to the T operation on ordinals: if x is a set picture, so is
x\iota=\{(\{a\},\{b\})\mid(a,b)\inx\}
T([x])
[x\iota]
[x]E[y]\leftrightarrowT([x])=T([y])
An axiom of extensionality for this simulated set theory follows from E's extensionality. From its well-foundedness follows an axiom of foundation. There remains the question of what comprehension axiom E may have. Consider any collection of set pictures
\{x\iota\midx\inS\}
x\iota
\{a\}
x\iota
(x,\{a\})
[x\iota]=T([x])
[y]
In particular, there will be an isomorphism type [v] whose preimage under E is the collection of all T[''x'']'s (including T[''v'']). Since T[''v''] E v and E is well-founded,
T[v] ≠ v
There are ranks of isomorphism classes of set pictures just as there are ranks of sets in the usual set theory. For any collection of set pictures A, define S(A) as the set of all isomorphism classes of set pictures whose preimage under E is a subset of A; call A a "complete" set if every subset of A is a preimage under E. The collection of "ranks" is the smallest collection containing the empty set and closed under the S operation (which is a kind of power set construction) and under unions of its subcollections. It is straightforward to prove (much as in the usual set theory) that the ranks are well-ordered by inclusion, and so the ranks have an index in this well-order: refer to the rank with index
\alpha
R\alpha
|R\alpha|=\beth\alpha
R\alpha
R\alpha
RT(\alpha)
T(\alpha)<\alpha
[x]\inNFU[y]
T([x])E[y]\wedge[y]\inRT(\alpha)+1
ENFU
\in
So there is a natural construction inside NFU of the cumulative hierarchy of sets which internalizes the natural construction of a model of NFU in Zermelo-style set theory.
Under the Axiom of Cantorian Sets described in the New Foundations article, the strongly cantorian part of the set of isomorphism classes of set pictures with the E relation as membership becomes a (proper class) model of ZFC (in which there are n-Mahlo cardinals for each n; this extension of NFU is strictly stronger than ZFC). This is a proper class model because the strongly cantorian isomorphism classes do not make up a set.
Permutation methods can be used to create from any model of NFU a model in which every strongly cantorian isomorphism type of set pictures is actually realized as the restriction of the true membership relation to the transitive closure of a set.