In mathematics, the Cantor set is a set of points lying on a single line segment that has a number of unintuitive properties. It was discovered in 1874 by Henry John Stephen Smith[1] [2] [3] [4] and mentioned by German mathematician Georg Cantor in 1883.[5] [6]
Through consideration of this set, Cantor and others helped lay the foundations of modern point-set topology. The most common construction is the Cantor ternary set, built by removing the middle third of a line segment and then repeating the process with the remaining shorter segments. Cantor mentioned this ternary construction only in passing, as an example of a perfect set that is nowhere dense ([5] Anmerkungen zu §10, /p. 590).
More generally, in topology, a Cantor space is a topological space homeomorphic to the Cantor ternary set (equipped with its subspace topology). By a theorem of L. E. J. Brouwer, this is equivalent to being perfect, nonempty, compact, metrizable and zero dimensional.[7]
The Cantor ternary set
l{C}
style\left[0,1\right]
[0,1]
C0:=[0,1]
Cn:=
Cn-1 | |
3 |
\cup\left(
2 | + | |
3 |
Cn-1 | |
3 |
\right)=
13 | |
l(C |
n-1\cup\left(2+Cn-1\right)r)
n\ge1
l{C}:=
{\color{Blue}\limn\toinftyCn}
=
infty | |
cap | |
n=0 |
Cn=
infty | |
cap | |
n=m |
Cn
m\ge0
The first six steps of this process are illustrated below.
Using the idea of self-similar transformations,
TL(x)=x/3,
TR(x)=(2+x)/3
Cn=TL(Cn-1)\cupTR(Cn-1),
l{C}=[0,1]\setminus
infty | |
cup | |
n=0 |
3n-1 | ||
cup | \left( | |
k=0 |
3k+1 | , | |
3n+1 |
3k+2 | |
3n+1 |
\right),
infty | |
l{C}=cap | |
n=1 |
3n-1-1 | |
cup | |
k=0 |
\left(\left[
3k+0 | , | |
3n |
3k+1 | |
3n |
\right]\cup\left[
3k+2 | , | |
3n |
3k+3 | |
3n |
\right]\right),
This process of removing middle thirds is a simple example of a finite subdivision rule. The complement of the Cantor ternary set is an example of a fractal string.
[0,1]
In The Fractal Geometry of Nature, mathematician Benoit Mandelbrot provides a whimsical thought experiment to assist non-mathematical readers in imagining the construction of
l{C}
CURDLING: The construction of the Cantor bar results from the process I call curdling. It begins with a round bar. It is best to think of it as having a very low density. Then matter "curdles" out of this bar's middle third into the end thirds, so that the positions of the latter remain unchanged. Next matter curdles out of the middle third of each end third into its end thirds, and so on ad infinitum until one is left with an infinitely large number of infinitely thin slugs of infinitely high density. These slugs are spaced along the line in the very specific fashion induced by the generating process. In this illustration, curdling (which eventually requires hammering!) stops when both the printer's press and our eye cease to follow; the last line is indistinguishable from the last but one: each of its ultimate parts is seen as a gray slug rather than two parallel black slugs.
Since the Cantor set is defined as the set of points not excluded, the proportion (i.e., measure) of the unit interval remaining can be found by total length removed. This total is the geometric progression
infty | |
\sum | |
n=0 |
2n | |
3n+1 |
=
1 | |
3 |
+
2 | |
9 |
+
4 | |
27 |
+
8 | |
81 |
+ … =
1 | \left( | |
3 |
1 | |||
|
\right)=1.
So that the proportion left is 1 − 1 = 0.
This calculation suggests that the Cantor set cannot contain any interval of non-zero length. It may seem surprising that there should be anything left—after all, the sum of the lengths of the removed intervals is equal to the length of the original interval. However, a closer look at the process reveals that there must be something left, since removing the "middle third" of each interval involved removing open sets (sets that do not include their endpoints). So removing the line segment from the original interval [0, 1] leaves behind the points and . Subsequent steps do not remove these (or other) endpoints, since the intervals removed are always internal to the intervals remaining. So the Cantor set is not empty, and in fact contains an uncountably infinite number of points (as follows from the above description in terms of paths in an infinite binary tree).
It may appear that only the endpoints of the construction segments are left, but that is not the case either. The number, for example, has the unique ternary form 0.020202... = . It is in the bottom third, and the top third of that third, and the bottom third of that top third, and so on. Since it is never in one of the middle segments, it is never removed. Yet it is also not an endpoint of any middle segment, because it is not a multiple of any power of 1/3.All endpoints of segments are terminating ternary fractions and are contained in the set
\left\{x\in[0,1]\mid\existsi\in\N0:x3i\in\Z\right\} l(\subset\N0
-\N0 | |
3 |
r)
It can be shown that there are as many points left behind in this process as there were to begin with, and that therefore, the Cantor set is uncountable. To see this, we show that there is a function f from the Cantor set
l{C}
l{C}
l{C}
l{C}
To construct this function, consider the points in the [0, 1] interval in terms of base 3 (or ternary) notation. Recall that the proper ternary fractions, more precisely: the elements of
l(\Z\setminus\{0\}r) ⋅
-\N0 | |
3 |
This can be summarized by saying that those numbers with a ternary representation such that the first digit after the radix point is not 1 are the ones remaining after the first step.
The second step removes numbers of the form 0.01xxxx...3 and 0.21xxxx...3, and (with appropriate care for the endpoints) it can be concluded that the remaining numbers are those with a ternary numeral where neither of the first two digits is 1.
Continuing in this way, for a number not to be excluded at step n, it must have a ternary representation whose nth digit is not 1. For a number to be in the Cantor set, it must not be excluded at any step, it must admit a numeral representation consisting entirely of 0s and 2s.
It is worth emphasizing that numbers like 1, = 0.13 and = 0.213 are in the Cantor set, as they have ternary numerals consisting entirely of 0s and 2s: 1 = 0.222...3 = 3, = 0.0222...3 = 3 and = 0.20222...3 = 3.All the latter numbers are "endpoints", and these examples are right limit points of
l{C}
l{C}
\Z ⋅
-\N0 | |
3 |
l{C}
l{C}
This set of endpoints is dense in
l{C}
l{C}
The function from
l{C}
f(\sumk\inak3-k)=\sumk\in
ak | |
2 |
2-k
\forallk\in\N:ak\in\{0,2\}.
For any number y in [0,1], its binary representation can be translated into a ternary representation of a number x in
l{C}
= 3 (which is a right limit point of
l{C}
= 3 (which is a left limit point of
l{C}
1/ | |
\begin{array}{lcl} fl({} | |
3 |
r)=f(0.0\overline{2}3)=0.0\overline{1}2=&0.12&=0.1\overline{0}2=f(0.2\overline{0}3)=
2/ | |
fl({} | |
3 |
r).\\ &\parallel\\ &
1/ | |
{} | |
2 \end{array} |
a\in[-1,1]
x,y\inl{C}
a=y-x
\{(x,y)\inR2\midy=x+a\} \cap (l{C} x l{C}) ≠ \emptyset
a\in[-1,1]
[-1,1]
l{C} x l{C}
|l{C} x l{C}|\geq|[-1,1]|=ak{c}
|A x A|=|A|
A
|l{C}|=ak{c}
The Cantor set contains as many points as the interval from which it is taken, yet itself contains no interval of nonzero length. The irrational numbers have the same property, but the Cantor set has the additional property of being closed, so it is not even dense in any interval, unlike the irrational numbers which are dense in every interval.
It has been conjectured that all algebraic irrational numbers are normal. Since members of the Cantor set are not normal, this would imply that all members of the Cantor set are either rational or transcendental.
The Cantor set is the prototype of a fractal. It is self-similar, because it is equal to two copies of itself, if each copy is shrunk by a factor of 3 and translated. More precisely, the Cantor set is equal to the union of two functions, the left and right self-similarity transformations of itself,
TL(x)=x/3
TR(x)=(2+x)/3
TL(l{C})\congTR(l{C})\congl{C}=TL(l{C})\cupTR(l{C}).
Repeated iteration of
TL
TR
\{TL,TR\}
The automorphisms of the binary tree are its hyperbolic rotations, and are given by the modular group. Thus, the Cantor set is a homogeneous space in the sense that for any two points
x
y
l{C}
h:l{C}\tol{C}
h(x)=y
h
\{0,1\}
h:\{0,1\}\N\to\{0,1\}\N
hn(u):=un+xn+yn\mod2
x
y
It has been found that some form of conservation law is always responsible behind scaling and self-similarity. In the case of Cantor set it can be seen that the
df
df=ln(2)/ln(3)
N=2n
1/3n
n
x1,x2,\ldots,
x | |
2n |
df
df | |
x | |
1 |
df | |
+x | |
2 |
df | |
+ … +x | |
2n |
=1
x1=x2= …
=x | |
2n |
=1/3n
The Hausdorff dimension of the Cantor set is equal to ln(2)/ln(3) ≈ 0.631.
Although "the" Cantor set typically refers to the original, middle-thirds Cantor set described above, topologists often talk about "a" Cantor set, which means any topological space that is homeomorphic (topologically equivalent) to it.
As the above summation argument shows, the Cantor set is uncountable but has Lebesgue measure 0. Since the Cantor set is the complement of a union of open sets, it itself is a closed subset of the reals, and therefore a complete metric space. Since it is also totally bounded, the Heine–Borel theorem says that it must be compact.
For any point in the Cantor set and any arbitrarily small neighborhood of the point, there is some other number with a ternary numeral of only 0s and 2s, as well as numbers whose ternary numerals contain 1s. Hence, every point in the Cantor set is an accumulation point (also called a cluster point or limit point) of the Cantor set, but none is an interior point. A closed set in which every point is an accumulation point is also called a perfect set in topology, while a closed subset of the interval with no interior points is nowhere dense in the interval.
Every point of the Cantor set is also an accumulation point of the complement of the Cantor set.
For any two points in the Cantor set, there will be some ternary digit where they differ — one will have 0 and the other 2. By splitting the Cantor set into "halves" depending on the value of this digit, one obtains a partition of the Cantor set into two closed sets that separate the original two points. In the relative topology on the Cantor set, the points have been separated by a clopen set. Consequently, the Cantor set is totally disconnected. As a compact totally disconnected Hausdorff space, the Cantor set is an example of a Stone space.
As a topological space, the Cantor set is naturally homeomorphic to the product of countably many copies of the space
\{0,1\}
2N=\{(xn)\midxn\in\{0,1\}forn\inN\},
which can also be identified with the set of 2-adic integers. The basis for the open sets of the product topology are cylinder sets; the homeomorphism maps these to the subspace topology that the Cantor set inherits from the natural topology on the real line. This characterization of the Cantor space as a product of compact spaces gives a second proof that Cantor space is compact, via Tychonoff's theorem.
From the above characterization, the Cantor set is homeomorphic to the p-adic integers, and, if one point is removed from it, to the p-adic numbers.
The Cantor set is a subset of the reals, which are a metric space with respect to the ordinary distance metric; therefore the Cantor set itself is a metric space, by using that same metric. Alternatively, one can use the p-adic metric on
2N
(xn),(yn)\in2N
d((xn),(yn))=2-k
k
xk\neyk
We have seen above that the Cantor set is a totally disconnected perfect compact metric space. Indeed, in a sense it is the only one: every nonempty totally disconnected perfect compact metric space is homeomorphic to the Cantor set. See Cantor space for more on spaces homeomorphic to the Cantor set.
The Cantor set is sometimes regarded as "universal" in the category of compact metric spaces, since any compact metric space is a continuous image of the Cantor set; however this construction is not unique and so the Cantor set is not universal in the precise categorical sense. The "universal" property has important applications in functional analysis, where it is sometimes known as the representation theorem for compact metric spaces.[13]
For any integer q ≥ 2, the topology on the group G = Zqω (the countable direct sum) is discrete. Although the Pontrjagin dual Γ is also Zqω, the topology of Γ is compact. One can see that Γ is totally disconnected and perfect - thus it is homeomorphic to the Cantor set. It is easiest to write out the homeomorphism explicitly in the case q = 2. (See Rudin 1962 p 40.)
The Cantor set can be seen as the compact group of binary sequences, and as such, it is endowed with a natural Haar measure. When normalized so that the measure of the set is 1, it is a model of an infinite sequence of coin tosses. Furthermore, one can show that the usual Lebesgue measure on the interval is an image of the Haar measure on the Cantor set, while the natural injection into the ternary set is a canonical example of a singular measure. It can also be shown that the Haar measure is an image of any probability, making the Cantor set a universal probability space in some ways.
In Lebesgue measure theory, the Cantor set is an example of a set which is uncountable and has zero measure.[14] In contrast, the set has a Hausdorff measure of 1 in its dimension of log 2 / log 3.[15]
If we define a Cantor number as a member of the Cantor set, then[16]
The Cantor set is a meagre set (or a set of first category) as a subset of [0,1] (although not as a subset of itself, since it is a Baire space). The Cantor set thus demonstrates that notions of "size" in terms of cardinality, measure, and (Baire) category need not coincide. Like the set
Q\cap[0,1]
l{C}
Q\cap[0,1]
\aleph0
l{C}
ak{c}
l{C}(n)
λ=(n-1)/n
l{C}(n)
l{A}\cupl{A}c=[0,1]
l{A}c
\mu(l{A})=1
l{A}c
See main article: Smith–Volterra–Cantor set.
Instead of repeatedly removing the middle third of every piece as in the Cantor set, we could also keep removing any other fixed percentage (other than 0% and 100%) from the middle. In the case where the middle of the interval is removed, we get a remarkably accessible case — the set consists of all numbers in [0,1] that can be written as a decimal consisting entirely of 0s and 9s. If a fixed percentage is removed at each stage, then the limiting set will have measure zero, since the length of the remainder
(1-f)n\to0
n\toinfty
f
0<f\leq1
On the other hand, "fat Cantor sets" of positive measure can be generated by removal of smaller fractions of the middle of the segment in each iteration. Thus, one can construct sets homeomorphic to the Cantor set that have positive Lebesgue measure while still being nowhere dense. If an interval of length
rn
r\leq1/3
λ=(1-3r)/(1-2r)
r=1/3
0<r<1/3
0<λ<1
r=1/4
1/2
One can modify the construction of the Cantor set by dividing randomly instead of equally. Besides, to incorporate time we can divide only one of the available intervals at each step instead of dividing all the available intervals. In the case of stochastic triadic Cantor set the resulting process can be described by the following rate equation[11] [12]
\partialc(x,t) | =- | |
\partialt |
x2 | |
2 |
c(x,t)+
infty | |
2\int | |
x |
(y-x)c(y,t)dy,
and for the stochastic dyadic Cantor set[18]
{{\partialc(x,t)}\over{\partial
infty | |
t}}=-xc(x,t)+(1+p)\int | |
x |
c(y,t)dy,
where
c(x,t)dx
x
x+dx
0.5616
0.6309
p
ln(1+p)/ln2
c(x,t)
-(1+df) | |
t |
e-xt
df=p
df
Cantor dust is a multi-dimensional version of the Cantor set. It can be formed by taking a finite Cartesian product of the Cantor set with itself, making it a Cantor space. Like the Cantor set, Cantor dust has zero measure.[19]
A different 2D analogue of the Cantor set is the Sierpinski carpet, where a square is divided up into nine smaller squares, and the middle one removed. The remaining squares are then further divided into nine each and the middle removed, and so on ad infinitum.[20] One 3D analogue of this is the Menger sponge.
Cantor introduced what we call today the Cantor ternary set
lC
lC
z=c1/3
2 | |
+c | |
2/3 |
+ … +
\nu | |
c | |
\nu/3 |
+ …
c\nu
A topological space
P
P'
lC
Cantor was led to the study of derived sets by his results on uniqueness of trigonometric series. The latter did much to set him on the course for developing an abstract, general theory of infinite sets.
Benoit Mandelbrot wrote much on Cantor dusts and their relation to natural fractals and statistical physics.[23] He further reflected on the puzzling or even upsetting nature of such structures to those in the mathematics and physics community. In The Fractal geometry of Nature, he described how "When I started on this topic in 1962, everyone was agreeing that Cantor dusts are at least as monstrous as the Koch and Peano curves," and added that "every self-respecting physicist was automatically turned off by a mention of Cantor, ready to run a mile from anyone claiming
lC