Calkin–Wilf tree explained

In number theory, the Calkin–Wilf tree is a tree in which the vertices correspond one-to-one to the positive rational numbers. The tree is rooted at the number 1, and any rational number expressed in simplest terms as the fraction has as its two children the numbers and . Every positive rational number appears exactly once in the tree. It is named after Neil Calkin and Herbert Wilf, but appears in other works including Kepler's Harmonices Mundi.

The sequence of rational numbers in a breadth-first traversal of the Calkin–Wilf tree is known as the Calkin–Wilf sequence. Its sequence of numerators (or, offset by one, denominators) is Stern's diatomic series, and can be computed by the fusc function.

History

The Calkin–Wilf tree is named after Neil Calkin and Herbert Wilf, who considered it in a 2000 paper. In a 1997 paper, Jean Berstel and Aldo de Luca^[1] called the same tree the Raney tree, since they drew some ideas from a 1973 paper by George N. Raney.^[2] Stern's diatomic series was formulated much earlier by Moritz Abraham Stern, a 19th-century German mathematician who also invented the closely related Stern–Brocot tree. Even earlier, a similar tree (including only the fractions between 0 and 1) appears in Kepler's Harmonices Mundi (1619).^[3]

Definition and structure

The Calkin–Wilf tree may be defined as a directed graph in which each positive rational number occurs as a vertex and has one outgoing edge to another vertex, its parent, except for the root of the tree, the number 1, which has no parent.

The parent of any rational number can be determined after placing the number into simplest terms, as a fraction for which greatest common divisor of and is 1. If, the parent of is ; if, the parent of is . Thus, in either case, the parent is a fraction with a smaller sum of numerator and denominator, so repeated reduction of this type must eventually reach the number 1. As a graph with one outgoing edge per vertex and one root reachable by all other vertices, the Calkin–Wilf tree must indeed be a tree.

The children of any vertex in the Calkin–Wilf tree may be computed by inverting the formula for the parents of a vertex. Each vertex has one child whose value is less than 1,, because of course . Similarly, each vertex has one child whose value is greater than 1, .^[4]

As each vertex has two children, the Calkin–Wilf tree is a binary tree. However, it is not a binary search tree: its inorder does not coincide with the sorted order of its vertices. However, it is closely related to a different binary search tree on the same set of vertices, the Stern–Brocot tree: the vertices at each level of the two trees coincide, and are related to each other by a bit-reversal permutation.

Breadth first traversal

The Calkin–Wilf sequence is the sequence of rational numbers generated by a breadth-first traversal of the Calkin–Wilf tree,

,,,,,,,,,,,,,, ….Because the Calkin–Wilf tree contains every positive rational number exactly once, so does this sequence.^[5] The denominator of each fraction equals the numerator of the next fraction in the sequence.The Calkin–Wilf sequence can also be generated directly by the formula

q_i+1=

	1
	2\lfloorq_i\rfloor-q_i+1

where denotes the th number in the sequence, starting from, and represents the integral part.^[6]

It's also possible to calculate directly from the run-length encoding of the binary representation of :the number of consecutive 1s starting from the least significant bit, then the number of consecutive 0s starting from the first block of 1s, and so on. The sequence of numbers generated in this way gives the continued fraction representation of .Example:

i = 1081 = 10000111001₂: The continued fraction is [1;2,3,4,1] hence .

i = 1990 = 11111000110₂: The continued fraction is [0;1,2,3,5] hence .

In the other direction, using the continued fraction of any as the run-length encoding of a binary number gives back itself. Example:

The continued fraction is [0;1,3] hence = 1110₂ = 14.

The continued fraction is [1;3]. But to use this method, the length of the continued fraction must be an odd number. So [1;3] should be replaced by the equivalent continued fraction [1;2,1]. Hence = 1001₂ = 9.

A similar conversion between run-length-encoded binary numbers and continued fractions can also be used to evaluate Minkowski's question mark function; however, in the Calkin–Wilf tree the binary numbers are integers (positions in the breadth-first traversal) while in the question mark function they are real numbers between 0 and 1.

Stern's diatomic sequence

Stern's diatomic sequence is the integer sequence

0, 1, 1, 2, 1, 3, 2, 3, 1, 4, 3, 5, 2, 5, 3, 4, 1, … .Using zero-based numbering, the th value in the sequence is the value of the fusc function, named^[7] according to the obfuscating appearance of the sequence of values and defined by the recurrence relations

\begin{align} \operatorname{fusc}(2n)&=\operatorname{fusc}(n)\\ \operatorname{fusc}(2n+1)&=\operatorname{fusc}(n)+\operatorname{fusc}(n+1), \end{align}

with the base cases and .

The th rational number in a breadth-first traversal of the Calkin–Wilf tree is the number .^[8] Thus, the diatomic sequence forms both the sequence of numerators and the sequence of denominators of the numbers in the Calkin–Wilf sequence.

The function is the number of odd binomial coefficients of the form,,^[9] and also counts the number of ways of writing as a sum of powers of two in which each power occurs at most twice. This can be seen from the recurrence defining fusc: the expressions as a sum of powers of two for an even number either have no 1s in them (in which case they are formed by doubling each term in an expression for) or two 1s (in which case the rest of the expression is formed by doubling each term in an expression for), so the number of representations is the sum of the number of representations for and for, matching the recurrence. Similarly, each representation for an odd number is formed by doubling a representation for and adding 1, again matching the recurrence.^[10] For instance,

6 = 4 + 2 = 4 + 1 + 1 = 2 + 2 + 1 + 1has three representations as a sum of powers of two with at most two copies of each power, so .

Relation to Stern–Brocot tree

The Calkin–Wilf tree resembles the Stern–Brocot tree in that both are binary trees with each positive rational number appearing exactly once. Additionally, the top levels of the two trees appear very similar, and in both trees, the same numbers appear at the same levels. One tree can be obtained from the other by performing a bit-reversal permutation on the numbers at each level of the trees. Alternatively, the number at a given node of the Calkin–Wilf tree can be converted into the number at the same position in the Stern–Brocot tree, and vice versa, by a process involving the reversal of the continued fraction representations of these numbers.However, in other ways, they have different properties: for instance, the Stern–Brocot tree is a binary search tree: the left-to-right traversal order of the tree is the same as the numerical order of the numbers in it. This property is not true of the Calkin–Wilf tree.

References

- - - .
.
. EWD 570: An exercise for Dr.R.M.Burstall, pp. 215–216, and EWD 578: More about the function "fusc" (A sequel to EWD570), pp. 230–232, reprints of notes originally written in 1976.
- .
- .

Notes and References

, Section 6.
.
.
The description here is dual to the original definition by Calkin and Wilf, which begins by defining the child relationship and derives the parent relationship as part of a proof that every rational appears once in the tree. As defined here, every rational appears once by definition, and instead the fact that the resulting structure is a tree requires a proof.
"a list of all positive rational numbers, each appearing once and only once, can be made by writing down, then the fractions on the level just below the top of the tree, reading from left to right, then the fractions on the next level down, reading from left to right, etc." discuss efficient functional programming techniques for performing this breadth first traversal.
credit this formula to Moshe Newman.
The fusc name was given in 1976 by Edsger W. Dijkstra; see EWD570 and EWD578.
, Theorem 1.
.
The OEIS entry credits this fact to and to uncited work of Lind. However, Carlitz's paper describes a more restricted class of sums of powers of two, counted by instead of by .