Interleave lower bound explained

In the theory of optimal binary search trees, the interleave lower bound is a lower bound on the number of operations required by a Binary Search Tree (BST) to execute a given sequence of accesses.

Several variants of this lower bound have been proven.^[1] ^[2] ^[3] This article is based on a variation of the first Wilber's bound.^[4] This lower bound is used in the design and analysis of Tango tree. Furthermore, this lower bound can be rephrased and proven geometrically, Geometry of binary search trees.

Definition

The bound is based on a fixed perfect BST

, called the lower bound tree, over the keys

\{1,2,...,n\}

. For example, for

n=7

can be represented by the following parenthesis structure:

[([1] 2 [3]) 4 ([5] 6 [7])]

For each node

, define:

Left(y)

to be the set of nodes in the left sub-tree of

, including

Right(y)

to be the set of nodes in the right sub-tree of

Consider the following access sequence:

X=x_1,x_2,...,x_m

. For a fixed node

, and for each access

x_i

, define the label of

x_i

with respect to

as:

"L" - if

x_i

is in

Left(y)

"R" - if

x_i

is in

Right(y)

;

Null - otherwise.

The label of

is the concatenation of the labels from all the accesses. For example, if the sequence of accesses is:

7,6,3

then the label of the root

(4)

is: "RRL", the label of 6 is: "RL", and the label of 2 is: "R".

For every node

, define the amount of interleaving through y as the number of alternations between L and R in the label of

. In the above example, the interleaving through

and

and the interleaving through all other nodes is

The interleave bound,

IB(X)

, is the sum of the interleaving through all the nodes of the tree. The interleave bound of the above sequence is

The Lower Bound Statement and its Proof

The interleave bound is summarized by the following theorem.

The following proof is based on.^[4]

Proof

Let

X=x_1,x_2,...,x_m

be an access sequence. Denote by

T_i

the state of an arbitrary BST at time

i.e. after executing the sequence

x_1,x_2,...,x_i

. We also fix a lower bound BST

For a node

, define the transition point for

at time

to be the minimum-depth node

in the BST

T_i

such that the path from the root of

T_i

includes both a node from Left(y) and a node from Right(y). Intuitively, any BST algorithm on

T_i

that accesses an element from Right(y) and then an element from Left(y) (or vice versa) must touch the transition point of

at least once. In the following Lemma, we will show that transition point is well-defined.

The second lemma that we need to prove states that the transition point is stable. It will not change until it is touched.

The last Lemma toward the proof states that every node

y\inP

has its unique transition point.

Now, we are ready to prove the theorem. First of all, observe that the number of touched transition points by the offline BST algorithm is a lower bound on its cost, we are counting less nodes than the required for the total cost.

We know by Lemma 3 that at any time

, any node

T_i

can be only a transition for at most one node in

. Thus, It is enough to count the number of touches of a transition node of

, the sum over all

Therefore, for a fixed node

y\inP

, let

\ell

and

to be defined as in Lemma 1. The transition point of

is among these two nodes. In fact, it is the deeper one. Let

x
	i₁

x
	i₂

,...,

x
	i_p

be a maximal ordered access sequence to nodes that alternate between

Left(y)

and

Right(y)

. Then

is the amount of interleaving through the node

. Suppose that the even indexed accesses are in the

Left(y)

, and the odd ones are in

Right(y)

i.e.

x
	i_2j

\inLeft(y)

and

x
	i_2j

\inRight(y)

. We know by the properties of lowest common ancestor that an access to a node in

Left(y)

, it must touch

\ell

. Similarly, an access to a node in

Right(y)

must touch

. Consider every

j\in[1,\lfloorp/2\rfloor]

. For two consecutive accesses

x
	i_2j

and

x
	i_2j

, if they avoid touching the access point of

, then

\ell

and

must change in between. However, by Lemma 2, such change requires touching the transition point. Consequently, the BST access algorithm touches the transition point of

at least once in the interval of

[i_2j,i_2j]

. Summing over all

j\in[1,\lfloorp/2\rfloor]

, the best algorithm touches the transition point of

at least

\lfloorp/2\rfloor\geqp/2-1

. Summing over all

\sum_yp_y/2-1\geqIB(X)/2-n

where

p_y

is the amount of interleave through

. By definition, the

p_y

's add up to

IB(X)

. That concludes the proof.

Notes and References

10.1137/0218004. Lower Bounds for Accessing Binary Search Trees with Rotations. SIAM Journal on Computing. 18. 56–67. 1989. Wilber . R. .
10.1137/S0097539795291598. Optimal Biweighted Binary Trees and the Complexity of Maintaining Partial Sums. SIAM Journal on Computing. 28. 1–9. 1998. Hampapuram . H. . Fredman . M. L. .
10.1137/S0097539705447256. Logarithmic Lower Bounds in the Cell-Probe Model. SIAM Journal on Computing. 35. 4. 932. 2006. Patrascu . M. . Demaine . E. D. . cs/0502041.
10.1137/S0097539705447347. Dynamic Optimality—Almost. SIAM Journal on Computing. 37. 240–251. 2007. Demaine . E. D. . Harmon . D. . Iacono . J. . Pătraşcu . M. .

Interleave lower bound explained

Definition

The Lower Bound Statement and its Proof

Proof

See also

Notes and References