Rokhlin lemma explained

In mathematics, the Rokhlin lemma, or Kakutani–Rokhlin lemma is an important result in ergodic theory. It states that an aperiodic measure preserving dynamical system can be decomposed to an arbitrary high tower of measurable sets and a remainder of arbitrarily small measure. It was proven by Vladimir Abramovich Rokhlin and independently by Shizuo Kakutani. The lemma is used extensively in ergodic theory, for example in Ornstein theory and has many generalizations.

Rokhlin lemma belongs to the group mathematical statements such as Zorn's lemma in set theory and Schwarz lemma in complex analysis which are traditionally called lemmas despite the fact that their roles in their respective fields are fundamental.

Terminology

A Lebesgue space is a measure space

(X,lB,\mu)

composed of two parts. One atomic part with finite/countably many atoms, and one continuum part isomorphic to an interval on \R.

We consider only measure-preserving maps. As typical in measure theory, we can freely discard countably many sets of measure zero.

An ergodic map is a map

T

such that if

T-1(A)=A

(except on a measure-zero set) then

A

or

X-A

has measure zero.

An aperiodic map is a map such that the set of periodic points is measure zero:\mu(\cup_\) = 0 A Rokhlin tower is a family of sets S, TS, \dots, T^S that are disjoint. S is called the base of the tower, and each T^nS is a rung or level of the tower. N is the height of the tower. The tower itself is

R:=(S\cupTS\cup...\cupTN-1S)

. The set outside the tower X - R is the error set.

There are several Rokhlin lemmas. Each states that, under some assumptions, we can construct Rokhlin towers that are arbitrarily high with arbitrarily small error sets.

Theorems

[1] [2]

Applications

The Rokhlin lemma can be used to prove some theorems. For example, (Section 2.5)

(Section 4.6)

Ornstein isomorphism theorem (Chapter 6).

Topological Rokhlin lemmas

Let

style(X,T)

be a topological dynamical system consisting of a compact metric space

styleX

and a homeomorphism

styleT:XX

. The topological dynamical system

style(X,T)

is called minimal if it has no proper non-empty closed

styleT

-invariant subsets. It is called (topologically) aperiodic if it has no periodic points (

Tkx=x

for some

x\inX

and

k\in{Z

} implies

k=0

). A topological dynamical system

style(Y,S)

is called a factor of

style(X,T)

if there exists a continuous surjective mapping

style\varphi:XY

which is equivariant, i.e.,

style\varphi(Tx)=S\varphi(x)

for all

stylex\inX

.

Elon Lindenstrauss proved the following theorem:[3]

Theorem: Let

style(X,T)

be a topological dynamical system which has an aperiodic minimal factor. Then for integer

stylen\in\N

there is a continuous function

stylef\colonX\R

such that the set

styleE=\{x\inX\midf(Tx)f(x)+1\}

satisfies

styleE,TE,\ldots,Tn-1E

are pairwise disjoint.

Gutman proved the following theorem:[4]

Theorem: Let

(X,T)

be a topological dynamical system which has an aperiodic factor with the small boundary property. Then for every

\varepsilon>0

, there exists a continuous function

f\colonX\R

such that the set

styleE=\{x\inX\midf(Tx)f(x)+1\}

satisfies

\operatorname{ocap}(styleE)<\varepsilon

, where

\operatorname{ocap}

denotes orbit capacity.

Other generalizations

Proofs

Proofs taken from.

Useful results

Proposition. An ergodic map on an atomless Lebesgue space is aperiodic.

Proof. If the map is not aperiodic, then there exists a number n, such that the set of periodic points of period n has positive measure. Call the set S. Since measure is preserved, points outside of S do not map into it, nor the other way. Since the space is atomless, we can divide S into two halves, and T maps each into itself, so T is not ergodic.

Proposition. If there is an aperiodic map on a Lebesgue space of measure 1, then the space is atomless.

Proof. If there are atoms, then by measure-preservation, each atom can only map into another atom of greater or equal measure. If it maps into an atom of greater measure, it would drain out measure from the lighter atoms, so each atom maps to another atom of equal measure. Since the space has finite total measure, there are only finitely many atoms of a certain measure, and they must cycle back to the start eventually.

Proposition. If T is ergodic, then any set A > 0 satisfies (up to a null set)X = \cup_ T^k A = \cup_ T^k AProof. T^(\cup_ T^k A) is a subset of \cup_ T^k A, so by measure-preservation they are equal. Thus \cup_ T^k A is a factor of T, and since it contains A> 0, it is all of X.

Similarly, T(\cup_ T^k A) is a subset of \cup_ T^k A, so by measure-preservation they are equal, etc.

Ergodic case

Let A be a set of measure < \epsilon. Since T is ergodic, X = \cup_ T^k A, almost any point sooner or later falls into A. So we define a “time till arrival” function: f(x) := \min\ with f(x) := +\infty if x never falls into A. The set of \ is null.

Now let S = \.

Aperiodic case

Simplify

By a previous proposition,

X

is atomless, so we can map it to the unit interval (0, 1).

If we can pick a near-zero set with near-full coverage, namely some A = O(\epsilon) such that X - \cup_ T^k A = O(\epsilon), then there exists some n, such that X - \cup_ T^k A = O(\epsilon), and since

T-i(TnA)\supsetTn-iA

for each

i=0,1,2,...

, we haveX - \cup_ T^k (T^nA) = O(\epsilon)Now, repeating the previous construction with

TnA

, we obtain a Rokhlin tower of height N and coverage 1-O(\epsilon).

Thus, our task reduces to picking a near-zero set with near-full coverage.

Constructing A

Pick M > 1/\epsilon. Let S be the family of sets A such that A, T^A, \dots, T^A are disjoint. Since T preserves measure, any A \in S has size < \epsilon.

The set S nonempty, because \emptyset \in S. It is preordered by A < B iff \mu(B-A) = 0. Any totally ordered chain contains an upper bound. So by a simple Zorn-lemma–like argument, there exists a maximal element A in it. This is the desired set.

We prove by contradiction that X = \cup_T^k A. Assume not, then we will construct a set I\cap E > 0, disjoint from A, such that A \cup (I \cap E) \in S, which makes A no longer a maximal element, a contradiction.

Constructing E

Since we assumed X - \cup_T^k A= \epsilon' > 0, with positive probability,

x\not\in\cupk\inTkA

.

Since T is aperiodic, with probability 1, (x \neq Tx) \wedge(x \neq T^2x) \wedge \dots \wedge (x \neq T^Mx) And so, for a small enough \delta, with probability > 1- \epsilon'/2, (|x - Tx| > \delta) \wedge(|x - T^2 x| > \delta) \wedge \dots \wedge (|x - T^M x| > \delta) And so, for a small enough \delta, with probability > \epsilon'/2, these two events occur simultaneously. Let the event be E.

Invertible case

Simplify

It suffices to prove the case where only the base of the tower is probabilistically independent of the partition. Once that case is proved, we can apply the base case to the partition P \vee T^ P \vee \dots \vee T^P.

Since events with zero probability can be ignored, we only consider partitions where each event P_k has positive probability.

The goal is to construct a Rokhlin tower R' with base S', such that

\mu(S'\capPi)=

1-\epsilon
N

\mu(Pi)

for each i \in 0:K-1.

Given a partition P and a map T, we can trace out the orbit of every point x as a string of symbols a_0(x), a_1(x), a_2(x), \dots, such that each T^i x \in P_. That is, we follow x to T^ix, then check which partition it has ended up in, and write that partition’s name as a_i(x).

Given any Rokhlin tower of height N, we can take its base S, and divide it into K^N equivalence classes. The equivalence is defined thus: two elements are equivalent iff their names have the same first-N symbols.

Let E \subset S be one such equivalence class, then we call E, TE, \dots, T^E a column of the Rokhlin tower.

For each word a_\in (0:K-1)^N, let the corresponding equivalence class be E_.

Since T is invertible, the columns partition the tower. One can imagine the tower made of string cheese, cut up the base of the tower into the K^N equivalence classes, then pull it apart into K^N columns.

First Rokhlin tower R

Let \delta \ll \epsilon be very small, and let M \gg N be very large. Construct a Rokhlin tower with M levels and error set of size \delta. Let its base be S. The tower R = S\cup TS \cup\dots\cup T^S has mass 1-\delta.

Divide its base into K^N equivalence classes, as previously described. This divides it into K^N columns \_ where a ranges over the possible words (0:K-1)^N.

Because of how we defined the equivalence classes, each level in each column T^nE_a falls entirely within one of the partitions P_0, \dots, P_. Therefore, the column levels \_ almost make up a refinement of the partition P, except for an error set of size \delta.

That is,\mu(R\cap P_i) = \sum_\mu(T^nE_a) = \mu(P_i) + O(\delta)The critical idea: If we partition each T^nE_a equally into N parts, and put one into a new Rokhlin tower base S', we will have\mu(S'\cap P_i) = \frac\mu(P_i) + O(\delta)

Second Rokhlin tower R

Now we construct a new base S' as follows: For each column based on E_a, add to S', in a staircase pattern, the setsE_, TE_, \dots, T^E_then wrap back to the start:T^NE_, T^E_, \dots, T^E_and so on, until the column is exhausted.The new Rokhlin tower base S' is almost correct, but needs to be trimmed slightly into another set

S''

, which would satisfy

\mu(S''\capPi)=

1-\epsilon
N

\mu(Pi)

for each i \in 0:K-1, finishing the construction. (Only now do we use the assumption that there are only finitely many partitions. If there are countably many partitions, then the trimming cannot be done.)

The new Rokhlin tower S', TS', \dots, T^S', contains almost as much mass as the original Rokhlin tower. The only lost mass is due to a small corner on the top right and bottom left of each column, which takes up \leq \frac proportion of the whole column’s mass. If we set M \gg N/\delta, this lost mass is still O(\delta). Thus, the new Rokhlin tower still has a very small error set.

Even after accounting for the mass lost from cutting off the column corners, we still have\begin\mu(S'\cap P_i) &= \frac\mu(P_i) + O(\delta) + O(\delta) \\&= \frac\mu(P_i) + O(\delta) \\&= \frac\mu(P_i)\times (1 + O(N\delta/\mu(P_i)))\quad\forall i = 0, 1, \dots, K-1

\end

Since there are only finitely many partitions, we can set \delta = o(\frac), we then have \mu(S'\cap P_i) = \frac\mu(P_i)\times (1 + o(1) \epsilon) In other words, we have real numbers c_0, c_1, \dots, c_ = o(1) such that

\mu(S'\capPi)=

1-ci\epsilon
N

\mu(Pi)

.

Now for each column i = 0, 1, \dots, K-1, trim away a part of S'\cap P_i into S\cap P_i, so that

\mu(S''\capPi)=

1-\epsilon
N

\mu(Pi)

. This finishes the construction.

Notes

Notes and References

  1. Book: Shields, Paul . The theory of Bernoulli shifts . The University of Chicago Press . 1973 . Chicago Lectures in Mathematics . Chicago, Illinois and London . Chapter 3.
  2. Book: Kalikow, Steven . An outline of ergodic theory . McCutcheon . Randall . 2010 . Cambridge Univ. Press . 978-0-521-19440-2 . 1. publ . Cambridge studies in advanced mathematics . Cambridge . 2.4. Rohlin tower theorem.
  3. Mean dimension, small entropy factors and an embedding theorem. Publications Mathématiques de l'IHÉS. 1999-12-01. 0073-8301. 227–262. 89. 1. 10.1007/BF02698858. Elon. Lindenstrauss. 2413058. Elon Lindenstrauss.
  4. Gutman, Yonatan. "Embedding ℤk-actions in cubical shifts and ℤk-symbolic extensions." Ergodic Theory and Dynamical Systems 31.2 (2011): 383-403.
  5. Kornfeld . Isaac . 2004 . Some old and new Rokhlin towers . Contemporary Mathematics . 356 . 145–169 . 10.1090/conm/356/06502 . 9780821833131.
  6. Avila . Artur . Artur Avila . Candela . Pablo . 2016 . Towers for commuting endomorphisms, and combinatorial applications . Annales de l'Institut Fourier . 66 . 4 . 1529–1544 . 1507.07010 . 10.5802/aif.3042 . free.
  7. Ornstein . Donald S. . Donald Samuel Ornstein . Weiss . Benjamin . Benjamin Weiss . 1987-12-01 . Entropy and isomorphism theorems for actions of amenable groups . . 48 . 1 . 1–141 . 10.1007/BF02790325 . 0021-7670 . 120653036 .
  8. Ionescu Tulcea . Alexandra . Alexandra Bellow . 1965-01-01 . On the Category of Certain Classes of Transformations in Ergodic Theory . . 114 . 1 . 261–279 . 10.2307/1994001 . 1994001.