In mathematics, subshifts of finite type are used to model dynamical systems, and in particular are the objects of study in symbolic dynamics and ergodic theory. They also describe the set of all possible sequences executed by a finite state machine. The most widely studied shift spaces are the subshifts of finite type.
A (one-sided) shift of finite type is the set of all sequences, infinite on one end only, that can be made up of the letters
A,B,C
AAA … ,ABAB … ,...
A subshift can be defined by a directed graph on these letters, such as the graph
A\toB\toC\toA
ABCABC … ,BCABCA … ,CABCAB …
Other directed graphs on the same letters produce other subshifts. For example, adding another arrow
A\toC
Given a Markov transition matrix and an invariant distribution on the states, we can impose a probability measure on the set of subshifts. For example, consider the Markov chain given on the left on the states
A,B1,B2
\pi=(2/7,4/7,1/7)
B1,B2
A,B1,B2
A,B
A,B
The curious thing is that the probability measure on the subshifts on
A,B
A,B
Bn
Pr(A|Bn)\to
23 | |
Conversely, there exists a space of subshifts on 6 symbols, projected to subshifts on 2 symbols, such that any Markov measure on the smaller subshift has a preimage measure that is not Markov of any order (Example 2.6).
Let be a finite set of symbols (alphabet). Let denote the set of all bi-infinite sequences of elements of together with the shift operator . We endow with the discrete topology and with the product topology. A symbolic flow or subshift is a closed -invariant subset of [1] and the associated language is the set of finite subsequences of .[2]
Now let be an adjacency matrix with entries in Using these elements we construct a directed graph with the set of vertices and the set of edges containing the directed edge in if and only if . Let be the set of all infinite admissible sequences of edges, where by admissible it is meant that the sequence is a walk of the graph, and the sequence can be either one-sided or two-sided infinite. Let be the left shift operator on such sequences; it plays the role of the time-evolution operator of the dynamical system. A subshift of finite type is then defined as a pair obtained in this way. If the sequence extends to infinity in only one direction, it is called a one-sided subshift of finite type, and if it is bilateral, it is called a two-sided subshift of finite type.
Formally, one may define the sequences of edges as
+ | |
\Sigma | |
A |
=\left\{(x0,x1,\ldots): xj\inV,
A | |
xjxj+1 |
=1,j\in\N\right\}.
This is the space of all sequences of symbols such that the symbol can be followed by the symbol only if the -th entry of the matrix is 1. The space of all bi-infinite sequences is defined analogously:
\SigmaA=\left\{(\ldots,x-1,x0,x1,\ldots): xj\inV,
A | |
xjxj+1 |
=1,j\inZ\right\}.
The shift operator maps a sequence in the one- or two-sided shift to another by shifting all symbols to the left, i.e.
\displaystyle(T(x))j=xj+1.
Clearly this map is only invertible in the case of the two-sided shift.
A subshift of finite type is called transitive if is strongly connected: there is a sequence of edges from any one vertex to any other vertex. It is precisely transitive subshifts of finite type which correspond to dynamical systems with orbits that are dense.
An important special case is the full -shift: it has a graph with an edge that connects every vertex to every other vertex; that is, all of the entries of the adjacency matrix are 1. The full -shift corresponds to the Bernoulli scheme without the measure.
By convention, the term shift is understood to refer to the full -shift. A subshift is then any subspace of the full shift that is shift-invariant (that is, a subspace that is invariant under the action of the shift operator), non-empty, and closed for the product topology defined below. Some subshifts can be characterized by a transition matrix, as above; such subshifts are then called subshifts of finite type. Often, subshifts of finite type are called simply shifts of finite type. Subshifts of finite type are also sometimes called topological Markov shifts.
Many chaotic dynamical systems are isomorphic to subshifts of finite type; examples include systems with transverse homoclinic connections, diffeomorphisms of closed manifolds with a positive metric entropy, the Prouhet–Thue–Morse system, the Chacon system (this is the first system shown to be weakly mixing but not strongly mixing), Sturmian systems and Toeplitz systems.[3]
A sofic system is an image of a subshift of finite type where different edges of the transition graph may be mapped to the same symbol. For example, if one only watches the output from a hidden Markov chain, then the output appears to be a sofic system.[4] It may be regarded as the set of labellings of paths through an automaton: a subshift of finite type then corresponds to an automaton which is deterministic.[5] Such systems correspond to regular languages.
Context-free systems are defined analogously, and are generated by phrase structure grammars.
A renewal system is defined to be the set of all infinite concatenations of some fixed finite collection of finite words.
Subshifts of finite type are identical to free (non-interacting) one-dimensional Potts models (-letter generalizations of Ising models), with certain nearest-neighbor configurations excluded. Interacting Ising models are defined as subshifts together with a continuous function of the configuration space (continuous with respect to the product topology, defined below); the partition function and Hamiltonian are explicitly expressible in terms of this function.
Subshifts may be quantized in a certain way, leading to the idea of the quantum finite automata.
A subshift has a natural topology, derived from the product topology on where
V\Z=\prodnV=\{x=(\ldots,x-1,x0,x1,\ldots):xk\inV \forallk\in\Z\}
and is given the discrete topology. A basis for the topology of which induces the topology of the subshift, is the family of cylinder sets
Ct[a0,\ldots,as]=\{x\inV\Z: xt=a0,\ldots,xt+s=as\}
The cylinder sets are clopen sets in Every open set in is a countable union of cylinder sets. Every open set in the subshift is the intersection of an open set of with the subshift. With respect to this topology, the shift is a homeomorphism; that is, with respect to this topology, it is continuous with continuous inverse.
The space is homeomorphic to a Cantor set.
A variety of different metrics can be defined on a shift space. One can define a metric on a shift space by considering two points to be "close" if they have many initial symbols in common; this is the -adic metric. In fact, both the one- and two-sided shift spaces are compact metric spaces.
A subshift of finite type may be endowed with any one of several different measures, thus leading to a measure-preserving dynamical system. A common object of study is the Markov measure, which is an extension of a Markov chain to the topology of the shift.
A Markov chain is a pair consisting of the transition matrix, an matrix for which all and
np | |
\sum | |
ij |
=1
for all . The stationary probability vector has all, and has
n | |
\sum | |
i=1 |
\piipij=\pij.
A Markov chain, as defined above, is said to be compatible with the shift of finite type if whenever . The Markov measure of a cylinder set may then be defined by
\mu(Ct[a0,\ldots,as])=
\pi | |
a0 |
p | |
a0,a1 |
…
p | |
as-1,as |
The Kolmogorov–Sinai entropy with relation to the Markov measure is
s\mu=
n | |
-\sum | |
i=1 |
\pii
n | |
\sum | |
j=1 |
pijlogpij
The Artin–Mazur zeta function is defined as the formal power series
infty | |
\zeta(z)=\exp\left(\sum | |
n=1 |
l|rm{Fix}(Tn)r|
zn | |
n |
\right),
where is the set of fixed points of the -fold shift. It has a product formula
\zeta(z)=\prod\gamma\left(1-z|\gamma|\right)-1
where runs over the closed orbits.[6] For subshifts of finite type, the zeta function is a rational function of :[7]
\zeta(z)=(\det(I-zA))-1 .