Unavoidable pattern explained
In mathematics and theoretical computer science, a pattern is an unavoidable pattern if it is unavoidable on any finite alphabet.
Definitions
Pattern
Like a word, a pattern (also called term) is a sequence of symbols over some alphabet.
The minimum multiplicity of the pattern
is
where
is the number of occurrence of symbol
in pattern
. In other words, it is the number of occurrences in
of the least frequently occurring symbol in
. Instance
Given finite alphabets
and
, a word
is an instance of the pattern
if there exists a non-erasing semigroup morphism
such that
, where
denotes the
Kleene star of
. Non-erasing means that
for all
, where
denotes the
empty string.
Avoidance / Matching
A word
is said to
match, or
encounter, a pattern
if a factor (also called
subword or
substring) of
is an instance of
. Otherwise,
is said to avoid
, or to be
-free. This definition can be generalized to the case of an infinite
, based on a generalized definition of "substring".
Avoidability / Unavoidability on a specific alphabet
A pattern
is
unavoidable on a finite alphabet
if each sufficiently long word
must match
; formally: if
\existsn\in\Nu. \forallx\in\Sigma*. (|x|\geqn\impliesxmatchesp)
. Otherwise,
is
avoidable on
, which implies there exist infinitely many words over the alphabet
that avoid
.
By Kőnig's lemma, pattern
is avoidable on
if and only if there exists an
infinite word
that avoids
.
Maximal p-free word
Given a pattern
and an alphabet
. A
-free word
is a maximal
-free word over
if
and
match
.
Avoidable / Unavoidable pattern
A pattern
is an unavoidable pattern (also called blocking term
) if
is unavoidable on any finite alphabet.If a pattern is unavoidable and not limited to a specific alphabet, then it is unavoidable for any finite alphabet by default. Conversely, if a pattern is said to be avoidable and not limited to a specific alphabet, then it is avoidable on some finite alphabet by default.
k-avoidable / k-unavoidable
A pattern
is
-avoidable if
is avoidable on an alphabet
of size
. Otherwise,
is
-unavoidable, which means
is unavoidable on every alphabet of size
.[1] If pattern
is
-avoidable, then
is
-avoidable for all
.
Given a finite set of avoidable patterns
, there exists an infinite word
such that
avoids all patterns of
. Let
denote the size of the minimal alphabet
such that
\existw'\in{\Sigma'}\omega
avoiding all patterns of
.
Avoidability index
The avoidability index of a pattern
is the smallest
such that
is
-avoidable, and
if
is unavoidable.Properties
is avoidable if
is an instance of an avoidable pattern
.[2]
be a factor of pattern
, then
is also avoidable.
is unavoidable if and only if
is a factor of some unavoidable pattern
.- Given an unavoidable pattern
and a symbol
not in
, then
is unavoidable.- Given an unavoidable pattern
, then the reversal
is unavoidable.- Given an unavoidable pattern
, there exists a symbol
such that
occurs exactly once in
.
represent the number of distinct symbols of pattern
. If
, then
is avoidable.
Zimin words
See main article: Sesquipower. Given alphabet
, Zimin words (patterns) are defined recursively
for
and
.
Unavoidability
All Zimin words are unavoidable.
A word
is unavoidable if and only if it is a factor of a Zimin word.[3] Given a finite alphabet
, let
represent the smallest
such that
matches
for all
. We have following properties:
[4]
}_
is the longest unavoidable pattern constructed by alphabet
since
.
Pattern reduction
Free letter
Given a pattern
over some alphabet
, we say
is free for
if there exist subsets
of
such that the following hold:
is a factor of
and
↔
is a factor of
and
x\inA\backslashB\cupB\backslashA
For example, let
, then
is free for
since there exist
satisfying the conditions above.Reduce
A pattern
reduces to pattern
if there exists a symbol
such that
is free for
, and
can be obtained by removing all occurrence of
from
. Denote this relation by
.For example, let
, then
can reduce to
since
is free for
.Locked
A word
is said to be locked if
has no free letter; hence
can not be reduced.[5] Transitivity
Given patterns
, if
reduces to
and
reduces to
, then
reduces to
. Denote this relation by
.Unavoidability
A pattern
is unavoidable if and only if
reduces to a word of length one; hence
such that
and
.[6] Graph pattern avoidance[7]
Avoidance / Matching on a specific graph
Given a simple graph
, a edge coloring
matches pattern
if there exists a simple path
in
such that the sequence c(P)=[c(e1),c(e2),...,c(er)]
matches
. Otherwise,
is said to avoid
or be
-free.Similarly, a vertex coloring
matches pattern
if there exists a simple path
in
such that the sequence
matches
.Pattern chromatic number
The pattern chromatic number
is the minimal number of distinct colors needed for a
-free vertex coloring
over the graph
.
Let
\pip(n)=max\{\pip(G):G\inGn\}
where
is the set of all simple graphs with a maximum degree no more than
. Similarly,
and
are defined for edge colorings.
Avoidability / Unavoidability on graphs
A pattern
is avoidable on graphs if
is bounded by
, where
only depends on
. - Avoidance on words can be expressed as a specific case of avoidance on graphs; hence a pattern
is avoidable on any finite alphabet if and only if
for all
, where
is a graph of
vertices concatenated.
Probabilistic bound on
There exists an absolute constant
, such that
for all patterns
with
.
Given a pattern
, let
represent the number of distinct symbols of
. If
, then
is avoidable on graphs.
Explicit colorings
Given a pattern
such that
is even for all
, then
for all
, where
is the
complete graph of
vertices.
Given a pattern
such that
, and an arbitrary
tree
, let
be the set of all avoidable subpatterns and their reflections of
. Then
.
Given a pattern
such that
, and a
tree
with degree
. Let
be the set of all avoidable subpatterns and their reflections of
, then
.
Examples
and
.
. The word over the alphabet
obtained by taking the first difference of the Thue–Morse sequence is an example of an infinite square-free word.[8] [9]
and
are unavoidable on any alphabet, since they are factors of the Zimin words.[10]
for
are 2-avoidable.- All binary patterns can be divided into three categories:[11]
are unavoidable.
xx,xxy,xyy,xxyx,xxyy,xyxx,xyxy,xyyx,xxyxx,xxyxy,xyxyy
have avoidability index of 3.
- others have avoidability index of 2.
has avoidability index of 4, as well as other locked words.
has avoidability index of 5.
[12]
is the infimum of exponents
such that
is avoidable on an alphabet of size
. Also see
Dejean's theorem.
Open problems
- Is there an avoidable pattern
such that the avoidability index of
is 6?
- Given an arbitrarily pattern
, is there an algorithm to determine the avoidability index of
?
References
- Book: Allouche . Jean-Paul . Shallit . Jeffrey . Jeffrey Shallit . 978-0-521-82332-6 . . Automatic Sequences: Theory, Applications, Generalizations . 2003 . 1086.11015 .
- Book: Berstel . Jean . Lauve . Aaron . Reutenauer . Christophe . Saliola . Franco V. . Combinatorics on words. Christoffel words and repetitions in words . CRM Monograph Series . 27 . Providence, RI . . 2009 . 978-0-8218-4480-9 . 1161.68043 .
- Book: Lothaire, M. . M. Lothaire . Algebraic combinatorics on words . With preface by Jean Berstel and Dominique Perrin . Reprint of the 2002 hardback . Encyclopedia of Mathematics and Its Applications . 90. . 2011 . 978-0-521-18071-9 . 1221.68183 .
- Book: Pytheas Fogg, N. . Valérie . Berthé . Valérie Berthé. Ferenczi . Sébastien . Mauduit . Christian . Siegel . A. . Substitutions in dynamics, arithmetics and combinatorics . Lecture Notes in Mathematics . 1794 . Berlin . . 2002 . 3-540-44141-7 . 1014.11015 .
Notes and References
- Book: Combinatorics on Words: Christoffel Words and Repetitions in Words. American Mathematical Soc.. 978-0-8218-7325-0. 127. en.
- Schmidt. Ursula. 1987-08-01. Long unavoidable patterns. Acta Informatica. en. 24. 4. 433–445. 10.1007/BF00292112. 7928450. 1432-0525.
- Zimin. A. I.. Blocking Sets of Terms. 1984. Mathematics of the USSR-Sbornik. en. 47. 2. 353–364. 10.1070/SM1984v047n02ABEH002647. 1984SbMat..47..353Z. 0025-5734.
- Book: Joshua. Cooper. Bounds on Zimin Word Avoidance. Rorabaugh. Danny. 2013. 1409.3080. 2014arXiv1409.3080C.
- Baker. Kirby A.. McNulty. George F.. Taylor. Walter. 1989-12-18. Growth problems for avoidable words. Theoretical Computer Science. en. 69. 3. 319–345. 10.1016/0304-3975(89)90071-6. 0304-3975. free.
- Bean. Dwight R.. Ehrenfeucht. Andrzej. McNulty. George F.. 1979. Avoidable patterns in strings of symbols.. Pacific Journal of Mathematics. en. 85. 2. 261–294. 10.2140/pjm.1979.85.261. 0030-8730. free.
- Grytczuk. Jarosław. 2007-05-28. Pattern avoidance on graphs. Discrete Mathematics. The Fourth Caracow Conference on Graph Theory. en. 307. 11. 1341–1346. 10.1016/j.disc.2005.11.071. 0012-365X. free.
- Book: Combinatorics on Words: Christoffel Words and Repetitions in Words. American Mathematical Soc.. 978-0-8218-7325-0. 97. en.
- Book: Fogg, N. Pytheas. Substitutions in Dynamics, Arithmetics and Combinatorics. 2002-09-23. Springer Science & Business Media. 978-3-540-44141-0. 104. en.
- Book: Allouche. Jean-Paul. Automatic Sequences: Theory, Applications, Generalizations. Shallit. Jeffrey. Shallit. Professor Jeffrey. 2003-07-21. Cambridge University Press. 978-0-521-82332-6. 24. en.
- Book: Lothaire. M.. Algebraic Combinatorics on Words. registration. Cambridge University Press. 2002. 9780521812207.
- Clark. Ronald J.. 2006-04-01. The existence of a pattern which is 5-avoidable but 4-unavoidable. International Journal of Algebra and Computation. 16. 2. 351–367. 10.1142/S0218196706002950. 0218-1967.