Trace monoid explained

In computer science, a trace is a set of strings, wherein certain letters in the string are allowed to commute, but others are not. It generalizes the concept of a string, by not forcing the letters to always be in a fixed order, but allowing certain reshufflings to take place. Traces were introduced by Pierre Cartier and Dominique Foata in 1969 to give a combinatorial proof of MacMahon's master theorem. Traces are used in theories of concurrent computation, where commuting letters stand for portions of a job that can execute independently of one another, while non-commuting letters stand for locks, synchronization points or thread joins.[1]

The trace monoid or free partially commutative monoid is a monoid of traces. In a nutshell, it is constructed as follows: sets of commuting letters are given by an independency relation. These induce an equivalence relation of equivalent strings; the elements of the equivalence classes are the traces. The equivalence relation then partitions up the free monoid (the set of all strings of finite length) into a set of equivalence classes; the result is still a monoid; it is a quotient monoid and is called the trace monoid. The trace monoid is universal, in that all dependency-homomorphic (see below) monoids are in fact isomorphic.

Trace monoids are commonly used to model concurrent computation, forming the foundation for process calculi. They are the object of study in trace theory. The utility of trace monoids comes from the fact that they are isomorphic to the monoid of dependency graphs; thus allowing algebraic techniques to be applied to graphs, and vice versa. They are also isomorphic to history monoids, which model the history of computation of individual processes in the context of all scheduled processes on one or more computers.

Trace

Let

\Sigma*

denote the free monoid, that is, the set of all strings written in the alphabet

\Sigma

. Here, the asterisk denotes, as usual, the Kleene star. An independency relation

I

on

\Sigma

then induces a (symmetric) binary relation

\sim

on

\Sigma*

, where

u\simv

if and only if there exist

x,y\in\Sigma*

, and a pair

(a,b)\inI

such that

u=xaby

and

v=xbay

. Here,

u,v,x

and

y

are understood to be strings (elements of

\Sigma*

), while

a

and

b

are letters (elements of

\Sigma

).

The trace is defined as the reflexive transitive closure of

\sim

. The trace is thus an equivalence relation on

\Sigma*

, and is denoted by

\equivD

, where

D

is the dependency relation corresponding to

I,

that is

D=(\Sigma x \Sigma)\setminusI

and conversely

I=(\Sigma x \Sigma)\setminusD.

Clearly, different dependencies will give different equivalence relations.

The transitive closure implies that

u\equivv

if and only if there exists a sequence of strings

(w0,w1,,wn)

such that

u\simw0

and

v\simwn

and

wi\simwi+1

for all

0\lei<n

. The trace is stable under the monoid operation on

\Sigma*

(concatenation) and is therefore a congruence relation on

\Sigma*

.

The trace monoid, commonly denoted as

M(D)

, is defined as the quotient monoid

M(D)=\Sigma*/\equivD.

The homomorphism

*\to
\phi
D:\Sigma

M(D)

is commonly referred to as the natural homomorphism or canonical homomorphism. That the terms natural or canonical are deserved follows from the fact that this morphism embodies a universal property, as discussed in a later section.

One will also find the trace monoid denoted as

M(\Sigma,I)

where

I

is the independency relation. Confusingly, one can also find the commutation relation used instead of the independency relation (it differs by including all the diagonal elements).

Examples

Consider the alphabet

\Sigma=\{a,b,c\}

. A possible dependency relation is

\begin{matrix}D&=&\{a,b\} x \{a,b\}\cup\{a,c\} x \{a,c\}\\ &=&\{a,b\}2\cup\{a,c\}2\\ &=&\{(a,b),(b,a),(a,c),(c,a),(a,a),(b,b),(c,c)\}\end{matrix}

The corresponding independency is

ID=\{(b,c),(c,b)\}

Therefore, the letters

b,c

commute. Thus, for example, a trace equivalence class for the string

abababbca

would be

[abababbca]D=\{abababbca,abababcba,ababacbba\}

The equivalence class

[abababbca]D

is an element of the trace monoid.

Properties

The cancellation property states that equivalence is maintained under right cancellation. That is, if

w\equivv

, then

(w ÷ a)\equiv(v ÷ a)

. Here, the notation

w ÷ a

denotes right cancellation, the removal of the first occurrence of the letter a from the string w, starting from the right-hand side. Equivalence is also maintained by left-cancellation. Several corollaries follow:

w\equivv

if and only if

xwy\equivxvy

for strings x and y. Thus, the trace monoid is a syntactic monoid.

ua\equivvb

and

a\neb

, then a is independent of b. That is,

(a,b)\inID

. Furthermore, there exists a string w such that

u\equivwb

and

v\equivwa

.

w\equivv

, then

\pi\Sigma(w)\equiv\pi\Sigma(v)

.

A strong form of Levi's lemma holds for traces. Specifically, if

uv\equivxy

for strings u, v, x, y, then there exist strings

z1,z2,z3

and

z4

such that

(w2,w3)\inID

for all letters

w2\in\Sigma

and

w3\in\Sigma

such that

w2

occurs in

z2

and

w3

occurs in

z3

, and

u\equivz1z2,    v\equivz3z4,

x\equivz1z3,    y\equivz2z4.

[2]

Universal property

A dependency morphism (with respect to a dependency D) is a morphism

\psi:\Sigma*\toM

to some monoid M, such that the "usual" trace properties hold, namely:

1.

\psi(w)=\psi(\varepsilon)

implies that

w=\varepsilon

2.

(a,b)\inID

implies that

\psi(ab)=\psi(ba)

3.

\psi(ua)=\psi(v)

implies that

\psi(u)=\psi(v ÷ a)

4.

\psi(ua)=\psi(vb)

and

a\neb

imply that

(a,b)\inID

Dependency morphisms are universal, in the sense that for a given, fixed dependency D, if

\psi:\Sigma*\toM

is a dependency morphism to a monoid M, then M is isomorphic to the trace monoid

M(D)

. In particular, the natural homomorphism is a dependency morphism.

Normal forms

There are two well-known normal forms for words in trace monoids. One is the lexicographic normal form, due to Anatolij V. Anisimov and Donald Knuth, and the other is the Foata normal form due to Pierre Cartier and Dominique Foata who studied the trace monoid for its combinatorics in the 1960s.[3]

Unicode's Normalization Form Canonical Decomposition (NFD) is an example of a lexicographic normal form - the ordering is to sort consecutive characters with non-zero canonical combining class by that class.

Trace languages

Just as a formal language can be regarded as a subset of

\Sigma*

, the set of all possible strings, so a trace language is defined as a subset of

M(D)

all possible traces.

Alternatively, but equivalently, a language

L\subseteq\Sigma*

is a trace language, or is said to be consistent with dependency D if

L=[L]D

where

[L]D=cupw[w]D

is the trace closure of a set of strings.

See also

References

General references

Seminal publications

Notes and References

  1. Sándor & Crstici (2004) p.161
  2. Proposition 2.2, Diekert and Métivier 1997.
  3. Section 2.3, Diekert and Métivier 1997.