History monoid explained

In mathematics and computer science, a history monoid is a way of representing the histories of concurrently running computer processes as a collection of strings, each string representing the individual history of a process. The history monoid provides a set of synchronization primitives (such as locks, mutexes or thread joins) for providing rendezvous points between a set of independently executing processes or threads.

History monoids occur in the theory of concurrent computation, and provide a low-level mathematical foundation for process calculi, such as CSP the language of communicating sequential processes, or CCS, the calculus of communicating systems. History monoids were first presented by M.W. Shields.[1]

History monoids are isomorphic to trace monoids (free partially commutative monoids) and to the monoid of dependency graphs. As such, they are free objects and are universal. The history monoid is a type of semi-abelian categorical product in the category of monoids.

Product monoids and projection

Let

A=(\Sigma1,\Sigma2,\ldots,\Sigman)

\Sigmak

. Let

P(A)

denote all possible combinations of one finite-length string from each alphabet:
*
P(A)=\Sigma
1

x

*
\Sigma
2

x x

*
\Sigma
n

(In more formal language,

P(A)

is the Cartesian product of the free monoids of the

\Sigmak

. The superscript star is the Kleene star.) Composition in the product monoid is component-wise, so that, for

u=(u1,u2,\ldots,un)

and

v=(v1,v2,\ldots,vn)

then

uv=(u1v1,u2v2,\ldots,unvn)

for all

u,v

in

P(A)

. Define the union alphabet to be

\Sigma=\Sigma1\cup\Sigma2\cup\cup\Sigman.

(The union here is the set union, not the disjoint union.) Given any string

w\in\Sigma*

, we can pick out just the letters in some
*
\Sigma
k
using the corresponding string projection
*
\pi
k
. A distribution

\pi:\Sigma*\toP(A)

is the mapping that operates on

w\in\Sigma*

with all of the

\pik

, separating it into components in each free monoid:

\pi(w)\mapsto(\pi1(w),\pi2(w),\ldots,\pin(w)).

Histories

For every

a\in\Sigma

, the tuple

\pi(a)

is called the elementary history of a. It serves as an indicator function for the inclusion of a letter a in an alphabet

\Sigmak

. That is,

\pi(a)=(a1,a2,\ldots,an)

where

ak=\begin{cases}aifa\in\Sigmak\\ \varepsilonotherwise. \end{cases}

Here,

\varepsilon

denotes the empty string. The history monoid

H(A)

is the submonoid of the product monoid

P(A)

generated by the elementary histories:

H(A)=\{\pi(a)|a\in\Sigma\}*

(where the superscript star is the Kleene star applied with a component-wise definition of composition as given above). The elements of

H(A)

are called global histories, and the projections of a global history are called individual histories.

Connection to computer science

The use of the word history in this context, and the connection to concurrent computing, can be understood as follows. An individual history is a record of the sequence of states of a process (or thread or machine); the alphabet

\Sigmak

is the set of states of the process.

A letter that occurs in two or more alphabets serves as a synchronization primitive between the various individual histories. That is, if such a letter occurs in one individual history, it must also occur in another history, and serves to "tie" or "rendezvous" them together.

Consider, for example,

\Sigma1=\{a,b,c\}

and

\Sigma2=\{a,d,e\}

. The union alphabet is of course

\Sigma=\{a,b,c,d,e\}

. The elementary histories are

(a,a)

,

(b,\varepsilon)

,

(c,\varepsilon)

,

(\varepsilon,d)

and

(\varepsilon,e)

. In this example, an individual history of the first process might be

bcbcc

while the individual history of the second machine might be

ddded

. Both of these individual histories are represented by the global history

bcbdddcced

, since the projection of this string onto the individual alphabets yields the individual histories. In the global history, the letters

b

and

c

can be considered to commute with the letters

d

and

e

, in that these can be rearranged without changing the individual histories. Such commutation is simply a statement that the first and second processes are running concurrently, and are unordered with respect to each other; they have not (yet) exchanged any messages or performed any synchronization.

The letter

a

serves as a synchronization primitive, as its occurrence marks a spot in both the global and individual histories, that cannot be commuted across. Thus, while the letters

b

and

c

can be re-ordered past

d

and

e

, they cannot be reordered past

a

. Thus, the global history

bcdabe

and the global history

bdcaeb

both have as individual histories

bcab

and

dae

, indicating that the execution of

d

may happen before or after

c

. However, the letter

a

is synchronizing, so that

e

is guaranteed to happen after

c

, even though

e

is in a different process than

c

.

Properties

A history monoid is isomorphic to a trace monoid, and as such, is a type of semi-abelian categorical product in the category of monoids. In particular, the history monoid

H(\Sigma1,\Sigma2,\ldots,\Sigman)

is isomorphic to the trace monoid

M(D)

with the dependency relation given by

D=\left(\Sigma1 x \Sigma1\right)\cup \left(\Sigma2 x \Sigma2\right)\cup\cup \left(\Sigman x \Sigman\right).

In simple terms, this is just the formal statement of the informal discussion given above: the letters in an alphabet

\Sigmak

can be commutatively re-ordered past the letters in an alphabet

\Sigmaj

, unless they are letters that occur in both alphabets. Thus, traces are exactly global histories, and vice versa.

Conversely, given any trace monoid

M(D)

, one can construct an isomorphic history monoid by taking a sequence of alphabets

\Sigmaa,=\{a,b\}

where

(a,b)

ranges over all pairs in

D

.

Notes

  1. M.W. Shields "Concurrent Machines", Computer Journal, (1985) 28 pp. 449–465.

References