Π-calculus explained

In theoretical computer science, the -calculus (or pi-calculus) is a process calculus. The -calculus allows channel names to be communicated along the channels themselves, and in this way it is able to describe concurrent computations whose network configuration may change during the computation.

The -calculus has few terms and is a small, yet expressive language (see). Functional programs can be encoded into the -calculus, and the encoding emphasises the dialogue nature of computation, drawing connections with game semantics. Extensions of the -calculus, such as the spi calculus and applied, have been successful in reasoning about cryptographic protocols. Beside the original use in describing concurrent systems, the -calculus has also been used to reason about business processes^[1] and molecular biology.

Informal definition

The -calculus belongs to the family of process calculi, mathematical formalisms for describing and analyzing properties of concurrent computation. In fact, the -calculus, like the λ-calculus, is so minimal that it does not contain primitives such as numbers, booleans, data structures, variables, functions, or even the usual control flow statements (such as if-then-else, while).

Process constructs

Central to the -calculus is the notion of name. The simplicity of the calculus lies in the dual role that names play as communication channels and variables.

The process constructs available in the calculus are the following^[2] (a precise definition is given in the following section):

concurrency, written

P\midQ

, where

and

are two processes or threads executed concurrently.

communication, where
- input prefixing

c\left(x\right).P

is a process waiting for a message that was sent on a communication channel named

before proceeding as binding the name received to the name Typically, this models either a process expecting a communication from the network or a label c usable only once by a goto c operation.

- output prefixing

\overline{c}\langley\rangle.P

describes that the name

is emitted on channel

before proceeding as Typically, this models either sending a message on the network or a goto c operation.

replication, written

, which may be seen as a process which can always create a new copy of Typically, this models either a network service or a label c waiting for any number of goto c operations.

creation of a new name, written

\left(\nux\right)P

, which may be seen as a process allocating a new constant within The constants of are defined by their names only and are always communication channels. Creation of a new name in a process is also called restriction.

the nil process, written

, is a process whose execution is complete and has stopped.

Although the minimalism of the -calculus prevents us from writing programs in the normal sense, it is easy to extend the calculus. In particular, it is easy to define both control structures such as recursion, loops and sequential composition and datatypes such as first-order functions, truth values, lists and integers. Moreover, extensions of the have been proposed which take into account distribution or public-key cryptography. The applied due to Abadi and Fournet https://www.soe.ucsc.edu/~abadi/Papers/isss02.pdf put these various extensions on a formal footing by extending the with arbitrary datatypes.

A small example

Below is a tiny example of a process which consists of three parallel components. The channel name is only known by the first two components.

\begin{align} (\nux)& ( \overline{x}\langlez\rangle. 0\\ & | x(y). \overline{y}\langlex\rangle. x(y). 0 )\\ & | z(v). \overline{v}\langlev\rangle.0 \end{align}

The first two components are able to communicate on the channel, and the name becomes bound to . The next step in the process is therefore

\begin{align} (\nux)& ( 0\\ & | \overline{z}\langlex\rangle. x(y). 0 )\\ & | z(v). \overline{v}\langlev\rangle. 0 \end{align}

Note that the remaining is not affected because it is defined in an inner scope.The second and third parallel components can now communicate on the channel name, and the name becomes bound to . The next step in the process is now

\begin{align} (\nux)& ( 0\\ & | x(y). 0\\ & | \overline{x}\langlex\rangle. 0 ) \end{align}

Note that since the local name has been output, the scope of is extended to cover the third component as well. Finally, the channel can be used for sending the name . After that all concurrently executing processes have stopped

\begin{align} (\nux)& ( 0\\ & | 0\\ & | 0 ) \end{align}

Formal definition

Syntax

Let Χ be a set of objects called names. The abstract syntax for the -calculus is built from the following BNF grammar (where x and y are any names from Χ):^[3]

\begin{align} P,Q::=& x(y).P&Receiveonchannelx,bindtheresulttoy,thenrunP\\ & | \overline{x}\langley\rangle.P&Sendthevalueyoverchannelx,thenrunP\\ & | P|Q&RunPandQsimultaneously\\ & | (\nux)P&CreateanewchannelxandrunP\\ & | !P&RepeatedlyspawncopiesofP\\ & | 0&Terminatetheprocess \end{align}

In the concrete syntax below, the prefixes bind more tightly than the parallel composition (|), and parentheses are used to disambiguate.

Names are bound by the restriction and input prefix constructs. Formally, the set of free names of a process in –calculus are defined inductively by the table below. The set of bound names of a process are defined as the names of a process that are not in the set of free names.

Construct	Free names
0	None
\overline{a}\langlex\rangle.P	a; x; all free names of P
a(x).P	a; free names of P except for x
P	Q	All free names of P and Q
(\nux)P	Free names of P except for x
!P	All free names of P

Structural congruence

Central to both the reduction semantics and the labelled transition semantics is the notion of structural congruence. Two processes are structurally congruent, if they are identical up to structure. In particular, parallel composition is commutative and associative.

More precisely, structural congruence is defined as the least equivalence relation preserved by the process constructs and satisfying:

Alpha-conversion:

P\equivQ

can be obtained from

by renaming one or more bound names in

Axioms for parallel composition:

P|Q\equivQ|P

(P|Q)|R\equivP|(Q|R)

P|0\equivP

Axioms for restriction:

(\nux)(\nuy)P\equiv(\nuy)(\nux)P

(\nux)0\equiv0

Axiom for replication:

!P\equivP|!P

Axiom relating restriction and parallel:

(\nux)(P|Q)\equiv(\nux)P|Q

if is not a free name of

This last axiom is known as the "scope extension" axiom. This axiom is central, since it describes how a bound name may be extruded by an output action, causing the scope of to be extended. In cases where is a free name of

, alpha-conversion may be used to allow extension to proceed.

Reduction semantics

We write

P → P'

can perform a computation step, following which it is now

.This reduction relation

→

is defined as the least relation closed under a set of reduction rules.

The main reduction rule which captures the ability of processes to communicate through channels is the following:

\overline{x}\langlez\rangle.P|x(y).Q → P|Q[z/y]

where

Q[z/y]

denotes the process

in which the free name

has been substituted for the free occurrences of

. If a free occurrence of

occurs in a location where

would not be free, alpha-conversion may be required.

There are three additional rules:

P → Q

then also

P|R → Q|R

This rule says that parallel composition does not inhibit computation.

P → Q

, then also

(\nux)P → (\nux)Q

This rule ensures that computation can proceed underneath a restriction.

P\equivP'

and

P' → Q'

and

Q'\equivQ

, then also

P → Q

The latter rule states that processes that are structurally congruent have the same reductions.

The example revisited

Consider again the process

(\nux)(\overline{x}\langlez\rangle.0|x(y).\overline{y}\langlex\rangle.x(y).0)|z(v).\overline{v}\langlev\rangle.0

Applying the definition of the reduction semantics, we get the reduction

(\nux)(\overline{x}\langlez\rangle.0|x(y).\overline{y}\langlex\rangle.x(y).0)|z(v).\overline{v}\langlev\rangle.0 → (\nux)(0|\overline{z}\langlex\rangle.x(y).0)|z(v).\overline{v}\langlev\rangle.0

Note how, applying the reduction substitution axiom, free occurrences of

are now labeled as

Next, we get the reduction

(\nux)(0|\overline{z}\langlex\rangle.x(y).0)|z(v).\overline{v}\langlev\rangle.0 → (\nux)(0|x(y).0|\overline{x}\langlex\rangle.0)

Note that since the local name has been output, the scope of is extended to cover the third component as well. This was captured using the scope extension axiom.

Next, using the reduction substitution axiom, we get

(\nux)(0|0|0)

Finally, using the axioms for parallel composition and restriction, we get

Labelled semantics

Alternatively, one may give the pi-calculus a labelled transition semantics (as has been done with the Calculus of Communicating Systems).
In this semantics, a transition from a state

to some other state

after an action

\alpha

is notated as:

P\xrightarrow{\overset{}\alpha}P'

Where states

and

represent processes and

\alpha

is either an input action

a(x)

, an output action \overline{a}\langlex\rangle

, or a silent action .^[4]

A standard result about the labelled semantics is that it agrees with the reduction semantics up to structural congruence, in the sense that

P → P'

if and only if

P\xrightarrow{\overset{}\tau}\equivP'

^[5]

Extensions and variants

The syntax given above is a minimal one. However, the syntax may be modified in various ways.

A nondeterministic choice operator

P+Q

can be added to the syntax.

A test for name equality

[x=y]P

can be added to the syntax. This match operator can proceed as

if and only if and

are the same name.Similarly, one may add a mismatch operator for name inequality. Practical programs which can pass names (URLs or pointers) often use such functionality: for directly modeling such functionality inside the calculus, this and related extensions are often useful.

The asynchronous -calculus^[6] ^[7] allows only outputs with no continuation, i.e. output atoms of the form

\overline{x}\langley\rangle

, yielding a smaller calculus. However, any process in the original calculus can be represented by the smaller asynchronous -calculus using an extra channel to simulate explicit acknowledgement from the receiving process. Since a continuation-free output can model a message-in-transit, this fragment shows that the original -calculus, which is intuitively based on synchronous communication, has an expressive asynchronous communication model inside its syntax. However, the nondeterministic choice operator defined above cannot be expressed in this way, as an unguarded choice would be converted into a guarded one; this fact has been used to demonstrate that the asynchronous calculus is strictly less expressive than the synchronous one (with the choice operator).^[8]

The polyadic -calculus allows communicating more than one name in a single action:

\overline{x}\langlez_1,...,z_n\rangle.P

(polyadic output) and

x(z_1,...,z_n).P

(polyadic input). This polyadic extension, which is useful especially when studying types for name passing processes, can be encoded in the monadic calculus by passing the name of a private channel through which the multiple arguments are then passed in sequence. The encoding is defined recursively by the clauses

\overline{x}\langley_{1, … ,y}_n\rangle.P

is encoded as

(\nuw)\overline{x}\langlew\rangle.\overline{w}\langley_{1\rangle. … .\overline{w}\langle}y_n\rangle.[P]

x(y_{1, … ,y}_n).P

is encoded as

x(w).w(y_{1). … .w(y}_n).[P]

All other process constructs are left unchanged by the encoding.

In the above,

[P]

denotes the encoding of all prefixes in the continuation

in the same way.

The full power of replication

is not needed. Often, one only considers replicated input

!x(y).P

, whose structural congruence axiom is

!x(y).P\equivx(y).P|!x(y).P

Replicated input process such as

!x(y).P

can be understood as servers, waiting on channel to be invoked by clients. Invocation of a server spawns a new copy ofthe process

P[a/y]

, where a is the name passed by the client to theserver, during the latter's invocation.

A higher order -calculus can be defined where not only names but processes are sent through channels.The key reduction rule for the higher order case is

\overline{x}\langleR\rangle.P|x(Y).Q → P|Q[R/Y]

Here,

denotes a process variable which can be instantiated by a process term. Sangiorgiestablished that the ability to pass processes does notincrease the expressivity of the -calculus: passing a process P can besimulated by just passing a name that points to P instead.

Properties

Turing completeness

The -calculus is a universal model of computation. This was first observed by Milner in his paper "Functions as Processes",^[9] in which he presents two encodings of the lambda-calculus in the -calculus. One encoding simulates the eager (call-by-value) evaluation strategy, the other encoding simulates the normal-order (call-by-name) strategy. In both of these, the crucial insight is the modeling of environment bindings – for instance, " is bound to term $M$ " – as replicating agents that respond to requests for their bindings by sending back a connection to the term

The features of the -calculus that make these encodings possible are name-passing and replication (or, equivalently, recursively defined agents). In the absence of replication/recursion, the -calculus ceases to be Turing-complete. This can be seen by the fact that bisimulation equivalence becomes decidable for the recursion-free calculus and even for the finite-control -calculus where the number of parallel components in any process is bounded by a constant.^[10]

Bisimulations in the -calculus

Early and late bisimilarity

Early and late bisimilarity were both formulated by Milner, Parrow and Walker in their original paper on the -calculus.^[11]

A binary relation

over processes is an early bisimulation if for every pair of processes

(p,q)\inR

whenever

p\xrightarrow{a(x)}p'

then for every name

there exists some

such that

q\xrightarrow{a(x)}q'

and

(p'[y/x],q'[y/x])\inR

;

for any non-input action

\alpha

, if

{ p\xrightarrow{\overset{}{\alpha}}p' }

then there exists some

such that

q\xrightarrow{\overset{}{\alpha}}q'

and

(p',q')\inR

;

and symmetric requirements with

and

interchanged.

Processes

and

are said to be early bisimilar, written

p\sim_eq

if the pair

(p,q)\inR

for some early bisimulation

In late bisimilarity, the transition match must be independent of the name being transmitted.A binary relation

over processes is a late bisimulation if for every pair of processes

(p,q)\inR

whenever

p\xrightarrow{a(x)}p'

then for some

it holds that

q\xrightarrow{a(x)}q'

and

(p'[y/x],q'[y/x])\inR

for every name y;

for any non-input action

\alpha

, if

p\xrightarrow{\overset{}{\alpha}}p'

implies that there exists some

such that

q\xrightarrow{\overset{}{\alpha}}q'

and

(p',q')\inR

;

and symmetric requirements with

and

interchanged.Processes

and

are said to be late bisimilar, written

p\sim_lq

if the pair

(p,q)\inR

for some late bisimulation

Both

\sim_e

and

\sim_l

suffer from the problem that they are not congruence relations in the sense that they are not preserved by all process constructs. More precisely, there exist processes

and

such that

p\sim_eq

but

a(x).p\not\sim_ea(x).q

. One may remedy this problem by considering the maximal congruence relations included in

\sim_e

and

\sim_l

, known as early congruence and late congruence, respectively.

Open bisimilarity

Fortunately, a third definition is possible, which avoids this problem, namely that of open bisimilarity, due to Sangiorgi.^[12]

A binary relation

over processes is an open bisimulation if for every pair of elements

(p,q)\inR

and for every name substitution

\sigma

and every action

\alpha

, whenever

p\sigma\xrightarrow{\overset{}{\alpha}}p'

then there exists some

such that

q\sigma\xrightarrow{\overset{}{\alpha}}q'

and

(p',q')\inR

Processes

and

are said to be open bisimilar, written

p\sim_oq

if the pair

(p,q)\inR

for some open bisimulation

Early, late and open bisimilarity are distinct

Early, late and open bisimilarity are distinct. The containments are proper, so

\sim_o\subsetneq\sim_l\subsetneq\sim_e

In certain subcalculi such as the asynchronous pi-calculus, late, early and open bisimilarity are known to coincide. However, in this setting a more appropriate notion is that of asynchronous bisimilarity.In the literature, the term open bisimulation usually refers to a more sophisticated notion, where processes and relations are indexed by distinction relations; details are in Sangiorgi's paper cited above.

Barbed equivalence

Alternatively, one may define bisimulation equivalence directly from the reduction semantics. We write

p\Downarrowa

if process

immediately allows an input or an output on name

A binary relation

over processes is a barbed bisimulation if it is a symmetric relation which satisfies that for every pair of elements

(p,q)\inR

we have that

(1)

p\Downarrowa

if and only if

q\Downarrowa

for every name

and

(2) for every reduction

p → p'

there exists a reduction

q → q'

such that

(p',q')\inR

We say that

and

are barbed bisimilar if there exists a barbed bisimulation

where

(p,q)\inR

Defining a context as a term with a hole [] we say that two processes P and Q are barbed congruent, written

P\sim_bQ

, if for every context

C[]

we have that

C[P]

and

C[Q]

are barbed bisimilar. It turns out that barbed congruence coincides with the congruence induced by early bisimilarity.

Applications

The -calculus has been used to describe many different kinds of concurrent systems. In fact, some of the most recent applications lie outside the realm of traditional computer science.

In 1997, Martin Abadi and Andrew Gordon proposed an extension of the -calculus, the Spi-calculus, as a formal notation for describing and reasoning about cryptographic protocols. The spi-calculus extends the -calculus with primitives for encryption and decryption. In 2001, Martin Abadi and Cedric Fournet generalised the handling of cryptographic protocols to produce the applied calculus. There is now a large body of work devoted to variants of the applied calculus, including a number of experimental verification tools. One example is the tool ProVerif http://www.proverif.ens.fr/ due to Bruno Blanchet, based on a translation of the applied -calculus into Blanchet's logic programming framework. Another example is Cryptyc http://www.cryptyc.org, due to Andrew Gordon and Alan Jeffrey, which uses Woo and Lam's method of correspondence assertions as the basis for type systems that can check for authentication properties of cryptographic protocols.

Around 2002, Howard Smith and Peter Fingar became interested that -calculus would become a description tool for modeling business processes. By July 2006, there is discussion in the community about how useful this would be. Most recently, the -calculus has formed the theoretical basis of Business Process Modeling Language (BPML), and of Microsoft's XLANG.^[13]

The -calculus has also attracted interest in molecular biology. In 1999, Aviv Regev and Ehud Shapiro showed that one can describe a cellular signaling pathway (the so-called RTK/MAPK cascade) and in particular the molecular "lego" which implements these tasks of communication in an extension of the -calculus.^[14] Following this seminal paper, other authors described the whole metabolic network of a minimal cell.^[15] In 2009, Anthony Nash and Sara Kalvala proposed a -calculus framework to model the signal transduction that directs Dictyostelium discoideum aggregation.^[16]

History

The -calculus was originally developed by Robin Milner, Joachim Parrow and David Walker in 1992, based on ideas by Uffe Engberg and Mogens Nielsen.^[17] It can be seen as a continuation of Milner's work on the process calculus CCS (Calculus of Communicating Systems). In his Turing lecture, Milner describes the development of the -calculus as an attempt to capture the uniformity of values and processes in actors.^[18]

Implementations

The following programming languages implement the -calculus or one of its variants:

Business Process Modeling Language (BPML)
occam-π
Pict
JoCaml (based on the Join-calculus)
RhoLang

References

Book: Milner, Robin. Robin Milner. Communicating and Mobile Systems: The π-calculus. 1999. Cambridge University Press. Cambridge, UK. 0-521-65869-1. registration.
Book: Milner, Robin. Robin Milner. F. L. Hamer . W. Brauer . H. Schwichtenberg. Logic and Algebra of Specification. http://www.lfcs.inf.ed.ac.uk/reports/91/ECS-LFCS-91-180/ECS-LFCS-91-180.ps. 1993. Springer-Verlag. The Polyadic π-Calculus: A Tutorial.
Book: Sangiorgi. Davide. Davide Sangiorgi. Walker. David. David Walker (computer scientist). The π-calculus: A Theory of Mobile Processes. 2001. Cambridge University Press. Cambridge, UK. 0-521-78177-9.

Notes and References

OMG Specification (2011). "Business Process Model and Notation (BPMN) Version 2.0", Object Management Group. p.21
Web site: FAQ on π-Calculus. Wing. Jeannette M.. 27 December 2002.
http://www.lfcs.inf.ed.ac.uk/reports/89/ECS-LFCS-89-85/ A Calculus of Mobile Processes part 1
Robin Milner, Communicating and Mobile Systems: The Pi Calculus, Cambridge University Press, . 1999
Sangiorgi, D., & Walker, D. (2003). p51, The Pi-Calculus. Cambridge University Press.
Book: Boudol. G.. Asynchrony and the -calculus. Technical Report 1702, INRIA, Sophia-Antipolis. 1992.
Book: Honda . K. . Tokoro . M. . An Object Calculus for Asynchronous Communication. ECOOP 91. Springer Verlag. 1991.
Palamidessi. Catuscia. Catuscia Palamidessi. Comparing the expressive power of the Synchronous and the Asynchronous pi-calculus. Proceedings of the 24th ACM Symposium on Principles of Programming Languages. 1997. 256–265. cs/9809008. 1998cs........9008P.
Milner. Robin. Robin Milner. Functions as Processes. Mathematical Structures in Computer Science. 119–141. 1992. 2. 2. 10.1017/s0960129500001407. 20.500.11820/159b09c0-1147-4f32-baf0-23bed198f12a. 36446818 . free.
Dam. Mads. On the Decidability of Process Equivalences for the pi-Calculus. Theoretical Computer Science. 2. 215–228. 1997. 183. 10.1016/S0304-3975(96)00325-8.
Milner. R.. J. Parrow . D. Walker. A calculus of mobile processes. Information and Computation. 1. 1–40. 1992. 10.1016/0890-5401(92)90008-4. 100. 20.500.11820/cdd6d766-14a5-4c3e-8956-a9792bb2c6d3. free.
Sangiorgi. D.. A theory of bisimulation for the π-calculus. Acta Informatica. 33. 69–97. 1996. 10.1007/s002360050036. 18155730 .
http://www.bpmi.org/downloads/BPML-BPEL4WS.pdf "BPML | BPEL4WS: A Convergence Path toward a Standard BPM Stack."
Aviv. Regev. Aviv Regev. William Silverman . Ehud Y. Shapiro. 2001. Representation and Simulation of Biochemical Processes Using the pi-Calculus Process Algebra. Pacific Symposium on Biocomputing. 459–470. 10.1142/9789814447362_0045 . 11262964 . 978-981-02-4515-3 .
Davide. Chiarugi. Pierpaolo Degano . Roberto Marangoni. 2007. A computational approach to the functional screening of genomes. PLOS Computational Biology. 3. 9. 1801–1806. 1994977. 10.1371/journal.pcbi.0030174 . free. 17907794. 2007PLSCB...3..174C .
Nash, A.. Kalvala, S.. A Framework Proposition for Cellular Locality of Dictyostelium Modelled in π-Calculus. CoSMoS 2009. 2009 .
Engberg, U.. Nielsen, M.. 1986. A Calculus of Communicating Systems with Label Passing. DAIMI Report Series. 15. 208. 10.7146/dpb.v15i208.7559. free.
Robin Milner. 1993. Elements of interaction: Turing award lecture. Commun. ACM . 36. 1. 78–89. 10.1145/151233.151240. free.