Linda | |
Paradigm: | Multi-paradigm[1] |
Family: | Coordination language |
Designer: | David Gelernter |
Developer: | Scientific Computing Associates |
File Formats: | --> |
Influenced: | Java[2] |
In computer science, Linda is a coordination model that aids communication in parallel computing environments. Developed by David Gelernter, it is meant to be used alongside a full-fledged computation language like Fortran or C where Linda's role is to "create computational activities and to support communication among them".[3] [4] [5]
David Gelernter wrote the first version of Linda as a Ph.D. candidate in 1979, naming it after Linda Lovelace, who appeared in the pornographic film Deep Throat. At the time, the main language for parallel processing was Ada, developed by the U.S. Department of Defense and a tribute to Ada Lovelace, which Gelernter considered an "inelegant and bulky" language.[6]
It was widely released in 1986, when Gelernter, along with his Yale colleague Nicholas Carriero and Sudhir Ahuja at AT&T Bell Laboratories, published "Linda and Friends" in an IEEE journal.
By the early 1990s, Linda was widely used by corporations to more efficiently conduct big data analyses, including Wall Street brokerages as well as AT&T, Boeing, and United Technologies. There were even companies that specialized in creating specialized parallel computing applications based on Linda, the largest of which was Scientific Computing Associates, a New Haven-based company founded by several Yale computer scientists (Gelernter occasionally consulted for them but did not work there).[6] Interest in Linda dropped in the mid-1990s, only to make a comeback in the late 1990s with several corporations implementing Linda in Java, including Sun Microsystems and IBM.[3]
The Linda model provides a distributed shared memory, known as a tuple space because its basic addressable unit is a tuple, an ordered sequence of typed data objects; specifically in Linda, a tuple is a sequence of up to 16 typed fields enclosed in parentheses". The tuple space is "logically shared by processes" which are referred to as workers that store and retrieve tuples.[7]
One of Linda's main advantages is that it's simple, with only six operations that workers perform on the tuples to access tuplespace:[8]
out
: Puts a tuple into the tuplespacein
: Takes out a tuple that matches a given pattern from the tuplespace (if there's no match, the operation is blocked)rd
: Copies a tuple that matches a given pattern from the tuplespace (if there's no match, the operation is blocked)eval
: Creates a new process to evaluate tuplesinp
: A non-blocking version of in
(if there's no match, an error message is returned)rdp
: A non-blocking version of rd
(if there's no match, an error message is returned)Compared to other parallel-processing models, Linda is more orthogonal in treating process coordination as a separate activity from computation, and it is more general in being able to subsume various levels of concurrency - uniprocessor, multi-threaded multiprocessor, or networked - under a single model. Its orthogonality allows processes computing in different languages and platforms to interoperate using the same primitives. Its generality allows a multi-threaded Linda system to be distributed across multiple computers without change.
Whereas message-passing models require tightly coupled processes sending messages to each other in some sequence or protocol, Linda processes are decoupled from other processes, communicating only through the tuplespace; a process need have no notion of other processes except for the kinds of tuples consumed or produced. Criticisms of Linda from the multiprocessing community tend to focus on the decreased speed of operations in Linda systems as compared to Message Passing Interface (MPI) systems. While not without justification, these claims were largely refuted for an important class of problems.[9] Detailed criticisms of the Linda model can also be found in Steven Ericsson-Zenith's book Process Interaction Models.[10]
Researchers have proposed more primitives to support different types of communication and co-ordination between (open distributed) computer systems, and to solve particular problems arising from various uses of the model. Researchers have also experimented with various means of implementing the virtual shared memory for this model. Many of these researchers proposed larger modifications to the original Linda model, developing a family of systems known as Linda-like systems and implemented as orthogonal technology (unlike original version). An example of this is the language Ease designed by Steven Ericsson-Zenith.
Linda's approach has also been compared to that of flow-based programming.[11]
The Linda-calculus is a formalisation of the above model with the difference that in the following
outeval
We abstract the concrete representation of tuples.We just assume that we have a set of tuples
t\inl{T}
\sigma
t=(hello,x)
\sigma=\{world/x\}
t
(hello,world)
The Linda-calculus processes are defined by the following grammar.
P,Q::=t | outeval(P).Q | rd(t).P | in(t).P | P+Q | recX.P | X
The syntax includes the aftermentioned Linda operations, non-deterministic choice, and recursion. The substitution function is extended to processes recursively.
A tuple space is represented as a multiset of the processes. We write
M|P
M\uplus\{P\}
M
\{P\}
\uplus
M → M'
\begin{array}{lrcll} (outeval)&M|outeval(P).Q& → &M|P|Q&\\ (read)&M|rd(t).P|t'& → &M|P\sigma|t'&ifexists\sigmasuchthatt\sigma=t'\\ (input)&M|in(t).P|t'& → &M|P\sigma&ifexists\sigmasuchthatt\sigma=t'\\ (leftchoice)&M|P+Q& → &M|P&\\ (rightchoice)&M|P+Q& → &M|Q&\\ (recursion)&M|recX.P& → &M|P\{recX.P/X\}&\\ \end{array}
Note that (input) consumes the tuple
t'
Linda was originally implemented in C and Fortran, but has since been implemented in many programming languages, including:
Some of the more notable Linda implementations include: