Pregroup grammar explained

Pregroup grammar (PG) is a grammar formalism intimately related to categorial grammars. Much like categorial grammar (CG), PG is a kind of type logical grammar. Unlike CG, however, PG does not have a distinguished function type. Rather, PG uses inverse types combined with its monoidal operation.

Definition of a pregroup

(A,1,,-l,-r,\leq)

such that

(A,1,)

is a monoid, satisfying the following relations:

xlx\leq1    xxr\leq1

    (contraction)

1\leqxxl    1\leqxrx

    (expansion)

The contraction and expansion relations are sometimes called Ajdukiewicz laws.

From this, it can be proven that the following equations hold:

1l=1=1r

xlr=x=xrl

(xy)l=ylxl    (xy)r=yrxr

xl

and

xr

are called the left and right adjoints of x, respectively.

The symbols

and

\leq

are also written

and

\to

respectively. In category theory, pregroups are also known as autonomous categories[1] or (non-symmetric) compact closed categories.[2] More typically,

xy

will just be represented by adjacency, i.e. as

xy

.

Definition of a pregroup grammar

A pregroup grammar consists of a lexicon of words (and possibly morphemes) L, a set of atomic types T which freely generates a pregroup, and a relation

:

that relates words to types. In simple pregroup grammars, typing is a function that maps words to only one type each.

Examples

Some simple, intuitive examples using English as the language to model demonstrate the core principles behind pregroups and their use in linguistic domains.

Let L =, let T =, and let the following typing relation hold:

it{John}:N    it{Mary}:N    it{the}:N

l
N
0

   it{dog}:N0    it{cat}:N0

it{met}:NrSNl    it{barked}:NrS    it{at}:SrNrrNrSNl

A sentence S that has type T is said to be grammatical if

T\leqS

. We can prove this by use of a chain of

\leq

. For example, we can prove that

it{John} it{met} it{Mary}:NNrSNlN

is grammatical by proving that

NNrSNlN\leqS

:

NNrSNlN~\leq~SNlN~\leq~S

by first using contraction on

NNr

and then again on

NlN

. A more convenient notation exists, however, that indicates contractions by connecting them with a drawn link between the contracting types (provided that the links are nested, i.e. don't cross). Words are also typically placed above their types to make the proof more intuitive. The same proof in this notation is simply

A more complex example proves that the dog barked at the cat is grammatical:

Historical notes

Pregroup grammars were introduced by Joachim Lambek in 1993 as a development of his syntactic calculus, replacing the quotients by adjoints.[3] Such adjoints had already been used earlier by Harris but without iterated adjoints and expansion rules.Adding such adjoints was interesting to handle more complex linguistic cases, where the fact that

alla

is needed. It was also motivated by a more algebraic viewpoint: the definition of a pregroup is a weakening of that of a group, introducing a distinction between the left and right inverses and replacing the equality by an order. This weakening was needed because using types from a free group would not work: an adjective would get the type

NN-1=1

, hence it could be inserted at any position in the sentence.[4]

Pregroup grammars have then been defined and studied for various languages (or fragments of them) including English,[5] Italian,[6] French,[7] Persian[8] and Sanskrit.[9] Languages with a relatively free word order such as Sanskrit required to introduce commutation relations to the pregroup, using precyclicity.

Semantics of pregroup grammars

Because of the lack of function types in PG, the usual method of giving a semantics via the λ-calculus or via function denotations is not available in any obvious way. Instead, two different methods exist, one purely formal method that corresponds to the λ-calculus, and one denotational method analogous to (a fragment of) the tensor mathematics of quantum mechanics.

Purely formal semantics

The purely formal semantics for PG consists of a logical language defined according to the following rules:

mn

is a term

Some examples of terms are f(x), g(a,h(x,y)),

g(x,b)[x]

. A variable x is free in a term t if [''x''] does not appear in t, and a term with no free variables is a closed term. Terms can be typed with pregroup types in the obvious manner.

The usual conventions regarding α conversion apply.

For a given language, we give an assignment I that maps typed words to typed closed terms in a way that respects the pregroup structure of the types. For the English fragment given above we might therefore have the following assignment (with the obvious, implicit set of atomic terms and function symbols):

\begin{align} I(it{John}:N)&=j:E \\ I(it{Mary}:N)&=m:E \\ I(the:N

l)
N
0

&=\iota(p)[p]:E

l \\ I(dog
E
0

:N0)&=dog:E0 \\ I(cat:N0)&=cat:E0 \\ I(met:NrSNl)&=[x]met(x,y)[y]:ErTEl \\ I(barked:NrS)&=[x]barked(x):ErT \\ I(at:SrNrrNrSNl)&=[x]y[y]at(x,z)[z]:TrErrErTEl \end{align}

where E is the type of entities in the domain, and T is the type of truth values.

Together with this core definition of the semantics of PG, we also have a reduction rules that are employed in parallel with the type reductions. Placing the syntactic types at the top and semantics below, we have

For example, applying this to the types and semantics for the sentence

it{John} it{met} it{Mary}:N(NrSNl)N

(emphasizing the link being reduced)

For the sentence

l)
it{the} it{dog} it{barked} it{at} it{the} it{cat}:(NN
0

N0(NrS)(SrNrrNrSNl)(N

l)
N
0

N0

:

See also

References

Notes and References

  1. Springer. 813. 289–233. Selinger. Peter. A survey of graphical languages for monoidal categories. New Structures for Physics. Lecture Notes in Physics. 2011. 0908.3347. 2009arXiv0908.3347S.
  2. 20. 4. 419–443. Preller. Anne. Mehrnoosh Sadrzadeh. Semantic Vector Models and Functional Models for Pregroup Grammars. Journal of Logic, Language and Information. 2011. 10.1007/s10849-011-9132-2. 207175357.
  3. Springer. 1582. 1–27. Alain Lecomte. Lambek. Joachim. Joachim Lambek. Type Grammar revisited. Logical Aspects of Computational Linguistics. Heidelberg. LNAI. 1999.
  4. 17. 2. 141–160. Lambek. Joachim. Joachim Lambek. Pregroup Grammars and Chomsky's Earliest Examples. Journal of Logic, Language and Information. 2008. 10.1007/s10849-007-9053-2. 30256603.
  5. Lambek 2008
  6. Book: Casadio , Claudia . Springer. 3540422730. 110–124. Joachim Lambek . Logical Aspects of Computational Linguistics. An algebraic analysis of clitic pronouns in Italian. 2001.
  7. 53–84. Preller. Anne. Violaine Prince. Pregroup grammars with linear parsing of the French verb phrase. CL2008. 2008. etal.
  8. 121–144. Sadrzadeh. Mehrnoosh. Pregroup analysis of Persian sentences. Computational Algebraic Approaches to Natural Language, Polimetrica, Milano, Italy. 2008. 10.1.1.163.5505.
  9. Book: Casadio , Claudia . Springer International Publishing. 978-3-319-06879-4. 8464. 229–249. Franck van Breugel. Elham Kashefi . Elham Kashefi. Catuscia Palamidessi . Catuscia Palamidessi. Jan Rutten. Mehrnoosh Sadrzadeh . Horizons of the Mind. A Tribute to Prakash Panangaden. Word Order Alternation in Sanskrit via Precyclicity in Pregroup Grammars. Lecture Notes in Computer Science. 2014.