Admissible rule explained

In logic, a rule of inference is admissible in a formal system if the set of theorems of the system does not change when that rule is added to the existing rules of the system. In other words, every formula that can be derived using that rule is already derivable without that rule, so, in a sense, it is redundant. The concept of an admissible rule was introduced by Paul Lorenzen (1955).

Definitions

Admissibility has been systematically studied only in the case of structural (i.e. substitution-closed) rules in propositional non-classical logics, which we will describe next.

Let a set of basic propositional connectives be fixed (for instance,

\{\to,\land,\lor,\bot\}

in the case of superintuitionistic logics, or

\{\to,\bot,\Box\}

in the case of monomodal logics). Well-formed formulas are built freely using these connectives from a countably infinite set of propositional variables p0, p1, .... A substitution σ is a function from formulas to formulas that commutes with applications of the connectives, i.e.,

\sigmaf(A1,...,An)=f(\sigmaA1,...,\sigmaAn)

for every connective f, and formulas A1, ..., An. (We may also apply substitutions to sets Γ of formulas, making) A Tarski-style consequence relation[1] is a relation

\vdash

between sets of formulas, and formulas, such that for all formulas A, B, and sets of formulas Γ, Δ. A consequence relation such thatfor all substitutions σ is called structural. (Note that the term "structural" as used here and below is unrelated to the notion of structural rules in sequent calculi.) A structural consequence relation is called a propositional logic. A formula A is a theorem of a logic

\vdash

if

\varnothing\vdashA

.

For example, we identify a superintuitionistic logic L with its standard consequence relation

\vdashL

generated by modus ponens and axioms, and we identify a normal modal logic with its global consequence relation

\vdashL

generated by modus ponens, necessitation, and (as axioms) the theorems of the logic.

A structural inference rule[2] (or just rule for short) is given by a pair (Γ, B), usually written as

A1,...,An
B    or   

A1,...,An/B,

where Γ =  is a finite set of formulas, and B is a formula. An instance of the rule is

\sigmaA1,...,\sigmaAn/\sigmaB

for a substitution σ. The rule Γ/B is derivable in

\vdash

, if

\Gamma\vdashB

. It is admissible if for every instance of the rule, σB is a theorem whenever all formulas from σΓ are theorems.[3] In other words, a rule is admissible if it, when added to the logic, does not lead to new theorems.[4] We also write

\Gammal{|\sim}B

if Γ/B is admissible. (Note that

\phantom{.}{|\sim}

is a structural consequence relation on its own.)

Every derivable rule is admissible, but not vice versa in general. A logic is structurally complete if every admissible rule is derivable, i.e.,

{\vdash}={|\sim}

.[5]

In logics with a well-behaved conjunction connective (such as superintuitionistic or modal logics), a rule

A1,...,An/B

is equivalent to

A1\land...\landAn/B

with respect to admissibility and derivability. It is therefore customary to only deal with unary rules A/B.

Examples

\top

if v(p) = 1, and σp =

\bot

if v(p) = 0. Then σA is a theorem, but σB is not (in fact, ¬σB is a theorem). Thus the rule A/B is not admissible either. (The same argument applies to any multi-valued logic L complete with respect to a logical matrix all of whose elements have a name in the language of L.)
(KPR)    \negp\toq\lorr
(\negp\toq)\lor(\negp\tor)

is admissible in the intuitionistic propositional calculus (IPC). In fact, it is admissible in every superintuitionistic logic.[7] On the other hand, the formula

(\negp\toq\lorr)\to((\negp\toq)\lor(\negp\tor))

is not an intuitionistic theorem; hence KPR is not derivable in IPC. In particular, IPC is not structurally complete.

\Boxp
p

is admissible in many modal logics, such as K, D, K4, S4, GL (see this table for names of modal logics). It is derivable in S4, but it is not derivable in K, D, K4, or GL.

\Diamondp\land\Diamond\negp
\bot

is admissible in normal logic

L\supseteqS4.3

.[8] It is derivable in GL and S4.1, but it is not derivable in K, D, K4, S4, or S5.
(LR)    \Boxp\top
p

is admissible (but not derivable) in the basic modal logic K, and it is derivable in GL. However, LR is not admissible in K4. In particular, it is not true in general that a rule admissible in a logic L must be admissible in its extensions.

Decidability and reduced rules

The basic question about admissible rules of a given logic is whether the set of all admissible rules is decidable. Note that the problem is nontrivial even if the logic itself (i.e., its set of theorems) is decidable: the definition of admissibility of a rule A/B involves an unbounded universal quantifier over all propositional substitutions. Hence a priori we only know that admissibility of rule in a decidable logic is

0
\Pi
1
(i.e., its complement is recursively enumerable). For instance, it is known that admissibility in the bimodal logics Ku and K4u (the extensions of K or K4 with the universal modality) is undecidable.[11] Remarkably, decidability of admissibility in the basic modal logic K is a major open problem.

Nevertheless, admissibility of rules is known to be decidable in many modal and superintuitionistic logics. The first decision procedures for admissible rules in basic transitive modal logics were constructed by Rybakov, using the reduced form of rules.[12] A modal rule in variables p0, ..., pk is called reduced if it has the form

0p
vee
1\Box
i,j
pjr)
j\landwedge
p0

,

where each
u
\neg
i,j
is either blank, or negation

\neg

. For each rule r, we can effectively construct a reduced rule s (called the reduced form of r) such that any logic admits (or derives) r if and only if it admits (or derives) s, by introducing extension variables for all subformulas in A, and expressing the result in the full disjunctive normal form. It is thus sufficient to construct a decision algorithm for admissibility of reduced rules.

Let

n\varphi
stylevee
i/p

0

be a reduced rule as above. We identify every conjunction

\varphii

with the set
0p
\{\neg
j,\neg
1\Box
i,j

pj\midj\lek\}

of its conjuncts. For any subset W of the set

\{\varphii\midi\len\}

of all conjunctions, let us define a Kripke model

M=\langleW,R,{\Vdash}\rangle

by

\varphii\Vdashpj\iffpj\in\varphii,

\varphiiR\varphii'\iff\forallj\lek(\Boxpj\in\varphii\{pj,\Boxpj\}\subseteq\varphii').

Then the following provides an algorithmic criterion for admissibility in K4:[13]

Theorem. The rule

n\varphi
stylevee
i/p

0

is not admissible in K4 if and only if there exists a set

W\subseteq\{\varphii\midi\len\}

such that

\varphii\nVdashp0

for some

i\len,

\varphii\Vdash\varphii

for every

i\len,

  1. for every subset D of W there exist elements

\alpha,\beta\inW

such that the equivalences

\alpha\Vdash\Boxpj

if and only if

\varphi\Vdashpj\land\Boxpj

for every

\varphi\inD

\alpha\Vdash\Boxpj

if and only if

\alpha\Vdashpj

and

\varphi\Vdashpj\land\Boxpj

for every

\varphi\inD

hold for all j.

Similar criteria can be found for the logics S4, GL, and Grz.[14] Furthermore, admissibility in intuitionistic logic can be reduced to admissibility in Grz using the Gödel–McKinsey–Tarski translation:[15]

A|\simIPCB

if and only if

T(A)|\simGrzT(B).

Rybakov (1997) developed much more sophisticated techniques for showing decidability of admissibility, which apply to a robust (infinite) class of transitive (i.e., extending K4 or IPC) modal and superintuitionistic logics, including e.g. S4.1, S4.2, S4.3, KC, Tk (as well as the above-mentioned logics IPC, K4, S4, GL, Grz).[16]

Despite being decidable, the admissibility problem has relatively high computational complexity, even in simple logics: admissibility of rules in the basic transitive logics IPC, K4, S4, GL, Grz is coNEXP-complete.[17] This should be contrasted with the derivability problem (for rules or formulas) in these logics, which is PSPACE-complete.[18]

Projectivity and unification

Admissibility in propositional logics is closely related to unification in the equational theory of modal or Heyting algebras. The connection was developed by Ghilardi (1999, 2000). In the logical setup, a unifier of a formula A in the language of a logic L (an L-unifier for short) is a substitution σ such that σA is a theorem of L. (Using this notion, we can rephrase admissibility of a rule A/B in L as "every L-unifier of A is an L-unifier of B".) An L-unifier σ is less general than an L-unifier τ, written as στ, if there exists a substitution υ such that

\vdashL\sigmap\leftrightarrow\upsilon\taup

for every variable p. A complete set of unifiers of a formula A is a set S of L-unifiers of A such that every L-unifier of A is less general than some unifier from S. A most general unifier (MGU) of A is a unifier σ such that is a complete set of unifiers of A. It follows that if S is a complete set of unifiers of A, then a rule A/B is L-admissible if and only if every σ in S is an L-unifier of B. Thus we can characterize admissible rules if we can find well-behaved complete sets of unifiers.

An important class of formulas that have a most general unifier are the projective formulas: these are formulas A such that there exists a unifier σ of A such that

A\vdashLB\leftrightarrow\sigmaB

for every formula B. Note that σ is an MGU of A. In transitive modal and superintuitionistic logics with the finite model property, one can characterize projective formulas semantically as those whose set of finite L-models has the extension property:[19] if M is a finite Kripke L-model with a root r whose cluster is a singleton, and the formula A holds at all points of M except for r, then we can change the valuation of variables in r so as to make A true at r as well. Moreover, the proof provides an explicit construction of an MGU for a given projective formula A.

In the basic transitive logics IPC, K4, S4, GL, Grz (and more generally in any transitive logic with the finite model property whose set of finite frame satisfies another kind of extension property), we can effectively construct for any formula A its projective approximation Π(A):[20] a finite set of projective formulas such that

P\vdashLA

for every

P\in\Pi(A),

  1. every unifier of A is a unifier of a formula from Π(A).

It follows that the set of MGUs of elements of Π(A) is a complete set of unifiers of A. Furthermore, if P is a projective formula, then

P|\simLB

if and only if

P\vdashLB

for any formula B. Thus we obtain the following effective characterization of admissible rules:[21]

A|\simLB

if and only if

\forallP\in\Pi(A)(P\vdashLB).

Bases of admissible rules

Let L be a logic. A set R of L-admissible rules is called a basis[22] of admissible rules, if every admissible rule Γ/B can be derived from R and the derivable rules of L, using substitution, composition, and weakening. In other words, R is a basis if and only if

\phantom{.}{|\simL}

is the smallest structural consequence relation that includes

\vdashL

and R.

Notice that decidability of admissible rules of a decidable logic is equivalent to the existence of recursive (or recursively enumerable) bases: on the one hand, the set of all admissible rules is a recursive basis if admissibility is decidable. On the other hand, the set of admissible rules is always co-recursively enumerable, and if we further have a recursively enumerable basis, set of admissible rules is also recursively enumerable; hence it is decidable. (In other words, we can decide admissibility of A/B by the following algorithm: we start in parallel two exhaustive searches, one for a substitution σ that unifies A but not B, and one for a derivation of A/B from R and

\vdashL

. One of the searches has to eventually come up with an answer.) Apart from decidability, explicit bases of admissible rules are useful for some applications, e.g. in proof complexity.[23]

For a given logic, we can ask whether it has a recursive or finite basis of admissible rules, and to provide an explicit basis. If a logic has no finite basis, it can nevertheless have an independent basis: a basis R such that no proper subset of R is a basis.

In general, very little can be said about existence of bases with desirable properties. For example, while tabular logics are generally well-behaved, and always finitely axiomatizable, there exist tabular modal logics without a finite or independent basis of rules.[24] Finite bases are relatively rare: even the basic transitive logics IPC, K4, S4, GL, Grz do not have a finite basis of admissible rules,[25] though they have independent bases.[26]

Examples of bases

\Diamondp\land\Diamond\negp
\bot.
n(p
\displaystylel(wedgeqi)\topn+1\lorpn+2r)\lorr
i\to
n+2
\displaystylevee
n
l(wedge
i=1
(pi\toqi)\topjr)\lorr
j=1

,    n\ge1

are a basis of admissible rules in IPC or KC.[28]

\displaystyle\Boxl(\Box
n\Box
q\tovee
i=1
pir)\lor\Boxr
n\Box(q\land\Box
\displaystyleveeq\topi)\lorr
i=1

,    n\ge0

are a basis of admissible rules of GL.[29] (Note that the empty disjunction is defined as

\bot

.)
\displaystyle\Boxl(\Box(q\to\Box
n\Box
q)\tovee
i=1
pir)\lor\Boxr
n\Box(\Box
\displaystyleveeq\topi)\lorr
i=1

,    n\ge0

are a basis of admissible rules of S4 or Grz.[30]

Semantics for admissible rules

F=\langleW,R\rangle

, if the following is true for every valuation

\Vdash

in F:

if for all

A\in\Gamma

\forallx\inW(x\VdashA)

, then

\forallx\inW(x\VdashB)

.(The definition readily generalizes to general frames, if needed.)

Let X be a subset of W, and t a point in W. We say that t is

We say that a frame F has reflexive (irreflexive) tight predecessors, if for every finite subset X of W, there exists a reflexive (irreflexive) tight predecessor of X in W.

We have:[31]

Note that apart from a few trivial cases, frames with tight predecessors must be infinite. Hence admissible rules in basic transitive logics do not enjoy the finite model property.

Structural completeness

While a general classification of structurally complete logics is not an easy task, we have a good understanding of some special cases.

Intuitionistic logic itself is not structurally complete, but its fragments may behave differently. Namely, any disjunction-free rule or implication-free rule admissible in a superintuitionistic logic is derivable.[32] On the other hand, the Mints rule

(p\toq)\top\lorr
((p\toq)\top)\lor((p\toq)\tor)
is admissible in intuitionistic logic but not derivable, and contains only implications and disjunctions.

We know the maximal structurally incomplete transitive logics. A logic is called hereditarily structurally complete, if any extension is structurally complete. For example, classical logic, as well as the logics LC and Grz.3 mentioned above, are hereditarily structurally complete. A complete description of hereditarily structurally complete superintuitionistic and transitive modal logics was given respectively by Citkin and Rybakov. Namely, a superintuitionistic logic is hereditarily structurally complete if and only if it is not valid in any of the five Kripke frames[9]

Similarly, an extension of K4 is hereditarily structurally complete if and only if it is not valid in any of certain twenty Kripke frames (including the five intuitionistic frames above).[9]

There exist structurally complete logics that are not hereditarily structurally complete: for example, Medvedev's logic is structurally complete,[33] but it is included in the structurally incomplete logic KC.

Variants

A rule with parameters is a rule of the form

A(p1,...,pn,s1,...,sk)
B(p1,...,pn,s1,...,sk)

,

whose variables are divided into the "regular" variables pi, and the parameters si. The rule is L-admissible if every L-unifier σ of A such that σsi = si for each i is also a unifier of B. The basic decidability results for admissible rules also carry to rules with parameters.[34]

A multiple-conclusion rule is a pair (Γ,Δ) of two finite sets of formulas, written as

A1,...,An
B1,...,Bm

   or    A1,...,An/B1,...,Bm.

Such a rule is admissible if every unifier of Γ is also a unifier of some formula from Δ.[35] For example, a logic L is consistent iff it admits the rule
\bot

,

and a superintuitionistic logic has the disjunction property iff it admits the rule
p\lorq
p,q

.

Again, basic results on admissible rules generalize smoothly to multiple-conclusion rules.[36] In logics with a variant of the disjunction property, the multiple-conclusion rules have the same expressive power as single-conclusion rules: for example, in S4 the rule above is equivalent to
A1,...,An
\BoxB1\lor...\lor\BoxBm

.

Nevertheless, multiple-conclusion rules can often be employed to simplify arguments.

In proof theory, admissibility is often considered in the context of sequent calculi, where the basic objects are sequents rather than formulas. For example, one can rephrase the cut-elimination theorem as saying that the cut-free sequent calculus admits the cut rule

\Gamma\vdashA,\Delta    \Pi,A\vdashΛ
\Gamma,\Pi\vdash\Delta,Λ

.

(By abuse of language, it is also sometimes said that the (full) sequent calculus admits cut, meaning its cut-free version does.) However, admissibility in sequent calculi is usually only a notational variant for admissibility in the corresponding logic: any complete calculus for (say) intuitionistic logic admits a sequent rule if and only if IPC admits the formula rule that we obtain by translating each sequent

\Gamma\vdash\Delta

to its characteristic formula

wedge\Gamma\tovee\Delta

.

References

Notes and References

  1. Blok & Pigozzi (1989), Kracht (2007)
  2. Rybakov (1997), Def. 1.1.3
  3. Rybakov (1997), Def. 1.7.2
  4. http://www.illc.uva.nl/D65/artemov.pdf From de Jongh’s theorem to intuitionistic logic of proofs
  5. Rybakov (1997), Def. 1.7.7
  6. Chagrov & Zakharyaschev (1997), Thm. 1.25
  7. Prucnal (1979), cf. Iemhoff (2006)
  8. Rybakov (1997), p. 439
  9. Rybakov (1997), Thms. 5.4.4, 5.4.8
  10. Cintula & Metcalfe (2009)
  11. Wolter & Zakharyaschev (2008)
  12. Rybakov (1997), §3.9
  13. Rybakov (1997), Thm. 3.9.3
  14. Rybakov (1997), Thms. 3.9.6, 3.9.9, 3.9.12; cf. Chagrov & Zakharyaschev (1997), §16.7
  15. Rybakov (1997), Thm. 3.2.2
  16. Rybakov (1997), §3.5
  17. Jeřábek (2007)
  18. Chagrov & Zakharyaschev (1997), §18.5
  19. Ghilardi (2000), Thm. 2.2
  20. Ghilardi (2000), p. 196
  21. Ghilardi (2000), Thm. 3.6
  22. Rybakov (1997), Def. 1.4.13
  23. Mints & Kojevnikov (2004)
  24. Rybakov (1997), Thm. 4.5.5
  25. Rybakov (1997), §4.2
  26. Jeřábek (2008)
  27. Rybakov (1997), Cor. 4.3.20
  28. Iemhoff (2001, 2005), Rozière (1992)
  29. Jeřábek (2005)
  30. Jeřábek (2005, 2008)
  31. Iemhoff (2001), Jeřábek (2005)
  32. Rybakov (1997), Thms. 5.5.6, 5.5.9
  33. Prucnal (1976)
  34. Rybakov (1997), §6.1
  35. Jeřábek (2005); cf. Kracht (2007), §7
  36. Jeřábek (2005, 2007, 2008)