Extension by definitions explained

In mathematical logic, more specifically in the proof theory of first-order theories, extensions by definitions formalize the introduction of new symbols by means of a definition. For example, it is common in naive set theory to introduce a symbol

\emptyset

for the set that has no member. In the formal setting of first-order theories, this can be done by adding to the theory a new constant

\emptyset

and the new axiom

\forallx(x\notin\emptyset)

, meaning "for all x, x is not a member of

\emptyset

". It can then be proved that doing so adds essentially nothing to the old theory, as should be expected from a definition. More precisely, the new theory is a conservative extension of the old one.

Definition of relation symbols

Let

be a first-order theory and

\phi(x_1,...,x_n)

a formula of

such that

x₁

, ...,

x_n

are distinct and include the variables free in

\phi(x_1,...,x_n)

. Form a new first-order theory

from

by adding a new

-ary relation symbol

, the logical axioms featuring the symbol

and the new axiom

\forallx_1...\forallx_n(R(x_1,...,x_{n)\leftrightarrow\phi(x}_1,...,x_n))

,called the defining axiom of

\psi

is a formula of

, let

\psi^\ast

be the formula of

obtained from

\psi

by replacing any occurrence of

R(t_1,...,t_n)

\phi(t_1,...,t_n)

(changing the bound variables in

\phi

if necessary so that the variables occurring in the

t_i

are not bound in

\phi(t_1,...,t_n)

). Then the following hold:

\psi\leftrightarrow\psi^\ast

is provable in

, and

is a conservative extension of

The fact that

is a conservative extension of

shows that the defining axiom of

cannot be used to prove new theorems. The formula

\psi^\ast

is called a translation of

\psi

into

. Semantically, the formula

\psi^\ast

has the same meaning as

\psi

, but the defined symbol

has been eliminated.

Definition of function symbols

Let

be a first-order theory (with equality) and

\phi(y,x_1,...,x_n)

a formula of

such that

x₁

, ...,

x_n

are distinct and include the variables free in

\phi(y,x_1,...,x_n)

. Assume that we can prove

\forallx_1...\forallx_n\exists!y\phi(y,x_1,...,x_n)

, i.e. for all

x₁

, ...,

x_n

, there exists a unique y such that

\phi(y,x_1,...,x_n)

. Form a new first-order theory

from

by adding a new

-ary function symbol

, the logical axioms featuring the symbol

and the new axiom

\forallx_1...\forallx_n\phi(f(x_1,...,x_n),x_1,...,x_n)

,called the defining axiom of

Let

\psi

be any atomic formula of

. We define formula

\psi^\ast

recursively as follows. If the new symbol

does not occur in

\psi

, let

\psi^\ast

\psi

. Otherwise, choose an occurrence of

f(t_1,...,t_n)

\psi

such that

does not occur in the terms

t_i

, and let

\chi

be obtained from

\psi

by replacing that occurrence by a new variable

. Then since

occurs in

\chi

one less time than in

\psi

, the formula

\chi^\ast

has already been defined, and we let

\psi^\ast

\forallz(\phi(z,t_1,...,t

	\ast)

	n) → \chi

(changing the bound variables in

\phi

if necessary so that the variables occurring in the

t_i

are not bound in

\phi(z,t_1,...,t_n)

). For a general formula

\psi

, the formula

\psi^\ast

is formed by replacing every occurrence of an atomic subformula

\chi

\chi^\ast

. Then the following hold:

\psi\leftrightarrow\psi^\ast

is provable in

, and

is a conservative extension of

The formula

\psi^\ast

is called a translation of

\psi

into

. As in the case of relation symbols, the formula

\psi^\ast

has the same meaning as

\psi

, but the new symbol

has been eliminated.

The construction of this paragraph also works for constants, which can be viewed as 0-ary function symbols.

Extensions by definitions

A first-order theory

obtained from

by successive introductions of relation symbols and function symbols as above is called an extension by definitions of

. Then

is a conservative extension of

, and for any formula

\psi

we can form a formula

\psi^\ast

, called a translation of

\psi

into

, such that

\psi\leftrightarrow\psi^\ast

is provable in

. Such a formula is not unique, but any two of them can be proved to be equivalent in T.

In practice, an extension by definitions

of T is not distinguished from the original theory T. In fact, the formulas of

can be thought of as abbreviating their translations into T. The manipulation of these abbreviations as actual formulas is then justified by the fact that extensions by definitions are conservative.

Examples

Traditionally, the first-order set theory ZF has

(equality) and

\in

(membership) as its only primitive relation symbols, and no function symbols. In everyday mathematics, however, many other symbols are used such as the binary relation symbol

\subseteq

, the constant

\emptyset

, the unary function symbol P (the power set operation), etc. All of these symbols belong in fact to extensions by definitions of ZF.

be a first-order theory for groups in which the only primitive symbol is the binary product ×. In T, we can prove that there exists a unique element y such that x×y = y×x = x for every x. Therefore we can add to T a new constant e and the axiom

\forallx(x x e=xande x x=x)

and what we obtain is an extension by definitions

. Then in

we can prove that for every x, there exists a unique y such that x×y=y×x=e. Consequently, the first-order theory

T''

obtained from

by adding a unary function symbol

and the axiom

\forallx(x x f(x)=eandf(x) x x=e)

is an extension by definitions of

. Usually,

f(x)

is denoted

x^-1

Bibliography

S. C. Kleene (1952), Introduction to Metamathematics, D. Van Nostrand
E. Mendelson (1997). Introduction to Mathematical Logic (4th ed.), Chapman & Hall.
J. R. Shoenfield (1967). Mathematical Logic, Addison-Wesley Publishing Company (reprinted in 2001 by AK Peters)

Extension by definitions explained

Definition of relation symbols

Definition of function symbols

Extensions by definitions

Examples

See also

Bibliography