In mathematical logic, model theory is the study of the relationship between formal theories (a collection of sentences in a formal language expressing statements about a mathematical structure), and their models (those structures in which the statements of the theory hold).[1] The aspects investigated include the number and size of models of a theory, the relationship of different models to each other, and their interaction with the formal language itself. In particular, model theorists also investigate the sets that can be defined in a model of a theory, and the relationship of such definable sets to each other.As a separate discipline, model theory goes back to Alfred Tarski, who first used the term "Theory of Models" in publication in 1954.[2] Since the 1970s, the subject has been shaped decisively by Saharon Shelah's stability theory.
Compared to other areas of mathematical logic such as proof theory, model theory is often less concerned with formal rigour and closer in spirit to classical mathematics.This has prompted the comment that "if proof theory is about the sacred, then model theory is about the profane".[3] The applications of model theory to algebraic and Diophantine geometry reflect this proximity to classical mathematics, as they often involve an integration of algebraic and model-theoretic results and techniques. Consequently, proof theory is syntactic in nature, in contrast to model theory, which is semantic in nature.
The most prominent scholarly organization in the field of model theory is the Association for Symbolic Logic.
This page focuses on finitary first order model theory of infinite structures. The relative emphasis placed on the class of models of a theory as opposed to the class of definable sets within a model fluctuated in the history of the subject, and the two directions are summarised by the pithy characterisations from 1973 and 1997 respectively:
model theory = universal algebra + logic[4]
where universal algebra stands for mathematical structures and logic for logical theories; and
model theory = algebraic geometry − fields.
where logical formulas are to definable sets what equations are to varieties over a field.[5]
Nonetheless, the interplay of classes of models and the sets definable in them has been crucial to the development of model theory throughout its history. For instance, while stability was originally introduced to classify theories by their numbers of models in a given cardinality, stability theory proved crucial to understanding the geometry of definable sets.
See main article: First-order logic.
A first-order formula is built out of atomic formulas such as
R(f(x,y),z)
y=x+1
\neg,\land,\lor, →
\forallv
\existsv
\varphi
\varphi(x)
x
\varphi
\psi
\psi(x)
\begin{array}{lcl}\varphi&=&\forallu\forallv(\existsw(x x w=u x v) → (\existsw(x x w=u)\lor\existsw(x x w=v)))\landx\ne0\landx\ne1, \\\psi&=&\forallu\forallv((u x v=x) → (u=x)\lor(v=x))\landx\ne0\landx\ne1.\end{array}
(Note that the equality symbol has a double meaning here.) It is intuitively clear how to translate such formulas into mathematical meaning. In the σsmr-structure
lN
n
\varphi
n
\psi
\models
lN\models\varphi(n)\iffn
lN\models\psi(n)\iffn
A set
T
lM\modelsT
T
It's a consequence of Gödel's completeness theorem (not to be confused with his incompleteness theorems) that a theory has a model if and only if it is consistent, i.e. no contradiction is proved by the theory.Therefore, model theorists often use "consistent" as a synonym for "satisfiable".
A signature or language is a set of non-logical symbols such that each symbol is either a constant symbol, or a function or relation symbol with a specified arity. Note that in some literature, constant symbols are considered as function symbols with zero arity, and hence are omitted. A structure is a set
M
M
Example: A common signature for ordered rings is
\sigmaor=(0,1,+, x ,-,<)
0
1
+
x
-
<
\Q
+
\Q2
\Q
<
\Q2
(\Q,\sigmaor)
A structure
l{N}
T
T
l{N}
l{N}
T
T
lA
lB
A substructure is said to be elementary if for any first-order formula
\varphi
lA
lA\models\varphi(a1,...,an)
lB\models\varphi(a1,...,an)
\varphi
lA
lB
lA\models\varphi
lB\models\varphi
Example: While the field of algebraic numbers
\overline{Q
C
Q
C
Q
An embedding of a σ-structure
lA
lB
lA
lB
A field or a vector space can be regarded as a (commutative) group by simply ignoring some of its structure. The corresponding notion in model theory is that of a reduct of a structure to a subset of the original signature. The opposite relation is called an expansion - e.g. the (additive) group of the rational numbers, regarded as a structure in the signature can be expanded to a field with the signature or to an ordered group with the signature .
Similarly, if σ' is a signature that extends another signature σ, then a complete σ'-theory can be restricted to σ by intersecting the set of its sentences with the set of σ-formulas. Conversely, a complete σ-theory can be regarded as a σ'-theory, and one can extend it (in more than one way) to a complete σ'-theory. The terms reduct and expansion are sometimes applied to this relation as well.
The compactness theorem states that a set of sentences S is satisfiable if every finite subset of S is satisfiable. The analogous statement with consistent instead of satisfiable is trivial, since every proof can have only a finite number of antecedents used in the proof. The completeness theorem allows us to transfer this to satisfiability. However, there are also several direct (semantic) proofs of the compactness theorem.As a corollary (i.e., its contrapositive), the compactness theorem says that every unsatisfiable first-order theory has a finite unsatisfiable subset. This theorem is of central importance in model theory, where the words "by compactness" are commonplace.
Another cornerstone of first-order model theory is the Löwenheim-Skolem theorem. According to the Löwenheim-Skolem Theorem, every infinite structure in a countable signature has a countable elementary substructure. Conversely, for any infinite cardinal κ every infinite structure in a countable signature that is of cardinality less than κ can be elementarily embedded in another structure of cardinality κ (There is a straightforward generalisation to uncountable signatures). In particular, the Löwenheim-Skolem Theorem implies that any theory in a countable signature with infinite models has a countable model as well as arbitrarily large models.
In a certain sense made precise by Lindström's theorem, first-order logic is the most expressive logic for which both the Löwenheim–Skolem theorem and the compactness theorem hold.[6]
In model theory, definable sets are important objects of study. For instance, in
N
\forallu\forallv(\existsw(x x w=u x v) → (\existsw(x x w=u)\lor\existsw(x x w=v)))\landx\ne0\landx\ne1
\existsy(2 x y=x)
l{M}n
y=x x x
(x,y)
y=x2
Both of the definitions mentioned here are parameter-free, that is, the defining formulas don't mention any fixed domain elements. However, one can also consider definitions with parameters from the model.For instance, in
R
y=x x x+\pi
\pi
R
In general, definable sets without quantifiers are easy to describe, while definable sets involving possibly nested quantifiers can be much more complicated.
This makes quantifier elimination a crucial tool for analysing definable sets: A theory T has quantifier elimination if every first-order formula φ(x1, ..., xn) over its signature is equivalent modulo T to a first-order formula ψ(x1, ..., xn) without quantifiers, i.e.
\forallx1...\forallxn(\phi(x1,...,xn)\leftrightarrow\psi(x1,...,xn))
If a theory does not have quantifier elimination, one can add additional symbols to its signature so that it does. Axiomatisability and quantifier elimination results for specific theories, especially in algebra, were among the early landmark results of model theory.[7] But often instead of quantifier elimination a weaker property suffices:
A theory T is called model-complete if every substructure of a model of T which is itself a model of T is an elementary substructure. There is a useful criterion for testing whether a substructure is an elementary substructure, called the Tarski–Vaught test. It follows from this criterion that a theory T is model-complete if and only if every first-order formula φ(x1, ..., xn) over its signature is equivalent modulo T to an existential first-order formula, i.e. a formula of the following form:
\existsv1...\existsvm\psi(x1,...,xn,v1,...,vm)
In every structure, every finite subset
\{a1,...,an\}
x=a1\vee...\veex=an
This leads to the concept of a minimal structure. A structure
l{M}
A\subseteql{M}
l{M}
On the other hand, the field
R
\varphi(x) = \existsy(y x y=x)
\varphi
R
l{M}
A\subseteql{M}
l{M}
See main article: Interpretation (model theory). Particularly important are those definable sets that are also substructures, i. e. contain all constants and are closed under function application. For instance, one can study the definable subgroups of a certain group.However, there is no need to limit oneself to substructures in the same signature. Since formulas with n free variables define subsets of
l{M}n
a\inl{M}
\varphi(x)
l{M}
\varphi(a)
One can even go one step further, and move beyond immediate substructures.Given a mathematical structure, there are very often associated structures which can be constructed as a quotient of part of the original structure via an equivalence relation. An important example is a quotient group of a group. One might say that to understand the full structure one must understand these quotients. When the equivalence relation is definable, we can give the previous sentence a precise meaning. We say that these structures are interpretable.A key fact is that one can translate sentences from the language of the interpreted structures to the language of the original structure. Thus one can show that if a structure
l{M}
l{M}
See main article: Type (model theory).
For a sequence of elements
a1,...,an
l{M}
l{M}
\varphi(x1,...,xn)
a1,...,an
a1,...,an
l{M}
a1,...,an
b1,...,bn
a1,...,an
b1,...,bn
The real number line
R
a\inR
a1,a2
a1<a2
a1=a2
a2<a1
Z\subseteqR
More generally, whenever
l{M}
l{M}
l{N}
l{M}
l{M | |
S | |
n |
T
l{M}
Sn(T)
T
\varphi
l{M}
\varphi → \psi
\psi
Since the real numbers
R
\{n<x|n\inZ\}
Z\subseteqR
R
A subset of
l{M}n
l{M}n
M
n
A
A[x1,\ldots,xn]
While not every type is realised in every structure, every structure realises its isolated types.If the only types over the empty set that are realised in a structure are the isolated types, then the structure is called atomic.
On the other hand, no structure realises every type over every parameter set; if one takes all of
l{M}
l{M}
l{M}
a\inl{M}
l{M}
l{M}
A\subsetl{M}
l{M}
While an automorphism that is constant on A will always preserve types over A, it is generally not true that any two sequences
a1,...,an
b1,...,bn
l{M}
l{M}
The real number line is atomic in the language that contains only the order
<
a1,...,an
R
a1,...,an
Z
Q
Q
The set of definable subsets of
l{M}n
A
n
A
\{p|\varphi\inp\}
\varphi
While types in algebraically closed fields correspond to the spectrum of the polynomial ring, the topology on the type space is the constructible topology: a set of types is basic open iff it is of the form
\{p:f(x)=0\inp\}
\{p:f(x) ≠ 0\inp\}
Constructing models that realise certain types and do not realise others is an important task in model theory.Not realising a type is referred to as omitting it, and is generally possible by the (Countable) Omitting types theorem:
Let
l{T}
\Phi
Then there is a model
l{M}
l{T}
\Phi
This implies that if a theory in a countable signature has only countably many types over the empty set, then this theory has an atomic model.
On the other hand, there is always an elementary extension in which any set of types over a fixed parameter set is realised:
Let
l{M}
\Phi
A\subsetl{M}.
Then there is an elementary extension
l{N}
l{M}
\Phi
However, since the parameter set is fixed and there is no mention here of the cardinality of
l{N}
Ultraproducts are used as a general technique for constructing models that realise certain types.An ultraproduct is obtained from the direct product of a set of structures over an index set by identifying those tuples that agree on almost all entries, where almost all is made precise by an ultrafilter on . An ultraproduct of copies of the same structure is known as an ultrapower.The key to using ultraproducts in model theory is Łoś's theorem:
Let
l{M}i
\varphi([(ai)i])
l{M}i
U
i\inI
l{M}i\models\varphi(ai)
In particular, any ultraproduct of models of a theory is itself a model of that theory, and thus if two models have isomorphic ultrapowers, they are elementarily equivalent.The Keisler-Shelah theorem provides a converse:
If and are elementary equivalent, then there is a set and an ultrafilter on such that the ultrapowers by of and : are isomorphic.[14]
Therefore, ultraproducts provide a way to talk about elementary equivalence that avoids mentioning first-order theories at all. Basic theorems of model theory such as the compactness theorem have alternative proofs using ultraproducts,[15] and they can be used to construct saturated elementary extensions if they exist.[16]
See main article: Categorical theory.
A theory was originally called categorical if it determines a structure up to isomorphism. It turns out that this definition is not useful, due to serious restrictions in the expressivity of first-order logic. The Löwenheim–Skolem theorem implies that if a theory T has an infinite model for some infinite cardinal number, then it has a model of size for any sufficiently large cardinal number . Since two models of different sizes cannot possibly be isomorphic, only finite structures can be described by a categorical theory.
However, the weaker notion of -categoricity for a cardinal has become a key concept in model theory. A theory T is called -categorical if any two models of T that are of cardinality are isomorphic. It turns out that the question of -categoricity depends critically on whether is bigger than the cardinality of the language (i.e.
\aleph0+|\sigma|
-categorical theories can be characterised by properties of their type space:
For a complete first-order theory T in a finite or countable signature the following conditions are equivalent:
The theory of
(Q,<)
(R,<)
p(x1,...,xn)
xi
\omega
x=1+...+1
\aleph0
A complete first-order theory T in a finite or countable signature is -categorical if and only if its automorphism group is oligomorphic.
The equivalent characterisations of this subsection, due independently to Engeler, Ryll-Nardzewski and Svenonius, are sometimes referred to as the Ryll-Nardzewski theorem.
In combinatorial signatures, a common source of -categorical theories are Fraïssé limits, which are obtained as the limit of amalgamating all possible configurations of a class of finite relational structures.
Michael Morley showed in 1963 that there is only one notion of uncountable categoricity for theories in countable languages.[17]
If a first-order theory T in a finite or countable signature is -categorical for some uncountable cardinal, then T is κ-categorical for all uncountable cardinals .
Morley's proof revealed deep connections between uncountable categoricity and the internal structure of the models, which became the starting point of classification theory and stability theory. Uncountably categorical theories are from many points of view the most well-behaved theories.In particular, complete strongly minimal theories are uncountably categorical. This shows that the theory of algebraically closed fields of a given characteristic is uncountably categorical, with the transcendence degree of the field determining its isomorphism type.
A theory that is both -categorical and uncountably categorical is called totally categorical.
See main article: Stable theory.
A key factor in the structure of the class of models of a first-order theory is its place in the stability hierarchy.
A complete theory T is called
λ
λ
l{M}
A\subsetl{M}
λ
λ
λ
λ
\aleph0
\omega
A fundamental result in stability theory is the stability spectrum theorem, which implies that every complete theory T in a countable signature falls in one of the following classes:
λ
λ
λ
\aleph0 | |
λ |
=λ
\aleph0 | |
λ |
λ
λ\geq
\aleph0 | |
2 |
\aleph0 | |
2 |
A theory of the first type is called unstable, a theory of the second type is called strictly stable and a theory of the third type is called superstable.Furthermore, if a theory is
\omega
\omega
Many construction in model theory are easier when restricted to stable theories; for instance, every model of a stable theory has a saturated elementary extension, regardless of whether the generalised continuum hypothesis is true.[18]
Shelah's original motivation for studying stable theories was to decide how many models a countable theory has of any uncountable cardinality.[19] If a theory is uncountably categorical, then it is
\omega
λ
2λ
λ
The stability hierarchy is also crucial for analysing the geometry of definable sets within a model of a theory. In
\omega
A theory T in which every definable set has well-defined Morley Rank is called totally transcendental; if T is countable, then T is totally transcendental if and only if T is
\omega
infty
More recently, stability has been decomposed into simplicity and "not the independence property" (NIP). Simple theories are those theories in which a well-behaved notion of independence can be defined, while NIP theories generalise o-minimal structures.They are related to stability since a theory is stable if and only if it is NIP and simple,[20] and various aspects of stability theory have been generalised to theories in one of these classes.
Model-theoretic results have been generalised beyond elementary classes, that is, classes axiomatisable by a first-order theory.
Model theory in higher-order logics or infinitary logics is hampered by the fact that completeness and compactness do not in general hold for these logics. This is made concrete by Lindstrom's theorem, stating roughly that first-order logic is essentially the strongest logic in which both the Löwenheim-Skolem theorems and compactness hold. However, model theoretic techniques have been developed extensively for these logics too. It turns out, however, that much of the model theory of more expressive logical languages is independent of Zermelo-Fraenkel set theory.[21]
More recently, alongside the shift in focus to complete stable and categorical theories, there has been work on classes of models defined semantically rather than axiomatised by a logical theory.One example is homogeneous model theory, which studies the class of substructures of arbitrarily large homogeneous models. Fundamental results of stability theory and geometric stability theory generalise to this setting.[22] As a generalisation of strongly minimal theories, quasiminimally excellent classes are those in which every definable set is either countable or co-countable. They are key to the model theory of the complex exponential function.The most general semantic framework in which stability is studied are abstract elementary classes, which are defined by a strong substructure relation generalising that of an elementary substructure. Even though its definition is purely semantic, every abstract elementary class can be presented as the models of a first-order theory which omit certain types. Generalising stability-theoretic notions to abstract elementary classes is an ongoing research program.[23]
Among the early successes of model theory are Tarski's proofs of quantifier elimination for various algebraically interesting classes, such as the real closed fields, Boolean algebras and algebraically closed fields of a given characteristic. Quantifier elimination allowed Tarski to show that the first-order theories of real-closed and algebraically closed fields as well as the first-order theory of Boolean algebras are decidable, classify the Boolean algebras up to elementary equivalence and show that the theories of real-closed fields and algebraically closed fields of a given characteristic are unique. Furthermore, quantifier elimination provided a precise description of definable relations on algebraically closed fields as algebraic varieties and of the definable relations on real-closed fields as semialgebraic sets [24] [25]
In the 1960s, the introduction of the ultraproduct construction led to new applications in algebra. This includes Ax's work on pseudofinite fields, proving that the theory of finite fields is decidable, and Ax and Kochen's proof of as special case of Artin's conjecture on diophantine equations, the Ax-Kochen theorem.[26] The ultraproduct construction also led to Abraham Robinson's development of nonstandard analysis, which aims to provide a rigorous calculus of infinitesimals.
More recently, the connection between stability and the geometry of definable sets led to several applications from algebraic and diophantine geometry, including Ehud Hrushovski's 1996 proof of the geometric Mordell-Lang conjecture in all characteristics[27] In 2001, similar methods were used to prove a generalisation of the Manin-Mumford conjecture.In 2011, Jonathan Pila applied techniques around o-minimality to prove the André-Oort conjecture for products of Modular curves.[28]
In a separate strand of inquiries that also grew around stable theories, Laskowski showed in 1992 that NIP theories describe exactly those definable classes that are PAC-learnable in machine learning theory. This has led to several interactions between these separate areas. In 2018, the correspondence was extended as Hunter and Chase showed that stable theories correspond to online learnable classes.[29]
Model theory as a subject has existed since approximately the middle of the 20th century, and the name was coined by Alfred Tarski, a member of the Lwów–Warsaw school, in 1954.[30] However some earlier research, especially in mathematical logic, is often regarded as being of a model-theoretical nature in retrospect.[31] The first significant result in what is now model theory was a special case of the downward Löwenheim–Skolem theorem, published by Leopold Löwenheim in 1915. The compactness theorem was implicit in work by Thoralf Skolem,[32] but it was first published in 1930, as a lemma in Kurt Gödel's proof of his completeness theorem. The Löwenheim–Skolem theorem and the compactness theorem received their respective general forms in 1936 and 1941 from Anatoly Maltsev.The development of model theory as an independent discipline was brought on by Alfred Tarski during the interbellum. Tarski's work included logical consequence, deductive systems, the algebra of logic, the theory of definability, and the semantic definition of truth, among other topics. His semantic methods culminated in the model theory he and a number of his Berkeley students developed in the 1950s and '60s.
In the further history of the discipline, different strands began to emerge, and the focus of the subject shifted. In the 1960s, techniques around ultraproducts became a popular tool in model theory.[33] At the same time, researchers such as James Ax were investigating the first-order model theory of various algebraic classes, and others such as H. Jerome Keisler were extending the concepts and results of first-order model theory to other logical systems. Then, inspired by Morley's problem, Shelah developed stability theory. His work around stability changed the complexion of model theory, giving rise to a whole new class of concepts. This is known as the paradigm shift. [34] Over the next decades, it became clear that the resulting stability hierarchy is closely connected to the geometry of sets that are definable in those models; this gave rise to the subdiscipline now known as geometric stability theory. An example of an influential proof from geometric model theory is Hrushovski's proof of the Mordell–Lang conjecture for function fields.[35]
See main article: Finite model theory. Finite model theory, which concentrates on finite structures, diverges significantly from the study of infinite structures in both the problems studied and the techniques used.[36] In particular, many central results of classical model theory that fail when restricted to finite structures. This includes the compactness theorem, Gödel's completeness theorem, and the method of ultraproducts for first-order logic. At the interface of finite and infinite model theory are algorithmic or computable model theory and the study of 0-1 laws, where the infinite models of a generic theory of a class of structures provide information on the distribution of finite models.[37] Prominent application areas of FMT are descriptive complexity theory, database theory and formal language theory.[38]
Any set theory (which is expressed in a countable language), if it is consistent, has a countable model; this is known as Skolem's paradox, since there are sentences in set theory which postulate the existence of uncountable sets and yet these sentences are true in our countable model. Particularly the proof of the independence of the continuum hypothesis requires considering sets in models which appear to be uncountable when viewed from within the model, but are countable to someone outside the model.[39]
The model-theoretic viewpoint has been useful in set theory; for example in Kurt Gödel's work on the constructible universe, which, along with the method of forcing developed by Paul Cohen can be shown to prove the (again philosophically interesting) independence of the axiom of choice and the continuum hypothesis from the other axioms of set theory.[40]
In the other direction, model theory is itself formalised within Zermelo-Fraenkel set theory. For instance, the development of the fundamentals of model theory (such as the compactness theorem) rely on the axiom of choice, and is in fact equivalent over Zermelo-Fraenkel set theory without choice to the Boolean prime ideal theorem.[41] Other results in model theory depend on set-theoretic axioms beyond the standard ZFC framework. For example, if the Continuum Hypothesis holds then every countable model has an ultrapower which is saturated (in its own cardinality). Similarly, if the Generalized Continuum Hypothesis holds then every model has a saturated elementary extension. Neither of these results are provable in ZFC alone. Finally, some questions arising from model theory (such as compactness for infinitary logics) have been shown to be equivalent to large cardinal axioms.[42]