Category of Markov kernels explained

In mathematics, the category of Markov kernels, often denoted Stoch, is the category whose objects are measurable spaces and whose morphisms are Markov kernels.It is analogous to the category of sets and functions, but where the arrows can be interpreted as being stochastic.

Several variants of this category are used in the literature. For example, one can use subprobability kernels instead of probability kernels, or more general s-finite kernels.Also, one can take as morphisms equivalence classes of Markov kernels under almost sure equality; see below.

Definition

Recall that a Markov kernel between measurable spaces

(X,l{F})

and

(Y,l{G})

is an assignment

k:X x l{G}\toR

which is measurable as a function on

and which is a probability measure on

l{G}

. We denote its values by

k(B|x)

for

x\inX

and

B\inl{G}

, which suggests an interpretation as conditional probability.

The category Stoch has:

As objects, measurable spaces;
As morphisms, Markov kernels between them;
For each measurable space

(X,l{F})

, the identity morphism is given by the kernel

\delta(A|x)=1_A(x)=\begin{cases} 1&x\inA;\\ 0&x\notinA\end{cases}

for all

x\inX

and

A\inl{F}

;

Given kernels

k:(X,l{F})\to(Y,l{G})

and

h:(Y,l{G})\to(Z,l{H})

, the composite morphism

h\circk:(X,l{F})\to(Z,l{H})

is given by

(h\circk)(C|x)=\int_Yh(C|y)k(dy|x)

for all

x\inX

and

C\inl{H}

.This composition formula is sometimes called the Chapman-Kolmogorov equation.

This composition is unital, and associative by the monotone convergence theorem, so that one indeed has a category.

Basic properties

Probability measures

. Morphisms in the form

1\toX

can be equivalently seen as probability measures on

, since they correspond to functions

1\toPX

, i.e. elements of

Given kernels

p:1\toX

and

k:X\toY

, the composite kernel

k\circp:1\toY

gives the probability measure on

with values

(k\circp)(B)=\int_Xk(B|x)p(dx),

for every measurable subset

(X,l{F},p)

and

(Y,l{G},q)

, a measure-preserving Markov kernel

(X,l{F},p)\to(Y,l{G},q)

is a Markov kernel

k:(X,l{F})\to(Y,l{G})

such that for every measurable subset

B\inl{G}

q(B)=\int_Xk(B|x)p(dx).

(Hom_{Stoch(1,-),Stoch)}

Measurable functions

Every measurable function

f:(X,l{F})\to(Y,l{G})

defines canonically a Markov kernel

\delta_{f:(X,l{F})\to(Y,l{G})}

as follows,

\delta_f(B|x)=1_B(f(x))=\begin{cases} 1&f(x)\inB;\\ 0&f(x)\notinB \end{cases}

for every

x\inX

and every

B\inl{G}

. This construction preserves identities and compositions, and is therefore a functor from Meas to Stoch.

Isomorphisms

By functoriality, every isomorphism of measurable spaces (in the category Meas) induces an isomorphism in Stoch. However, in Stoch there are more isomorphisms, and in particular, measurable spaces can be isomorphic in Stoch even when the underlying sets are not in bijection.

Relationship with other categories

Stoch is the Kleisli category of the Giry monad. This in particular implies that there is an adjunction

Hom_Stoch(X,Y)\congHom_Meas(X,PY)

between Stoch and the category of measurable spaces.

L:Meas\toStoch

of the adjunction above is the identity on objects, and on morphisms it gives the canonical Markov kernel induced by a measurable function described above.

(Hom_{Stoch(1,-),Stoch)}

(Hom_{Stoch(1,-),L)}

Particular limits and colimits

Since the functor

L:Meas\toStoch

is left adjoint, it preserves colimits. Because of this, all colimits in the category of measurable spaces are also colimits in Stoch. For example,

The initial object is the empty set, with its trivial measurable structure;
The coproduct is given by the disjoint union of measurable spaces, with its canonical sigma-algebra.
The sequential colimit of a decreasing filtration is given by the intersection of sigma-algebras.

In general, the functor

does not preserve limits. This in particular implies that the product of measurable spaces is not a product in Stoch in general. Since the Giry monad is monoidal, however, the product of measurable spaces still makes Stoch a monoidal category.

A limit of particular significance for probability theory is de Finetti's theorem, which can be interpreted as the fact that the space of probability measures (Giry monad) is the limit in Stoch of the diagram formed by finite permutations of sequences.

Almost sure version

Sometimes it is useful to consider Markov kernels only up to almost sure equality, for example when talking about disintegrations or about regular conditional probability.

(X,l{F},p)

and

(Y,l{G},q)

, we say that two measure-preserving kernels

k,h:(X,l{F},p)\to(Y,l{G},q)

are almost surely equal if and only if for every measurable subset

B\inl{G}

k(B|x)=h(B|x)

for

-almost all

x\inX

.This defines an equivalence relation on the set of measure-preserving Markov kernels

k,h:(X,l{F},p)\to(Y,l{G},q)

Probability spaces and equivalence classes of Markov kernels under the relation defined above form a category. When restricted to standard Borel probability spaces, the category is often denoted by Krn.

References

Web site: Lawvere. F. W.. The Category of Probabilistic Mappings. 1962.
Chentsov. N. N. . The categories of mathematical statistics. Dokl. Akad. SSSR. 164. 1965.
Book: Giry, Michèle . Categorical Aspects of Topology and Analysis . A categorical approach to probability theory . Lecture Notes in Mathematics . 1982 . 915 . 68–85 . Springer. 10.1007/BFb0092872 . 978-3-540-11211-2 . https://link.springer.com/chapter/10.1007/BFb0092872.
Panangaden. Prakash. The category of Markov kernels. Electronic Notes in Theoretical Computer Science. 22. 1999. 171–187. 10.1016/S1571-0661(05)80602-4. free.
Book: Riehl , Emily . Category Theory in Context. 2016 . Dover. 9780486809038.
Book: Kallenberg, Olav . Random Measures, Theory and Applications . Probability Theory and Stochastic Modelling. 2017 . 77 . Springer. 10.1007/978-3-319-41598-7. 978-3-319-41596-3.
Dahlqvist . Fredrik. Danos . Vincent. Garnier . Ilias. Silva . Alexandra . Alexandra Silva. Borel Kernels and their Approximation, Categorically. MFPS 2018: Proceedings of Mathematical Foundations of Programming Semantics. 2018. 1803.02651.
Fritz . Tobias. A synthetic approach to Markov kernels, conditional independence and theorems on sufficient statistics. Advances in Mathematics. 370. 2020. 10.1016/j.aim.2020.107239. 1908.07021. 201103837.

Category of Markov kernels explained

Definition

Basic properties

Probability measures

Measurable functions

Isomorphisms

Relationship with other categories

Particular limits and colimits

Almost sure version

See also

References

Further reading