Pi-system explained

\Omega

is a collection

P

of certain subsets of

\Omega,

such that

P

is non-empty.

A,B\inP

then

A\capB\inP.

That is,

P

is a non-empty family of subsets of

\Omega

that is closed under non-empty finite intersections.[1] The importance of -systems arises from the fact that if two probability measures agree on a -system, then they agree on the -algebra generated by that -system. Moreover, if other properties, such as equality of integrals, hold for the -system, then they hold for the generated -algebra as well. This is the case whenever the collection of subsets for which the property holds is a -system. -systems are also useful for checking independence of random variables.

This is desirable because in practice, -systems are often simpler to work with than -algebras. For example, it may be awkward to work with -algebras generated by infinitely many sets

\sigma(E1,E2,\ldots).

So instead we may examine the union of all -algebras generated by finitely many sets \bigcup_n \sigma(E_1, \ldots, E_n). This forms a -system that generates the desired -algebra. Another example is the collection of all intervals of the real line, along with the empty set, which is a -system that generates the very important Borel -algebra of subsets of the real line.

Definitions

A -system is a non-empty collection of sets

P

that is closed under non-empty finite intersections, which is equivalent to

P

containing the intersection of any two of its elements. If every set in this -system is a subset of

\Omega

then it is called a

\Sigma

of subsets of

\Omega,

there exists a -system

l{I}\Sigma,

called the , that is the unique smallest -system of

\Omega

containing every element of

\Sigma.

It is equal to the intersection of all -systems containing

\Sigma,

and can be explicitly described as the set of all possible non-empty finite intersections of elements of

\Sigma:

\left\.

A non-empty family of sets has the finite intersection property if and only if the -system it generates does not contain the empty set as an element.

Examples

a

and

b,

the intervals

(-infty,a]

form a -system, and the intervals

(a,b]

form a -system if the empty set is also included.

f:\Omega\to\Reals,

the set  

l{I}f=\left\{f-1((-infty,x]):x\in\Reals\right\}

defines a -system, and is called the -system by

f.

(Alternatively,

\left\{f-1((a,b]):a,b\in\Reals,a<b\right\}\cup\{\varnothing\}

defines a -system generated by

f.

)

P1

and

P2

are -systems for

\Omega1

and

\Omega2,

respectively, then

\{A1 x A2:A1\inP1,A2\inP2\}

is a -system for the Cartesian product

\Omega1 x \Omega2.

Relationship to -systems

A -system on

\Omega

is a set

D

of subsets of

\Omega,

satisfying

\Omega\inD,

A\inD

then

\Omega\setminusA\inD,

A1,A2,A3,\ldots

is a sequence of (pairwise) subsets in

D

then
infty
stylecup\limits
n=1

An\inD.

Whilst it is true that any -algebra satisfies the properties of being both a -system and a -system, it is not true that any -system is a -system, and moreover it is not true that any -system is a -algebra. However, a useful classification is that any set system which is both a -system and a -system is a -algebra. This is used as a step in proving the - theorem.

The - theorem

Let

D

be a -system, and let  

l{I}\subseteqD

be a -system contained in

D.

The - theorem[2] states that the -algebra

\sigma(l{I})

generated by

l{I}

is contained in

D~:~

\sigma(l{I})\subseteqD.

The - theorem can be used to prove many elementary measure theoretic results. For instance, it is used in proving the uniqueness claim of the Carathéodory extension theorem for -finite measures.[3]

The - theorem is closely related to the monotone class theorem, which provides a similar relationship between monotone classes and algebras, and can be used to derive many of the same results. Since -systems are simpler classes than algebras, it can be easier to identify the sets that are in them while, on the other hand, checking whether the property under consideration determines a -system is often relatively easy. Despite the difference between the two theorems, the - theorem is sometimes referred to as the monotone class theorem.[2]

Example

Let

\mu1,\mu2:F\to\Reals

be two measures on the -algebra

F,

and suppose that

F=\sigma(I)

is generated by a -system

I.

If

\mu1(A)=\mu2(A)

for all

A\inI,

and

\mu1(\Omega)=\mu2(\Omega)<infty,

then

\mu1=\mu2.

This is the uniqueness statement of the Carathéodory extension theorem for finite measures. If this result does not seem very remarkable, consider the fact that it usually is very difficult or even impossible to fully describe every set in the -algebra, and so the problem of equating measures would be completely hopeless without such a tool.

Idea of the proof[3] Define the collection of setsD = \left\.By the first assumption,

\mu1

and

\mu2

agree on

I

and thus

I\subseteqD.

By the second assumption,

\Omega\inD,

and it can further be shown that

D

is a -system. It follows from the - theorem that

\sigma(I)\subseteqD\subseteq\sigma(I),

and so

D=\sigma(I).

That is to say, the measures agree on

\sigma(I).

-Systems in probability

-systems are more commonly used in the study of probability theory than in the general field of measure theory. This is primarily due to probabilistic notions such as independence, though it may also be a consequence of the fact that the - theorem was proven by the probabilist Eugene Dynkin. Standard measure theory texts typically prove the same results via monotone classes, rather than -systems.

Equality in distribution

X:(\Omega,lF,\operatornameP)\to\Reals

in terms of its cumulative distribution function. Recall that the cumulative distribution of a random variable is defined asF_X(a) = \operatorname[X \leq a], \qquad a \in \Reals,whereas the seemingly more general of the variable is the probability measure \mathcal_X(B) = \operatorname\left[X^{-1}(B)\right] \quad \text B \in \mathcal(\Reals),where

l{B}(\Reals)

is the Borel -algebra. The random variables

X:(\Omega,lF,\operatornameP)\to\Reals

and

Y:(\tilde\Omega,\tilde{lF},\tilde{\operatornameP})\to\Reals

(on two possibly different probability spaces) are (or), denoted by

X\stackrel{lD}{=}Y,

if they have the same cumulative distribution functions; that is, if

FX=FY.

The motivation for the definition stems from the observation that if

FX=FY,

then that is exactly to say that

l{L}X

and

l{L}Y

agree on the -system

\{(-infty,a]:a\in\Reals\}

which generates

l{B}(\Reals),

and so by the example above:

l{L}X=l{L}Y.

A similar result holds for the joint distribution of a random vector. For example, suppose

X

and

Y

are two random variables defined on the same probability space

(\Omega,l{F},\operatorname{P}),

with respectively generated -systems

l{I}X

and

l{I}Y.

The joint cumulative distribution function of

(X,Y)

isF_(a, b) = \operatorname[X \leq a, Y \leq b] = \operatorname\left[X^{-1}((-\infty, a]) \cap Y^((-\infty, b])\right], \quad \text a, b \in \Reals.

However,

A=X-1((-infty,a])\inl{I}X

and

B=Y-1((-infty,b])\inl{I}Y.

Because \mathcal_ = \left\is a -system generated by the random pair

(X,Y),

the - theorem is used to show that the joint cumulative distribution function suffices to determine the joint law of

(X,Y).

In other words,

(X,Y)

and

(W,Z)

have the same distribution if and only if they have the same joint cumulative distribution function.

In the theory of stochastic processes, two processes

(Xt)t,(Yt)t

are known to be equal in distribution if and only if they agree on all finite-dimensional distributions; that is, for all

t1,\ldots,tn\inT,n\in\N,

\left(X_, \ldots, X_\right) \,\stackrel\, \left(Y_, \ldots, Y_\right).

The proof of this is another application of the - theorem.[4]

Independent random variables

The theory of -system plays an important role in the probabilistic notion of independence. If

X

and

Y

are two random variables defined on the same probability space

(\Omega,l{F},\operatorname{P})

then the random variables are independent if and only if their -systems

l{I}X,l{I}Y

satisfy for all

A\inl{I}X

and

B\inl{I}Y,

\operatorname[A \cap B] ~=~ \operatorname[A] \operatorname[B],which is to say that

l{I}X,l{I}Y

are independent. This actually is a special case of the use of -systems for determining the distribution of

(X,Y).

Example

Let

Z=\left(Z1,Z2\right),

where

Z1,Z2\siml{N}(0,1)

are iid standard normal random variables. Define the radius and argument (arctan) variablesR = \sqrt, \qquad \Theta = \tan^\left(Z_2 / Z_1\right).

Then

R

and

\Theta

are independent random variables.

To prove this, it is sufficient to show that the -systems

l{I}R,l{I}\Theta

are independent: that is, for all

\rho\in[0,infty)

and

\theta\in[0,2\pi],

\operatorname[R \leq \rho, \Theta \leq \theta] = \operatorname[R \leq \rho] \operatorname[\Theta \leq \theta].

Confirming that this is the case is an exercise in changing variables. Fix

\rho\in[0,infty)

and

\theta\in[0,2\pi],

then the probability can be expressed as an integral of the probability density function of

Z.

\begin\operatorname P [R \leq \rho, \Theta \leq \theta] &= \int_ \frac\exp\left(\right) dz_1 \, dz_2 \\[5pt]& = \int_0^ \int_0^\rho \frace^ \; r \, dr \, d\tilde\theta \\[5pt]& = \left(\int_0^\theta \frac \, d\tilde \theta\right) \; \left(\int_0^\rho e^ \; r \, dr\right) \\[5pt]& = \operatorname P[\Theta \leq \theta]\operatorname P[R \leq \rho].\end

References

Notes and References

  1. The nullary (0-ary) intersection of subsets of

    \Omega

    is by convention equal to

    \Omega,

    which is not required to be an element of a -system.
  2. Kallenberg, Foundations Of Modern Probability, p. 2
  3. Durrett, Probability Theory and Examples, p. 404
  4. Kallenberg, Foundations Of Modern Probability, p. 48