Regular conditional probability explained

In probability theory, regular conditional probability is a concept that formalizes the notion of conditioning on the outcome of a random variable. The resulting conditional probability distribution is a parametrized family of probability measures called a Markov kernel.

Definition

Conditional probability distribution

Consider two random variables

X,Y:\Omega\toR

. The conditional probability distribution of Y given X is a two variable function

\kappa_Y\mid:R x l{B}(R)\to[0,1]

If the random variable X is discrete

\kappa_Y\mid(x,A)=P(Y\inA\midX=x)=\begin{cases}

	P(Y\inA,X=x)
	P(X=x)

&ifP(X=x)>0\\[3pt] arbitraryvalue&otherwise. \end{cases}

If the random variables X, Y are continuous with density

f_X,Y(x,y)

\kappa_Y\mid(x,A)=\begin{cases}

	\int_Af_X,Y(x,y)dy
	\int_Rf_X,Y(x,y)dy

& if\int_Rf_X,Y(x,y)dy>0\\[3pt] arbitraryvalue&otherwise. \end{cases}

A more general definition can be given in terms of conditional expectation. Consider a function

e_Y:R\to[0,1]

satisfying

e_Y(X(\omega))=\operatornameE[1_Y\midX](\omega)

for almost all

\omega

.Then the conditional probability distribution is given by

\kappa_Y\mid(x,A)=e_Y(x).

As with conditional expectation, this can be further generalized to conditioning on a sigma algebra

l{F}

. In that case the conditional distribution is a function

\Omega x l{B}(R)\to[0,1]

\kappa_Y\midl{F

}(\omega, A) = \operatorname E[1_{Y \in A} \mid \mathcal{F}](\omega)

Regularity

For working with

\kappa_Y\mid

, it is important that it be regular, that is:

For almost all x,

A\mapsto\kappa_Y\mid(x,A)

is a probability measure

For all A,

x\mapsto\kappa_Y\mid(x,A)

is a measurable functionIn other words

\kappa_Y\mid

is a Markov kernel.

The second condition holds trivially, but the proof of the first is more involved. It can be shown that if Y is a random element

\Omega\toS

in a Radon space S, there exists a

\kappa_Y\mid

that satisfies the first condition.^[1] It is possible to construct more general spaces where a regular conditional probability distribution does not exist.^[2]

Relation to conditional expectation

For discrete and continuous random variables, the conditional expectation can be expressed as

\begin{aligned} \operatornameE[Y\midX=x]&=\sum_yyP(Y=y\midX=x)\\ \operatornameE[Y\midX=x]&=\intyf_Y\mid(x,y)dy \end{aligned}

where

f_Y\mid(x,y)

is the conditional density of given .

This result can be extended to measure theoretical conditional expectation using the regular conditional probability distribution:

\operatornameE[Y\midX](\omega)=\inty\kappa_{Y\mid\sigma(X)}(\omega,dy).

Formal definition

Let

(\Omega,lF,P)

be a probability space, and let

T:\Omega → E

be a random variable, defined as a Borel-measurable function from

\Omega

to its state space

(E,lE)

.One should think of

as a way to "disintegrate" the sample space

\Omega

into

\{T^-1(x)\}_x

.Using the disintegration theorem from the measure theory, it allows us to "disintegrate" the measure

into a collection of measures,one for each

x\inE

. Formally, a regular conditional probability is defined as a function

\nu:E x lF → [0,1],

called a "transition probability", where:

For every

x\inE

\nu(x, ⋅ )

is a probability measure on

. Thus we provide one measure for each

x\inE

For all

A\inlF

\nu( ⋅ ,A)

(a mapping

E\to[0,1]

) is

-measurable, and

For all

A\inlF

and all

B\inlE

^[3]

P(A\capT^-1(B))=\int_B\nu(x,A)(P\circT^-1)(dx)

where

P\circT^-1

is the pushforward measure

T_*P

of the distribution of the random element

x\in\operatorname{supp}T,

i.e. the support of the

T_*P

.Specifically, if we take

B=E

, then

A\capT^-1(E)=A

, and so

P(A)=\int_E\nu(x,A)(P\circT^-1)(dx),

where

\nu(x,A)

can be denoted, using more familiar terms

P(A | T=x)

Alternate definition

\Omega

(that is a probability measure defined on a Radon space endowed with the Borel sigma-algebra) and a real-valued random variable T. As discussed above, in this case there exists a regular conditional probability with respect to T. Moreover, we can alternatively define the regular conditional probability for an event A given a particular value t of the random variable T in the following manner:

P(A\midT=t)=\lim_U\supset

} \frac,

where the limit is taken over the net of open neighborhoods U of t as they become smaller with respect to set inclusion. This limit is defined if and only if the probability space is Radon, and only in the support of T, as described in the article. This is the restriction of the transition probability to the support of T. To describe this limiting process rigorously:

For every

\varepsilon>0,

there exists an open neighborhood U of the event, such that for every open V with

\{T=t\}\subsetV\subsetU,

\left\|	P(A\capV)
	P(V)

-L\right|<\varepsilon,

where

L=P(A\midT=t)

is the limit.

References

Book: Klenke . Achim . Probability theory : a comprehensive course . 30 August 2013 . London . 978-1-4471-5361-0 . Second.
Faden, A.M., 1985. The existence of regular conditional probabilities: necessary and sufficient conditions. The Annals of Probability, 13(1), pp. 288–298.
D. Leao Jr. et al. Regular conditional probability, disintegration of probability and Radon spaces. Proyecciones. Vol. 23, No. 1, pp. 15–29, May 2004, Universidad Católica del Norte, Antofagasta, Chile PDF

External links

Regular Conditional Probability on PlanetMath

Regular conditional probability explained

Definition

Conditional probability distribution

Regularity

Relation to conditional expectation

Formal definition

Alternate definition

See also

References

External links