Markov property explained

In probability theory and statistics, the term Markov property refers to the memoryless property of a stochastic process, which means that its future evolution is independent of its history. It is named after the Russian mathematician Andrey Markov.^[1] The term strong Markov property is similar to the Markov property, except that the meaning of "present" is defined in terms of a random variable known as a stopping time.

The term Markov assumption is used to describe a model where the Markov property is assumed to hold, such as a hidden Markov model.

A Markov random field extends this property to two or more dimensions or to random variables defined for an interconnected network of items.^[2] An example of a model for such a field is the Ising model.

A discrete-time stochastic process satisfying the Markov property is known as a Markov chain.

Introduction

A stochastic process has the Markov property if the conditional probability distribution of future states of the process (conditional on both past and present values) depends only upon the present state; that is, given the present, the future does not depend on the past. A process with this property is said to be Markov or Markovian and known as a Markov process. Two famous classes of Markov process are the Markov chain and Brownian motion.

Note that there is a subtle, often overlooked and very important point that is often missed in the plain English statement of the definition. Namely that the statespace of the process is constant through time. The conditional description involves a fixed "bandwidth". For example, without this restriction we could augment any process to one which includes the complete history from a given initial condition and it would be made to be Markovian. But the state space would be of increasing dimensionality over time and does not meet the definition.

Definition

Let

(\Omega,l{F},P)

be a probability space with a filtration

(l{F}_s, s\inI)

, for some (totally ordered) index set

; and let

(S,l{S})

be a measurable space. A

(S,l{S})

-valued stochastic process

X=\{X_t:\Omega\toS\}_t\in

adapted to the filtration is said to possess the Markov property if, for each

A\inl{S}

and each

s,t\inI

with

s<t

P(X_t\inA\midl{F}_s)=P(X_t\inA\midX_s).

^[3]

In the case where

is a discrete set with the discrete sigma algebra and

I=N

, this can be reformulated as follows:

P(X_n+1=x_n+1\midX_n=x_n,...,X_1=x_1)=P(X_n+1=x_n+1\midX_n=x_n)foralln\inN.

Alternative formulations

Alternatively, the Markov property can be formulated as follows.

\operatorname{E}[f(X_t)\midl{F}_{s]=\operatorname{E}[f(X}_{t)\mid\sigma(X}_s)]

for all

t\geqs\geq0

and

f:S → R

bounded and measurable.^[4]

Strong Markov property

Suppose that

X=(X_t:t\geq0)

is a stochastic process on a probability space

(\Omega,l{F},P)

with natural filtration

\{l{F}_t\}_t\geq

. Then for any stopping time

\tau

\Omega

, we can define

l{F}_\tau=\{A\inl{F}:\forallt\geq0,\{\tau\leqt\}\capA\inl{F}_t\}

Then

is said to have the strong Markov property if, for each stopping time

\tau

, conditional on the event

\{\tau<infty\}

, we have that for each

t\ge0

X_\tau

is independent of

l{F}_\tau

given

X_\tau

The strong Markov property implies the ordinary Markov property since by taking the stopping time

\tau=t

, the ordinary Markov property can be deduced.^[5]

In forecasting

In the fields of predictive modelling and probabilistic forecasting, the Markov property is considered desirable since it may enable the reasoning and resolution of the problem that otherwise would not be possible to be resolved because of its intractability. Such a model is known as a Markov model.

Examples

Assume that an urn contains two red balls and one green ball. One ball was drawn yesterday, one ball was drawn today, and the final ball will be drawn tomorrow. All of the draws are "without replacement".

Suppose you know that today's ball was red, but you have no information about yesterday's ball. The chance that tomorrow's ball will be red is 1/2. That's because the only two remaining outcomes for this random experiment are:

Day	Outcome 1	Outcome 2
Yesterday	Red	Green
Today	Red	Red
Tomorrow	Green	Red

On the other hand, if you know that both today and yesterday's balls were red, then you are guaranteed to get a green ball tomorrow.

This discrepancy shows that the probability distribution for tomorrow's color depends not only on the present value, but is also affected by information about the past. This stochastic process of observed colors doesn't have the Markov property. Using the same experiment above, if sampling "without replacement" is changed to sampling "with replacement," the process of observed colors will have the Markov property.^[6]

An application of the Markov property in a generalized form is in Markov chain Monte Carlo computations in the context of Bayesian statistics.

Notes and References

Markov, A. A. (1954). Theory of Algorithms. [Translated by Jacques J. Schorr-Kon and PST staff] Imprint Moscow, Academy of Sciences of the USSR, 1954 [Jerusalem, Israel Program for Scientific Translations, 1961; available from Office of Technical Services, [[United States Department of Commerce]]] Added t.p. in Russian Translation of Works of the Mathematical Institute, Academy of Sciences of the USSR, v. 42. Original title: Teoriya algorifmov. [QA248.M2943 Dartmouth College library. U.S. Dept. of Commerce, Office of Technical Services, number OTS 60-51085.]
[Yadolah Dodge|Dodge, Yadolah]
[Rick Durrett|Durrett, Rick]
Book: Øksendal, Bernt K. . Bernt Øksendal . Stochastic Differential Equations: An Introduction with Applications . Springer, Berlin . 2003 . 3-540-04758-1.
Ethier, Stewart N. and Kurtz, Thomas G. Markov Processes: Characterization and Convergence. Wiley Series in Probability and Mathematical Statistics, 1986, p. 158.
Web site: Example of a stochastic process which does not have the Markov property . . 2020-07-07.