Notation in probability and statistics explained

Probability theory and statistics have some commonly used conventions, in addition to standard mathematical notation and mathematical symbols.

Probability theory

P(X=x)

is written, then it represents the probability that a particular realisation of a random variable (e.g., height, number of cars, or bicycle colour), X, would be equal to a particular value or category (e.g., 1.735 m, 52, or purple), x. It is important that X and x are not confused into meaning the same thing. X is an idea, x is a value. Clearly they are related, but they do not have identical meanings.

P(X\lex)

to distinguish the random variable from its realization.[1]

P

to distinguish it from other functions and measure P to avoid having to define "P is a probability" and

P(X\inA)

is short for

P(\{\omega\in\Omega:X(\omega)\inA\})

, where

\Omega

is the event space,

X

is a random variable that is a function of

\omega

(i.e., it depends upon

\omega

), and

\omega

is some outcome of interest within the domain specified by

\Omega

(say, a particular height, or a particular colour of a car).

\Pr(A)

notation is used alternatively.

P(A\capB)

or

P[B\capA]

indicates the probability that events A and B both occur. The joint probability distribution of random variables X and Y is denoted as

P(X,Y)

, while joint probability mass function or probability density function as

f(x,y)

and joint cumulative distribution function as

F(x,y)

.

P(A\cupB)

or

P[B\cupA]

indicates the probability of either event A or event B occurring ("or" in this case means one or the other or both).

lF

for the set of sets on which we define the probability P)

f(x)

, or

fX(x)

.

F(x)

, or

FX(x)

.

\overline{F}(x)=1-F(x)

, or denoted as

S(x)

,

X\perpY

or

X\perp\perpY

, and X is independent of Y given W is often written

X\perp\perpY|W

or

X\perpY|W

styleP(A\midB)

, the conditional probability, is the probability of

styleA

given

styleB

Statistics

\widehat{\theta}

is an estimator for

\theta

.

\bar{x}

, pronounced "x bar".

\bar{x}

,

x(k)

is used for the

kth

order statistic, where

x(1)

is the sample minimum and

x(n)

is the sample maximum from a total sample size n.[3]

Critical values

The α-level upper critical value of a probability distribution is the value exceeded with probability \alpha, that is, the value x_\alpha such that F(x_\alpha) = 1-\alpha, where F is the cumulative distribution function. There are standard notations for the upper critical values of some commonly used distributions in statistics:

{\chi\alpha,\nu

}^2 or

{\chi}2(\alpha,\nu)

for the chi-squared distribution with \nu degrees of freedom
F
\alpha,\nu1,\nu2
or F(\alpha,\nu_1,\nu_2) for the F-distribution with \nu_1 and \nu_2 degrees of freedom

Linear algebra

Abbreviations

Common abbreviations include:

\nu

)

\{Ani.o.\}=capNcupn\geqAn

\{Anult.\}=cupNcapn\geqAn

See also

External links

Notes and References

  1. Web site: 2021-08-09 . Calculating Probabilities from Cumulative Distribution Function . 2024-02-26.
  2. Web site: 1999-02-13 . Letters of the Greek Alphabet and Some of Their Statistical Uses . 2024-02-26 . les.appstate.edu/.
  3. Web site: Order Statistics . 2024-02-26 . colorado.edu.