Misconceptions about the normal distribution explained

Students of statistics and probability theory sometimes develop misconceptions about the normal distribution, ideas that may seem plausible but are mathematically untrue. For example, it is sometimes mistakenly thought that two linearly uncorrelated, normally distributed random variables must be statistically independent. However, this is untrue, as can be demonstrated by counterexample. Likewise, it is sometimes mistakenly thought that a linear combination of normally distributed random variables will itself be normally distributed, but again, counterexamples prove this wrong.

To say that the pair

(X,Y)

of random variables has a bivariate normal distribution means that every linear combination

aX+bY

of

X

and

Y

for constant (i.e. not random) coefficients

a

and

b

(not both equal to zero) has a univariate normal distribution. In that case, if

X

and

Y

are uncorrelated then they are independent.[1] However, it is possible for two random variables

X

and

Y

to be so distributed jointly that each one alone is marginally normally distributed, and they are uncorrelated, but they are not independent; examples are given below.

Examples

A symmetric example

Suppose

X

has a normal distribution with expected value 0 and variance 1. Let

W

have the Rademacher distribution, so that

W=1

or

W=-1

, each with probability 1/2, and assume

W

is independent of

X

. Let

Y=WX

. Then

X

and

Y

are uncorrelated, as can be verified by calculating their covariance. Moreover, both have the same normal distribution. And yet,

X

and

Y

are not independent.[2] [3] [4]

To see that

X

and

Y

are not independent, observe that

|Y|=|X|

or that

\operatorname{Pr}(Y>1|-1/2<X<1/2)=\operatorname{Pr}(X>1|-1/2<X<1/2)=0

.

Finally, the distribution of the simple linear combination

X+Y

concentrates positive probability at 0:

\operatorname{Pr}(X+Y=0)=1/2

. Therefore, the random variable

X+Y

is not normally distributed, and so also

X

and

Y

are not jointly normally distributed (by the definition above).[2]

An asymmetric example

Suppose

X

has a normal distribution with expected value 0 and variance 1. LetY=\left\

Notes and References

  1. Book: Probability and Statistical Inference . 2001 . Hogg . Robert . Robert V. Hogg . Tanis . Elliot . Elliot Tanis . 6th . Chapter 5.4 The Bivariate Normal Distribution . 258–259 . Prentice Hall . 0130272949.
  2. Web site: Lecture 21. The Multivariate Normal Distribution . Lectures on Statistics . Robert B. . Ash . https://web.archive.org/web/20070714013309/http://www.math.uiuc.edu/~r-ash/Stat/StatLec21-25.pdf . dead . 2007-07-14.
  3. Web site: A Rant About Uncorrelated Normal Random Variables . Rosenthal . Jeffrey S. . Jeff Rosenthal . 2005.
  4. Book: Joesph P. . Romano . Andrew F. . Siegel . Counterexamples in Probability and Statistics . Counterexamples in Probability and Statistics . Wadsworth & Brooks/Cole . 1986 . 0-534-05568-0 . 65–66.