Cauchy distribution explained

The Cauchy distribution, named after Augustin Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution (after Hendrik Lorentz), Cauchy–Lorentz distribution, Lorentz(ian) function, or Breit–Wigner distribution. The Cauchy distribution

f(x;x0,\gamma)

is the distribution of the -intercept of a ray issuing from

(x0,\gamma)

with a uniformly distributed angle. It is also the distribution of the ratio of two independent normally distributed random variables with mean zero.

The Cauchy distribution is often used in statistics as the canonical example of a "pathological" distribution since both its expected value and its variance are undefined (but see below). The Cauchy distribution does not have finite moments of order greater than or equal to one; only fractional absolute moments exist.[1] The Cauchy distribution has no moment generating function.

In mathematics, it is closely related to the Poisson kernel, which is the fundamental solution for the Laplace equation in the upper half-plane.

It is one of the few stable distributions with a probability density function that can be expressed analytically, the others being the normal distribution and the Lévy distribution.

History

A function with the form of the density function of the Cauchy distribution was studied geometrically by Fermat in 1659, and later was known as the witch of Agnesi, after Agnesi included it as an example in her 1748 calculus textbook. Despite its name, the first explicit analysis of the properties of the Cauchy distribution was published by the French mathematician Poisson in 1824, with Cauchy only becoming associated with it during an academic controversy in 1853.[2] Poisson noted that if the mean of observations following such a distribution were taken, the mean error did not converge to any finite number. As such, Laplace's use of the central limit theorem with such a distribution was inappropriate, as it assumed a finite mean and variance. Despite this, Poisson did not regard the issue as important, in contrast to Bienaymé, who was to engage Cauchy in a long dispute over the matter.

Constructions

Here are the most important constructions.

Rotational symmetry

If one stands in front of a line and kicks a ball with a direction (more precisely, an angle) uniformly at random towards the line, then the distribution of the point where the ball hits the line is a Cauchy distribution.

More formally, consider a point at

(x0,\gamma)

in the x-y plane, and select a line passing the point, with its direction (angle with the

x

-axis) chosen uniformly (between -90° and +90°) at random. The intersection of the line with the x-axis is the Cauchy distribution with location

x0

and scale

\gamma

.

This definition gives a simple way to sample from the standard Cauchy distribution. Let

u

be a sample from a uniform distribution from

[0,1]

, then we can generate a sample,

x

from the standard Cauchy distribution using

x=\tan\left(\pi(u-

1
2

)\right)

When

U

and

V

are two independent normally distributed random variables with expected value 0 and variance 1, then the ratio

U/V

has the standard Cauchy distribution.

More generally, if

(U,V)

is a rotationally symmetric distribution on the plane, then the ratio

U/V

has the standard Cauchy distribution.

Probability density function (PDF)

The Cauchy distribution is the probability distribution with the following probability density function (PDF)[1] [3]

f(x;x0,\gamma)=

1
\pi\gamma\left[1+
\left(x-x0
\gamma
\right)2\right]

={1\over\pi}\left[{\gamma\over(x-

2
x
0)

+\gamma2}\right],

where

x0

is the location parameter, specifying the location of the peak of the distribution, and

\gamma

is the scale parameter which specifies the half-width at half-maximum (HWHM), alternatively

2\gamma

is full width at half maximum (FWHM).

\gamma

is also equal to half the interquartile range and is sometimes called the probable error. Augustin-Louis Cauchy exploited such a density function in 1827 with an infinitesimal scale parameter, defining what would now be called a Dirac delta function.

Properties of PDF

The maximum value or amplitude of the Cauchy PDF is

1
\pi\gamma
, located at

x=x0

.

It is sometimes convenient to express the PDF in terms of the complex parameter

\psi=x0+i\gamma

f(x;\psi)=1
rm{Im}\left(1
x-\psi
\right)=
\pi
1
\pi
rm{Re}\left(-i
x-\psi
\right)

The special case when

x0=0

and

\gamma=1

is called the standard Cauchy distribution with the probability density function[4] [5]

f(x;0,1)=

1
\pi(1+x2)

.

In physics, a three-parameter Lorentzian function is often used:

f(x;x0,\gamma,I)=

I
\left[1+
\left(x-x0
\gamma
\right)2\right]

=I\left[{\gamma2\over(x-

2
x
0)

+\gamma2}\right],

where

I

is the height of the peak. The three-parameter Lorentzian function indicated is not, in general, a probability density function, since it does not integrate to 1, except in the special case where

I=

1
\pi\gamma

.

Cumulative distribution function (CDF)

The Cauchy distribution is the probability distribution with the following cumulative distribution function (CDF):

F(x;

x\arctan\left(
0,\gamma)=1
\pi
x-x0\right)+
\gamma
1
2

and the quantile function (inverse cdf) of the Cauchy distribution is

Q(p;x0,\gamma)=x0+\gamma\tan\left[\pi\left(p-\tfrac{1}{2}\right)\right].

It follows that the first and third quartiles are

(x0-\gamma,x0+\gamma)

, and hence the interquartile range is

2\gamma

.

\arctan(x)

:

F(x;0,1)=

1\arctan\left(x\right)+
\pi
1
2

Other constructions

The standard Cauchy distribution is the Student's t-distribution with one degree of freedom, and so it may be constructed by any method that constructs the Student's t-distribution.

If

\Sigma

is a

p x p

positive-semidefinite covariance matrix with strictly positive diagonal entries, then for independent and identically distributed

X,Y\simN(0,\Sigma)

and any random

p

-vector

w

independent of

X

and

Y

such that

w1+ … +wp=1

and

wi\geq0,i=1,\ldots,p,

(defining a categorical distribution) it holds that
p
\sum
j=1
w
jXj
Yj

\simCauchy(0,1).

[6]

Properties

The Cauchy distribution is an example of a distribution which has no mean, variance or higher moments defined. Its mode and median are well defined and are both equal to

x0

.

The Cauchy distribution is an infinitely divisible probability distribution. It is also a strictly stable distribution.[7]

Like all stable distributions, the location-scale family to which the Cauchy distribution belongs is closed under linear transformations with real coefficients. In addition, the family of Cauchy-distributed random variables is closed under linear fractional transformations with real coefficients.[8] In this connection, see also McCullagh's parametrization of the Cauchy distributions.

Sum of Cauchy-distributed random variables

If

X1,X2,\ldots,Xn

are an IID sample from the standard Cauchy distribution, then their sample mean

\barX=

1n
\sum

iXi

is also standard Cauchy distributed. In particular, the average does not converge to the mean, and so the standard Cauchy distribution does not follow the law of large numbers.

This can be proved by repeated integration with the PDF, or more conveniently, by using the characteristic function of the standard Cauchy distribution (see below):\varphi_X(t) = \operatorname\left[e^{iXt} \right ] = e^.With this, we have

\varphi
\sumiXi

(t)=e-n

, and so

\barX

has a standard Cauchy distribution.

More generally, if

X1,X2,\ldots,Xn

are independent and Cauchy distributed with location parameters

x1,\ldots,xn

and scales

\gamma1,\ldots,\gamman

, and

a1,\ldots,an

are real numbers, then

\sumiaiXi

is Cauchy distributed with location

\sumiaixi

and scale

\sumi|ai|\gammai

. We see that there is no law of large numbers for any weighted sum of independent Cauchy distributions.

This shows that the condition of finite variance in the central limit theorem cannot be dropped. It is also an example of a more generalized version of the central limit theorem that is characteristic of all stable distributions, of which the Cauchy distribution is a special case.

Central limit theorem

If

X1,X2,\ldots

are and IID sample with PDF

\rho

such that

\limc

1
c
c
\int
-c

x2\rho(x)dx=

2\gamma
\pi

is finite, but nonzero, then
1n
\sum
n
i=1

Xi

converges in distribution to a Cauchy distribution with scale

\gamma

.[9]

Characteristic function

Let

X

denote a Cauchy distributed random variable. The characteristic function of the Cauchy distribution is given by

\varphiX(t)=\operatorname{E}\left[eiXt\right]

infty
=\int
-infty
ixt
f(x;x
0,\gamma)e

dx=

ix0t-\gamma|t|
e

.

which is just the Fourier transform of the probability density. The original probability density may be expressed in terms of the characteristic function, essentially by using the inverse Fourier transform:

f(x;x0,\gamma)=

1
2\pi
infty
\int
-infty

\varphiX(t;x

-ixt
0,\gamma)e

dt

The nth moment of a distribution is the nth derivative of the characteristic function evaluated at

t=0

. Observe that the characteristic function is not differentiable at the origin: this corresponds to the fact that the Cauchy distribution does not have well-defined moments higher than the zeroth moment.

Kullback–Leibler divergence

The Kullback–Leibler divergence between two Cauchy distributions has the following symmetric closed-form formula:[10]

KL\left(p
x0,1,\gamma1

:

p
x0,2,\gamma2

\right)=log

\left(\gamma1+\gamma2\right)2+\left(x0,1-x0,2\right)2
4\gamma1\gamma2

.

Any f-divergence between two Cauchy distributions is symmetric and can be expressed as a function of the chi-squared divergence.[11] Closed-form expression for the total variation, Jensen–Shannon divergence, Hellinger distance, etc are available.

Entropy

The entropy of the Cauchy distribution is given by:

\begin{align} H(\gamma)&

infty
=-\int
-infty

f(x;x0,\gamma)log(f(x;x0,\gamma))dx\\[6pt] &=log(4\pi\gamma) \end{align}

The derivative of the quantile function, the quantile density function, for the Cauchy distribution is:

Q'(p;\gamma)=\gamma\pi{\sec}2\left[\pi\left(p-\tfrac12\right)\right].

The differential entropy of a distribution can be defined in terms of its quantile density,[12] specifically:

H(\gamma)=

1
\int
0

log(Q'(p;\gamma))dp=log(4\pi\gamma)

The Cauchy distribution is the maximum entropy probability distribution for a random variate

X

for which[13]
2/\gamma
\operatorname{E}[log(1+(X-x
0)

2)]=log4

Moments

The Cauchy distribution is usually used as an illustrative counterexample in elementary probability courses, as a distribution with no well-defined (or "indefinite") moments.

Sample moments

If we take an IID sample

X1,X2,\ldots

from the standard Cauchy distribution, then the sequence of their sample mean is

Sn=

1n
\sum
n
i=1

Xi

, which also has the standard Cauchy distribution. Consequently, no matter how many terms we take, the sample average does not converge.

Similarly, the sample variance

Vn=

1n
\sum
n
i=1

(Xi-

2
S
n)
also does not converge.A typical trajectory of

S1,S2,...

looks like long periods of slow convergence to zero, punctuated by large jumps away from zero, but never getting too far away. A typical trajectory of

V1,V2,...

looks similar, but the jumps accumulate faster than the decay, diverging to infinity. These two kinds of trajectories are plotted in the figure.

Moments of sample lower than order 1 would converge to zero. Moments of sample higher than order 2 would diverge to infinity even faster than sample variance.

Mean

f(x)

, then the mean, if it exists, is given by

We may evaluate this two-sided improper integral by computing the sum of two one-sided improper integrals. That is,for an arbitrary real number

a

.

For the integral to exist (even as an infinite value), at least one of the terms in this sum should be finite, or both should be infinite and have the same sign. But in the case of the Cauchy distribution, both the terms in this sum are infinite and have opposite sign. Hence is undefined, and thus so is the mean.[14] When the mean of a probability distribution function (PDF) is undefined, no one can compute a reliable average over the experimental data points, regardless of the sample’s size.

Note that the Cauchy principal value of the mean of the Cauchy distribution is\lim_\int_^a x f(x)\,dx which is zero. On the other hand, the related integral\lim_\int_^a x f(x)\,dx is not zero, as can be seen by computing the integral. This again shows that the mean cannot exist.

Various results in probability theory about expected values, such as the strong law of large numbers, fail to hold for the Cauchy distribution.[14]

Smaller moments

The absolute moments for

p\in(-1,1)

are defined.For

X\simCauchy(0,\gamma)

we have

\operatorname{E}[|X|p]=\gammapsec(\pip/2).

Higher moments

The Cauchy distribution does not have finite moments of any order. Some of the higher raw moments do exist and have a value of infinity, for example, the raw second moment:

\begin{align} \operatorname{E}[X2]&\propto

infty
\int
-infty
x2
1+x2

dx=

infty
\int
-infty

1-

1
1+x2

dx\\[8pt] &=

infty
\int
-infty

dx-

infty
\int
-infty
1
1+x2

dx=

infty
\int
-infty

dx-\pi=infty. \end{align}

By re-arranging the formula, one can see that the second moment is essentially the infinite integral of a constant (here 1). Higher even-powered raw moments will also evaluate to infinity. Odd-powered raw moments, however, are undefined, which is distinctly different from existing with the value of infinity. The odd-powered raw moments are undefined because their values are essentially equivalent to

infty-infty

since the two halves of the integral both diverge and have opposite signs. The first raw moment is the mean, which, being odd, does not exist. (See also the discussion above about this.) This in turn means that all of the central moments and standardized moments are undefined since they are all based on the mean. The variance—which is the second central moment—is likewise non-existent (despite the fact that the raw second moment exists with the value infinity).

The results for higher moments follow from Hölder's inequality, which implies that higher moments (or halves of moments) diverge if lower ones do.

Moments of truncated distributions

Consider the truncated distribution defined by restricting the standard Cauchy distribution to the interval . Such a truncated distribution has all moments (and the central limit theorem applies for i.i.d. observations from it); yet for almost all practical purposes it behaves like a Cauchy distribution.[15]

Estimation of parameters

Because the parameters of the Cauchy distribution do not correspond to a mean and variance, attempting to estimate the parameters of the Cauchy distribution by using a sample mean and a sample variance will not succeed.[16] For example, if an i.i.d. sample of size n is taken from a Cauchy distribution, one may calculate the sample mean as:

\bar{x}=1
n
n
\sum
i=1

xi

Although the sample values

xi

will be concentrated about the central value

x0

, the sample mean will become increasingly variable as more observations are taken, because of the increased probability of encountering sample points with a large absolute value. In fact, the distribution of the sample mean will be equal to the distribution of the observations themselves; i.e., the sample mean of a large sample is no better (or worse) an estimator of

x0

than any single observation from the sample. Similarly, calculating the sample variance will result in values that grow larger as more observations are taken.

Therefore, more robust means of estimating the central value

x0

and the scaling parameter

\gamma

are needed. One simple method is to take the median value of the sample as an estimator of

x0

and half the sample interquartile range as an estimator of

\gamma

. Other, more precise and robust methods have been developed [17] [18] For example, the truncated mean of the middle 24% of the sample order statistics produces an estimate for

x0

that is more efficient than using either the sample median or the full sample mean.[19] [20] However, because of the fat tails of the Cauchy distribution, the efficiency of the estimator decreases if more than 24% of the sample is used.[19] [20]

Maximum likelihood can also be used to estimate the parameters

x0

and

\gamma

. However, this tends to be complicated by the fact that this requires finding the roots of a high degree polynomial, and there can be multiple roots that represent local maxima.[21] Also, while the maximum likelihood estimator is asymptotically efficient, it is relatively inefficient for small samples.[22] [23] The log-likelihood function for the Cauchy distribution for sample size

n

is:

\hat\ell(x1,...c,xn\midx0,\gamma)=-nlog(\gamma\pi)-

n
\sum
i=1

log\left(1+\left(

xi-x0
\gamma

\right)2\right)

Maximizing the log likelihood function with respect to

x0

and

\gamma

by taking the first derivative produces the following system of equations:
d\ell
dx0

=

n
\sum
i=1
2(xi-x0)
2
\gamma+\left(xi-
2
x
0\right)

=0

d\ell
d\gamma

=

n
\sum
i=1
2\left(x-
2
x
0\right)
i
\gamma(\gamma2+\left(xi-
2)
x
0\right)

-

n
\gamma

=0

Note that

n
\sum
i=1
\left(x-
2
x
0\right)
i
2
\gamma+\left(xi-
2
x
0\right)

is a monotone function in

\gamma

and that the solution

\gamma

must satisfy

min|xi-x0|\le\gamma\lemax|xi-x0|.

Solving just for

x0

requires solving a polynomial of degree

2n-1

,[21] and solving just for

\gamma

requires solving a polynomial of degree

2n

. Therefore, whether solving for one parameter or for both parameters simultaneously, a numerical solution on a computer is typically required. The benefit of maximum likelihood estimation is asymptotic efficiency; estimating

x0

using the sample median is only about 81% as asymptotically efficient as estimating

x0

by maximum likelihood.[20] [24] The truncated sample mean using the middle 24% order statistics is about 88% as asymptotically efficient an estimator of

x0

as the maximum likelihood estimate.[20] When Newton's method is used to find the solution for the maximum likelihood estimate, the middle 24% order statistics can be used as an initial solution for

x0

.

The shape can be estimated using the median of absolute values, since for location 0 Cauchy variables

X\simCauchy(0,\gamma)

, the

\operatorname{median}(|X|)=\gamma

the shape parameter.

Multivariate Cauchy distribution

X=(X1,\ldots,

T
X
k)
is said to have the multivariate Cauchy distribution if every linear combination of its components

Y=a1X1++akXk

has a Cauchy distribution. That is, for any constant vector

a\inRk

, the random variable

Y=aTX

should have a univariate Cauchy distribution.[25] The characteristic function of a multivariate Cauchy distribution is given by:

\varphiX(t)=

ix0(t)-\gamma(t)
e

,

where

x0(t)

and

\gamma(t)

are real functions with

x0(t)

a homogeneous function of degree one and

\gamma(t)

a positive homogeneous function of degree one.[25] More formally:[25]

x0(at)=ax0(t),

\gamma(at)=|a|\gamma(t),

for all

t

.

An example of a bivariate Cauchy distribution can be given by:[26]

f(x,y;x0,y0,\gamma)={1\over2\pi}\left[{\gamma\over((x-

2
x
0)

+(y-

2
y
0)

+\gamma2)3/2}\right].

Note that in this example, even though the covariance between

x

and

y

is 0,

x

and

y

are not statistically independent.[26]

We also can write this formula for complex variable. Then the probability density function of complex cauchy is :

f(z;z0,\gamma)={1\over2\pi}\left[{\gamma\over

2
(|z-z
0|

+\gamma2)3/2}\right].

Like how the standard Cauchy distribution is the Student t-distribution with one degree of freedom, the multidimensional Cauchy density is the multivariate Student distribution with one degree of freedom. The density of a

k

dimension Student distribution with one degree of freedom is:
f({x};{\mu},{\Sigma},k)=
\Gamma\left(1+k\right)
2
\Gamma(1
k
2
)\pi
\left|{\Sigma
2
1
2
\right|

\left[1+({x}-{\mu})T{\Sigma}-1

1+k
2
({x}-{\mu})\right]
} .

The properties of multidimensional Cauchy distribution are then special cases of the multivariate Student distribution.

Transformation properties

X\sim\operatorname{Cauchy}(x0,\gamma)

then

kX+\ell\simrm{Cauchy}(x0k+\ell,\gamma|k|)

X\sim\operatorname{Cauchy}(x0,\gamma0)

and

Y\sim\operatorname{Cauchy}(x1,\gamma1)

are independent, then

X+Y\sim\operatorname{Cauchy}(x0+x1,\gamma0+\gamma1)

and

X-Y\sim\operatorname{Cauchy}(x0-x1,\gamma0+\gamma1)

X\sim\operatorname{Cauchy}(0,\gamma)

then

\tfrac{1}{X}\sim\operatorname{Cauchy}(0,\tfrac{1}{\gamma})

[27] Expressing a Cauchy distribution in terms of one complex parameter

\psi=x0+i\gamma

, define

X\sim\operatorname{Cauchy}(\psi)

to mean

X\sim\operatorname{Cauchy}(x0,|\gamma|)

. If

X\sim\operatorname{Cauchy}(\psi)

then: \frac \sim \operatorname\left(\frac\right) where

a

,

b

,

c

and

d

are real numbers.

X\sim\operatorname{Cauchy}(\psi)

then:[27] \frac \sim \operatorname\left(\frac\right)where

\operatorname{CCauchy}

is the circular Cauchy distribution.

Lévy measure

The Cauchy distribution is the stable distribution of index 1. The Lévy–Khintchine representation of such a stable distribution of parameter

\gamma

is given, for

X\sim\operatorname{Stable}(\gamma,0,0)

by:

\operatorname{E}\left(eixX\right)=\exp\left(\int(eixy-1)\Pi\gamma(dy)\right)

where

\Pi\gamma(dy)=\left(c1,

1
y1

1}+c2,\gamma

1
|y|1

1\left\{

} \right) \, dy

and

c1,,c2,

can be expressed explicitly.[28] In the case

\gamma=1

of the Cauchy distribution, one has

c1,=c2,

.

This last representation is a consequence of the formula

\pi|x|=\operatorname{PV}\intR(1-eixy)

dy
y2

Related distributions

\operatorname{Cauchy}(0,1)\simrm{t}(df=1)

Student's t distribution

\operatorname{Cauchy}(\mu,\sigma)\simrm{t}(df=1)(\mu,\sigma)

non-standardized Student's t distribution

X,Y\simrm{N}(0,1)X,Y

independent, then

\tfracXY\simrm{Cauchy}(0,1)

X\simrm{U}(0,1)

then

\tan\left(\pi\left(X-\tfrac{1}{2}\right)\right)\simrm{Cauchy}(0,1)

X\sim\operatorname{Log-Cauchy}(0,1)

then

ln(X)\simrm{Cauchy}(0,1)

X\sim\operatorname{Cauchy}(x0,\gamma)

then

\tfrac1X\sim\operatorname{Cauchy}\left(\tfrac{x0}{x

2+\gamma
0
2+\gamma
0

2}\right)

X\simrm{Stable}(1,0,\gamma,\mu)

, then

X\sim\operatorname{Cauchy}(\mu,\gamma)

.

X\simrm{N}(0,1)

,

Z\sim\operatorname{Inverse-Gamma}(1/2,s2/2)

, then

Y=\mu+X\sqrtZ\sim\operatorname{Cauchy}(\mu,s)

. For half-Cauchy distributions, the relation holds by setting

X\simrm{N}(0,1)I\{X\ge0\}

.

Relativistic Breit–Wigner distribution

See main article: article and Relativistic Breit–Wigner distribution. In nuclear and particle physics, the energy profile of a resonance is described by the relativistic Breit–Wigner distribution, while the Cauchy distribution is the (non-relativistic) Breit–Wigner distribution.

Occurrence and applications

\hat{\beta}

for the equation

xt+1=\beta{x}t+\varepsilont+1,\beta>1

and where the maximum likelihood estimator is found using ordinary least squares showed the sampling distribution of the statistic is the Cauchy distribution.

See also

External links

Notes and References

  1. Book: N. L. Johnson . S. Kotz . N. Balakrishnan . Continuous Univariate Distributions, Volume 1. Wiley. New York. 1994., Chapter 16.
  2. Cauchy and the Witch of Agnesi in Statistics on the Table, S M Stigler Harvard 1999 Chapter 18
  3. Book: Feller, William. An Introduction to Probability Theory and Its Applications, Volume II. 2. John Wiley & Sons Inc.. New York. 1971. 704. 978-0-471-25709-7. registration.
  4. Book: Riley. Ken F.. Hobson. Michael P.. Bence. Stephen J.. Mathematical Methods for Physics and Engineering. limited. 3. Cambridge University Press. Cambridge, UK. 2006. 1333. 978-0-511-16842-0.
  5. Book: Balakrishnan. N.. Nevrozov. V. B.. A Primer on Statistical Distributions. 1. John Wiley & Sons Inc.. Hoboken, New Jersey. 2003. 305. 0-471-42798-5.
  6. Pillai N. . Meng, X.L. . 2016 . An unexpected encounter with Cauchy and Lévy . . 44 . 5 . 2089–2097 . 1505.01957 . 10.1214/15-AOS1407 . 31582370.
  7. Book: Campbell B. Read . N. Balakrishnan . Brani Vidakovic . Samuel Kotz . 2006 . Encyclopedia of Statistical Sciences . 778 . 2nd . . 978-0-471-15044-2. Encyclopedia of Statistical Sciences .
  8. Franck B. . Knight. A characterization of the Cauchy type. Proceedings of the American Mathematical Society. 55. 1. 1976. 130–135. 10.2307/2041858. 2041858. free.
  9. Web site: Updates to the Cauchy Central Limit . Quantum Calculus . 13 November 2022 . 21 June 2023.
  10. Frederic . Chyzak . Nielsen . Frank . 2019 . A closed-form formula for the Kullback-Leibler divergence between Cauchy distributions . cs.IT . 1905.10965.
  11. Nielsen . Frank . Okamura . Kazuki . 2023 . On f-Divergences Between Cauchy Distributions . IEEE Transactions on Information Theory . 69 . 5 . 3150–3171 . 10.1109/TIT.2022.3231645 . 2101.12459. 231728407 .
  12. Vasicek . Oldrich . 1976 . A Test for Normality Based on Sample Entropy . Journal of the Royal Statistical Society, Series B . 38 . 1 . 54–59. 10.1111/j.2517-6161.1976.tb01566.x .
  13. Park . Sung Y. . Bera . Anil K. . 2009 . Maximum entropy autoregressive conditional heteroskedasticity model . dead . Journal of Econometrics . Elsevier . 150 . 2 . 219–230 . 10.1016/j.jeconom.2008.12.014 . https://web.archive.org/web/20110930062639/http://www.econ.yorku.ca/cesg/papers/berapark.pdf . 2011-09-30 . 2011-06-02.
  14. Web site: Cauchy Distribution . Kyle Siegrist . Random . 5 July 2021 . 9 July 2021 . https://web.archive.org/web/20210709183100/http://www.randomservices.org/random/special/Cauchy.html . live .
  15. .
  16. Web site: Illustration of instability of sample means. 2014-11-22. 2017-03-24. https://web.archive.org/web/20170324193842/http://www.statistics4u.info/fundstat_eng/ee_distri_cauchy.html. live.
  17. Cane . Gwenda J. . 1974 . Linear Estimation of Parameters of the Cauchy Distribution Based on Sample Quantiles . Journal of the American Statistical Association . 69 . 345 . 243–245 . 2285535 . 10.1080/01621459.1974.10480163.
  18. Zhang . Jin . 2010 . A Highly Efficient L-estimator for the Location Parameter of the Cauchy Distribution . Computational Statistics . 25 . 1 . 97–105 . 10.1007/s00180-009-0163-y. 123586208 .
  19. Rothenberg . Thomas J. . Fisher. Franklin, M.. Tilanus. C.B.. 1964. 59. 306. Journal of the American Statistical Association. A note on estimation from a Cauchy sample. 460–463. 10.1080/01621459.1964.10482170.
  20. Bloch. Daniel. 1966. 61 . 316 . Journal of the American Statistical Association. A note on the estimation of the location parameters of the Cauchy distribution. 852 - 855. 2282794. 10.1080/01621459.1966.10480912.
  21. Ferguson. Thomas S.. Thomas S. Ferguson . 1978 . Journal of the American Statistical Association . 73. 361. 211–213 . Maximum Likelihood Estimates of the Parameters of the Cauchy Distribution for Samples of Size 3 and 4. 2286549 . 10.1080/01621459.1978.10480031.
  22. The Pitman estimator of the Cauchy location parameter. Cohen Freue. Gabriella V.. Journal of Statistical Planning and Inference. 137. 6. 2007. 1901. 10.1016/j.jspi.2006.05.002. dead. https://web.archive.org/web/20110816002255/http://faculty.ksu.edu.sa/69424/USEPAP/Coushy%20dist.pdf. 2011-08-16.
  23. Book: Introduction to Robust Estimation & Hypothesis Testing . Wilcox . Rand . 2012 . Elsevier.
  24. Barnett. V. D.. 1966. Journal of the American Statistical Association . 61. 316. 1205–1218. Order Statistics Estimators of the Location of the Cauchy Distribution. 2283210. 10.1080/01621459.1966.10482205.
  25. Ferguson. Thomas S.. A Representation of the Symmetric Bivariate Cauchy Distribution. The Annals of Mathematical Statistics . 33. 4. 1256–1266. 1962 . 2237984. 10.1214/aoms/1177704357. 2017-01-07 . free.
  26. Non-linear Integral Equations to Approximate Bivariate Densities with Given Marginals and Dependence Function. Molenberghs. Geert. Lesaffre. Emmanuel. Statistica Sinica. 7. 1997. 713 - 738. dead. https://web.archive.org/web/20090914055538/http://www3.stat.sinica.edu.tw/statistica/oldpdf/A7n310.pdf. 2009-09-14.
  27. [Peter McCullagh|McCullagh, P.]
  28. Book: Kyprianou, Andreas . 2009 . Lévy processes and continuous-state branching processes:part I . 11 . 2016-05-04 . 2016-03-03 . https://web.archive.org/web/20160303235654/http://www.maths.bath.ac.uk/~ak257/LCSB/part1.pdf . live .
  29. Book: E. Hecht . 1987 . Optics . 603 . 2nd . .
  30. White, J.S. . December 1958 . The Limiting Distribution of the Serial Correlation Coefficient in the Explosive Case . The Annals of Mathematical Statistics . 29 . 4 . 1188–1197 . 10.1214/aoms/1177706450 . free.
  31. Gull, S.F. (1988) Bayesian Inductive Inference and Maximum Entropy. Kluwer Academic Publishers, Berlin. https://doi.org/10.1007/978-94-009-3049-0_4
  32. Tong Liu (2012), An intermediate distribution between Gaussian and Cauchy distributions. https://arxiv.org/pdf/1208.5109.pdf