The Lexis ratio[1] is used in statistics as a measure which seeks to evaluate differences between the statistical properties of random mechanisms where the outcome is two-valued - for example "success" or "failure", "win" or "lose". The idea is that the probability of success might vary between different sets of trials in different situations. This ratio is not much used currently having been largely replaced by the use of the chi-squared test in testing for the homogeneity of samples.
This measure compares the between-set variance of the sample proportions (evaluated for each set) with what the variance should be if there were no difference between in the true proportions of success across the different sets. Thus the measure is used to evaluate how data compares to a fixed-probability-of-success Bernoulli distribution. The term "Lexis ratio" is sometimes referred to as L or Q, where
L2=Q2=
s2 | ||||||
|
.
Where
s2
2 | |
\sigma | |
0 |
This ratio (Q) is a measure that can be used to distinguish between three types of variation in sampling for attributes: Bernoullian, Lexian and Poissonian. The Lexis ratio is sometimes also referred to as L.
Let there be k samples of size n1, n3, n3, ..., nk and these samples have the proportion of the attribute being examined of p1, p2, p3, ..., pk respectively. Then the Lexis ratio is
Q=
\sum{ni(pi-p)2 | |
}{ |
(k-1)p(1-p)}
E(Q)=1
var(Q)=
2 | |
n-1 |
where E is the expectation and var is the variance. The formula for the variance is approximate and holds only for large values of n.
An alternative definition is
Q=
s2 | ||||||||
|
here
s2
2 | |
\sigma | |
0 |
A closely related concept is the Lexis variation. Let k samples each of size n be drawn at random. Let the probability of success (p) be constant and let the actual probability of success in the kth sample be p1, p2, ..., pk.
The average probability of success (p) is
p=
1 | |
k |
\sum{pi}
The variance in the number of successes is
var(successes)=np(1-p)+n(n-1)var(pi)
where var(pi) is the variance of the pi.
If all the pi are equal the sampling is said to be Bernoullian; where the pi differ the sampling is said to be Lexian and the dispersion is said to be supranormal.
Lexian sampling occurs in sampling from non homogenous strata.
Wilhelm Lexis introduced this statistic to test the then commonly held assumption that sampling data could be regarded as homogeneous.