Pareto interpolation is a method of estimating the median and other properties of a population that follows a Pareto distribution. It is used in economics when analysing the distribution of incomes in a population, when one must base estimates on a relatively small random sample taken from the population.
The family of Pareto distributions is parameterized by
Pareto interpolation can be used when the available information includes the proportion of the sample that falls below each of two specified numbers a < b. For example, it may be observed that 45% of individuals in the sample have incomes below a = $35,000 per year, and 55% have incomes below b = $40,000 per year.
Let
Pa = proportion of the sample that lies below a;
Pb = proportion of the sample that lies below b.
Then the estimates of κ and θ are
\widehat{\kappa}= \left(
Pb-Pa | |
\left(1/a\widehat{\theta |
\right)-\left(1/b\widehat{\theta
and
\widehat{\theta} =
log(1-Pa)-log(1-Pb) | |
log(b)-log(a) |
.
The estimate of the median would then be
estimatedmedian=\widehat{\kappa} ⋅ 21/\widehat{\theta
since the actual population median is
median=\kappa21/\theta.