Lee–Carter model explained

The Lee–Carter model is a numerical algorithm used in mortality forecasting and life expectancy forecasting.[1] The input to the model is a matrix of age specific mortality rates ordered monotonically by time, usually with ages in columns and years in rows. The output is a forecasted matrix of mortality rates in the same format as the input.

The model uses singular value decomposition (SVD) to find:

kt

that captures 80–90% of the mortality trend (here the subscript

t

refers to time),

bx

that describes the relative mortality at each age (here the subscript

x

refers to age), and

s1

but unnamed in the literature).

Surprisingly,

kt

is usually linear, implying that gains to life expectancy are fairly constant year after year in most populations. Prior to computing SVD, age specific mortality rates are first transformed into

Ax,t

, by taking their logarithms, and then centering them by subtracting their age-specific means over time. The age-specific mean over time is denoted by

ax

. The subscript

x,t

refers to the fact that

Ax,t

spans both age and time.

Many researchers adjust the

kt

vector by fitting it to empirical life expectancies for each year, using the

ax

and

bx

generated with SVD. When adjusted using this approach, changes to

kt

are usually small.

To forecast mortality,

kt

(either adjusted or not) is projected into

n

future years using an ARIMA model. The corresponding forecasted

Ax,t+n

is recovered by multiplying

kt+n

by

bx

and the first diagonal element of S (when

US

V*

=svd(Ax,t)

). The actual mortality rates are recovered by taking exponentials of this vector.

Because of the linearity of

kt

, it is generally modeled as a random walk with trend. Life expectancy and other life table measures can be calculated from this forecasted matrix after adding back the means and taking exponentials to yield regular mortality rates.

In most implementations, confidence intervals for the forecasts are generated by simulating multiple mortality forecasts using Monte Carlo Methods. A band of mortality between 5% and 95% percentiles of the simulated results is considered to be a valid forecast. These simulations are done by extending

kt

into the future using randomization based on the standard error of

kt

derived from the input data.

Algorithm

The algorithm seeks to find the least squares solution to the equation:

ln{(mx,t)}=ax+ktbx+\epsilonx,t

where

mx,t

is a matrix of mortality rate for each age

x

in each year

t

.
  1. Compute

ax

which is the average over time of

ln{(mx,t)}

for each age:

ax=

T
\sum{ln{(mx,t)
t=1
}}
  1. Compute

Ax,t

which will be used in SVD:

Ax,t=ln{(mx,t)}-ax

  1. Compute the singular value decomposition of

Ax,t

:

US

V*

=svd(Ax,t)

  1. Derive

bx

,

s1

(the scaling eigenvalue), and

kt

from

U

,

S

, and
V*
:

bx=(u1,1,u2,1,...,ux,1)

kt=(v1,1,v1,2,...,v1,t)

  1. Forecast

kt

using a standard univariate ARIMA model to

n

additional years:

kt+n=ARIMA(kt,n)

  1. Use the forecasted

kt+n

, with the original

bx

, and

ax

to calculate the forecasted mortality rate for each age:

mx,t+n=\exp(ax+s1kt+nbx)

Discussion

Without applying SVD or some other method of dimension reduction the table of mortality data is a highly correlated multivariate data series, and the complexity of these multidimensional time series makes them difficult to forecast. SVD has become widely used as a method of dimension reduction in many different fields, including by Google in their page rank algorithm.

The Lee–Carter model was introduced by Ronald D. Lee and Lawrence Carter in 1992 with the article "Modeling and Forecasting U.S. Mortality".[2] The model grew out of their work in the late 1980s and early 1990s attempting to use inverse projection to infer rates in historical demography.[3] The model has been used by the United States Social Security Administration, the US Census Bureau, and the United Nations. It has become the most widely used mortality forecasting technique in the world today.[4]

There have been extensions to the Lee–Carter model, most notably to account for missing years, correlated male and female populations, and large scale coherency in populations that share a mortality regime (western Europe, for example). Many related papers can be found on Professor Ronald Lee's website.

Implementations

There are surprisingly few software packages for forecasting with the Lee–Carter model.

Notes and References

  1. Web site: The Lee-Carter Method for Forecasting Mortality, with Various Extensions and Applications | SOA . September 28, 2010 . March 7, 2019 . https://web.archive.org/web/20190307054148/https://www.soa.org/library/journals/north-american-actuarial-journal/2000/january/naaj0001_5.pdf . dead .
  2. Lee . Ronald D . Carter . Lawrence R . Modeling and Forecasting U.S. Mortality . Journal of the American Statistical Association . September 1992 . 87 . 419 . 659–671 . 10.2307/2290201.
  3. Web site: Reflections on Inverse Projection: Its Origins, Development, Extensions, and Relation to Forecasting. June 5, 2003. Lee. Ronald.
  4. Web site: Understanding the Lee-Carter Mortality Forecasting Method. 12 April 2023. Harvard University. Federico Girosi. Gary King.