# Working with constraints

Regular readers of this blog will be aware of the importance of stochastic mortality models in insurance work. Of these models, the best-known is that from Lee & Carter (1992):

$\log \mu_{x,y} = \alpha_x + \beta_x\kappa_y\qquad(1)$

where $$\mu_{x,y}$$ is the force of mortality at age $$x$$ in year $$y$$ and $$\alpha_x$$, $$\beta_x$$ and $$\kappa_y$$ are parameters to be estimated. Lee & Carter used singular value decomposition (SVD) to estimate their parameters, but the modern approach is to use the method of maximum likelihood — by making an explicit distributional assumption for the number of deaths, the fitting process can make proper allowance for the amount of information available at each age. Once the parameters have been estimated, forecasting takes place by making an assumption over the process driving $$\kappa_y$$, e.g. it follows a random walk with drift or an ARIMA process.

Regardless of how the Lee-Carter model is fitted, however, the analyst must also set identifiability constraints. These are necessary because the model specified in Equation (1) has an infinite number of possible solutions. To see this, consider the following simple transformation:

$\beta'_x = c\beta_x \\ \kappa'_y = \kappa_y/c$

for some non-zero constant $$c$$ (we previously discussed a very specific instance of this when $$c=-1$$). As is clear from putting $$\beta'_x$$ and $$\kappa'_y$$ into Equation (1), this will result in an identical fitted value for $$\log \mu_{x,y}$$. To tie things down, we need a constraint on the parameter values to counter this infinite number of transformations.

As it happens, there is another such transformation possible involving $$\alpha_x$$, so in order to fit the Lee-Carter model we actually need two constraints on the parameters to make their values unique. For example, the constraint system used by Lee & Carter (1992) is as follows:

$\sum_y\kappa_y = 0\\ \sum_x\beta_x = 1$

where summation is over the data, i.e. the summation over $$y$$ is over the fitted values of $$\kappa_y$$ only and does not include the projected values; this permits the fitted parameters to be invariant to the forecast.

The constraint system from Lee & Carter (1992) is the most commonly used one, not least because it is linear. However, there are other constraint systems, such as the one used by Girosi & King (2008):

$\sum_y\kappa_y = 0\\ \sum_x\beta_x^2 = 1$

and also the constraint system used by Richards & Currie (2009):

$\sum_y\kappa_y = 0\\ \sum_y\kappa^2_y = 1$

These latter two constraint systems are non-linear, but offer some technical advantages in the fitting process. For example, the constraints used by Richards & Currie (2009) were chosen because they fitted the Lee-Carter model as paired Generalized Linear Models (GLMs): conditioning on the values of $$\hat\kappa_y$$, $$\hat\alpha_x$$ and $$\hat\beta_x$$ can be estimated from fitting a GLM, then a second GLM can be used to obtain updated estimates of $$\hat\kappa_y$$ by conditioning on the revised values of $$\hat\alpha$$ and $$\hat\beta_x$$. Repeating this pairwise procedure leads to rapid convergence of the parameter estimates, and with this algorithm it is convenient to apply both the constraints to $$\hat\kappa_y$$ in the second step.

The choice of constraint system is up to the analyst fitting the model — the fitted values of $$\log \mu_{x,y}$$ will be the same in all three cases, although the parameter estimates will be different. The constraints an analyst chooses will depend on personal preferences, and the above three systems are not the only options. However, it seems to make most sense to stick to those constraint systems that have been published in either textbooks or the peer-reviewed literature.

References:

Girosi, F. and King, G. (2008) Demographic Forecasting, Princeton University Press, ISBN 978-0-691-13095-8.

Lee, R. D. and Carter, L. (1992) Modeling and forecasting US mortality, Journal of the American Statistical Association, 87, 659–671.

Richards, S. J. and Currie, I. D. (2009) Longevity risk and annuity pricing with the Lee-Carter model, British Actuarial Journal, Volume 15(II) No. 65, 317–365 (with discussion).

Written by: Stephen Richards
Publication Date:
Last Updated:

## Constraints in the Projections Toolkit

Users can select which constraint system they wish to use in the Lee-Carter family of models.  For other models including cohort terms, users can either opt to apply minimal constraints or else have models over-constrained to control the pattern in the cohort terms.

## Previous posts

### The age pattern of mortality

Heligman and Pollard published a famous paper in 1980 with the title "The age pattern of mortality". In their paper they proposed an additive, three-component model of mortality:

$q_x/p_x = f_I(x) + f_S(x) + f_A(x)$

Tags: Filter information matrix by tag: Heligman-Pollard model, Filter information matrix by tag: accident hump, Filter information matrix by tag: shape penalty, Filter information matrix by tag: smoothness penalty

### Old drugs, new tricks

Breakthrough science in the longevity space doesn't always require the development of new medicines. In fact, there are significant advantages to repurposing medicines already in use, since some of the most expensive aspects of drug development lie in establishing human safety in the trial phase.
Tags: Filter information matrix by tag: longevity, Filter information matrix by tag: research, Filter information matrix by tag: regenerative medicine, Filter information matrix by tag: cognitive impairment, Filter information matrix by tag: diabetes