Article Text
Statistics from Altmetric.com
It has been pointed out (D Eayres, personal communication) that Chiang’s formula^{1} for estimation of the standard error of the expectation of life (which allows for the fact that mortality rates increase with age within each age band—clearly relevant with abridged life tables and advancing age) can be improved by combining it with the fact that the final age band is openended so that its width is a random variable and therefore contributes to the standard error of expectation of life.^{2} However, in simulations Eayres notes (personal communication) that there is still bias in the estimation of expectation of life, especially with small populations. The purpose of this communication is to suggest an improvement to reduce the effect of this bias. Errors in estimation of the population may also cause spurious variation in e^{o}_{x} between populations, but these are not considered here.
METHODS
In Silcocks et al^{2} the expectation of life (e^{o}_{x}) is expressed in terms of functions of the age specific mortality rates. The (theoretically) openended final age band contributes to the point estimate of e^{o}_{x} and its variance by assigning to the final age band a notional fixed width, which in turn is a function of that age band’s mortality rate.
Assuming exponential survival, the mean survival time in the final age band is 1/m_{n} years; the total persontime is l_{n−1}/m_{n} where l_{n−1} is the number alive at the start of the final age band. With exponential survival there is no fixed width to the final age band but the correct persontime is obtained as if given by the area of a triangle, of height l_{n−1} and base equivalent to a notional width of the final age band fixed at 2/m_{n} so that
where m_{n} = annual mortality rate in the final age band and r_{n} = number of deaths in one year in the final age band.
The variance of e^{o}_{x} is then estimated by applying the delta rule to the function relating e^{o}_{x} to the age specific mortality rates.
However, although the estimated mortality rate m_{n} is an unbiased estimate of the true mortality rate, z_{n}overestimates the width of the age band on average.
An approximation for the bias can be obtained mathematically as follows:
Putting z_{n} = g(m_{n}) and denoting the expected value of m_{n} by μ a Taylor series expansion gives:
Considering only the first three terms and taking expectations of both sides (noting that the expected value of (m_{n}−μ) is 0, because m_{n} is an unbiased estimate of the true mortality rate) then after rearranging, an approximate estimate of the bias^{3} is the expected value of ½(m_{n}−μ)^{2}d^{2}g/dm^{2}_{n} = ½var(m_{n})d^{2}g/dm^{2}_{n}
Using a binomial distribution for the number of deaths (regarding m_{n} as an annual probability of dying), the bias corrected estimate is z_{n}bias, which works out to be:
The fewer the number of deaths, r_{n}, the bigger the difference between z_{n} and z*_{n} Conversely if r_{n} is large then 1/r_{n} is small and the bias is small.
The variance of z_{n} under the binomial assumption is given by:
Alternatively the point estimate could instead be multiplied by a shrinkage factor to reduce the mean squared error,^{3} which will also reduce bias because the expectation of the mean squared error = bias^{2}+variance.
As a practical example, consider an English electoral ward with a total population of perhaps 6000 people with about 76 people in the 85+ age band, among whom there would be 13 or so deaths per year. The point estimate of the age band width would be 11.69 years. I compared this estimate and its variance (8.72) with the mean and variance of the estimated widths based on 5000 simulations (using MINITAB v10 to simulate a binomial distribution with n = 76 and 13 expected deaths—that is, with binomial parameter 0.171).
Key points

The variance of expectation of life must be allowed for in studies of small populations

With small populations, the small number of deaths in the oldest age band may bias the estimate of the final age band width, and hence the expectation of life

A simple bias correction is proposed to alleviate this
RESULTS
In this exercise we know that the “true” value for z_{n} is given by the point estimate z_{n} = 2/m_{n}. The mean of the simulated values shows the extent to which an uncorrected estimate of the notional width is an overestimate (from table 1 on average about 7.5% greater).
The simulations also show that the variance of the uncorrected estimate of z_{n} was 68% greater than predicted from the formula (4).
The mean of the bias corrected simulated values was closer to the true value (99.9%) than was the mean of the shrunken simulated values (99.2%). The point estimate of variance was smaller than the variances of the shrunken and bias corrected estimates but was closer to the latter.
The bias correction also improved the approximation to a normal distribution by reducing the skewness of the uncorrected estimate and performed better than the shrunk estimate—although because the width of the age band is a reciprocal transformation of the (approximately normal) mortality rate, zero skewness is an unrealistic target.
Policy implications

Apparently high expectation of life estimates in small populations may be the result of correctable bias, but errors in estimating the population at risk may also contribute.
DISCUSSION
The limited simulation exercise performed here suggests that a correction based on a Taylor series has the following advantages:

It removes some of the bias associated with the original estimate for the width of the final age band

The point estimate of variance is then closer to the variance of the simulated corrected estimates

The skewness is substantially reduced, improving the overall fit of the estimated expectation of life to a normal distribution.
The temptation to adjust the estimated variance of the age band width by pluggingin the corrected point estimate should be resisted as this will then underestimate the variance. Instead the original variance estimate should be used.
REFERENCES
Footnotes

Funding: none.

Competing interests: none declared.
Linked Articles
 In this issue