Article Text
Statistics from Altmetric.com
It has been pointed out (D Eayres, personal communication) that Chiang’s formula1 for estimation of the standard error of the expectation of life (which allows for the fact that mortality rates increase with age within each age band—clearly relevant with abridged life tables and advancing age) can be improved by combining it with the fact that the final age band is open-ended so that its width is a random variable and therefore contributes to the standard error of expectation of life.2 However, in simulations Eayres notes (personal communication) that there is still bias in the estimation of expectation of life, especially with small populations. The purpose of this communication is to suggest an improvement to reduce the effect of this bias. Errors in estimation of the population may also cause spurious variation in eox between populations, but these are not considered here.
METHODS
In Silcocks et al2 the expectation of life (eox) is expressed in terms of functions of the age specific mortality rates. The (theoretically) open-ended final age band contributes to the point estimate of eox and its variance by assigning to the final age band a notional fixed width, which in turn is a function of that age band’s mortality rate.
Assuming exponential survival, the mean survival time in the final age band is 1/mn years; the total person-time is ln−1/mn where ln−1 is the number alive at the start of the final age band. With exponential survival there is no fixed width to the final age band but the correct person-time is obtained as if given by the area of a triangle, of height ln−1 and base equivalent to a notional width of the final age band fixed at 2/mn so that
where mn = annual mortality rate in the final age band and rn = number of deaths in one year in the final age band.
The variance of eox is then estimated by applying the delta rule to the function relating eox to the age specific mortality rates.
However, although the estimated mortality rate mn is an unbiased estimate of the true mortality rate, znoverestimates the width of the age band on average.
An approximation for the bias can be obtained mathematically as follows:
Putting zn = g(mn) and denoting the expected value of mn by μ a Taylor series expansion gives:
Considering only the first three terms and taking expectations of both sides (noting that the expected value of (mn−μ) is 0, because mn is an unbiased estimate of the true mortality rate) then after rearranging, an approximate estimate of the bias3 is the expected value of ½(mn−μ)2d2g/dm2n = ½var(mn)d2g/dm2n
Using a binomial distribution for the number of deaths (regarding mn as an annual probability of dying), the bias corrected estimate is zn-bias, which works out to be:
The fewer the number of deaths, rn, the bigger the difference between zn and z*n Conversely if rn is large then 1/rn is small and the bias is small.
The variance of zn under the binomial assumption is given by:
Alternatively the point estimate could instead be multiplied by a shrinkage factor to reduce the mean squared error,3 which will also reduce bias because the expectation of the mean squared error = bias2+variance.
As a practical example, consider an English electoral ward with a total population of perhaps 6000 people with about 76 people in the 85+ age band, among whom there would be 13 or so deaths per year. The point estimate of the age band width would be 11.69 years. I compared this estimate and its variance (8.72) with the mean and variance of the estimated widths based on 5000 simulations (using MINITAB v10 to simulate a binomial distribution with n = 76 and 13 expected deaths—that is, with binomial parameter 0.171).
Key points
-
The variance of expectation of life must be allowed for in studies of small populations
-
With small populations, the small number of deaths in the oldest age band may bias the estimate of the final age band width, and hence the expectation of life
-
A simple bias correction is proposed to alleviate this
RESULTS
In this exercise we know that the “true” value for zn is given by the point estimate zn = 2/mn. The mean of the simulated values shows the extent to which an uncorrected estimate of the notional width is an overestimate (from table 1 on average about 7.5% greater).
Comparison of point, shrunk, and bias corrected estimates
The simulations also show that the variance of the uncorrected estimate of zn was 68% greater than predicted from the formula (4).
The mean of the bias corrected simulated values was closer to the true value (99.9%) than was the mean of the shrunken simulated values (99.2%). The point estimate of variance was smaller than the variances of the shrunken and bias corrected estimates but was closer to the latter.
The bias correction also improved the approximation to a normal distribution by reducing the skewness of the uncorrected estimate and performed better than the shrunk estimate—although because the width of the age band is a reciprocal transformation of the (approximately normal) mortality rate, zero skewness is an unrealistic target.
Policy implications
-
Apparently high expectation of life estimates in small populations may be the result of correctable bias, but errors in estimating the population at risk may also contribute.
DISCUSSION
The limited simulation exercise performed here suggests that a correction based on a Taylor series has the following advantages:
-
It removes some of the bias associated with the original estimate for the width of the final age band
-
The point estimate of variance is then closer to the variance of the simulated corrected estimates
-
The skewness is substantially reduced, improving the overall fit of the estimated expectation of life to a normal distribution.
The temptation to adjust the estimated variance of the age band width by plugging-in the corrected point estimate should be resisted as this will then under-estimate the variance. Instead the original variance estimate should be used.
REFERENCES
Footnotes
-
Funding: none.
-
Competing interests: none declared.
Linked Articles
- In this issue