Article Text
Abstract
Study objective: To derive methods of calculating confidence limits for the relative index of inequality, defined by Kunst and Mackenbach as a measure of the influence of socioeconomic status on an adverse health index, such as mortality rate. The methods may be used for a health outcome recorded on a continuous scale, as a Poisson count or as a binomial variable.
Results and Conclusion: The confidence limits depend on the sampling variability of both the mean mortality rate and the slope of the regression line of mortality on the socioeconomic status scale variable. The best method for a continuous health outcome is based on Fieller’s theorem but a good approximation is obtained by substituting the confidence limits for the slope of the regression line into the formula for the calculation of the index, or by using the variance of the logarithmic transform of the index. The last method is the most appropriate for the construction of significance tests comparing indices. The mortality rates may show statistically significant departure from linearity, while not suggesting that a linear relation is inappropriate, and the main decision is whether to base the confidence limits on the conventional standard error of the slope derived from the regression analysis or whether to use the standard deviation of the estimates of mortality rates.
 socioeconomic factors
 small area analysis
 confidence intervals
 health status indicators
Statistics from Altmetric.com
The relation between health and socioeconomic status is an important topic and a measurement of inequality in health is required. We consider the specific measurement of inequality in mortality, or any other adverse health index, over socioeconomic status, the relative index of inequality (RII), and the construction of confidence limits. Following Pamuk^{1} the socioeconomic status categories are ranked on a scale from the lowest to the highest. Each category covers a range on the scale proportional to its population size and is given a value, x, on the scale corresponding to the midpoint of its range. The scale has a range from 0 to 1, so that the lower value of the range of a category is equal to the proportion of the whole population that is of lower socioeconomic status. For each category a measure of mortality y, such as a standardised death rate, is available.
The data are plotted as in figure 1 and a regression line fitted by least squares. Pamuk^{1,}^{2} recommended that weighted least squares should be used with the weights proportional to the population size of each category, in order to minimise the effect of deviant rates based on small numbers.
The regression line has the formPamuk ^{1,}^{2} defined the relative index of inequality aswhere is the overall mean death rate. Note that β is negative so that the index is positive.
This index was modified by Kunst and Mackenbach ^{3,}^{4} asThis index is the ratio of the mortality of the most disadvantaged (x=0) to the most advantaged (x=1). Thus if the index is 1.5 then the mortality rate of the most disadvantaged is 1.5 times as high as that of the most advantaged. Note that the values x = 0 and 1 do not correspond to the lowest and highest categories but to the extremes of these categories. They therefore represent extreme, possibly hypothetical, subgroups. In earlier papers Kunst and Mackenbach^{5,}^{6} reversed the x scale and defined the index as the proportional increase in mortality of the most disadvantaged relative to the most advantaged; this index is equal to RII_{KM} − 1. The RII is sometimes calculated within age groups so inequality is assessed separately for each age group.
Kunst and Mackenbach^{3,}^{4} used ordinary least squares, rather than weighted least squares, and we will follow this approach on the grounds that the variability about the regression line may exceed the sampling variability associated with each mortality rate and then the weights give too much weight to the larger categories. It is straightforward to modify the method to use weights if required.
In this paper we consider calculation of the standard error of an estimate of the relative index of inequality. Writing γ for RII_{KM} and noting that α = − β thenIt is convenient to work with this form as β and are uncorrelated in the least squares estimation. Clearly the variance of γ depends on the variances of β and . The variance of γ may be estimated approximately asThis giveswhere c is the critical 5% value from the appropriate distribution used in the calculation of the variances.
An alternative is to work with the natural logarithm of γ givingA third method is to use Fieller’s theorem^{7} to derive confidence limits directly without the intermediate derivation of a standard error. Details are in the appendix.
Because x is in the range 0 to 1 the variance of will be smaller than the variance of β by an order of magnitude. Also β^{2} will be smaller than ^{2} provided that γ is less than 3 (this is a very high rate of inequality). Therefore in the expression β^{2}var + ^{2}varβ in equations (5) and (6) the latter term predominates. A fourth method is then to ignore the variability in ; equation (6) then becomesAlternatively the limits may be calculated by substituting the limits of β in equation (4); these are the Fieller limits (see equation (11) in appendix).
When it is required to compare two groups an estimate of variance is required and the Fieller limits cannot be used. It is appropriate to work with the logarithmic transform and use equations (6) or (7).
There are two methods of estimating the variances of β and . The first is to use the regression output from fitting the regression equation working with the variance based on the standard deviation about the regression line fitted by least squares. This has the disadvantage that there are usually only a few socioeconomic status categories and hence there are few degrees of freedom for the error (in our example, 3 df and c = t_{3,0.05} = 3.182) giving an increase in the width of the confidence intervals. Also the death rates that are used within each category have a known accuracy dependent on the sample size from which they have been estimated. This variability contributes to the variability around the regression line but the latter also includes a component because of lack of linearity in the true relation between the death rate and x, or because of a lack of fit of the line for other reasons. The second method is to ignore any lack of linearity or fit and base the variances of β and on the sampling variability of the death rates. Then c is the standardised normal deviate, z=1.96.
An alternative approach is to use Poisson regression. If d is the number of deaths, and the expectation of d, based on the size and age distribution of the group, is m then the Poisson regression iswhere μ is the expectation of d taking account of x. As the death rate is μ/m this model is linear between the logarithm of death rate and x. Thenand a confidence interval for γ is obtained directly as exp [−β ±c ×SE(β)].
A second alternative is to use logistic regression, and this is appropriate for data from casecontrol studies in which the odds ratio (OR) is approximately equal to relative risk. The relative index of inequality, γ, is the odds ratio for x=0 relative to x=1 and equals exp(−β), and a confidence interval is obtained in the same way as for Poisson regression.
It should be noted that when either Poisson or logistic regression is used the RII is obtained directly as the exponential of the slope and the intercept term is not required. For a continuous outcome variable, this is not the case, as the slope is then an absolute measure of difference in health status. Wagstaff ^{8} refers to this as the slope index of inequality (SII) and notes that it may be interpreted as the absolute effect on health of moving from the lowest socioeconomic group to the highest. A relative measure can only be constructed either by dividing by the overall rate, as in equation (2), or by taking the ratio of the fitted values at x=0 and x=1 to produce the KunstMackenbach measure, equation (3).
EXAMPLE
In a study of inequality in mortality in the Australian state of New South Wales 1975 to 1994, local government areas were sorted into quintiles based on a measure of social disadvantage. For each five year period by gender the relative index of inequality was used to summarise the relation between the age standardised mortality rate (ASMR, y) and the quintile cumulative population proportions (x). The data in table 1 and figure 1(A) are for men in 1990–1994. The standard errors shown for each ASMR were calculated assuming a Poisson distribution of the number of deaths in each age group. β is estimated as −310.1 and γ as 2.033. The 95% confidence limits of γ are given in table 2.
Key points

The relation between health and socioeconomic status is an important topic.

The relative index of inequality (RII) is a frequently used summary measure of socioeconomic inequality in health.

The RII confidence limits depend on two sources of sampling variability, the health outcome measure (such as mortality rate) and the fitted regression line of health outcome on the socioeconomic status scale variable.

This paper shows that while the best method for deriving confidence limits is based on Fieller’s theorem, a very good approximation may be obtained using a simpler method.
Within the columns of table 2 the limits are similar for all the methods except for those calculated using SE(γ). The limits calculated using the limits of β are only slightly narrower than those based on Fieller’s method as may be predicted as ^{2}varβ is 30 times β^{2}var so that ignoring the latter has little effect. The limits using the standard deviation from estimation of the age standardised mortality rates are much narrower than those using the standard deviation about the regression line. This is mainly because the latter standard deviation is 4.3 times as great as the former and also because of the effect of using a t value on 3 df of 3.18 instead of a z value of 1.96. A formal test of linearity is the ratio of the variances (22.9^{2}/5.33^{2} =18.5 as a F with 3 and ∞ df, p< 0.001).
Figure 1(B) shows similar data for 1975–1979. The index is estimated as 1.81 with 95% confidence interval using equation (7) of 1.18 to 2.79 including nonlinearity, or 1.74 to 1.88 ignoring nonlinearity. The former are very wide as a consequence of the wide variability about the best fitting line of the two lowest points on the socioeconomic status scale. Suppose it is required to test whether there has been a change in the index between 1975–1979 and 1990–1994. Then taking account of nonlinearity var(lnγ) may be recalculated for each period using the combined estimate of the variance about the fitted lines that is 1604 with 6 df. It should be noted that although the separate estimates (2684 for 1975–79 and 524 for 1990–1994) differ by a factor of 5 this is well within the bounds of chance when each is estimated on only 3 df. Then the test statistic is calculated as the difference between the two values of lnγ divided by the square root of the sum of their variances, which gives 0.59 as a t statistic with 6 df. Clearly the result is not significant as is clear from the wide confidence intervals. If the nonlinearity is ignored then using the variances calculated from equation (7) for each period the test statistic is 3.68 as a z statistic giving a highly significant effect as is again clear from the confidence intervals.
DISCUSSION
The measurement of inequalities in health is an important topic and the relative index of inequality is a frequently used measure.^{8–}^{17} Poisson regression^{9–}^{12} or logistic regression^{12–}^{17} are often used, while analysis of a continuous health outcome seems to be less common. Hence it is necessary to be able to estimate the sampling variability of an estimate of this measure in order to produce confidence intervals and to compare two measures.
The Fieller method of calculating the confidence limits may be expected to be superior as it takes account of the variability in both β and , and does not assume that either γ or its logarithm are normally distributed. However, in practice the limits obtained by substituting the limits of the regression coefficient β in the calculation of γ and ignoring the variability in the estimation of the mean mortality rate are a very good approximation. Another reason for basing the limits only on varβ is so that the 95% confidence limits and the 5% significance test will be consistent, as a significance test of the index is obtained by testing β. An alternative to Fieller’s theorem is to calculate limits using SE[ln(γ)] and this is the best method for estimating the sampling variability for the calculation of a test statistic to compare two groups. Kunst and Mackenbach^{5,}^{6} gave confidence intervals of the index which were based on the sampling variability of β. As they used Poisson regression methods exp(−β) gave the index and was uninvolved.
The main decision is which standard deviation to use in the calculation of the limits. The conventional approach is to use the standard deviation about the regression line. This would certainly be the advised course if an assumption of a real linear relation between the mortality rate and the measure of socioeconomic status was considered essential. However, there is no a priori reason to suppose that the relation is necessarily linear and the fitting of a straight line may then be regarded as a convenient approximation to the general trend. It is then more appropriate to ignore the nonlinearity and use the standard deviation calculated from the original estimation of the mortality rates. When fitting a Poisson regression the choice is between estimating the variance of β under the Poisson distribution (ignoring nonlinearity if present) or inflating the variance to take account of extra Poisson dispersion due to nonlinearity. The latter situation is termed “overdispersion” (see McCullagh and Nelder^{18}).
In the example there was clear evidence of excess variability about the fitted line and yet figure 1 suggests that linear relations are the most appropriate relations that could be used. The lack of fit may be attributable to nonlinearity in the true relation or to imprecision in defining the socioeconomic status variable. In the former case it seems appropriate to ignore the nonlinearity, as it is a systematic rather than a random effect, whereas in the latter the lack of fit indicates random variability that should be taken into account. It is conservative to base the limits on the standard deviation about the regression when assessing whether changes in inequality have occurred over time.
APPENDIX
Fieller’s method
DefineThenThe best estimate of γ is when z=0 and by Fieller’s theorem the confidence limits for γ are the solutions of the equationwhere c is the critical value of the appropriate distribution. Therefore the confidence limits for γ are the solutions ofThis is a quadratic equation in γ and the two roots are the lower and upper confidence limits.
After some algebraic manipulation the limits are found to bewhereandIf the variance of is small relative to ^{2}varβ/β^{2} then the limits may be obtained by putting var equal to zero in the above to give as limitsThat is, the limits for γ are obtained by substituting the limits of β in the calculation of γ (equation 4).
Acknowledgments
We thank Professors Susan Quine and Richard Taylor for their comments and encouragement.
REFERENCES
Footnotes

Funding: this work was funded, in part, by a Public Health and Research Development Committee research scholarship from the National Health and Medical Research Council awarded to Lillian Hayes.

Conflicts of interest: none.