Study objective: To examine social inequalities in minor psychiatric morbidity as measured by the GHQ-12 using lagged models of psychiatric morbidity and changing job status.
Design: GHQ scores were modelled using two level hierarchical regression models with measurement occasions nested within individuals. The paper compares and contrasts three different ways of describing social position: income, social advantage and lifestyle (the Cambridge scale), and social class (the new National Statistics Socio-Economic Classification), and adjusts for attrition.
Setting: Survey interviews for a nationally representative sample of adults of working age living in Britain.
Participants: 8091 original adult respondents in 1991 who remain of working age during 1991–1998 from the British Household Panel Survey (BHPS).
Main results: There was a relation of GHQ-12 to social position when social position was combined with employment status. This relation itself varied according to a person’s psychological health in the previous year.
Conclusions: The relation between social position and minor psychiatric morbidity depended on whether or not a person was employed, unemployed, or economically inactive. It was stronger in those with previously less good psychological health. Among employed men and women in good health, GHQ-12 varied little according to social class, status, or income. There was a “classic” social gradient in psychiatric morbidity, with worse health in less advantaged groups, among the economically inactive. Among the unemployed, a “reverse” gradient was found: the impact of unemployment on minor psychiatric morbidity was higher for those who were previously in a more advantaged social class position.
- psychiatric morbidity
Statistics from Altmetric.com
The General Health Questionnaire (GHQ) is a widely used measure of minor psychiatric morbidity, validated and intended for large scale community surveys.1 Although most studies using other measures that operationalise depression and psychological function show clear relations to socioeconomic position, studies that use the GHQ show inconsistent results, and most show no social gradient. In this paper we address this puzzle in three ways: by taking a longitudinal perspective that enables us to examine year on year changes in GHQ, by taking account of employment status, and by using three measures of socioeconomic position and circumstances. Data are taken from the British Household Panel Survey (BHPS), which includes the 12 item version of the GHQ every year in its data collection. The BHPS is unique in that we have for the first time GHQ-12 self reports over eight years for the British population. Our analysis focuses on the years 1991, when the study originated, until 1998. We treat the BHPS as a genuine panel study rather than as a series of repeat cross sectional analyses.
Kreiger et al2 argue that there are “diverse aspects and dimensions of … socio-economic position … [and] … numerous studies suggest that measures at each level, over time, may be informative.” These authors and, more recently, Fryers et al3 make a convincing argument for more precision in the measurement of socioeconomic position and circumstances. Following their plea we examine the relations to mental health of three separately operationalised dimensions of social inequality. These are: the new UK Office for National Statistics socioeconomic classification, the NS-SEC,4 a key alternative developed by Prandy and colleagues5 that reflects household social advantage and lifestyle (the Cambridge score), and income. The NS-SEC is a measure of occupational social class based on employment relations and conditions, and the Cambridge scale measures general social advantage based on patterns of social mixing and affiliation. In these respects, they attempt to reveal the meaning of social inequality in terms of relationships between individuals in society. In contrast, income only directly reflects a material aspect of social inequality.
Whereas evidence for the relation between minor psychiatric morbidity and social class as measured by the registrar general’s schema is weak or inconsistent,3,6 employment status (employed, unemployed, economically inactive), standard of living, and poverty have been shown to be consistently related to minor psychiatric morbidity as measured by the GHQ in a number of studies.7–11
Here we report findings for eight waves of BHPS that use all of the information in the GHQ-12 as a continuous outcome, recognise clustering in the data, and focus not only upon income and its relation to GHQ but lifestyle (Cambridge) and employment relations (NS-SEC). We control for a person’s job status in any given year, along with their previous year’s GHQ score. The loss of information involved in attrition from the panel is also addressed by the use of Schafer’s method of data imputation.12
SAMPLE AND MEASURES
Our analysis focuses on 8091 people who were aged between 16 and 64 in 1991, when the study originated, until 1998. For a full description of the sample design and selection probabilities see Taylor et al.13 In subsequent modelling we apply cross sectional weights for 1991 to adjust our analyses of imputed data. (see Methods and the appendix for details of our strategy for handling missing data).
This was a summative score based upon the 12 item version of the GHQ (GHQ-12) for each wave. Respondents who report any statement (for example, have you recently lost much sleep over worry?) on the scale as applying to them “rather more than usual” or “more than usual” score 1 whereas those reporting “less than usual” or “not at all” score 0. Summated scores with a value 3 or more (out of 12) are typically used to classify cases of common mental disorder. In our application we use the total score rather than a dichotomy (mean = 2.05, SD = 1.99).
Person level characteristics
Age (in years) and sex as recorded in the first wave of the survey, 1991.
Wave or occasion specific characteristics
Social position was measured at each wave in three ways: the NS-SEC, the Cambridge scale, and income. NS-SEC is a sevenfold categorisation, which classifies occupations according to employment relations and conditions (the degree of job security, opportunity for promotion, work autonomy, and control over the work of others). Economically inactive respondents (for example, retired) were classified according to their last full time occupation. The Cambridge scale was derived from a multidimensional scaling of social distance scores between all pairs of occupations in several field studies undertaken in the UK. The scaling resulted in a single dimension that is labelled “general social advantage and shared lifestyle”. Scores reflect the degree to which members of different occupations associated together on a basis of equality. In this analysis it is treated as a household measure where a household is assumed to contain a couple. Whenever this is the case the higher of the individual scores is assigned to the household. In instances where there is no couple present in the household (for example, a shared adult household) individual Cambridge scores are assigned. Income was based upon an adjusted measure of the total gross household income (the Clements’s scale14) to allow for differing needs according household size and composition. To ensure that our comparative analysis used broadly equivalent categorisations of social position we created septiles for the Cambridge scale and our income measure. Allocation to socioeconomic categories is based upon a person’s position in each wave for the fully imputed data.
Current economic activity is based on whether or not a person describes themselves as currently employed, unemployed but actively seeking work, or inactive. Previous psychiatric status is simply the previous year’s GHQ-12 score referred to in subsequent tables as “GHQ lag”. Beyond a one or two year lag there is very little covariance among the multivariate GHQ-outcomes (results not shown). This suggests that a person’s current GHQ-12 score is most affected by their immediate history.
Hierarchical modelling to match the data structure over time
Hierarchical linear models are appropriate analytical tools to handle panel data where individual sample members generate observations over time.15,16 Each wave or occasion (level 1) is nested within a person at level 2 in the hierarchy. We are able to separate out two simple variance component estimates to describe the variation in GHQ scores. One of these describes the variance between individuals and the other component describes the variance within individuals between occasions of measurement. The variation between individuals can be modelled by including person specific characteristics that are constant throughout the period of the study (for example, sex). The variation within individuals over time can be explained by including circumstances that may change from one measurement occasion to the next (for example, social position, economic activity). The ratio of the between person variance to the total variance is described as the intraunit correlation. It indicates the extent to which our observations are clustered within individuals.
Tackling information loss
Missing observations have been dealt with by using multiple imputation techniques.12,17–21 Using 1991 as our sample base only 59% (4805) of the original respondents remain in the study by 1998. Another 32% (2615) leave at any one of the subsequent waves (1992–98) and never return to the study. Others are in the study in 1991, leave in one or more subsequent waves but later return for one or more years. They make up around 8% (671) of the original sample. If we ignored anyone who is absent from the BHPS in one or more waves we would lose up to 41% of the original data. Table 1 provides a resume of the extent of attrition. Schafer’s12 data augmentation procedures for filling in provide a full set of data for any pattern of forward missingness (that is, data for anyone who joined the study after 1991 are not imputed). A fuller account of the strategy is given in the appendix. For recent applications of Schafer’s methodology see Schafer et al,22 Schneker et al,23 and McCulloch and Joshi.24
Analyses are carried out using hierarchical linear modelling25 where observations are nested over time within individuals in a two level hierarchy. We adopted a forward selection strategy, conditioning for age and sex (base model) then adding each main effect in turn before testing for two and three way interactions. All models are simple variance component models, which are fitted using the panel data functions for longitudinal data available in Stata (release 6.0, College Station, TX).
All figures (fig 1, 2, 3, and 4) and analyses (tables 2–7) show predicted GHQ scores based on augmented data.
Augmentation procedures have brought the distributions in the BHPS sample available for analysis closer to that of the population of UK as reflected in 1991 census data (see appendix table A1).
Table 2 shows the marginal frequencies for each measure of socioeconomic position and circumstances: NS-SEC, Cambridge, and household income after data augmentation. The first category in each measure represents the highest level of income or prestige.
Table 3 summarises the mean GHQ score by economic activity after imputation. The unemployed have a marginally higher score than the economically inactive although the unemployed outnumber the economically inactive by about 4:1.
Table 4 presents the initial modelling in terms of our null model that contains no explanatory variables but shows the amount of variation at occasion and individual levels, and a base model where age and sex controls are entered. As evidenced by the intraunit correlation coefficients (ρ values) in the null model there is a clear confirmation that there is a substantial amount of within person wave on wave variation in GHQ scores as well as between person variance to explain (ρ = 0.38). Adding controls for sex and age produces a modest reduction in the between persons variance (about 3%, 2.89 compared with 2.81), and no reduction in the amount of within person wave on wave variance left to explain.
Next we include each measure of social position in turn at the wave level to examine the impact on the base model. Table 5 shows the relative reduction in the variance components. Social position, however measured, explains a negligible amount of the wave on wave variation. Nor does the introduction of the measures of social position explain any great amount of the between person differences. If anything knowing a person’s income level is comparatively more informative than knowing their social status as measured by the Cambridge scale or their social class based on employment relations using NS-SEC.
Although the measures of socioeconomic position do not explain a great deal of the variance in GHQ scores, turning to the fixed part of these models in figure 1 we can see some evidence of a gradient of predicted GHQ scores for two of the measures. This figure as with all subsequent figures shows the predicted difference in the GHQ score above and below the reference category for each measure of social position (always the most advantaged group).
Broadly, GHQ levels rise as income levels decline, with a similar but shallower gradient as Cambridge levels fall. For NS-SEC there is some weak evidence for a relation. The difference between the most advantaged groups (septile 1) and the least advantaged (septile 7) is never more than a third of a GHQ point. Notably, the relative rise in GHQ for SEC 4 (self employed and small employers) in relation to their nearest neighbours could mark the risk of a relatively higher GHQ score in the self employed.
The addition of economic activity status and prior health (GHQ lag) adds considerably to the explanatory power of the models. From table 6 we see that a person’s current job status results in a further modest reduction in between person variance (about 3%) whereas a person’s previous year’s GHQ score reduces the between person variance considerably (about 48%). The inclusion of GHQ lag also indicates a modest increase (4.6%) in the within person wave on wave variance.
For the impact of job status and GHQ lag as main effects in the model examine figures 2 and 3. Figure 2 shows that GHQ scores are lowest for the employed, highest for the unemployed, and intermediate for the inactive. It also shows that gradients according to socioeconomic position differ according to employment status. For the employed the gradients are virtually flat. In the inactive, GHQ scores are higher in those in less advantaged socioeconomic position according to all three measures. However, the differential between the most and least advantaged is rarely greater than half a GHQ point. But the relative impact of unemployment for who were previously high earners and experienced more favourable employment relations (along the NS-SEC dimension) suggests a reverse gradient. Here the most advantaged rise almost a whole GHQ point above their counterparts. The gradient for the unemployed along the dimension of household prestige (Cambridge scale) is less evident.
Figure 3 shows the impact of a person’s previous year’s GHQ score. Three GHQ scores have been used to illustrate the impact of the previous year’s score. For those with no previous GHQ symptoms (always the majority in any wave) there is no social gradient. As previous year’s GHQ scores rise we begin to see some evidence of a gradient by socioeconomic position with notable “humps” at higher levels for the small employers and self employed (as defined by the NS-SEC) where their previous year’s GHQ score was high. Although the impact of a relatively high GHQ score in the previous year never pushes the current score much above half a point for the least advantaged.
The final stage in our modelling combines the effect of socioeconomic position and economic activity status on GHQ in each year controlling for the previous year’s GHQ score. For each measure of social position, three way interactions between social position, job status, and the GHQ lag score were statistically significant. Thus to interpret the relation between social position and GHQ scores we have to simultaneously take account of current economic activity status and a person’s previous GHQ score as well as their age and sex. Table 7 summarises the predicted change in GHQ score for persons of average age with one of two hypothetical GHQ scores in the previous year (zero and 3) based on these models. At all combinations of levels of job status, social position, and previous year’s GHQ score, women tend to have a higher reported GHQ score than men of the same age (about half a point higher). The interaction between job status and the previous year’s GHQ score shows evidence of a greater effect of social position in economically inactive persons who had a high GHQ score in the previous year. As a result we concentrate on the economically inactive in figure 4.
Figure 4 focuses on the relation of social position measures to GHQ score in men and women only at a time when they were economically inactive. It shows that even in this group, the presence of a social gradient depended on their psychological health in the previous year. In those without any GHQ symptoms in the previous year there is little to no evidence of a social gradient in GHQ. There is slight evidence along the income and Cambridge scale dimensions for higher scores among those in the bottom two septiles. Beyond that, clearer gradients are seen along the prestige dimension (Cambridge scale), the higher the level of reported GHQ in the previous year. Apart from a downturn for the lowest septile this is supported along the income scale. For NS-SEC we see similar amounts of relative increase as the previous year’s GHQ scores become higher. Again, small employers and self employed (SEC 4) among those currently inactive and having previously high levels of GHQ peak above any of the other NS-SEC positions.
DISCUSSION AND CONCLUSION
As the availability and richness of longitudinal data on the social determinants of health increase rapidly, methods similar to those we adopt in this paper are beginning to be extended to social epidemiological studies outside the field of health inequality. Panel data have the potential to improve our ability to test the causal nature of observed associations by demonstrating dose-response relations over time. However, they present two major challenges. The first of these is how best to exploit long sequences of events, in which individuals move in and out of conditions in contrast with conventional survival analysis. The second is the issue of sample attrition.
Most recently Weich et al26 analysed BHPS data using the first seven waves of data to examine the effect of social roles and gender differences on mental health assessed using the GHQ. Their analytical strategy addresses both of these challenges. Firstly, they analyse the sequences of GHQ scores by treating observations as clustered over time within individuals and using random effects models.27 This is equivalent to the hierarchical modelling we have adopted via simple variance component modelling in Stata. Secondly, they tackle the problem of attrition by analysing those participants aged 16–70, (n = 9947), who answered the GHQ on at least two consecutive occasions. Our preference has been to fill in or augment the wave by case data matrix before our analyses using multiple imputation for the whole person by year matrix rather than condition upon consecutive pairs of observations. This encouraged us to extend a similar approach to inequalities in mental health.
The approach has several advantages. One is that it can allocate variation in mental health as between the effects of varying social and economic circumstances within the same person, and the effect of characteristics that distinguish individuals throughout the period of observation. Over 20% of the variance in minor psychiatric morbidity in this sample of the British population was shown to be explained in terms of time varying social and economic circumstances over the eight year period, and only 1.2% in terms of stable individual characteristics. In this paper we have been concerned to show the way this approach may be applied to the study of health inequality, and have for simplicity only examined a very limited number of both time invariant individual and time varying factors: there is obvious scope for exploring more elaborated models.
There is a perennial problem in health inequality research that associations between social conditions and health may result from the effect of health itself on, for example, income (“selection”). In the approach we have taken there are two safeguards to this. The first is that the method assumes that the employment statuses, socioeconomic positions, and health statuses of any person over time are not totally independent events but will tend to “cluster”. The method used in this paper makes it quite feasible for future work to try and explain the stability or “state dependence” of individual work histories in terms of hypothetically time invariant level 2 characteristics such as educational ability or childhood experiences. The second way in which we have addressed the problem of selection is by the use of a one year lag in the GHQ. This decision was supported by an empirical evaluation of the inter-relations between the successive GHQ scores for each wave by treating them as a set of multivariate outcomes nested within each person.15 Beyond a one year lag covariance estimates between GHQ scores were weak. Indeed, GHQ score in the previous year was found to be by far the strongest explanation of GHQ score in any current year. However, lagged GHQ did not explain away the relation of socioeconomic position to current GHQ. Rather, the results show that social inequality has a different effect according to mental health one year ago. If anything the relation of advantage and disadvantage, however measured, is greater in those with previously less good health.
Using the GHQ-12 for the first eight years of the BHPS we have been able to show the way in which variation in mental health can be attributed in part to the effects of varying social and economic circumstances within the same individual, and in part to the effect of characteristics that distinguish individuals
The relation of socioeconomic position and circumstances to minor psychiatric morbidity depends upon whether or not a person is employed, unemployed, or economically inactive
Social inequality has a different effect according to a person’s mental health a year ago
The relation of advantage and disadvantage, however measured, is greater in those with previously less good health
For the unemployed there is evidence of a sharp “reverse” gradient whose previous jobs were more advantaged
For the economically inactive there is evidence of a social gradient in mental health
Among the economically inactive, women report higher levels of psychological distress in every socioeconomic category
The interaction between socioeconomic position and employment status is in some ways a very surprising result, but it is in fact consistent with previous literature. The impact of unemployment on mental health may depend on the extent of “relative deprivation” produced by job loss.28–30 Platt and Kreitman31 also found that the relative risk of parasuicide among the unemployed was higher in middle class than working class areas. This is consistent with the very high rates of GHQ caseness found here among unemployed persons in more advantaged households and who had previously been in more socially advantaged occupations as measured by the NS-SEC.
In sharp contrast with unemployment, economic inactivity (the employment status of retirement, permanent sickness, or full time home care) appears to amplify the effect of socioeconomic position on mental health as measured by the GHQ. Taken together with a person’s immediate GHQ history, it does seem that in this group relative income will afford some protection against a high GHQ score in the current year. This is in line with the findings of previous studies that have found relations of GHQ to poverty and deprivation. However in this study, the risk of a high GHQ score was also increased throughout the lowest three septiles of the income distribution, not only in the poorest.
It seems that it is in the economically inactive group that low household income or prestige have the most significantly deleterious effect on mental health, particularly on the ability to recover from poor health in the previous year (interaction of lagged GHQ with socioeconomic position measure among the inactive). This result applies most strongly to the women in the sample: some 45% of all the person years of economic inactivity in the eight years of observation were experienced by women in the form of home care. It is well known that women without paid employment tend to have poorer mental health than those with jobs.31–33 However, the social gradient seen in this group is not often commented upon.
The large amount of within person variation over short to medium term spans means that a high GHQ score in any one year does not necessarily imply continued high levels of this type of psychological distress. This paper has shown that to understand the part played by changes in socioeconomic position and economic status in changes in psychological health, consideration must be given to ways in which these experiences interact, and how their effects are influenced by pre-existing psychological health.
MORE ON TACKLING INFORMATION LOSS
Any prospective subject loss has been managed by treating the person year dataset as a series of case observations across time. This allows the analyst to use multiple imputation to fill in any missing observations.12 The analytical model is then applied to each filled in dataset in turn, and the parameter estimates averaged using the appropriate formulas. This strategy gives parameter estimates adjusted for non-response bias (under certain assumptions that will be discussed further on) and maximises the amount of information available for analysis. An alternative approach to adjust for non-response bias would be to use one of the weighting schemes routinely available with the BHPS. The longitudinal weights “gross up” estimates based on those individuals who have complete data across all waves (caaw), and the cross sectional weights gross up those who have complete data within waves (cww). Analyses are reasonably straightforward to apply using Stata but a substantial proportion of information from incomplete cases is lost from the analysis.
In this paper, the multiple imputation method of Schafer is used.12 Schafer’s approach fills in missing values using a procedure, which adjusts for non-response bias provided that (a) the survey variables follow a multivariate normal distribution and (b) the data are “missing at random” (MAR). (De Leeuw et al20 contains a useful description of missing data mechanisms and treatment strategies.) Essentially, any “gap” or block of missing items in a longitudinal record is filled in. Of course, this might represent a whole year’s set of observations. The analyst then conducts whatever analysis is planned over a number of filled in data replicates. The recommended number of replicates arises from Little and Rubin’s34 work on multiple imputation. The actual number is decided upon by the extent of missingness in our rectangular person by year matrix. This is equivalent to the percentage of missing data entries (case by variable) to the total number of entries if the whole matrix were complete ( around 24% for our data). This suggests that we use five replicates to achieve around 95% efficiency. Final parameter estimates (for example, regression coefficients) in subsequent regression modelling are then averaged across the replicates and corresponding standard errors are calculated using Rubin’s rule. Typically, standard errors will be larger than those obtained by simply analysing those cases with complete records because they are inflated to account for uncertainty about the filled in values. The multiple imputation algorithm, applied before analysis, is that implemented in the free software NORM, which assumes that the data is multivariate normally distributed as well as any missingness being MAR. Survey variables with continuous, non-normally distributed outcomes were transformed to normality; filled in values for categorical outcome variables were truncated to the nearest integer value.* To be included in our analysis someone has to be present in the BHPS in the year of the study’s origin, 1991. Analyses for the filled in datasets are then simply adjusted by applying the cross sectional weights for that year as routinely supplied by BHPS.13 These weights bring the observed sample in line with 1991 population distributions based on the census. Table A1 shows how well this procedure works for age and sex distributions.
In modelling analyses (not presented) we adopted two strategies that did not entail any multiple imputation. The first simply uses those 8091 cases that have complete sets of observations within any wave or year (complete within wave or cww). These data can be handled routinely under hierarchical linear modelling procedures. The second approach would simply be to apply the longitudinal weights provided by the BHPS to those 4805 cases who have complete observations across all eight waves (caaw). Essentially, a longitudinal weight is a composite of many sub-categories of data that are “best” predictors of loss (personal communication). In terms of assumptions about missingness this would be equivalent to assuming that items were “missing completely at random” (MCAR) in the subclass defined by the longitudinal weight. We judge Schafer’s approach to be an improvement over this approach because it does not assume that the missing people in a subclass are simply a random sample of people in that subclass. Under MAR missing values are assumed to depend on the values of the characteristics of those for whom we do have observations and not on the characteristics that they share in common. In pragmatic terms a filled in dataset resulting from our MAR based imputation results in a dramatic gain over caaw (a 68% increase in available data: 323640 (8091 cases × 8 waves × 5 items) data points compared with 192200 (4805 × 8 × 5) under caaw). Whereas the cww analysis represents a compromise that draws upon 251980 data points or a 31% gain over caaw). Following Schafer and Graham35 we are inclined to trust our MAR based analysis, which does appear to be supported by the data. Firstly, the pre-imputation and post-imputation results are sensible: post-imputation mean GHQ scores are higher than pre-imputation scores, which suggests that those who have missing data tend to be in lower social positions and non-employed (table A2). Secondly, it is unlikely that the reasons for drop out or compliance for any item are solely related to GHQ status alone.
Schafer and Graham investigated drop out in a survey of drug use where it was postulated that the data on drug use were “missing not at random” (MNAR), and so NORM would give biased results. However, they found that, while drop out depended strongly on factors associated with drug use, these factors were also strongly associated with survey variables with high response rates, and so the MAR assumption was valid. We subscribe to their view that this is the exception rather than the rule with rich social surveys.
In Results, findings are presented for the augmented dataset alone (averaged across five replicates). Comparing the caaw and cww models with the models based on augmented or filled in data we see a consistent disparity in the value of the intraclass class correlation around 0.20 for the first compared with 0.22 for the second. This suggests that by ignoring those with incomplete or partial data we will overemphasise the power of the final model to explain both the variance in GHQ scores between individuals as well as that within individuals over time. Secondly, the relative magnitude of the variances for the between individual component (level 2) compared with the within individual component (level 1) tends to shift. Filled in models tend to account for relatively more of the within individual level variance whereas the caaw and cww models show a consistently smaller between individuals component. This might tentatively be interpreted in terms of greater homogeneity among loyal responders. Among the fixed effects gender reflects a consistent difference that is, interestingly, relatively stronger for the complete cases across all waves data (0.46 compared with 0.38). Otherwise the pattern of fixed effects is reasonably consistent although, augmented data analyses produce three way terms with consistently smaller coefficients. This would tend to render our conclusions to be relatively more conservative than they would be if based on fewer cases over time.
The authors wish to express their thanks to colleagues in the MRC team at UCL, notably Dr Tarani Chandola and Dr Paul Clarke.
↵* Procedures to routinely handle categorical data are not available as freeware.
Funding: this work was funded as part of the Medical Research Council’s “Health of the Public” initiative. Grant number: 9900586.
Conflicts of interest: none declared.