To map out area effects on health research, this study had the following aims: (1) to inventory multilevel investigations of area effects on self rated health, cardiovascular diseases and risk factors, and mortality among adults; (2) to describe and critically discuss methodological approaches employed and results observed; and (3) to formulate selected recommendations for advancing the study of area effects on health. Overall, 86 studies were inventoried. Although several innovative methodological approaches and analytical designs were found, small areas are most often operationalised using administrative and statistical spatial units. Most studies used indicators of area socioeconomic status derived from censuses, and few provided information on the validity and reliability of measures of exposures. A consistent finding was that a significant portion of the variation in health is associated with area context independently of individual characteristics. Area effects on health, although significant in most studies, often depend on the health outcome studied, the measure of area exposure used, and the spatial scale at which associations are examined.
- small area analysis
- multilevel analysis
- scoping study
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
A brief search of published reports on area effects on health shows a striking increase over the past decade in the number of studies adopting a multilevel approach to the study of social determinants of health. The impetus for such research probably results from a convergence of conceptual and methodological innovations, including an appreciation of the importance of the social environment to health and greater accessibility of multilevel modelling techniques and software. However, multilevel investigations of area effects on health abound with conceptual and methodological challenges which have given rise to numerous debates. Debated issues are summarised in table 1.
In a previous review of social determinant studies examining effects of area socioeconomic status (SES) on health, 23 of 25 studies reported significant associations between at least one measure of area SES and health, while controlling for individual SES.1 The investigators concluded that data supported the existence of modest small area effects on health but that extant data were replete with methodological problems. More specifically, they stated: “It is clear from our review that investigations of the role of neighbourhood level [small area] social factors on health are characteristics of preliminary, exploratory studies in epidemiology. Certain aspects of study design are in need of improvement before the field can advance […] We hope that this review will show what has already been achieved and point the way to more sophisticated studies of societal determinants of health” (pp 120–121).
In an effort to map out multilevel research on social determinants of health, to identify the types of evidence available, and to gauge whether or not “more sophisticated studies” are being conducted, we undertook a scoping study of research of area effects on health published between July 1998 and December 2005. Unlike the more familiar systematic review, a scoping study addresses broad research topics where many different study designs are applied, with the aim of comprehensively examining the extent, range, and nature of research activity and to identify key concepts and results.36 37
Given the broad diversity of studies, we restricted the scoping review to multilevel investigations of area effects on self rated health (SRH), cardiovascular disease and risk factors, and mortality among adults. These health indicators were selected because of their relevance to understanding the broader socio-spatial patterning of health. SRH is a highly predictive measure of morbidity and mortality, independent of other medical, behavioural, or psychosocial factors,38 and cardiovascular disease is one of the leading causes of mortality in developed countries.
We further restricted study selection to multilevel investigations allowing for estimation of between-area variation (random effects). As pointed out by Merlo and colleagues,24 “clustering of individual health within neighbourhoods (areas) is not a statistical nuisance that only needs to be considered for obtaining correct statistical estimations, but a key concept in social epidemiology that yields important information by itself” (p 443). As measures of variation provide information on the portion of health differences among people that may be attributable to the areas in which they live, they are central to understand the significance of specific contexts for health.24
In keeping with the framework for conducting a scoping study proposed by Arksey and O’Malley,36 the specific objectives of the scoping study were: (1) to provide an inventory of multilevel investigations of area effects on SRH, cardiovascular disease and risk factors, and mortality; (2) to describe and critically discuss the methodological approaches employed and results observed; and (3) to formulate selected recommendations for advancing the study of area effects on health.
The scoping study involved three steps.
In the first step, the identification of studies published between July 1998 and December 2005 ensued from a comprehensive search strategy using the Medline database. We first used “neighbourhood/neighbourhood or area” and “multilevel/multi-level or hierarchical” as words in the title or abstract of articles; this search yielded 634 entries. Inclusion criteria for studies were: publication in the English language, peer reviewed journals, data from adult populations in industrialised countries, use of a multilevel design with at least two units of analysis including individuals and areas, and measurement and analysis of health indicators at the individual level. Studies exploring between country variation in health were excluded, though areas operationalised by subregions of a country—for example, states—were included. Of the 634 studies, 67 meet the inclusion criteria.
We conducted additional searches using “contextual effects”, “place effects”, “census tracts”, and “community” as words in title or abstract, and “residence characteristics and small area analysis” as medical subject headings (MeSH), again limiting the search to studies referring to multilevel methods in their title or abstract. Three additional studies were identified. References lists of all studies compiled and of other sources were reviewed, yielding an additional 16 studies. In all, 86 studies meeting inclusion criteria were retained. In two studies, the investigators reported findings for both self rated health and cardiovascular disease risk factors; these studies appear in both categories.39 40 Overall the sample of the scoping study comprised 88 multilevel investigations of area effects on health.
In the second step, studies were coded by one of us (MR) along the following dimensions: citation and study location; health indicator/analytical variable; research design, year of data collection for individual sample; individual sample size and sex/age distribution; individual characteristics adjusted for; area sample size and operational definition; area level exposures; crude between-area variation; adjusted between-area variation (for individual level variables); and summary of significant findings of adjusted area effects. LG and TAB cross validated half the studies. The coding scheme and abbreviations are summarised in table 2.
In the third step, in order to gauge the accuracy of data compiled, we established interauthor agreement in a random sample of about 25% of studies (n = 21) where the coding of one us (MR) was compared with that of another (TAB) for all coding dimensions except “citation/location” and “summary findings of area effects”. For every dimension, each source of information was equated with one observation. Discrepancies in values reported were considered a disagreement. Overall interauthor agreement was 92.0% (43 disagreements in 513 observations), with agreement ranging between 81.0% and 100% across dimensions. Finally, summary statistics were compiled using the total sample of investigations (n = 88) as the unit of analysis.
Results of study coding appear in the supplementary table, which can be viewed on the journal website (http://www.jech.com/supplemental). Studies are listed alphabetically by surname of first author within each category of health indicator—that is, self rated health (n = 39), cardiovascular morbidity and risk factors (n = 32), and mortality (n = 17). Table 3 presents summary statistics for research design, operational definition of area contours, and exposure as a function of health indicator and time period.
Year of publication and location of studies
There has been a marked increase in the number of studies published on area effects on self rated health, cardiovascular disease and risk factors, and mortality over the 1998–2005 time period, which almost doubled from 2004 to 2005 (fig 1). Most results are from area effects examined in the USA (n = 37) and the United Kingdom (n = 14), although several studies involved data collected in Canada (n = 10), the Netherlands (n = 8), and Sweden (n = 8).
Research design and analytical variables
As shown in table 3, a majority of studies (80.7%) had cross sectional designs, whereas others adopted longitudinal designs (17.0%) wherein a majority of studies involved data from a cohort that were matched with vital statistics records to examine associations with mortality and cardiovascular disease at a later time (designated as “follow up” in the supplementary table). Linear multilevel models for continuous and logistic multilevel models for dichotomous outcomes were the most commonly used statistical models, although some analyses were done on models for ordinal outcomes.45 48 57 64 67 76
Individual data: sample size and variables
Sample size of individuals ranged between 57792 and 2 637 628,101 with a median of 8606 individuals. Sixteen per cent of studies had a sample size over 100 000 individuals, but the majority of studies (61%) had a sample size under 10 000.
Most studies controlled for age, sex, SES, and marital status, but some controlled for other individual characteristics such as health related behaviours, medical conditions, perception of area characteristics, social network, and years of residency in the area. Seven studies did not control for individual SES.56 103 104 112 116 121 124 Most studies targeted general populations, but some restricted their focus to men,96 118 older adults,64 88 92 and racial/ethnic groups.54 55 59 80 83 95
Area data: operational definition, sample size, and exposures
As shown in table 3, the majority of studies (89.8%) operationalised areas using statistical (for example, census tracts) or administrative spatial units (for example, city defined neighbourhoods, boroughs, local authorities), or both. One study delimited areas using geographical information systems,107 and others clustered statistical/administrative spatial units based on similarities in terms of SES, demographic composition, and type of area.41 Most studies had a two level structure, with individuals nested within areas, though some had more complex structures, including cross classification,110 and three level structures—for example, individuals nested within households within areas63 or individuals nested within several hierarchically structured area units.39 44 58 85 111 119 121
Six studies did not report area level sample size. Among studies for which data were reported, sample size ranged from 956 to 12 344 areas.111 Average within-area sample size ranged from 1100 to 36 387 individuals.117 Half the studies (52.4%) had an average within-area sample size of 50 individuals; for 10% of the studies, the within-area sample size was less than five individuals.
Area level indicators of SES such as deprivation, education, and unemployment were generally aggregates of individual level variables derived from censuses and survey data. Others derived measures of area social context (for example, social cohesion, social capital) by aggregating individuals’ perceptions or by the application of ecometric procedures.45 46 48 Over one third of the studies (38.6%) operationalised area exposures using integral measures—that is, features of areas only measurable at an ecological level (table 4). The most commonly used integral measure was income inequality, but other studies relied on characteristics of the social and built environment,50 67 75 92 112 urban sprawl,87 and availability of services and parks.81 88 92 95 106 107
Summary of findings about area effects
Among 47 studies that reported on between-area variation after adjusting for individual characteristics (minimally age, sex, and SES), 27 studies (57.4%) reported significant between-area variation for at least one subgroup (defined either by individual or area level characteristics). Although several studies reported one or the other, several did not report variance components for both unadjusted and adjusted models.
Of the 88 studies, six focused on between-area variation in health only56 60 63 73 89 94 and 82 examined main area effects. Of these studies, six did not report significant direct area effects on health.51 75 93 96 120 122 All other studies reported significant associations between at least one measure of area exposure and at least one health indicator. Significant cross level interactions were observed, indicating that subgroups of individuals may be differentially influenced by certain area characteristics, and by interacting area characteristics. More specifically, effects of area deprivation on poor health, unhealthy behaviours, and risk of mortality were often greater among low SES individuals and among women.
Self rated health
Thirty nine studies examined area effects on self rated health. Four studies focused on between-area variation only, and 35 were cross sectional investigations of associations between area SES and SRH. In all but two studies,51 75 significant associations were observed between at least one measure of area SES and SRH. More specifically, less favourable area socioeconomic conditions were associated with poorer SRH. Area affluence, positive perceptions of area environment, and higher area levels of collective efficacy and social capital were predictive of better SRH, although area residential stability was associated with poorer health.46 48 One study reported associations between poor SRH and unfavourable area level opportunity structures and social functioning, such as poor physical quality of residential environment, lower political engagement, and lower transport wealth.50 Several studies reported significant effects of social processes, characteristics of the built environment, and perceptions of area characteristics as potential mediating pathways of the association between area deprivation/inequality and SRH.46 48 51 53 76
Cardiovascular morbidity and risk factors
Thirty two studies were investigations of cardiovascular disease and risk factors, of which two examined between-area variation only.89 94 Some studies employed follow up96 101 102 and case–control research designs,99 but most were cross sectional.
Twenty three studies examined the direct main effect of area deprivation on cardiovascular disease and risk factors. In all studies, at least one measure of area deprivation was associated with a greater risk for cardiovascular disease and with at least one but not all of the risk factors investigated. In general, greater levels of area deprivation and inequality were associated with a greater likelihood of unhealthy dietary habits, smoking, overweight and obesity, and physical inactivity. However, positive associations between affluence and smoking and drinking behaviours were also observed,78 79 104 and in one study, greater state inequality was associated with lower body mass index among white women.80 A greater likelihood of walking was observed in more deprived areas,88 106 but also in more socially cohesive areas.88
In studies where area exposures were operationalised with integral variables, results showed that in less sprawling areas, the risk of being overweight or obese was lower and levels of walking were higher.87 Involvement in physical activity and walking was more likely in areas characterised by greater availability, accessibility, and density of selected services and green spaces.87 88 92 106 107 Dietary habits were associated with the presence of supermarkets and full service restaurants in the area,95 and greater mean distance to alcohol outlets was associated with lower alcohol consumption.98 High convenience store density and lower distance to convenience store were associated with smoking, although this association was not significant in models controlling for area SES.81
Seventeen studies examined area effects on mortality, most of which reported results from matching of cohort data with vital statistics.
In all but two studies,120 122 results showed that at least one measure of area SES was associated with all cause and cause specific mortality, such that greater area deprivation and income inequality were significantly associated with greater risk of mortality. When adjusting for area SES, greater area religious affiliation was associated with lower risk of all cause mortality,113 and areas with higher social capital were associated with lower risk of all cause mortality and mortality from heart disease,116 while lower levels of social cohesion were associated with higher risk of all cause mortality.118 Area SES confounded the association between air pollution and all cause and cause specific mortality.115
Results of the scoping study showed that the typical methodological approach for multilevel investigations of area effects remains a cross sectional two level study wherein individuals are nested within areas delimited by administrative/statistical spatial units, area level indicators are operationalised using aggregates of individual variables, and direct associations between area exposures and individual level outcomes are adjusted for selected individual characteristics. Various large datasets produced several publications, and it should be noted that findings emanating from the same dataset clearly are not independent.
Over-reliance on cross sectional research designs raises the critical issue of “self selection”—that is, the fact that people will be selected into residential areas based on individual attributes which are themselves related to health.2 To deal with this issue, one frequently used strategy consists of controlling for individual level variables that are potential confounders of associations between area characteristics and health outcomes. Although useful, this modelling strategy does not overcome problems associated with misspecification resulting from omitted or mismeasured individual level variables, or with lack of statistical power. Addressing these methodological issues rests on crafting longitudinal studies, assessing individual characteristics over the life course, and endeavouring to study cross level interactions and mediating pathways. In this regard, several investigators have emphasised the value of innovative methodological approaches.125
In multilevel studies, there are at least two units of analysis: individuals and areas. However, the attention that is usually devoted to measuring individual attributes is infrequently carried over to areas. For example, in six studies area sample size was not reported, and most studies provided little information on the validity and reliability of area level measures of exposure. When measuring exposure, most studies relied on indicators of area SES derived from censuses and other surveys. Although easily accessible, such measures provide only truncated information about the context of areas,7 17 and may in fact be endogenous to the composition of the areas as they are determined by individual characteristics of residents.2 35 To this end, some studies have tapped into measuring area exposures by means of ecometric procedures.20 23 45 46 48
Most studies employed administrative or statistical spatial units to define area contours, despite their recognised limitations—that is, their potential lack of intrinsic meaning in relation to health.1 5 8 13–15 Such limitations are evidenced by variations in the strength and magnitude of area effects on health according to the operational definition of areas.44 58 65 111 We view as particularly innovative recent initiatives to define area contours by delimiting a radius around individual residential location and post codes.107 126 127
Between-area variation was reported using a variety of statistical parameters including variance components and standard error coefficients, intraclass correlation coefficients for continuous and dichotomous outcomes, plausible value ranges, and others, which precludes comparisons across studies. Furthermore, several studies did not report variance components for either unadjusted or adjusted models. This is unfortunate given the importance of measures of variation for understanding the socio-spatial patterning of health. There is clearly a need for more detailed and consistent reporting of between-area variation.24
Power estimation in multilevel studies is complicated by the need to account for the nested structure of the data.128 Only one study reported power calculations.91 Routine reporting of a priori power estimates and post hoc effect size calculations is warranted in multilevel studies.
Although significant area effects were reported in a majority of studies, critical examination of findings revealed inconsistencies within and across studies. For example, one study examined associations between area deprivation and SRH for different operational definitions of area; significant area effects were observed for one type of area only.65 Others have examined effects of area deprivation on several cardiovascular disease risk factors, but observed significant effects only for some of them.78 83 Within individual studies, associations between health and several indicators of area SES were frequently investigated. Although there is probably multicolinearity between measures of area exposures,44 81 119 124 often only certain indicators were significantly associated with the health outcome. This suggests that indicators of area SES such as average income, educational attainment, and income inequality may be tapping into different aspects of the social environment and may be differently associated with specific health outcomes. In addition, a few studies reported no significant area effects. Of concern, findings of the scoping study show that area effects on a specific health indicator may be dependent on the measure of area exposure and the spatial level (area unit) at which associations are investigated. This clearly underscores the importance of conceptualising plausible causal pathways in the search for new knowledge.
The diversity in research designs employed and in the reported results calls attention to the need to move toward a set of reporting guidelines for multilevel investigations of area effects on health. This could include, but not be exclusive to, the following: first, greater details accorded to the description of the nested structure of the database with systematic reporting of sample size at both individual and area levels, and a more explicit discussion of the validity and reliability of area level measures of exposures; second, description of the analytical strategy in such a way that would allow replication of analyses; and third, better description of the between-cluster variation in health outcomes, both in unadjusted and adjusted models for individual characteristics.24
What is already known
A significant portion of the variation in health is associated with area context independently of individual characteristics.
The main limitation of our scoping study pertains to inclusion criteria for studies. A first issue concerns keyword and MeSH searches in the Medline database. Because the area of research emerged only recently, keywords listed by authors do not necessarily correspond to keywords used for searching the literature in the MeSH system. Rather, when searching for “neighbourhood” and “area”, MeSH uses the terms “residence characteristics” and “small area analysis”. Thus if these MeSH terms were not used to classify an article, and if the selected keyword—that is, “neighbourhood”, “area”, “multilevel”, and “hierarchical”—were not specified either in the title or in the abstract of an article, then it would not have been identified. To overcome this problem, more encompassing terms were used and reference lists of all identified articles were reviewed to identify studies. Nonetheless, we acknowledge that some studies may not have been identified with our search strategy.
What this paper adds
This paper is a scoping study of 86 multilevel investigations of area effects on self rated health, cardiovascular morbidity and risk factors, and mortality among adults. Area effects on health, although significant in most studies, often depend on the health outcome studied, the measure of area exposure used, and the spatial scale at which associations are examined.
This highlights the importance conceptualising causal pathways linking area context to health outcomes and devising appropriate methodological strategies in future research.
A second limitation pertains to the exclusion of the literature on area effects on mental health, non-cardiovascular disease morbidity, paediatric populations, maternal health, and health services utilisation. Several of these studies have used innovative methodological approaches and analytical procedures which could provide the reader with other perspectives on area effects on health.126 127 129 130 It seems relevant to undertake a review of these studies as well to ascertain similarities and differences in the range and types of investigations conducted.
Various health outcomes are influenced by area context although the specific processes through which such influences occur remain unclear. The implementation and evaluation of policy interventions aimed at changing area exposures represents an opportunity to fill this knowledge gap.
The results of the scoping study raise several important issues. One issue pertains to whether or not “true” area effects are concealed by less than adequate methodologies. This may indeed be the case. Several studies showed significant between-area variation and area effects independently of individual characteristics. These are quite considerable findings given that most studies suffered from methodological limitations and lack of precision when operationalising and measuring context. Significant results garnered with such measurement error probably underestimate effect sizes.
In recent years, increased attention has been directed towards formulating and testing theoretically based pathways between more specific area level measures of exposures and more specific health indicators, such as the effect of the density of food stores on dietary habits, or the effect of urban form on physical activity involvement. These measures of area exposures may be more proximal to influence everyday health related behaviours, thus operating on the pathways between area SES and broader health outcomes.
A second issue is whether or not widely used methodologies for studying areas and health are well suited to the task. Multilevel modelling, without being a panacea, is a suitable statistical procedure that can be used to analyse data with nested sources of variability, while accounting for the non-independence of within-cluster observations—that is, addressing the non-random processes situating people with similar characteristics into certain types of areas.25 As operationalisation of area contours essentially falls back on using readily available spatial units such as census tracts, space is fragmented into seemingly independent area units, therefore ignoring spatial associations between areas. As pointed out by Chaix and colleagues,126 multilevel modelling procedures are based on the assumption that spatial correlations can be reduced to within-area correlation. For this reason, multilevel approaches may provide only limited information on the spatial distribution of health outcomes and ecological exposures, both when modelling variations and when investigating associations.126 Further debates underscore limitations of multilevel models in detecting causal effects of area exposures on health outcomes.2 15 35 131
Another issue relates to the most promising approaches in this area of research. In recent years, didactic and conceptual tutorials linking social epidemiological concepts to multilevel analysis have been published.24 26 28 132 Innovative methodological approaches and analytical procedures have been applied to examining area effects on health, such as geographical information systems used to operationalise area contours and ecological exposures,107 126 127 development of measurement technology (for example, ecometrics20 22 45 46 48), geostatistical modelling and spatial analysis,47 115 126 133 multilevel path analyses,92 and multilevel structural equation modelling to test potential mediating pathways between area exposures and health outcomes.53 88 Others have underscored the value of developing experimental research designs such as randomised community trials,35 and natural134 and social experiments.135
What seems to emerge from the accumulating evidence on area effects on health is a “specific” research agenda. As argued by other investigators,17 133 136 we espouse the view that the adoption of a specific research approach to examine area effects on health—that is, one that would conceptualise, operationalise, and measure associations between specific health outcomes and specific area exposures—across specific spatial area units may yield more informative evidence of area effects. Adopting a specific approach shows the greatest promise for advancing theoretically based pathways, providing a basis for more precise definitions and measures of ecological exposures, and improved delimitations of area contours.
The supplementary table can be viewed on the journal website, http://www.jech.com/supplemental
MR is recipient of a Canada graduate scholarships doctoral award and TAB is recipient of a postdoctoral fellowship, both from the Canadian Institutes of Health Research. Both are working under the supervision of LG.
We thank Steven Cummins for his helpful comments on an earlier draft of the manuscript.
- medical subject heading
- socioeconomic status
- self rated health