The exposure term in the case-crossover design consists in the difference between the ambient concentration on the event day and the concentration(s) on some control day(s). So far, all air pollution case-crossover studies presented the distribution of the daily ambient pollutant concentrations but do not publish the distributional properties of the relevant exposure term—that is, the concentration difference. This article shows that this difference can be very small for a large fraction of event days, therefore, seriously limiting the statistical power to refute the null hypothesis. Publishing the distribution of the relevant differences will improve the interpretation and discussion of findings from case-crossover studies, particularly in cases with statistically non-significant associations.
- air pollution
- study design
- statistical power
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Various study designs have been used to investigate the statistical association between daily levels of air pollution and consecutive acute changes in mortality and morbidity rates. Time series studies, for example, AHPEA in Europe or NMMAPS in the USA make up the vast majority of these investigations.1,2 However, the case-crossover design, first described in 1991,3 is a very attractive alternative approach to investigate acute effects. The design samples cases only, and effect estimates are based on the within subject comparison of exposure at failure (or event) times with exposure at control times.4 Each subject serves as their own control, leading to the benefit that stable subject specific covariates such as genetic predisposition, smoking, or other risk factors cannot confound the effect estimates. In contrast with time series studies, the case-crossover approach permits the use of individual characteristics such as age, sex, health status, or lifestyle factors to assess effects of air pollution among potentially susceptible subgroups or to explore modification of air pollution effects by individual characteristics.5,6
More than 20 ambient air pollution studies have adopted the case-crossover design for clinical events or death6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26 and others have discussed methodological aspects of the application.27–33 None of the studies, however, published information on the distribution of the relevant exposure term—that is, the concentration differences between event and control days. In this paper, we show that the distribution of these differences is an important missing characteristic for the evaluation of power, the interpretation of findings, and the comparison across studies. For a broader discussion of strengths and limitations of the case-crossover design we refer the interested reader to a recent review of this approach in air pollution epidemiology5 and the above mentioned methodological papers.
THE RELEVANT EXPOSURE TERM IN CASE-CROSSOVER STUDIES IS A DIFFERENCE
In the traditional air pollution time series studies such as APHEA or NMMAPS34,35 the relevant exposure term is the daily level of some ambient air pollutants. Accordingly, these studies publish characteristics of the distribution of these concentrations such as the mean, interquartile, or total range. Keeping all other factors constant, it may be argued that effects are more easily estimated in a city with a wide range across daily concentrations. However, only the short term component of this variation is actually relevant for the power of this type of statistical analysis whereas long term and seasonal fluctuations in air pollutant concentrations are filtered out by the trend and seasonal variables incorporated in the respective models.
The case-crossover design compares ambient air pollutant concentrations at or shortly before an event—for example, a cardiac arrest17—with concentrations on a control day, using conditional logistic regression. (As usual in air pollution epidemiology, ambient concentrations are used as a surrogate measure or indicators of exposure whereas true personal exposure to pollution from ambient origin is not known. To reflect the use of an indicator of exposure, we set “exposure” in quotes). Under the null hypothesis, “exposure” on control days should not be different from “exposure” at (or shortly before) the event. Control days are selected in various ways, most often 7, 14, or 21 days before and/or after the event, to control for time varying cofactors that might be associated with day of the week. Bias is reduced if control days are selected bi-directionally, considering the mean concentrations on the days k weeks before and after the event (anterior and posterior controls, respectively) as the “control exposure”, where k commonly equals 1, 2, 3, or 4.31 As usual in epidemiological studies, the distribution of the exposure term (that is, in our case the concentration difference between event and control days) is a driving force of statistical power. If ambient air pollution has a true effect on the event probability, the null hypothesis can only be rejected if the nature of the ambient air pollution time series is such that the difference between event and control days shows sufficient variation around a non-zero mean value. For example, if ambient concentrations show very little variability between event and control days, this leads to similar “exposure” levels in the two periods, thus limiting the power to detect effects. The more variable these concentration differences are, the higher the power to detect a true association (see appendix, available on the journal web site http://www.jech.com/supplemental).
So far, none of the air pollution case-crossover studies published the distributional properties of this exposure term (that is, the difference in “exposure” between event and control days). To show the importance of distinguishing between this relevant exposure term and the absolute daily levels of pollutants we use pollution data from three communities. Formal results regarding statistical power in the case-crossover and the Poisson time series design are given in the appendix (available on line http://www.jech.com/supplemental).
METHODS AND RESULTS
Table 1 lists the “exposure” distribution as typically provided in case-crossover air pollution studies. We only show those studies with sample sizes (that is, number of events) up to 1000 and studies with non-significant findings. However, similar information can be found in all publications.
To show the distinction between the distribution of daily mean concentrations—relevant in time series studies—and the distribution of the concentration differences between “event” and “control” days—relevant for case-crossover studies—we used daily nitrogen dioxide (NO2) data from three Swiss locations.
We chose the 1999–2000 daily fixed site monitor data from a rather clean alpine community, Davos, the city of Basle in the more populated area of the upper Rhine, and the city of Lugano in the southern part of Switzerland with the highest concentrations. These three communities belong to the SAPALDIA study, and daily NO2 data were easily available.36 However, the choice of the pollutant is not relevant for our purposes, and the conclusions apply to all exposure terms used in case-crossover studies.
The first part of table 2 shows the distributional characteristics of these daily ambient NO2 concentrations as usually provided in time series analyses and case-crossover studies. The second part of the table presents the distribution of the exposure term that would be relevant if you were to conduct a case-crossover study in these communities.
In this hypothetical example, we assume that events may occur every day (“event” days). We further assume a bi-directional approach. The relevant term of daily “exposure” was therefore calculated as the difference between the concentration on the respective day and the average concentration over the two “control” days, chosen seven days before and after (bidirectional), as done in some studies.12,20
In Davos, on 75% of the days (or events), the “control” values differ by less than 7.7 μg/m3 from those of the corresponding “event” days. For half of all days, event and control “exposure” will differ by less than 2.8 μg/m3, whereas the interquartile range of daily NO2 values is 13 μg/m3. Moreover, 90% of “exposure” differences do not exceed 17.1 μg/m3, whereas the 80% range of daily concentrations is 29 μg/m3. Restriction of the time series to summer periods, where fluctuations in NO2 tend to be lower, further decreases the expected contrast between “event” and “control” concentrations with 90% of differences not exceeding 5.2 μg/m3 (last part of table 2).
We have shown that the exposure term relevant in case-crossover studies can have rather narrow distributions, therefore statistical power (see appendix) may be jeopardised. The distribution of the difference between concentrations on “event” and “control” days—the only relevant exposure term in the case-crossover design—needs to be presented. The distribution of the daily concentrations does not reveal this relevant information. In fact, the daily concentrations usually spread over much broader ranges. Thus focusing on that distribution in the planning and evaluation of case-crossover studies is misleading. As shown in the appendix, the variation of concentration differences rather than of absolute levels determines the statistical power of this type of analysis.
Comparing tables 1 and 2 (first part) shows that the cities chosen in our hypothetical example have ranges in the pollution levels comparable to those reported in many case-crossover studies. The spread (for example, 10th to 90th centile) is similar in the Swiss cities (that is, 29 to 40 μg/m3) and slightly lower in Seattle, but higher in Boston and Kaohsiung. Therefore, we consider the Swiss examples a reasonable model.
As shown in these data, the nature of the pollution time series can be such that for a large number of potential “event” days concentrations would be very similar to those on the days chosen as controls. Adjustment for seasonal and weather factors, as usually done, would most probably further reduce the contrast between “event” and “control” exposure terms. In this case, the ability to observe a statistical difference between event and control days is substantially reduced and effect estimates tend to be driven by events occurring during the winter period or a few influential event-control “exposure” pairs. In summary, power to detect a statistically significant effect of ambient air pollution is a priori low if short term variability of exposure is small.
The power calculation in the appendix shows that sample size, as usual, is an important element. As case-crossover studies in contrast with time series analyses often involve rather small samples, for example, a few hundred events11,13,14,17,25 statistical power is further reduced. A prior assessment of the distribution of the relevant exposure term and its impact on power would lead investigators to strategies aiming at increasing the event sample size. For example, the distribution of NO2 concentration differences in Davos is so narrow that you would either need a very large number of events or choose a different location to set up a case-crossover study with sufficient statistical power to detect, for example, an effect of air pollution on mortality. The studies from Seattle, Taiwan, and Vancouver, and also some studies from France13,17,20,25,26 have comparatively small samples. In all theses cases, “exposure” on case and control days might not have differed enough to warrant sufficient statistical power for detecting existing effects. Based on the published data of one study it seems that in Vancouver the range of daily concentrations was even smaller than in our example from Davos. The IQR of NO2 was only 5.4 μg/m3 in Vancouver compared with about 13 μg/m3 in Davos. The range of the relevant exposure term might have been extremely small in this study that relied on less than 100 events.26 In case of the Seattle studies, the reported distribution of daily PM values (table 1) suggests that daily variability of air pollution was comparable to the one seen in Davos (table 2).
Our three examples show that the exposure conditions would be more favourable if you were to conduct a case-crossover study in the city of Basle or Lugano where differences can be expected to be roughly two to three times (summer) as large as in Davos. The respective information is not provided in any case-crossover study and cannot be inferred from table 1 data. In fact, the distribution of the difference depends on the choice of the lag between event and control days, the day to day variability, and thus, ultimately, on prevailing weather and emission patterns, which may differ across regions, seasons, and/or over time.
In light of null findings, a rigorous discussion of power is required (see appendix). Data presented in the format of table 1 permit only speculation that power was insufficient in some studies. Presentation of data in the format of table 2 (second part) would show whether the size of the difference is small for a large fraction of days or not.
What this paper adds
It is good epidemiological practice to publish the exposure distribution. So far, air pollution case-crossover studies have neither reported nor discussed the exposure term relevant for this design.
This paper emphasises the potential lack of power in case-crossover studies. It makes a strong case for full disclosure of the relevant exposure information.
It is important to distinguish “no effects” from a lack of statistical power to detect existing effects. The proposition to publish the relevant exposure information increases an understanding of this distinction. This leads to appropriate interpretations of research findings on acute effects of ambient air pollution that ultimately frame policies.
The need to investigate and publish the relevant exposure term is also amplified in case-crossover studies comparing associations for a set of pollutants or providing multi-pollutant models. Multi-pollutant models9,13,16 are often used to investigate the varying contribution of specific pollutants or sources. The ability to disentangle pollutant effects depends on their correlation. However, in the case-crossover study the correlation between concentration differences rather than across absolute concentration values is of relevance and should therefore be reported. In fact, correlation patterns of absolute daily levels are likely to be distinct from those of the concentration differences relevant in the case-crossover design. In addition, the issue is important in the evaluation and control of other covariates, for example, weather factors. Here, again, the relevant correlation is the one between concurrent differences and not the one between concurrent absolute daily values.
Given that the daily level of ambient air pollution is not the exposure term relevant for case-crossover studies, it is misleading to report associations, as done in many papers, for the interquartile range of this measure. Estimates for different pollutants will be comparable only if presented for the interquartile range of the relevant exposure term—that is, the concentration difference between “event” and “control” days.
Reporting of the relevant study specific “exposure” data (table 2) will permit the comparison of results across studies, for meta-analyses of comparable studies to be conducted, the advancement of our understanding of results, and the improvement in the ability to decide which null findings may have resulted from a lack of power. Future studies should aim at the statistical power necessary for detecting true effects and for determining whether variation in these effects may reflect geographical differences in the toxicity of pollutants or in the susceptibility of populations. Thus, we strongly recommend that case-crossover studies investigate and publish the distributional properties of the difference between “event” and “control” exposure and the correlation of this measure between pollutants. The mere distribution of daily values does not disclose relevant properties of the concentration difference between event and control days.
The appendix is available as a downloadable PDF (printer friendly file).
If you do not have Adobe Reader installed on your computer,
you can download this free-of-charge, please Click here
Files in this Data Supplement:
- [view PDF] -
1. Details of the power calculation for the case-crossover-model.
2. Details of the power calculation for the Poisson-regression model.
- [view PDF] - 1. Details of the power calculation for the case-crossover-model.
Funding: Professor Kuenzli is supported by the Southern California Environmental Health Sciences Center (National Institute of Environmental Health Sciences grant P30 ES07048) and the Hastings Foundation.
Conflicts of interest: none declared.