Article Text

Download PDFPDF
A bootstrap method to avoid the effect of concurvity in generalised additive models in time series studies of air pollution
  1. Adolfo Figueiras1,
  2. Javier Roca-Pardiñas2,
  3. Carmen Cadarso-Suárez3
  1. 1Department of Preventive Medicine, University of Santiago de Compostela, Spain
  2. 2Department of Statistics and Operations Research, University of Vigo, Spain
  3. 3Unit of Biostatistics, Department of Statistics and Operations Research, University of Santiago de Compostela
  1. Correspondence to:
 Dr A Figueiras-Guzmán
 Dto de Medicina Preventiva y Salud Pública, Facultad de Medicina, c/San Francisco s/n, 15705 Santiago de Compostela (A Coruña), Spain; aldolfo.figueirasusc.es

Abstract

Background: In recent years a great number of studies have applied generalised additive models (GAMs) to time series data to estimate the short term health effects of air pollution. Lately, however, it has been found that concurvity—the non-parametric analogue of multicollinearity—might lead to underestimation of standard errors of the effects of independent variables. Underestimation of standard errors means that for concurvity levels commonly present in the data, the risk of committing type I error rises by over threefold.

Methods: This study developed a conditional bootstrap methology that consists of assuming that the outcome in any observation is conditional upon the values of the set of independent variables used. It then tested this procedure by means of a simulation study using a Poisson additive model. The response variable of this model is a function of an unobserved confounding variable (that introduces trend and seasonality), real black smoke data, and temperature. Scenarios were created with different coefficients and degrees of concurvity.

Results: Conditional bootstrap provides confidence intervals with coverages close to nominal (95%), irrespective of the degree of concurvity, number of variables in the model or magnitude of the coefficient to be estimated (for example, for a concurvity of 0.85, bootstrap confidence interval coverage is 95% compared with 71% in the case of the asymptotic interval obtained directly with S-plus gam function).

Conclusions: The bootstrap method avoids the problem of concurvity in time series studies of air pollution, and is easily generalised to non-linear dose-risk effects. All bootstrap calculations described in this paper can be performed using S-Plus gam.boot software.

  • GAM, generalised additive models
  • BS, black smoke
  • air pollutants
  • computing methodologies
  • epidemiological research design
  • risk assessment

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Footnotes

  • Funding: Dr Adolfo Figueiras’ work on this project was funded by Health Research Fund (Fondo de Investigación Sanitaria) grants 00/0010-05 and 99/1189 from the Spanish Ministry of Health, Javier Roca-Pardiñas’ work was funded by a grant from the University of Vigo (Vigo, Spain), and Dr Carmen Cadarso-Suárez’s work was funded by grant BMF2002-03213 from the Spanish Ministry of Science and Technology.

  • Conflicts of interest: none.

Linked Articles