Register for email alerts and news feeds:
This journal | BMJ Group
rss
Journal of Epidemiology and Community Health 2005;59:881-884; doi:10.1136/jech.2004.026740
Copyright © 2005 by the BMJ Publishing Group Ltd.

THEORY AND METHODS

A bootstrap method to avoid the effect of concurvity in generalised additive models in time series studies of air pollution

Adolfo Figueiras1, Javier Roca-Pardiñas2 and Carmen Cadarso-Suárez3

1 Department of Preventive Medicine, University of Santiago de Compostela, Spain
2 Department of Statistics and Operations Research, University of Vigo, Spain
3 Unit of Biostatistics, Department of Statistics and Operations Research, University of Santiago de Compostela

Correspondence to:
Correspondence to:
Dr A Figueiras-Guzmán
Dto de Medicina Preventiva y Salud Pública, Facultad de Medicina, c/San Francisco s/n, 15705 Santiago de Compostela (A Coruña), Spain; aldolfo.figueiras{at}usc.es

Background: In recent years a great number of studies have applied generalised additive models (GAMs) to time series data to estimate the short term health effects of air pollution. Lately, however, it has been found that concurvity—the non-parametric analogue of multicollinearity—might lead to underestimation of standard errors of the effects of independent variables. Underestimation of standard errors means that for concurvity levels commonly present in the data, the risk of committing type I error rises by over threefold.

Methods: This study developed a conditional bootstrap methology that consists of assuming that the outcome in any observation is conditional upon the values of the set of independent variables used. It then tested this procedure by means of a simulation study using a Poisson additive model. The response variable of this model is a function of an unobserved confounding variable (that introduces trend and seasonality), real black smoke data, and temperature. Scenarios were created with different coefficients and degrees of concurvity.

Results: Conditional bootstrap provides confidence intervals with coverages close to nominal (95%), irrespective of the degree of concurvity, number of variables in the model or magnitude of the coefficient to be estimated (for example, for a concurvity of 0.85, bootstrap confidence interval coverage is 95% compared with 71% in the case of the asymptotic interval obtained directly with S-plus gam function).

Conclusions: The bootstrap method avoids the problem of concurvity in time series studies of air pollution, and is easily generalised to non-linear dose-risk effects. All bootstrap calculations described in this paper can be performed using S-Plus gam.boot software.

Abbreviations: GAM, generalised additive models; BS, black smoke

Keywords: air pollutants; computing methodologies; epidemiological research design; risk assessment


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?

Relevant Article

Elephants in the living room and "ISMS" of all kinds–whimsical and reflective contributions from the board
Carlos Alvarez-Dardet and John R Ashton
J Epidemiol Community Health 2005 59: 813. [Extract] [Full Text] [PDF]

This Article

Services
Citing Articles
Google Scholar
PubMed
Topic Collections
Bookmark with

Register for free content

The full back archive is now available for all BMJ Journals. Institutional subscribers may access the entire archive as part of their subscription. Personal subscribers will also have access to all content when logged in. Non-subscribers who register have free access to all articles published before 2006 right back to volume 1 issue 1. Register here to access the free archive of all BMJ Journals.

Don't forget to sign up for content alerts so you keep up to date with all the articles as they are published.

BMJ Careers - Latest infectious diseases and epidemilogy jobs

Infectious diseases and epidemilogy jobs