Selection bias and patterns of confounding in cohort studies: the case of the NINFEA web-based birth cohort
- Costanza Pizzi1,2,
- Bianca L De Stavola2,
- Neil Pearce2,3,
- Fulvio Lazzarato1,
- Paola Ghiotti4,
- Franco Merletti1,
- Lorenzo Richiardi1
- 1Cancer Epidemiology Unit, CeRMS and CPO-Piemonte, University of Turin, Turin, Italy
- 2Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK
- 3Centre for Public Health Research, Massey University, Wellington, New Zealand
- 4Department of Health, Piedmont Region, Turin, Italy
- Correspondence to Costanza Pizzi, Cancer Epidemiology Unit, CeRMS and CPO-Piemonte, University of Turin, Via Santena 7, Turin 10126, Italy;
Contributors All authors are responsible for the reported research and have made substantial contributions to the conception and design of the study, acquisition of data, analysis and interpretation of data, drafting the article or revising it critically for important intellectual content. All authors have seen and approved the final version of the manuscript.
- Accepted 10 November 2011
- Published Online First 6 December 2011
Background Several studies have examined the effects of sample selection on the exposure–outcome association estimates in cohort studies, but the reasons why this selection may induce bias have not been fully explored.
Aims To investigate how sample selection of the web-based NINFEA birth cohort may change the confounding patterns present in the source population.
Methods The characteristics of the NINFEA participants (n=1105) were compared with those of the wider source population—the Piedmont Birth Registry (PBR)—(n=36 092), and the association of two exposures (parity and educational level) with two outcomes (low birth weight and birth by caesarean section), while controlling for other risk factors, was studied. Specifically the associations among measured risk factors within each dataset were examined and the exposure–outcome estimates compared in terms of relative ORs.
Results The associations of educational level with the other risk factors (alcohol consumption, folic acid intake, maternal age, pregnancy weight gain, previous miscarriages) partly differed between PBR and NINFEA. This was not observed for parity. Overall, the exposure–outcome estimates derived from NINFEA only differed moderately from those obtained in PBR, with relative ORs ranging between 0.74 and 1.03.
Conclusions Sample selection in cohort studies may alter the confounding patterns originally present in the general population. However, this does not necessarily introduce selection bias in the exposure–outcome estimates, as sample selection may reduce some of the residual confounding present in the general population.
- Sample selection
- selection bias
- residual confounding
- web-based studies
- cohort studies
- directed acyclic graph
- longitudinal studies
- medical statistics
Funding This work was supported by Compagnia SanPaolo/FIRMS, the Piedmont Region, the Italian Ministry of University and Research (MIUR), the Italian Association for Research on Cancer (AIRC) and the Massey University Research Fund (MURF). The Centre for Public Health Research is supported by a Programme Grant from the Health Research Council of New Zealand.
Competing interests None.
Ethics approval The study was approved by the Ethical Committee of the San Giovanni Battista Hospital—A.S.O. C.T.O./C.R.F./Maria Adelaide, Turin, Italy.
Provenance and peer review Not commissioned; externally peer reviewed.