Elsevier

NeuroImage

Volume 163, December 2017, Pages 115-124
NeuroImage

Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker

https://doi.org/10.1016/j.neuroimage.2017.07.059Get rights and content

Highlights

  • Chronological age can be accurately predicted using convolutional neural networks.

  • Age predicted is accurate even using raw structural neuroimaging data.

  • Brain-predicted age can be generated in a clinically applicable timeframe.

  • Brain-predicted age is significantly heritable.

  • Brain-predicted age is highly reliable, both within and between scanners.

Abstract

Machine learning analysis of neuroimaging data can accurately predict chronological age in healthy people. Deviations from healthy brain ageing have been associated with cognitive impairment and disease. Here we sought to further establish the credentials of ‘brain-predicted age’ as a biomarker of individual differences in the brain ageing process, using a predictive modelling approach based on deep learning, and specifically convolutional neural networks (CNN), and applied to both pre-processed and raw T1-weighted MRI data.

Firstly, we aimed to demonstrate the accuracy of CNN brain-predicted age using a large dataset of healthy adults (N = 2001). Next, we sought to establish the heritability of brain-predicted age using a sample of monozygotic and dizygotic female twins (N = 62). Thirdly, we examined the test-retest and multi-centre reliability of brain-predicted age using two samples (within-scanner N = 20; between-scanner N = 11). CNN brain-predicted ages were generated and compared to a Gaussian Process Regression (GPR) approach, on all datasets. Input data were grey matter (GM) or white matter (WM) volumetric maps generated by Statistical Parametric Mapping (SPM) or raw data.

CNN accurately predicted chronological age using GM (correlation between brain-predicted age and chronological age r = 0.96, mean absolute error [MAE] = 4.16 years) and raw (r = 0.94, MAE = 4.65 years) data. This was comparable to GPR brain-predicted age using GM data (r = 0.95, MAE = 4.66 years). Brain-predicted age was a heritable phenotype for all models and input data (h2 ≥ 0.5). Brain-predicted age showed high test-retest reliability (intraclass correlation coefficient [ICC] = 0.90–0.99). Multi-centre reliability was more variable within high ICCs for GM (0.83–0.96) and poor-moderate levels for WM and raw data (0.51–0.77).

Brain-predicted age represents an accurate, highly reliable and genetically-influenced phenotype, that has potential to be used as a biomarker of brain ageing. Moreover, age predictions can be accurately generated on raw T1-MRI data, substantially reducing computation time for novel data, bringing the process closer to giving real-time information on brain health in clinical settings.

Introduction

The human brain changes across the adult lifespan. This process of brain ageing occurs in accord with a general decline in cognitive performance, cognitive ageing. Although the changes associated with brain ageing are not explicitly pathological, with increasing age comes increasing risk of neurodegenerative disease and dementia (Abbott, 2011). However, the wide range of onset ages for age-associated brain diseases indicates that the effects of ageing on the brain vary greatly between individuals. Thus, advancing our understanding of brain ageing and identifying biomarkers of the process are vital to help improve detection of early-stage neurodegeneration and predict age-related cognitive decline.

One promising approach to identifying individual differences in brain ageing derives from the research showing that neuroimaging data can be used to accurately predict chronological age in healthy individuals, using machine learning (Dosenbach et al., 2010, Franke et al., 2010). By ‘learning’ the correspondence between patterns in structural or functional neuroimaging data and an age ‘label’, machine-learning algorithms can formulate massively high-dimensional regression models, fitting large neuroimaging datasets as independent variables to predict chronological age as the dependent variable. The resulting brain-based age predictions are generally highly accurate, particularly when algorithms learn from large training datasets and are applied to novel or ‘left-out’ data (i.e., test datasets).

Neuroimaging-derived age predictions have been explored in the context of different brain diseases. By training models on healthy individuals, brain-based predictions of age can then be made in independent clinical samples. If ‘brain-predicted age’ is greater than an individual's chronological age, this is thought to reflect some aberrant accumulation of age-related changes to the brain. The degree of this ‘added’ brain ageing can be simply quantified by subtracting chronological age from brain-predicted age. This approach is being used more frequently and has demonstrated increased brain-predicted age in adults with mild cognitive impairment who progress to Alzheimer's (Franke and Gaser, 2012, Gaser et al., 2013), after traumatic brain injury (Cole et al., 2015), in schizophrenia (Koutsouleris et al., 2013, Schnack et al., 2016), HIV (Cole et al., 2017c), epilepsy (Pardoe et al., 2017), Down's syndrome (Cole et al., 2017a) and diabetes (Franke et al., 2013). At the same time, brain-predicted age has been used to demonstrate protective influences on brain ageing, including meditation (Luders et al., 2016) and increased levels of education and physical exercise (Steffener et al., 2016). Evidently, the extent to which one's brain resembles the typical structure or function appropriate for one's age can be affected by both positive and negative influences. By conceptualising brain ageing in this manner, highly-complex multivariate datasets and statistical procedures can be reformulated into an intuitively straightforward and widely-applicable biomarker. However, the practicality of using such a marker clinically, its reliability and relevance for normal variation in brain ageing need to be further demonstrated.

One hindrance to clinical applications for neuroimaging generally is the time needed for image ‘post-processing’ after acquisition (referred to as ‘pre-processing’ by neuroimagers), which can take hours or days, while clinical decisions often need to occur in minutes or less. Regardless of learning algorithm, previous brain-predicted age studies have required several pre-processing stages. Such steps are typically a sequence of data transformations that produce a representation of the original images that is sufficiently structured, compact and informative to support machine learning. These include the removal of non-brain tissue (i.e., skull stripping or brain extraction), affine or non-linear image registration, interpolation and smoothing. While pre-processing may reduce noise and permit voxelwise inter-individual statistical comparisons, there are numerous additional assumptions required for any pre-processing pipeline. These assumptions are often not met, particularly when analysing brain images containing gross pathology (Avants et al., 2008, Liu et al., 2015) and can even be an increased source of error. Recently, however, modelling methods that require little or no image pre-processing have become available, including so-called ‘deep learning’.

The resurgence of interest in artificial neural networks for learning data representations, deep learning, offers a new way of approaching statistical modelling in neuroimaging, thanks to improvements in computing infrastructure. When sufficiently large volumes of data are available, no ‘hand-engineering’ (i.e., manually selecting a priori which features should be used as input) is needed as the deep learning algorithm is able to infer a compact representation of the data, starting only with raw images as input, which is optimally tailored for the particular predictive modelling task at hand. In this respect, deep learning offers several practical advantages for high-dimensional prediction tasks, that should enable the learning of both physiologically-relevant representations and latent relationships (Plis et al., 2014). Of particular interest to us is the potential for deep learning techniques, such as convolutional neural networks (CNN), to make predictions from raw, unprocessed neuroimaging data, thus obviating the reliance on time-consuming pre-processing and improving the clinical applicability of models of brain ageing.

Beyond improving clinical applicability, a biomarker of brain ageing needs to relate to naturally occurring variation, such as that caused by genetic factors. Many aspects of brain ageing and susceptibility to age-related brain disease are thought to be under genetic influence (Lee and Sachdev, 2014, Lu et al., 2004, Peters, 2006, Teter and Finch, 2004). Therefore, demonstrating a brain ageing biomarker is sensitive to genetic influences gives some external, genetic, validity to the measure. Furthermore, if a neuroimaging biomarker is heritable, this motivates further research into specific candidate genes, or sets of genes, that may affect this aspect of brain ageing. These candidate genes can then, in turn, provide biological targets for pharmacological interventions which aim to improve brain health in older adults.

Another important facet of any biomarker is reliability. If a biomarker is to be evaluated longitudinally, in clinical trials or research settings, to track change over time, establishing test-retest reliability is vital. Furthermore, as many neuroimaging studies are now international collaborative efforts, data collection often takes place across multiple scanning sites. Therefore, between-scanner reliability, which indicates that a method of obtaining a biomarker is generalizable to data acquired from other sites, is of increasing importance.

In this work, we sought to establish the credentials of CNN-predicted age as a potential biomarker of brain ageing in three different ways: 1) Demonstrate that CNNs can accurately predict age using structural neuroimaging data and compare predictions using pre-processed and ‘raw’ input data; 2) Establish the heritability of brain-predicted age using a sample of monozygotic and dizygotic twins; 3) Assess both the test-retest (i.e., within-scanner) and multi-centre (i.e., between-scanner) reliability of brain-predicted age.

Section snippets

Datasets

All neuroimaging data used in the study were T1-weighted MRI scans. Details of the participants in the specific samples and the respective acquisition parameters used are outlined below:

Convolutional neural networks accurately predict age using neuroimaging

Analysis showed that our CNN method could accurately predict the chronological age of healthy adults, using either processed volumetric maps or raw T1-MRI data (see Table 1). Prediction accuracy was similar for GPR. The lowest MAE achieved was using GM data and CNN analysis (MAE = 4.16 years), though other predictions were generally comparable. Using single tissues (i.e., GM or WM) did not appreciably alter the prediction accuracy compared to using all available input data for each subject

Discussion

Using 3D convolutional neural networks, we accurately estimated chronological age from raw T1-weighted MRI brain scans of healthy adults. The accuracy of CNN for age prediction was also high when using processed GM and WM voxelwise images, and was comparable with age estimations made using GPR. Brain-predicted age estimates were significantly heritable and showed high levels of within-scanner and between-scanner reliability. These findings support the idea that deep learning methods can

Conclusions

Deep learning models based on T1-MRI can accurately predict chronological age in healthy individuals. This can be achieved using raw MRI data, with a minimum of processing necessary to generate an accurate age prediction. These estimates of brain-predicted age are also considerably heritable, giving external, genetic, validity to the measure and motivating its use in genetic studies of brain ageing. Finally, our analysis showed the brain-predicted age is highly reliable and thus appropriate for

Acknowledgements

The TwinsUK study was funded by the Wellcome Trust, Medical Research Council, European Commision’s Seventh Framework Program (FP7/2007-2013, GA No 259749). The study also receives support from the National Institute for Health Research (NIHR), BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy's and St Thomas' NHS Foundation Trust in partnership with King's College London. The STudy Of Reliability of MRI (STORM) was funded by the NIHR Biomedical Research Centre

References (68)

  • J. Kleesiek et al.

    Deep MRI brain extraction: a 3D convolutional neural network for skull stripping

    NeuroImage

    (2016)
  • A. Klein et al.

    Evaluation of volume-based and surface-based brain image registration methods

    NeuroImage

    (2010)
  • E. Konukoglu et al.

    Neighbourhood approximation using randomized forests

    Med. Image Anal.

    (2013)
  • W.S. Kremen et al.

    Genetic and environmental influences on the size of specific brain regions in midlife: the VETSA MRI study

    NeuroImage

    (2010)
  • T. Lee et al.

    Genetic influences on cognitive functions in the elderly: a selective review of twin studies

    Brain Res. Rev.

    (2010)
  • E. Luders et al.

    Estimating brain age using high-resolution pattern recognition: younger brains in long-term meditation practitioners

    NeuroImage

    (2016)
  • B. Mwangi et al.

    Prediction of individual subject's age across the human lifespan using diffusion tensor imaging: a machine learning approach

    NeuroImage

    (2013)
  • H.R. Pardoe et al.

    Structural brain changes in medically refractory focal epilepsy resemble premature brain aging

    Epilepsy Res.

    (2017)
  • H.R. Pardoe et al.

    Motion and morphometry in clinical and nonclinical populations

    NeuroImage

    (2016)
  • J. Steffener et al.

    Differences between chronological and brain age are related to education and self-reported physical activity

    Neurobiol. Aging

    (2016)
  • B. Teter et al.

    Caliban's heritance and the genetics of neuronal aging

    Trends Neurosci.

    (2004)
  • A.M. Winkler et al.

    Cortical thickness or grey matter volume? The importance of selecting the phenotype for imaging genetics studies

    NeuroImage

    (2010)
  • A. Abbott

    Dementia: a problem for our age

    Nature

    (2011)
  • H. Akaike

    A new look at the statistical model identification

    IEEE Trans. Automat. Control

    (1974)
  • W.F.C. Baaré et al.

    Quantitative genetic modeling of variation in human brain morphology

    Cereb. Cortex

    (2001)
  • S.A.H. Batouli et al.

    Heritability of brain volumes in older adults: the older Australian twins study

    Neurobiol. Aging

    (2014)
  • S. Boker et al.

    OpenMx: an open source extended structural equation modeling framework

    Psychometrika

    (2011)
  • T.J. Bouchard

    The Wilson Effect: the increase in heritability of IQ with age

    Twin Res. Hum. Genet.

    (2013)
  • X.W. Chen et al.

    Big data deep learning: challenges and perspectives

    IEEE Access

    (2014)
  • J.H. Cole et al.

    Brain-predicted age in Down Syndrome is associated with β-amyloid deposition and cognitive decline

    Neurobiol. Aging

    (2017)
  • J.H. Cole et al.

    Prediction of brain age suggests accelerated atrophy after traumatic brain injury

    Ann. Neurol.

    (2015)
  • J.H. Cole et al.

    Brain age predicts mortality

    Mol. Psychiatry

    (2017)
  • J.H. Cole et al.

    Increased brain-predicted aging in treated HIV disease

    Neurology

    (2017)
  • L. Deng et al.

    Deep learning: methods and applications

    Found. Trends Signal Process.

    (2013)
  • Cited by (579)

    • Application of AI in biological age prediction

      2024, Current Opinion in Structural Biology
    • Deep Learning and Geriatric Mental Health

      2024, American Journal of Geriatric Psychiatry
    View all citing articles on Scopus
    View full text