Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker

doi:10.1016/j.neuroimage.2017.07.059

NeuroImage

Volume 163, December 2017, Pages 115-124

https://doi.org/10.1016/j.neuroimage.2017.07.059 Get rights and content

Highlights

•
Chronological age can be accurately predicted using convolutional neural networks.
•
Age predicted is accurate even using raw structural neuroimaging data.
•
Brain-predicted age can be generated in a clinically applicable timeframe.
•
Brain-predicted age is significantly heritable.
•
Brain-predicted age is highly reliable, both within and between scanners.

Abstract

Machine learning analysis of neuroimaging data can accurately predict chronological age in healthy people. Deviations from healthy brain ageing have been associated with cognitive impairment and disease. Here we sought to further establish the credentials of ‘brain-predicted age’ as a biomarker of individual differences in the brain ageing process, using a predictive modelling approach based on deep learning, and specifically convolutional neural networks (CNN), and applied to both pre-processed and raw T1-weighted MRI data.

Firstly, we aimed to demonstrate the accuracy of CNN brain-predicted age using a large dataset of healthy adults (N = 2001). Next, we sought to establish the heritability of brain-predicted age using a sample of monozygotic and dizygotic female twins (N = 62). Thirdly, we examined the test-retest and multi-centre reliability of brain-predicted age using two samples (within-scanner N = 20; between-scanner N = 11). CNN brain-predicted ages were generated and compared to a Gaussian Process Regression (GPR) approach, on all datasets. Input data were grey matter (GM) or white matter (WM) volumetric maps generated by Statistical Parametric Mapping (SPM) or raw data.

CNN accurately predicted chronological age using GM (correlation between brain-predicted age and chronological age r = 0.96, mean absolute error [MAE] = 4.16 years) and raw (r = 0.94, MAE = 4.65 years) data. This was comparable to GPR brain-predicted age using GM data (r = 0.95, MAE = 4.66 years). Brain-predicted age was a heritable phenotype for all models and input data (h² ≥ 0.5). Brain-predicted age showed high test-retest reliability (intraclass correlation coefficient [ICC] = 0.90–0.99). Multi-centre reliability was more variable within high ICCs for GM (0.83–0.96) and poor-moderate levels for WM and raw data (0.51–0.77).

Brain-predicted age represents an accurate, highly reliable and genetically-influenced phenotype, that has potential to be used as a biomarker of brain ageing. Moreover, age predictions can be accurately generated on raw T1-MRI data, substantially reducing computation time for novel data, bringing the process closer to giving real-time information on brain health in clinical settings.

Introduction

The human brain changes across the adult lifespan. This process of brain ageing occurs in accord with a general decline in cognitive performance, cognitive ageing. Although the changes associated with brain ageing are not explicitly pathological, with increasing age comes increasing risk of neurodegenerative disease and dementia (Abbott, 2011). However, the wide range of onset ages for age-associated brain diseases indicates that the effects of ageing on the brain vary greatly between individuals. Thus, advancing our understanding of brain ageing and identifying biomarkers of the process are vital to help improve detection of early-stage neurodegeneration and predict age-related cognitive decline.

One promising approach to identifying individual differences in brain ageing derives from the research showing that neuroimaging data can be used to accurately predict chronological age in healthy individuals, using machine learning (Dosenbach et al., 2010, Franke et al., 2010). By ‘learning’ the correspondence between patterns in structural or functional neuroimaging data and an age ‘label’, machine-learning algorithms can formulate massively high-dimensional regression models, fitting large neuroimaging datasets as independent variables to predict chronological age as the dependent variable. The resulting brain-based age predictions are generally highly accurate, particularly when algorithms learn from large training datasets and are applied to novel or ‘left-out’ data (i.e., test datasets).

Neuroimaging-derived age predictions have been explored in the context of different brain diseases. By training models on healthy individuals, brain-based predictions of age can then be made in independent clinical samples. If ‘brain-predicted age’ is greater than an individual's chronological age, this is thought to reflect some aberrant accumulation of age-related changes to the brain. The degree of this ‘added’ brain ageing can be simply quantified by subtracting chronological age from brain-predicted age. This approach is being used more frequently and has demonstrated increased brain-predicted age in adults with mild cognitive impairment who progress to Alzheimer's (Franke and Gaser, 2012, Gaser et al., 2013), after traumatic brain injury (Cole et al., 2015), in schizophrenia (Koutsouleris et al., 2013, Schnack et al., 2016), HIV (Cole et al., 2017c), epilepsy (Pardoe et al., 2017), Down's syndrome (Cole et al., 2017a) and diabetes (Franke et al., 2013). At the same time, brain-predicted age has been used to demonstrate protective influences on brain ageing, including meditation (Luders et al., 2016) and increased levels of education and physical exercise (Steffener et al., 2016). Evidently, the extent to which one's brain resembles the typical structure or function appropriate for one's age can be affected by both positive and negative influences. By conceptualising brain ageing in this manner, highly-complex multivariate datasets and statistical procedures can be reformulated into an intuitively straightforward and widely-applicable biomarker. However, the practicality of using such a marker clinically, its reliability and relevance for normal variation in brain ageing need to be further demonstrated.

One hindrance to clinical applications for neuroimaging generally is the time needed for image ‘post-processing’ after acquisition (referred to as ‘pre-processing’ by neuroimagers), which can take hours or days, while clinical decisions often need to occur in minutes or less. Regardless of learning algorithm, previous brain-predicted age studies have required several pre-processing stages. Such steps are typically a sequence of data transformations that produce a representation of the original images that is sufficiently structured, compact and informative to support machine learning. These include the removal of non-brain tissue (i.e., skull stripping or brain extraction), affine or non-linear image registration, interpolation and smoothing. While pre-processing may reduce noise and permit voxelwise inter-individual statistical comparisons, there are numerous additional assumptions required for any pre-processing pipeline. These assumptions are often not met, particularly when analysing brain images containing gross pathology (Avants et al., 2008, Liu et al., 2015) and can even be an increased source of error. Recently, however, modelling methods that require little or no image pre-processing have become available, including so-called ‘deep learning’.

The resurgence of interest in artificial neural networks for learning data representations, deep learning, offers a new way of approaching statistical modelling in neuroimaging, thanks to improvements in computing infrastructure. When sufficiently large volumes of data are available, no ‘hand-engineering’ (i.e., manually selecting a priori which features should be used as input) is needed as the deep learning algorithm is able to infer a compact representation of the data, starting only with raw images as input, which is optimally tailored for the particular predictive modelling task at hand. In this respect, deep learning offers several practical advantages for high-dimensional prediction tasks, that should enable the learning of both physiologically-relevant representations and latent relationships (Plis et al., 2014). Of particular interest to us is the potential for deep learning techniques, such as convolutional neural networks (CNN), to make predictions from raw, unprocessed neuroimaging data, thus obviating the reliance on time-consuming pre-processing and improving the clinical applicability of models of brain ageing.

Beyond improving clinical applicability, a biomarker of brain ageing needs to relate to naturally occurring variation, such as that caused by genetic factors. Many aspects of brain ageing and susceptibility to age-related brain disease are thought to be under genetic influence (Lee and Sachdev, 2014, Lu et al., 2004, Peters, 2006, Teter and Finch, 2004). Therefore, demonstrating a brain ageing biomarker is sensitive to genetic influences gives some external, genetic, validity to the measure. Furthermore, if a neuroimaging biomarker is heritable, this motivates further research into specific candidate genes, or sets of genes, that may affect this aspect of brain ageing. These candidate genes can then, in turn, provide biological targets for pharmacological interventions which aim to improve brain health in older adults.

Another important facet of any biomarker is reliability. If a biomarker is to be evaluated longitudinally, in clinical trials or research settings, to track change over time, establishing test-retest reliability is vital. Furthermore, as many neuroimaging studies are now international collaborative efforts, data collection often takes place across multiple scanning sites. Therefore, between-scanner reliability, which indicates that a method of obtaining a biomarker is generalizable to data acquired from other sites, is of increasing importance.

In this work, we sought to establish the credentials of CNN-predicted age as a potential biomarker of brain ageing in three different ways: 1) Demonstrate that CNNs can accurately predict age using structural neuroimaging data and compare predictions using pre-processed and ‘raw’ input data; 2) Establish the heritability of brain-predicted age using a sample of monozygotic and dizygotic twins; 3) Assess both the test-retest (i.e., within-scanner) and multi-centre (i.e., between-scanner) reliability of brain-predicted age.

Section snippets

Datasets

All neuroimaging data used in the study were T1-weighted MRI scans. Details of the participants in the specific samples and the respective acquisition parameters used are outlined below:

Convolutional neural networks accurately predict age using neuroimaging

Analysis showed that our CNN method could accurately predict the chronological age of healthy adults, using either processed volumetric maps or raw T1-MRI data (see Table 1). Prediction accuracy was similar for GPR. The lowest MAE achieved was using GM data and CNN analysis (MAE = 4.16 years), though other predictions were generally comparable. Using single tissues (i.e., GM or WM) did not appreciably alter the prediction accuracy compared to using all available input data for each subject

Discussion

Using 3D convolutional neural networks, we accurately estimated chronological age from raw T1-weighted MRI brain scans of healthy adults. The accuracy of CNN for age prediction was also high when using processed GM and WM voxelwise images, and was comparable with age estimations made using GPR. Brain-predicted age estimates were significantly heritable and showed high levels of within-scanner and between-scanner reliability. These findings support the idea that deep learning methods can

Conclusions

Deep learning models based on T1-MRI can accurately predict chronological age in healthy individuals. This can be achieved using raw MRI data, with a minimum of processing necessary to generate an accurate age prediction. These estimates of brain-predicted age are also considerably heritable, giving external, genetic, validity to the measure and motivating its use in genetic studies of brain ageing. Finally, our analysis showed the brain-predicted age is highly reliable and thus appropriate for

Acknowledgements

The TwinsUK study was funded by the Wellcome Trust, Medical Research Council, European Commision’s Seventh Framework Program (FP7/2007-2013, GA No 259749). The study also receives support from the National Institute for Health Research (NIHR), BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy's and St Thomas' NHS Foundation Trust in partnership with King's College London. The STudy Of Reliability of MRI (STORM) was funded by the NIHR Biomedical Research Centre

References (68)

J. Ashburner
A fast diffeomorphic image registration algorithm
NeuroImage
(2007)
B.B. Avants et al.
Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain
Med. Image Anal.
(2008)
J. Barnes et al.
A comparison of methods for the automated calculation of volumes and atrophy rates in the hippocampus
NeuroImage
(2008)
S.A.H. Batouli et al.
The heritability of volumes of brain structures and its relationship to age: a review of twin and family studies
Ageing Res. Rev.
(2014)
A.M. Fjell et al.
Critical ages in the life course of the adult brain: nonlinear subcortical aging
Neurobiol. Aging
(2013)
K. Franke et al.
Estimating the age of healthy subjects from T1-weighted MRI scans using kernel methods: exploring the influence of various parameters
NeuroImage
(2010)
S.E. Harris et al.
The genetics of cognitive ability and cognitive ageing in healthy older people
Trends Cognitive Sci.
(2011)
M. Jenkinson et al.
A global optimisation method for robust affine registration of brain images
Med. Image Anal.
(2001)
J. Jovicich et al.
Reliability in multi-site structural MRI studies: effects of gradient non-linearity correction on phantom and human data
NeuroImage
(2006)
K. Kamnitsas et al.
Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation
Med. Image Anal.
(2017)

J. Kleesiek et al.

Deep MRI brain extraction: a 3D convolutional neural network for skull stripping

NeuroImage

(2016)

A. Klein et al.

Evaluation of volume-based and surface-based brain image registration methods

NeuroImage

(2010)

E. Konukoglu et al.

Neighbourhood approximation using randomized forests

Med. Image Anal.

(2013)

W.S. Kremen et al.

Genetic and environmental influences on the size of specific brain regions in midlife: the VETSA MRI study

NeuroImage

(2010)

T. Lee et al.

Genetic influences on cognitive functions in the elderly: a selective review of twin studies

Brain Res. Rev.

(2010)

E. Luders et al.

Estimating brain age using high-resolution pattern recognition: younger brains in long-term meditation practitioners

NeuroImage

(2016)

B. Mwangi et al.

Prediction of individual subject's age across the human lifespan using diffusion tensor imaging: a machine learning approach

NeuroImage

(2013)

H.R. Pardoe et al.

Structural brain changes in medically refractory focal epilepsy resemble premature brain aging

Epilepsy Res.

(2017)

H.R. Pardoe et al.

Motion and morphometry in clinical and nonclinical populations

NeuroImage

(2016)

J. Steffener et al.

Differences between chronological and brain age are related to education and self-reported physical activity

Neurobiol. Aging

(2016)

B. Teter et al.

Caliban's heritance and the genetics of neuronal aging

Trends Neurosci.

(2004)

A.M. Winkler et al.

Cortical thickness or grey matter volume? The importance of selecting the phenotype for imaging genetics studies

NeuroImage

(2010)

A. Abbott

Dementia: a problem for our age

Nature

(2011)

H. Akaike

A new look at the statistical model identification

IEEE Trans. Automat. Control

(1974)

W.F.C. Baaré et al.

Quantitative genetic modeling of variation in human brain morphology

Cereb. Cortex

(2001)

S.A.H. Batouli et al.

Heritability of brain volumes in older adults: the older Australian twins study

Neurobiol. Aging

(2014)

S. Boker et al.

OpenMx: an open source extended structural equation modeling framework

Psychometrika

(2011)

T.J. Bouchard

The Wilson Effect: the increase in heritability of IQ with age

Twin Res. Hum. Genet.

(2013)

X.W. Chen et al.

Big data deep learning: challenges and perspectives

IEEE Access

(2014)

J.H. Cole et al.

Brain-predicted age in Down Syndrome is associated with β-amyloid deposition and cognitive decline

Neurobiol. Aging

(2017)

J.H. Cole et al.

Prediction of brain age suggests accelerated atrophy after traumatic brain injury

Ann. Neurol.

(2015)

J.H. Cole et al.

Brain age predicts mortality

Mol. Psychiatry

(2017)

J.H. Cole et al.

Increased brain-predicted aging in treated HIV disease

Neurology

(2017)

L. Deng et al.

Deep learning: methods and applications

Found. Trends Signal Process.

(2013)

Cited by (579)

Unveiling the muscle-brain axis: A bidirectional mendelian randomization study investigating the causal relationship between sarcopenia-related traits and brain aging
2024, Archives of Gerontology and Geriatrics
Observational studies suggest an association between sarcopenia-related traits and brain aging, but whether this association reflects a causal relationship remains unclear. This study aims to employ Mendelian randomization (MR) methods to investigate the causal impact of sarcopenia-related traits on brain aging.
This study presents a comprehensive analysis of genome-wide association study (GWAS) summary data associated with sarcopenia-related traits. The data were derived from a large-scale cohort, encompassing measures such as grip strength, lean body mass, and walking pace. Measurements of brain aging were obtained from neuroimaging genetics, utilizing meta-analysis (ENIGMA) to combine magnetic resonance imaging (MRI) data from 33,992 participants. The primary methodology employed in this analysis was the inverse-variance-weighted method (IVW). Additionally, sensitivity analyses were conducted, to assess heterogeneity and pleiotropy.
Appendicular lean mass(ALM) is negatively correlated with Pallidum aging; Whole body fat-free mass shows a negative correlation with Amygdala aging; Leg fat-free mass (left) and Leg fat-free mass (right) are negatively correlated with Pallidum aging; Usual walking pace is positively correlated with Nucleus Accumbens aging. Cerebellum WM aging is negatively correlated with Leg fat-free mass (left) and Leg fat-free mass (right); Hippocampus aging is negatively correlated with Hand grip strength (left) and Hand grip strength (right). Ventricles aging is positively correlated with Usual walking pace; Nucleus Accumbens aging is positively correlated with Leg fat-free mass (left) and Leg fat-free mass (right); Putamen aging is positively correlated with ALM.
Our study confirms that reduced muscle mass speeds up brain aging. Walking too fast raises the risk of brain aging, while maintaining or increasing appendicular lean mass, overall muscle mass, and muscle mass in both legs lowers the risk of brain aging.
Extensive T1-weighted MRI preprocessing improves generalizability of deep brain age prediction models
2024, Computers in Biology and Medicine
Brain age is an estimate of chronological age obtained from T1-weighted magnetic resonance images (T1w MRI), representing a straightforward diagnostic biomarker of brain aging and associated diseases. While the current best accuracy of brain age predictions on T1w MRIs of healthy subjects ranges from two to three years, comparing results across studies is challenging due to differences in the datasets, T1w preprocessing pipelines, and evaluation protocols used. This paper investigates the impact of T1w image preprocessing on the performance of four deep learning brain age models from recent literature. Four preprocessing pipelines, which differed in terms of registration transform, grayscale correction, and software implementation, were evaluated. The results showed that the choice of software or preprocessing steps could significantly affect the prediction error, with a maximum increase of 0.75 years in mean absolute error (MAE) for the same model and dataset. While grayscale correction had no significant impact on MAE, using affine rather than rigid registration to brain atlas statistically significantly improved MAE. Models trained on 3D images with isotropic $1 {mm}^{3}$ resolution exhibited less sensitivity to the T1w preprocessing variations compared to 2D models or those trained on downsampled 3D images. Our findings indicate that extensive T1w preprocessing improves MAE, especially when predicting on a new dataset. This runs counter to prevailing research literature, which suggests that models trained on minimally preprocessed T1w scans are better suited for age predictions on MRIs from unseen scanners. We demonstrate that, irrespective of the model or T1w preprocessing used during training, applying some form of offset correction is essential to enable the model’s performance to generalize effectively on datasets from unseen sites, regardless of whether they have undergone the same or different T1w preprocessing as the training set.
Using a deep generation network reveals neuroanatomical specificity in hemispheres
2024, Patterns
Asymmetry is an important property of brain organization, but its nature is still poorly understood. Capturing the neuroanatomical components specific to each hemisphere facilitates the understanding of the establishment of brain asymmetry. Since deep generative networks (DGNs) have powerful inference and recovery capabilities, we use one hemisphere to predict the opposite hemisphere by training the DGNs, which automatically fit the built-in dependencies between the left and right hemispheres. After training, the reconstructed images approximate the homologous components in the hemisphere. We use the difference between the actual and reconstructed hemispheres to measure hemisphere-specific components due to asymmetric expression of environmental and genetic factors. The results show that our model is biologically plausible and that our proposed metric of hemispheric specialization is reliable, representing a wide range of individual variation. Together, this work provides promising tools for exploring brain asymmetry and new insights into self-supervised DGNs for representing the brain.
Application of AI in biological age prediction
2024, Current Opinion in Structural Biology
The development of anti-aging interventions requires quantitative measurement of biological age. Machine learning models, known as “aging clocks,” are built by leveraging diverse aging biomarkers that vary across lifespan to predict biological age. In addition to traditional aging clocks harnessing epigenetic signatures derived from bulk samples, emerging technologies allow the biological age estimating at single-cell level to dissect cellular diversity in aging tissues. Moreover, imaging-based aging clocks are increasingly employed with the advantage of non-invasive measurement, making it suitable for large-scale human cohort studies. To fully capture the features in the ever-growing multi-modal and high-dimensional aging-related data and uncover disease associations, deep-learning based approaches, which are effective to learn complex and non-linear relationships without relying on pre-defined features, are increasingly applied. The use of big data and AI-based aging clocks has achieved high accuracy, interpretability and generalizability, guiding clinical applications to delay age-related diseases and extend healthy lifespans.
Assessing the association between global structural brain age and polygenic risk for schizophrenia in early adulthood: A recall-by-genotype study
2024, Cortex
Neuroimaging studies consistently show advanced brain age in schizophrenia, suggesting that brain structure is often ‘older’ than expected at a given chronological age. Whether advanced brain age is linked to genetic liability for schizophrenia remains unclear. In this pre-registered secondary data analysis, we utilised a recall-by-genotype approach applied to a population-based subsample from the Avon Longitudinal Study of Parents and Children to assess brain age differences between young adults aged 21–24 years with relatively high (n = 96) and low (n = 93) polygenic risk for schizophrenia (SCZ-PRS). A global index of brain age (or brain-predicted age) was estimated using a publicly available machine learning model previously trained on a combination of region-wise gray-matter measures, including cortical thickness, surface area and subcortical volumes derived from T1-weighted magnetic resonance imaging (MRI) scans. We found no difference in mean brain-PAD (the difference between brain-predicted age and chronological age) between the high- and low-SCZ-PRS groups, controlling for the effects of sex and age at time of scanning (b = −.21; 95% CI −2.00, 1.58; p = .82; Cohen's d = −.034; partial R² = .00029). These findings do not support an association between SCZ-PRS and brain-PAD based on global age-related structural brain patterns, suggesting that brain age may not be a vulnerability marker of common genetic risk for SCZ. Future studies with larger samples and multimodal brain age measures could further investigate global or localised effects of SCZ-PRS.
Deep Learning and Geriatric Mental Health
2024, American Journal of Geriatric Psychiatry
The goal of this overview is to help clinicians develop basic proficiency with the terminology of deep learning and understand its fundamentals and early applications. We describe what machine learning and deep learning represent and explain the underlying data science principles. We also review current promising applications and identify ethical issues that bear consideration. Deep Learning is a new type of machine learning that is remarkably good at finding patterns in data, and in some cases generating realistic new data. We provide insights into how deep learning works and discuss its relevance to geriatric psychiatry.

View all citing articles on Scopus

View full text

Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker

Highlights

Abstract

Introduction

Section snippets

Datasets

Convolutional neural networks accurately predict age using neuroimaging

Discussion

Conclusions

Acknowledgements

NeuroImage

Med. Image Anal.

NeuroImage

Ageing Res. Rev.

Neurobiol. Aging

NeuroImage

Trends Cognitive Sci.

Med. Image Anal.

NeuroImage

Med. Image Anal.

NeuroImage

NeuroImage

Med. Image Anal.

NeuroImage

Brain Res. Rev.

NeuroImage

NeuroImage

Epilepsy Res.

NeuroImage

Neurobiol. Aging

Trends Neurosci.

NeuroImage

Dementia: a problem for our age

Nature

A new look at the statistical model identification

IEEE Trans. Automat. Control

Quantitative genetic modeling of variation in human brain morphology

Cereb. Cortex

Heritability of brain volumes in older adults: the older Australian twins study

Neurobiol. Aging

OpenMx: an open source extended structural equation modeling framework

Psychometrika

The Wilson Effect: the increase in heritability of IQ with age

Twin Res. Hum. Genet.

Big data deep learning: challenges and perspectives

IEEE Access

Brain-predicted age in Down Syndrome is associated with β-amyloid deposition and cognitive decline

Neurobiol. Aging

Prediction of brain age suggests accelerated atrophy after traumatic brain injury

Ann. Neurol.

Brain age predicts mortality

Mol. Psychiatry

Increased brain-predicted aging in treated HIV disease

Neurology

Deep learning: methods and applications

Found. Trends Signal Process.